Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2013 Jan 23;33(4):1337–1343. doi: 10.1523/JNEUROSCI.1226-12.2013

Uncovering the Mechanisms of Conscious Face Perception: A Single-Trial Study of the N170 Responses

Joaquin Navajas 1,, Maryam Ahmadi 1, Rodrigo Quian Quiroga 1,2
PMCID: PMC6618736  PMID: 23345210

Abstract

When a face is flashed to an observer, a large negative component is elicited in the occipitotemporal cortex at ∼170 ms from the onset of presentation (N170). Previous studies have shown that the average N170 is correlated with conscious face perception; however, the single-trial mechanisms underlying such modulation remain largely unexplored. Here, we studied in human subjects the average and the single-trial N170 responses to briefly flashed faces, coupled with backward masking and varying degrees of Gaussian noise. In the average evoked responses we observed that, at fixed levels of noise, supraliminal faces exhibited significantly larger N170 amplitudes than subliminal faces. Moreover, the average N170 amplitude decreased with noise level both for the perceived and the nonperceived faces. At the single-trial level, the N170 amplitude was modulated by conscious recognition, which allowed predicting the subjects' perceptual responses above chance. In contrast, the single-trial N170 amplitudes were not modulated by the amount of noise and the effect found in the average responses was due to different latency jitters, as confirmed with latency-corrected averages. Altogether, these results suggest that conscious face perception is correlated with a boost in the activity of face-selective neural assemblies, whereas the stimulus uncertainty introduced by the added noise decreases the timing consistency (but not the amplitude) of this activation.

Introduction

Previous studies aimed at identifying the neural correlates of face perception have reported the existence of an event-related potential (ERP) in the occipitotemporal cortex, which is larger for faces than other objects, at ∼170 ms from the onset of stimulus presentation (N170) (Bentin et al., 1996). The types of stimuli and conditions that elicit the N170 response have been extensively studied (for review, see Eimer, 2011; Rossion and Jacques, 2011), along with its modulation by attention (Furey et al., 2006; Landau et al., 2007; Engell and McCarthy, 2010) and visual expertise (Rossion et al., 2002; Gauthier et al., 2003; McKone et al., 2007), among other factors.

Over the past years, several studies have reported that the average N170 is correlated with conscious face perception (Fisch et al., 2009; Harris et al., 2011; Rodríguez et al., 2012). However, with one exception (Fisch et al., 2009; see Discussion), previous works reporting differences between perceived and nonperceived faces induced lack of awareness by changing physical properties of the stimuli, such as the time of presentation (Liddell et al., 2004; Genetti et al., 2009; Pegna et al., 2011), the temporal separation to a mask (Bacon-Macé et al., 2005; Harris et al., 2011), the stimulus uncertainty given by its phase coherence (Philiastides and Sajda, 2006) or the amount of noise added to the faces (Jemel et al., 2003). These studies have provided valuable insights on how the N170 correlates with face perception, but have the caveat that the effect of conscious face perception is mixed up with the one given by processing stimuli with different physical properties. Moreover, the single-trial underpinnings of the N170 modulation by conscious face perception and by stimulus uncertainty remain largely unexplored. This is particularly important considering that an increase in the average ERP could be due to an increase of the amplitude of the single-trial responses, or due to a better time locking to the onset of the stimulus, thus reflecting different mechanisms (Quian Quiroga, 2000; Quian Quiroga and Garcia, 2003; Atienza et al., 2005).

Here, we studied the average and single-trial N170 responses to briefly flashed faces, coupled with backward masking and varying degrees of Gaussian noise. The amount of noise was tuned throughout the experiment using a double-staircase procedure. After completing the experiment, we could determine for each subject three levels of noise at which they failed to recognize the flashed image as a face in ∼30%, 50%, and 70% of the trials. This provided us the opportunity to simultaneously assess the effect of conscious face perception using identical stimuli (comparing the N170 for seen and unseen trials at fixed noise levels) and the effect of stimulus uncertainty (comparing the N170 for different levels of noise for each perceptual outcome: recognized or not). Moreover, we implemented a single-trial analysis to further understand the mechanisms underlying these two different modulations.

Materials and Methods

Stimuli.

Twenty pictures of faces in frontal view were selected from the Psychological Image Collection at Stirling (http://pics.psych.stir.ac.uk/), and 20 front views of cars were downloaded from different websites. A mask was created with randomly shuffled pieces taken from different images. All pictures were converted to grayscale and cropped to a resolution of 411 × 527 pixels. The mean brightness was normalized across images. Zero-mean Gaussian noise was added to the car and face images, with 10 different degrees of variance, linearly increasing from 0 to 1, expressed in units relative to the mean brightness of the picture. Pictures were presented in the center of a screen with a gray surrounding background, on a CRT Monitor with a resolution of 1024 × 768 pixels and a refresh rate of 70 Hz. The viewing distance was 50 cm.

Paradigm.

Each trial (Fig. 1) began with a fixation cross presented for a variable time (500–700 ms), followed by a picture of a face or a car shown for 57 ms, and a mask that was shown for 443 ms. Participants were instructed to click the left mouse button whenever they saw a face and click the right mouse button otherwise. Throughout the experiment, a double-staircase procedure was implemented—only for face images—by changing the variance of the Gaussian noise. The descending staircase started with a variance of 1, and the ascending staircase with a variance of 0 (no noise), with a step-size of 0.1.

Figure 1.

Figure 1.

Structure of a trial. A fixation cross was shown for 500–700 ms, followed by the presentation of a picture containing either a face or a car with varying degrees of Gaussian noise, flashed for 57 ms. Each stimulus was backward masked for 443 ms, and, after the mask presentation, the participant was requested to report whether the picture was a face or not.

Overall, the experiment consisted of 1200 trials, randomly interleaved: 200 corresponded to the presentation of faces without noise, 200 to the presentation of cars without noise, 350 to the ascending staircase, 350 to the descending staircase, and in the remaining 100 trials cars were presented with the level of noise determined by the staircase procedure for the current trial. These last set of “catch trials” were added to assess the reliability of the subjects' reports as established by the number of false positives (i.e., trials in which the participant reported seeing a face when a car was presented). After blocks of 240 trials, subjects were able to take a short rest break. The experiment lasted ∼1 h and 15 min.

After completing the experiment, we selected for each subject a level of noise in which the face-detection performance was ∼50% (i.e., threshold condition). The double-staircase procedure also allowed us to select higher and lower levels of noise in which the performances were ∼30% and 70%, respectively (i.e., sub-threshold and supra-threshold condition). We then separated the trials according to the level of noise and to the subjects' report (seen or unseen), leading to six conditions (i.e., sub-threshold unseen, sub-threshold seen, threshold unseen, threshold seen, supra-threshold unseen, and supra-threshold seen).

Participants.

Twenty-two participants (mean age 27.9 years; age range 21–37 years; 2 were left handed; 12 females) volunteered for this study. Subjects were students and researchers at the University of Leicester. All of them had normal or corrected-to-normal vision and no known neurological impairments. Data from another six participants were discarded due to an excessive number of false positives (≥10%) compared with an average of 1.7% (range 0–5%) for the remaining 22 subjects. All participants considered in this study had a minimum of 50 trials in each of the six conditions.

Recording and ERP analysis.

Signals were recorded with a sampling rate of 256 Hz using a BIOSEMI Active Two 64-channel EEG system. The reference was set off-line to average electrodes. Epochs including 500 ms before and after stimulus onset were extracted for further analysis and bandpass filtered with a second-order Butterworth filter between 1 and 70 Hz. Trials with eye movements, blinks, and other artifacts were rejected off-line by visual inspection. After examination of the evoked topographies, the PO7 and PO8 electrode sites were selected for further analysis as these showed the best P1, N170, and P2 responses. For statistical analysis, we computed for each channel and each subject the mean average ERP between 80 and 120 ms for the P1, between 150 and 190 ms for the N170, and between 200 and 240 ms for the P2 component. For each subject, these values were averaged across channels PO7 and PO8 in each of the six conditions. We then compared the different conditions with a two-factor within-subjects ANOVA with factors “noise level” (NL; three levels: sub-threshold, threshold, or supra-threshold; dfNL = 2) and “conscious report” (CR; two levels: face seen or unseen; dfCR = 1). The degrees of freedom of the error term for testing the significance of the “noise level” factor were dfNL×S = dfNL × dfS = 2 × 21 = 42. Similarly, the degrees of freedom of the error term for testing the significance of the conscious report factor were dfCR×S = dfCR × dfS = 1 × 21 = 21. Statistical differences were assessed separately for each ERP component (P1, N170, and P2).

Correct rejections of the catch trials (i.e., trials in which we presented a car with noise and the subject reported not seeing a face) were used to calculate a non-face object-evoked response with the level of noise of the threshold condition. To compare the response to cars with the one to seen and unseen faces, we used a one-factor within-subjects ANOVA with main factor “Selectivity” (i.e., Sel, three levels: face seen, face unseen, or car, dfSel = 2, dfSel×S = 42). Post hoc comparisons were performed with Scheffé's tests.

To compare between face and car presentations without noise, we also implemented a one-factor within-subjects ANOVA with main factor “Object” (i.e., Obj, two levels: face and car, dfObj = 1, dfObj×S = 21). This analysis was performed to study the face selectivity of the N170 with perfect visibility.

Single-trial analysis.

Single-trial ERPs were extracted by means of a denoising algorithm that uses a wavelet decomposition of the single-trial traces and denoises the ERPs by reconstructing the signal using only the wavelet coefficients related to the evoked responses (Quian Quiroga, 2000; Ahmadi and Quian Quiroga, 2013). This method was previously shown to significantly improve the estimation of the single-trial ERPs compared with the original (non-denoised) single-trial traces, as assessed visually with real visual and auditory evoked potentials, and quantitatively with simulated data resembling the components obtained with pattern visual evoked potentials (Quian Quiroga and Garcia, 2003). In the latter case, for different noise levels covering a wide range of signal-to-noise ratios found in real recordings, the single-trial denoising significantly decreased the error in the estimation of the single-trial amplitudes and latencies of the different components, compared with the non-denoised data (Quian Quiroga and Garcia, 2003). The set of wavelet coefficients selected to denoise the single-trial data was kept constant for all channels and subjects. The single-trial N170 responses were identified as the local minimum between 120 and 200 ms, and those of the P1 and P2 peaks were identified as the local maximum between 80 ms and the latency of the N170, and between the N170 peak and 240 ms, respectively. Similar results were obtained with slightly smaller or larger window sizes. To assess the correlation between the single-trial responses and the subjects' report, we implemented a linear discriminant analysis (Fisher, 1936). To evaluate the information provided by each component, in a separate analysis we decoded the behavioral responses by the subjects (face recognized or not) using the values of the P1, N170, and P2 peaks in channels PO7 and PO8. For validation of decoding performance, we used a leave-one-out cross-validation. The p value of the decoding performance was estimated by comparing the number of hits to those obtained from a binomial distribution under the assumption of chance guessing (Quian Quiroga and Panzeri, 2009). To further validate the results, statistical significance was also assessed with a nonparametric permutation test. In each permutation, the subjects' reports were shuffled, and the same linear discriminant analysis was implemented. This procedure was repeated 1000 times for each subject and component, thus producing a distribution of number of hits under the null hypothesis. The p value was estimated as the proportion of times that the number of hits from the shuffled distribution exceeded the observed one.

For each condition, the single-trial amplitudes of each component were calculated and latency jitters were computed as the SD of the single-trial latencies of each peak. Statistical differences for the single-trial amplitudes and latency jitters were assessed by the same ANOVA test used for the average evoked responses. Post hoc comparisons were performed using Scheffé's tests.

Results

Behavior

The implementation of the double-staircase procedure gave a mean noise variance of 0.45 ± 0.21 for the sub-threshold condition; 0.32 ± 0.17 for the threshold condition; and 0.20 ± 0.12 for the supra-threshold condition. After rejection of trials with blinks, eye-movements, and other artifacts, the proportion of seen trials was 31.3 ± 7.1% for the sub-threshold condition; 49.6 ± 6.6% for the threshold condition, and 73.1 ± 11.8% for the supra-threshold condition.

Average evoked responses

We first replicated the existence of the N170 face-selective responses by comparing the ERPs elicited by cars and faces without noise. An ANOVA revealed a significant effect for the N170 (F(1,21) = 27.71, p < 0.001), but not for the P1 (F(1,21) = 0.12, p = 0.73) and the P2 (F(1,21) = 0.55, p = 0.47) components.

To assess the dependence of the N170 with conscious face perception and with the uncertainty introduced by the Gaussian noise, we compared the ERPs for the seen and unseen trials at the three noise levels. Figure 2 shows the grand average ERPs at channel PO8 elicited by these six conditions, as well as the scalp topographies for the seen and unseen faces at the time of the N170 response. The P1 was neither modulated by conscious face perception nor by noise level. In contrast, we observe a larger N170 and, to a lesser degree, a larger P2 for the seen trials, especially for the threshold and supra-threshold conditions. Altogether, the responses were also larger for the presentations with a lower noise level (i.e., supra-threshold condition). Statistical analysis revealed that the N170 component presented a significant effect of noise level (F(2,42) = 13.93, p < 0.001) and conscious report (F(1,21) = 24.92, p < 0.001). The interaction between these factors was not significant (F(2,42) = 0.01, p = 0.98). The P1 and P2 components did not show a significant effect of noise level (P1: F(2,42) = 0.3, p = 0.74; P2: F(2,42) = 0.91, p = 0.41), conscious report (P1: F(1,21) = 1.14, p = 0.29; P2: F(1,21) = 2.55, p = 0.12) or interaction (P1: F(2,42) < 0.01, p > 0.99; P2: F(2,42) < 0.01, p > 0.99).

Figure 2.

Figure 2.

Neural correlates of conscious versus unconscious face perception. a, Grand average ERPs recorded at electrode PO8 for the sub-threshold, threshold, and supra-threshold conditions. For the threshold condition, we also display the car evoked potential (black line). Bands around mean values denote SEM. b, Scalp topography of activation for seen and unseen trials in the threshold condition, 170 ms after stimulus onset. Units of the bottom colorbars are microvolts.

We also compared the activity elicited by cars with the one by the seen and unseen faces at the threshold condition (Fig. 2a). An ANOVA showed a significant effect of selectivity (F(2,42) = 8.18, p < 0.001). Post hoc comparisons with Scheffé's tests revealed that the N170 elicited by cars was not significantly different to the one triggered by the unseen faces (t(21) = 2.7< tscheffé = 4.04) and was significantly smaller than the one for the seen faces (t(21) = 4.2 > tscheffé = 4.04, p < 0.001).

Single-trial analysis

Differences observed in the average responses could be due to different mechanisms. For instance, an increase of the average ERP could be due to an increase of the amplitude of the single-trial responses, or due to a better time locking to the onset of the stimulus, which reduces the latency jitter. To distinguish between these possibilities, we implemented a single-trial analysis. The single-trial P1, N170, and P2 peaks were extracted by identifying the local maxima (minima) of the denoised responses in different time windows (Fig. 3a; see Materials and Methods). The single-trial amplitude of the N170 was significantly larger for the trials in which the faces were recognized (F(1,21) = 64.12, p < 0.001). In contrast, this difference was not significant for the P1 (F(1,21) = 0.09, p = 0.75) and the P2 (F(1,21) = 2.1, p = 0.13) components. Post hoc comparisons revealed that the differences between seen and unseen trials in the N170 component were significant for the threshold (t(21) = 9.7 > tscheffé = 8.01, p < 0.001) and supra-threshold (t(21) = 9.74 > tscheffé = 8.01, p < 0.001) conditions, but not for the sub-threshold condition (t(21) = 2.25 < tscheffé = 8.01).

Figure 3.

Figure 3.

Decoding of the conscious reports with the single-trial N170. a, Average ERP and 15 denoised single-trial ERPs at electrode PO8 for a typical subject. The identification of the single-trial peaks is shown with the markers. b, Distribution of single-trial N170 responses in a typical participant used for decoding. Each blue (red) dot represents the single-trial N170 amplitudes measured at electrodes PO7 and PO8 in a seen (unseen) trial. The black line represents the Fisher's linear discriminant. The blue and red lines show the normalized distributions for seen and unseen trials projected along the axis perpendicular to the linear discriminant. c, Black dots show in a logarithmic scale the individual p values of the decoding performance with different components. The red lines indicate the median of the distribution of p values for each component.

Given the single-trial amplitude differences for the N170 responses, we examined whether we could use this feature to decode the participants' report (face seen or unseen). For this, we used a linear discriminant analysis with two features (single-trial N170 at the electrodes PO7 and PO8) and a leave-one-out cross-validation. Figure 3b shows the single-trial N170 amplitude for one typical subject at the two electrodes selected for decoding. Decoding performance was significantly higher than chance for most subjects (18/22 with p < 0.05), with a mean decoding performance of 60.1% (SD 4.2%). In contrast, using the P1 and P2 components, for most subjects (16/22 in both cases), the decoding performance was at chance level (P1: mean = 50.4%, SD = 11.0%; P2: mean = 54.8%, SD = 6.3%). The median of the distribution of p values was 0.01 for the N170, 0.26 for the P1, and 0.11 for the P2 components (Fig. 3c). The nonparametric permutation test (see Materials and Methods) gave very similar results (Median of p value distributions: 0.01 for the N170, 0.28 for the P1, and 0.16 for the P2 components).

Next, we studied the single-trial responses for the different noise levels. Figure 4 shows the mean and SD of the single-trial N170 amplitude (Fig. 4a) and latency jitter (Fig. 4b) at channel PO8 for all conditions. Similar results were obtained with electrode site PO7. Despite the differences seen in the average responses, an ANOVA test showed that the differences in the single-trial amplitudes for the three noise levels were not significant for any component: N170 (F(2,42) = 2.15, p = 0.13); P1 (F(2,42) = 0.35, p = 0.71); P2 (F(2,42) = 0.55, p = 0.58). The interaction between noise level and conscious report was also not significant for the N170 (F(2,42) = 0.03, p = 0.97), P1 (F(2,42) = 0.01, p = 0.99), and P2 (F(2,42) = 0.02, p = 0.98). For the N170, the lack of a difference in the single-trial amplitudes, but the significant amplitude difference in the average, suggested the presence of different degrees of latency variability for the three noise conditions. As expected, we found a significant negative correlation between the N170 latency jitter and the mean amplitude of the grand average N170 (r = −0.98, p = 0.002). Moreover, an ANOVA revealed that the N170 latency jitter was significantly modulated by noise level (F(2,42) = 4.96, p = 0.01) but not by conscious report (F(1,21) = 0.9, p = 0.35). Post hoc comparisons revealed that the modulation by noise level was explained by a significant difference between the sub-threshold and supra-threshold condition (t(21) = 3.21 > tscheffé = 3.14, p < 0.01). Differences in latency jitter were not significant for the P1 (noise level: F(2,42) = 1.83, p = 0.17; conscious report: F(1,21) = 1.24, p = 0.27) and the P2 (noise level: F(2,42) = 2.55, p = 0.09; conscious report: F(1,21) = 1.62, p = 0.21) components. The mean latency of the N170 was modulated neither by conscious recognition (F(1,21) = 0.62, p = 0.44) nor by noise level (F(2,42) = 2.23, p = 0.12), thus indicating that the stimulus uncertainty introduced by the noise modulated the timing consistency across trials but not the mean timing of occurrence of the N170 responses.

Figure 4.

Figure 4.

Single-trial analysis of ERP responses. a, Mean and SD of the single-trial N170 amplitude at channel PO8. b, Mean and SD of the N170 latency jitter at channel PO8. c, The noise-level effect (electrode site: PO8) measured at the average ERP level (top-left panel) vanished after latency-correction (top-right panel). In contrast, the conscious-report effect (bottom-left panel) persisted after latency correction (bottom-right panel). Bands around mean values denote SEM.

The single-trial analysis also allowed us to remove latency jitters and to compute latency-corrected averages —i.e., averages in which each single trial is shifted in time to have the same latency as the average response. Figure 4c shows the grand average and the latency-corrected grand average ERPs at channel PO8 for the three conditions in which subjects reported the presence of a face. In line with the findings described above, the noise-level effect vanished after latency correction of the N170, and the differences between the different noise levels became nonsignificant (F(2,42) = 1.77, p = 0.18). In contrast, the latency-corrected N170 still exhibited a significant modulation by conscious report (Fig. 4c, F(1,21) = 8.82, p = 0.007). In summary, conscious face perception modulated the single-trial amplitudes of the N170, whereas the uncertainty introduced by the Gaussian noise increased the variability of the N170 latency across trials, thus giving a reduced peak in the average ERP.

Discussion

The neural correlates of object recognition have been explored with different brain signals, going from EEG (Lamy et al., 2009; Fabre-Thorpe, 2011; Martens et al., 2011), MEG (Liu et al., 2002; Tanskanen et al., 2007), fMRI (Malach et al., 1995; Grill-Spector et al., 2000; Bar et al., 2001), and electrocorticography (ECoG) (Fisch et al., 2009; Gaillard et al., 2009), to single-cell recordings in human patients (Quian Quiroga et al., 2008) and nonhuman primates (for review, see Macknik, 2006). Among the subset of works that focused on face processing, there is an agreement that the inferotemporal cortex around the fusiform gyrus plays a decisive role in face recognition. This area is known to display face-selective responses in single-cell recordings (Tsao et al., 2006) as well as during noninvasive imaging (Kanwisher et al., 1997), and it was shown to be the source of the evoked N170 component measured with EEG recordings (Shibata et al., 2002).

Several works have studied the N170 in response to face presentations compared with other stimuli (Eimer, 2011). In line with these works, in our study we explored whether the N170 responses were modulated by conscious face perception and by the stimulus uncertainty introduced by different levels of noise. Using identical visual stimuli, we found that the amplitude of the average N170 was larger for recognized than for nonrecognized faces. Moreover, the average N170 diminished with increasing levels of noise. These two effects were specific for the N170 as they were absent in earlier (P1) and later (P2) ERP components. Furthermore, we investigated the mechanisms underlying these modulations by analyzing the single-trial amplitudes and latencies of the different components.

In general, ERPs are very small compared with the ongoing EEG oscillations, and their analysis relies on the identification of different peaks after averaging several trials. Although averaging improves the signal-to-noise ratio, important information about single-trial variations is completely dismissed, despite the fact that trial-to-trial changes can provide useful information about the mechanisms underlying the modulations observed in the average responses (Quian Quiroga et al., 2007). For instance, sleep-dependent enhancements of the average mismatch negativity—a component observed in auditory oddball paradigms—that were at first thought to be due to a recruitment of larger populations of neurons during sleep (Atienza et al., 2004), were later found to be due to a completely different mechanism after a single-trial analysis, namely, a decrease in the latency jitter of the single-trial responses (Atienza et al., 2005). Similar to the effect reported in that study, here we showed that the stimulus uncertainty introduced by the Gaussian noise decreased the average N170 by impoverishing the timing consistency (i.e., increasing the latency jitter) but not the amplitude of the single-trial responses. In turn, the face-recognition effect observed in the average N170 was linked to an attenuation of the single-trial responses for the unseen trials.

Previous studies have also implemented a single-trial approach to study the face versus non-face object sensitivity of the N170 (Philiastides and Sajda, 2006; Rousselet et al., 2007; Gaspar et al., 2011). For instance, it was found that this effect is correlated with an increase of the single-trial power at the 5–15 Hz band (Rousselet et al., 2007). Moreover, a single-trial decoding approach was also used to distinguish between face and object presentations with different levels of uncertainty (Philiastides and Sajda, 2006). While these works were focused on the face versus non-face object modulation, here we studied the single-trial underpinnings of the face-recognition effect (i.e., the difference between seen and unseen faces). Interestingly, it was recently shown that N170 grand averages might be misleading, suggesting that a single-subject analysis is essential to a correct interpretation of the results (Gaspar et al., 2011). Our study provides further evidence along this line, showing that even more information can be extracted from the single-trial analysis of the responses.

Most of the previous studies linking the average N170 to conscious face perception compared the activity elicited by “recognized” versus “not-recognized” stimuli inducing the different perceptual outcomes by changing physical properties of the stimuli (Jemel et al., 2003; Liddell et al., 2004; Bacon-Macé et al., 2005; Genetti et al., 2009; Pegna et al., 2011). The caveat of this approach is that it mixes up the effect of conscious recognition with the one given by processing different types of stimuli. In contrast, in this work the effect of conscious face perception was assessed by comparing the activity elicited by identical stimuli.

Only few previous works have used identical stimuli to study the N170 modulation by conscious face perception (Fisch et al., 2009; Genetti et al., 2010; Rodríguez et al., 2012). Genetti et al. (2010) reported differences due to face recognition but only at latencies larger than the one of the N170, probably due to the use of very short stimulus durations (16 ms). Another recent work (Rodríguez et al., 2012) showed that only fully perceived faces have an N170 activity significantly stronger than scrambled faces, while the activity of unseen faces was consistent with the one elicited by scrambled faces (though in this work seen and unseen faces were not directly compared with each other). In line, here we showed that seen, but not unseen, faces elicit an N170 significantly larger than cars. To the best of our knowledge, only one previous ECoG study, which focused on the analysis of gamma-induced oscillations, reported differences in the N170 elicited by seen and unseen faces using identical stimuli (Fisch et al., 2009). Building on the results of this study, here we described further evidence about the relationship of the N170 component with conscious face processing by showing that: (1) the intracranial responses observed by Fisch et al. (2009) are observable at the level of the scalp EEG, which has been by far the most used signal to study N170 in face perception; (2) this effect was specific to the N170 as it was absent in earlier and later components (the P1 and P2, respectively); (3) at the single-trial level the N170 could predict the behavioral responses better than chance; and (4) two different mechanisms underlie the modulation of the N170 responses with conscious recognition and with the stimulus uncertainty introduced with the Gaussian noise.

Summarizing, the current study provides evidence that the average N170 is modulated by conscious face perception, as well as by the amount of noise added to the stimulus. However, these two effects modulate the average N170 through different mechanisms, as shown by the single-trial analysis. While conscious face perception was correlated with a boost in the activity of face-selective neural assemblies, the stimulus uncertainty introduced by the Gaussian noise induced a decrease in the timing consistency (but not in the amplitude) of this activation.

Footnotes

This work was funded by the Engineering and Physical Sciences Research Council, UK. We thank the Psychological Image Collection at Stirling (http://pics.psych.stir.ac.uk/) for providing facial stimuli, and Kaliilah Bolkiah for assistance in data collection.

The authors declare no competing financial interests.

References

  1. Ahmadi M, Quian Quiroga R. Automatic denoising single-trial evoked potentials. Neuroimage. 2013;66:672–680. doi: 10.1016/j.neuroimage.2012.10.062. [DOI] [PubMed] [Google Scholar]
  2. Atienza M, Cantero JL, Stickgold R. Posttraining sleep enhances automaticity in perceptual discrimination. J Cogn Neurosci. 2004;16:53–64. doi: 10.1162/089892904322755557. [DOI] [PubMed] [Google Scholar]
  3. Atienza M, Cantero JL, Quian Quiroga R. Precise timing accounts for posttraining sleep-dependent enhancements of the auditory mismatch negativity. Neuroimage. 2005;26:628–634. doi: 10.1016/j.neuroimage.2005.02.014. [DOI] [PubMed] [Google Scholar]
  4. Bacon-Macé N, Macé M, Fabre-Thorpe M, Thorpe SJ. The time course of visual processing: backward masking and natural scene categorisation. Vision Res. 2005;45:1459–1469. doi: 10.1016/j.visres.2005.01.004. [DOI] [PubMed] [Google Scholar]
  5. Bar M, Tootell RB, Schacter DL, Greve DN, Fischl B, Mendola JD, Rosen BR, Dale AM. Cortical mechanisms specific to explicit visual object recognition. Neuron. 2001;29:529–535. doi: 10.1016/s0896-6273(01)00224-0. [DOI] [PubMed] [Google Scholar]
  6. Bentin S, Allison T, Puce A, Perez E, McCarthy G. Electrophysiological studies of face perception in humans. J Cogn Neurosci. 1996;8:551–565. doi: 10.1162/jocn.1996.8.6.551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Eimer M. The face-sensitive N170 component of the event-related brain potential. In: Calder AJ, Rhodes G, Johnson M, Haxby J, editors. The oxford handbook of face perception. Oxford, UK: Oxford UP; 2011. pp. 329–344. [Google Scholar]
  8. Engell AD, McCarthy G. Selective attention modulates face-specific induced gamma oscillations recorded from ventral occipitotemporal cortex. J Neurosci. 2010;30:8780–8786. doi: 10.1523/JNEUROSCI.1575-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Fabre-Thorpe M. The characteristics and limits of rapid visual categorization. Front Psychol. 2011;2:243. doi: 10.3389/fpsyg.2011.00243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Fisch L, Privman E, Ramot M, Harel M, Nir Y, Kipervasser S, Andelman F, Neufeld MY, Kramer U, Fried I, Malach R. Neural “ignition”: enhanced activation linked to perceptual awareness in human ventral stream visual cortex. Neuron. 2009;64:562–574. doi: 10.1016/j.neuron.2009.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Fisher RA. The use of multiple measurements in taxonomic problems. Ann Hum Genet. 1936;7:179–188. [Google Scholar]
  12. Furey ML, Tanskanen T, Beauchamp MS, Avikainen S, Uutela K, Hari R, Haxby JV. Dissociation of face-selective cortical responses by attention. Proc Natl Acad Sci U S A. 2006;103:1065–1070. doi: 10.1073/pnas.0510124103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gaillard R, Dehaene S, Adam C, Clémenceau S, Hasboun D, Baulac M, Cohen L, Naccache L. Converging intracranial markers of conscious access. PLoS Biol. 2009;7:e61. doi: 10.1371/journal.pbio.1000061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Gaspar CM, Rousselet GA, Pernet CR. Reliability of ERP and single-trial analyses. Neuroimage. 2011;58:620–629. doi: 10.1016/j.neuroimage.2011.06.052. [DOI] [PubMed] [Google Scholar]
  15. Gauthier I, Curran T, Curby KM, Collins D. Perceptual interference supports a non-modular account of face processing. Nat Neurosci. 2003;6:428–432. doi: 10.1038/nn1029. [DOI] [PubMed] [Google Scholar]
  16. Genetti M, Khateb A, Heinzer S, Michel CM, Pegna AJ. Temporal dynamics of awareness for facial identity revealed with ERP. Brain Cogn. 2009;69:296–305. doi: 10.1016/j.bandc.2008.08.008. [DOI] [PubMed] [Google Scholar]
  17. Genetti M, Britz J, Michel CM, Pegna AJ. An electrophysiological study of conscious visual perception using progressively degraded stimuli. J Vis. 2010;10(14):10 1–14. doi: 10.1167/10.14.10. [DOI] [PubMed] [Google Scholar]
  18. Grill-Spector K, Kushnir T, Hendler T, Malach R. The dynamics of object-selective activation correlate with recognition performance in humans. Nat Neurosci. 2000;3:837–843. doi: 10.1038/77754. [DOI] [PubMed] [Google Scholar]
  19. Harris JA, Wu C, Woldorff MG. Sandwich masking eliminates both visual awareness of faces and face-specific brain activity through a feedforward mechanism. J Vis. 2011;11(7):3 1–12. doi: 10.1167/11.7.3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Jemel B, Schuller AM, Cheref-Khan Y, Goffaux V, Crommelinck M, Bruyer R. Stepwise emergence of the face-sensitive N170 event-related potential component. Neuroreport. 2003;14:2035–2039. doi: 10.1097/00001756-200311140-00006. [DOI] [PubMed] [Google Scholar]
  21. Kanwisher N, McDermott J, Chun MM. The fusiform face area: a module in human extrastriate cortex specialized for face perception. J Neurosci. 1997;17:4302–4311. doi: 10.1523/JNEUROSCI.17-11-04302.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Lamy D, Salti M, Bar-Haim Y. Neural correlates of subjective awareness and unconscious processing: an ERP study. J Cogn Neurosci. 2009;21:1435–1446. doi: 10.1162/jocn.2009.21064. [DOI] [PubMed] [Google Scholar]
  23. Landau AN, Esterman M, Robertson LC, Bentin S, Prinzmetal W. Different effects of voluntary and involuntary attention on EEG activity in the gamma band. J Neurosci. 2007;27:11986–11990. doi: 10.1523/JNEUROSCI.3092-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Liddell BJ, Williams LM, Rathjen J, Shevrin H, Gordon E. A temporal dissociation of subliminal versus supraliminal fear perception: an event-related potential study. J Cogn Neurosci. 2004;16:479–486. doi: 10.1162/089892904322926809. [DOI] [PubMed] [Google Scholar]
  25. Liu J, Harris A, Kanwisher N. Stages of processing in face perception: an MEG study. Nat Neurosci. 2002;5:910–916. doi: 10.1038/nn909. [DOI] [PubMed] [Google Scholar]
  26. Macknik SL. Visual masking approaches to visual awareness. In: Martinez-Conde S, Macknik S, Martinez MM, Alonso J-M, Tse PU, editors. Visual perception, Pt 2, Vol 155: fundamentals of awareness: multi-sensory integration and high-order perception. New York: Elsevier; 2006. pp. 177–215. [Google Scholar]
  27. Malach R, Reppas JB, Benson RR, Kwong KK, Jiang H, Kennedy WA, Ledden PJ, Brady TJ, Rosen BR, Tootell RB. Object-related activity revealed by functional magnetic-resonance-imaging in human occipital cortex. Proc Natl Acad Sci U S A. 1995;92:8135–8139. doi: 10.1073/pnas.92.18.8135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Martens U, Trujillo-Barreto N, Gruber T. Perceiving the tree in the woods: segregating brain responses to stimuli constituting natural scenes. J Neurosci. 2011;31:17713–17718. doi: 10.1523/JNEUROSCI.4743-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. McKone E, Kanwisher N, Duchaine BC. Can generic expertise explain special processing for faces? Trends Cogn Sci. 2007;11:8–15. doi: 10.1016/j.tics.2006.11.002. [DOI] [PubMed] [Google Scholar]
  30. Pegna AJ, Darque A, Berrut C, Khateb A. Early ERP modulation for task-irrelevant subliminal faces. Front Psychol. 2011;2:88. doi: 10.3389/fpsyg.2011.00088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Philiastides MG, Sajda P. Temporal characterization of the neural correlates of perceptual decision making in the human brain. Cereb Cortex. 2006;16:509–518. doi: 10.1093/cercor/bhi130. [DOI] [PubMed] [Google Scholar]
  32. Quian Quiroga R. Obtaining single stimulus evoked potentials with wavelet denoising. Physica D. 2000;145:278–292. [Google Scholar]
  33. Quian Quiroga R, Garcia H. Single-trial event-related potentials with wavelet denoising. Clin Neurophysiol. 2003;114:376–390. doi: 10.1016/s1388-2457(02)00365-6. [DOI] [PubMed] [Google Scholar]
  34. Quian Quiroga R, Panzeri S. Extracting information from neuronal populations: information theory and decoding approaches. Nat Rev Neurosci. 2009;10:173–185. doi: 10.1038/nrn2578. [DOI] [PubMed] [Google Scholar]
  35. Quian Quiroga R, Atienza M, Cantero JL, Jongsma M. What can we learn from single-trial event-related potential. Chaos Complexity Lett. 2007;2:345–363. [Google Scholar]
  36. Quian Quiroga R, Mukamel R, Isham EA, Malach R, Fried I. Human single-neuron responses at the threshold of conscious recognition. Proc Natl Acad Sci U S A. 2008;105:3599–3604. doi: 10.1073/pnas.0707043105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Rodríguez V, Thompson R, Stokes M, Brett M, Alvarez I, Valdes-Sosa M, Duncan J. Absence of face-specific cortical activity in the complete absence of awareness: converging evidence from functional magnetic resonance imaging and event-related potentials. J Cogn Neurosci. 2012;24:396–415. doi: 10.1162/jocn_a_00137. [DOI] [PubMed] [Google Scholar]
  38. Rossion B, Jacques C. The N170: understanding the time-course of face perception in the human brain. In: Luck S, Kappenman E, editors. The oxford handbook of ERP components. Oxford, UK: Oxford UP; 2011. pp. 115–142. [Google Scholar]
  39. Rossion B, Gauthier I, Goffeaux V, Tarr MJ, Crommelinck M. Expertise training with novel objects leads to left-lateralized facelike electrophysiological responses. Psychological Science. 2002;13:250–257. doi: 10.1111/1467-9280.00446. [DOI] [PubMed] [Google Scholar]
  40. Rousselet GA, Husk JS, Bennett PJ, Sekuler AB. Single-trial EEG dynamics of object and face visual processing. Neuroimage. 2007;36:843–862. doi: 10.1016/j.neuroimage.2007.02.052. [DOI] [PubMed] [Google Scholar]
  41. Shibata T, Nishijo H, Tamura R, Miyamoto K, Eifuku S, Endo S, Ono T. Generators of visual evoked potentials for faces and eyes in the human brain as determined by dipole localization. Brain Topogr. 2002;15:51–63. doi: 10.1023/a:1019944607316. [DOI] [PubMed] [Google Scholar]
  42. Tanskanen T, Näsänen R, Ojanpää H, Hari R. Face recognition and cortical responses: effect of stimulus duration. Neuroimage. 2007;35:1636–1644. doi: 10.1016/j.neuroimage.2007.01.023. [DOI] [PubMed] [Google Scholar]
  43. Tsao DY, Freiwald WA, Tootell RB, Livingstone MS. A cortical region consisting entirely of face-selective cells. Science. 2006;311:670–674. doi: 10.1126/science.1119983. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES