Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Dec 1.
Published in final edited form as: Neuroimage. 2018 Aug 13;183:412–424. doi: 10.1016/j.neuroimage.2018.08.027

Processing of Auditory Novelty Across the Cortical Hierarchy: An Intracranial Electrophysiology Study

Kirill V Nourski a,b, Mitchell Steinschneider c, Ariane E Rhone a, Hiroto Kawasaki a, Matthew A Howard III a,b,d, Matthew I Banks e
PMCID: PMC6207077  NIHMSID: NIHMS993514  PMID: 30114466

Abstract

Under the predictive coding hypothesis, specific spatiotemporal patterns of cortical activation are postulated to occur during sensory processing as expectations generate feedback predictions and prediction errors generate feedforward signals. Establishing experimental evidence for this information flow within cortical hierarchy has been difficult, especially in humans, due to spatial and temporal limitations of non-invasive measures of cortical activity. This study investigated cortical responses to auditory novelty using the local/global deviant paradigm, which engages the hierarchical network underlying auditory predictive coding over short (‘local deviance’; LD) and long (‘global deviance’; GD) time scales. Electrocorticographic responses to auditory stimuli were obtained in neurosurgical patients from regions of interest (ROIs) including auditory, auditory-related and prefrontal cortex. LD and GD effects were assayed in averaged evoked potential (AEP) and high gamma (70–150 Hz) signals, the former likely dominated by local synaptic currents and the latter largely reflecting local spiking activity. AEP LD effects were distributed across all ROIs, with greatest percentage of significant sites in core and non-core auditory cortex. High gamma LD effects were localized primarily to auditory cortex in the superior temporal plane and on the lateral surface of the superior temporal gyrus (STG). LD effects exhibited progressively longer latencies in core, non-core, auditory-related and prefrontal cortices, consistent with feedforward signaling. The spatial distribution of AEP GD effects overlapped that of LD effects, but high gamma GD effects were more restricted to non-core areas. High gamma GD effects had shortest latencies in STG and preceded AEP GD effects in most ROIs. This latency profile, along with the paucity of high gamma GD effects in the superior temporal plane, suggest that the STG plays a prominent role in initiating novelty detection signals over long time scales. Thus, the data demonstrate distinct patterns of information flow in human cortex associated with auditory novelty detection over multiple time scales.

Keywords: Averaged evoked potential, human auditory cortex, electrocorticography, high gamma, predictive coding

1. Introduction

Far from being passive receivers of sensory information, humans are actively engaged in the process of sensation. Strategies that are maximally efficient for extracting behaviorally relevant cues from the environment rely on contextual modulation of sensory perception and motor responses based on attention and expectation (den Ouden et al., 2012). According to the predictive coding hypothesis (Bar, 2009; Clark, 2013; Friston, 2005), this is accomplished in the brain by varying the parameters of a generative model, whose output is predictions carried by feedback (FB) afferents, to best match the observed sensory input. Parameter optimization is driven by error signals arising from mismatches between expectations and observations, which are projected back to higher levels via cortical feedforward (FF) connections (Bastos et al., 2012; Mumford, 1992; Rao and Ballard, 1999). These error signals can arise at the circuit level from local comparisons between FF and FB inputs, as well as from local synaptic mechanisms, such as stimulus specific adaptation (Nelken and Ulanovsky, 2007). In both cases, however, error signals are envisioned to be projected via strong coupling between adjacent levels in the cortical hierarchy, either directly or via a thalamic loop (Sherman and Guillery, 2011). This form of bidirectional message passing is an important element of leading theories of brain function and consciousness (Dehaene and Changeux, 2011; Friston, 2010; Mashour, 2013; Tononi et al., 2016).

Although there is extensive circumstantial support for the predictive coding hypothesis (Bastos et al., 2012; Heilbron and Chait, 2017), direct experimental evidence is only beginning to emerge (Blank and Davis, 2016; Egner et al., 2010). Because the sensory environment exhibits nested, hierarchical complexity, with events unfolding over multiple time scales (e.g. phonemes, words, sentences in spoken language), the optimal generative model for deviance detection is also likely to be hierarchical and operate over multiple temporal integration windows. This idea is supported by computational and theoretical work (Hinton, 2010; Hohwy et al., 2008; Rao and Ballard, 1999). However, we still do not have a detailed understanding of the spatial distribution, timing, and directionality of information flow during the processing of auditory novelty that would place deviance detection firmly within the framework of predictive sensory processing. According to the predictive coding model, activation of higher-order cortical areas precedes that of lower-order areas in generating expectations, while prediction error signals unfold in the opposite order. This spatio-temporal profile should exhibit further complexity when FF error signals trigger model updates and refined FB predictions. To date, there has been limited investigation of the temporal sequence of activation of cortical regions during predictive coding. Thus, it is still unclear whether predictions are generated in higher-order areas and projected down the cortical hierarchy. Even the identity of these interacting regions remains ambiguous (Deouell, 2007), and depends on the time scale over which predictive coding operates and the level of concurrent engagement of attentional networks (Wacongne et al., 2011).

Auditory novelty detection is a sensory task postulated to rely on predictive coding (Garrido et al., 2009). The present study took advantage of the local/global deviant (LGD) stimulus paradigm (Bekinschtein et al., 2009). This paradigm is an elaboration of the classic oddball paradigm designed to investigate auditory novelty detection over multiple time scales and engage both pre-attentive and conscious perceptual processes (Bekinschtein et al., 2009; Strauss et al., 2015). Stimuli consist of repeating tokens, with repetition within the stimulus (“local standards”; LS) establishing expectation for the final token over short time scales (<1 s). Repeated stimulus sequences (“global standards”; GS) establish expectation on a longer time scale (5–10 s). Violations of expectation by “local deviants” (LD) and “global deviants” (GD) trigger enhanced responses that are interpreted as prediction error signals. LD effects have been associated with the scalp-recorded mismatch negativity (MMN) response (Näätänen and Alho, 1995), whose generators are reported to be localized primarily to the auditory cortex (Bekinschtein et al., 2009; Joos et al., 2014). GD effects have been associated with the P3b component of the event-related potential (Kok, 2001), generated within a broader network that includes temporo-parietal and prefrontal regions (Bekinschtein et al., 2009; Halgren et al., 1998; Smith et al., 1990).

Key questions remain about the generators of LD and GD effects. For example, although some studies localized the MMN and LD effects to auditory cortex (Alho, 1995; Bekinschtein et al., 2009; El Karoui et al., 2015), others describe a broader network including auditory cortex, surrounding temporo-parietal auditory-related areas and prefrontal cortex (PFC) (Durschmid et al., 2016; Rinne et al., 2000; Schonwiesner et al., 2007). Furthermore, even basic information about the sequence of activation of these regions is unclear, with the timing of frontal relative to temporal generators of the MMN ranging from lag to lead (Rinne et al., 2000; Schonwiesner et al., 2007; Yago et al., 2001). It is well-established that GD effects manifest with longer latencies (Bekinschtein et al., 2009), suggesting greater involvement of higher order processing compared to the LD effect. Whether the response to the GD stimulus is generated first in PFC, auditory-related or auditory cortex is unclear, as is the associated timing of GD effects within the cortical hierarchy. Here, we provide a detailed picture of the spatial and temporal activation patterns during auditory novelty detection over multiple time scales, taking advantage of the superior spatial and temporal resolution offered by intracranial recordings. This work builds upon previous electrocorticography (ECoG) studies of auditory novelty detection (Durschmid et al., 2016; Edwards et al., 2005; El Karoui et al., 2015) by analyzing high density ECoG data obtained simultaneously from multiple cortical levels envisioned to be involved in auditory predictive coding, from core (primary) auditory cortex to PFC.

2. Methods

2.1. Subjects

Study subjects were six neurosurgical patients with medically refractory epilepsy who had been implanted with intracranial ECoG electrodes to identify resectable seizure foci. Research protocols were approved by the University of Iowa Institutional Review Board and the National Institutes of Health, and written informed consent was obtained from all subjects. Research participation did not interfere with acquisition of clinically necessary data, and subjects could rescind consent for research without interrupting their clinical management. Experiments were performed when the patients returned to the operating room to undergo electrode removal and seizure focus resection surgery. The demographic and seizure focus data for each subject are presented in Table 1. Recording sites that were confirmed to be involved in seizure onset were excluded from analysis. All subjects were native English speakers, right-handed and had left language dominance as determined by Wada tests. All subjects had pure-tone thresholds within 30 dB hearing level between 125 Hz and 8 kHz (Supplementary Fig. 1). Word recognition scores, as evaluated by spondees presented via monitored live voice, were 88% or higher, and speech reception thresholds were within 20 dB in all tested subjects. Cognitive function, as determined by standard neuropsychological assessments, was in the average range in all subjects.

Table 1.

Subject demographics.

Subject1 Age Sex2 Seizure focus Surgical procedure
R369 30 M R medial temporal lobe R anterior and medial temporal lobectomy
L372 34 M L temporal pole L anterior and medial temporal lobectomy
R376 48 F R medial temporal lobe R anterior and medial temporal lobectomy
R394 24 M R amygdala R anterior and medial temporal lobectomy
R399 22 F R temporal lobe with early
propagation to R inferior lateral
frontal lobe
R temporal lobectomy and R inferior lateral
frontal lobectomy
L400 59 F L amygdala L anterior and medial temporal lobectomy
1

Letter prefix of the subject code denotes the side of electrode implantation over auditory cortex, and the side of seizure focus (L = left; R = right). Most subjects had, to varying degrees, bilateral coverage of other regions of the brain.

2

F = female; M = male

Subject R394 had previously undergone resection of a cavernoma in the anterior medial temporal lobe, in which part of the amygdala and head of hippocampus were removed. Adhesion of the dura mater to the cortical surface secondary to that surgery precluded subdural placement of surface electrode arrays in this subject. Instead, surface arrays were placed epidurally. While the epidural recordings had diagnostic utility for seizure focus detection, their signal-to-noise ratio and localization of electrode contacts in relation to the cortical surface anatomy were unfavorable for research data acquisition. Consequently, recordings obtained from the epidural contacts were excluded from analyses in the present study. In this subject, cortex that corresponded to all studied brain regions (see below) was spared, with the exception of planum polare. Additionally, this subject had normal hearing, normal intelligence, and exhibited above-average performance on the behavioral task. This provided a justification for inclusion of data obtained from this subject in the study.

2.2. Stimulus and procedure

Stimulus generation was controlled by a TDT RZ2 real-time processor (Tucker-Davis Technologies, Alachua, FL). Experimental stimuli were vowels /ɑ/ and /i/, presented in an LGD paradigm (Bekinschtein et al., 2009; El Karoui et al., 2015; Strauss et al., 2015) (Fig. 1). The vowels were excised from the steady-state vowel portions of consonant-vowel stimuli /hɑd/ and /hid/, spoken by a female (F0 = 232 Hz and 233 Hz, respectively; Hillenbrand et al., 1995). On each trial, five 100 ms vowels, normalized to the same root-mean-square amplitude, gated with 5 ms on/off ramps and separated by 50 ms silent intervals, were presented, with the fifth vowel being either the same as the first four (LS) or different (LD; Fig. 1a). This difference constituted local deviance: /ɑɑɑɑɑ/ and /iiiii/ were LS stimuli, while /ɑɑɑɑi/ and /iiiiɑ/ were LD stimuli.

Figure 1.

Figure 1.

Local/global deviant experimental paradigm. a: Schematic of the four experimental stimuli. b: Stimulus sequences. c: Comparisons between trials to characterize local and global deviance responses. Adapted from Strauss et al. (2015).

The stimuli were presented in four sequences, with the order of the sequences randomized across subjects (Fig. 1b). Each sequence began with a recorded instruction that defined the task and the target (GD) stimulus to the subject, e.g., for Sequence 2: “This time, press the button when you hear this sound: /ɑɑɑɑɑ/. Once again, press the button when you hear this sound: /ɑɑɑɑɑ/.” This was followed by a habituation sequence of 10 trials that established the GS stimulus (e.g. /ɑɑɑɑi/ for Sequence 2), and then by 72 GS trials and 18 GD trials, presented in a pseudorandom order. The difference in presentation frequency thus constituted the global deviance, and the identity of the GD stimulus changed across the four sequences within each block. This design allowed for the simultaneous evaluation of responses to auditory novelty on two different time scales (local and global; Fig. 1c). The intertrial interval varied within a Gaussian distribution (onset-to-onset mean 1500 ms, S.D. = 10 ms) to reduce heterodyning in the recordings secondary to power line noise. The duration of the experimental block was 11 minutes (400 ~1.5 s trials and four 15 s instruction segments).

Stimuli were delivered to both ears via insert earphones (ER4B, Etymotic Research, Elk Grove Village, IL) that were integrated into custom-fit earmolds. Acoustic stimulation was performed at a comfortable level, typically 60–65 dB SPL. The target detection task was used to control for and provide measures of the subjects’ level of attention. The hand ipsilateral to the hemisphere from which recordings were made was used to operate the response button. This was done to minimize contributions of preparatory (motor planning), motor and somatosensory responses associated with the button press to recorded neural activity as opposed to auditory deviance processing per se.

The data presented here were collected within the context of a larger study on effects of general anesthesia on auditory cortical responses during a pre-drug baseline period (Nourski et al., 2018). As part of that study, the subjects’ overall level of alertness was evaluated over the course of the experiment using the Observer’s Assessment of Alertness/Sedation (OAA/S) scale, which ranges from OAA/S = 5 for fully awake to OAA/S = 1 for unresponsive even to noxious stimuli (Chernik et al., 1990). Because we observed that task performance even prior to administration of anesthesia was related to level of alertness, we present the task performance data along with OAA/S scores measured immediately before and after the recordings presented here.

2.3. Recording

ECoG recordings were made using subdural and depth electrode arrays (Ad-Tech Medical, Racine, WI) placed on the basis of clinical requirements to identify seizure foci (Nagahama et al., 2017; Reddy et al., 2010). Electrode implantation, recording and ECoG data analysis have been previously described in detail (Howard et al., 1996; Howard et al., 2000; Nourski and Howard, 2015; Reddy et al., 2010). In brief, the subdural arrays consisted of platinum-iridium disc electrodes (2.3 mm exposed diameter, 5–10 mm inter-electrode distance) embedded in a silicon membrane. Subdural strip and grid arrays were implanted over lateral and ventral surfaces of temporal and frontal lobe, and lateral parietal cortex. Depth electrode arrays (8–12 macro contacts, spaced 5 mm apart) targeting superior temporal plane, including Heschl’s gyrus (HG), were stereotactically implanted along the anterolateral-to-posteromedial axis of the gyrus. Additional arrays targeted insular cortex and provided coverage of posteromedial HG (HGPM), planum temporale (PT) and planum polare (PP). This configuration was clinically warranted, as it bracketed epileptogenic zones from dorsal, ventral, medial and lateral aspects, providing a more accurate assessment of suspected temporal lobe seizure foci than could be achieved with subdural electrodes alone. Depth electrodes that targeted mesial temporal lobe structures (amygdala and hippocampus) provided additional coverage of auditory-related cortex within superior temporal sulcus. A subgaleal electrode was used as a reference in all subjects.

Reconstruction of the anatomical locations of implanted electrodes and their mapping onto a standardized set of coordinates across subjects was performed using FreeSurfer image analysis suite (Version 5.3; Martinos Center for Biomedical Imaging, Harvard, MA) and in-house software (see Nourski et al. (2014) for details). In brief, subjects underwent whole-brain high-resolution T1-weighted structural magnetic resonance imaging (MRI) scans (resolution and slice thickness ≤1.0 mm) before electrode implantation. After electrode implantation, subjects underwent MRI and thin-slice volumetric computerized tomography (CT) (resolution and slice thickness ≤1.0 mm) scans. Contact locations of the depth and subdural electrodes were first extracted from post-implantation MRI and CT scans, respectively. These were then projected onto preoperative MRI scans using non-linear three-dimensional thin-plate spline morphing, aided by intraoperative photographs. Data from multiple subjects were pooled by transforming the electrode locations into standard Montreal Neurological Institute (MNI) coordinates. This was done for each contact using linear co-registration to the MNI152 T1 average brain, as implemented in FMRIB Software library (Version 5.0; FMRIB Analysis Group, Oxford, UK). Left hemisphere MNI x-axis coordinates (xMNI) were then multiplied by (−1) to map them onto the right-hemisphere common space. Contacts were then projected onto the right lateral hemispheric surface and right superior temporal plane of the FreeSurfer average template brain.

Regions of interest (ROIs) included, in roughly ascending hierarchical order, core auditory cortex in HGPM, non-core auditory areas (anterolateral portion of Heschl’s gyrus [HGAL], PT, PP, superior temporal gyrus [STG]), auditory-related (insular cortex, superior temporal sulcus, middle temporal gyrus [MTG], supramarginal and angular gyrus), and PFC, including inferior, middle and superior frontal gyrus, orbital gyrus and transverse frontopolar gyrus (Table 2). Assignment of recording sites to ROIs was based upon anatomical reconstructions of electrode locations in each subject. For subdural arrays, it was informed by automated parcellation of cortical gyri (Destrieux et al., 2010; Destrieux et al., 2017) as implemented in the FreeSurfer software package. For depth electrodes, ROI assignment was informed by MRI sections along sagittal, coronal and axial planes. Recording sites identified as seizure foci or characterized by excessive noise, depth electrode contacts in white matter or outside brain, and epidurally implanted arrays in subject R394 were excluded from analyses and thus are not listed in Table 2. These criteria led to exclusion of 34/249 contacts in subject R369, 33/214 in L372, 27/226 in R376, 49/74 in R394, 50/238 in R399, and 52/210 in L400.

Table 2.

Electrode coverage.

Subject HGPM HGAL PT PP STG Auditory-
related
PFC Other Total
R369 8 5 4 6 20 77 44 51 215
L372 6 4 4 4 25 56 34 48 181
R376 7 4 3 3 21 73 30 58 199
R394 8 0 2 0 0 6 2 7 25
R399 3 4 2 1 23 46 47 62 188
L400 4 5 1 1 6 24 56 61 158
Total 36 22 16 15 95 282 213 287 966

ECoG data acquisition was performed using a TDT RZ2 real-time processor (Tucker-Davis Technologies, Alachua, FL) in the operating room. Collected ECoG data were amplified, filtered (0.7–800 Hz bandpass, 12 dB/octave rolloff), digitized at a sampling rate of 2034.5 Hz and stored for subsequent offline analysis.

2.4. Analysis

Behavioral performance in the target detection task was described in terms of accuracy (hit rate, expressed as % of detected target stimuli), sensitivity index (d’, calculated as Zhit rateZfalse alarm rate, where Z(p) is the inverse of the cumulative distribution function of the normal distribution) and reaction times (RTs). Only button presses that occurred between the onset of the 5th vowel and the onset of the next trial (i.e. ~900 ms after the 5th vowel onset; see Fig. 1) were counted as hits. Button presses that overlapped with the next non-target trial were counted as false alarms. The behavioral results thus likely somewhat underestimated the actual target detection rate and biased the RTs towards faster responses.

ECoG data obtained from each recording site were downsampled to 1000 Hz. To minimize contamination from power line noise, ECoG waveforms were de-noised using a demodulated band transform-based procedure (Kovach and Gander, 2016). Data analysis was performed using custom software written in MATLAB Version 7.14 programming environment (MathWorks, Natick, MA, USA).

Two signals of interest were extracted from the recorded data: averaged evoked potentials (AEPs) and high gamma event-related band power (ERBP). AEPs are dominated by low frequency signals, most notably synaptic currents flowing in the vicinity of the electrode, and thus roughly index inputs to the cortical site from which recordings are made. High gamma ERBP is closely related to unit activity and thus largely reflects the output signal from the recorded region (Mukamel et al., 2005; Nir et al., 2007; Ray et al., 2008; Steinschneider et al., 2008; Whittingstall and Logothetis, 2009). Voltage deflections greater than 5 S.D. from the across-block mean for each recording site were considered artifacts; trials containing such deflections were excluded from further analysis. In 95% of recording sites, 62 or fewer trials (i.e. ≤15.5%) were rejected using this approach.

High gamma power was calculated for each recording site by bandpass filtering the ECoG signal (300th order finite impulse response filter, 70–150 Hz passband), followed by Hilbert envelope extraction. Power envelope waveforms were then log-transformed and, for each of the four sequences within the experimental block (see Fig. 1b), normalized to the mean power over the entire duration of the sequence. LFP and high gamma ERBP waveforms were smoothed using a 4th order Butterworth lowpass filter (30 Hz cutoff). AEPs and high gamma ERBP representing responses to standard and deviant stimuli were computed by time-domain averaging of single-trial LFP waveforms and high gamma ERBP envelopes.

Recording sites that responded to the first four vowels were identified based on the approach used previously by Nourski et al. (2014). First, data within each trial were baseline-corrected by subtracting the mean value in the 100 ms interval immediately preceding the onset of the first vowel. For AEP data, sites were considered responsive to the stimulus onset if either the upper or the lower bound of the 95% confidence interval of the AEP mean was below or above zero µV, respectively, for at least 30 ms, and the following peak exceeded the mean value at the threshold-crossing time point by at least twofold. For high gamma data, the threshold criterion was based on the lower bound of the 95% confidence interval exceeding 0 dB for at least 30 ms, and the following peak exceeded the mean value at the threshold-crossing time point by at least twofold.

LD and GD effects were defined as significant increases in averaged responses to the corresponding deviant vs. standard stimuli within the time interval between 0 and 800 ms following the onset of the fifth vowel. Statistical significance was established using a non-parametric cluster-based permutation test introduced by Maris and Oostenveld (2007). The test statistic was based on grouping adjacent time points that exhibited a significant difference between experimental conditions. The cluster statistic was constructed by first computing two-sample t-statistics across all time points for each recording site. For each time point, t-values were compared to a threshold corresponding to the 1st percentile tail of the T-distribution. The threshold was the 99.5th percentile for two-tailed tests for AEP data and 99th percentile for the one-tailed tests for high gamma data (one-tailed, as high gamma LGD effects were defined as increases in high gamma ERBP in the deviant vs. standard condition). Clusters were defined as consecutive time points for which the t-statistic exceeded the threshold. The cluster-level statistic was computed as the sum of the t-values within each cluster. The significance level (p-value) of those statistics was calculated using permutation tests. To construct the permutation distribution, 10,000 random partitions of experimental conditions were made, shuffled with respect to experimental conditions (standard or deviant), the cluster statistics were calculated, and the largest cluster-level statistic was identified for each partition. This yielded a 10,000-sample distribution set of the test statistics. Monte Carlo p-values (Phipson and Smyth, 2010) were calculated for each cluster based on this permutation distribution. The rationale for the maximum cluster-level statistic was to reduce the false alarm rate and control for multiple comparisons at a single-contact level.

Although non-parametric clustering can be used for grouping across spatial dimensions, spatial clustering was not feasible in the present study, as recording sites in individual subjects were not evenly spaced, and electrode coverage typically included a combination of multiple depth and subdural arrays. Therefore, the clustering approach was used only in the time domain to account for temporal adjacency of significant effects. To correct for multiple comparisons across recording sites, all p-values were adjusted by controlling the false discovery rate (Heller et al., 2006), following the approach of El Karoui et al. (2015). False discovery rate correction was applied to all the p-values in each of the four test categories (LD AEP, LD high gamma, GD AEP, GD high gamma) using the method introduced by Benjamini and Hochberg (1995). The differences between responses to standard and deviant stimuli were considered significant at q<0.05.

Recording sites with at least one significant cluster were considered as exhibiting the deviance effect. The time course of deviance effects was described in three ways. First, onset latencies of deviance effects were defined as the first time point of the earliest significant cluster in each recording site. Second, the detailed time course of each deviance effect was characterized by summing significant clusters across recording sites in all subjects at each time point between 0 and 800 ms following the fifth vowel onset. Third, sites that exhibited LGD effects at four representative time points (100, 225, 400 and 700 ms after the onset of the fifth vowel onset) were plotted in MNI coordinate space.

Differences in onset latencies of LGD effects were evaluated using non-parametric statistical analysis (Kruskal-Wallis tests and two-tailed Wilcoxon rank sum tests). For sites that exhibited both LD and GD effects, comparisons between the effects’ onset latencies were made using two-tailed Wilcoxon signed rank tests.

3. Results

3.1. GD stimulus target detection task performance in awake subjects

Subjects exhibited variable performance on the GD stimulus target detection task (Fig. 2a). Target hit rates, computed over the course of the entire block, varied from 51.4% (L372) to 90.3% (R369) (Fig. 2b, upper panel), while sensitivity (d’) varied from 1.62 in subject L400 to over 5 in subject R369 (who had no false alarm responses) (Fig. 2b, middle panel). The grand median RT across subjects was 463 ms (Fig. 2b, lower panel).

Figure 2.

Figure 2.

GD stimulus target detection task performance. Summary of data from six subjects. a: Timing of button presses in each subject. Hits and false alarm responses are shown as long and short vertical lines, respectively. Schematic of the 11-minute-long experimental block is shown on top; recorded instructions and LGD stimulus sequences (see Fig. 1b) are represented by white and gray rectangles, respectively. Numbers represent OAA/S scores, as assessed before and after the experimental block. b: Hit rates (% correctly detected target stimuli), sensitivity (d’) and reaction times (RTs) (upper, middle and lower panel, respectively) for each of the six subjects. Box-and-whiskers RT plot depicts median values and 10th, 25th, 75th and 90th percentiles; median values for each subject are shown inside boxes. Dashed line corresponds to the grand median value across all subjects and hit trials (0.463 s).

Four subjects out of six (R369, R376, R394, R399) had maximal OAA/S scores of 5 before and after completion of the experimental block, indicative of a fully awake and alert state. Task performance for subject L372 was characterized by a relatively low hit rate, yet the second highest sensitivity. Analysis of this subject’s task performance over the course of the experimental block showed a marked decrease in button presses indicative of decreased attention to the task (see Fig. 2a). This paralleled the decrease in that subject’s level of awareness as indexed by OAA/S score. Thus, within the block, when the subject was alert and responding, performance accuracy was high. Likewise, the baseline level of alertness in subject L400 was diminished compared to the rest of subjects, as evidenced by OAA/S scores. Thus, as expected, task performance was related to the overall behavioral state of the subjects.

3.2. Neural responses to the vowels prior to the onset of deviance

The first four vowels of the experimental five-vowel stimuli occurred prior to the emergence of either local or global deviance. Examination of neural activity during this initial portion of the stimuli provided a means to examine responses elicited by sound, regardless of auditory novelty. The vowel tokens elicited AEPs and high gamma responses in all ROIs (Fig. 3). Responses simultaneously recorded from four exemplary right hemisphere sites (Fig. 3a) in subject R369 who had both the highest hit rate and sensitivity in the behavioral task are shown in Figure 3b. Responses to the first four vowels varied as a function of ROI. Within HGPM (Fig. 3b, upper row), AEP waveforms were time-locked to the onset of each vowel. High gamma increases were primarily associated with the onset of the five-vowel stimulus. AEPs and high gamma responses were of lower magnitude in other ROIs (Fig. 3b, second – fourth rows). The overall incidence of significant AEPs (see Methods) varied across cortical regions. Specifically, across all six subjects, 100% sites in HGPM, HGAL and PT, 60.0% in PP, 81.1% in STG, 48.2% in auditory-related cortex and 35.7% in PFC exhibited significant AEPs to the first four vowels. High gamma activation was more spatially restricted; significant responses were present in 94.0% of HGPM sites, 54.5% in HGAL, 62.5% in PT, 13.3% in PP, 40.0% in STG, 4.96% in auditory-related cortex and 0% in PFC.

Figure 3.

Figure 3.

Cortical responses to standard and deviant stimuli along the auditory processing hierarchy. a: MRI top-down view of superior temporal plane and side view of the hemispheric surface showing the locations of four representative recording sites (sites A, B, C, D in HG, STG, MTG and inferior frontal gyrus [IFG], respectively) in subject R369. Colors represent different ROIs used in the present study. b: AEP and high gamma responses (cyan/turquoise and magenta/purple waveforms, respectively) recorded from the four cortical sites in response to standard and deviant stimuli (lighter and darker waveforms, respectively). Lines and shading represent mean values and the 95% confidence intervals, respectively. LD and GD effects are presented on the left and right, respectively. Horizontal bars underneath response waveforms highlight significant differences between responses to standard and deviant stimuli (q<0.05, non-parametric cluster-based permutation test, false discovery rate-adjusted).

3.3. Spatial properties of LD and GD effects

Auditory novelty elicited AEPs and increases in high gamma ERBP beyond those that might represent offset responses to the final token. These deviance effects were defined as significant differences between responses to standard and deviant stimuli (cyan and magenta bars in Fig. 3b). In subject R369, significant LD effects were seen in the AEP at all ROIs, while high gamma increases were restricted to core and non-core auditory cortex (see Fig. 3b, left column). GD AEP effects were also present at all four sites, with the earliest changes occurring in the STG, followed by activity in PFC. Significant high gamma GD effects were present in the STG and auditory-related cortex in the MTG (see Fig. 3b, right column).

Exemplar data are representative of the overall spatial distribution of LD and GD effects in this subject (Fig. 4). AEPs reflecting LD were seen throughout all ROIs as well as other brain regions (e.g. amygdala; inset in Fig. 4a), whereas increases in high gamma ERBP were restricted to auditory cortex ROIs (Fig. 4a). GD effects were associated with a different spatial distribution compared to LD effects (Fig. 4b). High gamma GD effects were not common in auditory cortex and instead were more prominent in auditory-related cortex and PFC, reflecting activity originating within higher levels of the cortical processing hierarchy. Comparable patterns were observed in subject R376 (Supplementary Fig. 2). Especially notable was the prevalence of LD, but not GD high gamma effects within superior temporal plane, and the opposite pattern (GD, but not LD high gamma effects) in PFC.

Figure 4.

Figure 4.

Spatial distribution of sites exhibiting LD and GD effects (panels a and b, respectively) in a representative subject (R369). Sites that feature significant differences between responses to standard and deviant stimuli, as measured by AEP and high gamma responses, are shown in cyan and magenta, respectively. Sites that didn’t exhibit either significant difference are shown in white. Dashed outlines indicate absence of response to the initial portion of the stimuli (“No response”). Sites that were not included in the analysis are marked with X. Insets: MRI coronal section through the right temporal lobe (section plane indicated by a dashed line) showing location of depth electrode contacts in the right amygdala that exhibited AEP local and deviance effects.

While responses to the initial portion of the acoustic stimuli (i.e., the first four vowels) were present in each ROI (sites with solid outlines in Fig. 4 and Supplementary Fig. 2), not every recording site that featured LGD effects responded to the acoustic stimulus prior to the emergence of deviance (sites with dashed outlines in Fig. 4 and Supplementary Fig. 2). An example of such site is shown in Figure 3b (PFC site ‘D’) that did not respond to the first four stimuli, nor even to the fifth stimulus when it was a local standard. This deviance-selective response pattern was observed predominantly in higher-order areas, but not in core auditory cortex (HGPM) and adjacent non-core auditory areas.

Across the entire subject cohort, there was a substantial spatial overlap in LD and GD effects, consistent with involvement of many sites in both LD and GD networks (Fig. 5). AEP LD effects were prominent in the superior temporal plane and were usually associated with GD effects (Fig. 5a). In contrast, sites exclusively responsive to GD were rare here. Beyond the superior temporal plane, AEP LD effects were widely distributed throughout all ROIs and often co-located with GD effects. Sites exclusively exhibiting AEP GD effects were rare in auditory cortex on the STG, yet prevalent in surrounding areas, including MTG, SMG, angular gyrus, as well as PFC.

Figure 5.

Figure 5.

Spatial distribution of sites exhibiting significant LD and GD effects as measured by the AEP and high gamma activity (panels a and b, respectively). Summary of data from six subjects, plotted in MNI coordinate space and projected onto FreeSurfer average template brain. Top-down views of the right superior temporal plane are plotted underneath side views of the right lateral hemispheric convexity, aligned with respect to the yMNI coordinate. Sites exhibiting significant LD effects only, both LD and GD effects, and GD effects only, are depicted in the left, middle and right column, respectively. Sites are color-coded based on their ROI assignment in each individual subject.

A sparser representation of LGD effects was seen in high gamma activity (Fig. 5b). In contrast to AEP, sites in the superior temporal plane largely exhibited only high gamma LD effects, and not GD effects. High gamma GD effects (with or without LD effects) were more prevalent in the STG (25 out of 95 STG sites) compared to all other ROIs. Within the PFC, high gamma GD effects, but not LD effects, were observed.

3.4. Temporal properties of LD and GD effects

Onset latencies (i.e. the timing of the first significant difference after the fifth vowel stimulus) of the LD effects were shortest in auditory cortex located in the superior temporal plane (HGPM, HGAL and PT), and became progressively longer in non-core auditory cortex on the lateral STG, auditory-related cortex and PFC (Fig. 6a, left column). AEP and high gamma onset latencies within the superior temporal plane were comparable between core (HGPM) and non-core regions (HGAL, PT) (AEP: p = 0.986, high gamma: p = 0.714; Kruskal-Wallis test). In contrast, onset latencies of AEP LD effects increased from HGPM to non-core auditory cortex on the lateral STG, from STG to auditory-related cortex, and from auditory-related cortex to PFC (HGPM vs. STG p = 0.0229, z = −2.28, W = 1585; STG vs. auditory-related p = 0.001006, z = −3.29, W = 5321; auditory-related vs. PFC p = 0.00155, z = −3.17, W = 9655; two-tailed Wilcoxon rank sum tests). Likewise, onset latencies of high gamma LD effects increased from HGPM to STG (p = 0.00382, z = −2.89, W = 724), although no significant difference was found between STG and auditory-related cortex (p = 0.176, z = −1.35, W = 1085; two-tailed Wilcoxon rank sum tests), and high gamma LD effects were absent in PFC. The relatively low occurrence of LGD effects within PP (LD AEP: 6 sites out of 15 across 5 subjects; LD high gamma: 2 sites; GD AEP: 9 sites; GD high gamma: 0 sites) precluded comparisons of LGD effect onset latencies in this region with other ROIs.

Figure 6.

Figure 6.

Temporal properties of LD and GD effects. a: Onset latencies of LD and GD effects (left and right column, respectively) across ROIs. Data from six subjects. Onset latencies of AEP and high gamma effects are shown in cyan and magenta, respectively. Box and whiskers plots (across-contact medians, quartiles, 10th and 90th percentiles) are shown for ROIs with at least ten sites across the six subjects exhibiting significant deviance effects. Gray box represents RTs across the six subjects. b: Time course of LD and GD effects (left and right column, respectively). Total number of sites across all subjects exhibiting significant effects is plotted as a function of time after the fifth vowel onset for AEP and high gamma in cyan and magenta, respectively. Dashed lines represent four exemplar time points depicted in panel c. c: Spatial distribution of LD and GD effects (upper and lower row) at four representative time points (left to right: 100, 225, 400 and 700 ms after fifth vowel onset). Sites that exhibited significant deviance effects at these time points are shown in cyan and magenta for AEP and high gamma, respectively. Data from six subjects, plotted in MNI coordinate space and projected onto FreeSurfer average template brain.

A markedly different latency profile was seen for GD effects (Fig. 6a, right column). Onset latencies were generally longer compared to LD effects (Table 3). A total of 176 sites exhibited both LD and GD AEP effects, while 35 sites were characterized by presence of both LD and GD high gamma effects. For these sites, pairwise comparisons revealed that LD effects were significantly earlier (AEP: p = 2.84×10−14, z = −7.61, W = 2640; high gamma: p = 6.645×10−6, z = −4.51, W = 40; two-tailed Wilcoxon signed rank tests).

Table 3.

Comparison of LD and GD effect onset latencies across ROIs.

ROI AEP LD vs. GD High gamma LD vs. GD
LD effect GD effect LD Vs. GD
p-value
LD effect GD effect LD vs. GD
p-value
Median
lat. (ms)
nsites Median
lat. (ms)
nsites Median
lat. (ms)
nsites Median
lat. (ms)
nsites
HGPM 88.5 36 214 27 2.89×10−5 57 28 192 4 0.00333
HGAL 71 21 406 19 1.71×10−7 64 11
PT 78 16 405 14 2.04×10−5 61 10 215.5 4 0.00199
PP 217.5 6 353 9 0.627 143 2
STG 104.5 70 350 25 3.31×10−7 73 39 163.5 24 4.44×10−9
Auditory−related 147 114 240 64 2.75×10−8 84.5 20 208 25 8.14×10−5
PFC 194 75 307 44 0.00326 476 11

High gamma GD effects were relatively uncommon in the superior temporal plane (4/36 sites in HGPM, 0/22 in HGAL, 4/16 sites in PT and 0/15 in PP), precluding statistical analyses of onset latencies in these ROIs. Of the remaining ROIs, median latencies of high gamma GD effects were shortest in lateral STG, though they did not differ significantly from those in auditory-related cortex (STG vs. auditory-related p = 0.0854, z = −1.72, W = 513.5). Median latencies of high gamma GD effects were longest in PFC (STG vs. PFC p = 7.43×10−5, z = −3.96, W = 320; auditory-related vs. PFC: p = 0.00251, z = −3.02, W = 374).

The paucity of high gamma GD effects in HGPM, HGAL, PT and PP contrasted with presence of AEP GD effects. Onset latencies of AEP GD effects were different across ROIs (p = 3.91×10−5; Kruskal-Wallis test). Interestingly, AEP GD onset latencies were the shortest in HGPM (p < 0.05 in all pairwise comparisons; two-tailed Wilcoxon rank sum tests), even though the shortest-latency high gamma GD effects were found on STG, and high gamma GD effects were sparse in core auditory cortex. High gamma GD effects preceded AEP GD effects in all ROIs except HGPM (HGPM: p = 0.304, z = −1.03, W = 569; p < 0.05 for pairwise comparisons in all other ROIs; two-tailed Wilcoxon rank sum tests). Finally, high gamma activity in STG and auditory-related cortex preceded the subjects’ behavioral responses (STG: p = 3.65×10−14, z = −7.57, W = 590; auditory-related: p = 6.89×10−10, z = −6.17, W = 1387), while high gamma GD effect onset latencies in PFC overlapped with the subjects’ reaction times (p = 0.809, z = −−0.242, W = 1787).

Overall timing of LD and GD effects provides evidence for region- and effect-specific activation of underlying generators. AEP LGD effects had a more extended and complex time course compared to high gamma effects (Fig. 6b). Figure 6b shows the estimated overall probability density function of cortical activation during LD and GD effects, computed by summing across subjects the number of sites exhibiting significant effects at each time point relative to the onset of the fifth vowel. AEP LG and GD effects were characterized by multi-peaked time courses, while high gamma effects exhibited simpler time courses. AEP LD effects were most prominent at 200–250 ms after the fifth vowel onset, while AEP GD effects were dominated by a peak at 400 ms. The overall time course of the high gamma LD effect was characterized by a single peak around 125 ms after the fifth vowel onset, while the high gamma GD effect peaked at 250 ms.

The spatiotemporal evolution of LGD effects is summarized in Figure 6c for four exemplar time points (see dashed lines in Fig. 6b). Early LD effects were represented by distributed AEPs in all ROIs and high gamma effects in all ROIs except PFC. Over time, LD effects became progressively restricted to AEPs within temporal cortex. GD effects were more sparsely represented, and featured high gamma activity that developed at longer latency and was more broadly distributed across all time points. Taken together, these results demonstrate that LD and GD effects are distinct both in their spatial and temporal profiles.

4. Discussion

4.1. Summary of findings

The data reveal distinct spatial and temporal profiles of neural responses underlying auditory novelty detection at short (LD effect) vs. long (GD effect) time scales. Both effects, as indexed by AEPs, were broadly distributed across auditory, auditory-related and prefrontal cortical regions. In contrast, high gamma LD effects were focused mainly in auditory core and non-core areas, while high gamma GD effects were more concentrated in auditory-related cortex and PFC. The regional distribution of latencies of LD effects can be interpreted within the context of the predictive coding model. Latencies were earliest within core and non-core regions of auditory cortex and became progressively longer along the processing hierarchy. This pattern is consistent with FF signal propagation of prediction errors. By contrast, the shortest latencies of high gamma GD effects were observed in STG and auditory-related cortex, and preceded the onset of AEP GD effects both within those areas and in other ROIs. Thus, signal propagation, and presumably the composition of the network driving the predictive coding process, differs for LD versus GD effects. The latter appear to be initiated in higher order regions of auditory cortex and in auditory-related temporo-parietal cortex rather than in PFC. These findings help clarify the involvement of specific brain regions in the detection of auditory novelty consistent within a hypothesized predictive coding network.

4.2. Cortical generators of LD and GD effects

While auditory cortex has been consistently observed to contribute to LD effects, there is conflicting evidence for the additional involvement of frontal cortical generators. Some studies reported that LD effects are confined to temporal cortex (Baudena et al., 1995; Bekinschtein et al., 2009; El Karoui et al., 2015), whereas others have also shown involvement of PFC using ECoG (Liasis et al., 2001; Rosburg et al., 2005) as well as EEG/MEG and fMRI (reviewed in Deouell, 2007). The data presented here indicate widespread cortical activation associated with LD effects: AEPs were observed in all ROIs, including parietal and frontal cortex, and high gamma signals observed in all areas except PFC. These data suggest involvement of multiple levels of the auditory cortical hierarchy, though the absence of high gamma LD effects in PFC suggests a passive role of the frontal generator for novelty detection over this time scale.

We observed prominent GD effects (AEPs and high gamma) in STG, auditory-related cortex and in PFC, consistent with previous reports (Bekinschtein et al., 2009; Chennu et al., 2013; El Karoui et al., 2015). Although there was a substantial spatial overlap of AEP GD and LD effects, recording sites exhibiting only GD effects were largely outside of auditory cortex ROIs. The spatial distributions of high gamma GD and LD effects exhibited less overlap than for AEPs, suggesting that the populations of cells initiating these effects are largely distinct, with a common node in both networks located in STG.

4.3. Auditory novelty detection over multiple time scales

There are multiple electrophysiological signatures of auditory novelty detection, distinct in their generators, latency, and dependence on brain state (Friedman et al., 2001; Kok, 2001; Näätänen and Alho, 1995; Näätänen et al., 2011). At a mechanistic level, responses to auditory novelty within auditory cortex can be ascribed in part to stimulus-specific adaptation (Eliades et al., 2014; Fishman and Steinschneider, 2012; Ulanovsky et al., 2003). This process, in which the long-term statistics of preceding sounds modulate responses to subsequent sounds in a context-dependent manner, likely contributes to both LD and GD effects within auditory cortex (Holt, 2006; Ulanovsky et al., 2004). However, modeling studies (Wacongne et al., 2012), as well as evidence from omission responses (Wacongne et al., 2011) and from MMNs generated in response to abstract rule violations (Paavilainen, 2013), suggest that higher order and especially FB-mediated processes are also involved in generating these novelty signals. The presence of AEP GD effects (a proxy of synaptic inputs) coupled with the scarcity of high gamma GD effects (a surrogate for neuronal firing) in core auditory cortex and surrounding auditory cortical areas in the superior temporal plane support this interpretation.

While it is necessary to be cautious when comparing intracranial data with scalp-recorded potentials or MEG fields, it is noteworthy that the AEP LD effect reported here was prominent at 150–250 ms after the fifth vowel onset. This timing is consistent with previous reports identifying the LD effect with the MMN (Bekinschtein et al., 2009; Wacongne et al., 2011). More broadly, onset latencies of AEP LD effects across ROIs had a wide range (50 – 500 ms; see Fig. 6a), comparable to that reported in a previous intracranial study (70 – 440 ms; El Karoui et al., 2015), and overall time course of AEP LD effects extended over nearly the entire 800 ms analysis window (see Fig. 6b). Such a wide range of latencies is consistent with involvement of a broad network in detection of local deviance. Indeed, AEP LD effects were observed in all ROIs investigated. Thus, the present findings are in disagreement with previous studies that reported focal expression of the LD effect in STG (Bekinschtein et al., 2009; Liebenthal et al., 2003; Sabri et al., 2004; Wacongne et al., 2011), but are consistent with studies that reported LD effects within PFC (Durschmid et al., 2016; Rinne et al., 2000; Uhrig et al., 2016). Some of these disparities may result from differences in signal-to-noise ratios in the recordings. LD effects were prominent and of large amplitude in auditory cortex and adjacent regions, and it is these areas where differences in responses between standard and deviant stimuli are most likely to reach statistical significance.

In the current study, a progressive increase in onset latency of both AEP and high gamma LD effects was observed from core to non-core to auditory-related cortex to PFC (see Fig. 6a). This latency profile is consistent with hierarchical schemes of auditory cortical processing (e.g. Hackett, 2015; Rauschecker and Scott, 2009; Romanski and Averbeck, 2009), patterns of connectivity across processing stages (Brugge et al., 2003; Garell et al., 2013; Kumar et al., 2011), and temporal sequence of activation along this hierarchy (e.g. Celesia, 1976; Liegeois-Chauvel et al., 1994; Nourski et al., 2014). It could be argued that this onset latency profile can be accounted for by a combination of progressively longer response latencies to the acoustic stimulus and error signals that are generated independently of message passing between cortical areas (e.g. mechanisms such as stimulus-specific adaptation). This is likely to be a contributing factor within the classically defined auditory cortex, where responses to each vowel that precedes the deviant stimulus are robust (e.g., sites A and B in Fig. 3b). However, at sites that do not respond to the acoustic stimulus in the absence of deviance (e.g. site D in Fig. 3b), the emergence of the error signal would necessarily rely on integration of inputs from other cortical regions. Importantly, even error signals generated in part by stimulus-specific adaptation will be propagated along the cortical hierarchy. Additional evidence supporting feedforward propagation of the LD signal is found in recording sites in PFC, where AEP LD effects (a proxy of synaptic inputs) are observed in the absence of high gamma effects (a surrogate for neuronal firing).

However, the temporally and spatially distributed response profiles suggest mechanisms more complex than a simple FF process. The range of latencies in each ROI, especially the longer latencies of AEPs relative to high gamma ERBP, and the extended overall time course of LD effects, are consistent with bidirectional signal flow in a broad network operating over multiple spatial and temporal scales, part of which is automatic/pre-attentive, and part of which is active/predictive (El Karoui et al., 2015; Näätänen et al., 2011; Sculthorpe et al., 2009; Strauss et al., 2015). The LD effects that persist beyond the onset components illustrated in Figure 6a may reflect the iterative process whereby the error signals are utilized to update predictions, modifying both FB signals and subsequent error signals.

Unlike the LD effect, the GD effect is dominated by response components with onset latencies greater than 200 ms, and has been associated with the P3b component of the event-related potential (Bekinschtein et al., 2009; Wacongne et al., 2011), a task- and attention-dependent index of contextual updating that operates over time scales of seconds to minutes (Kok, 2001; Polich, 2007). In the present study, the latency distributions for LD and GD effects overlapped (see Fig. 6a), but the mean latencies of GD effects were significantly longer than those of LD across a wide range of recording sites (see Table 3). Previous reports have emphasized a distributed network engaged by global deviance detection, involving anterior cingulate, parietal, temporal and prefrontal regions (Bekinschtein et al., 2009; El Karoui et al., 2015; Uhrig et al., 2014). This overlaps with the postulated global workspace network subserving conscious sensory processing (Dehaene and Changeux, 2011; Dykstra et al., 2017). Similar to these studies, the present study revealed a GD network widely distributed in space and time, with significant responses observed in all regions of interest, from core auditory cortex to PFC (see Fig. 5), and over the time window 100 – 800 ms after the onset of the fifth vowel (see Fig. 6b). The spatial overlap of LD and GD networks may either be representative of a commonality in mechanisms underlying the two effects (e.g. forward masking at multiple time scales) or the co-location of two distinct processes that remain to be elucidated (see Fig. 5).

The shortest onset latencies of GD effects corresponded to high gamma ERBP in the STG and adjacent auditory-related cortex. By contrast, high gamma GD effects were uncommon in the superior temporal plane ROIs (HGPM, HGAL, PT and PP), compared to reliably observed AEP GD effects (see Fig. 5). Given that high gamma is thought to represent a population-level surrogate for unit activity (Mukamel et al., 2005; Nir et al., 2007; Steinschneider et al., 2008), the onset latency data support a model in which GD effects trigger earliest spiking activity in STG and auditory-related cortex, and signals propagate both down the hierarchy via FB projections triggering AEPs in lower areas and up the hierarchy to PFC via FF projections where they trigger both AEPs and spiking activity. This parallels the results of a recent intracranial study demonstrating the involvement of STG in predictive coding for speech sequences (Leonard et al., 2015). This is in contrast to models in which long time scale predictive coding relies on FB projections from frontal cortex (cf. Durschmid et al., 2016), and may be relevant to the current debate over the importance of frontal versus parietal regions for the neural basis of consciousness (Boly et al., 2017; Odegaard et al., 2017; Siclari et al., 2017). Functional connectivity analyses may be necessary to further identify specific sources of GD effects.

At present, it is unclear to what degree GD effects occur independently from LD effects in STG. Emergence of GD effects in auditory-related cortex can occur in the absence of LD effects (see Fig. 5). This observation suggests that temporo-parietal auditory-related cortex is operating over temporal windows of integration greater than those that characterize auditory cortex. This is parsimonious with regional differences in speech processing at acoustic-phonetic and lexico-semantic levels, wherein the former is processed in STG and the latter in surrounding auditory-related regions (Nourski et al., 2016).

AEP GD effects had relatively short latencies in HGPM (see Fig. 6a). The time course of AEP GD effects in HGPM was similar to the overall time course of AEP GD effects, with the early components most prominent at 100–200 ms followed by components centered at 400 and 700 ms (see Fig. 6b). As noted previously (El Karoui et al., 2015; Wacongne et al., 2011), the early components likely arise due to the partial interdependence of LD and GD effects. This interaction, or context dependence, arises because the magnitude of LD effects is inversely correlated with the probability of the deviant stimulus. Thus, trial sequences in which the LD stimulus is also the GD (see Fig. 1c) will have large, short-latency responses to that stimulus. This may simply reflect the difference in probability of occurrence of the LD stimulus, but could also reflect violation of a second order prediction about the vowel sequence that operates on a longer time scale (Ulanovsky et al., 2004). Modifications of the classic LGD paradigm that include token omission as a means to elicit global deviance may help clarify these outstanding issues (Wacongne et al., 2011).

4.4. Caveats and limitations

Principal caveats of the present study are related to the subject population and limitations of the experimental paradigm. The former is an issue of concern inherent to all human intracranial studies. Experimental subjects have a neurologic disorder, and their brain responses may not be representative of a healthy population (Nourski and Howard, 2015). To address this issue, recordings from epileptic foci in each subject were excluded from the analyses. Most importantly, results were replicated in multiple subjects, who had different neurologic histories, seizure foci and antiepileptic drug regimens. Cognitive function in each subject was in the average range, and all subjects were able to perform the experimental task successfully.

Another potential limitation inherent to studying patients undergoing chronic invasive monitoring is electrode coverage based solely on clinical needs. Accordingly, sampling of cortical regions that might be involved in auditory predictive coding is determined by the electrode coverage in each subject. In particular, this limitation precluded investigating the contribution of thalamocortical loops to the cortical processing of auditory novelty.

In general, electrode coverage is unilateral. This issue precludes within-subject comparisons across cerebral hemispheres. Four of the six subjects studied here had coverage of the non-dominant right hemisphere. This raises the question of whether recordings in the non-dominant hemisphere are directly relevant to the experimental task. Results of non-invasive studies including those using fMRI, EEG and MEG all demonstrate bihemispheric processing of auditory novelty (Bekinschtein et al., 2009; Wacongne et al., 2011; Strauss et al., 2015). The relatively poorer task performance of the two subjects with left hemisphere seizure foci and left hemisphere electrode coverage likely reflected their overall lower level of alertness as indexed by OAA/S scores. The extent of electrode coverage and spatial distribution of LGD effects in these two subjects was comparable with the rest of the subject cohort.

It is also important to consider the possibility that during the target detection task, neural activity associated with the motor act of button press may temporally overlap with responses associated with deviance detection per se. This confound was minimized in the present study by having subjects operate the response button with the hand ipsilateral to the hemisphere from which recordings were made (see Nourski et al. (2016), for a further discussion of this issue).

Finally, while there is a progression of response latencies along the hypothesized cortical hierarchy (see Fig. 6a), and known anatomical connectivity between these ROIs, the results of this study do not unambiguously establish propagation of error signals across this hierarchy. As discussed above (Section 4.3), it is unlikely that the observed LGD effects are solely due to each brain region independently generating its own error signals with different latency profiles. However, the relative contribution of locally generated error signals vs. signals propagated across the hierarchy to the measured LGD effects is yet to be determined. Ongoing work seeks to clarify the role of hierarchical signal propagation by characterizing functional connectivity patterns associated with LGD detection (Banks et al., 2017). Studies in animal models are likely needed to elucidate the specific computations at each level of the proposed hierarchy.

4.5. Functional significance

The LGD experimental paradigm represents an important empirical test of the predictive coding model. First, the first four vowels within an LGD sequence are always the same and thus by themselves would not generate an error signal in this framework. Cortical responses to this initial portion of the LGD stimuli become progressively weaker along the ascending auditory hierarchy, in accordance with the predictive coding model. Second, the relative involvement of lower and higher hierarchical stages in processing of local and global deviance, respectively, highlights the utility of the results obtained in this experimental paradigm, and may enhance novelty response generator identification as seen in non-invasive studies (e.g. Symonds et al. 2017). This paradigm has potential use as part of a noninvasive metric of conscious processing. For example, evidence suggests that clinical evaluation alone is insufficient to fully evaluate patients with disorders of consciousness (Bayne et al., 2017; Bernat, 2017; Naccache, 2017; Schnakers et al., 2009). The present study also serves as a foundation for ongoing and future studies that seek to gain better understanding of sensitivity of auditory cortical novelty processing to general anesthesia (Nourski et al., 2018). In turn, knowledge gained from this work will contribute to improvements in accuracy of assessing conscious processing in clinical and non-clinical settings.

Supplementary Material

Supplementary material

Acknowledgements

We are grateful to Haiming Chen, Phillip Gander, Bradley Hindman, Christopher Kovach, Bryan Krause, Rashmi Mueller, Caitlin Murphy, Yuri Saalmann, Beau Snoad and Xiayi Wang for help with data collection, analysis and comments on the manuscript.

Funding

This work was supported by the National Institutes of Health (grant numbers R01-DC04290, R01-GM109086, UL1-RR024979); National Science Foundation (grant number CRCNS-IIS-1515678); the Hoover Fund.

Abbreviations:

AEP

averaged evoked potential

CT

computerized tomography

ECoG

electrocorticography

ERBP

event-related band power

FB

feedback

FF

feedforward

GD

global deviant

GS

global standard

HG

Heschl’s gyrus

HGAL

anterolateral Heschl’s gyrus

HGPM

posteromedial Heschl’s gyrus

LGD

local/global deviant

LD

local deviant

LS

local standard

MMN

mismatch negativity

MNI

Montreal Neurological Institute

MRI

magnetic resonance imaging

MTG

middle temporal gyrus

OAA/S

Observer’s Assessment of Alertness/Sedation

PFC

prefrontal cortex

PP

planum polare

PT

planum temporale

ROI

region of interest

RT

reaction time

STG

superior temporal gyrus

REFERENCES CITED

  1. Alho K, 1995. Cerebral generators of mismatch negativity (MMN) and its magnetic counterpart (MMNm) elicited by sound changes. Ear Hear 16, 38–51. [DOI] [PubMed] [Google Scholar]
  2. Banks MI, Nourski KV, Kawasaki H, Howard III MA, 2017. Effects of propofol anesthesia on connectivity within the human auditory cortical hierarchy: An intracranial electrophysiological study. Society for Neuroscience 47th Annual Meeting, 489.19. Society for Neuroscience, Washington, DC. [Google Scholar]
  3. Bar M, 2009. Predictions: a universal principle in the operation of the human brain. Introduction. Philos Trans R Soc Lond B Biol Sci 364, 1181–1182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bastos AM, Usrey WM, Adams RA, Mangun GR, Fries P, Friston KJ, 2012. Canonical microcircuits for predictive coding. Neuron 76, 695–711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Baudena P, Halgren E, Heit G, Clarke JM, 1995. Intracerebral potentials to rare target and distractor auditory and visual stimuli. III. Frontal cortex. Electroencephalogr Clin Neurophysiol 94, 251–264. [DOI] [PubMed] [Google Scholar]
  6. Bayne T, Hohwy J, Owen AM, 2017. Reforming the taxonomy in disorders of consciousness. Ann Neurol 82, 866–872. [DOI] [PubMed] [Google Scholar]
  7. Bekinschtein TA, Dehaene S, Rohaut B, Tadel F, Cohen L, Naccache L, 2009. Neural signature of the conscious processing of auditory regularities. Proc Natl Acad Sci U S A 106, 1672–1677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Benjamini Y, Hochberg Y, 1995. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J Royal Stat Soc Series B Stat Methodol 57, 289–300. [Google Scholar]
  9. Bernat JL, 2017. Nosologic considerations in disorders of consciousness. Ann Neurol 82, 863–865. [DOI] [PubMed] [Google Scholar]
  10. Blank H, Davis MH, 2016. Prediction Errors but Not Sharpened Signals Simulate Multivoxel fMRI Patterns during Speech Perception. PLoS Biol 14, e1002577. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Boly M, Massimini M, Tsuchiya N, Postle BR, Koch C, Tononi G, 2017. Are the Neural Correlates of Consciousness in the Front or in the Back of the Cerebral Cortex? Clinical and Neuroimaging Evidence. J Neurosci 37, 9603–9613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Brugge JF, Volkov IO, Garell PC, Reale RA, Howard MA 3rd, 2003. Functional connections between auditory cortex on Heschl’s gyrus and on the lateral superior temporal gyrus in humans. J Neurophysiol 90, 3750–3763. [DOI] [PubMed] [Google Scholar]
  13. Celesia GG, 1976. Organization of auditory cortical areas in man. Brain 99, 403–414. [DOI] [PubMed] [Google Scholar]
  14. Chennu S, Noreika V, Gueorguiev D, Blenkmann A, Kochen S, Ibanez A, Owen AM, Bekinschtein TA, 2013. Expectation and attention in hierarchical auditory prediction. J Neurosci 33, 11194–11205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chernik DA, Gillings D, Laine H, Hendler J, Silver JM, Davidson AB, Schwam EM, Siegel JL, 1990. Validity and reliability of the Observer’s Assessment of Alertness/Sedation Scale: study with intravenous midazolam. J Clin Psychopharmacol 10, 244–251. [PubMed] [Google Scholar]
  16. Clark A, 2013. Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav Brain Sci 36, 181–204. [DOI] [PubMed] [Google Scholar]
  17. Dehaene S, Changeux JP, 2011. Experimental and theoretical approaches to conscious processing. Neuron 70, 200–227. [DOI] [PubMed] [Google Scholar]
  18. den Ouden HE, Kok P, de Lange FP, 2012. How prediction errors shape perception, attention, and motivation. Front Psychol 3, 548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Deouell LY, 2007. The Frontal Generator of the Mismatch Negativity Revisited. Journal of Psychophysiology 21, 188–203. [Google Scholar]
  20. Destrieux C, Fischl B, Dale A, Halgren E, 2010. Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature. Neuroimage 53, 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Destrieux C, Terrier LM, Andersson F, Love SA, Cottier JP, Duvernoy H, Velut S, Janot K, Zemmoura I, 2017. A practical guide for the identification of major sulcogyral structures of the human cortex. Brain Struct Funct 222, 2001–2015. [DOI] [PubMed] [Google Scholar]
  22. Durschmid S, Edwards E, Reichert C, Dewar C, Hinrichs H, Heinze HJ, Kirsch HE, Dalal SS, Deouell LY, Knight RT, 2016. Hierarchy of prediction errors for auditory events in human temporal and frontal cortex. Proc Natl Acad Sci U S A 113, 6755–6760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Dykstra AR, Cariani PA, Gutschalk A, 2017. A roadmap for the study of conscious audition and its neural basis. Philos Trans R Soc Lond B Biol Sci 372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Edwards E, Soltani M, Deouell LY, Berger MS, Knight RT, 2005. High gamma activity in response to deviant auditory stimuli recorded directly from human cortex. J Neurophysiol 94, 4269–4280. [DOI] [PubMed] [Google Scholar]
  25. Egner T, Monti JM, Summerfield C, 2010. Expectation and surprise determine neural population responses in the ventral visual stream. J Neurosci 30, 16601–16608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. El Karoui I, King JR, Sitt J, Meyniel F, Van Gaal S, Hasboun D, Adam C, Navarro V, Baulac M, Dehaene S, Cohen L, Naccache L, 2015. Event-Related Potential, Time-frequency, and Functional Connectivity Facets of Local and Global Auditory Novelty Processing: An Intracranial Study in Humans. Cereb Cortex 25, 4203–4212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Eliades SJ, Crone NE, Anderson WS, Ramadoss D, Lenz FA, Boatman-Reich D, 2014. Adaptation of high-gamma responses in human auditory association cortex. J Neurophysiol 112, 2147–2163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Fishman YI, Steinschneider M, 2012. Searching for the mismatch negativity in primary auditory cortex of the awake monkey: deviance detection or stimulus specific adaptation? J Neurosci 32, 15747–15758. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Friedman D, Cycowicz YM, Gaeta H, 2001. The novelty P3: an event-related brain potential (ERP) sign of the brain’s evaluation of novelty. Neurosci Biobehav Rev 25, 355–373. [DOI] [PubMed] [Google Scholar]
  30. Friston K, 2005. A theory of cortical responses. Philos Trans R Soc Lond B Biol Sci 360, 815–836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Friston K, 2010. The free-energy principle: a unified brain theory? Nat Rev Neurosci 11, 127–138. [DOI] [PubMed] [Google Scholar]
  32. Garell PC, Bakken H, Greenlee JD, Volkov I, Reale RA, Oya H, Kawasaki H, Howard MA, Brugge JF, 2013. Functional connection between posterior superior temporal gyrus and ventrolateral prefrontal cortex in human. Cereb Cortex 23, 2309–2321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Garrido MI, Kilner JM, Stephan KE, Friston KJ, 2009. The mismatch negativity: a review of underlying mechanisms. Clin Neurophysiol 120, 453–463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Hackett TA, 2015. Anatomic organization of the auditory cortex. Handb Clin Neurol 129, 27–53. [DOI] [PubMed] [Google Scholar]
  35. Halgren E, Marinkovic K, Chauvel P, 1998. Generators of the late cognitive potentials in auditory and visual oddball tasks. Electroencephalogr Clin Neurophysiol 106, 156–164. [DOI] [PubMed] [Google Scholar]
  36. Heilbron M, Chait M, 2017. Great expectations: Is there evidence for predictive coding in auditory cortex? Neuroscience, in press. [DOI] [PubMed]
  37. Heller R, Stanley D, Yekutieli D, Rubin N, Benjamini Y, 2006. Cluster-based analysis of FMRI data. Neuroimage 33, 599–608. [DOI] [PubMed] [Google Scholar]
  38. Hillenbrand J, Getty LA, Clark MJ, Wheeler K, 1995. Acoustic characteristics of American English vowels. J Acoust Soc Am 97, 3099–3111. [DOI] [PubMed] [Google Scholar]
  39. Hinton GE, 2010. Learning to represent visual input. Philos Trans R Soc Lond B Biol Sci 365, 177–184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Hohwy J, Roepstorff A, Friston K, 2008. Predictive coding explains binocular rivalry: an epistemological review. Cognition 108, 687–701. [DOI] [PubMed] [Google Scholar]
  41. Holt LL, 2006. The mean matters: effects of statistically defined nonspeech spectral distributions on speech categorization. J Acoust Soc Am 120, 2801–2817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Howard MA 3rd, Volkov IO, Abbas PJ, Damasio H, Ollendieck MC, Granner MA, 1996. A chronic microelectrode investigation of the tonotopic organization of human auditory cortex. Brain Res 724, 260–264. [DOI] [PubMed] [Google Scholar]
  43. Howard MA, Volkov IO, Mirsky R, Garell PC, Noh MD, Granner M, Damasio H, Steinschneider M, Reale RA, Hind JE, Brugge JF, 2000. Auditory cortex on the human posterior superior temporal gyrus. J Comp Neurol 416, 79–92. [DOI] [PubMed] [Google Scholar]
  44. Joos K, Gilles A, Van de Heyning P, De Ridder D, Vanneste S, 2014. From sensation to percept: the neural signature of auditory event-related potentials. Neurosci Biobehav Rev 42, 148–156. [DOI] [PubMed] [Google Scholar]
  45. Kok A, 2001. On the utility of P3 amplitude as a measure of processing capacity. Psychophysiology 38, 557–577. [DOI] [PubMed] [Google Scholar]
  46. Kovach CK, Gander PE, 2016. The demodulated band transform. J Neurosci Methods 261, 135–154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kumar S, Sedley W, Nourski KV, Kawasaki H, Oya H, Patterson RD, Howard MA 3rd, Friston KJ, Griffiths TD, 2011. Predictive coding and pitch processing in the auditory cortex. J Cogn Neurosci 23, 3084–3094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Leonard MK, Bouchard KE, Tang C, Chang EF, 2015. Dynamic encoding of speech sequence probability in human temporal cortex. J Neurosci 35, 7203–7214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Liasis A, Towell A, Alho K, Boyd S, 2001. Intracranial identification of an electric frontal-cortex response to auditory stimulus change: a case study. Brain Res Cogn Brain Res 11, 227–233. [DOI] [PubMed] [Google Scholar]
  50. Liebenthal E, Ellingson ML, Spanaki MV, Prieto TE, Ropella KM, Binder JR, 2003. Simultaneous ERP and fMRI of the auditory cortex in a passive oddball paradigm. Neuroimage 19, 1395–1404. [DOI] [PubMed] [Google Scholar]
  51. Liegeois-Chauvel C, Musolino A, Badier JM, Marquis P, Chauvel P, 1994. Evoked potentials recorded from the auditory cortex in man: evaluation and topography of the middle latency components. Electroencephalography & Clinical Neurophysiology 92, 204–214. [DOI] [PubMed] [Google Scholar]
  52. Maris E, Oostenveld R, 2007. Nonparametric statistical testing of EEG- and MEG-data. J Neurosci Methods 164, 177–190. [DOI] [PubMed] [Google Scholar]
  53. Mashour GA, 2013. Cognitive unbinding: a neuroscientific paradigm of general anesthesia and related states of unconsciousness. Neurosci Biobehav Rev 37, 2751–2759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Mukamel R, Gelbard H, Arieli A, Hasson U, Fried I, Malach R, 2005. Coupling between neuronal firing, field potentials, and FMRI in human auditory cortex. Science 309, 951–954. [DOI] [PubMed] [Google Scholar]
  55. Mumford D, 1992. On the computational architecture of the neocortex. II. The role of cortico-cortical loops. Biol Cybern 66, 241–251. [DOI] [PubMed] [Google Scholar]
  56. Näätänen R, Alho K, 1995. Mismatch negativity--a unique measure of sensory processing in audition. Int J Neurosci 80, 317–337. [DOI] [PubMed] [Google Scholar]
  57. Näätänen R, Kujala T, Winkler I, 2011. Auditory processing that leads to conscious perception: a unique window to central auditory processing opened by the mismatch negativity and related responses. Psychophysiology 48, 4–22. [DOI] [PubMed] [Google Scholar]
  58. Naccache L, 2017. Minimally conscious state or cortically mediated state? Brain 141, 949–960. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Nagahama Y, Kovach CK, Ciliberto M, Joshi C, Rhone AE, Vesole A, Gander PE, Nourski KV, Oya H, Howard MA 3rd, Kawasaki H, Dlouhy BJ, 2017. Localization of musicogenic epilepsy to Heschl’s gyrus and superior temporal plane: case report. J Neurosurg, 129, 157–164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Nelken I, Ulanovsky N, 2007. Mismatch negativity and stimulus-specific adaptation in animal models. J Psychophysiol 21, 214–223. [Google Scholar]
  61. Nir Y, Fisch L, Mukamel R, Gelbard-Sagiv H, Arieli A, Fried I, Malach R, 2007. Coupling between Neuronal Firing Rate, Gamma LFP, and BOLD fMRI Is Related to Interneuronal Correlations. Current Biology 17, 1275–1285. [DOI] [PubMed] [Google Scholar]
  62. Nourski K, Rhone AE, Steinschneider M, Kawasaki H, Howard III MA, Banks MI, 2018. Cortical hierarchical processing of auditory novelty in awake and anesthetized states: An intracranial electrophysiology study., Assoc. Res. Otolaryngol. Abs, San Diego, CA, p. 727. [Google Scholar]
  63. Nourski KV, Howard MA 3rd, 2015. Invasive recordings in the human auditory cortex. Handb Clin Neurol 129, 225–244. [DOI] [PubMed] [Google Scholar]
  64. Nourski KV, Steinschneider M, McMurray B, Kovach CK, Oya H, Kawasaki H, Howard MA 3rd, 2014. Functional organization of human auditory cortex: investigation of response latencies through direct recordings. Neuroimage 101, 598–609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Nourski KV, Steinschneider M, Rhone AE, Howard Iii MA, 2016. Intracranial Electrophysiology of Auditory Selective Attention Associated with Speech Classification Tasks. Front Hum Neurosci 10, 691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Odegaard B, Knight RT, Lau H, 2017. Should a Few Null Findings Falsify Prefrontal Theories of Conscious Perception? J Neurosci 37, 9593–9602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Paavilainen P, 2013. The mismatch-negativity (MMN) component of the auditory event-related potential to violations of abstract regularities: a review. Int J Psychophysiol 88, 109–123. [DOI] [PubMed] [Google Scholar]
  68. Phipson B, Smyth GK, 2010. Permutation P-values should never be zero: calculating exact P-values when permutations are randomly drawn. Stat Appl Genet Mol Biol 9, 39. [DOI] [PubMed] [Google Scholar]
  69. Polich J, 2007. Updating P300: an integrative theory of P3a and P3b. Clin Neurophysiol 118, 2128–2148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Rao RP, Ballard DH, 1999. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci 2, 79–87. [DOI] [PubMed] [Google Scholar]
  71. Rauschecker JP, Scott SK, 2009. Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat Neurosci 12, 718–724. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Ray S, Crone NE, Niebur E, Franaszczuk PJ, Hsiao SS, 2008. Neural correlates of high-gamma oscillations (60–200 Hz) in macaque local field potentials and their potential implications in electrocorticography. J Neurosci 28, 11526–11536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Reddy CG, Dahdaleh NS, Albert G, Chen F, Hansen D, Nourski K, Kawasaki H, Oya H, Howard MA 3rd, 2010. A method for placing Heschl gyrus depth electrodes. J Neurosurg 112, 1301–1307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Rinne T, Alho K, Ilmoniemi RJ, Virtanen J, Näätänen R, 2000. Separate time behaviors of the temporal and frontal mismatch negativity sources. Neuroimage 12, 14–19. [DOI] [PubMed] [Google Scholar]
  75. Romanski LM, Averbeck BB, 2009. The primate cortical auditory system and neural representation of conspecific vocalizations. Annu Rev Neurosci 32, 315–346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Rosburg T, Trautner P, Dietl T, Korzyukov OA, Boutros NN, Schaller C, Elger CE, Kurthen M, 2005. Subdural recordings of the mismatch negativity (MMN) in patients with focal epilepsy. Brain 128, 819–828. [DOI] [PubMed] [Google Scholar]
  77. Sabri M, Kareken DA, Dzemidzic M, Lowe MJ, Melara RD, 2004. Neural correlates of auditory sensory memory and automatic change detection. Neuroimage 21, 69–74. [DOI] [PubMed] [Google Scholar]
  78. Schnakers C, Vanhaudenhuyse A, Giacino J, Ventura M, Boly M, Majerus S, Moonen G, Laureys S, 2009. Diagnostic accuracy of the vegetative and minimally conscious state: clinical consensus versus standardized neurobehavioral assessment. BMC Neurol 9, 35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Schonwiesner M, Novitski N, Pakarinen S, Carlson S, Tervaniemi M, Näätänen R, 2007. Heschl’s gyrus, posterior superior temporal gyrus, and mid-ventrolateral prefrontal cortex have different roles in the detection of acoustic changes. J Neurophysiol 97, 2075–2082. [DOI] [PubMed] [Google Scholar]
  80. Sculthorpe LD, Ouellet DR, Campbell KB, 2009. MMN elicitation during natural sleep to violations of an auditory pattern. Brain Res 1290, 52–62. [DOI] [PubMed] [Google Scholar]
  81. Sherman SM, Guillery RW, 2011. Distinct functions for direct and transthalamic corticocortical connections. J Neurophysiol 106, 1068–1077. [DOI] [PubMed] [Google Scholar]
  82. Siclari F, Baird B, Perogamvros L, Bernardi G, LaRocque JJ, Riedner B, Boly M, Postle BR, Tononi G, 2017. The neural correlates of dreaming. Nat Neurosci 20, 872–878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Smith ME, Halgren E, Sokolik M, Baudena P, Musolino A, Liegeois-Chauvel C, Chauvel P, 1990. The intracranial topography of the P3 event-related potential elicited during auditory oddball. Electroencephalogr Clin Neurophysiol 76, 235–248. [DOI] [PubMed] [Google Scholar]
  84. Steinschneider M, Fishman YI, Arezzo JC, 2008. Spectrotemporal Analysis of Evoked and Induced Electroencephalographic Responses in Primary Auditory Cortex (A1) of the Awake Monkey. Cereb Cortex 18, 610–625. [DOI] [PubMed] [Google Scholar]
  85. Strauss M, Sitt JD, King JR, Elbaz M, Azizi L, Buiatti M, Naccache L, van Wassenhove V, Dehaene S, 2015. Disruption of hierarchical predictive coding during sleep. Proc Natl Acad Sci U S A 112, E1353–1362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Symonds RM, Lee WW, Kohn A, Schwartz O, Witkowski S, Sussman ES, 2017. Distinguishing Neural Adaptation and Predictive Coding Hypotheses in Auditory Change Detection. Brain Topogr 30, 136–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Tononi G, Boly M, Massimini M, Koch C, 2016. Integrated information theory: from consciousness to its physical substrate. Nat Rev Neurosci 17, 450–461. [DOI] [PubMed] [Google Scholar]
  88. Uhrig L, Dehaene S, Jarraya B, 2014. A hierarchy of responses to auditory regularities in the macaque brain. J Neurosci 34, 1127–1132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Uhrig L, Janssen D, Dehaene S, Jarraya B, 2016. Cerebral responses to local and global auditory novelty under general anesthesia. Neuroimage 141, 326–340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Ulanovsky N, Las L, Farkas D, Nelken I, 2004. Multiple time scales of adaptation in auditory cortex neurons. J Neurosci 24, 10440–10453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Ulanovsky N, Las L, Nelken I, 2003. Processing of low-probability sounds by cortical neurons. Nat Neurosci 6, 391–398. [DOI] [PubMed] [Google Scholar]
  92. Wacongne C, Changeux JP, Dehaene S, 2012. A neuronal model of predictive coding accounting for the mismatch negativity. J Neurosci 32, 3665–3678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Wacongne C, Labyt E, van Wassenhove V, Bekinschtein T, Naccache L, Dehaene S, 2011. Evidence for a hierarchy of predictions and prediction errors in human cortex. Proc Natl Acad Sci U S A 108, 20754–20759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Whittingstall K, Logothetis NK, 2009. Frequency-band coupling in surface EEG reflects spiking activity in monkey visual cortex. Neuron 64, 281–289. [DOI] [PubMed] [Google Scholar]
  95. Yago E, Escera C, Alho K, Giard MH, 2001. Cerebral mechanisms underlying orienting of attention towards auditory frequency changes. Neuroreport 12, 2583–2587. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

RESOURCES