Abstract
The timing of slow auditory cortical activity aligns to the rhythmic fluctuations in speech. This entrainment is considered to be a marker of the prosodic and syllabic encoding of speech, and has been shown to correlate with intelligibility. Yet, whether and how auditory cortical entrainment is influenced by the activity in other speech–relevant areas remains unknown. Using source-localized MEG data, we quantified the dependency of auditory entrainment on the state of oscillatory activity in fronto-parietal regions. We found that delta band entrainment interacted with the oscillatory activity in three distinct networks. First, entrainment in the left anterior superior temporal gyrus (STG) was modulated by beta power in orbitofrontal areas, possibly reflecting predictive top-down modulations of auditory encoding. Second, entrainment in the left Heschl's Gyrus and anterior STG was dependent on alpha power in central areas, in line with the importance of motor structures for phonological analysis. And third, entrainment in the right posterior STG modulated theta power in parietal areas, consistent with the engagement of semantic memory. These results illustrate the topographical network interactions of auditory delta entrainment and reveal distinct cross-frequency mechanisms by which entrainment can interact with different cognitive processes underlying speech perception.
Keywords: Auditory entrainment, Speech processing, MEG, Delta band, Prosodic parsing
Graphical abstract
Highlights
-
•
We study auditory cortical speech entrainment from a network perspective.
-
•
Found three distinct networks interacting with delta-entrainment in auditory cortex.
-
•
Entrainment is modulated by frontal beta power, possibly indexing predictions.
-
•
Central alpha power interacts with entrainment, suggesting motor involvement.
-
•
Parietal theta is modulated by entrainment, suggesting working memory compensation.
1. Introduction
While listening to speech, rhythmic auditory cortical activity aligns to the quasi-rhythmic regularities arising from stress, syllabic rate, or phonemes. This entrainment of brain activity to speech is particularly prominent in the delta (below 4 Hz) and theta (4–8 Hz) frequency bands (Ahissar et al., 2001, Aiken and Picton, 2008, Di Liberto et al., 2015, Giraud and Poeppel, 2012, Gross et al., 2013, Kayser et al., 2015, Luo and Poeppel, 2007, Ng et al., 2012) and is particularly strong in the auditory cortex (Aiken and Picton, 2008, Gross et al., 2013). Given that the degree of entrainment is predictive of speech intelligibility and comprehension rates (Ahissar et al., 2001, Ding et al., 2014, Ghitza et al., 2012, Ghitza and Greenberg, 2009, Luo and Poeppel, 2007, Peelle and Davis, 2012, Peelle et al., 2013), the alignment of auditory cortical activity to the speech envelope has been proposed to subserve a number of important functions, such as the parsing or encoding of acoustic and phonological features (Cogan and Poeppel, 2011, Ghitza, 2013, Ghitza et al., 2012, Giraud and Poeppel, 2012, Howard and Poeppel, 2010, Peelle and Davis, 2012), or the selection of sensory streams (Schroeder and Lakatos, 2009). While auditory entrainment is frequently used as a marker for speech encoding, the network-level interactions that shape the underlying neural processes remain unknown.
Language processing depends on a large network of interconnected brain areas (Bornkessel-Schlesewsky et al., 2015, Friederici and Gierhan, 2013, Hickok and Poeppel, 2007, Poeppel, 2014, Rauschecker and Scott, 2009), with ventral and dorsal pathways linking auditory cortex with fronto-parietal regions implicated in extracting acoustic, lexical or categorical information (Alho et al., 2014, Bornkessel-Schlesewsky et al., 2015, Friederici and Gierhan, 2013, Hickok and Poeppel, 2007, Schomers et al., 2015, Smalle et al., 2015, Wilson et al., 2004). Interestingly, many studies have shown that auditory perception depends on the state of rhythmic activity, such as theta, alpha or beta activity, in auditory or frontal regions (Henry et al., 2014, Henry and Obleser, 2012, Neuling et al., 2012, Ng et al., 2012, Strauss et al., 2015). Furthermore, oscillatory power in different frequency bands and fronto-parietal brain regions has been quantified in numerous studies investigating cognitive control or attention (de Lange et al., 2008, Hipp et al., 2011, Schroeder and Lakatos, 2009, Stoll et al., 2016, van de Vijver et al., 2011). In the context of speech perception, the state of alpha power has been found to co-vary with experimental manipulations such as attention (Kelly et al., 2006, Klimesch et al., 1998), listening effort (Obleser et al., 2012), and speech intelligibility (Obleser and Weisz, 2012). In addition, changes in theta power have been linked to lexical retrieval processes and semantic working memory (Bastiaansen and Hagoort, 2006, Bastiaansen et al., 2005), while changes in beta and gamma power have been associated with a predictive coding framework, representing top-down predictions and bottom-up prediction errors during language processing (Arnal and Giraud, 2012, Arnal et al., 2011, Lewis and Bastiaansen, 2015). However, the functional interpretation of the processes indexed by the oscillatory activity extracted from fronto-parietal regions usually rests on the implicit assumption that changes in this oscillatory activity correlate with, or even causally relate to, changes in the auditory cortical speech representations provided by the entrainment of neural activity to the speech envelope (Arnal and Giraud, 2012, Kayser et al., 2015, Park et al., 2015). For example, changes in frontal alpha during enhanced listening effort have been suggested to relate to changes in the precision of auditory cortical entrainment (Kayser et al., 2015). However, no study to date has directly tested the hypothesis that changes in fronto-parietal oscillatory processes directly correlate with the fidelity of dynamic speech representations in temporally entrained auditory cortical activity.
To address the functional dependence between auditory speech entrainment and the state of oscillatory activity in fronto-parietal regions, we used source-localised MEG data obtained while participants listened to a continuous story (Gross et al., 2013). We chose an unbiased approach, in that we did not limit our analyses to specific and pre-defined anatomical regions of interest within frontal or parietal lobes. The reason for this was that previous studies have implied a vast number of regions in language processes, or provided only very coarse localizations of these. Likewise, we included a wide range of frequency bands when assessing the state of oscillatory activity in fronto-parietal regions (from delta to gamma), as previous studies have implied a wide range of rhythmic processes in speech perception. As a result, we systematically quantified the relation between auditory cortical speech entrainment and states of oscillatory power in fronto-parietal regions across a wide range of frequency bands.
2. Materials and methods
2.1. Participants and data acquisition
MEG data were acquired from 23 healthy, right-handed participants (12 female, mean age 26.9±7.9 years [M±SD]) as part of a previous study (Gross et al., 2013). All participants provided written informed consent prior to testing. The experiment was approved by a local ethics committee (University of Glasgow, Faculty of Information and Mathematical Sciences), and conducted in compliance with the Declaration of Helsinki.
MEG-recordings were obtained with a 248-magnetometers whole-head MEG system (MAGNES 3600 WH, 4-D Neuroimaging; sampling rate: 1017 Hz). The participants’ head positions were measured at the beginning and end of each run via 5 coils placed on the forehead and behind the ears. Head position was co-digitised with head-shape (FASTRAK®, Polhemus Inc., VT, USA). Participants sat upright and fixated a cross projected centrally on screen with a DLP projector while listening to two auditory presentations in a pseudo-randomised order. Sounds were presented binaurally via plastic earpieces and 5-m long plastic tubes connected to a sound pressure transducer. Stimulus presentation was controlled with Psychtoolbox (Brainard, 1997) for MATLAB (The MathWorks, Inc.).
2.2. Auditory stimuli
The ‘forward’ story condition consisted of an approximately 7-min long narration (‘Pie-man’ told by Jim O’Grady, recorded at ‘The Moth’ storytelling event in New York, 2012). This narration comprises approximately 950 words, with syllables spoken at an average rate of 6.8 Hz. This is in line with studies that establish the syllabic rate typically in the theta-range (4–8 Hz) (Cotton, 1936, Hyafil et al., 2015, Poeppel, 2003). In natural speech, the rate of prominent, or stressed, syllables is approximately one third of all syllables (e.g., Kochanski et al., 2005), essentially placing it in the delta-range (1–4 Hz) (Goswami and Leong, 2013, Greenberg et al., 2003). Consequently, entrainment in the delta band has been proposed to reflect prosodic fluctuations in speech (Ghitza, 2013, Ghitza et al., 2012). The ‘backward’ condition consisted of the same story played backwards and served as an unintelligible control condition.
2.3. Extraction of speech envelope
From the waveform of the acoustic stimulus we computed the wideband speech envelope by band-pass filtering (3rd order Butterworth filter, forward and reverse) into eight bands in the range of 100–10,000 Hz, equidistant on the cochlear frequency map (Smith et al., 2002). Individual band-limited envelopes were obtained using the magnitude of the Hilbert transform and were subsequently averaged to obtain the wide band speech envelope (Chandrasekaran et al., 2009, Drullman, 1995, Gross et al., 2013, Kayser et al., 2015, Smith et al., 2002). The envelope was resampled to 150 Hz for subsequent analysis.
2.4. MEG data processing
Data were analysed using MATLAB (The MathWorks, Inc.), including external toolboxes, such as FieldTrip (Oostenveld et al., 2011), and custom-written routines. The MEG signal was detrended and resampled to 150 Hz. Data were band-limited to seven frequency bands (delta 1–4 Hz, theta 4–8 Hz, alpha 8–12 Hz, low beta 12–18 Hz, beta 18–24 Hz, high beta 24–36 Hz, gamma 30–48 Hz), using FIR filters (forward and reverse, with 60 dB stop-band attenuation, 1-Hz transition bandwidth, and 0.01 dB pass-band ripple).
2.5. MEG source localisation
Individual, T1-weighted structural magnetic resonance images (MRIs) were manually co-registered to the MEG coordinate system by using participants’ digitised head shapes. MRIs were further realigned with individual head shapes through an iterative closest point (ICP) algorithm (Besl and Mckay, 1992). MRIs were then segmented to obtain a representation of the brain, including grey and white matter, and cerebrospinal fluid. A single-shell model was used to construct a volume conduction model (Nolte, 2003). Individual anatomical MRIs were linearly transformed to a template (MNI) brain using Fieldtrip/SPM5. Sensor level data were transformed into source space using the linear constraint minimum variance (LCMV) beamformer (Van Veen et al., 1997) on a 4-mm regular grid covering the entire brain (7% regularisation). The optimal orientation for each dipole was computed using the SVD approach. We used the AAL atlas (Automated Anatomical Labelling atlas) to parcellate the template brain into 116 anatomical areas (Tzourio-Mazoyer et al., 2002). As the superior temporal gyrus (STG) comprises a very large and functionally differentiated area in the AAL atlas, we divided it further into an anterior and posterior section (e.g., Friederici, 2002; Hickok and Poeppel, 2007). Specifically, the median of voxel positions along the horizontal plane (i.e., y-coordinates in MNI space) was used as threshold for the division between anterior and posterior parts to obtain equally-sized anterior and posterior STG partitions.
2.6. Region-specific analyses
We quantified the entrainment of rhythmic activity to speech within three auditory regions in each hemisphere (Heschl's Gyrus, anterior/posterior superior temporal gyrus [aSTG/pSTG]). For each auditory region, the bandpass-filtered MEG source-space data were Hilbert-transformed to derive the instantaneous phase for each time and grid point (for an analysis overview, see Fig. 1A).
Fig. 1.
Quantifying the dependency between auditory speech entrainment and the activity state in fronto-parietal regions. A) General analysis strategy. After preprocessing the acoustic waveform (including filtering in narrow bands, equidistant on the cochlear frequency map), the broadband speech amplitude envelope was derived. Speech and MEG signals were bandpass filtered into frequency bands and the MEG signal was projected into source space. The Hilbert transform was used to derive the instantaneous phase and power. Anatomical regions of interest (ROIs) were extracted using the AAL atlas. For each fronto-parietal ROI (coloured in blue), the oscillatory power was divided into four bins. For grid points within auditory regions (coloured in dark orange), the mutual information between the phase of the local oscillatory activity and the phase of the speech envelope was computed separately for each power state (bin) derived from each fronto-parietal ROI (ROIs closer than 3.5 cm to the respective auditory region were excluded). B) Binning procedure. Upper panel: Example of the time course of alpha power in the left supplementary motor area (inlet shows midsagittal view of left hemisphere). Middle panel: The same time course segmented into bins from low (blue) to high (red) power. MI in auditory areas was calculated separately for time points falling into each power bin, resulting in four MI values for each fronto-parietal ROI, power frequency band (delta to gamma), auditory region and auditory frequency band (delta/theta). MI-values were then regressed on a linear predictor (values 1 to 4, representing low to high power states). Lower panel: Illustration of hypothetical positive and negative relations between MI and power state (low to high).
We quantified the state of oscillatory activity in 46 frontal, central, and parietal ROIs. These included all ROIs of the AAL atlas in frontal and parietal lobes, motor regions, rolandic operculum, cingulate and angular gyri, insula, cuneus and precuneus. For these fronto-parietal ROIs, bandpass-filtered signals were Hilbert-transformed to derive the instantaneous power for each time and grid point. The power at each grid point was normalised for each frequency band by its time average, and the normalised power was then averaged across grid points to obtain a power time series for each ROI.
2.7. Speech entrainment quantified by mutual information
To quantify the statistical dependency between the speech envelope and the MEG source data, we used mutual information (MI) (Gross et al., 2013, Kayser et al., 2015). MI measures how much knowing one signal reduces the uncertainty about another signal and is expressed on a common principled scale in units of bits. MI values between two time series were calculated, using a robust bin-less approach based on the concept of statistical copulas (for details, see Ince et al., 2016, 2015; Kayser et al., 2015). When using phase as a variable, the phase was expressed as a unit magnitude complex number. The real and imaginary parts were standardised separately and combined as a two-dimensional variable for the MI calculation.
For an initial analysis, we computed within-band MI between the instantaneous phase of the source signal at each grid point and the instantaneous phase of the speech amplitude envelope, separately for each frequency band from delta to gamma. The MI was computed using a frequency-specific time-lag between the speech envelope and the MEG data. This was estimated by computing phase coherence for each participant between the acoustic amplitude envelope and the MEG time series within frequency bands for each grid point in auditory areas, using different lags from 1 to 30 sampling points (i.e. up to a lag of 200 ms), in steps of 1 sampling point. Phase coherence was averaged across grid points within auditory regions. The lag with maximum phase coherence was then chosen and averaged across participants (e.g., 140 ms for delta, and 147 ms for theta). MI was computed including all grid points for the whole-brain analysis. For subsequent analyses, MI was analysed for each grid point in auditory areas and then averaged within each auditory region.
2.8. Dependence of auditory cortical speech MI on the power in individual ROIs
To quantify whether the speech MI in auditory regions was related to the state of oscillatory power in the fronto-parietal ROIs, we binned the time series of the power in each of the 46 ROIs into four equipopulated bins representing different levels of amplitude (see Fig. 1B). For each auditory region, speech MI was then calculated separately for time points corresponding to each power bin (‘state’). Linear dependencies of MI on power states were analysed using within-subject regression analysis, with positive dependencies reflecting higher MI for higher power. Regression was performed for MI values in each of the six auditory regions in two frequency bands (delta, theta), in dependency on power in 46 fronto-parietal ROIs in seven power bands (delta to gamma). Group-level analysis was then performed on the t-statistics of the regression of betas using a cluster-based permutation approach (Maris and Oostenveld, 2007), correcting for multiple comparisons across frequency bands and fronto-parietal ROIs. Cluster statistics were derived separately for each auditory region (detailed parameters: 2000 iterations, including only bins with a two-tailed alpha of p<.025 in the cluster analysis, performing a two-tailed t-test at p<.05 on the clustered data; using the 3D-distance between ROI centres as neighbourhood information). Effect sizes for cluster-based t-statistics are reported as the summed t-value across all bins (fronto-parietal ROIs, power bands, and phase bands) within a cluster (Tsum). For the backward condition, cluster statistics yielded no comparable effects for which Tsum- or p-values could be reported. Instead, we report t-statistics with FDR-corrected p-values for these ROIs and frequency bands that were significant in the forward condition. To avoid spurious effects from signal blurring in source space or overlap between frequency bands we excluded fronto-parietal ROIs that were adjacent (<3.5 cm centre to centre) to the analysed auditory region, and we excluded within-frequency comparisons from the cluster statistics (i.e., comparison of entrainment and power state within the same band).
2.9. Directionality of MI-dependence on power
To determine whether the relation between auditory speech entrainment and fronto-parietal oscillatory power is temporally directional, we repeated the above regression analysis by systematically testing different temporal lags between the time points used to compute speech MI and oscillatory power – similar to other directional approaches such as Granger causality (Granger, 1969) or transfer entropy (Schreiber, 2000). For this analysis, time points for each ROI belonging to a cluster were again binned according to the four power states, but instead of computing speech MI at corresponding time points, it was computed at different time points before and after the sampling points of oscillatory states (shifted between -200 ms and 200 ms in steps of 1 sampling point, i.e., ~6.67 ms). Speech MI was then averaged within each ROI cluster. The regression analysis between speech MI and power states was repeated for each lag (where zero lag represents the original result). We then tested whether the time points of peak effects in the group-level regression statistics were significantly different from zero, using a jackknife test that reduces the bias in the estimate of the population value (Tukey's jackknife; see Sokal and Rohlf, 1995, pp. 820–821, for used formulas). For this procedure, the leave-one-out jackknife distribution is first z-transformed, the values then transformed into “pseudovalues”, and the statistics are based on these Jackknife estimates. Note that the jackknifed estimates of the statistics can differ from leave-one-out means, as they represent bias-free estimates. A positive peak effect indicates that speech-brain entrainment is related to the oscillatory power in preceding time bins.
2.10. Entrainment between speech and ROI clusters
For those fronto-parietal ROIs that exhibited a significant relation to auditory entrainment in our main analysis, we also computed the degree of speech entrainment of their oscillatory activity. For this, the MI was calculated between the power time course of each ROI and the delta phase of the speech envelope and then averaged within each ROI cluster. To estimate the significance of MI values, we obtained a surrogate distribution of MI values under the null-hypothesis of no systematic alignment of MEG data to speech. Practically, we implemented this by randomly shifting the speech time series (circular shift with lag drawn from a uniform distribution on [1 N] where N is the number of samples in the time series) and calculating a distribution of 1000 surrogate MI values for each auditory region or fronto-parietal ROI cluster. To account for multiple comparisons across multiple ROIs, maximum statistics were used (Holmes et al., 1996).
2.11. Phase-amplitude coupling between ROI clusters and auditory regions
Phase-amplitude coupling was calculated as the magnitude of the time-averaged complex product of fronto-parietal power and auditory phase (for each grid point in auditory regions) (Canolty et al., 2006). Coupling values were then averaged within each cluster. For statistical analysis, we derived a surrogate distribution of 1000 phase-amplitude values, using randomly shifted time series (circular shift without limitations as to the lag), and compared the actual values with the percentiles of the group-level surrogate distribution.
2.12. Independence of effects for modulation of MI in aSTG
As the speech MI within the left anterior STG was related to alpha power in central ROIs and beta power in frontal ROIs, we asked whether these two effects were related to each other. To test for an interaction between alpha and beta power on entrainment, we entered both alpha and beta power states into a 4×4 ANOVA (four alpha power bins×four beta power bins) with delta MI in aSTG as dependent variable. Significant main effects would essentially replicate results of the regression analysis, whereas a significant interaction would suggest a dependent modulation of aSTG-entrainment by central alpha and frontal beta power. As a next step, we calculated within-subject Pearson correlations and cross-correlations between average beta and alpha power time series, to determine whether both signals were directly related. The resulting mean correlation coefficients were again compared to a surrogate distribution. The time points of the peaks in the cross-correlation (with lags between ±200 ms, in steps of 1 sampling point, i.e., ~6.67 ms), were compared to zero by means of a dependent-samples t-test.
3. Results
3.1. Entrainment of auditory cortical activity to the speech envelope
We found significantly stronger speech MI for the forward condition, compared with the backward condition, in the delta (1–4 Hz) and theta (4–8 Hz) frequency bands within left and right auditory regions (group statistics, p<.05, false discovery rate [FDR] corrected, see Fig. 2). We did not observe significant differences between forward and backward conditions in higher frequency bands. Furthermore, MI values were higher for the delta compared to the theta band for all six auditory regions (all pFDR<.001, dependent-samples t-test; means across auditory regions: MIdelta=0.008, MItheta=0.003). As our main goal was to quantify whether this entrainment is modulated by the state of activity within fronto-parietal ROIs, we limited the following analysis to the speech entrainment in delta and theta bands.
Fig. 2.
Entrainment in delta and theta frequency bands. Upper panel shows raw MI values averaged across participants. Lower panel shows significant grid points in comparison with backward-condition (p<.05 dependent samples t-test, FDR corrected).
To quantify whether auditory entrainment was systematically related to changes in the power of oscillatory activity over fronto-parietal regions, we used group-level cluster-based regression to extract whether and for which auditory regions there was a significant linear dependency between speech MI and the state of oscillatory power in clusters of fronto-parietal ROIs over time within each participant (termed ‘ROI cluster’). This analysis was repeated separately for auditory delta and theta entrainment, and for seven frequency bands indexing the state of fronto-parietal activity. This analysis revealed four patterns of dependencies, which we describe in the following.
In the left anterior superior temporal gyrus (aSTG), delta MI was positively dependent on high beta power (24 – 36 Hz) in a cluster comprising bilateral superior frontal gyrus (medial orbital part), left superior frontal gyrus (orbital part), and left middle frontal gyrus (orbital part) (frontal beta effect, pcluster=.04, Tsum=10.48, Fig. 3A). Fig. 3A, middle column illustrates the increase in delta MI with increasing beta power in this cluster. There was no significant influence of beta power on delta MI in the backward speech condition (t(22)=0.19, pFDR=.39).
Fig. 3.
Dependence between auditory speech entrainment and activity states in fronto-parietal ROIs. (A)(Left column) Top-down modulation of delta MI in the left aSTG by beta power in orbito-frontal ROIs (bilateral medial orbital parts of the superior frontal gyrus, left superior orbito-frontal gyrus, left middle orbito-frontal gyrus). (Middle column) Delta MI as a function of binned beta power for the forward and backward speech conditions. Solid lines represent the average MI across participants, the shaded area the SEM across participants. The effect of beta power was significant for the story (pcluster=.04) but not the backward condition (pFDR=.39, no cluster found). (Right column) Dependence of the MI-modulation on the time-lag between power state and the time of MI calculation. The line in the upper panel shows the mean regression beta for this cluster (across participants) as a function of lag. The lower panel shows a histogram of the leave-one-out jackknife values and the associated mean (black cross) and 95% confidence interval (horizontal bar) derived from the jackknife statistics. The peak lag (at 7 ms) was significantly different from zero (pFDR<.05). (B)(Left column) Top-down modulation of delta MI in two auditory regions by alpha power in central/motor ROIs. MI in left Heschl's Gyrus was dependent on alpha power in bilateral SMA, bilateral precentral gyri, bilateral median cingulate gyri, and left postcentral gyrus. MI in left aSTG was dependent on power in the bilateral SMA, left precentral gyrus, bilateral median cingulate gyri, left postcentral gyrus, and left inferior parietal gyrus). (Middle column) The effect of alpha power was significant for the story (Heschl: pcluster<.001, aSTG: pcluster<.01) but not the backward condition (Heschl: pFDR=.39, aSTG: pFDR=.39, no cluster found). (Right column). The peak lags (at 40 ms and 20 ms, respectively) were marginally significantly different from zero in Heschl's Gyrus (pFDR=.06), but not the aSTG (pFDR=.16). Dashed lines/bars illustrate effects of aSTG. (C)(Left column) Bottom-up modulation of theta power in parietal ROIs by delta MI in the right posterior STG. This cluster comprises the bilateral cuneus, right precuneus, right superior parietal gyrus, and part of the right inferior parietal lobule. (Middle column) The effect of theta power was significant for the story (pcluster=.01) but not the backward condition (pFDR=.24, no cluster found). (Right column) The peak lag (at -60 ms) was significantly different from zero (pFDR<.001). SFG, superior frontal gyrus; FG, frontal gyrus; SMA, supplementary motor area; aSTG, anterior superior temporal gyrus.
In Heschl's Gyrus, as well as in the left anterior temporal gyrus (aSTG), delta MI was negatively dependent on alpha power (8–12 Hz). The two ROI clusters were highly overlapping. For Heschl's Gyrus, the cluster comprised bilateral supplementary motor areas, bilateral precentral gyri, bilateral median cingulate gyri, and left postcentral gyrus (central alpha effect 1, pcluster<.001, Tsum=-25.02 (Fig. 3B). For the aSTG, the cluster comprised bilateral supplementary motor areas, left precentral gyrus, bilateral median cingulate gyri, left postcentral gyrus, and left inferior parietal gyrus (central alpha effect 2, pcluster=.002, Tsum=-20.42. Delta MI in these auditory areas decreased with increasing alpha power in these ROIs. In the backward speech condition, there was no significant modulation of MI by alpha power (left Heschl's Gyrus: t(22)=-0.41, pFDR=.39; left aSTG: t(22)=0.44, pFDR=.39, Fig. 3B, middle column).
In the right posterior temporal gyrus (pSTG), delta MI was negatively dependent on theta power (4 – 8 Hz) in a cluster comprising bilateral cuneus, right precuneus, right superior parietal gyrus, and part of the right inferior frontal lobule (parietal theta effect, cluster statistics: pcluster=.01, Tsum=-16.02 (Fig. 3C). With decreasing theta power in these parietal ROIs, delta MI in right pSTG increases. Again, there was no significant MI modulation in the backward speech condition, t(22)=1. 89, pFDR=.24 (Fig. 3C, middle column).
3.2. Directionality of speech-MI modulation
To probe the directionality of the relation between auditory entrainment and fronto-parietal power, we repeated the regression analyses by varying a relative temporal lag between the time points at which ROI power and speech MI were calculated. We used Jackknife resampling to test whether the peak of the group-level MI dependence on power was significantly different from the zero lag.
The modulation of delta entrainment in left aSTG by frontal beta power peaked 7 ms after the power state and this was significantly different from zero (95% CI [4,48] ms, pFDR<.05; p-values for this analysis are FDR-corrected across comparisons for each ROI cluster). The modulations of delta entrainment in left Heschl's Gyrus and aSTG by central alpha power peaked at 40 ms and 20 ms respectively, after the power state. In Heschl's Gyrus, this peak was marginally different from zero, whereas in aSTG it was not (Heschl: 95% CI [2,91] ms, pFDR=.06; aSTG: 95% CI [-20, 111] ms, pFDR=.16). Finally, the modulation of delta entrainment in right pSTG by parietal theta power peaked at 60 ms before power computation and was significantly different from zero (95% CI [-135, -75], pFDR<.001). These results suggest that the modulation of parietal theta power follows the speech entrainment in auditory regions, while modulations of central alpha and frontal beta power precede auditory speech entrainment and therefore likely reflect top-down influences on auditory encoding.
3.3. Brain – speech entrainment in fronto-parietal ROIs
We ruled out that the dependence of speech MI on fronto-parietal activity trivially arises because fronto-parietal activity is by itself significantly entrained to the speech envelope. We calculated MI between the power in each of the fronto-parietal clusters and the delta phase of the speech amplitude envelope, which was the acoustic component driving the speech entrainment in the auditory regions. Speech MI in fronto-parietal power was not significant for any of the four clusters (frontal beta: Student's t=1.16, pFDR=.87, central alpha 1 [Heschl's Gyrus]: Student's t=0.17, pFDR=.76, central alpha 2 [aSTG]: Student's t=-0.68, pcorrected=. 51, parietal theta: Student's t =-0. 93, pFDR=.51) and the MI values were considerably lower than those for the delta phase of the activity in auditory regions (Fig. 4).
Fig. 4.
Brain-speech entrainment in fronto-parietal and auditory areas. MI between the power in fronto-parietal ROIs and the delta phase of the speech amplitude envelope (orange), and between the delta phase in auditory regions and the delta phase of the speech envelope (green). For fronto-parietal ROIs, MI values are represented as mean across participants for each individual ROI belonging to a cluster. Bars illustrate SEM across participants. Dotted line denotes significance level at 95% derived from surrogate data.
3.4. Phase-amplitude coupling between ROI clusters and auditory regions
We also ruled out that the relation between delta entrainment in auditory regions and fronto-parietal power reflects a coupling of these signals during listening to speech that is independent of the acoustic input. Previous studies have described wide-spread coupling between the oscillatory phase at lower frequencies and the power at higher frequencies within and between brain regions, termed phase-amplitude coupling (e.g., Gross et al., 2013; Jensen and Colgin, 2007). Thus, we calculated the phase-amplitude coupling between auditory delta phase and the power in each of the four clusters. For no cluster did the observed phase-amplitude coupling values (PAC) reach statistical significance: frontal beta cluster (PAC=0.013; Student's t=-1.00, pFDR=.66); two central clusters (central alpha 1: PAC=0.011; Student's t=0.88, pFDR=.81; central alpha 2: M=0.012; Student's t=0.57, pFDR=.81); and parietal theta cluster (PAC=0.015; Student's t=-0.41, pFDR=.69).
3.5. Dependent influences of alpha and beta power on aSTG entrainment
As speech MI in the left anterior superior temporal gyrus was dependent on two distinct ROI clusters (negative dependence on central alpha and positive dependence on frontal beta-power), we asked whether these effects were statistically distinct. We entered both alpha and beta power as two factors into an analysis of variance (ANOVA) with delta speech MI in aSTG as dependent variable. As expected, the analysis yielded a significant main effect of alpha power, F(3,66) = 4.74, p < .01, η2 = .02, and a significant main effect of beta power, F(3,66)=8.25, p<.001, η2=.07. Furthermore, there was a significant alpha power×beta power interaction, F(9,66)=2.14, p=.03, η2=.02. The interaction suggests that frontal beta and central alpha power co-modulate auditory entrainment in aSTG. Furthermore, within-subject correlations between the power time series of frontal beta and central alpha were significant (mean rPearson=.027, Student's t=-20.67, p<.001), and a cross-correlation showed that the correlation peaked with central alpha power following frontal beta power at a lag of 67 ms (95% CI [10,85], p=.02).
4. Discussion
The entrainment (alignment in time) of auditory cortex activity to speech is frequently used as a marker for how well the auditory brain encodes speech. Auditory entrainment has been linked with comprehension rates (e.g., Ahissar et al., 2001; Ding et al., 2014; Peelle and Davis, 2012) and is considered as a mechanistic component of sound encoding in the brain serving the segmentation and parsing of prosody, syllables or phonemes (e.g., Hyafil et al., 2015). In particular, entrainment in the theta band is often assumed to reflect the processing of the syllabic-scale information, as the syllabic rate of natural speech falls into the theta range (e.g., 6.8 Hz on average for the current speech stimulus) (Ding and Simon, 2014, Ghitza, 2012, Ghitza, 2013, Giraud and Poeppel, 2012, Hyafil et al., 2015, Peelle and Davis, 2012). In contrast, entrainment in the delta band has been proposed to reflect the processing of supra-segmental prosodic features such as acoustic stress, which are critical for the contextual processing and the parsing of speech (Ghitza, 2013, Ghitza et al., 2012, Giraud and Poeppel, 2012, Goswami and Leong, 2013, Greenberg et al., 2003). In line with this, a recent study demonstrated a direct influence of acoustic speech rhythm on auditory delta-entrainment (Kayser et al., 2015). It is noteworthy that, although entrainment is often associated with amplitude (i.e., intensity) fluctuations in the speech signal (e.g., Ahissar et al., 2001; Gross et al., 2013; Peelle and Davis, 2012), there is vast evidence that frequency (e.g., pitch) fluctuations are also tracked by the auditory system (Henry and Obleser, 2012, Henry and Obleser, 2013, Obleser et al., 2008, Obleser et al., 2012, Scott and McGettigan, 2012, Xu et al., 2005, Zeng et al., 2005). Particularly, spectral regularities in the delta band have been found to entrain neural oscillations (Henry and Obleser, 2012, Obleser et al., 2012) and could play a major role in the tracking of non-intensity based prosodic information that can contribute to speech comprehension.
Although neural entrainment has been extensively used to investigate the cortical processing of speech or other acoustic information, it is still under debate whether entrainment reflects a genuine oscillatory process (e.g., Zoefel and VanRullen, 2015) or a series of evoked potentials (Szymanski et al., 2011). Supporting the former view are, for example, findings that neural phase alignment in the delta-band can be found in the absence of acoustic fluctuations in the stimulus envelope, which is thought to reflect higher-level segmentations of speech (Ding et al., 2016; Meyer et al., 2016) or that entrainment can occur as the result of cross-modal attention (Lakatos et al., 2008, Lakatos et al., 2009).While research needs to resolve the exact neural mechanisms giving rise to rhythmic entrainment (see Kayser et al., 2015; Keitel, Quigley, & Ruhnau, 2014), it certainly serves as a powerful tool to index speech encoding (Giraud and Poeppel, 2012).
4.1. Network interactions indexed by oscillatory power
The auditory cortex functions within a large network of other temporal and fronto-parietal regions to provide the neural representations of speech. Many studies on speech networks have quantified the activity in fronto-parietal regions using the power of oscillatory activity, and have used this to attribute specific functions to these regions in speech encoding (Arnal et al., 2011, Bastiaansen and Hagoort, 2006, Bastiaansen et al., 2005, Obleser and Weisz, 2012, Obleser et al., 2012). Importantly, this often involves the implicit hypothesis that the processes indexed by fronto-parietal oscillatory power directly interact with the auditory cortical speech encoding implemented by rhythmic entrainment (Kayser et al., 2015). However, no study to date has systematically investigated whether and for which fronto-parietal brain regions the oscillatory activity directly interacts with auditory cortical speech entrainment.
We here provide a comprehensive analysis of such interactions and probe their temporal specificity to dissociate potential feed-forward from feed-back interactions. This revealed three auditory regions in which delta speech MI was systematically related to the oscillatory power in distinct clusters of fronto-parietal regions. Importantly, each of these network interactions was characterised by a distinct time scale of the relevant fronto-parietal state. Our results reveal a spatio-temporal organization whereby auditory delta-entrainment is influenced by frontal beta-activity, interacts bidirectionally with central alpha activity, and influences slow (theta) activity in parietal regions. By performing control analyses and excluding spatially proximal ROIs, we ruled out that these network interactions arise trivially from signal blurring in source space, general coupling mechanisms not related to the acoustic input, or a significant entrainment of fronto-parietal power to the speech signal itself. As we compare the auditory cortical entrainment to natural speech with a control condition of unintelligible (reversed) speech, our results do not allow conclusions about the immediate link between speech entrainment and speech comprehension, which have been investigated in many previous studies (e.g., Ding et al., 2014; Ding and Simon, 2013; Peelle et al., 2013). Rather, we here investigate the function of speech entrainment within a network context, and demonstrate how entrainment interacts with fronto-parietal networks. While future work is required to directly demonstrate the behavioural relevance of these interactions, we can speculate about hypothetical functions and their contribution to perception based on the specific oscillatory fingerprints associated with each of these.
As noted before, we used the power in different frequency bands as a marker of fronto-parietal activity. We thereby followed many previous studies that have used oscillatory power as index of possibly speech-relevant neural processes in various brain regions (e.g., Arnal and Giraud, 2012; Bastiaansen and Hagoort, 2006; Obleser and Weisz, 2012). Similarly, studies linking oscillatory activity to perception have shown that the power of pre-stimulus activity correlates with perceptual accuracy or reaction times (Arnal et al., 2015, Hanslmayr et al., 2007, Lundqvist et al., 2013, Ng et al., 2012, van Dijk et al., 2008). However, we acknowledge that many studies have also noted that the phase of oscillatory processes can correlate with perceptual outcomes or the quality of sound encoding (Henry et al., 2014, Henry and Obleser, 2012, Kayser et al., 2016, Ng et al., 2012). While it seems reasonable to repeat the present analysis using phase as a marker of the state of fronto-parietal activity, this is made difficult for several reasons. First, phase is technically well defined only when oscillatory power is sufficiently strong (Muthukumaraswamy and Singh, 2011). Second, the sign of source-localized activity is difficult to compare across regions and participants, given inherent ambiguities in the source localization process. This makes it difficult to assign an absolute interpretation to a specific phase angle in neuroimaging data, which is sometimes resolved with a within-subject standardization of phase angles for group-level analysis (Ng et al., 2013). In contrast, the linear relation between auditory entrainment and the power of oscillatory source activity described here can be easily compared and interpreted across regions and participants.
To date, only one study has provided indirect evidence for network-level influences on auditory cortical entrainment. Park et al. (2015) quantified a general measure of the functional connectivity between auditory cortex and the frontal lobe and found that the strength of top-down feedback from left precentral regions to auditory cortex correlates with the strength of entrainment across participants (Park et al., 2015). However, this study could neither implicate specific oscillatory processes in this top-down influence, nor did this study demonstrate that frontal activity directly affects the alignment of auditory activity to speech over time and within an individual participant.
4.2. Orbitofrontal beta and top-down predictions
We found evidence for a top-down modulation of delta-entrainment in the left aSTG by beta-power in predominantly left-lateralised medial orbitofrontal areas. Anatomically, orbitofrontal regions are connected with anterior superior temporal areas via a ventral stream (Gierhan, 2013). Medial frontal areas have been associated with story comprehension (Elliott et al., 2000, Maguire et al., 1999), phonological processing (Vigneau et al., 2006) and the generation of predictions (Bar, 2007, Dikker and Pylkkanen, 2013). The orbital gyrus has also been found to be involved in phrasal processing (Grodzinsky and Friederici, 2006). Beta oscillations have been suggested as a mechanism for the top-down propagation of predictions from higher to lower areas across the cortical hierarchy (Arnal and Giraud, 2012, Fontolan et al., 2014, Lewis and Bastiaansen, 2015) as well as the maintenance/change of the current mode of processing within the predictive coding framework (Bressler and Richter, 2015, Engel and Fries, 2010, Lewis et al., 2015). The involved areas, the frequency signature, and the direction of the interaction observed here are therefore consistent with a top-down scaling of auditory entrainment by predictions from frontal areas, whereby predictions facilitate the acoustic-phonological analysis within the aSTG (DeWitt and Rauschecker, 2012, Friederici, 2012). More specifically, increased beta power could indicate that the recent sensory input allowed the formation of clear predictions, while reduced beta power could indicate the lack of predictions about the expected acoustic input (Arnal and Giraud, 2012). The formation of strong predictions in prefrontal regions could then subsequently facilitate the dynamic representation of upcoming acoustic input in lower auditory regions, which is reflected by an enhancement of speech entrainment.
4.3. Auditory speech entrainment and central alpha power
Delta-entrainment in both left Heschl's Gyrus and aSTG was found to depend on alpha-power in bilateral central regions (pre- and postcentral gyri and SMA), extending to inferior parietal and median cingulate gyri. Motor areas are connected to auditory areas via the dorsal stream, and reflect articulatory mapping of heard speech (e.g., Friederici and Gierhan, 2013), and phoneme-specific activation (Pulvermüller et al., 2006). Furthermore, the SMA has been associated with temporal sensory prediction in auditory rhythm perception (Bengtsson et al., 2009, Grahn and Brett, 2007) and phoneme detection (Simon et al., 2002), while the cingulate gyrus has been associated with semantic processing (Zatorre et al., 1996) and phoneme detection (Simon et al., 2002). The alpha rhythm has a ubiquitous role in the brain as a gating mechanism ('functional inhibition', see Jensen and Mazaheri, 2010; Strauss et al., 2014) and has been found to indicate sensitivity to acoustic features in the context of speech intelligibility (Obleser and Weisz, 2012). Alpha is also one of the dominant rhythms in motor areas (e.g., Pineda, 2005). In line with this, we found that suppressed alpha-power in motor areas leads to enhanced delta-entrainment in early auditory areas. This might represent the recruitment of motor areas for a temporally precise phonological analysis in the Heschl's Gyrus and aSTG (DeWitt and Rauschecker, 2012, Friederici, 2012). While it seems unlikely that alpha power in central areas has a direct causal impact on auditory cortex, alpha may reflect the engagement of long-range connections that modulate the efficiency of information transmission along auditory pathways (Strauss et al., 2014) or which reflect the impact of task-engagement and cognitive effort on auditory cortex activity (Strauss et al., 2014, Wöstmann et al., 2015). Our data are ambivalent as to the direction of this interaction and previous work has revealed conflicting views on an automatic recruitment of motor cortex for auditory processing or the specifics of motor influences during high task demands (Alho et al., 2014, Morillon et al., 2015, Morillon et al., 2014). However, two recent studies have reported directed relations between left frontal activity and auditory cortical entrainment (Kayser et al., 2015, Park et al., 2015), cautiously supporting a top-down modulation.
4.4. Parietal theta power is modulated by speech entrainment in pSTG
The degree of delta-entrainment in the right posterior superior temporal gyrus negatively influenced theta-power in predominantly right-lateralised parietal areas, such as cuneus, precuneus, and superior/inferior parietal areas. The inferior parietal cortex is connected to the pSTG via dorsal streams (Gierhan, 2013). The right hemisphere, including parietal areas, has been associated with prosodic and intonational processing (Behrens, 1988, Behrens, 1989, Shapiro and Danly, 1985). Consistently, the right pSTG has been shown to be particularly involved in the processing of isolated prosody (Meyer et al., 2002). This right-hemisphere posterior network-level interaction might therefore represent the processing of prosodic information. Furthermore, theta-activity in the parietal cortex is usually associated with working-memory processes, where theta-power increases with working-memory load (Moran et al., 2010, Raghavachari et al., 2001, Sauseng et al., 2010). It has also been proposed that theta power changes during language processing reflect lexical retrieval processes and semantic working memory (Bastiaansen and Hagoort, 2006, Bastiaansen et al., 2005). We found that weaker auditory entrainment leads to stronger theta power in parietal regions. One plausible interpretation of the present result is that weak delta-parsing is subsequently compensated by parietal working memory processes that support linguistic inference. In other words, weak entrainment, and the likely resulting poor comprehension (e.g., Doelling et al., 2014), might necessitate stronger working memory involvement to make sense of the heard speech.
4.5. The network interactions of auditory entrainment
We did not observe any significant interactions of auditory theta band entrainment with fronto-parietal activity. During listening to the naturally-told comedy story used as stimulus here, auditory activity entrains to the speech envelope both in the delta and theta bands, but entrainment is stronger in the delta band (see also, Gross et al., 2013). One possible explanation for this delta dominance is that our acoustic stimulus is rich in prosodic features, in contrast to more neutral speech such as sequences of digits (Doelling et al., 2014, Ghitza, 2012) or individual sentences (Peelle et al., 2013). Importantly, entrainment in these frequency bands can directly dissociate in the same experimental paradigm (Ding et al., 2014, Kayser et al., 2015). Across multiple studies, it has been observed that changes in theta band entrainment correlate with changes in acoustic features such as the signal-to-noise ratio or voice vocoding (Ding et al., 2014, Ding and Simon, 2013, Peelle et al., 2013). It has therefore been tied to acoustic speech properties (Ding and Simon, 2013). On the other hand, delta-entrainment correlates more strongly with the perceived quality of speech or intelligibility measures (Ding et al., 2014, Ding and Simon, 2013) and has also been implied in attentional selection (Ding and Simon, 2013, Schroeder and Lakatos, 2009) and the formation of temporal predictions (Arnal et al., 2015). In addition, two recent studies show that auditory delta oscillations modulate the interpretation of linguistic input (Meyer et al., 2016), and can track hierarchical linguistic structures even without the presence of acoustic cues (Ding et al., 2016). Thus, there seems to be converging evidence that delta oscillations are involved in higher order cognitive functions during speech processing such as attention, semantic comprehension and lexical interpretation. These may be reasons for why long-range network interactions of auditory entrainment are more prominent for delta than for theta entrainment.
However, our results suggest a cascade of network mechanisms that control the parsing of continuous, natural speech by delta entrainment (see Fig. 5). In the left hemisphere, orbitofrontal areas influence the alignment of rhythmic auditory cortical activity to speech, possibly by providing predictive information. These predictions are also conveyed to the motor system, as shown by a significant cross-correlation between frontal beta power and central alpha. The motor system in turn interacts with delta-entrainment, possibly by exploiting the frontal predictions and utilising its ability to process rhythms in a temporally precise manner. Finally, in the right hemisphere, parietal theta processes are particularly engaged in moments when auditory delta-entrainment is weak. As we did not employ any direct experimental manipulation or the collection of behavioural measures, these interpretations necessarily remain speculative. However, our results directly place the entrainment of rhythmic auditory cortical activity in the context of fronto-parietal speech relevant networks. They provide a basis to further explore the functional and possibly mechanistic relations between auditory cortical entrainment, semantic and lexical processes at higher stages of the auditory pathways, and perception.
Fig. 5.
Summary and time course of network-level effects of delta-entrainment in auditory areas. Left hemisphere: I) Prefrontal predictions facilitate delta-speech tracking in aSTG. II) Frontal predictions are conveyed to motor areas. III) Motor areas interact with delta-entrainment in aSTG. Right hemisphere: IV) Delta-speech tracking in pSTG engages parietal areas. aSTG: anterior superior temporal gyrus, pSTG posterior superior temporal gyrus.
Funding
This work was supported by the UK Biotechnology and Biological Sciences Research Council (BBSRC, BB/L027534/1); a European Research Council grant (ERC-2014-CoG; Grant No 646657) to CK; and a Wellcome Trust grant (Joint Senior Investigator Award, No 098433) to JG.
Acknowledgements
We thank Nienke Hoogenboom for recording the data, and Christian Keitel for valuable discussion.
Contributor Information
Anne Keitel, Email: anne.keitel@glasgow.ac.uk.
Christoph Kayser, Email: christoph.kayser@glasgow.ac.uk.
References
- Ahissar E., Nagarajan S., Ahissar M., Protopapas A., Mahncke H., Merzenich M.M. Speech comprehension is correlated with temporal response patterns recorded from auditory cortex. Proc. Natl. Acad. Sci. USA. 2001;98(23):13367–13372. doi: 10.1073/pnas.201400998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aiken S.J., Picton T.W. Human cortical responses to the speech envelope. Ear Hear. 2008;29(2):139–157. doi: 10.1097/aud.0b013e31816453dc. [DOI] [PubMed] [Google Scholar]
- Alho J., Lin F.H., Sato M., Tiitinen H., Sams M., Jaaskelainen I.P. Enhanced neural synchrony between left auditory and premotor cortex is associated with successful phonetic categorization. Front. Psychol. 2014;5:394. doi: 10.3389/fpsyg.2014.00394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arnal L.H., Doelling K.B., Poeppel D. Delta-beta coupled oscillations underlie temporal prediction accuracy. Cereb. Cortex. 2015;25(9):3077–3085. doi: 10.1093/cercor/bhu103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arnal L.H., Giraud A.L. Cortical oscillations and sensory predictions. Trends Cogn. Sci. 2012;16(7):390–398. doi: 10.1016/j.tics.2012.05.003. [DOI] [PubMed] [Google Scholar]
- Arnal L.H., Wyart V., Giraud A.L. Transitions in neural oscillations reflect prediction errors generated in audiovisual speech. Nat. Neurosci. 2011;14(6):797–801. doi: 10.1038/nn.2810. [DOI] [PubMed] [Google Scholar]
- Bar M. The proactive brain: using analogies and associations to generate predictions. Trends Cogn. Sci. 2007;11(7):280–289. doi: 10.1016/j.tics.2007.05.005. [DOI] [PubMed] [Google Scholar]
- Bastiaansen M., Hagoort P. Oscillatory neuronal dynamics during language comprehension. Prog. Brain Res. 2006;159:179–196. doi: 10.1016/S0079-6123(06)59012-0. [DOI] [PubMed] [Google Scholar]
- Bastiaansen M., van der Linden M., Ter Keurs M., Dijkstra T., Hagoort P. Theta responses are involved in lexical-semantic retrieval during language processing. J. Cogn. Neurosci. 2005;17(3):530–541. doi: 10.1162/0898929053279469. [DOI] [PubMed] [Google Scholar]
- Behrens S.J. The role of the right hemisphere in the production of linguistic stress. Brain Lang. 1988;33(1):104–127. doi: 10.1016/0093-934x(88)90057-0. [DOI] [PubMed] [Google Scholar]
- Behrens S.J. Characterizing sentence intonation in a right hemisphere-damaged population. Brain Lang. 1989;37(2):181–200. doi: 10.1016/0093-934x(89)90014-x. [DOI] [PubMed] [Google Scholar]
- Bengtsson S.L., Ullen F., Ehrsson H.H., Hashimoto T., Kito T., Naito E., Forssberg H., Sadato N. Listening to rhythms activates motor and premotor cortices. Cortex. 2009;45(1):62–71. doi: 10.1016/j.cortex.2008.07.002. [DOI] [PubMed] [Google Scholar]
- Besl P.J., Mckay N.D. A method for registration of 3-D shapes. IEEE Trans. Pattern Anal. Mach. Intell. 1992;14(2):239–256. [Google Scholar]
- Bornkessel-Schlesewsky I., Schlesewsky M., Small S.L., Rauschecker J.P. Neurobiological roots of language in primate audition: common computational properties. Trends Cogn. Sci. 2015;19(3):142–150. doi: 10.1016/j.tics.2014.12.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brainard D.H. The psychophysics toolbox. Spat. Vis. 1997;10(4):433–436. [PubMed] [Google Scholar]
- Bressler S.L., Richter C.G. Interareal oscillatory synchronization in top-down neocortical processing. Curr. Opin. Neurobiol. 2015;31:62–66. doi: 10.1016/j.conb.2014.08.010. [DOI] [PubMed] [Google Scholar]
- Canolty R.T., Edwards E., Dalal S.S., Soltani M., Nagarajan S.S., Kirsch H.E., Berger M.S., Barbaro N.M., Knight R.T. High gamma power is phase-locked to theta oscillations in human neocortex. Science. 2006;313(5793):1626–1628. doi: 10.1126/science.1128115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chandrasekaran C., Trubanova A., Stillittano S., Caplier A., Ghazanfar A.A. The natural statistics of audiovisual speech. PLoS Comput. Biol. 2009;5(7):e1000436. doi: 10.1371/journal.pcbi.1000436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cogan G.B., Poeppel D. A mutual information analysis of neural coding of speech by low-frequency MEG phase information. J Neurophysiol. 2011;106(2):554–563. doi: 10.1152/jn.00075.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cotton J.C. Syllabic rate: a new concept in the study of speech rate variation. Speech Monogr. 1936;3:112–117. [Google Scholar]
- DeWitt I., Rauschecker J.P. Phoneme and word recognition in the auditory ventral stream. Proc. Natl. Acad. Sci USA. 2012;109(8):E505–E514. doi: 10.1073/pnas.1113427109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Di Liberto G.M., O'Sullivan J.A., Lalor E.C. Low-frequency cortical entrainment to speech reflects phoneme-level processing. Curr. Biol. 2015;25(19):2457–2465. doi: 10.1016/j.cub.2015.08.030. [DOI] [PubMed] [Google Scholar]
- van Dijk H., Schoffelen J.M., Oostenveld R., Jensen O. Prestimulus oscillatory activity in the alpha band predicts visual discrimination ability. J. Neurosci. 2008;28(8):1816–1823. doi: 10.1523/JNEUROSCI.1853-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dikker S., Pylkkanen L. Predicting language: meg evidence for lexical preactivation. Brain Lang. 2013;127(1):55–64. doi: 10.1016/j.bandl.2012.08.004. [DOI] [PubMed] [Google Scholar]
- Ding N., Chatterjee M., Simon J.Z. Robust cortical entrainment to the speech envelope relies on the spectro-temporal fine structure. Neuroimage. 2014;88C:41–46. doi: 10.1016/j.neuroimage.2013.10.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ding N., Melloni L., Zhang H., Tian X., Poeppel D. Cortical tracking of hierarchical linguistic structures in connected speech. Nat. Neurosci. 2016;19(1):158–164. doi: 10.1038/nn.4186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ding N., Simon J.Z. Adaptive temporal encoding leads to a background-insensitive cortical representation of speech. J. Neurosci. 2013;33(13):5728–5735. doi: 10.1523/JNEUROSCI.5297-12.2013. (doi: 10.1523/JNEUROSCI.5297-12.2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ding N., Simon J.Z. Cortical entrainment to continuous speech: functional roles and interpretations. Front Hum. Neurosci. 2014;8:311. doi: 10.3389/fnhum.2014.00311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doelling K.B., Arnal L.H., Ghitza O., Poeppel D. Acoustic landmarks drive delta-theta oscillations to enable speech comprehension by facilitating perceptual parsing. Neuroimage. 2014;85(Pt 2):761–768. doi: 10.1016/j.neuroimage.2013.06.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drullman R. Temporal envelope and fine structure cues for speech intelligibility. J. Acoust. Soc. Am. 1995;97(1):585–592. doi: 10.1121/1.413112. [DOI] [PubMed] [Google Scholar]
- Elliott R., Dolan R.J., Frith C.D. Dissociable functions in the medial and lateral orbitofrontal cortex: evidence from human neuroimaging studies. Cereb. Cortex. 2000;10(3):308–317. doi: 10.1093/cercor/10.3.308. [DOI] [PubMed] [Google Scholar]
- Engel A.K., Fries P. Beta-band oscillations – signalling the status quo? Curr. Opin. Neurobiol. 2010;20(2):156–165. doi: 10.1016/j.conb.2010.02.015. [DOI] [PubMed] [Google Scholar]
- Fontolan L., Morillon B., Liegeois-Chauvel C., Giraud A.L. The contribution of frequency-specific activity to hierarchical information processing in the human auditory cortex. Nat. Commun. 2014;5:4694. doi: 10.1038/ncomms5694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friederici A.D. Towards a neural basis of auditory sentence processing. Trends Cogn. Sci. 2002;6(2):78–84. doi: 10.1016/s1364-6613(00)01839-8. [DOI] [PubMed] [Google Scholar]
- Friederici A.D. The cortical language circuit: from auditory perception to sentence comprehension. Trends Cogn. Sci. 2012;16(5):262–268. doi: 10.1016/j.tics.2012.04.001. [DOI] [PubMed] [Google Scholar]
- Friederici A.D., Gierhan S.M. The language network. Curr. Opin. Neurobiol. 2013;23(2):250–254. doi: 10.1016/j.conb.2012.10.002. [DOI] [PubMed] [Google Scholar]
- Ghitza O. On the role of theta-driven syllabic parsing in decoding speech: intelligibility of speech with a manipulated modulation spectrum. Front. Psychol. 2012;3:238. doi: 10.3389/fpsyg.2012.00238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghitza O. The theta-syllable: a unit of speech information defined by cortical function. Front. Psychol. 2013;4:138. doi: 10.3389/fpsyg.2013.00138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghitza O., Giraud A.L., Poeppel D. Neuronal oscillations and speech perception: critical-band temporal envelopes are the essence. Front. Hum. Neurosci. 2012;6:340. doi: 10.3389/fnhum.2012.00340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghitza O., Greenberg S. On the possible role of brain rhythms in speech perception: intelligibility of time-compressed speech with periodic and aperiodic insertions of silence. Phonetica. 2009;66(1–2):113–126. doi: 10.1159/000208934. [DOI] [PubMed] [Google Scholar]
- Gierhan S.M. Connections for auditory language in the human brain. Brain Lang. 2013;127(2):205–221. doi: 10.1016/j.bandl.2012.11.002. [DOI] [PubMed] [Google Scholar]
- Giraud A.L., Poeppel D. Cortical oscillations and speech processing: emerging computational principles and operations. Nat. Neurosci. 2012;15(4):511–517. doi: 10.1038/nn.3063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goswami U., Leong V. Speech rhythm and temporal structure: converging perspectives? Lab. Phonol. 2013;4(1):67–92. [Google Scholar]
- Grahn J.A., Brett M. Rhythm and beat perception in motor areas of the brain. J. Cogn. Neurosci. 2007;19(5):893–906. doi: 10.1162/jocn.2007.19.5.893. [DOI] [PubMed] [Google Scholar]
- Granger C.W.J. Investigating causal relations by econometric models and cross-spectral methods. Econometrica. 1969;37(3):414. (&.) [Google Scholar]
- Greenberg S., Carvey H., Hitchcock L., Chang S.Y. Temporal properties of spontaneous speech – a syllable-centric perspective. J. Phon. 2003;31(3–4):465–485. [Google Scholar]
- Grodzinsky Y., Friederici A.D. Neuroimaging of syntax and syntactic processing. Curr. Opin. Neurobiol. 2006;16(2):240–246. doi: 10.1016/j.conb.2006.03.007. [DOI] [PubMed] [Google Scholar]
- Gross J., Hoogenboom N., Thut G., Schyns P., Panzeri S., Belin P., Garrod S. Speech rhythms and multiplexed oscillatory sensory coding in the human brain. PLoS Biol. 2013;11(12):e1001752. doi: 10.1371/journal.pbio.1001752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hanslmayr S., Aslan A., Staudigl T., Klimesch W., Herrmann C.S., Bauml K.H. Prestimulus oscillations predict visual perception performance between and within subjects. Neuroimage. 2007;37(4):1465–1473. doi: 10.1016/j.neuroimage.2007.07.011. [DOI] [PubMed] [Google Scholar]
- Henry M.J., Herrmann B., Obleser J. Entrained neural oscillations in multiple frequency bands comodulate behavior. Proc. Natl. Acad. Sci. USA. 2014;111(41):14935–14940. doi: 10.1073/pnas.1408741111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henry M.J., Obleser J. Frequency modulation entrains slow neural oscillations and optimizes human listening behavior. Proc. Natl. Acad. Sci. USA. 2012;109(49):20095–20100. doi: 10.1073/pnas.1213390109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henry M.J., Obleser J. Dissociable neural response signatures for slow amplitude and frequency modulation in human auditory cortex. PLoS One. 2013;8(10):e78758. doi: 10.1371/journal.pone.0078758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hickok G., Poeppel D. Opinion – the cortical organization of speech processing. Nat. Rev. Neurosci. 2007;8(5):393–402. doi: 10.1038/nrn2113. [DOI] [PubMed] [Google Scholar]
- Hipp J.F., Engel A.K., Siegel M. Oscillatory synchronization in large-scale cortical networks predicts perception. Neuron. 2011;69(2):387–396. doi: 10.1016/j.neuron.2010.12.027. [DOI] [PubMed] [Google Scholar]
- Holmes A.P., Blair R.C., Watson J.D., Ford I. Nonparametric analysis of statistic images from functional mapping experiments. J Cereb. Blood Flow. Metab. 1996;16(1):7–22. doi: 10.1097/00004647-199601000-00002. [DOI] [PubMed] [Google Scholar]
- Howard M.F., Poeppel D. Discrimination of speech stimuli based on neuronal response phase patterns depends on acoustics but not comprehension. J. Neurophysiol. 2010;104(5):2500–2511. doi: 10.1152/jn.00251.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hyafil A., Fontolan L., Kabdebon C., Gutkin B., Giraud A.L. Speech encoding by coupled cortical theta and gamma oscillations. Elife. 2015;4:e06213. doi: 10.7554/eLife.06213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ince R.A., Giordano B.L., Kayser C., Rousselet G.A., Gross J., Schyns P.G. A statistical framework for neuroimaging data analysis based on mutual information estimated via a gaussian copula. Human. Brain Mapp. 2016 doi: 10.1002/hbm.23471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ince R.A., van Rijsbergen N.J., Thut G., Rousselet G.A., Gross J., Panzeri S., Schyns P.G. Tracing the flow of perceptual features in an algorithmic brain network. Sci. Rep. 2015;5:17681. doi: 10.1038/srep17681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jensen O., Colgin L.L. Cross-frequency coupling between neuronal oscillations. Trends Cogn. Sci. 2007;11(7):267–269. doi: 10.1016/j.tics.2007.05.003. [DOI] [PubMed] [Google Scholar]
- Jensen O., Mazaheri A. Shaping functional architecture by oscillatory alpha activity: gating by inhibition. Front. Hum. Neurosci. 2010;4:186. doi: 10.3389/fnhum.2010.00186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kayser S.J., Ince R.A., Gross J., Kayser C. Irregular speech rate dissociates auditory cortical entrainment, evoked responses, and frontal alpha. J. Neurosci. 2015;35(44):14691–14701. doi: 10.1523/JNEUROSCI.2243-15.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kayser S.J., McNair S.W., Kayser C. Prestimulus influences on auditory perception from sensory representations and decision processes. Proc. Natl. Acad. Sci. USA. 2016;113(17):4842–4847. doi: 10.1073/pnas.1524087113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kelly S.P., Lalor E.C., Reilly R.B., Foxe J.J. Increases in alpha oscillatory power reflect an active retinotopic mechanism for distracter suppression during sustained visuospatial attention. J. Neurophysiol. 2006;95(6):3844–3851. doi: 10.1152/jn.01234.2005. [DOI] [PubMed] [Google Scholar]
- Keitel C., Quigley C., Ruhnau P. Stimulus-driven brain oscillations in the alpha range: entrainment of intrinsic rhythms or frequency-following response? J. Neurosci. 2014;34(31):10137–10140. doi: 10.1523/JNEUROSCI.1904-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klimesch W., Doppelmayr M., Russegger H., Pachinger T., Schwaiger J. Induced alpha band power changes in the human EEG and attention. Neurosci. Lett. 1998;244(2):73–76. doi: 10.1016/s0304-3940(98)00122-0. [DOI] [PubMed] [Google Scholar]
- Kochanski G., Grabe E., Coleman J., Rosner B. Loudness predicts prominence: fundamental frequency lends little. J. Acoust. Soc. Am. 2005;118(2):1038–1054. doi: 10.1121/1.1923349. [DOI] [PubMed] [Google Scholar]
- Lakatos P., Karmos G., Mehta A.D., Ulbert I., Schroeder C.E. Entrainment of neuronal oscillations as a mechanism of attentional selection. Science. 2008;320(5872):110–113. doi: 10.1126/science.1154735. [DOI] [PubMed] [Google Scholar]
- Lakatos P., O'Connell M.N., Barczak A., Mills A., Javitt D.C., Schroeder C.E. The leading sense: supramodal control of neurophysiological context by attention. Neuron. 2009;64(3):419–430. doi: 10.1016/j.neuron.2009.10.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- de Lange F.P., Jensen O., Bauer M., Toni I. Interactions between posterior gamma and frontal alpha/beta oscillations during imagined actions. Front. Hum. Neurosci. 2008:2. doi: 10.3389/neuro.09.007.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewis A.G., Bastiaansen M. A predictive coding framework for rapid neural dynamics during sentence-level language comprehension. Cortex. 2015;68:155–168. doi: 10.1016/j.cortex.2015.02.014. [DOI] [PubMed] [Google Scholar]
- Lewis A.G., Wang L., Bastiaansen M. Fast oscillatory dynamics during language comprehension: unification versus maintenance and prediction? Brain Lang. 2015;148:51–63. doi: 10.1016/j.bandl.2015.01.003. [DOI] [PubMed] [Google Scholar]
- Lundqvist M., Herman P., Lansner A. Effect of prestimulus alpha power, phase, and synchronization on stimulus detection rates in a biophysical attractor network model. J. Neurosci. 2013;33(29):11817–11824. doi: 10.1523/JNEUROSCI.5155-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo H., Poeppel D. Phase patterns of neuronal responses reliably discriminate speech in human auditory cortex. Neuron. 2007;54(6):1001–1010. doi: 10.1016/j.neuron.2007.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maguire E.A., Frith C.D., Morris R.G. The functional neuroanatomy of comprehension and memory: the importance of prior knowledge. Brain. 1999;122:1839–1850. doi: 10.1093/brain/122.10.1839. [DOI] [PubMed] [Google Scholar]
- Maris E., Oostenveld R. Nonparametric statistical testing of EEG- and MEG-data. J. Neurosci. Methods. 2007;164(1):177–190. doi: 10.1016/j.jneumeth.2007.03.024. [DOI] [PubMed] [Google Scholar]
- Meyer L., Henry M.J., Gaston P., Schmuck N., Friederici A.D. Linguistic bias modulates interpretation of speech via neural delta-band oscillations. Cereb. Cortex. 2016 doi: 10.1093/cercor/bhw228. [DOI] [PubMed] [Google Scholar]
- Meyer M., Alter K., Friederici A.D., Lohmann G., von Cramon D.Y. FMRI reveals brain regions mediating slow prosodic modulations in spoken sentences. Human Brain Mapp. 2002;17(2):73–88. doi: 10.1002/hbm.10042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moran R.J., Campo P., Maestu F., Reilly R.B., Dolan R.J., Strange B.A. Peak frequency in the theta and alpha bands correlates with human working memory capacity. Front. Hum. Neurosci. 2010;4:200. doi: 10.3389/fnhum.2010.00200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morillon B., Hackett T.A., Kajikawa Y., Schroeder C.E. Predictive motor control of sensory dynamics in auditory active sensing. Curr. Opin. Neurobiol. 2015;31:230–238. doi: 10.1016/j.conb.2014.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morillon B., Schroeder C.E., Wyart V. Motor contributions to the temporal precision of auditory attention. Nat. Commun. 2014;5:5255. doi: 10.1038/ncomms6255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Muthukumaraswamy S.D., Singh K.D. A cautionary note on the interpretation of phase-locking estimates with concurrent changes in power. Clin. Neurophysiol. 2011;122(11):2324–2325. doi: 10.1016/j.clinph.2011.04.003. [DOI] [PubMed] [Google Scholar]
- Neuling T., Rach S., Wagner S., Wolters C.H., Herrmann C.S. Good vibrations: oscillatory phase shapes perception. Neuroimage. 2012;63(2):771–778. doi: 10.1016/j.neuroimage.2012.07.024. [DOI] [PubMed] [Google Scholar]
- Ng B.S., Logothetis N.K., Kayser C. EEG phase patterns reflect the selectivity of neural firing. Cereb. Cortex. 2013;23(2):389–398. doi: 10.1093/cercor/bhs031. [DOI] [PubMed] [Google Scholar]
- Ng B.S., Schroeder T., Kayser C. A precluding but not ensuring role of entrained low-frequency oscillations for auditory perception. J. Neurosci. 2012;32(35):12268–12276. doi: 10.1523/JNEUROSCI.1877-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nolte G. The magnetic lead field theorem in the quasi-static approximation and its use for magnetoencephalography forward calculation in realistic volume conductors. Phys. Med. Biol. 2003;48(22):3637–3652. doi: 10.1088/0031-9155/48/22/002. [DOI] [PubMed] [Google Scholar]
- Obleser J., Eisner F., Kotz S.A. Bilateral speech comprehension reflects differential sensitivity to spectral and temporal features. J. Neurosci. 2008;28(32):8116–8123. doi: 10.1523/JNEUROSCI.1290-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Obleser J., Herrmann B., Henry M.J. Neural oscillations in speech: don't be enslaved by the envelope. Front. Hum. Neurosci. 2012;6:250. doi: 10.3389/fnhum.2012.00250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Obleser J., Weisz N. Suppressed alpha oscillations predict intelligibility of speech and its acoustic details. Cereb. Cortex. 2012;22(11):2466–2477. doi: 10.1093/cercor/bhr325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Obleser J., Wöstmann M., Hellbernd N., Wilsch A., Maess B. Adverse listening conditions and memory load drive a common alpha oscillatory network. J. Neurosci. 2012;32(36):12376–12383. doi: 10.1523/JNEUROSCI.4908-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oostenveld R., Fries P., Maris E., Schoffelen J.M. FieldTrip: open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput. Intell. Neurosci. 2011;2011:156869. doi: 10.1155/2011/156869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park H., Ince R.A., Schyns P.G., Thut G., Gross J. Frontal top-down signals increase coupling of auditory low-frequency oscillations to continuous speech in human listeners. Curr. Biol. 2015;25(12):1649–1653. doi: 10.1016/j.cub.2015.04.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peelle J.E., Davis M.H. Neural oscillations carry speech rhythm through to comprehension. Front. Psychol. 2012;3:320. doi: 10.3389/fpsyg.2012.00320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peelle J.E., Gross J., Davis M.H. Phase-locked responses to speech in human auditory cortex are enhanced during comprehension. Cereb. Cortex. 2013;23(6):1378–1387. doi: 10.1093/cercor/bhs118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pineda J.A. The functional significance of mu rhythms: translating "seeing" and "hearing" into "doing". Brain Res. Brain Res. Rev. 2005;50(1):57–68. doi: 10.1016/j.brainresrev.2005.04.005. [DOI] [PubMed] [Google Scholar]
- Poeppel D. The analysis of speech in different temporal integration windows: cerebral lateralization as 'asymmetric sampling in time'. Speech Commun. 2003;41(1):245–255. [Google Scholar]
- Poeppel D. The neuroanatomic and neurophysiological infrastructure for speech and language. Curr. Opin. Neurobiol. 2014;28:142–149. doi: 10.1016/j.conb.2014.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pulvermüller F., Huss M., Kherif F., Martin F.M.D.P., Hauk O., Shtyrov Y. Motor cortex maps articulatory features of speech sounds. Proc. Natl. Acad. Sci. USA. 2006;103(20):7865–7870. doi: 10.1073/pnas.0509989103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raghavachari S., Kahana M.J., Rizzuto D.S., Caplan J.B., Kirschen M.P., Bourgeois B., Madsen J.R., Lisman J.E. Gating of human theta oscillations by a working memory task. J. Neurosci. 2001;21(9):3175–3183. doi: 10.1523/JNEUROSCI.21-09-03175.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rauschecker J.P., Scott S.K. Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat. Neurosci. 2009;12(6):718–724. doi: 10.1038/nn.2331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sauseng P., Griesmayr B., Freunberger R., Klimesch W. Control mechanisms in working memory: a possible function of EEG theta oscillations. Neurosci. Biobehav. Rev. 2010;34(7):1015–1022. doi: 10.1016/j.neubiorev.2009.12.006. [DOI] [PubMed] [Google Scholar]
- Schomers M.R., Kirilina E., Weigand A., Bajbouj M., Pulvermuller F. Causal influence of articulatory motor cortex on comprehending single spoken words: TMS evidence. Cereb. Cortex. 2015;25(10):3894–3902. doi: 10.1093/cercor/bhu274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schreiber T. Measuring information transfer. Phys. Rev. Lett. 2000;85(2):461–464. doi: 10.1103/PhysRevLett.85.461. [DOI] [PubMed] [Google Scholar]
- Schroeder C.E., Lakatos P. Low-frequency neuronal oscillations as instruments of sensory selection. Trends Neurosci. 2009;32(1):9–18. doi: 10.1016/j.tins.2008.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scott S., McGettigan C. Amplitude onsets and spectral energy in perceptual experience. Front. Psychol. 2012;3:80. doi: 10.3389/fpsyg.2012.00080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shapiro B.E., Danly M. The role of the right hemisphere in the control of speech prosody in propositional and affective contexts. Brain Lang. 1985;25(1):19–36. doi: 10.1016/0093-934x(85)90118-x. [DOI] [PubMed] [Google Scholar]
- Simon O., Mangin J.F., Cohen L., Le Bihan D., Dehaene S. Topographical layout of hand, eye, calculation, and language-related areas in the human parietal lobe. Neuron. 2002;33(3):475–487. doi: 10.1016/s0896-6273(02)00575-5. [DOI] [PubMed] [Google Scholar]
- Smalle E.H., Rogers J., Mottonen R. Dissociating contributions of the motor cortex to speech perception and response bias by using transcranial magnetic stimulation. Cereb. Cortex. 2015;25(10):3690–3698. doi: 10.1093/cercor/bhu218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith Z.M., Delgutte B., Oxenham A.J. Chimaeric sounds reveal dichotomies in auditory perception. Nature. 2002;416(6876):87–90. doi: 10.1038/416087a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sokal R.R., Rohlf F.J. WH Freeman and Company; New York: 1995. Biometry. The Principles and Practice of Statistics in Biological Research. [Google Scholar]
- Stoll F.M., Wilson C.R.E., Faraut M.C.M., Vezoli J., Knoblauch K., Procyk E. The effects of cognitive control and time on frontal beta oscillations. Cereb. Cortex. 2016;26(4):1715–1732. doi: 10.1093/cercor/bhv006. [DOI] [PubMed] [Google Scholar]
- Strauss A., Henry M.J., Scharinger M., Obleser J. Alpha phase determines successful lexical decision in noise. J. Neurosci. 2015;35(7):3256–3262. doi: 10.1523/JNEUROSCI.3357-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Strauss A., Kotz S.A., Scharinger M., Obleser J. Alpha and theta brain oscillations index dissociable processes in spoken word recognition. Neuroimage. 2014;97:387–395. doi: 10.1016/j.neuroimage.2014.04.005. [DOI] [PubMed] [Google Scholar]
- Strauss A., Wöstmann M., Obleser J. Cortical alpha oscillations as a tool for auditory selective inhibition. Front. Hum. Neurosci. 2014;8:350. doi: 10.3389/fnhum.2014.00350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Szymanski F.D., Rabinowitz N.C., Magri C., Panzeri S., Schnupp J.W. The laminar and temporal structure of stimulus information in the phase of field potentials of auditory cortex. J. Neurosci. 2011;31(44):15787–15801. doi: 10.1523/JNEUROSCI.1416-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tzourio-Mazoyer N., Landeau B., Papathanassiou D., Crivello F., Etard O., Delcroix N., Mazoyer B., Joliot M. Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage. 2002;15(1):273–289. doi: 10.1006/nimg.2001.0978. [DOI] [PubMed] [Google Scholar]
- van de Vijver I., Ridderinkhof K.R., Cohen M.X. Frontal oscillatory dynamics predict feedback learning and action adjustment. J. Cogn. Neurosci. 2011;23(12):4106–4121. doi: 10.1162/jocn_a_00110. [DOI] [PubMed] [Google Scholar]
- Van Veen B.D., van Drongelen W., Yuchtman M., Suzuki A. Localization of brain electrical activity via linearly constrained minimum variance spatial filtering. IEEE Trans. Biomed. Eng. 1997;44(9):867–880. doi: 10.1109/10.623056. [DOI] [PubMed] [Google Scholar]
- Vigneau M., Beaucousin V., Herve P.Y., Duffau H., Crivello F., Houde O., Mazoyer B., Tzourio-Mazoyer N. Meta-analyzing left hemisphere language areas: phonology, semantics, and sentence processing. Neuroimage. 2006;30(4):1414–1432. doi: 10.1016/j.neuroimage.2005.11.002. [DOI] [PubMed] [Google Scholar]
- Wilson S.M., Saygin A.P., Sereno M.I., Iacoboni M. Listening to speech activates motor areas involved in speech production. Nat. Neurosci. 2004;7(7):701–702. doi: 10.1038/nn1263. [DOI] [PubMed] [Google Scholar]
- Wöstmann M., Herrmann B., Wilsch A., Obleser J. Neural alpha dynamics in younger and older listeners reflect acoustic challenges and predictive benefits. J. Neurosci. 2015;35(4):1458–1467. doi: 10.1523/JNEUROSCI.3250-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu L., Thompson C.S., Pfingst B.E. Relative contributions of spectral and temporal cues for phoneme recognition. J Acoust. Soc. Am. 2005;117(5):3255–3267. doi: 10.1121/1.1886405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zatorre R.J., Meyer E., Gjedde A., Evans A.C. PET studies of phonetic processing of speech: review, replication, and reanalysis. Cereb. Cortex. 1996;6(1):21–30. doi: 10.1093/cercor/6.1.21. [DOI] [PubMed] [Google Scholar]
- Zeng F.G., Nie K., Stickney G.S., Kong Y.Y., Vongphoe M., Bhargave A., Wei C., Cao K. Speech recognition with amplitude and frequency modulations. Proc. Natl. Acad. Sci. USA. 2005;102(7):2293–2298. doi: 10.1073/pnas.0406460102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zoefel B., VanRullen R. The role of high-level processes for oscillatory phase entrainment to speech sound. Front. Hum. Neurosci. 2015;9:651. doi: 10.3389/fnhum.2015.00651. [DOI] [PMC free article] [PubMed] [Google Scholar]