Abstract
Language tasks require the coordinated activation of multiple subnetworks—groups of related cortical interactions involved in specific components of task processing. Although electrocorticography (ECoG) has sufficient temporal and spatial resolution to capture the dynamics of event-related interactions between cortical sites, it is difficult to decompose these complex spatiotemporal patterns into functionally discrete subnetworks without explicit knowledge of each subnetwork’s timing. We hypothesized that subnetworks corresponding to distinct components of task-related processing could be identified as groups of interactions with co-varying strengths. In this study, five subjects implanted with ECoG grids over language areas performed word repetition and picture naming. We estimated the interaction strength between each pair of electrodes during each task using a time-varying dynamic Bayesian network (tvDBN) model constructed from the power of high gamma (70–110 Hz) activity, a surrogate for population firing rates. We then reduced the dimensionality of this model using principal component analysis (PCA) to identify groups of interactions with co-varying strengths, which we term functional network components (FNCs). This data-driven technique estimates both the weight of each interaction’s contribution to a particular subnetwork, and the temporal profile of each subnetwork’s activation during the task. We found FNCs with temporal and anatomical features consistent with articulatory preparation in both tasks, and with auditory and visual processing in the word repetition and picture naming tasks, respectively. These FNCs were highly consistent between subjects with similar electrode placement, and were robust enough to be characterized in single trials. Furthermore, the interaction patterns uncovered by FNC analysis correlated well with recent literature suggesting important functional-anatomical distinctions between processing external and self-produced speech. Our results demonstrate that subnetwork decomposition of event-related cortical interactions is a powerful paradigm for interpreting the rich dynamics of large-scale, distributed cortical networks during human cognitive tasks.
Keywords: ECoG, language tasks, speech processing, functional connectivity, high gamma, subnetworks
1. Introduction
Converging evidence from lesion analysis, stimulation mapping, and functional neuroimaging has shown that the topography of human language cortex is widely distributed over large-scale networks in the temporal, parietal, and frontal lobes (Broca, 1865; Ojemann et al., 1989; Indefrey and Levelt, 2000; Hickok and Poeppel, 2004; Indefrey and Levelt, 2004). Nevertheless, it is difficult to map the dynamics of interactions between these regions during specific language tasks with current methodologies. Diffusion tensor imaging (DTI) has facilitated detailed estimation of the anatomical connections between language areas (Glasser and Rilling, 2008). The presence of physical connections, however, provides limited information about how those connections are utilized over the course of a task. Functional magnetic resonance imaging (fMRI) has uncovered robust correlations between the metabolic activities of distant brain regions in the absence of a task, providing insight into the functional network architecture of the brain’s resting state (Biswal et al., 1995). However, the slow time scale of these metabolic interactions—on the order of hundreds of milliseconds—and the low temporal resolution of fMRI acquisition make it ill-suited for probing the rich dynamics of event-related cortical interactions during relatively fast activities like object recognition or speech.
Electrocorticography (ECoG) has become a powerful tool for characterizing the dynamics of cortical activation during cognitive tasks with high temporal resolution, on the order of tens of milliseconds. In particular, activity in the high gamma band (> 70 Hz) has been shown to reflect changes in cortical population firing rates (Manning et al., 2009; Ray et al., 2008; Miller, 2010), even in single trials (Flinker et al., 2010). These features make ECoG an ideal modality to capture the event-related interaction (ERI) between cortical sites during language processing. Although ERI can be characterized with ECoG using a variety of methods—for example, changes in measures of correlated activity (Friston et al., 1994; Friston, 2011), or parameters from an imposed model of system dynamics (Winterhalder et al., 2005)—these estimates are inherently high-dimensional regardless of the method used, as every site-site pair produces a new signal depicting the strength of that pair’s interaction over time; hence, a set of n ECoG recordings yields n2 − n ERI time series. When each interaction between cortical sites is considered as an independently modulating signal, this high dimensionality presents a considerable challenge for visualization and interpretation of ECoG ERI results.
Recent studies of the neurobiology of language have provided converging evidence for discrete subnetworks of cortex responsible for distinct stages of task computation (Indefrey and Levelt, 2004; Indefrey, 2011; Hickok and Poeppel, 2007). Given this framework, a method capable of parcellating the large array of ERI signals into groups of functionally related interactions would dramatically aid the interpretability of interaction data. Rather than visualizing the strength of each site-site interaction at every point in time, one may see ERI as the activity of a small number of subnetworks, each possessing two complementary representations: (1) a static spatial architecture showing which site-site interactions are implicated in that subnetwork, and (2) a temporal profile of the subnetwork’s aggregate activation during a particular task. Such a decomposition might help elucidate the functional roles of individual sites in task completion. For example, sites that participate in multiple subnetworks might be identified as “bridge” or “connector” nodes, responsible for passing information between stages of task processing (Bullmore and Sporns, 2009).
Previous studies using ERI estimates found that many of the interactions between ECoG sites tend to modulate together over the course of a task (Korzeniewska et al., 2008, 2011; Benz et al., 2012a). We therefore hypothesized that subnetworks corresponding to distinct computational stages could be identified as groups of interactions with similarly timed task-related activation, which we term “function al network components” (FNCs). Here we present a simple method for computing these FNCs based on principal component analysis, a common dimensionality reduction technique that finds weighted combinations of input signals that account for the greatest input variance. We show that this data-driven method yields subnetwork estimates that are spatially and temporally consistent with current models of the neurophysiology of human language, are consistent between individuals with similar coverage, and are robust enough to be observed in single trials.
2. Material and methods
2.1. Study participation and data collection
In this study, we examined 5 subjects with drug-resistant epilepsy undergoing ECoG monitoring for seizure localization prior to resection surgery, whose electrode placement was determined solely by clinical necessity. Subjects were selected for inclusion in this study if their implants covered perisylvian and basal temporal cortices. All subjects provided informed consent before testing began in accordance with our experimental protocol, as approved by the Johns Hopkins Institutional Review Board. The locations of these electrodes relative to the subject’s anatomy were determined by co-registering a post-operative computed tomography scan with a pre-operative magnetic resonance imaging (MRI) scan using the BioImage suite (Duncan et al., 2004; Papademetris et al., 2005). Figure 1 shows these co-registered locations on volumetric MRI reconstructions for all 5 subjects, while Table 1 summarizes the subjects’ demographic information.
Table 1.
Subject | Age (years) | Age of seizure onset (years) | Gender | Language dominance | HG-active electrodes* | Recording system |
---|---|---|---|---|---|---|
1 | 55 | 45 | Male | Left | 21 | NeuroPort |
2 | 55 | 19 | Female | Left | 34 | NeuroPort |
3 | 47 | 8 | Male | Left | 24 | Stellate |
4 | 23 | 13 | Female | Left | 32 | Stellate |
5 | 13 | 15 months | Female | Bilateral | 41 | Nihon Kohden |
i.e., the number of electrodes that showed statistically significant high gamma modulation in either task, auditory word repetition or picture naming.
2.1.1. Data acquisition
ECoG signals were recorded using standard 2.3 mm diameter platinum-iridium electrodes with 1 cm center-to-center spacing (Adtech; Racine, WI) placed subdurally. These recordings were referenced to a single intracranial electrode to minimize extracranial sources of artifacts. The raw ECoG signals were recorded using a Stellate (Stellate Systems Inc.; Montréal, Canada), Nihon Kohden (Nihon Kohden Corporation; Tokyo, Japan) or NeuroPort (BlackRock Microsystems; Salt Lake City, UT) system as indicated in Table 1, which amplified and sampled the data at 1–30 kHz and applied analog anti-aliasing filters. Channels with excessive noise were identified visually by a clinical neurophysiologist and excluded from further analysis. Recordings of non-excluded ECoG channels for all subjects were then downsampled to 1 kHz offline, and were re-referenced to their common average to remove common mode noise artifacts and avoid spatial bias from the reference electrode.
2.1.2. Experimental design
Each subject performed two language-oriented tasks: picture naming and auditory word repetition. In the picture naming task, subjects were seated approximately 1 m from an LCD monitor, which presented visual stimuli in a random sequence from a collection of 100 distinct color pictures of objects on a grey background, in two blocks of 50 trials each, using E-Prime (Psychology Software Tools; Sharpsburg, PA). The depicted objects, which were matched for usage frequency in English, spanned a broad array of object categories, including tools, food items, animals, plants, body parts, structures, building parts, common household items, clothing, vehicles, jewelry, and instruments. Each picture was presented for one second, after which the subject would respond verbally with the name of the object. The inter-stimulus interval was selected at random from 6.5–8.2 s for each trial, in order to mitigate anticipatory artifacts. All 100 trials were considered in this analysis; for subject 1, this included 8 trials without a verbal response to the stimulus, as activiton patterns during non-response trials can provide an additional facet to distinguish cue- and response-related stages of processing.
In the word repetition task, subjects were played auditory word stimuli in a random sequence over standard commercial speakers (subject 3) or insert headphones via a high-end psychoacoustics workstation (all other subjects; model Z6A, Tucker-Davis Technologies; Alachua, FL); the word stimuli were identical to those used in, e.g., (Crone et al., 2001). Subjects were instructed to repeat the presented word verbally, after which a randomly chosen delay was applied, with the total inter - stimulus interval ranging from 7 – 8 s. Similar to the visual task, the stimuli were all unique, to avoid effects of repetition priming (Rodriguez Merzagora et al., 2014), and all trials were analyzed, leaving 58 (subject 3, due to time constraints, including 6 without a verbal response) or 116 (all other subjects) trials for analysis.
To later align the ECoG recordings with experimental cues, the stimulus onset and response onset times were marked in each trial for both tasks. We detected the onset of visual stimuli during picture naming by recording the thresholded output of a photodiode mounted on the computer monitor presenting stimuli. In each trial, the photodiode detected a reduction in luminance that occurred when a small black square appeared at the bottom left corner of the screen during presentation of the visual stimulus. This signal was passed into the same amplifier system as the ECoG data for trial alignment. We similarly measured the onset of acoustic stimuli during word repetition by transmitting a trigger pulse synchronized with the onset of the auditory stimuli to the recording system. In both tasks, a microphone was placed in front of the subject to record synchronized audio of verbal responses through the recording system’s analog input; these recordings were used to manually determine the response onset time for each trial.
2.2. Data analysis
Our analysis paradigm progressed in three broad stages: first, extraction of high gamma spectral features from the recorded ECoG data; second, computation of event-related cortical interactions from these features; and finally, estimation of subnetworks from these spatiotemporal interaction patterns. An overview of this pipeline is given in Supplementary Figure 1.
2.2.1. Spectral feature estimation
The common average–referenced ECoG signals were processed with a zero - phase finite impulse response bandpass filter with cutoffs at 70 Hz and 110 Hz to isolate high gamma activity and avoid line noise harmonics at 60 Hz and 120 Hz. These data were then separated into individual trials aligned to experimental cue onset, spanning from 1 s pre- to 3 s post-stimulus. The pre - cue window, used in constructing a base line null distribution for later statistical testing, was chosen to be as long as possible while avoiding any contamination from post - response activity in the previous trial. The relatively long post - cue analysis window was chosen to capture a large proportion of subjects’ responses, the latencies of which were particularly variable in subjects 1 and 3 (see Figure 4). It is notable that, in light of the multiple hypothesis correction used in later statistical testing, lengthening the analysis window could have also lowered the ensuing critical p - value thresholds, resulting in more conservative significance criteria for all time points. All trials were initially extracted with an additional 0.5 s of buffer data on either side to prevent edge effects at later stages of processing.
We then computed time-varying spectra for each trial in 128 ms overlapping windows with 16 ms spacing using the multi-taper windowing method—which resolves the statistical inconsistency of the classical Fourier transform spectral estimator (Thomson, 1982; Babadi and Brown, 2014)—with 5 tapers, as implemented in the open source Chronux toolbox (Bokil et al., 2010). As the noise distribution of spectral power was observed to be lognormal, consistent with previous efforts (Gasser et al., 1982), these spectra were log-transformed and averaged over frequencies from 70–110 Hz to produce a single high gamma activity time series for each channel. Each channel’s data was then normalized using the mean and standard deviation of the aggregate baseline data for that channel, to ensure zero-centering and consistent scaling of the baseline activity distribution.
2.2.2. Feature subset selection
For global interaction methods like the one used in this study, inclusion of features that do not carry meaningful information can substantially degrade estimates for the rest of the network, as the method tries to fit its interaction model to noise. We therefore decided to select a subset of the available ECoG electrodes to use for event-related interaction analysis in each subject.
To select this subset, we first examined which of the recorded channels had significant changes in post-stimulus high gamma band activity. We evaluated this in each channel by performing a t-test between the distribution of high gamma activity across trials for each time bin and that channel’s aggregate baseline distribution, correcting for a familywise error rate of 0.05 using the Bonferroni procedure (i.e., thresholding at p < 0.05/T, where T is the number of time points in the analysis window). Channels with at least one 16 ms time bin showing significant modulation at this level were labeled as “high gamma–active” during that task; the interaction montage selected for each subject was the union of the sets of high gamma–active channels for the two tasks. The interaction montages for the subjects in this study had between 21 and 41 (mean 30) channels, as shown in Table 1.
2.2.3. Estimation of event-related interaction
We used the time-varying dynamic Bayesian network (tvDBN) framework (Song et al., 2009) as a model for estimating interactions between sites. This model makes the assumption that the current system state is a linear, time-varying function of the state at a previous time point, at some fixed delay d. We treated the n baseline-normalized high gamma power time series, xi[t], as our state vector:
The tvDBN framework’s model of the time series’ dynamics was then, in matrix form,
where the Gaussian ε[t] error terms encompass observation noise and nonlinear behavior. This is similar in principle to a first-order multivariate autoregressive model, where the coefficients A are allowed to vary with time.
The time-varying matrix A[t] is interpreted as the instantaneous interaction strength of the system at time t, where the coefficient Aij is the influence from the jth channel at time t − d to the ith channel at time t. In the tvDBN model, this matrix is estimated row-wise at each time point using regression. Because the number of interaction coefficients being estimated generally exceeds the number of available data points, this problem by itself would be vastly underdetermined. To alleviate this, the tvDBN model makes some assumptions about the structure of the interaction matrices A[t]. First, and most important, the model imposes continuity of the underlying interaction matrix—that is, A[t] is assumed to vary relatively “smoothly” over time. This assumption allows us to estimate A[t] using data from time points near t, by weighting the observations using a kernel function ϕ. Second, self-interactions are excluded from the model, so that the diagonal elements of A[t] are all set to zero. Finally, the model incorporates a regularization penalty to avoid overfitting, given the low-data nature of the problem. Applying these considerations, and letting denote the space of row vectors with 0 in the ith position, we arrive at the following optimization problem for estimating each row i of A[t]:
where T is the total length of the signals, λ controls the degree of regularization imposed, and ║ · ║2 indicates the L2 vector norm. This is a weighted ridge regression problem, which we solved for each row at each time point using the standard closed-form solution (Phillips, 1962; Tikhonov, 1963). The original presentation of tvDBN in (Song et al., 2009) used the L1 norm for the regularization term, allowing their implementation to directly control the degree of sparseness (i.e., the number of nonzero elements) of the resultant A[t] matrices, as estimated using LASSO (Tibshirani, 1996). We opted instead to use the L2 variant, as solving this problem is considerably more computationally efficient than LASSO. This method does not directly impose sparseness, but instead reduces the potential for overfitting by penalizing solutions with large norms, and hence encouraging most fitted coefficients to be small. Specifically, L2 regularization is equivalent to imposing a zero-mean Gaussian prior on the regressed estimates of A[t], with variance controlled by λ.
Consistent with prior studies using tvDBN in ECoG (Benz et al., 2012b), we used a regularization parameter λ of 100, and a Gaussian kernel for weighting observations:
with a kernel width parameter h of 5 and a prediction lag d of 1 sample. To improve computation time, we truncated the kernel to a total width of 4h samples, as the kernel weights become very small far from the center, and have negligible impact on the regression output.
Our selected prediction lag d of a single sample corresponds to a delay of 16 ms in the transfer of high gamma information between cortical sites. It is unlikely that this single choice reflects the relative timing of all causal influences between cortical regions, and a more comprehensive model would incorporate state information from multiple lags to maximize descriptive power; however, we opted to use the first-order tvDBN model to preserve interpretability of the regressed parameters, and to ensure stable fitting of model coefficients in single trials, as outlined in the Discussion. Also, while the tvDBN model is manifestly directional in theory, and while prior work has shown that this method produces accurate interaction estimates, we observed that these parameter choices resulted in A[t] matrices that were nearly symmetric. Although our choice of h was informed by prior successful decoding work, it is possible that this kernel was too broad, and as a result smeared the directionality of information flow. To avoid confusion and prevent misinterpretation of these results, in all depictions of tvDBN estimates in this study, we have symmetrized the interaction matrices and omitted any indication of directionality; i.e., for a tvDBN coefficient matrix A[t], the results here depict
Finally, it is important to acknowledge that in the scheme presented here, as is standard for offline analysis, both the window for spectral estimation and the kernel for interaction estimation were non-causal. Hence, while the computed activity is “centered” in time (making features like time of peak activation unbiased), the smoothing induced by the windows may have caused the computed activity to attain statistical significance up to 150 ms before any physiological activity took place, since activity on the “future” edge of the window could appear at the window’s center.
We approximated the distribution of tvDBN interaction coefficients during the baseline period (1 s prior to cue onset) as a Gaussian, and estimated the mean and variance of this aggregate baseline distribution for each channel pair. We normalized the tvDBN data for each pair relative to that pair’s baseline mean and standard deviation, to ensure zero-centering and consistent scaling of the baseline interaction distributions. We then modeled the interaction strength for each channel pair at each time point as Gaussians, and estimated the mean and variance of each of these distributions, giving us an estimate of the event-related interaction (ERI) between sites over the course of each task. To assess the statistical significance of ERI, we performed a t-test between each interaction’s pre-cue baseline distribution and the same interaction’s distribution in each time bin. To account for multiple comparisons, we determined the threshold for significant ERI by correcting for a false discovery rate (FDR) of 0.05 across time points for each site - site pair using the Benjamini-Hochberg (BH) protocol (Benjamini and Hochberg, 1995).
2.2.4. Subnetwork identification
To identify groups of interactions with similarly timed modulation, we concatenated the post-stimulus ERI data for all trials, and performed principal component analysis (PCA) on this aggregate ERI dataset (Hotelling, 1933). PCA iteratively identifies linear combinations of input signals, called principal components (PCs), that account for maximal amounts of input variance. This procedure results in a decomposition of the input signals with two complementary representations: spatial “loading” vectors, which show the contribution of each input signal to each PC; and temporal “score” time series, which show the modulation of each PC obtained by applying the loadings to the original data.
PCA, in its pursuit to explain the most variance, tends to collapse groups of signals that are highly anticorrelated by assigning the groups opposite - signed weights. This property was undesirable in the context of the present study, where the aim was to characterize groups of related interactions implicated in specific stages of task processing; indeed, any pair of processing stages with distinct temporal envelopes will naturally have activation signatures that are anticorrelated. In such a setting, PCA maximizes parsimony at the potential expense of physical meaning. To ameliorate this, we constructed two “signed PCs” from each computed PC: one from the positive weights with the negative weights set to zero, and the other from the absolute value of the negative weights with the positive weights set to zero. These signed PCs isolated the portions of the raw PC that positively co-varied. A concrete example of this behavior is shown in Figure 2 while the raw PC (orange in C) captures more variance than either of its signed constituents (green and blue), sign separation better elucidates the apparent subnetwork structure.
It is important to recognize that “signed PCs” do not possess many of the desirable linear algebraic properties of their unsigned counterparts. First, being a transformation from ℝn to ℝ2n there is no well-defined inverse mapping from signed PC–space back to interaction-space; as such, the common practice of back-transforming calculations done in PC-space must be avoided when working with signed PCs. Second, the signed PCs are not orthonormal; this, for example, makes it impossible to generalize the familiar notion of “explained variance” to signed PCs.
Although PCA can in theory extract as many PCs as there are input signals (and hence, twice as many signed PCs), in the case of ECoG interaction data, the vast majority of these PCs have an incredibly low signal-to-noise ratio, and hence do not carry discernible task-related information. To determine which signed PCs modulated significantly during the tasks, we used the same statistical paradigm described above for determining the significance of ERI, but replaced the tvDBN interaction data with the signed PC score time series. Once again, the threshold for significant modulation was determined by correcting for an FDR of 0.05 across time points for each signed PC using the BH protocol. For simplicity, we call signed PCs of ERI with significant activation during a task functional network components (FNCs). These FNCs provide estimates of the spatial (PC loadings) and temporal (PC scores) profiles of the activated subnetworks. In each of the subjects tested, we found only 3 to 7 activated FNCs per task, a dramatic reduction from the 420 to 1640 computed ERI signals.
“Mixing”—where one computed FNC had a strong spatial and temporal similarity to another FNC, or combination of FNCs—was evident in several of the examined subjects, owing to the loss of orthonormality in the signed PCs alluded to above. For clarity, we have attempted to omit duplicates of mixed FNCs. Groups exhibiting mixing were found by analyzing the Pearson’s r correlation between the weights of each pair of FNCs; pairs with r > 0.25 were considered mixed. All of the mixed FNCs fell into one of two categories: either one FNC was spatially a strict subset of another FNC, or an FNC was a combination of two other FNCs. In the former case, the “subset” FNC was not considered; in the latter, the two constituent FNCs were kept while the mixture was excluded from further analysis. An example of the combination case is shown in Figure 3; as the underlying behavior is captured by the constituent FNCs, the mixed combination does not lend much interpretational benefit. Removing mixed FNCs left the examined subjects with 3 to 4 remaining unmixed FNCs per task.
The essence of subnetwork decomposition lies in utilizing the covariance structure of ERI time series to form a more concise representation that separates the spatial and temporal facets of interaction dynamics. While PCA provides an apt first demonstration of this premise, it is important to note that there are a wide array of techniques available to supply such a decomposition, some of which may be more appropriate than PCA in certain contexts. For example, while the time-varying dynamic Bayesian network model used in this report can yield negative values, many interaction measures—e.g., spectral coherence—are constrained to be strictly non-negative. When working with such measures, the family of methods known collectively as nonnegative matrix factorization (NMF) may provide a more suitable decomposition than PCA (Lee and Seung, 1999, 2001). As its name suggests, the NMF analogues of PC loadings (spatial weights) and scores (temporal activation profiles) are guaranteed to be non-negative, obviating the need for any sign-separation procedure.
2.2.5. Identification of significantly involved interactions
To approximate the null distribution of FNC weights, we imposed the null condition—i.e., we removed the covariance between interactions’ temporal profiles—by shuffling the ERI data across time independently for each channel pair. This procedure was repeated 500 times; in each iteration, we performed PCA on the result, and added the absolute values of the loadings for the 4 highest-variance PCs to the aggregate null distribution, as all of the observed activated FNCs were derived from the top 4 PCs of ERI. This null distribution was approximated by a Gaussian, and an interaction was considered significantly involved in an FNC if the absolute value of its FNC loading was above the 95th percentile of this distribution. FNCs which were so widely distributed as to have no significantly involved interactions were excluded from further analysis.
It is important to note that the unit normalization of the coefficients for each PC places important constraints on the distribution of FNC spatial weights. In particular, FNCs involving widely distributed networks will necessarily have lower spatial weights than FNCs involving a small number of interactions. Hence, while the spatial profile of an FNC shows the relative contributions of individual site-site interactions within that particular FNC, it does not provide a means of comparing the strengths of interactions across FNCs.
3. Results
3.1. Subtask specificity of FNCs
All of the subjects in this study exhibited three or four unmixed FNCs in each task. Figure 4 summarizes the spatial and temporal profiles of all observed FNCs for each subject and task; Supplementary Figure 2 provides a more detailed view of the anatomical composition of selected FNCs from all subjects in isolation.
Several of the observed FNCs showed marked anatomical and temporal similarities across subjects. As shown in the left column of Figure 4, during the picture naming task, all subjects exhibited an FNC that included interactions within ventral occipital-temporal cortex (VOTC, including fusiform gyrus), and between VOTC and perisylvian cortices, including sensorimotor cortex (SMC) and Broca’s area (blue edges in anatomical plots, left; blue temporal plots accompanied by squares, right). As expected given VOTC’s known role in visual processing (Moore and Price, 1999; Allison et al., 1999), these FNCs were active shortly after cue presentation, becoming significant at 50–150 ms and peaking in activation at 200–600 ms. In all subjects, an additional FNC was present with interactions between posterior superior temporal gyrus (pSTG) or supramarginal gyrus (SMG) and either sensorimotor cortex, Broca’s area, or both (red; traces accompanied by triangles). These FNCs attained peak activation after the median latency of the subject’s spoken response, consistent with the involvement of pSTG and SMG, which have been shown to participate in the perception of speech (Celsis et al., 1999; Dehaene-Lambertz et al., 2005).
In auditory word repetition (right column of Figure 4), all subjects displayed an FNC consisting primarily of interactions within pSTG, with activity very early after cue onset (green; traces accompanied by circles). While these intra-pSTG FNCs attained absolute peak activation at 150–500 ms, all showed activity late in the task as well, with those from subjects 1, 3, and 4 attaining a second, much smaller peak after the median response latency. This temporal pattern—including both post-cue and post-response modulation—is again expected given pSTG’s known role in speech perception. Four of the five subjects exhibited an FNC during word repetition incorporating interactions between pSTG and Broca’s area (brown in subjects 1, 2, 3, and 5; traces accompanied by stars). These FNCs attained peak activation at an intermediate stage of the task—after the peak of that subject’s intra-pSTG FNC, but before the median response time. Subjects 2, 4, and 5 had an FNC primarily composed of interactions within sensorimotor cortex (purple; traces accompanied by crosses). These FNCs showed activity centered around the median response latency, consistent with sensorimotor involvement. An additional FNC was seen in three subjects involving interactions between SMC and Wernicke’s area (pink in subjects 1, 4, and 5; traces accompanied by diamonds). As in the case of the primarily intra-pSTG FNCs (green) described above, these SMC-Wernicke FNCs showed activation across both task epochs, with those from subjects 1 and 4 exhibiting a similar double-peaked pattern. However, unlike the intra-pSTG FNCs, which showed much stronger activation during the post-cue period, these FNCs were equally active in both epochs (subjects 1 and 4) or more active during the post-response period (subject 5).
Figure 5 provides a more detailed look at the FNCs from subject 1. In the picture naming task, subject 1 exhibited both of the common FNC spatial-temporal patterns described above: an early post-cue FNC consisting of intra-VOTC and VOTC–Broca’s area interactions (brain 1 with blue network edges in A; blue traces in B), and a later FNC incorporating interactions between sensorimotor cortex and pSTG peaking after the median verbal response time (brain 3; red). An additional FNC with primarily Broca’s area–SMC interactions (brain 2; yellow) peaked in activation at an intermediate latency. In word repetition, three common FNC patterns were evident as well: an intra-pSTG FNC with large post-cue and small post-response peaks (brain 4 with green network edges in C; green traces in B); an FNC linking pSTG and Broca’s area with intermediate timing (brain 5; brown); and a third FNC incorporating mostly SMC–pSTG interactions with post-cue and post-response activity of equal magnitude (brain 6; pink).
3.2. Robustness of FNC activation in single trials
The robust aggregate profile of FNC activation was echoed in the modulation of FNCs in single trials. In all of the tested subjects, each FNC displayed a characteristic timing profile relative to cue and/or response onset that was consistent across trials; these single-trial profiles are shown for selected FNCs from all subjects in Supplementary Figure 3. The timing of each FNC’s activation in individual trials fell into one of three broad categories: locked to cue, locked to response, or bimodal with peaks locked to both cue and response. FNCs that were structurally similar across subjects showed largely similar patterns of activation in single trials. For example, the FNCs from picture naming involving interactions with VOTC (blue in Figure 4 and Supplementary Figure 2) showed a consistent burst of activation locked to stimulus presentation (blue in Supplementary Figure 3). In contrast, FNCs from picture naming that primarily included interactions between pSTG and SMC (red in Figure 4 and Supplementary Figure 2) exhibited strong coupling in all subjects to the epoch directly after response onset, with a more or less fixed duration of activation. The two distinct timing patterns seen for FNCs from word repetition incorporating pSTG–Broca’s area interactions (brown in Supplementary Figure 3) appeared to reflect the degree to which the FNCs also incorporated Broca-SMC interactions (brown in Supplementary Figure 2).
The rasters in Figure 6, displaying the modulation of subject 1’s FNCs in single trials, give a representative demonstration of the different temporal profiles described above, and provide more detailed insight into the task-related dynamics of FNC activation. The first FNC from picture naming (blue, top-left), which involved intra-VOTC and VOTC–Broca’s area interactions (anatomy in Figure 5, top-left), was clearly time-locked to cue presentation, and stayed active for a fixed duration of ~600 ms regardless of response latency. The yellow FNC from picture naming, which spanned SMC and Broca’s area (top-middle), was also time-locked to the cue, beginning 100–200 ms after the VOTC-involved FNC became active; however, rather than having a fixed duration of activation, this FNC exhibited activity until roughly 300 ms after response onset in each trial. The red FNC from picture naming, incorporating interactions between Broca’s area, SMC, and pSTG (top-right), showed strong coupling to response onset, and remained active throughout the duration of the subject’s response. It is interesting to note that in trials where the subject did not respond (bottom of the rasters), the blue VOTC-Broca FNC showed consistent activation, the yellow SMC-Broca FNC showed discernible but diminished activation, and the red pSTG-involved FNC showed little to no activation.
During auditory word repetition, the green FNC, incorporating primarily intra-pSTG interactions (anatomy in Figure 5, bottom-left), showed a bimodal activation pattern, with peaks occurring after both cue and response onsets (Figure 6, bottom-left), the former being considerably stronger than the latter in all trials. The brown FNC, spanning pSTG and Broca’s area (bottommiddle), became active primarily in the epoch between the green intra-pSTG FNC’s activation and the subject’s response onset. The pink FNC, spanning SMC, pSTG, and Broca’s area (bottomright), showed two bursts of activity: one after the green intra-pSTG FNC’s activation (leading into the subject’s response), and one starting at response onset and continuing for ~500 ms. Interestingly, while the green intra-pSTG FNC’s bursts of activation were delayed by ~100 ms relative to the onset of the auditory cue or the subject’s spoken response, the FNCs incorporating SMC-Pstg interactions from both word repetition (pink) and picture naming (red) showed response-related activation considerably earlier, with the latter becoming active up to 200 ms pre-response in some trials.
4. Discussion
4.1. FNCs identify subnetworks for different stages of task-related processing
Our hypothesis—that functional network components identify groups of related cortical interactions involved in particular stages of task-related processing—is supported by the high degree of consistency between the spatial-temporal profiles of FNCs and current neurobiological models of speech production. As detailed in Section 3.1, during the picture naming task, all subjects exhibited an FNC consisting of interactions within ventral occipital-temporal cortex, as well as between VOTC and Broca’s area and/or sensorimotor cortex (blue/squares in Figure 4 and Supplementary Figure 2); these FNCs showed early cue-locked activation, exemplified in individual trials in Figure 6 and Supplementary Figure 3 (blue). This profile is expected given the involvement of VOTC, which is known to play a role in visual object processing (Moore and Price, 1999; Allison et al., 1999); the prominent VOTC-Broca interactions may indicate that Broca’s area incorporates the results of this processing during lexical retrieval (Flinker et al., 2015). All subjects also had an FNC during picture naming with interactions between posterior superior temporal gyrus or supramarginal gyrus and either sensorimotor cortex, Broca’s area, or both (red/triangles in Figure 4 and Supplementary Figure 2), with timing strongly aligned to verbal response onset (red in Figure 6 and Supplementary Figure 3). This alignment is consistent with pSTG’s role in auditory processing of speech (Price et al., 1992; Steinschneider et al., 2011; Leonard and Chang, 2014).
Auditory word repetition evoked its own set of robust FNC spatial-temporal profiles. The most prominent of these, seen in all subjects, primarily included interactions within pSTG (green/circles in Figure 4 and Supplementary Figure 2), with strong cue-aligned and weaker response-aligned activation (green in Figure 6 and Supplementary Figure 3). Again, this activation profile is consistent with pSTG’s likely role in processing both the external speech stimuli and the subjects’ spoken responses; the large disparity in activation observed between cue-related and response-related activity may reflect suppression of high gamma activity during self-generated speech (Flinker et al., 2010). Four subjects exhibited FNCs with pSTG–Broca’s area interactions (brown/stars in Figure 4 and Supplementary Figure 2) with intermediate timing, becoming active after the green intra-pSTG FNCs (e.g., Figure 6 and Supplementary Figure 3, brown). These FNCs correlate well with the predicted flow of information from phonological networks in pSTG and posterior middle temporal gyrus (pMTG) to networks for syllabification (Indefrey, 2011) or formation of articulatory codes (Hickok and Poeppel, 2007; Flinker et al., 2015) in Broca’s area. Three subjects had FNCs consisting primarily of intra-SMC interactions (purple/crosses in Figure 4 and Supplementary Figure 2); as expected, these FNCs show activation centered around response onset, consistent with articulatory execution.
Four subjects also exhibited FNCs including prominent SMC-pSTG and/or SMC-SMG interactions (pink/diamonds in Figure 4 and Supplementary Figure 6). These FNCs showed both post-cue and post-response activity like the green intra-pSTG FNCs, as exemplified in Figure 6 and Supplementary Figure 3 (pink); however, rather than being suppressed during the post-response period, these FNCs showed equal or greater activation for self-produced speech. This difference in subnetwork dynamics—between intra-pSTG FNCs that are preferential for the external speech cue, and FNCs with SMC–Wernicke’s area interactions with stronger activation during the subject’s own verbal response—is consistent with recent studies suggesting functional distinctions between processing external and self-generated speech (Towle et al., 2008; Chang et al., 2013; Kort et al., 2014). The inclusion of SMC-Wernicke interactions in these FNCs, as well as in the red response-aligned FNCs from picture naming, is consistent with previous studies in nonhuman primates showing motor-sensory interactions during vocalization (Eliades and Wang, 2003); these interactions may reflect auditory feedback for motor control (Houde and Chang, 2015) and/or feedforward motor inputs for auditory state prediction (Hickok, 2012). Interestingly, subject 5’s SMC-Wernicke FNCs (red and pink in Figure 4) involved distinct sites superior to those implicated in her cue-preferential intra-pSTG FNC (green); this correlates well with predictions of a spatially distinct “sensorimotor interface” for speech (Hickok et al., 2003; Hickok and Poeppel, 2007; Hickok et al., 2009).
To further test the validity of the estimated subnetworks and their interpretation, we examined the activation of FNCs during tasks for which they were not directly computed by applying the FNC loadings from one task to the event-related interaction data from the other task; the results of this procedure applied to subject 1 can be seen in the top-right and bottom-left plots of Figure 5. Remarkably, FNCs computed from one task showed robust profiles of activation when applied to the other task, providing more evidence for their roles in task processing. For example, in subject 1, the early VOTC-involved FNC from picture naming (brain 1 in A; blue traces in B) showed no significant modulation during the auditory repetition task, suggesting that that FNC’s activity was specific to visual processing. As another example, subject 1’s intra-pSTG FNC from word repetition (brain 4 in C; green traces in B) showed similar activation during the picture naming task to subject 1’s post-response FNC from picture naming (brain 3 in A; red traces in B), though the green intra-pSTG FNC attained significance and peak activation slightly later. Subject 1’s pSTG–Broca’s area FNC from word repetition (brain 5 in C; brown traces in B) showed slight intermediate activation during picture naming, perhaps reflecting access to phonology that was overshadowed in the FNC computation by the dependence of lexical retrieval on visual object processing.
Taken as a whole, these results support our original hypothesis: groups of cortico-cortical interactions with strong temporal covariance—as captured by FNC analysis—are indicative of discrete functional subnetworks, and may provide important insights into the dynamics of large-scale cortical networks during cognitive tasks.
4.2. FNCs can be estimated in a computationally efficient manner with single-trial robustness
There are a wide array of tools available for characterizing cortical interplay, spanning the gamut of complexity and interpretability (Winterhalder et al., 2005). In recent studies of interaction dynamics in ECoG, multivariate parametric methods—including partial directed coherence (Baccalá and Sameshima, 2001), the directed transfer function (Kamiński et al., 2001), and event related causality (Korzeniewska et al., 2008)—have come to the fore. The theoretical basis for these methods lies in determining the degree to which one signal “Granger-causes” another—that is, whether information about one signal in the past improves a predictive model of the other signal in the present (Granger, 1988). To this end, these methods approximate ECoG signal dynamics as a multivariate autoregressive (MVAR) model of some specified order p,
where the Ak matrices encode interactions between channels, and are used to make various inferences about the system, depending on the metric.
Cortical interactions are not static, however; hence, to be truly descriptive about interaction dynamics, a new MVAR model must be fit in each short time window. As this window is made shorter to provide higher time resolution, less data are available to inform the fitted model coefficients. As a result, for modest model order, montage size, and time resolution, the fit becomes unreliable unless the data are aggregated across trials, which is undesirable in many circumstances. As an alternative, the tvDBN model used in this study—which has shown great promise as an ECoG interaction estimator in motor tasks (Benz et al., 2012b)—ameliorates the data-volume problem through simplification: it reduces the complexity of the input data being analyzed by considering only power in the high gamma band, and reduces the complexity of the model by restricting to a first-order MVAR model, which can be robustly evaluated in single trials when subjected to regularization and smoothness conditions as described above. In this way, tvDBN can be seen as a local linear approximation of the multivariate high-gamma envelope dynamics.
PCA signal decomposition is relatively fast on modern hardware; hence, the FNC analysis method introduced here can be readily generalized to other interaction estimation methods. Combined with the relative computational ease of multi-taper spectral estimation and tvDBN interaction estimation, the entire pipeline demonstrated in this report is suitable for online implementation; indeed, even the PCA step need only be run once during a model “training” session, making online use of FNCs a simple linear weighting of interaction features. Our results have shown that FNCs demonstrate robust and informative activation in single trials. As such, FNC analysis on tvDBN ERI estimates is an ideal candidate for use in real-time applications such as control of a brain-machine interface.
4.3. Potential extensions of FNC analysis
Direct cortical stimulation in epilepsy surgery patients has shown that the topography of language cortex is highly variable across individuals, and that even within consistent language areas—e.g., pSTG and posterior inferior frontal gyrus in the dominant hemisphere—there are mosaics of cortical patches in which direct cortical stimulation interferes with language tasks (Ojemann, 2003). To minimize the cognitive side-effects of resection surgery, it is necessary to map the cortical sites crucial for language in each patient. Until recently, this was done exclusively with electrocortical stimulation mapping (ESM); however, important practical limitations of this method (Lesser et al., 1984, 1985; Blume et al., 2004; Pouratian et al., 2004; Hamberger, 2007) have motivated the development of passive ECoG recording and analysis techniques, which measure task-related high gamma responses, as a potential alternative (Lachaux et al., 2007; Miller et al., 2007; Schalk et al., 2008; Cheung and Chang, 2012; Qian et al., 2013). Although mapping of high gamma activation has shown promise, its adoption for routine clinical use has been inhibited by inconsistencies with ESM results (Wu et al., 2010; Cervenka et al., 2011; Ruescher et al., 2013; Bauer et al., 2013) and by methodological difficulties in comparing the post-operative outcomes of resections guided by ESM to those guided by activation-based mapping. Our current study was motivated by the hypothesis that the spatial-temporal profile of task-related cortical interactions provides additional insights into the the relative importance of individual sites for task processing.
FNC analysis may lend several interpretational benefits over ESM or maps of activation at individual sites. First, the participation of sites in particular FNCs provides information about which stages of task processing those sites are involved with. Given the variation in the functional anatomy of language, this information could lend more insight into the specific kinds of deficits risked by lesioning that site than the strength of the site’s task-related activation alone. Second, the location of a site within the FNC architecture may be indicative of that site’s importance in task processing relative to other activated sites. For instance, nodes with high centrality in an FNC may be seen as local “hubs” for a particular processing stage (Bullmore and Sporns, 2009; Newman et al., 2015); hence, resecting such central sites may be more likely to cause cognitive damage specific to that FNC’s processing role, while resecting sites that are more peripheral in the FNC could potentially carry a reduced risk of deficits. Similarly, sites that participate in multiple FNCs may facilitate communication between subnetworks, serving as the “bridge” or “connector” between computation stages; such sites, when resected, might also lead to a specific pattern of impairment. This sort of graph-theoretic analysis of cortical interaction data derived from ECoG has already shown great promise in the characterization of epileptogenic networks (Burns et al., 2012; Ortega et al., 2008; Wilke et al., 2011), and may prove equally informative when applied to task-related network structures like FNCs.
Network structure analysis has great potential outside of the clinic as well: a considerable amount of recent effort in neuroimaging has gone into characterizing the structure of cortical connections, in terms of both physical tracts (DTI) and correlated resting-state metabolic activity (fMRI) (van den Heuvel and Sporns, 2011; Collin et al., 2013). Indeed, the “rich club” nodes of such import to connectomics are presumed to lie at the intersection of network hubs (centers of local processing) and bridges (gateways of information flow). However, the poor time resolution of functional neuroimaging techniques hinders their utility in probing the dynamic exchanges of information that take place during a task. FNC analysis performed on ECoG high gamma activity has the potential to expand upon these noninvasive methods by not only identifying important network loci like hubs and bridges, but also by illuminating the temporal profile of their interactions over the course of a specific activity. Although beyond the scope of the present study, such a characterization of these loci would be an insightful application of the methods introduced here.
5. Conclusions
Mapping the event-related interactions between cortical sites during language tasks, such as picture naming and word repetition, holds great promise for expanding our understanding of the neurophysiology underlying these complex human behaviors. The difficulties of visualizing, analyzing, and interpreting the high volume of data inherent in these interactions can be mitigated by the theoretical framework of subnetwork decomposition, which seeks to identify groups of related interactions that can be treated as a cohesive functional unit. The approach presented in this report, functional network component analysis, is a simple yet informative first step toward this vision in mapping of the dynamics of task-related cortical interactions in the human brain with ECoG. FNCs capture both the strong interaction phenomena common to subjects with broadly similar anatomical coverage, as well as the more nuanced patterns of interactions specific to each subject’s electrode placement and cortical organization. This first demonstration suggests that FNC analysis—and, more generally, the subnetwork decomposition paradigm—may be a powerful tool for investigating the rich dynamics of large-scale, distributed cortical networks recruited during a wide range of human cognitive tasks.
Supplementary Material
Supplementary Figure 1. Overview of the signal processing pipeline used for the computation of functional network components (FNCs). Raw ECoG data was first re-referenced to eliminate common mode artifacts, filtered to isolate high gamma (70–110 Hz) activity and avoid line noise, and aligned to experimental cues. We then performed spectral estimation on the aligned data, averaging the log-power across high gamma frequencies for each time window. These spectral features were then normalized to the baseline mean and standard deviation, resulting in an estimate of the event-related modulation of high gamma activity; shown are the single-trial results (grey traces) and average across trials (white trace) for a representative electrode from subject 5 during the picture naming task. We then performed the tvDBN interaction estimation procedure on those channels that showed significant high gamma modulation during the task, and normalized the results relative to the mean and standard deviation of the baseline interaction values for each site-site pair, giving an estimate of the event-related interaction between the sites. We concatenated all trials of interaction data, and performed “signed PCA” on the result; this yielded the (spatial) loadings for each site-site interaction in each signed PC, as well as the (temporal) scores of each signed PC across trials. The figure depicts the scores in single trials (grey traces) and averaged across trials (white trace) for one signed PC from subject 5 during picture naming. We then statistically tested these signed PCs: temporally, for significant event-related modulation; and spatially, for significant participation of site-site interactions in the PC. The end results, functional network components, are characterized by the loadings of the significantly involved site-site interactions, as well as the time series of the FNC’s event-related modulation.
Supplementary Figure 2. Detailed view of the anatomical composition of all FNCs exhibiting common patterns described in the text from picture naming (left of vertical bar) and auditory word repetition (right). Symbols and colors are the same as in Figure 4; thicker edges represent stronger contributions to an FNC.
Supplementary Figure 3. Activation of the same FNCs shown in Supplementary Figure 2 in single trials, with darker colors corresponding to stronger activation. Trials are aligned to cue onset (vertical line), and have been sorted according to the subject’s response time (smaller black lines); trials without a response (present for subject 1 during picture naming and subject 3 during word repetition) are grouped at the bottom.
Highlights.
A method to identify functional subnetworks from ECoG connectivity is proposed.
Subnetworks simplify connectivity dynamics by grouping co-varying connections.
Subnetworks observed during language tasks are consistent with current models.
Acknowledgments
This study was supported by the National Institute of Neurological Disorders and Stroke (R01 NS40596, R01 NS091139).
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- Broca P. Sur le siège de la faculté du langage articulé. Bulletins de la Société d’Anthropologie de Paris. 1865;6(1):377–393. [Google Scholar]
- Ojemann G, Ojemann J, Lettich E, Berger M. Cortical language localization in left, dominant hemisphere: an electrical stimulation mapping investigation in 117 patients. Journal of neurosurgery. 1989;71(3):316–326. doi: 10.3171/jns.1989.71.3.0316. [DOI] [PubMed] [Google Scholar]
- Indefrey P, Levelt WJ. The new cognitive neurosciences. 2nd. MIT Press; 2000. The neural correlates of language production; pp. 845–865. [Google Scholar]
- Hickok G, Poeppel D. Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language. Cognition. 2004;92(1):67–99. doi: 10.1016/j.cognition.2003.10.011. [DOI] [PubMed] [Google Scholar]
- Indefrey P, Levelt WJ. The spatial and temporal signatures of word production components. Cognition. 2004;92(1):101–144. doi: 10.1016/j.cognition.2002.06.001. [DOI] [PubMed] [Google Scholar]
- Glasser MF, Rilling JK. DTI tractography of the human brain’s language pathways. Cerebral cortex. 2008;18(11):2471–2482. doi: 10.1093/cercor/bhn011. [DOI] [PubMed] [Google Scholar]
- Biswal B, Zerrin Yetkin F, Haughton VM, Hyde JS. Functional connectivity in the motor cortex of resting human brain using echo-planar mri. Magnetic resonance in medicine. 1995;34(4):537–541. doi: 10.1002/mrm.1910340409. [DOI] [PubMed] [Google Scholar]
- Manning JR, Jacobs J, Fried I, Kahana MJ. Broadband shifts in local field potential power spectra are correlated with single-neuron spiking in humans. The Journal of neuroscience. 2009;29(43):13613–13620. doi: 10.1523/JNEUROSCI.2041-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ray S, Crone NE, Niebur E, Franaszczuk PJ, Hsiao SS. Neural correlates of high-gamma oscillations (60–200 Hz) in macaque local field potentials and their potential implications in electrocorticography. The Journal of Neuroscience. 2008;28(45):11526–11536. doi: 10.1523/JNEUROSCI.2848-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miller KJ. Broadband spectral change: evidence for a macroscale correlate of population firing rate? The Journal of Neuroscience. 2010;30(19):6477–6479. doi: 10.1523/JNEUROSCI.6401-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flinker A, Chang EF, Kirsch HE, Barbaro NM, Crone NE, Knight RT. Single-trial speech suppression of auditory cortex activity in humans. The Journal of Neuroscience. 2010;30(49):16643–16650. doi: 10.1523/JNEUROSCI.1809-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston KJ, et al. Functional and effective connectivity in neuroimaging: a synthesis. Human brain mapping. 1994;2(1–2):56–78. [Google Scholar]
- Friston KJ. Functional and effective connectivity: a review. Brain connectivity. 2011;1(1):13–36. doi: 10.1089/brain.2011.0008. [DOI] [PubMed] [Google Scholar]
- Winterhalder M, Schelter B, Hesse W, Schwab K, Leistritz L, Klan D, Bauer R, Timmer J, Witte H. Comparison of linear signal processing techniques to infer directed interactions in multivariate neural systems. Signal processing. 2005;85(11):2137–2160. [Google Scholar]
- Indefrey P. The spatial and temporal signatures of word production components: a critical update. Frontiers in psychology. 2 doi: 10.3389/fpsyg.2011.00255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hickok G, Poeppel D. The cortical organization of speech processing. Nature Reviews Neuroscience. 2007;8(5):393–402. doi: 10.1038/nrn2113. [DOI] [PubMed] [Google Scholar]
- Bullmore E, Sporns O. Complex brain networks: graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience. 2009;10(3):186–198. doi: 10.1038/nrn2575. [DOI] [PubMed] [Google Scholar]
- Korzeniewska A, Crainiceanu CM, Kuś R, Franaszczuk PJ, Crone NE. Dynamics of event-related causality in brain electrical activity. Human brain mapping. 2008;29(10):1170–1192. doi: 10.1002/hbm.20458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Korzeniewska A, Franaszczuk PJ, Crainiceanu CM, Kuś R, Crone NE. Dynamics of large-scale cortical interactions at high gamma frequencies during word production: event related causality (ERC) analysis of human electrocorticography (ECoG) Neuroimage. 2011;56(4):2218–2237. doi: 10.1016/j.neuroimage.2011.03.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benz H, Collard M, Tsimpouris C, Acharya S, Crone N, Thakor N, Bezerianos A. Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE. IEEE; 2012a. Directed causality of the human electrocorticogram during dexterous movement; pp. 1872–1875. [DOI] [PubMed] [Google Scholar]
- Duncan JS, Papademetris X, Yang J, Jackowski M, Zeng X, Staib LH. Geometric strategies for neuroanatomic analysis from MRI. Neuroimage. 2004;23:S34–S45. doi: 10.1016/j.neuroimage.2004.07.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Papademetris X, Jackowski M, Rajeevan N, Constable RT, Staib L. BioImage Suite: An integrated medical image analysis suite. The Insight Journal. 2005;1:3. [PMC free article] [PubMed] [Google Scholar]
- Crone N, Hao L, Hart J, Boatman D, Lesser R, Irizarry R, Gordon B. Electrocorticographic gamma activity during word production in spoken and sign language. Neurology. 2001;57(11):2045–2053. doi: 10.1212/wnl.57.11.2045. [DOI] [PubMed] [Google Scholar]
- Rodriguez Merzagora A, Coffey TJ, Sperling MR, Sharan A, Litt B, Baltuch G, Jacobs J. Repeated stimuli elicit diminished high-gamma electrocorticographic responses. NeuroImage. 2014;85:844–852. doi: 10.1016/j.neuroimage.2013.07.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomson DJ. Spectrum estimation and harmonic analysis. Proceedings of the IEEE. 1982;70(9):1055–1096. [Google Scholar]
- Babadi B, Brown EN. A review of multitaper spectral analysis, Biomedical Engineering. IEEE Transactions on. 2014;61(5):1555–1564. doi: 10.1109/TBME.2014.2311996. [DOI] [PubMed] [Google Scholar]
- Bokil H, Andrews P, Kulkarni JE, Mehta S, Mitra PP. Chronux: a platform for analyzing neural signals. Journal of neuroscience methods. 2010;192(1):146–151. doi: 10.1016/j.jneumeth.2010.06.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gasser T, Bächer P, Möcks J. Transformations towards the normal distribution of broad band spectral parameters of the EEG. Electroencephalography and clinical neurophysiology. 1982;53(1):119–124. doi: 10.1016/0013-4694(82)90112-2. [DOI] [PubMed] [Google Scholar]
- Song L, Kolar M, Xing EP. Time-varying dynamic Bayesian networks. Advances in Neural Information Processing Systems. 2009:1732–1740. [Google Scholar]
- Phillips DL. A technique for the numerical solution of certain integral equations of the first kind. Journal of the ACM (JACM) 1962;9(1):84–97. [Google Scholar]
- Tikhonov A. Solution of incorrectly formulated problems and the regularization method. Soviet Mathematics Doklady. 1963;5:1035–1038. [Google Scholar]
- Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B (Methodological) 1996:267–288. [Google Scholar]
- Benz HL, Zhang H, Bezerianos A, Acharya S, Crone NE, Zheng X, Thakor NV. Connectivity analysis as a novel approach to motor decoding for prosthesis control. Neural Systems and Rehabilitation Engineering, IEEE Transactions on. 2012b;20(2):143–152. doi: 10.1109/TNSRE.2011.2175309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B (Methodological) 1995:289–300. [Google Scholar]
- Hotelling H. Analysis of a complex of statistical variables into principal components. Journal of educational psychology. 1933;24(6):417. [Google Scholar]
- Lee DD, Seung HS. Learning the parts of objects by non-negative matrix factorization. Nature. 1999;401(6755):788–791. doi: 10.1038/44565. [DOI] [PubMed] [Google Scholar]
- Lee DD, Seung HS. Algorithms for non-negative matrix factorization, in: Advances in neural information processing systems. 2001:556–562. [Google Scholar]
- Moore CJ, Price CJ. Three distinct ventral occipitotemporal regions for reading and object naming. Neuroimage. 1999;10(2):181–192. doi: 10.1006/nimg.1999.0450. [DOI] [PubMed] [Google Scholar]
- Allison T, Puce A, Spencer DD, McCarthy G. Electrophysiological studies of human face perception. I: Potentials generated in occipitotemporal cortex by face and non-face stimuli. Cerebral cortex. 1999;9(5):415–430. doi: 10.1093/cercor/9.5.415. [DOI] [PubMed] [Google Scholar]
- Celsis P, Boulanouar K, Doyon B, Ranjeva J, Berry I, Nespoulous J, Chollet F. Differential fMRI responses in the left posterior superior temporal gyrus and left supramarginal gyrus to habituation and change detection in syllables and tones. Neuroimage. 1999;9(1):135–144. doi: 10.1006/nimg.1998.0389. [DOI] [PubMed] [Google Scholar]
- Dehaene-Lambertz G, Pallier C, Serniclaes W, Sprenger-Charolles L, Jobert A, Dehaene S. Neural correlates of switching from auditory to speech perception. Neuroimage. 2005;24(1):21–33. doi: 10.1016/j.neuroimage.2004.09.039. [DOI] [PubMed] [Google Scholar]
- Flinker A, Korzeniewska A, Shestyuk AY, Franaszczuk PJ, Dronkers NF, Knight RT, Crone NE. Redefining the role of Broca’s area in speech. Proceedings of the National Academy of Sciences. 2015;112(9):2871–2875. doi: 10.1073/pnas.1414491112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Price C, Wise R, Ramsay S, Friston K, Howard D, Patterson K, Frackowiak R. Regional response differences within the human auditory cortex when listening to words. Neuroscience letters. 1992;146(2):179–182. doi: 10.1016/0304-3940(92)90072-f. [DOI] [PubMed] [Google Scholar]
- Steinschneider M, Nourski KV, Kawasaki H, Oya H, Brugge JF, Howard MA. Intracranial study of speech-elicited activity on the human posterolateral superior temporal gyrus. Cerebral Cortex. 2011;21(10):2332–2347. doi: 10.1093/cercor/bhr014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leonard MK, Chang EF. Dynamic speech representations in the human temporal lobe. Trends in cognitive sciences. 2014;18(9):472–479. doi: 10.1016/j.tics.2014.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Towle VL, Yoon HA, Castelle M, Edgar JC, Biassou NM, Frim DM, Spire J-P, Kohrman MH. ECoG gamma activity during a language task: differentiating expressive and receptive speech areas. Brain. 2008;131(8):2013–2027. doi: 10.1093/brain/awn147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chang EF, Niziolek CA, Knight RT, Nagarajan SS, Houde JF. Human cortical sensorimotor network underlying feedback control of vocal pitch. Proceedings of the National Academy of Sciences. 2013;110(7):2653–2658. doi: 10.1073/pnas.1216827110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kort NS, Nagarajan SS, Houde JF. A bilateral cortical network responds to pitch perturbations in speech feedback. Neuroimage. 2014;86:525–535. doi: 10.1016/j.neuroimage.2013.09.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eliades SJ, Wang X. Sensory-motor interaction in the primate auditory cortex during self-initiated vocalizations. Journal of neurophysiology. 2003;89(4):2194–2207. doi: 10.1152/jn.00627.2002. [DOI] [PubMed] [Google Scholar]
- Houde JF, Chang EF. The cortical computations underlying feedback control in vocal production. Current opinion in neurobiology. 2015;33:174–181. doi: 10.1016/j.conb.2015.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hickok G. The cortical organization of speech processing: Feedback control and predictive coding the context of a dual-stream model. Journal of communication disorders. 2012;45(6):393–402. doi: 10.1016/j.jcomdis.2012.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hickok G, Buchsbaum BR, Humphries C, Muftuler T. Auditory–motor interaction revealed by fMRI: speech, music, and working memory in area Spt. Cognitive Neuroscience, Journal of. 2003;15(5):673–682. doi: 10.1162/089892903322307393. [DOI] [PubMed] [Google Scholar]
- Hickok G, Okada K, Serences JT. Area Spt in the human planum temporale supports sensory-motor integration for speech processing. Journal of Neurophysiology. 2009;101(5):2725–2732. doi: 10.1152/jn.91099.2008. [DOI] [PubMed] [Google Scholar]
- Baccalá LA, Sameshima K. Partial directed coherence: a new concept in neural structure determination. Biological cybernetics. 2001;84(6):463–474. doi: 10.1007/PL00007990. [DOI] [PubMed] [Google Scholar]
- Kamiński M, Ding M, Truccolo WA, Bressler SL. Evaluating causal relations in neural systems: Granger causality, directed transfer function and statistical assessment of significance. Biological cybernetics. 2001;85(2):145–157. doi: 10.1007/s004220000235. [DOI] [PubMed] [Google Scholar]
- Granger CW. Some recent development in a concept of causality. Journal of econometrics. 1988;39(1):199–211. [Google Scholar]
- Ojemann GA. The neurobiology of language and verbal memory: observations from awake neurosurgery. International Journal of Psychophysiology. 2003;48(2):141–146. doi: 10.1016/s0167-8760(03)00051-5. [DOI] [PubMed] [Google Scholar]
- Lesser R, Lüders H, Klem G, Dinner D, Morris H, Hahn J. Cortical after discharge and functional response thresholds: results of extraoperative testing. Epilepsia. 1984;25(5):615–621. doi: 10.1111/j.1528-1157.1984.tb03471.x. [DOI] [PubMed] [Google Scholar]
- Lesser R, Lüders H, Klem G, Dinner D, Morris H, Hahn J. Ipsilateral trigeminal sensory responses to cortical stimulation by subdural electrodes. Neurology. 1985;35(12):1760–1760. doi: 10.1212/wnl.35.12.1760. [DOI] [PubMed] [Google Scholar]
- Blume WT, Jones DC, Pathak P. Properties of after-discharges from cortical electrical stimulation in focal epilepsies. Clinical neurophysiology. 2004;115(4):982–989. doi: 10.1016/j.clinph.2003.11.023. [DOI] [PubMed] [Google Scholar]
- Pouratian N, Cannestra AF, Bookheimer SY, Martin NA, Toga AW. Variability of intraoperative electrocortical stimulation mapping parameters across and within individuals. Journal of neurosurgery. 2004;101(3):458–466. doi: 10.3171/jns.2004.101.3.0458. [DOI] [PubMed] [Google Scholar]
- Hamberger MJ. Cortical language mapping in epilepsy: a critical review. Neuropsychology review. 2007;17(4):477–489. doi: 10.1007/s11065-007-9046-6. [DOI] [PubMed] [Google Scholar]
- Lachaux JP, Jerbi K, Bertrand O, Minotti L, Hoffmann D, Schoendorff B, Kahane P. A blueprint for real-time functional mapping via human intracranial recordings. PLoS One. 2007;2(10):e1094. doi: 10.1371/journal.pone.0001094. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miller KJ, Shenoy P, Miller JW, Rao RP, Ojemann JG, et al. Real-time functional brain mapping using electrocorticography. Neuroimage. 2007;37(2):504–507. doi: 10.1016/j.neuroimage.2007.05.029. [DOI] [PubMed] [Google Scholar]
- Schalk G, Leuthardt EC, Brunner P, Ojemann JG, Gerhardt LA, Wolpaw JR. Real-time detection of event-related brain activity. Neuroimage. 2008;43(2):245–249. doi: 10.1016/j.neuroimage.2008.07.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheung C, Chang EF. Real-time, time–frequency mapping of event-related cortical activation. Journal of neural engineering. 2012;9(4):046018. doi: 10.1088/1741-2560/9/4/046018. [DOI] [PubMed] [Google Scholar]
- Qian T, Zhou W, Ling Z, Gao S, Liu H, Hong B. Fast presurgical functional mapping using task-related intracranial high gamma activity: Laboratory investigation. Journal of neurosurgery. 2013;119(1):26. doi: 10.3171/2013.2.JNS12843. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu M, Wisneski K, Schalk G, Sharma M, Roland J, Breshears J, Gaona C, Leuthardt EC. Electrocorticographic frequency alteration mapping for extraoperative localization of speech cortex. Neurosurgery. 2010;66(2):E407–E409. doi: 10.1227/01.NEU.0000345352.13696.6F. [DOI] [PubMed] [Google Scholar]
- Cervenka MC, Boatman-Reich D, Ward J, Franaszczuk PJ, Crone N. Language mapping in multilingual patients: electrocorticography and cortical stimulation during naming. Frontiers in human neuroscience. 2011;5:13. doi: 10.3389/fnhum.2011.00013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruescher J, Iljina O, Altenmüller DM, Aertsen A, Schulze-Bonhage A, Ball T. Somatotopic mapping of natural upper-and lower-extremity movements and speech production with high gamma electrocorticography. NeuroImage. 2013;81:164–177. doi: 10.1016/j.neuroimage.2013.04.102. [DOI] [PubMed] [Google Scholar]
- Bauer PR, Vansteensel MJ, Bleichner MG, Hermes D, Ferrier CH, Aarnoutse EJ, Ramsey NF. Mismatch between electrocortical stimulation and electrocorticography frequency mapping of language. Brain stimulation. 2013;6(4):524–531. doi: 10.1016/j.brs.2013.01.001. [DOI] [PubMed] [Google Scholar]
- Newman G, Fifer M, Benz H, Crone N, Thakor N. Eigenvector centrality reveals the time course of task-specific electrode connectivity in human ECoG. Neural Engineering (NER), 2015 7th International IEEE/EMBS Conference on, IEEE. 2015:336–339. [Google Scholar]
- Burns SP, Sritharan D, Jouny C, Bergey G, Crone N, Anderson WS, Sarma SV. Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE. IEEE; 2012. A network analysis of the dynamics of seizure; pp. 4684–4687. [DOI] [PubMed] [Google Scholar]
- Ortega GJ, Sola RG, Pastor J. Complex network analysis of human ECoG data. Neuroscience letters. 2008;447(2):129–133. doi: 10.1016/j.neulet.2008.09.080. [DOI] [PubMed] [Google Scholar]
- Wilke C, Worrell G, He B. Graph analysis of epileptogenic networks in human partial epilepsy. Epilepsia. 2011;52(1):84–93. doi: 10.1111/j.1528-1167.2010.02785.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van den Heuvel MP, Sporns O. Rich-club organization of the human connectome. The Journal of neuroscience. 2011;31(44):15775–15786. doi: 10.1523/JNEUROSCI.3539-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Collin G, Sporns O, Mandl RC, van den Heuvel MP. Structural and functional aspects relating to cost and benefit of rich club organization in the human cerebral cortex. Cerebral Cortex. 2013 doi: 10.1093/cercor/bht064. bht064. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary Figure 1. Overview of the signal processing pipeline used for the computation of functional network components (FNCs). Raw ECoG data was first re-referenced to eliminate common mode artifacts, filtered to isolate high gamma (70–110 Hz) activity and avoid line noise, and aligned to experimental cues. We then performed spectral estimation on the aligned data, averaging the log-power across high gamma frequencies for each time window. These spectral features were then normalized to the baseline mean and standard deviation, resulting in an estimate of the event-related modulation of high gamma activity; shown are the single-trial results (grey traces) and average across trials (white trace) for a representative electrode from subject 5 during the picture naming task. We then performed the tvDBN interaction estimation procedure on those channels that showed significant high gamma modulation during the task, and normalized the results relative to the mean and standard deviation of the baseline interaction values for each site-site pair, giving an estimate of the event-related interaction between the sites. We concatenated all trials of interaction data, and performed “signed PCA” on the result; this yielded the (spatial) loadings for each site-site interaction in each signed PC, as well as the (temporal) scores of each signed PC across trials. The figure depicts the scores in single trials (grey traces) and averaged across trials (white trace) for one signed PC from subject 5 during picture naming. We then statistically tested these signed PCs: temporally, for significant event-related modulation; and spatially, for significant participation of site-site interactions in the PC. The end results, functional network components, are characterized by the loadings of the significantly involved site-site interactions, as well as the time series of the FNC’s event-related modulation.
Supplementary Figure 2. Detailed view of the anatomical composition of all FNCs exhibiting common patterns described in the text from picture naming (left of vertical bar) and auditory word repetition (right). Symbols and colors are the same as in Figure 4; thicker edges represent stronger contributions to an FNC.
Supplementary Figure 3. Activation of the same FNCs shown in Supplementary Figure 2 in single trials, with darker colors corresponding to stronger activation. Trials are aligned to cue onset (vertical line), and have been sorted according to the subject’s response time (smaller black lines); trials without a response (present for subject 1 during picture naming and subject 3 during word repetition) are grouped at the bottom.