Abstract.
Understanding how cortical networks interact in response to task demands is important both for providing insight into the brain’s processing architecture and for managing neurological diseases and mental disorders. High-density diffuse optical tomography (HD-DOT) is a neuroimaging technique that offers the significant advantages of having a naturalistic, acoustically controllable environment and being compatible with metal implants, neither of which is possible with functional magnetic resonance imaging. We used HD-DOT to study the effective connectivity and assess the modulatory effects of speech intelligibility and syntactic complexity on functional connections within the cortical speech network. To accomplish this, we extend the use of a generalized psychophysiological interaction (PPI) analysis framework. In particular, we apply PPI methods to event-related HD-DOT recordings of cortical oxyhemoglobin activity during auditory sentence processing. We evaluate multiple approaches for selecting cortical regions of interest and for modeling interactions among these regions. Our results show that using subject-based regions has minimal effect on group-level connectivity maps. We also demonstrate that incorporating an interaction model based on estimated neural activity results in significantly stronger effective connectivity. Taken together our findings support the use of HD-DOT with PPI methods for noninvasively studying task-related modulations of functional connectivity.
Keywords: diffuse optical tomography, effective connectivity, psychophysiological interaction, speech comprehension
1. Introduction
Functional neuroimaging has provided substantial insight into how distributed dynamic neural processes are related in space and time. Collections of brain regions implicated in task-based or resting-state studies constitute functionally integrated systems of neural networks with properties that have been shown to be remarkably consistent across healthy participants.1–3 Importantly, disruptions or alterations of these functional networks have been shown to correlate with the stage of multiple neurodevelopmental and neurodegenerative conditions such as Autism and Alzheimer’s diseases.4–8 Mapping functional brain networks can potentially aid in early prediction of these conditions and could facilitate the optimization of interventions and tracking of treatment efficacy. However, technical limitations such as a fixed scanner, contraindication with metal implants, loud scanner noise, and emotional distress associated with the fear of small spaces limit the clinical and translational applications of functional magnetic resonance imaging (fMRI) for many types of diagnostic and prognostic studies.
An emerging optical neuroimaging technique, high-density diffuse optical tomography (HD-DOT), has the potential to be used as a surrogate to fMRI in many circumstances. HD-DOT provides a more naturalistic imaging setting where the acoustic environment is highly controllable. It is metal-compatible, cap-based imaging that samples the brain’s hemodynamic response with a relatively high temporal resolution [compared to time course of the hemodynamic response function (HRF)] and provides fMRI-comparable anatomical localization at the gyral level.9 To date, DOT studies have been mainly focused on investigating functional segregation (i.e., activity in specific brain regions related to a particular task)10–12 or evaluating resting state functional connectivity (i.e., correlations among intrinsic activities of remote brain regions).13,14 Excluding topographic studies using functional near-infrared spectroscopy,15,16 no previous studies have used optical neuroimaging to map effective connectivity within the brain (i.e., the influence that one brain region exerts over another during a task performance17) (Note that functional connectivity is a statistical relationship between two time series, whereas effective connectivity refers to the influence one system exerts over another, i.e., it implies a directionality to the relationship; see Ref. 18).
In the fMRI literature, psychophysiological interaction (PPI) analysis has been among the most promising techniques for examining effective connectivity.19–21 PPI analyses provide estimates of effective connectivity patterns by modeling activity in brain regions in terms of the interactions among the input from a seed brain region of interest (ROI) (physiological component) and the demand imposed by a cognitive process (psychological component). In this study, we use a generalized PPI (gPPI) technique19 to investigate functional integration during speech comprehension. We examine different strategies for selecting seeds and modeling interregional connections of HD-DOT data.
One challenge facing all effective connectivity studies is the selection of the seed ROIs. Methods for selecting seeds vary across studies, and currently there is no established standard approach for seed selection. Seeds can be defined in at least four different ways:
-
1.
Using anatomical landmarks or coordinates, as when there is a strong hypothesis about a particular anatomical region,11–13
-
2.
Using regions with the strongest task-dependent response in group results,22–24
-
3.
Using peaks from each individual’s task-dependent response, which may help account for interindividual differences in functional anatomy,25 or
- 4.
In the current study, we investigate the degree to which different seed selection methods impact estimated effective connectivity.
Because interactions in the brain occur at the neural level (rather than the hemodynamic level), an accurate model of neural interactions requires the neural signal, whereas HD-DOT provides only an indirect measure of the neural activity (as does the blood-oxygenation level-dependent signal measured with fMRI). Prior applications of PPI in fMRI data have suggested calculating interactions on the estimated neural activity based on a deconvolution approach.21 Here, we examined whether an interaction model based on deconvolution of a canonical HRF from the hemodynamic activity at the seed location can increase the statistical significance of measures of effective connectivity relative to a no deconvolution routine (where interactions are simply modeled at hemodynamic level).
To examine these methodological decisions and explore the use of PPI analyses in HD-DOT, we used data from an auditory sentence comprehension task. We selected speech comprehension because of its distributed and hierarchical organization in the brain that requires interaction among discrete cortical regions, it is a behavior that is frequently of interest in clinical populations,29–31 and it is well suited for optical neuroimaging.32 Using sentences with different levels of syntactic complexity, we examine how demands made by syntactic processes are manifested in the patterns of interregional connections. We hypothesized that processing sentences with increasing levels of linguistic challenge would result in increased effective connectivity within the speech network.
2. Materials and Methods
Herein are new analyses of data that were presented in a previous paper.12
2.1. Participants
Participants were 10 healthy right-handed native English speakers (ages 20 to 30 years, , , 6 female). All had normal hearing by self-report and no history of neurological or psychiatric disorders. All gave written informed consent under a process approved by the Human Research Protection Office by the Washington University School of Medicine.
2.2. HD-DOT Imaging System and Data Acquisition
We used a high-density, relatively large field-of-view, continuous wave, diffuse optical tomography system for data acquisition. The instrument consists of a flexible imaging cap that holds 96 source and 92 detector locations. Using fiber optic bundles, source locations are illuminated by continuous-wave light-emitting diodes. Light emitted from the head and collected by optical fibers at detector locations is detected by avalanche photodiodes (Hamamatsu C5460-01). Hemoglobin spectroscopy is enabled using two wavelengths (750 and 850 nm). The detected light signal is digitized by dedicated 24-bit analog-to-digital converters (MOTU HD192),33 which have high dynamic range () and low crosstalk (). The dynamic range allows the detection of light from multiple source-detector distances (e.g., first through fourth-nearest neighbors are separated by 13, 30, 39, and 47 mm, respectively). This array provides more than 1200 usable source-detector measurements at a 10-Hz full-field frame rate. Full details on this system have been previously described.13
Subjects were seated in an adjustable chair in a sound-isolated room facing a 19-in. LCD screen (located 1 m from subjects at approximately eye level) and two stereo speakers (each located at 1.5 m from subjects at approximately ear level). Subjects held a keyboard on their lap. The HD-DOT cap was put on the subject’s head covering occipital, temporal, and parts of parietal, motor, and frontal cortices (Fig. 1). Once the cap was placed comfortably with a good signal-to-noise ratio (the average signal for all the optodes across the cap was within two orders of magnitude and the temporal standard deviation of the signal was less than 7.5%), the exact placement of the cap with respect to the locations of anatomical landmarks on the head and face of the subject (e.g., the distance between optode location in one of the outer four corners of the imaging array and the nasion) was noted to locate the cap relative to the subject’s head, later used to generate a subject-specific finite element light model.9
2.3. Stimulus Protocol
We used 60 meaningful six-word sentences each with a subject-relative embedded clause, and reworded them to vary (1) their syntactic complexity (turning subject-relative into object-relative construction) and (2) the gender of the character performing the action: male (e.g., king and brother) or female (e.g., queen and sister).34 These manipulations resulted in 120 sentences with subject-relative center-embedded clauses and 120 sentences with object-relative center-embedded clauses, split evenly between male and female characters performing the action. The mean length of auditory stimuli was (range: 1.32 to 1.89 s). These stimuli are available in Ref. 35.
Sentences with object-relative clauses are more difficult to comprehend compared to those containing subject-relative clauses,36,37 typically resulting in longer response times,38 more errors, or equivalent performance but through increased neural activity.12 During the experiment, each of the 240 sentences was presented a single time and participants were asked to indicate the gender of the character performing the action (male or female) using a button-press response.
Additionally, we included unintelligible speech trials (noise) as a control condition. The noise stimuli consisted of one-channel noise-vocoded speech created by modulating white noise (lowpass filtered at 8 kHz) with the amplitude envelope of the sentence (lowpass filtered at 30 Hz). Noise vocoding removes much of the spectral detail from the sentence while retaining its temporal amplitude envelope.39 The code for noise vocoding (jp_vocode.m) is available in Ref. 40.
We presented stimuli using Psychophysics Toolbox 3,41 sending audio to the speakers via an external audio interface (M-Audio Fast Track Pro). We set the sound level at a comfortable listening level that did not change over the course of a session. Stimuli were presented in four separate runs, each of which contained 30 subject-relative (easy) sentences, 30 object-relative (complex) sentences, and 10 noise trials. These stimuli were presented in a pseudorandom order, with the order of conditions varied among runs but constant across subjects. Subjects were instructed to press a key with their left index finger if the person performing the action was female and a separate key with their right index finger if the person performing the action was male. A central fixation cross was displayed at the center of a gray screen; after each key press the cross was changed to an “” to inform the subject that a response was received. This sign was on the screen during part of interstimulus interval (ISI). ISIs were pseudorandomly distributed between 2 and 10 s; the subject’s reaction time at each trial was considered as a part of the ISI of that trial. If the reaction time was over the predetermined ISI for a given trial, the next stimulus was presented immediately following a subject’s key press. One second prior to the stimulus trial the “x” was changed back to a cross to prepare the subject for listening to the next stimulus. Stimuli were presented in a pseudorandom order.
Prior to the experiment, subjects were given a short practice session containing 24 trials (eight trials per condition) to explain the instructions and ensure they were performing the task correctly. None of these sentences was presented in the actual experiment. Participants’ average accuracy was greater than 97% for both subject-relative and object-relative sentences and did not differ between conditions.
2.4. Data Preprocessing
Volumetric measurements of hemodynamic activity were obtained from standard HD-DOT preprocessing of raw detector data.42 HD-DOT preprocessing includes denoising by superficial signal regression and bandpass filtering (0.02 to 0.5 Hz) of ratiometric changes in light levels, followed by reconstruction of hemodynamic-related changes in the absorption coefficients within the imaging field of view (FOV) and spectroscopy to estimate changes in the concentration of regional oxyhemoglobin and deoxyhemoglobin ( and , respectively). Hemodynamic signals were then downsampled to 1 Hz. For image reconstruction, we used individuals’ head models (derived from participants’ T1- and T2-weighted MRIs) to generate subject-specific forward light propagation models.9 For each participant, the reconstructed image was then normalized to Montreal Neurological Institute (MNI) 152 atlas space using a linear affine transformation, concurrently resampling to a voxel size of . Due to the cap fitting on a variety of head sizes and shapes, the FOV measured within each subject varied across participants. For this study, the group FOV includes only voxels sampled with acceptable sensitivity in all subjects (Fig. 1, red area on the cortex). To find these voxels, we calculated a flat field reconstruction [reconstruction of an image where the absorbance change is assumed to be equal throughout the FOV (i.e., all the voxels)43] and included only voxels whose reconstructed value was within two orders of magnitude of the maximum. The group FOV contains of head volume, covering occipital cortex and aspects of parietal, temporal, motor, and prefrontal cortices.
For the cortical surface representation of results, volumetric results are mapped onto the cortical gray matter midthickness surface of MNI152 atlas extracted using FreeSurfer software (version 5.1.0, Martinos Center for Biomedical Imaging, Massachusetts General Hospital).44
2.5. Timeseries Analysis
Here, we focused our analysis on oxyhemoglobin signal because (1) signal offers a higher contrast-to-noise ratio compared to and (2) we found that response maps based on signal are generally consistent with results.12,42 Prior to effective connectivity analysis, a standard general linear model (GLM) approach was used to localize the responses to each stimulus. Five conditions were included in the design matrix: subject-relative sentences, object-relative sentences, noise trials, left button presses, and right button presses. Auditory stimuli were modeled as events with 2 s duration and button presses as events with 0 s duration. Each event onset was convolved with a canonical HRF to produce a task-dependent reponse (TDR) model (predicted hemodynamic response). The canonical HRF was a double-gamma function with temporal properties matching the general features of the oxyhemoglobin response (averaged across all conditions and subjects) in primary auditory cortex: delay time of 2 s, time to peak of 7 s, and undershoot at 17 s. For each subject, we combined data for all four runs using a fixed effects analysis. Linear contrast maps including “sentences > baseline” and “complex sentences > easy sentences” were generated and statistical -value maps for each contrast were calculated. We assessed group-level response maps using random effects analyses of these contrast maps. Group maps were thresholded at (voxelwise) and corrected for multiple comparisons using a nonstationary cluster analysis technique at .45,46
2.6. Effective Connectivity Analysis: Generalized Psychophysiological Interaction Analysis
To obtain effective connectivity maps, we used a linear model with three types of regressors: one regressor per condition to model the response to the main effect of that condition (equivalent to the regressors used in GLM for estimating TDR maps), one regressor to model task-independent connectivity (TIC) to a selected seed region (equivalent to the resting-state connectivity), and one regressor per condition to model the task-dependent connectivity (TDC) (i.e., effective connectivity) to the seed region during that condition (the PPI term). A TDC regressor models hemodynamic activity in terms of interaction between the task and the influence of the seed location. Hence, significant effects for a TDC regressor indicate brain regions in which the influence of (connectivity to) the seed region is modulated by the task. The TDR model () was built by convolving the time trace of each stimulus onset (noise, subject-relative, and object-relative sentences) with the canonical HRF. The hemodynamic activity (ΔHbO signal) from each seed region () was used as a regressor for TIC. We used the PPI analysis method19,20 to model the TDC ()
(1) |
is the hemodynamic activity at time and location and , , and are fitting parameters for each regressor for the location , and is the residual signal at that location. Further details about how we built and are found in the following sections.
2.7. Effective Connectivity Analysis: Selecting Seeds
The first step in conducting a PPI analysis is defining one or more seed ROIs. We selected 10 nonoverlapping ROIs based on their known roles in speech processing including posterior and anterior parts of temporal cortex, dorsal and ventral prefrontal cortex, and premotor and inferior parietal cortex.12,34,47 The locations of seeds within these ROIs were specified on the basis of significantly activated (at voxelwise uncorrected) local maxima on group maps of auditory speech processing (sentences > noise contrast) and syntactic complexity processing (complex sentences > easy sentences contrast) (Fig. 2). Coordinates of these local maxima are given in Table 1. Seeds were defined as 6-mm-diameter spheres centered at these points. We refer to these seeds as canonical seeds because they are based on the group results rather than individual results.
Table 1.
Region | Center coordinates | |||
---|---|---|---|---|
Left | ||||
Dorsal premotor | 3.0 | 45.5 | ||
Dorsal middle frontal | 24.0 | 39.0 | ||
Dorsal inferior parietal | 45.0 | |||
Lateral superior temporal | ||||
Posterior middle temporal | 3.0 | |||
Anterior superior temporal | 9.0 | |||
Ventral inferior prefrontal | 39.0 | |||
Right | ||||
Lateral superior temporal | 67.5 | 0.0 | ||
Posterior middle temporal | 64.5 | 3.0 | ||
Ventral inferior prefrontal | 52.5 | 27.0 |
Variability in response maps across individuals motivated us to investigate the use of subject-specific seeds in the effective connectivity analysis. To identify these seeds, all the local peaks in individual maps were found and among them the closest peak to each canonical seed was located. If an individual participant’s peak was within 18-mm Euclidean distance of the canonical seed, the subject-specific seed was used; otherwise, the canonical seed location was used. 18-mm threshold is selected to be a couple of voxels larger than the FWHM of estimated point spread function for our imaging array (which is 48). We chose this approach to ensure that every participant had a seed region, and that there was a reasonable amount of spatial consistency across participants.
2.8. Effective Connectivity Analysis: Generating Task-Dependent Connectivity Regressors
In a PPI analysis, a task-dependent connectivity (TDC) regressor is constructed by multiplying the signal at a seed region (physiological component) by a vector representing the task condition (psychological component). In this study, we took two approaches to model the TDC regressors.
In the first approach, we modeled interactions at the hemodynamic signal level, so the TDC regressors were built by simply multiplying the signal at the seed location to the TDR model [Fig. 3(a)]
(2) |
sign denotes a componentwise multiplication.
In the second approach, we modeled the interactions in a faster frequency band (closer to a neural level) with subsequent convolution by an HRF to build TDC regressors. Because HD-DOT measures of brain activity are filtered by a slow hemodynamic response, estimating interactions at the neural level can be approached by deconvolution of HRF from hemodynamic measures, noise permitting. We used a parametric empirical Bayes formulation method21 along with our data-driven canonical HRF (as explained earlier) to estimate faster underlying neural activity. We then constructed by multiplying the estimated neural activity () into the task vector (), and then reconvolved the result with the same HRF used in the deconvolution step [Fig. 3(b)]. The noise introduced by the initial deconvolution step is thus smoothed by the final convolution. The task vector for each stimulus in this study was a vector of ones (epochs of 2 s for each stimulus trial) and zeros (when the stimulus is off).
(3) |
2.9. Effective Connectivity Analysis with a Permuted Signal
In order to verify the physiological relevance of our effective connectivity results, we performed a second set of PPI analyses with a signal for the physiological component that was not related to the current task-based neural activity. To hold all the other characteristics of a real signal constant—including spectral characteristics of noise (with different origins)—we used the seed signal recorded in another run (the stimuli order was pseudorandomly altered from run to run). We used a two sample chi-square test to determine whether the connectivity results based on permuted (physiologically irrelevant) signal follow the same distribution function as those based on real (physiologically relevant) signal or they are drawn from a different distribution function.
3. Results
To investigate response variability across subjects, we combined four runs of data collected per subject using fixed effects analyses and generated individual response maps of sentence comprehension and syntactic complexity processing. These maps were thresholded at (uncorrected) and converted to binary maps (1 if the voxel -statistic was above 1.6 and 0 otherwise). As shown in Fig. 4, the overlap between individuals’ binary maps is relatively poor at some of the canonical seed coordinates: not everyone showed a significant response at those locations. When selecting subject-specific seeds, local peaks in individuals’ maps in the vicinity of canonical seeds were identified. The average distance between these peaks and the canonical seeds, as well as the number of subjects that had a local maximum within 18 mm, are given in Table 2.
Table 2.
Region | Euclidean distance (mm) | # subjects | |
---|---|---|---|
Left | |||
Dorsal premotor | 14.1 | 8 | |
Dorsal middle frontal | 17.1 | 6 | |
Dorsal inferior parietal | 12.1 | 9 | |
Lateral superior temporal | 12.0 | 7 | |
Posterior middle temporal | 14.7 | 7 | |
Anterior superior temporal | 25.2 | 2 | |
Ventral inferior prefrontal | 22.5 | 4 | |
Right | |||
Lateral superior temporal | 15.8 | 5 | |
Posterior middle temporal | 18.4 | 5 | |
Ventral inferior prefrontal | 23.1 | 4 |
In order to assess the effect of using subject-specific seeds versus canonical seeds, we compared TIC maps generated with both sets of seeds. The TIC regressor () was built by averaging signal within the seed region (i.e., across voxels intersecting a 6-mm-radius sphere centered at the seed location). To summerize the resulting connectivity patterns, we calculated a seed-to-ROI connectivity matrix. For each TIC map, we averaged the -values from thresholded group maps within cubes of 9 mm per side centered at each canonical seed coordinate; these measures made a row in the connectivity matrix [Fig. 5(a)]. Note that these connectivity matrices are not symmetrical because they are not presenting direct seed-to-seed correlation coefficients but rather results of different gPPI analyses. The seed-to-ROI connectivity matrix with both seed sets (canonical and subject-specific) looked relatively similar (correlation coefficient ) [Fig. 5(b)]. TIC maps for left dorsal inferior parietal gyrus (dIPG) and left dorsal premotor gyrus (dpreMG) seeds that had the highest number of subject-specific seeds (9 and 8, respectively) are shown in Fig. 5(c).
Next, we deconvolved the canonical HRF from seed signals to incorporate the estimated neural signals in effective connectivity analysis and then we looked at how task demands influence the effective connectivity within the speech network. The effect of speech intelligibility on TDC patterns was investigated by comparing TDC maps for the sentence condition with those for the noise condition. Results showed that intelligibility significantly increased the effective connectivity between left frontal lobe and several cortical areas including both left and right posterior middle and lateral superior temporal gyri, and left anterior superior temporal and left dorsal inferior parietal gyri. In addition, intelligibility modulated the connections between left angular gyrus and left middle frontal gyrus (Fig. 6). Intelligibility did not have a significant effect on the connectivity between ventral inferior frontal gyri and the rest of the brain, nor did it affect the connectivity between left dpreMG with other regions in the brain.
We assessed the effect of syntactic complexity on interregional connections by statistically comparing TDC maps for the complex sentence condition with those for the easy sentence condition. These comparisons show that processing complex sentences increased bilateral connectivity between ventral parts of the prefrontal cortex and also had a strong modulatory effect on the connections between left temporal cortex and right prefrontal cortex, as well as on connections between left parietal lobe and left prefrontal cortex (Fig. 7).
As described previously, we deconvolved the canonical HRF from seed signals to estimate the underlying neural activity. Estimated neural activity carried more high frequency components than the hemodynamic activity [Fig. 8(a)], and when incorporated in PPI models changed the effective connectivity maps. In order to statistically evaluate its effect on the maps, we looked at changes in the effective connectivity measures at the seed-to-ROI level [that summarizes maps Fig. 5(a)]. Bonferroni correction was applied to address the effect of multiple comparisons (10 comparisons per map, thus a -value for significance at seed-to-ROI level was set to 0.005). We found significant increases in the measures of effective connectivity between several seeds and ROIs including left vIFG seed and left aSTG, left dIPG and right vIFG regions as well as those between right vIFG seed and left aSTG region [Fig. 8(b)]. For the rest of the seed-to-ROI comparisons, the effect of deconvolution of neural activity was not significant.
To examine the physiological relevance of the results, we used seed signals from different runs in the gPPI analysis. These resulted in lower connectivity values that followed a significantly different distribution as compared (using two sample chi-square test, ) to those of correct physiological signal (Fig. 9).
4. Discussion
In this paper, we establish the use of the gPPI analysis for HD-DOT studies of functional brain networks. We show that, as in fMRI, gPPI analyses provide useful information regarding physiologically relevant dynamic (task-dependent) connectivity among brain regions. Although several analysis approaches provided comparable results, the strongest measures of effective connectivity resulted when we included a deconvolution step in constructing the PPI model.
Our approach for estimating effective connectivity was semiexploratory in that we selected seeds based on local maxima from group maps within known speech regions. Previous neuropsychological and functional neuroimaging studies have repeatedly reported involvement of lateral portions of STG in speech perception.49–54 There is also strong support for the key role that pMTG plays in lexical-semantic processing34,51,55,56 and evidence of the role the inferior parietal cortex plays in working memory processes that are essential for speech comprehension.57 Additionally, multiple parts of the left frontal cortex including MFG, IFG, and premotor cortex have been repeatedly observed during speech comprehension,47,53,58 particularly during sentence processing.59,60 In addition to specifying canonical seeds in these ROI, we explored using subject-specific seeds that allow for individual variance in functional organization. Overall, a relatively small change in the TIC patterns was seen when subject-specific seeds were employed.
Effective connectivity analysis relies on modeling the interactions between tasks and brain activity. A problem arises when brain activity is measured in the form of hemodynamic response, which is delayed and smoothed by neurovascular coupling (e.g., an HRF). On one hand, we assume interactions are at the neural level, which suggests deconvolution might be helpful. On the other hand, as typically implemented, deconvolving is sensitive to noise and requires model estimations about the shape of the HRF. However, deconvolution is needed only to create the interaction term and can be followed by convolution using the same HRF. This two-step procedure minimizes the effect of model errors (same model for both deconvolution and convolution). In the data analyzed here, deconvolution of HRF seemed to enhance frequencies in the expected range for the neural activity in the speech network: as we expected to see (1) enhancement in the stimulus-locked components (i.e., between 0.1 and 0.5 Hz related to interstimulus-intervals of 2 to 10 s) and (2) slower frequencies in the higher order brain regions (related to integration of information over longer timescales in nonsensory regions61–63). Incorporating estimates of neural activity in the interaction model significantly strengthened the effective connectivity measures within the speech network. These included connections between left frontal lobe and several cortical areas, including both left and right posterior middle and lateral superior temporal gyri as well as left anterior superior and left dorsal inferior parietal gyri. Based on the deconvolution model, we found that speech intelligibility modulated the connections between left angular gyrus and left middle frontal gyrus. As we hypothesized, increased syntactic complexity modulated the effective connectivity among regions that we previously found to be involved in processing syntactically complex sentences.12 Overall, these results highlight the presence of hierarchical effective connectivity within a distributed speech-processing network.
Finally, we examined the biological relevance of the results by comparing the distributions of averaged connectivity values based on real signals with those based on scrambled signals. With the real signals, we found that distributions were leaning more toward positive values but with the scrambled signals distributions of positive and negative values were found to be relatively symmetric around 0 particularly in task-independent condition [Fig. 9(a)]. Interestingly, for the task-dependent condition [Fig. 9(b)], we found greater negative correlations with the scrambled signals. While distributions with real and scrambled signals were significantly different for both task-independent and task-dependent cases, this difference was less pronounced for the latter one in the positive values. That could be because in addition to physiological component [], the TDC regressor has a psychological component [ [Eq. (2)] that carries a significant amount of variance in data.
5. Conclusion
In this paper, we established a method for network-level study of brain function using HD-DOT. We incorporated this method in analysis of HD-DOT recordings of brain activity during different levels of speech processing and showed the modulatory effect of intelligibility and complexity on functional connectivity patterns within the cortical speech network. These advances facilitate studies of network-level brain processes using HD-DOT for a variety of tasks and populations.
Acknowledgments
This work was supported in part by NIH grant nos. R01EB009223, R01NS090874, R01NS046424, R21DC015884, and K01MH103594, as well as the Dana Foundation, the McDonnell Center for Systems Neuroscience #220020256, and an Autism Speaks Postdoctoral Translational Research Fellowship #7962. The funding sources had no involvement in the study design, collection, analysis, interpretation of the data, writing of the paper, or decision to submit the paper for publication.
Biographies
Mahlega S. Hassanpour, PhD, is a medical imaging scientist. She received her PhD in physics from Washington University in St. Louis and is currently a postdoctoral associate at Laureate Institute for Brain Research. During her PhD, she focused on developing DOT for neuroimaging of speech perception in people with cochlear implants. In her postdoctoral research, she develops multimodal pharmacological neuroimaging methods for studies of heart-brain connection and evaluates effects of cardiorespiratory stimulation of fMRI signals.
Adam T. Eggebrecht, PhD, is an instructor of radiology at Washington University School of Medicine in the Neurophotonics Lab of the Optical Radiology Labs. His research focuses on developing analytical and optical tools to provide portable and wearable systems for minimally constrained methods to map brain function that extend functional neuroimaging beyond current limitations. These methods allow brain imaging in populations unsuited to traditional methods due to discomfort with current technology or implanted devices who are not able to be studied with MRI due to safety concerns.
Jonathan E. Peelle, PhD, is a cognitive neuroscientist in the Department of Otolaryngology at Washington University in Saint Louis. His lab studies the neuroscience of speech comprehension, aging, and hearing impairment. He also has a courtesy appointment in the Department of Psychological and Brain Sciences, and is affiliated with the Philosophy-Neuroscience-Psychology Program, Linguistics program, and Department of Biology and Biomedical Sciences at Washington University in Saint Louis.
Joseph P. Culver, PhD, is a professor of radiology, physics, and biomedical engineering at Washington University in Saint Louis. His lab explores ways of exploiting noninvasive optical measurements for both functional- and molecular- biological imaging. Specifically, His group develops subsurface optical tomography for imaging intrinsic, hemoglobin-sensitive contrasts, and exogenous, molecularly targeted-fluorescent contrasts. His lab also develops fluorescence tomography systems and methods to image the biodistribution of molecularly targeted probes in small animal models of disease.
Disclosures
J.P.C. and Washington University have financial interests in Cephalogics LLC based on a license of related optical imaging technology by the University to Cephalogics LLC. No conflicts of interest, financial or otherwise, are declared by the authors.
References
- 1.Turkeltaub P. E., Coslett H. B., “Localization of sublexical speech perception components,” Brain Lang. 114(1), 1–15 (2010). 10.1016/j.bandl.2010.03.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Yang J., Andric M., Mathew M. M., “The neural basis of hand gesture comprehension: a meta-analysis of functional magnetic resonance imaging studies,” Neurosci. Biobehav. Rev. 57, 88–104 (2015). 10.1016/j.neubiorev.2015.08.006 [DOI] [PubMed] [Google Scholar]
- 3.Samara Z., et al. , “Human orbital and anterior medial prefrontal cortex: intrinsic connectivity parcellation and functional organization,” Brain Struct. Funct. (2017). 10.1007/s00429-017-1378-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Duong M. V. A., et al. , “Altered functional connectivity related to white matter changes inside the working memory network at the very early stage of MS,” J. Cereb. Blood Flow Metab. 25(10), 1245–1253 (2005). 10.1038/sj.jcbfm.9600122 [DOI] [PubMed] [Google Scholar]
- 5.Buckner R. L., et al. , “Cortical hubs revealed by intrinsic functional connectivity: mapping, assessment of stability, and relation to Alzheimer’s disease,” J. Neurosci. 29(6), 1860–1873 (2009). 10.1523/JNEUROSCI.5062-08.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kennedy D. P., Courchesne E., “Functional abnormalities of the default network during self- and other-reflection in autism,” Social Cognit. Affective Neurosci. 3(2), 177–190 (2008). 10.1093/scan/nsn011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Maximo J. O., Cadena E. J., Kana R. K., “The implications of brain connectivity in the neuropsychology of autism,” Neuropsychol. Rev. 24(1), 16–31 (2014). 10.1007/s11065-014-9250-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Rowe J., et al. , “Attention to action in Parkinson’s disease—impaired effective connectivity among frontal cortical regions,” Brain 125, 276–289 (2002). 10.1093/brain/awf036 [DOI] [PubMed] [Google Scholar]
- 9.Eggebrecht A. T., et al. , “A quantitative spatial comparison of high-density diffuse optical tomography and fMRI cortical mapping,” Neuroimage 61(4), 1120–1128 (2012). 10.1016/j.neuroimage.2012.01.124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Huppert T. J., Franceschini M. A., Boas D. A., “Noninvasive imaging of cerebral activation with diffuse optical tomography,” in In Vivo Optical Imaging of Brain Function, Frostig R. D., Ed., CRC Press, Boca Raton, Florida: (2009). [PubMed] [Google Scholar]
- 11.Becerra L., et al. , “Diffuse optical tomography activation in the somatosensory cortex: specific activation by painful vs. non-painful thermal stimuli,” PLoS One 4(11), e8016 (2009). 10.1371/journal.pone.0008016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hassanpour M. S., et al. , “Mapping cortical responses to speech using high-density diffuse optical tomography,” Neuroimage 117, 319–326 (2015). 10.1016/j.neuroimage.2015.05.058 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Eggebrecht A. T., et al. , “Mapping distributed brain function and networks with diffuse optical tomography,” Nat. Photonics 8(6), 448–454 (2014). 10.1038/nphoton.2014.107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chuang C. C., Sun C. W., “Gender-related effects of prefrontal cortex connectivity: a resting-state functional optical tomography study,” Biomed. Opt. Express 5(8), 2503–2516 (2014). 10.1364/BOE.5.002503 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhou G., et al. , “Development of effective connectivity during own- and other-race face processing: a granger causality analysis,” Front Hum. Neurosci. 10, 474 (2016). 10.3389/fnhum.2016.00474 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Tak S., et al. , “Dynamic causal modelling for functional near-infrared spectroscopy,” Neuroimage 111, 338–349 (2015). 10.1016/j.neuroimage.2015.02.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Friston K. J., Buchel C., “Attentional modulation of effective connectivity from V2 to V5/MT in humans,” Proc. Natl. Acad. Sci. U. S. A. 97(13), 7591–7596 (2000). 10.1073/pnas.97.13.7591 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Friston K. J., “Functional and effective connectivity: a review,” Brain Connect. 1(1), 13–36 (2011). 10.1089/brain.2011.0008 [DOI] [PubMed] [Google Scholar]
- 19.McLaren D. G., et al. , “A generalized form of context-dependent psychophysiological interactions (gPPI): a comparison to standard approaches,” Neuroimage 61(4), 1277–1286 (2012). 10.1016/j.neuroimage.2012.03.068 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Friston K. J., et al. , “Psychophysiological and modulatory interactions in neuroimaging,” Neuroimage 6(3), 218–229 (1997). 10.1006/nimg.1997.0291 [DOI] [PubMed] [Google Scholar]
- 21.Gitelman D. R., et al. , “Modeling regional and psychophysiologic interactions in fMRI: the importance of hemodynamic deconvolution,” Neuroimage 19(1), 200–207 (2003). 10.1016/S1053-8119(03)00058-2 [DOI] [PubMed] [Google Scholar]
- 22.Bitan T., et al. , “Shifts of effective connectivity within a language network during rhyming and spelling,” J. Neurosci. 25(22), 5397–5403 (2005). 10.1523/JNEUROSCI.0864-05.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Snijders T. M., Petersson K. M., Hagoort P., “Effective connectivity of cortical and subcortical regions during unification of sentence structure,” Neuroimage 52(4), 1633–1644 (2010). 10.1016/j.neuroimage.2010.05.035 [DOI] [PubMed] [Google Scholar]
- 24.Stamatakis E. A., et al. , “Cingulate control of fronto-temporal integration reflects linguistic demands: a three-way interaction in functional connectivity,” Neuroimage 28(1), 115–121 (2005). 10.1016/j.neuroimage.2005.06.012 [DOI] [PubMed] [Google Scholar]
- 25.Kreifelts B., et al. , “Audiovisual integration of emotional signals in voice and face: an event-related fMRI study,” Neuroimage 37(4), 1445–1456 (2007). 10.1016/j.neuroimage.2007.06.020 [DOI] [PubMed] [Google Scholar]
- 26.Vitali P., et al. , “Generating animal and tool names: an fMRI study of effective connectivity,” Brain Lang. 93(1), 32–45 (2005). 10.1016/j.bandl.2004.08.005 [DOI] [PubMed] [Google Scholar]
- 27.Papoutsi M., et al. , “Is left fronto-temporal connectivity essential for syntax? Effective connectivity, tractography and performance in left-hemisphere damaged patients,” Neuroimage 58(2), 656–664 (2011). 10.1016/j.neuroimage.2011.06.036 [DOI] [PubMed] [Google Scholar]
- 28.Rossi S., et al. , “Functional connectivity of dorsal and ventral frontoparietal seed regions during auditory orienting,” Brain Res. 1583, 159–168 (2014). 10.1016/j.brainres.2014.08.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bisconti S., et al. , “Functional near-infrared spectroscopy brain imaging investigation of phonological awareness and passage comprehension abilities in adult recipients of cochlear implants,” J. Speech Lang. Hear. Res. 59(2), 239–253 (2016). 10.1044/2015_JSLHR-L-14-0278 [DOI] [PubMed] [Google Scholar]
- 30.Pillay S. B., et al. , “Lesion localization of speech comprehension deficits in chronic aphasia,” Neurology 88(10), 970–975 (2017). 10.1212/WNL.0000000000003683 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Gernsbacher M. A., Morson E. M., Grace E. J., “Language and speech in autism,” Annu. Rev. Linguist. 2, 413–425 (2016). 10.1146/annurev-linguistics-030514-124824 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Peelle J. E., “Optical neuroimaging of spoken language,” Language Cognit. Neurosci. 32(7), 847–854 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zeff B. W., et al. , “Retinotopic mapping of adult human visual cortex with high-density diffuse optical tomography,” Proc. Natl. Acad. Sci. U. S. A. 104(29), 12169–12174 (2007). 10.1073/pnas.0611266104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Peelle J. E., “The hemispheric lateralization of speech processing depends on what ‘speech’ is: a hierarchical perspective,” Front Hum. Neurosci. 6, 309 (2012). 10.3389/fnhum.2012.00309 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Peelle J. E., “Six-word subject-relative and object-relative sentences,” Peelle Lab Stimuli, (2016) https://osf.io/szt2g/.
- 36.Traxler M. J., Morris R. K., Seely R. E., “Processing subject and object relative clauses: evidence from eye movements,” J. Mem. Lang. 47(1), 69–90 (2002). 10.1006/jmla.2001.2836 [DOI] [Google Scholar]
- 37.Gibson E., “Linguistic complexity: locality of syntactic dependencies,” Cognition 68(1), 1–76 (1998). 10.1016/S0010-0277(98)00034-1 [DOI] [PubMed] [Google Scholar]
- 38.Wingfield A., Peelle J. E., Grossman M., “Speech rate and syntactic complexity as multiplicative factors in speech comprehension by young and older adults,” Aging Neuropsychol. Cognit. 10, 310–322 (2003). 10.1076/anec.10.4.310.28974 [DOI] [Google Scholar]
- 39.Shannon R. V., et al. , “Speech recognition with primarily temporal cues,” Science 270(5234), 303–304 (1995). 10.1126/science.270.5234.303 [DOI] [PubMed] [Google Scholar]
- 40.Peelle J. E., “Noise vocoding (jp_vocode.m),” Misc matlab scripts, (2016) https://github.com/jpeelle/jp_matlab.
- 41.Brainard D. H., “The psychophysics toolbox,” Spat. Vision 10(4), 433–436 (1997). 10.1163/156856897X00357 [DOI] [PubMed] [Google Scholar]
- 42.Hassanpour M. S., et al. , “Statistical analysis of high density diffuse optical tomography,” Neuroimage 85(Pt 1), 104–116 (2014). 10.1016/j.neuroimage.2013.05.105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Dehghani H., et al. , “Depth sensitivity and image reconstruction analysis of dense imaging arrays for mapping brain function with diffuse optical tomography,” Appl. Opt. 48(10), D137–D143 (2009). 10.1364/AO.48.00D137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Dale A. M., Fischl B., Sereno M. I., “Cortical surface-based analysis—I. Segmentation and surface reconstruction,” Neuroimage 9(2), 179–194 (1999). 10.1006/nimg.1998.0395 [DOI] [PubMed] [Google Scholar]
- 45.Hayasaka S., et al. , “Nonstationary cluster-size inference with random field and permutation methods,” Neuroimage 22(2), 676–687 (2004). 10.1016/j.neuroimage.2004.01.041 [DOI] [PubMed] [Google Scholar]
- 46.Worsley K. J., et al. , “Applications of random field theory to functional connectivity,” Hum. Brain Mapp. 6(5–6), 364–367 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Hickok G., Poeppel D., “Opinion—the cortical organization of speech processing,” Nat. Rev. Neurosci. 8(5), 393–402 (2007). 10.1038/nrn2113 [DOI] [PubMed] [Google Scholar]
- 48.White B. R., Culver J. P., “Quantitative evaluation of high-density diffuse optical tomography: in vivo resolution and mapping performance,” J. Biomed. Opt. 15(2), 026006 (2010). 10.1117/1.3368999 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Belin P., et al. , “Voice-selective areas in human auditory cortex,” Nature 403, 309–312 (2000). 10.1038/35002078 [DOI] [PubMed] [Google Scholar]
- 50.Binder J. R., et al. , “Human temporal lobe activation by speech and nonspeech sounds,” Cereb. Cortex 10(5), 512–528 (2000). 10.1093/cercor/10.5.512 [DOI] [PubMed] [Google Scholar]
- 51.Binder J. R., et al. , “Human brain language areas identified by functional magnetic resonance imaging,” J. Neurosci. 17(1), 353–362 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Davis M. H., Johnsrude I. S., “Hierarchical processing in spoken language comprehension,” J. Neurosci. 23(8), 3423–3431 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Demonet J. F., et al. , “The anatomy of phonological and semantic processing in normal subjects,” Brain 115, 1753–1768 (1992). 10.1093/brain/115.6.1753 [DOI] [PubMed] [Google Scholar]
- 54.Scott S. K., et al. , “Identification of a pathway for intelligible speech in the left temporal lobe,” Brain 123, 2400–2406 (2000). 10.1093/brain/123.12.2400 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Rissman J., Eliassen J. C., Blumstein S. E., “An event-related fMRI investigation of implicit semantic priming,” J. Cognit. Neurosci. 15(8), 1160–1175 (2003). 10.1162/089892903322598120 [DOI] [PubMed] [Google Scholar]
- 56.Rodd J. M., Davis M. H., Johnsrude I. S., “The neural mechanisms of speech comprehension: fMRI studies of semantic ambiguity,” Cereb. Cortex 15(8), 1261–1269 (2005). 10.1093/cercor/bhi009 [DOI] [PubMed] [Google Scholar]
- 57.Chou T. L., et al. , “The role of inferior frontal gyrus and inferior parietal lobule in semantic processing of Chinese characters,” Exp. Brain Res. 198(4), 465–475 (2009). 10.1007/s00221-009-1942-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Price C. J., “A review and synthesis of the first 20 years of PET and fMRI studies of heard speech, spoken language and reading,” NeuroImage 62(2), 816–847 (2012). 10.1016/j.neuroimage.2012.04.062 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Adank P., “Design choices in imaging speech comprehension: an activation likelihood estimation (ALE) meta-analysis,” NeuroImage 63(3), 1601–1613 (2012). 10.1016/j.neuroimage.2012.07.027 [DOI] [PubMed] [Google Scholar]
- 60.Peelle J. E., et al. , “Neural processing during older adults’ comprehension of spoken sentences: age differences in resource allocation and connectivity,” Cereb. Cortex 20(4), 773–782 (2010). 10.1093/cercor/bhp142 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Baria A. T., et al. , “Linking human brain local activity fluctuations to structural and functional network architectures,” NeuroImage 73, 144–155 (2013). 10.1016/j.neuroimage.2013.01.072 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Kiebel S. J., Daunizeau J., Friston K. J., “A hierarchy of time-scales and the brain,” PLoS Comput. Biol. 4(11), e1000209 (2008). 10.1371/journal.pcbi.1000209 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Stephens G. J., Honey C. J., Hasson U., “A place for time: the spatiotemporal structure of neural dynamics during natural audition,” J. Neurophysiol. 110(9), 2019–2026 (2013). 10.1152/jn.00268.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]