Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2017 Dec 13;37(50):12187–12201. doi: 10.1523/JNEUROSCI.1436-17.2017

Extensive Tonotopic Mapping across Auditory Cortex Is Recapitulated by Spectrally Directed Attention and Systematically Related to Cortical Myeloarchitecture

Frederic K Dick 1,2,3,, Matt I Lehet 4,5, Martina F Callaghan 7, Tim A Keller 4,6, Martin I Sereno 8, Lori L Holt 4,5,
PMCID: PMC5729191  PMID: 29109238

Abstract

Auditory selective attention is vital in natural soundscapes. But it is unclear how attentional focus on the primary dimension of auditory representation—acoustic frequency—might modulate basic auditory functional topography during active listening. In contrast to visual selective attention, which is supported by motor-mediated optimization of input across saccades and pupil dilation, the primate auditory system has fewer means of differentially sampling the world. This makes spectrally-directed endogenous attention a particularly crucial aspect of auditory attention. Using a novel functional paradigm combined with quantitative MRI, we establish in male and female listeners that human frequency-band-selective attention drives activation in both myeloarchitectonically estimated auditory core, and across the majority of tonotopically mapped nonprimary auditory cortex. The attentionally driven best-frequency maps show strong concordance with sensory-driven maps in the same subjects across much of the temporal plane, with poor concordance in areas outside traditional auditory cortex. There is significantly greater activation across most of auditory cortex when best frequency is attended, versus ignored; the same regions do not show this enhancement when attending to the least-preferred frequency band. Finally, the results demonstrate that there is spatial correspondence between the degree of myelination and the strength of the tonotopic signal across a number of regions in auditory cortex. Strong frequency preferences across tonotopically mapped auditory cortex spatially correlate with R1-estimated myeloarchitecture, indicating shared functional and anatomical organization that may underlie intrinsic auditory regionalization.

SIGNIFICANCE STATEMENT Perception is an active process, especially sensitive to attentional state. Listeners direct auditory attention to track a violin's melody within an ensemble performance, or to follow a voice in a crowded cafe. Although diverse pathologies reduce quality of life by impacting such spectrally directed auditory attention, its neurobiological bases are unclear. We demonstrate that human primary and nonprimary auditory cortical activation is modulated by spectrally directed attention in a manner that recapitulates its tonotopic sensory organization. Further, the graded activation profiles evoked by single-frequency bands are correlated with attentionally driven activation when these bands are presented in complex soundscapes. Finally, we observe a strong concordance in the degree of cortical myelination and the strength of tonotopic activation across several auditory cortical regions.

Keywords: attention, auditory, cortical mapping

Introduction

Listeners shift attention across multiple simultaneously present acoustic dimensions to home in on those that are diagnostic in guiding behavior (Idemaru and Holt, 2011; Herrmann et al., 2013; Shamma and Fritz, 2014). In nonhuman animal studies, task-based spectral attention adaptively modulates auditory neurons' spectrotemporal response fields (Fritz et al., 2010). Human neuroimaging results reveal that attention to streams of high- versus low-frequency acoustic input can modulate activity in tonotopically defined regions (Paltoglou et al., 2009), as can imagery of higher versus lower frequencies (Oh et al., 2013). In and directly around Heschl's gyrus, there are strong frequency band-specific attentional effects to high and low pure-tone streams presented to opposite ears (Da Costa et al., 2013) and a shared topography of sensory and attentionally driven responses (Riecke et al., 2017). These results establish that endogenous attention directed across acoustic frequency, the primary axis of auditory representation, can modulate human cortical activity in a tonotopic manner around Heschl's gyrus. But there remain important unanswered questions about the neurobiological basis of human spectrally directed attention.

First, does the topography of attention to different frequency bands recapitulate tonotopic organization in human primary auditory cortex? Nonhuman animal physiology establishes spectrally directed attention in myeloarchitectonically and cytoarchitectonically defined primary areas in “auditory core” (Fritz et al., 2007b; Shamma and Fritz, 2014). However, although two recent neuroimaging studies have shown strong similarities between stimulus-driven and attentionally driven tonotopic organization in and directly around Heschl's gyrus (Da Costa et al., 2013; Riecke et al., 2017), it has not yet been possible to unambiguously localize this effect to human auditory core. Here, we use high-resolution quantitative MRI (Pierpaoli, 2010) to estimate myeloarchitectonically defined auditory core, and demonstrate that spectrally directed attention modulates its activation in a tonotopically organized manner.

Second, is attentionally driven tonotopic organization present outside of auditory core? In humans, Riecke et al. (2017) found no significant evidence for tonotopically organized effects of spectral attention outside of early auditory areas but did show that the information content of nonprimary cortical frequency representations was sufficient for above-chance decoding of listeners' frequency-selective attentional focus. The lack of attentionally driven tonotopic maps contrasts with the finding that most nonprimary cortical visual areas exhibit strong retinotopically specific attentional effects (Saygin and Sereno, 2008). Using intensive data collection (>7000 functional volumes per subject), we present evidence for widespread, tonotopically organized modulation by spectral attention across much of auditory cortex, with individual differences in individual participants' tonotopic maps reproduced in attentionally driven maps.

Third, what is the effect of frequency-selective attention being directed to a voxel's nonpreferred frequency band? Detailed fMRI studies of stimulus-driven frequency response functions (Schönwiesner and Zatorre, 2009; Moerel et al., 2013) have shown graded and multipeaked frequency responses across human auditory cortex. However, it is unclear whether these more complex patterns are recapitulated by attention to a given frequency band. In the context of three distinct frequency bands, Riecke et al. (2017) found that attentional filters appeared to be bandpass in and around Heschl's gyrus. Here, using a five-frequency-band paradigm, we establish that graded response profiles evoked by single-frequency bands are strongly associated with attentionally driven response profiles to those frequencies across much of auditory cortex. We also show that a systematic topography of “dis-preferred” frequency can be driven by attention, and establish the regionalization of spectral attentional effects relative to prior studies of cross-modal auditory attention (Petkov et al., 2004).

Finally, is there spatial correspondence between auditory cortical anatomy, as measured by the local change in R1-estimated myelination, and fMRI-assessed strength of relative frequency selectivity? Postmortem Gallyas staining to establish human myeloarchitecture reveals considerable variability in auditory cortical myelination that is associated with MRI signal change in the same brain (Wallace et al., 2016). Likewise, variation in cortical myelination estimated using T1-weighted/T2-weighted ratio approaches also appears to correspond spatially with some functional variation in the superior temporal lobe (Glasser et al., 2016). Here, we demonstrate that there is spatial concordance between the degree of myelination and the amplitude of the frequency-selective tonotopic signal across several regions in auditory cortex.

Materials and Methods

Experiment overview.

We used a novel paradigm in which listeners direct attention to a series of four-tone “mini-sequences” that fall within one of five possible spectral bands, without any spatial cues. The task is to monitor for temporally adjacent mini-sequence repeats within the attended band. Because this places a very high demand on encoding and integrating spectral sequences within a delimited frequency range, we expect it to be especially effective in evoking strong responses in nonprimary auditory cortical areas. The goal is to address where specifically in the auditory system spectral gain from attention is evident, and akin to long-standing work in vision (Kastner and Ungerleider, 2000), to delineate the topographic maps across which attentional modulation is apparent.

The target mini-sequences were embedded in either an informationally sparse or informationally dense acoustic scene (Fig. 1). Streams of four-tone mini-sequences were presented in either a single band (“tonotopy,” Fig. 1A), or accompanied by mini-sequences in a “distractor” frequency band, the center frequency of which varied in the frequency distance from the attended band across blocks (attention-tonotopy, [attn-tono]; Fig. 1B). A verbal cue directed listeners' attention to a specific frequency band, within which listeners monitored four-tone mini-sequences for repeats; the distractor band in attn-tono blocks also contained repeats. Using a discretized version of a phase-encoded fMRI design (Sereno et al., 1995; Rao and Talavage, 2005; Schwarzkopf et al., 2011; Herdener et al., 2013; Langers et al., 2014), the cued frequency band stepped up or down in orderly steps across the acoustic spectrum across a 64 s cycle (Figure 1C). Phase-encoded tonotopic designs benefit from the power and robustness of the “traveling wave” method for topographic cortical mapping of smoothly varying representations (Engel, 2012); the discretized (blocked) version we use here allows use of the verbal cue and has the advantage of being able to be analyzed using both Fourier and regression approaches. This allowed us to include an additional, “randomized” attn-tono condition that contributed both as a control condition in Fourier analyses and also as an additional attn-tono run in regression analyses (Fig. 1D). The tone stimuli from this condition were identical to the “stepped” attn-tono condition, but the order of the verbal cues directing listeners' attention to a specific frequency band was scrambled in their assignment to blocks. This preserved the acoustics across conditions but eliminated the consistent “stepping” of attention through the frequency spectrum in the randomized condition, thereby destroying the consistent phase lag associated with specific frequency bands that support Fourier analyses (schematized in Figure 1E; and see below). Thus, to the extent that there are attentionally driven frequency-selective maps in auditory cortex, we expect tonotopically organized attentional maps to be apparent in the stepped, but not the randomized attn-tono conditions under Fourier analyses. In contrast, regression analyses include a model of attention, allowing stepped and randomized attn-tono conditions to be pooled to investigate the impact of attention on cortical activation. Across both Fourier and regression analyses, the stepped attn-tono conditions were collapsed across runs for which the cued frequency band stepped up in frequency and those that stepped down; inclusion of each simply balanced the directional movement of attention through the acoustic spectrum across the experiment.

Figure 1.

Figure 1.

Stimuli and design overview. A, In a representative 12.8 s Tonotopy block, a neutral verbal prompt “hear” precedes 14 four-tone mini-sequences sampled around one of five center frequencies. The task is to detect the 1–3 mini-sequence repeats embedded within the block. Gray box represents a single mini-sequence repeat. B, A single Attention-tonotopy (attn-tono) block includes two simultaneous streams of mini-sequences with distinct center frequencies. Mini-sequence repeats occur in each stream. A verbal prompt (“high” or “low”) directs listeners to attend to one stream and report mini-sequence repeats in that stream while ignoring repeats in the unattended stream. Two randomly ordered orientation tones at the center frequency of each stream alert listeners to the frequency neighborhood of the upcoming streams. C, A single 64 s cycle of stepped attn-tono blocks includes five 12.8 s blocks that step up (shown), or down, in center frequency. In this single cycle, the frequency band to which attention is directed by the verbal prompt (indicated with “high”/”low” above each block) is acoustically matched with the tonotopy cycle shown in B, but there are always competing unattended mini-sequences in a distinct frequency band. D, A single 64 s cycle of randomized attn-tono blocks is acoustically identical to the stepped attn-tono cycle in C, except that half of the verbal prompts have been swapped, and therefore no longer cue attention to frequency with consistent phase lags. E, The distinction between stepped and randomized attn-tono blocks is highlighted by examining the first three (of eight) cycles of a stepped (top) versus randomized (bottom) attn-tono run. The focus of attention is color coded in the frequency band-specific manner shown in A. Top, For the stepped condition, there is a consistent relationship between the stimulus phase lag and the attended frequency across cycles within a run. Thus, for voxels that show a consistently higher response at one attended frequency band compared with all others, there will be a periodic response at 8 cycles/run at a given phase lag corresponding to the particular frequency band attended. Bottom, For the randomized condition, there is no consistent relationship, providing a control condition for Fourier analyses because frequency band-directed attention is aperiodic across a run. F, The stimulus phase lag with the highest periodic BOLD signal amplitude is determined for each voxel, mapped to a color scale, and then painted onto the cortical surface patch. BOLD signal amplitude is mapped to the color's saturation. A–D, Stimulus intensity is adjusted across the spectrum to aid visual presentation of energy across frequency bands (for details on actual intensity across frequency bands, see Materials and Methods).

In summary, in the attn-tono conditions, attention alone was available to differentially drive responses to an approximately constant acoustic input, whereas in the tonotopy condition, responses were driven by spectrally selective stimuli as well as by attention.

We analyzed mapping data using Fourier methods with individual and surface-based group analysis methods, as described previously (Sereno et al., 1995; Hagler et al., 2006, 2007). With this approach, voxels preferentially responding to a certain point in a stepped stimulus cycle have a significantly higher signal amplitude at this stimulus temporal frequency (meaning the slow frequency of the repeat of the spectral ramp) than at the average of other “noise” frequencies (Fig. 1E). Significant signal phases (a particular position in the cycle) are then mapped to a color scale to indicate the voxel's “best frequency,” and signal amplitude is mapped to the voxel's color saturation (Fig. 1F). We time-reversed runs stepping down in frequency and averaged them with runs stepping up in frequency (Sereno et al., 1995; Talavage et al., 2004; Dick et al., 2012; Ahveninen et al., 2016). Cross-subject averaging of phase-encoded mapping data was performed using a method described previously (Hagler et al., 2007) in which the real and imaginary components of the signal with respect to the stepped cycle were sampled to the cortical surface and then averaged across subjects, preserving any phase information that was coherent over subjects.

Using previously established methods (Dick et al., 2012; Sereno et al., 2013; Lutti et al., 2014; see also Glasser et al., 2016), we used high-resolution quantitative multiparameter mapping to generate maps of estimated cortical myelination based on longitudinal relaxation times (quantitative T1). Recent work by multiple laboratories supports the hypothesis that T1 relaxation is reliably associated with quantitative differences in myelination in white matter and cortex (Sereno et al., 2013; Stüber et al., 2014; Dinse et al., 2015; Callaghan et al., 2015; Tardif et al., 2015, 2016; Turner, 2015). Here, we calculated each subject's R1 (1/T1) values, where the greater the R1, the higher the inferred myelin content. These R1 values were resampled onto his or her surface at a cortical depth fraction of 0.5 and also averaged across individuals using sulcus-aligned cortical-surface-based procedures (see below for further details).

Participants.

Eight adults (aged 23–45 years, mean 28 years; 6 female) participated; none reported a history of neurological disease or communication disorders. All had some childhood and/or adult musical training (one had a music degree) and had previous experience with longer scanning sessions. While musical training seemed to facilitate learning the experimental task, subsequent behavioral studies in the laboratory have shown that musically naive subjects can also attain excellent performance with similar levels of training on this and even more demanding related tasks.

Stimuli and design.

Stimuli were created using custom code in MATLAB version 2015a (The MathWorks) and SoX version 14.4.2 (www.sourceforge.net). The basic stimulus unit was a four-tone mini-sequence (140 ms sine-wave tones, including 10 ms linear amplitude ramp), with each tone drawn with replacement from a seven-semitone, band-delimited pool centered around one of five frequencies (300, 566, 1068, 2016, and 3805 Hz; Fig. 1A). Fourteen mini-sequences formed a block (mean intersequence silent interval 240 ms, SD 10 ms). Each block contained one to three mini-sequence repeats (1:2:1 ratio of 1, 2, and 3 repeats). When there was more than one repeat per block, mini-sequence repeat pairs were separated by at least one intervening mini-sequence. Each block began with a verbal prompt—“hear”, “high”, or “low”—generated with a loudness-equalized Mac Victoria voice mean duration 506 ms (SD 36 ms), padded with silence to 800 ms total duration. This prompt was followed by 800 ms silent gap (tonotopy) or tone-cue (attn-tono), then the 14 mini-sequences (11.2 s in total), for a total block duration of 12.8 s.

The task was to detect mini-sequence repeats in the attended frequency band (i.e., a 1-back task). In the tonotopy condition, mini-sequences were confined to a single frequency band preceded by the neutral verbal prompt hear (Fig. 1A). In two of the four single-band tonotopy runs, block center frequency was stepped from low to high over a 64 s cycle with 8 cycles/run; step direction was reversed (high to low) for the other two runs. This is a “discrete” version of phase-encoded designs commonly used in visual, somatosensory, and auditory mapping studies (Engel et al., 1994; Sereno et al., 1995; Da Costa et al., 2011; Dick et al., 2012; Langers and van Dijk, 2012; Langers et al., 2014; Saenz and Langers, 2014).

The attn-tono condition had the exact mini-sequence patterns from the tonotopy blocks, but there also were simultaneous, competing mini-sequences in a distinct frequency band with a center frequency at least 14 semitones apart (Fig. 1B; 300 vs 1068 Hz; 300 vs 2016 Hz; 300 vs 3805 Hz; 566 vs 2016 Hz; 566 vs 3805 Hz; 1068 vs 3805 Hz; not all center frequencies were paired due to the 14 semitone constraint). The verbal prompt (“high” or “low”) initiating each attn-tono block signaled participants to perform the 1-back task on either the higher or lower frequency band. Immediately after the verbal prompt, a randomly ordered pair of sine-wave tones cued the center frequencies of the upcoming block (140 ms tones, including 8 ms linear on/off amplitude ramp; 26 ms intertone silence, tone pair followed by 494 ms silence, total duration 800 ms). Crucially, there were mini-sequence repeats even in the unattended band to assure that attention directed to the task was endogenously driven rather than being attracted by stimulus repetition effects (Barascud et al., 2016).

There were two attn-tono conditions: stepped and randomized. Analogous to single-band tonotopy runs, in stepped attn-tono runs, the verbally cued frequency band implicitly stepped up (2 runs) or down (2 runs) in frequency over a 64 s cycle (Fig. 1C). This cued iterative stepping through the frequency spectrum facilitates transfer of attention to each frequency band (as in traditional phase-encoded designs) and supports Fourier approaches to analysis (Fig. 1E). Each randomized attn-tono run was acoustically identical to a stepped run, but the verbal prompt was manipulated so that there was no systematic, stepped organization of attended mini-sequence center frequencies through the spectrum (Fig. 1D). For this condition, frequency bands were cued at inconsistent phase lags within the 8 cycles/run, thereby phase-canceling any periodic attentional response; this is schematized in Figure 1E. This randomized-order control is important, as there is a small (∼1 octave) overall shift in spectral mean over the course of an attn-tono stimulus cycle that is unavoidable due to the constraints on the pairing of frequency bands.

Each of the 12 9.6-min-long runs was composed of eight 64 s cycles plus 32 s silent periods at the beginning and end of each run to allow for calculation of baseline auditory activation (Klein et al., 2014).

Behavioral thresholds and training.

Participants first underwent behavioral tests of monaural pure-tone thresholds and binaural thresholds for detecting mini-sequences in quiet and in acoustic noise generated by the MRI scanner running the multiband EPI sequence. This provided a basis for adjusting center frequency amplitudes to approximate equal loudness in scanner noise. Participants also trained on the mini-sequence detection task in quiet and in acoustic scanner noise across two sessions.

Imaging data acquisition.

Structural and functional images were acquired on a 3 Tesla Siemens Verio wide-bore MRI scanner at the Scientific Imaging and Brain Research Center at Carnegie Mellon University using a phased array 32-channel head coil across three scan sessions on separate days. Stimulus presentation was under the control of a MacBookPro running PsychToolbox 3.0.12 in MATLAB, with audio output to an external AD/DA converter (Babyface, RME) connected to an amplifier (Pylepro) that delivered stimuli to participants in the scanner diotically over MRI-compatible earbuds (Sensimetrics S14). All stimuli were prefiltered to equalize sound stimuli according to the earbuds' frequency response. After participants were settled into the bore, sound volume was adjusted so that participants could comfortably hear all frequencies through scanner noise. Participants wore a fiber optic response glove (Current Designs) that communicated with a Brain Logics I/O device (Psychology Software Tools); participants used the glove to respond to mini-sequence repeats using the right index finger. During all functional scans, subjects closed their eyes to reduce the potential for stimulus-correlated eye movements.

In the initial scanning session (∼50 min), we acquired multiparameter mapping images for quantitative myelin mapping and structural identification of primary auditory cortex on an individual basis while participants watched a film. Proton density-weighted (PDw), T1-weighted (T1w), and magnetization transfer (MTw) images were acquired using an in-house 3D FLASH pulse sequence (voxel size: 0.8 × 0.8 × 0.8 mm3, matrix = 320 × 280 × 208, TR = 25.0 ms, bandwidth 488 Hz/px, excitation flip angle: 6° (PDw/MTw) or 21° (T1w), slab rotation 30°). To accelerate this high-resolution acquisition, a partial Fourier acquisition (6/8 coverage) was used in the inner phase-encoded direction (right, left) and parallel imaging was used along the outer phase-encoding direction (anteroposterior), reconstructed using the GRAPPA algorithm (acceleration factor 2, 18 integrated auto-calibration lines) as implemented on the scanner platform. Four gradient echoes were acquired for each contrast (TE = 2.5, 4.74, 6.98, 9.22 ms) after each excitation pulse and averaged to improve signal-to-noise ratio (Helms et al., 2009). Each FLASH acquisition lasted 9 min 45 s. Quantitative R1 (1/T1) maps were estimated from the PDw and T1w images according to the model developed by Helms et al. (2008), including a correction for RF transmit field inhomogeneities (Lutti et al., 2010) and imperfect spoiling (Preibisch and Deichmann, 2009). The transmit field map was calculated using a 3D EPI spin-echo/stimulated echo method (Lutti et al., 2010, 2012); FOV = 256 × 192 × 192 mm, matrix = 64 × 64 × 48, TE = 53.14 ms, TM = 47.60 ms, TR = 500 ms, bandwidth = 2298, nominal α varying from 135° to 65° in steps of 5°, acquisition time 6 min) and was corrected for off-resonance effects using a standard B0 field map (double gradient echo FLASH, 3 × 3 × 2 mm isotropic resolution, whole-brain coverage).

The final two scanning sessions acquired functional data for four runs each of the tonotopy, stepped attn-tono, and randomized attn-tono conditions. The runs were interleaved across conditions and designed to assess phase-encoded functional influences of selective attention across frequency (stepped attn-tono), the functional response to identical acoustics without systematic phase-encoded shifts of attention (randomized attn-tono), and functional responses to single-frequency bands identical to the attended bands in attn-tono, with phase-encoded steps through frequency and no distractor frequency bands (tonotopy). Across all functional runs, participants engaged in detecting repeats (1-back) of the four-tone mini-sequences. Run order was counterbalanced according to condition and whether the cycle involved steps up or down in frequency.

Functional images were acquired using a T2*-weighted EPI pulse sequence (44 oblique axial slices, in-plane resolution 3 mm × 3 mm, 3 mm slice thickness, no gap, TR = 1000 ms, TE = 41 ms, flip angle = 61°, matrix size = 64 × 64, FOV = 192 mm). All EPI functional scans were performed using 4× multiband acceleration (Feinberg et al., 2010; Feinberg and Setsompop, 2013). There were 584 repetitions acquired per run, with the first 8 images discarded to allow for longitudinal magnetization to arrive at equilibrium. Runs were pseudo-randomly ordered across participants.

Image preprocessing: cortical surface creation, and mapping of R1 values.

Each subject's cortical surface was reconstructed from a contrast-optimized synthetic FLASH volume, created with mri_synthesize in Freesurfer from scaled and truncated versions of the T1 and proton-density volumes; another MPRAGE-like synthetic image was created for use with the automated Freesurfer Talairach procedure. Both volumes were conformed to 1 mm isotropic resolution and used in a customized reconstruction pipeline version. In particular, the subject's PD volume was used to deskull the synthetic FLASH image using a “shrink-wrap” technique (Dale and Sereno, 1993). After inspection for reconstruction quality, R1 values were resampled from 50% cortical depth fraction to the subject's surface, and also morphed to the unit icosahedron for cross-subject curvature-aligned cortical-surface-based averaging (Fischl et al., 1999).

EPI processing.

Each functional image from both sessions was aligned to a reference volume from the middle of the first run using AFNI's 3dvolreg; registration and motion correction goodness were hand-checked for each run. The reference volume was aligned to the subject's cortical surface using boundary-based registration in Freesurfer (Greve and Fischl, 2009), verified using manual blink comparison, and applied to the volume-aligned EPI data for resampling. EPI data were analyzed in native space without any spatial smoothing using both Fourier and general linear methods.

Experimental design and statistical analysis.

As noted above, the fMRI experiment used a discrete version of a traditional phase-encoded design, such that both Fourier based and general linear model approaches could be used. Fourier analyses were performed in csurf (http://www.cogsci.ucsd.edu/∼sereno/.tmp/dist/csurf) with individual and group analysis methods used as previously described (Sereno et al., 1995; Sereno and Huang, 2006; Hagler et al., 2007). Functional activation amplitude was estimated as the Fourier amplitude of the periodic BOLD signal (proportional to percent response) at the frequency of the stimulus cycle (8 repetitions per run). An F statistic was calculated by comparing that amplitude to the average amplitude of other noise frequencies (Hagler et al., 2007). Periodic signal components with very low frequencies (due to slow head motion) and the second and third harmonic of the stimulus were excluded as neither signal nor noise (this is mathematically equivalent to first linearly regressing out these frequencies as nuisance variables before calculating significance). The phase of the signal, which corresponds to a particular point of the stimulus cycle, was then mapped to a color scale, and the amplitude of the signal at each vertex was mapped to color saturation (Gouraud shading within each face). Runs with downward frequency steps were time-reversed and averaged with upward-stepped scans to cancel fixed voxel-specific delays in the BOLD response.

Linear modeling was performed in FSL (Smith et al., 2004). For all runs, the motion-registered data were high-pass-filtered (100 s) and prewhitened; a hemodynamic model corresponding to each stimulated and attended (tonotopy condition) or attended (stepped, randomized attn-tono conditions) frequency band was created by convolving the 12.8 s block with a gamma function (lag 6 s, SD 3s). In a separate multiple regression, the unattended (ignored) frequency band was modeled for both stepped and randomized attn-tono conditions. The verbal cue was also modeled; all models were temporally filtered before multiple regression. Coefficients from the first-level contrasts for each of the four runs were combined in a fixed-effects analysis for each condition; data from the stepped and random block conditions were also combined in an eight-run average.

Cross-subject averaging of phase-encoded mapping data was performed using the methodology developed by Hagler and Sereno (2006) in which the real and imaginary components of the signal with respect to the stimulus ramp are averaged across subjects, preserving any phase information consistent between subjects. This was performed by projecting each participant's phase-encoded map to the FreeSurfer spherical atlas using mri_surf2surf, performing 1 step of surface-based smoothing (<1 mm FWHM in 2D), averaging across subjects at each vertex, then painting back onto a single subject's surface for viewing. For the multiple regression analyses, the same sampling process was used to sample each subject's contrast parameter estimates for cross-subject averaging and t tests.

Surface-based cluster exclusion was used to correct for multiple comparisons in the groupwise averages (surfclust and randsurfclust) (Hagler et al., 2006). The exclusion criterion (only surface clusters >78 mm2 unless otherwise noted) was determined based on the minimum estimated cortical area from iterative random sampling of cluster sizes (N = 10,000 iterations per hemisphere in randsurfclust) required to achieve a corrected α of p < 0.001 for each hemisphere, based on an initial uncorrected α of vertexwise p < 0.01.

As an alternative means of defining primary auditory cortex, we projected the Morosan et al. (2001) 3D raw probability maps provided in the AFNI (version 16.3.13) (Cox, 2012) to a FreeSurfer “fsaverage” brain registered to the Talairach target brain, resampled the data onto the cortical surface, and thresholded at p > 0.30 to create region of interest (ROI) labels. The labels were ∼2 mm FWHM (five steps) surface-smoothed with manual removal of isolated marked vertices (due to “spillover” from the 3D to 2D projection within the lateral fissure), then spherically morphed to each subject. The labels were individually inspected (and filled if there were small holes in the label); then a boundary was delineated around each label on each subject's flattened auditory cortical patch.

ROI analyses.

We quantified the similarity between frequency band response profiles driven by stimulus+attention (tonotopy) versus attention alone (attn-tono) in a “quilt” of small cortical surface-based ROIs that tiled the temporal plane. ROIs (see Fig. 5B) were created on a single subject's right and left hemisphere flattened patches by flooding all vertices within a 4 mm radius around a central selected vertex. Each of the ROIs (57 in the right hemisphere patch and 68 in the slightly larger left patch) were then spherically morphed to the other 7 subjects' flattened patches. Spurious ROI sampling on the edges of the patches was manually corrected on the original subject's inflated surface and remorphed to all other subjects. Each ROI was then projected into the registered native-space EPI volume using Freesurfer's mri_label2vol (sampled from the gray-white boundary to 0.8 of the calculated cortical depth, with fillthresh set to 0.5). For each subject, within each ROI, we calculated the average parameter estimate for each frequency band for tonotopy, and combined stepped and randomized attn-tono conditions. For each ROI, we then ran a linear model with average tonotopy parameter estimates for the 5 frequency bands predicting average attn-tono parameter estimates for the same bands, including subjects as a random factor. The resulting partial t statistic for each ROI was z-transformed and color-rendered in Figure 5B, with p value thresholds Bonferroni-corrected to p < 0.05 for the number of ROIs per hemisphere, and indicated by the white outline surrounding the set of ROIs that surpass this z-threshold.

Figure 5.

Figure 5.

Comparison of tonotopy and attn-tono maps. A, Concordance maps are rendered in heatscale on the inflated hemispheres to illustrate the similarity in best frequency between tonotopic and attn-tono maps (the latter averaged over stepped and randomized blocks). These maps were calculated in two stages. First, in each subject's native EPI space, a voxel was coded as 1 if tonotopy and attn-tono stimuli evoked the same best frequency, and otherwise coded as 0. Second, for each subject, the concordance maps were resampled to the individual's cortical surface and projected onto the unit icosahedron for cross-subject surface-based averaging, thereby creating a composite measure of agreement between tonotopy and attn-tono maps, weighted by the consistency of this agreement across subjects. The concordance maps are statistically masked with a cross-subject t map, calculated versus chance agreement (p = 0.20) with a surface cluster correction of p < 0.001 (vertexwise p < 0.01, cluster threshold surface area >74 mm2) (Hagler et al., 2006). To demonstrate the extent of tonotopically mapped cortex that is similarly mapped through spectrally directed attention, the phase-encoded tonotopy cortical patches from Fig. 2A are overlaid with the outline of the thresholded concordance map shown by the yellow dotted line. White solid outline indicates the Bonferroni-corrected ROI-wise correspondence outline from the ROI quilt in B. B, Shading in each small ROI patch represents the z score for the partial fit between tonotopy and attn-tono responses to each frequency band (with subjects as a random factor). Thin white outline indicates ROIs with significant z scores (Bonferroni-corrected p value threshold of p < 0.05).

Results

Fourier based analyses

Stimulus-driven and attentionally driven tonotopic organization in human auditory core

As a necessary first step, we characterized basic tonotopic (stimulus-driven) organization in and immediately around myelin-estimated auditory core. The group-average R1-based estimates of myelination (inflated hemispheres, Fig. 2, leftmost panel) show that the highest R1 values occur within primary somatomotor areas along the central sulcus, and in the typically keyhole-shaped presumptive “auditory core” lying along and immediately surrounding Heschl's gyrus. It is important to note that myelination varies within auditory core and that the lateral and medial borders are less sharply demarcated (for review, see Hackett et al., 2001; Dick et al., 2012). To show this variation, we plot isointensity R1 contours in the cortical flat patches in Figure 2A–C (with the curvature-based boundaries of Heschl's gyrus overlaid in dotted lines). To help identify the discontinuities in R1 that would correspond to the putative borders of auditory core, we calculated the R1 gradient along the surface (Glasser et al., 2016). Lines drawn along the peak gradient amplitude (data not shown) corresponded well with the outermost R1 isointensity contour in Figure 2 (0.66 s−1). It is important to note that the gradient at the lateral edge of presumptive core is quite shallow; and as in postmortem myelin stains, it is therefore more difficult to establish an unambiguous lateral border, as could be surmised from the greater lateral spread of the isointensity contours. The shape and size (∼1.2 cm × 2.4 cm) of presumptive auditory core in this sample also agreed with the results from Dick et al. (2012) at the same R1 threshold (with the latter average core slightly narrower; data not shown here).

Figure 2.

Figure 2.

Group activation for tonotopy and attn-tono conditions, with R1 contours showing putative auditory core. Leftmost panel, Cortical surface-based group-averaged R1, projected on the lateral inflated surface of one subject. The left hemisphere is mirror-reversed to align cortical maps for visual comparison. For tonotopic map display, a patch of cortex, including the entire temporal plane (shown in purple on the inflated surface), was cut and flattened. A–C, Enlarged region, with isocontour lines showing quantitative R1 values for the group-averaged putative auditory core and color maps showing group-averaged best frequency as a function of (A) tonotopy, (B) attn-tono (stepped), and (C) attn-tono (randomized control) conditions. Stars represent fiduciary points to assist in visual comparisons of maps across conditions. Yellow dashed lines indicate the outline of Heschl's gyrus (in A, from the individual subject whose cortical patch was used). Consistent with previous work, the tonotopic map is characterized by two pairs of three interlacing best-frequency fingers, with the high-frequency fingers (red/orange colormap) showing greatest frequency preference medially and extending laterally, where they meet interdigitated lower-frequency fingers (green/yellow colormap) extending lateral to medial, with the longest middle lower-frequency finger extending approximately halfway into auditory core. This pattern is evident in Fourier analysis-derived maps of the stepped attn-tono condition but not in the randomized control attn-tono condition, for which the attentional response was phase-cancelled. All maps are statistically masked by overall activation to tonotopy stimuli in each hemisphere (cluster-corrected p < 10−8, and gently shaded to show relative amplitude).

The group-averaged topography of preferred frequency around auditory core has a typical arrangement (Dick et al., 2012; De Martino et al., 2015b), with the core surrounded by a high-frequency “V.” Preferred frequency descends into the center of core (where R1 values are highest) before reversing and slowly ascending to mid-frequency preferred frequencies anterolaterally (and to some extent posterolaterally). Figure 3 shows tonotopic maps for each individual listener. In general, the relationship between auditory core and tonotopy group is conserved across listeners, but with some variability in the shape and extent of the isointensity R1 contours. In particular, S2, S6 (right hemisphere), S7, and S8 (right hemisphere) had irregularly shaped and “blotchy” isointensity contours. Although there is a fair degree of individual variability in results from human postmortem cytoarchitectonic and myeloarchitectonic studies of auditory core and surrounding areas (Hackett et al., 2001; Sweet et al., 2005), this was somewhat greater than expected variation given other work in our laboratory (Dick et al., 2012; Lutti et al., 2014; Carey et al., 2017). As an independent estimate of primary auditory areas, we also morphed the Morosan et al. (2001) 3D probabilistic map of primary auditory areas (TE1.0) using previously established methods (see Materials and Methods); the outlines of the morphed labels corresponding to p > 0.30 of being within TE1.0 are overlaid in black dotted lines in Figure 3.

Figure 3.

Figure 3.

Individual subjects' tonotopy and attn-tono maps. Each subject's tonotopic and attn-tono (stepped) Fourier analysis-derived maps are displayed on the same subject's flattened superior temporal cortical patch. White represents R1 isocontours around presumptive auditory core. Thick solid lines indicate the lowest valued (outermost) R1 isocontour. Thin solid lines indicate the highest (innermost) R1 isocontour. Dashed lines indicate intermediate values. R1 values differ somewhat across individuals. Dashed black lines indicate the outline of the cortical surface-morphed TE1.0 label, where the area inside the line contains vertices estimated to have a p > 0.3 probability of falling within primary auditory cortex based on the Morosan et al. (2001) postmortem probability atlas (see Materials and Methods). Activation maps are gently shaded to show changes in response amplitude but are unthresholded for comparison with individual maps from previous studies (e.g., Da Costa et al., 2011).

We then asked whether attn-tono mapping resembled the tonotopic case in and around auditory core. Here, the group-level spatial distribution of tonotopy is closely recapitulated when spectrally directed attention (stepped attn-tono condition) alone modulates activation (Fig. 2B). This holds true in and around the keyhole-shaped hyperintensity defining core, with a slight exception in the transition from higher to lower frequency preference in mid core. In contrast (and as expected), the group-level attn-tono response for the randomized control condition is much weaker (Fig. 2C), with almost no correspondence with the tonotopic map, despite being acoustically identical to stepped attn-tono but for the shuffling of the verbal prompt ordering that destroyed the consistent phase lag associated with specific frequency bands. The one potential exception is in and around posterolateral core, where there is a low-to-mid frequency progression that is similar in attn-tono and tonotopic maps, particularly in the left hemisphere. This may be due to the small (∼1 octave) overall shift in spectral mean over the course of a stimulus cycle noted in Materials and Methods.

Stimulus-driven and attentionally driven tonotopic organization outside of auditory core

In line with results from previous fMRI studies (Talavage et al., 2004; Woods et al., 2009; Humphries et al., 2010; Barton et al., 2012; Dick et al., 2012; Moerel et al., 2012; Saenz and Langers, 2014; De Martino et al., 2015b; Thomas et al., 2015; Ahveninen et al., 2016; Leaver and Rauschecker, 2016; Riecke et al., 2017), there is stimulus-driven tonotopic mapping extending well beyond auditory core, spanning the temporal plane and continuing into the superior temporal sulcus. As shown in Figure 2A, the overall arrangement is characterized by two pairs of three interlacing best-frequency “fingers,” with the high-frequency fingers (red/orange colormap) predominating medially and extending laterally, where they meet interdigitated lower-frequency fingers (green/yellow colormap) extending lateral to medial, with the longest “middle” lower-frequency finger extending approximately halfway into auditory core. Similar to tonotopy within auditory core, the overall pattern of group activation can be observed in the majority of individual subjects (Fig. 3), but there is also considerable individual variability in the complexity, topography, and extent of tonotopic and attn-tono mapping, similar to that observed in the fMRI studies cited above (as well as electrophysiological studies in a number of studies in macaque and owl monkey) (e.g., Merzenich and Brugge, 1973; Morel et al., 1993).

As can be seen in the maps in Figure 2B, the tonotopically aligned maps evoked by spectrally directed attention are also present in the majority of auditory cortex outside of auditory core. Again, the structure of the tonotopic map (as revealed by Fourier analysis) is abolished when the attentional cue is randomized, thereby eliminating any consistent relationship between attended frequency band and phase lag (Fig. 2C).

The similarity between the maps evoked by presentation of a single-frequency band (tonotopy) versus attention to one of two simultaneously presented frequency bands (stepped attn-tono) can also be seen in each individual subject (Fig. 3). As with the group-averaged data, there is a close correspondence in the progression of preferred frequencies across auditory cortex in individual subjects. The similarity between the tonotopic and attn-tono maps is particularly striking in Subjects 1, 2, 5, 6, and 7. The tonotopic organization of individual subjects demonstrated overall commonalities, but with notable differences, even between individual subjects' right and left hemispheres, particularly outside of auditory core (Humphries et al., 2010; Moerel et al., 2014; Saenz and Langers, 2014). However, individual peculiarities were replicated across tonotopic and attn-tono conditions. In some subjects, there was a surprising lack of strong tonotopic mapping (Subject 3 for which poor tonotopy may be due to greater EPI warping, and also Subject 4 for which low frequencies dominated the tonotopic maps). In summary, there was a strong correspondence between tonotopic and attn-tono maps at both the group and individual levels.

Multiple regression analyses

Winner-takes-all (WTA): maps of “stepped” versus “randomized” attention conditions, and quantitative concordance of tonotopic and attn-tono maps

In a complementary analysis, we used standard multiple regression techniques (see Materials and Methods) to estimate the BOLD response to each center frequency band when it was presented in isolation (tonotopy), versus when it was attended in the presence of a distractor band (attn-tono). This allowed us to make use of the attentionally driven signal in the randomized attn-tono condition and to combine these data with the results from the stepped attn-tono condition to increase statistical power. It also allowed us to verify that the attention effects generalize when listeners direct attention without the “crutch” of consistent stepping up or down across attended frequency bands.

The auditory cortical patches in Figure 4 show the cross-subject average WTA best frequency band (most positive-going BOLD response relative to resting baseline) maps for tonotopy and attn-tono conditions (with no shading for response amplitude). These are overlaid with the outermost R1 isocontour (dashed yellow) corresponding to auditory core. As should be expected, the topography of the WTA maps essentially recapitulates the topography revealed by the phase-encoded analyses. The same holds true of the attn-tono WTA maps from both the stepped and, importantly, the randomized block conditions (Fig. 4); this result confirms that, even without the crutch of the stepping frequency band, listeners can direct their attention to specific frequency bands.

Figure 4.

Figure 4.

Comparison of responses in regression-based WTA maps, tonotopy, and attn-tono. Color maps projected onto the right (top panels) and left (bottom panels) hemisphere cortical patches (same as patches shown in Fig. 2, purple) show the cross-subject average best frequency band (WTA) for stepped tonotopic (left) stepped attn-tono (middle), and randomized attn-tono (right) conditions. With the regression-based approach, the randomized condition is also expected to evoke strong attentionally driven tonotopic maps. Dotted yellow line indicates the outermost R1 contour (0.66 s−1) around presumptive auditory core as shown in Figure 2.

The WTA approach also allowed us to straightforwardly quantify the within-subject correspondence between voxelwise best frequency, as estimated by tonotopy and by attn-tono. Here, we coded each voxel in native space as a 1 when best frequency was identical in both conditions, and a 0 otherwise. We then resampled each subject's binary maps to their cortical surface, and then averaged across subjects to create a concordance map (Fig. 5A). These maps (statistically thresholded at vertexwise p < 0.01, with surface-cluster-corrected α of p < 0.001) show that, across subjects, there was high concordance across best frequency maps evoked by stimulus and by attention across much of the temporal plane in both hemispheres, with little concordance in nonauditory areas. The extent of attentionally driven tonotopic mapping relative to overall tonotopicity is shown in the cortical patches below each concordance map in Figure 5A. Here, the outer contour of the significant (p < 0.001 clusterwise corrected) concordance map is overlaid on the phase-averaged group tonotopy map (same as Fig. 2A). Averaging over all subjects, the majority of consistently tonotopically mapped cortex medial to the crown of the superior temporal gyrus (STG) shows preferred-frequency-aligned attn-tono maps, as does a small posterior cluster. In the left hemisphere, almost all consistently mapped tonotopic cortex also shows aligned attn-tono maps. However, it is important to note that there are considerable individual differences in regional best-frequency alignment across the tonotopy and attn-tono maps (as can be seen in Fig. 3).

Comparison of response profiles to all frequency bands across tonotopy and attn-tono

As has been shown previously (e.g., Moerel et al., 2013), hemodynamic responses to frequency in auditory cortex are not necessarily bandpass but can be more complex and multipeaked. Therefore, we also examined whether attention to a given frequency band in the presence of a distractor band recapitulates the more graded response to nonpreferred frequencies observed when that frequency band is presented in isolation. To do this, we created and surface-morphed a set of small cortical ROIs to each subject (Fig. 5B; see Materials and Methods) and quantified the similarity between the tonotopy and attn-tono response profiles in each ROI in each hemisphere by regressing the mean tonotopic parameter estimate for each frequency band against the attn-tono parameter estimate (with subjects as a random factor). We used this ROI quilt analysis (as opposed to a vertexwise one) to capture regional variation in cross-condition response profile similarity across subjects, which might be obscured by individual differences in tonotopic map topography and surface-based registration errors. This also reduced the number of statistical comparisons that must be corrected for, thus increasing power to detect effects.

The ROI analyses (Fig. 5) further support the results from the concordance maps from the WTA analyses (Fig. 4). The ROI analyses (Fig. 5B) show that individual subjects' tonotopy and attn-tono responses profiles are significantly associated across most of auditory cortex (all ROIs within the white border), with the exception of the most lateral aspects of the STG and upper bank of the superior temporal sulcus. Although there is a strong relationship between tonotopy and attn-tono response profiles of each subject within a given ROI, there is cross-subject variability in the particular shape of those response profiles, as suggested by the individual maps in Figure 3. There is a broad tendency for tonotopy/attn-tono profile similarity to be strongest posteromedially in both hemispheres, and no clear indication that profile similarity is higher in auditory core (indeed, this is not the case in the left hemisphere). As shown by the white line on the tonotopic flat maps in Figure 5A, the area showing significant response profile similarity extends over the majority of cortex showing strong tonotopic mapping with topologic similarity across subjects. The response profile similarity extends into less tonotopically consistent regions medially and posteriorly but does not include the more posterolateral tonotopically mapped regions along the crown of the STG.

Loser-takes-all (LTA): maps of dis-preferred frequency

Given the graded nature of frequency response preferences we observed, we suspected that there would be a large-scale topography associated with the minimum BOLD response across frequency, and that this topography would also be recapitulated by attention. Thus, we also performed a parallel LTA analysis, in which we coded voxels by the frequency band driving the minimum BOLD response (again relative to resting baseline) and analyzed as above. The average descriptive LTA maps show approximately opposite frequency responses compared with the WTA tonotopic maps, with higher-frequency band-preferring regions in the tonotopic map being least driven by lower-frequency bands, and vice versa (Fig. 6A). There is also some overlap in the mid-frequency-preferring regions, likely due to blurring of values when averaging subjects' integer-based maps. There is also quite close correspondence between the frequency band evoking the least response in the tonotopy (stimulus) condition and the smallest BOLD response evoked by attending to a given frequency band. The LTA concordance maps (Fig. 6B, statistical thresholding as in Fig. 5A) show that, in the right hemisphere, the alignment of tonotopic and attn-tono maps is greatest in more lateral and anterior auditory cortex, with qualitatively somewhat greater concordance more medially in the left hemisphere. The hemispheric difference and also the apparent qualitative contrast with the WTA concordance maps seen in Figures 4 and 6 are exaggerated by the clusterwise statistical thresholding combined with the overall slightly lower concordance in the LTA maps.

Figure 6.

Figure 6.

Comparison of responses in regression-based LTA maps, tonotopy, and attn-tono. A, The colormaps projected onto the same cortical patches as in Figures 2 and 4 show cross-subject group-average maps that depict the frequency band that drives the least activation compared with all other frequency bands (LTA) in tonotopy and attn-tono (stepped plus randomized blocks) conditions and in right and left hemispheres. As in Figure 4, the presumptive auditory core shown by the dashed yellow line depicting the outermost R1 contour (0.66 s−1). B, The tonotopy versus attn-tono LTA concordance map was created as in Figure 5A. The midpoint of the heatscale has been lowered slightly compared with Figure 5A, reflecting the overall somewhat lower concordance in the LTA maps compared with WTA. Dotted yellow R1 isocontour is the same as in Figure 4.

Difference in activation across auditory areas when best frequency is attended versus ignored

We also assessed the strength and consistency of BOLD-related frequency band-selective attention across subjects, and how the effect of attention varied with preferred or dis-preferred frequency. We first used a subject's native-space WTA map to establish each voxel's best frequency. Then, we assigned each voxel the parameter estimate for the difference in activation between attending to its best frequency in the presence of a distractor versus attending to the distractor and ignoring its best frequency. In other words, the value at each voxel was the estimated difference in activation between attending to, versus ignoring, its best frequency in the presence of other frequency bands. We repeated this process to estimate the parallel attention effect for each voxel's “worst” frequency (using the corresponding LTA map). We then resampled each subject's native-space attention maps to her/his cortical surface to allow for surface-based cross-subject averaging and statistical testing (all again with a vertexwise p < 0.01 threshold and surface-cluster-corrected α of p < 0.001). Figure 7 (top row) shows that, across subjects, there was significantly greater activation across most of auditory cortex when best frequency was attended versus ignored. The widespread attention effect included all of R1-estimated auditory core (outlined in green), extending from the inferior circular sulcus laterally to the upper bank of the superior temporal sulcus, and anteroposteriorly from the temporal pole onto the planum temporale. By contrast, there were relatively few regions where attention to a voxel's least-preferred frequency band evoked greater activation than when the same frequency band was the distractor. Attending to a voxel's least-preferred frequency band only significantly increased activation along the posterior lateral STG in both hemispheres, extending more medially in the left and more anteriorly in the right (Fig. 7, middle row). A direct comparison between these maps (cross-subject t test on the difference of differences, Fig. 7, bottom row) showed that there were considerable regional differences in activation between attending to a voxel's preferred versus dis-preferred frequency band. In both hemispheres, there was greater activation across most of the anterior temporal plane when attention was directed to the preferred versus dis-preferred frequency; in the right hemisphere, this effect extended throughout the temporal plane, as well as including a patch in the posterior STG. There were no regions in which the converse effect was observed (greater attend > distract activation for dis-preferred versus preferred frequency band). This shows that the frequency-selective attention-related BOLD gain is strongly modulated by frequency preference and provides some evidence for models of multiplicative, and not additive, attentional gain (but see Discussion).

Figure 7.

Figure 7.

Comparison of maps when best frequency is attended versus ignored. The heatscale (t values, thresholded as in Fig. 5A) depicts the cross-subject cortical surface-based average difference in activation when the subject-specific best frequency band of each voxel was attended versus ignored. The dotted green R1 isocontour estimating auditory core is as in Figure 4.

Relationship of tonotopic and attn-tono map strength to MR-estimated myeloarchitecture

Typically, assays of cortical myelination are used to differentiate the most highly myelinated cortical regions (e.g., auditory core, MT/V5, or V1) from adjacent regions. This is true whether cortical myelination is assessed using ex vivo “gold standard” approaches, such as Gallyas staining, or estimated through in vivo MRI T1-weighted/T2-weighted ratio, quantitative R1, or magnetization transfer measures. However, more subtle myelination changes that occur throughout cortex may spatially correspond with changes in functional characteristics (Glasser et al., 2016; Wallace et al., 2016). For instance, recent combined fMRI and high-resolution quantitative MR show that slight reductions in cortical myelination in primary somatomotor cortex reliably occur at the border between face and hand areas (Kuehn et al., 2017).

Here, we asked whether the change in the degree to which cortex showed a strong frequency band preference (i.e., the amplitude of the phase-encoded tonotopic or attn-tono signal) spatially corresponded with changes in myelination as assessed by quantitative R1 (within a 4-mm-radius disk that roved across the entire cortical surface). The cross-subject-average normalized covariance map in Figure 8A shows that there is a shared local gradient in tonotopic amplitude and R1 along the entire inferior circular sulcus and the anterior part of the STG, where tonotopic amplitude and R1 drop in tandem over a narrow band of cortex. There is also negative local spatial covariance between tonotopic amplitude and R1 within the center of auditory cortex, where tonotopic amplitude increases but R1 remains relatively stable. There is also some tonotopic/R1 spatial covariance within and around the central sulcus; these regions showed considerably less overall amplitude in tonotopic response, but one that spatially covaries with changes in R1.

Figure 8.

Figure 8.

Local normalized covariance between R1 values and tonotopic and attn-tono response amplitude. The heatscale value at each vertex represents the normalized spatial covariance within a 4 mm (2D) radius between R1 and the amplitude of the tonotopic or attn-tono signal (e.g., the amplitude of the Fourier component at the stimulus frequency of 8 cycles/run). A, The cross-subject (N = 8) cortical surface-based average normalized covariance between R1 and tonotopic amplitude. B, The R1 versus tonotopy normalized covariance in an independent cohort (N = 6), using data previously acquired with a different tonotopy protocol (bandpass-filter-swept nonlinguistic vocalizations) and on a different scanner (Siemens 3T Trio); full protocol as described by Dick et al. (2012). C, The average normalized covariance between R1 and attn-tono amplitude in the current cohort.

To test the replicability of this novel tonotopy-versus-R1 searchlight cross-correlation, we reanalyzed R1 and tonotopy data from a previous study (Dick et al., 2012) that used a different tonotopic stimulus (bandpass-filter-swept nonlinguistic vocalizations) and a slightly different multiparameter mapping protocol. Despite these methodological differences, we found a very similar pattern of tonotopic/R1 positive local spatial covariance within the circular sulcus and along the lateral STG, with negative spatial covariance again in the center of auditory cortex (Fig. 8B). The shared and relatively steep anterolateral and medial gradients in putative myelination and degree of frequency specificity, observed in two independently acquired datasets, suggest a shared functional and myeloarchitectonic border, possibly similar in character to those reported recently relating resting state, standard task activation, and T1-weighted/T2-weighted derived myelination estimates across cortex (Glasser et al., 2016; Kuehn et al., 2017).

As seen in Figure 8C, the spatial relationship between local R1 and attn-tono amplitude changes is much less clear. Here, there is a weak relationship within and around auditory cortex that is only observed within the circular sulcus (particularly in the right hemisphere). There are also stripes of spatial covariation along the banks of the central sulcus, although not closely aligned with the pattern observed with the tonotopy versus R1 covariance maps. Although very preliminary, these results suggest that changes in the degree of spectral attentional modulation in auditory cortex are not strongly linked to the underlying myeloarchitecture, and stand in contrast to the consistent spatial association in lateral and medial auditory cortex between local changes in R1 and the strength of stimulus-driven frequency response preference.

In summary, everyday listening ordinarily takes place in rich soundscapes within multiple, simultaneous sound sources contributing to the overlapping mix of sound waves that arrives at the ears. Auditory attention is crucial to sorting out the mix. Listeners direct attentional focus to a sound source, or even to specific acoustic dimensions within a single sound source, to zero in on auditory information that is diagnostic in guiding behavior.

We asked how endogenous attention directed to specific acoustic frequency bands modulates human auditory cortical activity. Using high-resolution quantitative MRI and a novel fMRI paradigm for driving sustained selective attention within specific frequency bands, we established effects of spectrally specific attention in myeloarchitectonically estimated human auditory core. These effects extend across the majority of tonotopically mapped auditory cortex and are apparent in individual listeners. Sensory-driven best-frequency tonotopic maps align with attentionally driven maps across much of the temporal plane, with poor concordance in nonauditory areas. Individual tonotopic and attn-tono maps show correlated idiosyncracies. The frequency bands that evoke the least BOLD response from input and from attention also exhibit close spatial correspondence. There is greater activation across most of auditory cortex when best frequency is attended, versus ignored. Finally, there is local spatial correspondence in multiple auditory regions between the degree of R1-estimated myelination and the strength of the frequency band-selective fMRI response for tonotopic stimuli.

Discussion

Human auditory core exhibits attentionally driven tonotopic organization

Previous findings showed similar stimulus-driven and attentionally driven frequency preference in and around Heschl's gyrus, a macroanatomical landmark associated with primary auditory areas (Da Costa et al., 2013; Riecke et al., 2017). Here, we demonstrate that, within quantitative-R1-defined primary auditory areas, the attentionally driven maps in each hemisphere are very similar to the detailed tonotopic maps in the same subjects. As shown by comparison maps across the acoustically identical stepped and randomized attn-tono conditions (Figs. 2, 4), the alignment between tonotopic and attention maps depends on allocation of attention to the cued frequency band, not perceptual interference or other stimulus-driven effects. The fact that there is considerable, high-level attentional modulation within primary auditory areas is interesting given previous results suggesting more limited attentional topographic modulation in primary auditory (Atiani et al., 2014) and visual (Saygin and Sereno, 2008) cortex, compared with more robust attentional modulation in areas immediately adjacent to primary ones.

Attentionally driven tonotopic organization extends across much of auditory cortex

We also find strong evidence for tonotopically mapped spectrally directed attention in much of auditory cortex, particularly along the lateral STG (potentially analogous to lateral auditory belt and parabelt regions in macaque) (Hackett, 2007). In addition to the concordance in and around auditory core, the most consistent group-level alignment of these maps lies lateral to auditory core, with each map characterized by three higher-to-lower best-frequency band traversals, moving from posterior to anterior approximately along the STG.

This pattern suggests a cross-species parallel to results reported in ferret (Atiani et al., 2014), where task-evoked attentional modulation of frequency-tuned neurons is particularly strong in nonprimary (dPEG) tonotopically mapped auditory areas. In this regard, the stimulus complexity, variability, and memory demands of the current task may have helped to drive attentional response in these more lateral and anterior areas. Our results are consistent with a human fMRI comparison of cross-modal attentional effects (Petkov et al., 2004), which showed greater activation in lateral auditory regions when attention was directed to a demanding auditory repetition detection task than when the same sounds were played as subjects performed a demanding visual detection task. However, our results differ from these studies to some degree in that attentionally driven tonotopic modulation in auditory core was also robust (similar to cross-modal attention studies in macaque A1) (O'Connell et al., 2014) and primary auditory areas (De Martino et al., 2015a), and did not differ significantly from that in lateral belt.

There was good correspondence between the voxelwise best frequency band for tonotopy and attn-tono in individual listeners. Like several prior studies (Humphries et al., 2010; Moerel et al., 2014; Saenz and Langers, 2014; Brewer and Barton, 2016; Leaver and Rauschecker, 2016), we observed quite substantial variation in the detailed topography of tonotopy across individuals (but compare Ahveninen et al., 2016). It is especially noteworthy that attn-tono recapitulated these topographic idiosyncrasies (as observed in the concordance analyses, Figs. 4B, 5B).

It is intriguing that there was a systematic frequency band-associated topography not only of best frequency but also of dis-preferred frequency and, also, that the frequency-selective attenuation of BOLD gain (relative to other frequencies) can be recapitulated by selective attention to that frequency band in the presence of other spectral information. One could speculate that this map structure might be a population-level reflection of an “inhibitory surround” structure observed in some electrophysiology studies (Calford and Semple, 1995; Sutter et al., 1999; but compare Wehr and Zador, 2003), with the frequency band driving the least BOLD response corresponding to the deepest trough in an asymmetric surround, an effect that could drive the very similar tonotopic and attn-tono graded frequency response preferences revealed in the multiple ROI analysis (Fig. 5B).

Here, the average frequency response profile evoked by the single-band tonotopic stimuli was recapitulated by attention to the same frequency bands in the context of distractors. Prior human neuroimaging research has been consistent with the possibility that the shape of the frequency response in and around Heschl's gyrus is attentionally modulated in a bandpass manner that relies on amplification rather than attenuation (Riecke et al., 2017). Based on results from a larger number of spectral bands, the current findings suggest that, at least at a more macroscopic scale, spectrally directed attention modulates cortical activity in a more graded fashion, with the shape of the attentional response to both preferred and less-preferred frequency bands similar to that evoked by stimulus alone, a contention supported by the alignment of the LTA tonotopic and attn-tono maps (Fig. 6). That is, the frequency band that drives the smallest fMRI response when presented alone is also the frequency band that elicits the least activation when attended in the presence of a distractor. A better understanding of the mechanisms underlying these maps will require more fine-grained characterization of frequency-directed attentional modulation, preferably at very high spectral and temporal resolution (Moerel et al., 2013, 2014; Lutti et al., 2014; Ahveninen et al., 2016) that might also help to unveil cortical-depth-specific attentional effects (De Martino et al., 2015a). In particular, it will be important to see whether different fMRI tasks, using more complex naturalistic sounds, or more or less abstract cues to frequency, can mimic the task-, valence-, and context-dependent effects observed in nonhuman animal cortical auditory receptive fields, where the character of the “contrast-enhancing” modulations differs markedly with experimental manipulation (Fritz et al., 2005, 2007a, b; David et al., 2012; Atiani et al., 2014; Kuchibhotla et al., 2017). It is worth noting that task-related modulation of frequency-selective attentional effects has long been of interest in human auditory psychophysics (Greenberg and Larkin, 1968; Scharf et al., 1987; Scharf, 1989; Moore et al., 1996; Green and McKeown, 2001).

There is correspondence between local change in R1-estimated myelination and the strength of fMRI-assessed relative frequency selectivity

We found that the change in the degree to which a small (4 mm radius) patch of cortex shows strong frequency preferences in tonotopy was positively spatially correlated with its degree of myelination as estimated by R1. The strength of the correlation was anatomically specific, marking the medial border of auditory cortex (within the circular sulcus) and revealing a potential anatomical index of “processing style” (from more to less tonotopically mapped) along anterolateral STG. We found this pattern to hold true in the data from the current study as well as in an independent cohort scanned with quite different tonotopic stimuli and with multiparameter maps acquired on a different scanner model, with different sequence settings (Fig. 8C). Although there was a relatively reliable pattern of R1-tonotopy correspondence at a group level, there was some notable individual variation in local shared R1/tonotopy gradients relative to gyral anatomy. Thus, these patterns may be more useful than curvature for establishing areal borders on an individual subject basis, particularly when there is no obvious sharp change in a single measure (for discussion, see also Glasser et al., 2016). Such work holds promise for generating novel hypotheses for more intensively characterized species (e.g., mouse, ferret, or marmoset), particularly in tandem with imaging techniques that that can cover multiple cortical areas simultaneously.

Future directions

In the current study, we limited our investigation to broadly defined auditory cortex, where there was good evidence for systematic tonotopic representation from a number of previous studies (Talavage and Edmister, 2004; Hackett, 2007; Moerel et al., 2013, 2014; Saenz and Langers, 2014; Leaver and Rauschecker, 2016). In future research, it will be informative to examine interactions with several frontal regions whose potential analogs are known to have direct feedforward and feedback connections in macaque monkeys (Romanski and Goldman-Rakic, 2002), and where in ferret there are clear modulatory influences on primary and nonprimary auditory cortex during learning (Atiani et al., 2014; Shamma and Fritz, 2014). Similar to recent work in vision (Klein et al., 2014; Puckett and DeYoe, 2015), it will also be useful to establish the shape of the attentional population receptive field, and how this varies across auditory areas and relates to stimulus-driven auditory population receptive field size (Thomas et al., 2015). Finally, following on from our own pilot work, it will be exciting to explore whether higher-level auditory regionalization may follow along some of the “fault lines” revealed by shared local tonotopic and myelin gradients, and whether or not more sophisticated and fine-grained spectral attentional manipulations may reveal a relationship between the degree of attentional malleability and underlying cortical architecture and circuitry.

Footnotes

This work was supported by the Rothberg Research Award in Human Brain Imaging of Carnegie Mellon University. M.I.L. was supported by National Institutes of Health T90DA022761 and T32GM081760. The Wellcome Centre for Human Neuroimaging is supported by core funding from Wellcome 203147/Z/16/Z. We thank Scott Kurdilla and Debbie Viszlay (Scientific Imaging and Brain Research Center, Carnegie Mellon University) for imaging support; Antoine Lutti and Nikolaus Weiskopf for physics expertise and generosity with porting the multiparameter mapping protocol to the SIBR Verio; the developers of FSL, AFNI, and FreeSurfer for scientific work and software; and Marlene Behrmann, Jenny Bizley, Maria Chait, Tim Griffiths, Jen Linden, Catherine Perrodin, Chris Petkov, Lars Riecke, Sam Schwarzkopf, Jeremy Skipper, Ediz Sohoglu, Adam Tierney, and three anonymous reviewers of a previous version of the manuscript for extremely useful suggestions and feedback.

The authors declare no competing financial interests.

References

  1. Ahveninen J, Chang WT, Huang S, Keil B, Kopco N, Rossi S, Bonmassar G, Witzel T, Polimeni JR (2016) Intracortical depth analyses of frequency-sensitive regions of human auditory cortex using 7TfMRI. Neuroimage 143:116–127. 10.1016/j.neuroimage.2016.09.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Atiani S, David SV, Elgueda D, Locastro M, Radtke-Schuller S, Shamma SA, Fritz JB (2014) Emergent selectivity for task-relevant stimuli in higher-order auditory cortex. Neuron 82:486–499. 10.1016/j.neuron.2014.02.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barascud N, Pearce MT, Griffiths TD, Friston KJ, Chait M (2016) Brain responses in humans reveal ideal observer-like sensitivity to complex acoustic patterns. Proc Natl Acad Sci U S A 113:E616–E625. 10.1073/pnas.1508523113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Barton B, Venezia JH, Saberi K, Hickok G, Brewer AA (2012) Orthogonal acoustic dimensions define auditory field maps in human cortex. Proc Natl Acad Sci U S A 109:20738–20743. 10.1073/pnas.1213381109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Brewer AA, Barton B (2016) Maps of the auditory cortex. Annu Rev Neurosci 39:385–407. 10.1146/annurev-neuro-070815-014045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Calford MB, Semple MN (1995) Monaural inhibition in cat auditory cortex. J Neurophysiol 73:1876–1891. [DOI] [PubMed] [Google Scholar]
  7. Callaghan MF, Helms G, Lutti A, Mohammadi S, Weiskopf N (2015) A general linear relaxometry model of R1 using imaging data. Magn Reson Med 73:1309–1314. 10.1002/mrm.25210 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Carey D, Caprini F, Allen M, Lutti A, Weiskopf N, Rees G, Callaghan MF, Dick F (2017) Quantitative MRI provides markers of intra-, inter-regional, and age-related differences in young adult cortical microstructure. bioRxiv 139568. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Cox RW. (2012) AFNI: what a long strange trip it's been. Neuroimage 62:743–747. 10.1016/j.neuroimage.2011.08.056 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Da Costa S, van der Zwaag W, Marques JP, Frackowiak RS, Clarke S, Saenz M (2011) Human primary auditory cortex follows the shape of Heschl's gyrus. J Neurosci 31:14067–14075. 10.1523/JNEUROSCI.2000-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Da Costa S, van Der Zwaag W, Miller LM, Clarke S, Saenz M (2013) Tuning in to sound: frequency-selective attentional filter in human primary auditory cortex. J Neurosci 33:1858–1863. 10.1523/JNEUROSCI.4405-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Dale AM, Sereno MI (1993) Improved localization of cortical activity by combining EEG and MEG with MRI cortical surface reconstruction: a linear approach. J Cogn Neurosci 5:162–176. 10.1162/jocn.1993.5.2.162 [DOI] [PubMed] [Google Scholar]
  13. David SV, Fritz JB, Shamma SA (2012) Task reward structure shapes rapid receptive field plasticity in auditory cortex. Proc Natl Acad Sci U S A 109:2144–2149. 10.1073/pnas.1117717109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. De Martino F, Moerel M, Ugurbil K, Goebel R, Yacoub E, Formisano E (2015a) Frequency preference and attention effects across cortical depths in the human primary auditory cortex. Proc Natl Acad Sci U S A 112:16036–16041. 10.1073/pnas.1507552112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. De Martino F, Moerel M, Xu J, van de Moortele PF, Ugurbil K, Goebel R, Yacoub E, Formisano E (2015b) High-resolution mapping of myelo-architecture in vivo: localization of auditory areas in the human brain. Cerebral Cortex 25:3394–3405. 10.1093/cercor/bhu150 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Dick F, Tierney AT, Lutti A, Josephs O, Sereno MI, Weiskopf N (2012) In vivo functional and myelo-architectonic mapping of human primary auditory areas. J Neurosci 32:16095–16105. 10.1523/JNEUROSCI.1712-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Dinse J, Härtwich N, Waehnert MD, Tardif CL, Schäfer A, Geyer S, Preim B, Turner R, Bazin PL (2015) A cytoarchitecture-driven myelin model reveals area-specific signatures in human primary and secondary areas using ultra-high resolution in-vivo brain MRI. Neuroimage 114:71–87. 10.1016/j.neuroimage.2015.04.023 [DOI] [PubMed] [Google Scholar]
  18. Engel SA. (2012) The development and use of phase-encoded functional MRI designs. Neuroimage 62:1195–1200. 10.1016/j.neuroimage.2011.09.059 [DOI] [PubMed] [Google Scholar]
  19. Engel SA, Rumelhart DE, Wandell BA, Lee AT, Glover GH, Chichilnisky EJ, Shadlen MN (1994) fMRI of human visual cortex. Nature 369:525. 10.1038/369525a0 [DOI] [PubMed] [Google Scholar]
  20. Feinberg DA, Setsompop K (2013) Ultra-fast MRI of the human brain with simultaneous multi-slice imaging. J Magn Reson 229:90–100. 10.1016/j.jmr.2013.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Feinberg DA, Moeller S, Smith SM, Auerbach E, Ramanna S, Gunther M,Glasser MF, Miller KL, Ugurbil K, Yacoub E (2010) Multiplexed echo planar imaging for sub-second whole brain fMRI and fast diffusion imaging. PLoS One 5:e15710. 10.1371/journal.pone.0015710 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Fischl B, Sereno MI, Tootell RB, Dale AM (1999) High-resolution intersubject averaging and a coordinate system for the cortical surface. Hum Brain Mapp 8:272–284. 10.1002/(SICI)1097-0193(1999)8:4%3C272::AID-HBM10%3E3.0.CO;2-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Fritz JB, Elhilali M, Shamma SA (2005) Differential dynamic plasticity of A1 receptive fields during multiple spectral tasks. J Neurosci 25:7623–7635. 10.1523/JNEUROSCI.1318-05.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fritz JB, Elhilali M, David SV, Shamma SA (2007a) Does attention play a role in dynamic receptive field adaptation to changing acoustic salience in A1? Hear Res 229:186–203. 10.1016/j.heares.2007.01.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Fritz JB, Elhilali M, Shamma SA (2007b) Adaptive changes in cortical receptive fields induced by attention to complex sounds. J Neurophysiol 98:2337–2346. 10.1152/jn.00552.2007 [DOI] [PubMed] [Google Scholar]
  26. Fritz JB, David SV, Radtke-Schuller S, Yin P, Shamma SA (2010) Adaptive, behaviorally gated, persistent encoding of task-relevant auditory information in ferret frontal cortex. Nat Neurosci 13:1011–1019. 10.1038/nn.2598 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Glasser MF, Coalson TS, Robinson EC, Hacker CD, Harwell J, Yacoub E, Ugurbil K, Andersson J, Beckmann CF, Jenkinson M, Smith SM, Van Essen DC (2016) A multi-modal parcellation of human cerebral cortex. Nature 536:171–178. 10.1038/nature18933 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Green TJ, McKeown JD (2001) Capture of attention in selective frequency listening. J Exp Psychol Hum Percept Perform 27:1197–1210. 10.1037/0096-1523.27.5.1197 [DOI] [PubMed] [Google Scholar]
  29. Greenberg GZ, Larkin WD (1968) Frequency-response characteristic of auditory observers detecting signals of a single frequency in noise: the probe-signal method. J Acoust Soc Am 44:1513–1523. 10.1121/1.1911290 [DOI] [PubMed] [Google Scholar]
  30. Greve DN, Fischl B (2009) Accurate and robust brain image alignment using boundary-based registration. Neuroimage 48:63–72. 10.1016/j.neuroimage.2009.06.060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hackett TA. (2007) Organization and correspondence of the auditory cortex of humans and nonhuman primates. Evolution of the nervous system (Kaas JH, ed). Academic Press: San Diego, CA, pp 109–119. [Google Scholar]
  32. Hackett TA, Preuss TM, Kaas JH (2001) Architectonic identification of the core region in auditory cortex of macaques, chimpanzees, and humans. J Comp Neurol 441:197–222. 10.1002/cne.1407 [DOI] [PubMed] [Google Scholar]
  33. Hagler DJ Jr, Sereno MI (2006) Spatial maps in frontal and prefrontal cortex. Neuroimage 29:567–577. 10.1016/j.neuroimage.2005.08.058 [DOI] [PubMed] [Google Scholar]
  34. Hagler D Jr, Saygin AP, Sereno MI (2006) Smoothing and cluster thresholding for cortical surface-based group analysis of fMRI data. Neuroimage 33:1093–1103. 10.1016/j.neuroimage.2006.07.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Hagler DJ Jr, Riecke L, Sereno MI (2007) Parietal and superior frontal visuospatial maps activated by pointing and saccades. Neuroimage 35:1562–1577. 10.1016/j.neuroimage.2007.01.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Helms G, Dathe H, Dechent P (2008) Quantitative FLASH MRI at 3T using a rational approximation of the Ernst equation. Magn Reson Med 59:667–672. 10.1002/mrm.21542 [DOI] [PubMed] [Google Scholar]
  37. Helms G, Draganski B, Frackowiak R, Ashburner J, Weiskopf N (2009) Improved segmentation of deep brain grey matter structures using magnetization transfer (MT) parameter maps. Neuroimage 47:194–198. 10.1016/j.neuroimage.2009.03.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Herdener M, Esposito F, Scheffler K, Schneider P, Logothetis NK, Uludag K, Kayser C (2013) Spatial representations of temporal and spectral sound cues in human auditory cortex. Cortex 49:2822–2833. 10.1016/j.cortex.2013.04.003 [DOI] [PubMed] [Google Scholar]
  39. Herrmann B, Henry MJ, Obleser J (2013) Frequency-specific adaptation in human auditory cortex depends on the spectral variance in the acoustic stimulation. J Neurophysiol 109:2086–2096. 10.1152/jn.00907.2012 [DOI] [PubMed] [Google Scholar]
  40. Humphries C, Liebenthal E, Binder JR (2010) Tonotopic organization of human auditory cortex. Neuroimage 50:1202–1211. 10.1016/j.neuroimage.2010.01.046 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Idemaru K, Holt LL (2011) Word recognition reflects dimension-based statistical learning. J Exp Psychol Hum Percept Perform 37:1939–1956. 10.1037/a0025641 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Kastner S, Ungerleider LG (2000) Mechanisms of visual attention in the human cortex. Annu Rev Neurosci 23:315–341. 10.1146/annurev.neuro.23.1.315 [DOI] [PubMed] [Google Scholar]
  43. Klein BP, Harvey BM, Dumoulin SO (2014) Attraction of position preference by spatial attention throughout human visual cortex. Neuron 84:227–237. 10.1016/j.neuron.2014.08.047 [DOI] [PubMed] [Google Scholar]
  44. Kuchibhotla KV, Gill JV, Lindsay GW, Papadoyannis ES, Field RE, Sten TA, Miller KD, Froemke RC (2017) Parallel processing by cortical inhibition enables context-dependent behavior. Nat Neurosci 20:62–71. 10.1038/nn.4436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Kuehn E, Dinse J, Jakobsen E, Long X, Schäfer A, Bazin PL, Villringer A, Sereno MI, Margulies DS (2017) Body topography parcellates human sensory and motor cortex. Cereb Cortex 27:3790–3805. 10.1093/cercor/bhx026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Langers DR, van Dijk P (2012) Mapping the tonotopic organization in human auditory cortex with minimally salient acoustic stimulation. Cereb Cortex 22:2024–2038. 10.1093/cercor/bhr282 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Langers DR, Krumbholz K, Bowtell RW, Hall DA (2014) Neuroimaging paradigms for tonotopic mapping: I. The influence of sound stimulus type. Neuroimage 100:650–662. 10.1016/j.neuroimage.2014.07.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Leaver AM, Rauschecker JP (2016) Functional topography of human auditory cortex. J Neurosci 36:1416–1428. 10.1523/JNEUROSCI.0226-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Lutti A, Hutton C, Finsterbusch J, Helms G, Weiskopf N (2010) Optimization and validation of methods for mapping of the radiofrequency transmit field at 3T. Magn Reson Med 64:229–238. 10.1002/mrm.22421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Lutti A, Stadler J, Josephs O, Windischberger C, Speck O, Bernarding J, Hutton C, Weiskopf N (2012) Robust and fast whole brain mapping of the RF transmit field B1 at 7T. PLoS One 7:e32379. 10.1371/journal.pone.0032379 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Lutti A, Dick F, Sereno MI, Weiskopf N (2014) Using high-resolution quantitative mapping of R1 as an index of cortical myelination. Neuroimage 93:176–188. 10.1016/j.neuroimage.2013.06.005 [DOI] [PubMed] [Google Scholar]
  52. Merzenich MM, Brugge JF (1973) Representation of the cochlear partition of the superior temporal plane of the macaque monkey. Brain Res 50:275–296. 10.1016/0006-8993(73)90731-2 [DOI] [PubMed] [Google Scholar]
  53. Moerel M, De Martino F, Formisano E (2012) Processing of natural sounds in human auditory cortex: tonotopy, spectral tuning, and relation to voice sensitivity. J Neurosci 32:14205–14216. 10.1523/JNEUROSCI.1388-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Moerel M, De Martino F, Santoro R, Ugurbil K, Goebel R, Yacoub E, Formisano E (2013) Processing of natural sounds: characterization of multipeak spectral tuning in human auditory cortex. J Neurosci 33:11888–11898. 10.1523/JNEUROSCI.5306-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Moerel M, De Martino F, Formisano E (2014) An anatomical and functional topography of human auditory cortical areas. Front Neurosci 8:225. 10.3389/fnins.2014.00225 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Moore BC, Hafter ER, Glasberg BR (1996) The probe-signal method and auditory-filter shape: results from normal- and hearing-impaired subjects. J Acoust Soc Am 99:542–552. 10.1121/1.414512 [DOI] [PubMed] [Google Scholar]
  57. Morel A, Garraghty PE, Kaas JH (1993) Tonotopic organization, architectonic fields, and connections of auditory cortex in macaque monkeys. J Comp Neurol 335:437–459. 10.1002/cne.903350312 [DOI] [PubMed] [Google Scholar]
  58. Morosan P, Rademacher J, Schleicher A, Amunts K, Schormann T, Zilles K (2001) Human primary auditory cortex: cytoarchitectonic subdivisions and mapping into a spatial reference system. Neuroimage 13:684–701. 10.1006/nimg.2000.0715 [DOI] [PubMed] [Google Scholar]
  59. O'Connell MN, Barczak A, Schroeder CE, Lakatos P (2014) Layer specific sharpening of frequency tuning by selective attention in primary auditory cortex. J Neurosci 34:16496–16508. 10.1523/JNEUROSCI.2055-14.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Oh J, Kwon JH, Yang PS, Jeong J (2013) Auditory imagery modulates frequency-specific areas in the human auditory cortex. J Cogn Neurosci 25:175–187. 10.1162/jocn_a_00280 [DOI] [PubMed] [Google Scholar]
  61. Paltoglou AE, Sumner CJ, Hall DA (2009) Examining the role of frequency specificity in the enhancement and suppression of human cortical activity by auditory selective attention. Hear Res 257:106–118. 10.1016/j.heares.2009.08.007 [DOI] [PubMed] [Google Scholar]
  62. Petkov CI, Kang X, Alho K, Bertrand O, Yund EW, Woods DL (2004) Attentional modulation of human auditory cortex. Nat Neurosci 7:658–663. 10.1038/nn1256 [DOI] [PubMed] [Google Scholar]
  63. Pierpaoli C. (2010) Quantitative brain MRI. Top Magn Reson Imaging 21:63. 10.1097/RMR.0b013e31821e56f8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Preibisch C, Deichmann R (2009) Influence of RF spoiling on the stability and accuracy of T1 mapping based on spoiled FLASH with varying flip angles. Magn Reson Med 61:125–135. 10.1002/mrm.21776 [DOI] [PubMed] [Google Scholar]
  65. Puckett AM, DeYoe EA (2015) The attentional field revealed by single-voxel modeling of fMRI time courses. J Neurosci 35:5030–5042. 10.1523/JNEUROSCI.3754-14.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Rao AA, Talavage TM (2005) Reliability of phase-encode mapping in the presence of spatial non-stationarity of response latency. Neuroimage 28:563–578. 10.1016/j.neuroimage.2005.06.020 [DOI] [PubMed] [Google Scholar]
  67. Riecke L, Peters JC, Valente G, Kemper VG, Formisano E, Sorger B (2017) Frequency-selective attention in auditory scenes recruits frequency representations throughout human superior temporal cortex. Cereb Cortex 27:3002–3014. 10.1093/cercor/bhw160 [DOI] [PubMed] [Google Scholar]
  68. Romanski LM, Goldman-Rakic PS (2002) An auditory domain in primate prefrontal cortex. Nat Neurosci 5:15–16. 10.1038/nn781 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Saenz M, Langers DR (2014) Tonotopic mapping of human auditory cortex. Hear Res 307:42–52. 10.1016/j.heares.2013.07.016 [DOI] [PubMed] [Google Scholar]
  70. Saygin AP, Sereno MI (2008) Retinotopy and attention in human occipital, temporal, parietal, and frontal cortex. Cereb Cortex 18:2158–2168. 10.1093/cercor/bhm242 [DOI] [PubMed] [Google Scholar]
  71. Scharf B. (1989) Spectral specificity in auditory detection: The effect of listening on hearing. J Acoust Soc Jpn 10:309–317. [Google Scholar]
  72. Scharf B, Quigley S, Aoki C, Peachey N, Reeves A (1987) Focused auditory attention and frequency selectivity. Percept Psychophys 42:215–223. 10.3758/BF03203073 [DOI] [PubMed] [Google Scholar]
  73. Schönwiesner M, Zatorre RJ (2009) Spectro-temporal modulation transfer function of single voxels in the human auditory cortex measured with high-resolution fMRI. Proc Natl Acad Sci U S A 106:14611–14616. 10.1073/pnas.0907682106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Schwarzkopf DS, Song C, Rees G (2011) The surface area of human V1 predicts the subjective experience of object size. Nat Neurosci 14:28–30. 10.1038/nn.2706 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Sereno MI, Huang RS (2006) A human parietal face area contains aligned head-centered visual and tactile maps. Nat Neurosci 9:1337–1343. 10.1038/nn1777 [DOI] [PubMed] [Google Scholar]
  76. Sereno MI, Dale AM, Reppas JB, Kwong KK, Belliveau JW, Brady TJ, Rosen BR, Tootell RB (1995) Borders of multiple visual areas in humans revealed by functional magnetic resonance imaging. Science 268:889–893. 10.1126/science.7754376 [DOI] [PubMed] [Google Scholar]
  77. Sereno MI, Lutti A, Weiskopf N, Dick F (2013) Mapping the human cortical surface by combining quantitative T(1) with retinotopy. Cereb Cortex 23:2261–2268. 10.1093/cercor/bhs213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Shamma S, Fritz J (2014) Adaptive auditory computations. Curr Opin Neurobiol 25:164–168. 10.1016/j.conb.2014.01.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Smith SM, Jenkinson M, Woolrich MW, Beckmann CF, Behrens TE, Johansen-Berg H, Bannister PR, De Luca M, Drobnjak I, Flitney DE, Niazy RK, Saunders J, Vickers J, Zhang Y, De Stefano N, Brady JM, Matthews PM (2004) Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage 23:S208–S219. 10.1016/j.neuroimage.2004.07.051 [DOI] [PubMed] [Google Scholar]
  80. Stüber C, Morawski M, Schäfer A, Labadie C, Wähnert M, Leuze C, Streicher M, Barapatre N, Reimann K, Geyer S, Spemann D, Turner R (2014) Myelin and iron concentration in the human brain: a quantitative study of MRI contrast. Neuroimage 93:95–106. 10.1016/j.neuroimage.2014.02.026 [DOI] [PubMed] [Google Scholar]
  81. Sutter ML, Schreiner CE, McLean M, O'Connor KN, Loftus WC (1999) Organization of inhibitory frequency receptive fields in cat primary auditory cortex. J Neurophysiol 82:2358–2371. [DOI] [PubMed] [Google Scholar]
  82. Sweet RA, Dorph-Petersen KA, Lewis DA (2005) Mapping auditory core, lateral belt, and parabelt cortices in the human superior temporal gyrus. J Comp Neurol 491:270–289. 10.1002/cne.20702 [DOI] [PubMed] [Google Scholar]
  83. Talavage TM, Edmister WB (2004) Nonlinearity of FMRI responses in human auditory cortex. Hum Brain Mapp 22:216–228. 10.1002/hbm.20029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Talavage TM, Sereno MI, Melcher JR, Ledden PJ, Rosen BR, Dale AM (2004) Tonotopic organization in human auditory cortex revealed by progressions of frequency sensitivity. J Neurophysiol 91:1282–1296. [DOI] [PubMed] [Google Scholar]
  85. Tardif CL, Schäfer A, Waehnert M, Dinse J, Turner R, Bazin PL (2015) Multi-contrast multi-scale surface registration for improved alignment of cortical areas. Neuroimage 111:107–122. 10.1016/j.neuroimage.2015.02.005 [DOI] [PubMed] [Google Scholar]
  86. Tardif CL, Gauthier CJ, Steele CJ, Bazin PL, Schäfer A, Schaefer A, Turner R, Villringer A (2016) Advanced MRI techniques to improve our understanding of experience-induced neuroplasticity. Neuroimage 131:55–72. 10.1016/j.neuroimage.2015.08.047 [DOI] [PubMed] [Google Scholar]
  87. Thomas JM, Huber E, Stecker GC, Boynton GM, Saenz M, Fine I (2015) Population receptive field estimates of human auditory cortex. Neuroimage 105:428–439. 10.1016/j.neuroimage.2014.10.060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Turner R. (2015) Myelin imaging. In: Brain mapping: an encyclopedic reference, pp 137–142. San Diego: Academic Elsevier. [Google Scholar]
  89. Wallace MN, Cronin MJ, Bowtell RW, Scott IS, Palmer AR, Gowland PA (2016) Histological basis of laminar MRI patterns in high resolution images of fixed human auditory cortex. Front Neurosci 10:455. 10.3389/fnins.2016.00455 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Wehr M, Zador AM (2003) Balanced inhibition underlies tuning and sharpens spike timing in auditory cortex. Nature 426:442–446. 10.1038/nature02116 [DOI] [PubMed] [Google Scholar]
  91. Woods DL, Stecker GC, Rinne T, Herron TJ, Cate AD, Yund EW, Liao I, Kang X (2009) Functional maps of human auditory cortex: effects of acoustic features and attention. PLoS One 4:e5183. 10.1371/journal.pone.0005183 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES