Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Jul 5.
Published in final edited form as: Neuron. 2017 Jun 15;95(1):180–194.e5. doi: 10.1016/j.neuron.2017.05.019

A corticothalamic circuit for dynamic switching between feature detection and discrimination

Wei Guo 1,2,^, Amanda R Clause 1, Asa Barth-Maron 1, Daniel B Polley 1,3,*
PMCID: PMC5568886  NIHMSID: NIHMS877468  PMID: 28625486

Summary

Sensory processing must be sensitive enough to encode faint signals near the noise floor, but selective enough to differentiate between similar stimuli. Here we describe a layer 6 corticothalamic (L6 CT) circuit in the mouse auditory forebrain that alternately biases sound processing towards hypersensitivity and improved behavioral sound detection or dampened excitability and enhanced sound discrimination. Optogenetic activation of L6 CT neurons could increase or decrease the gain and tuning precision in the thalamus and all layers of the cortical column, depending on the timing between L6 CT activation and sensory stimulation. The direction of neural and perceptual modulation – enhanced detection at the expense of discrimination or vice versa – arose from the interaction of L6 CT neurons and sub-networks of fast-spiking inhibitory neurons that reset the phase of low-frequency cortical oscillations. These findings suggest that L6 CT neurons contribute towards resolving the competing demands of detection and discrimination.

Keywords: Layer 6, auditory cortex, auditory thalamus, perception, modulation, sensory gain, phase reset, delta rhythm, theta rhythm, oscillation, plasticity

Introduction

Environmental stimuli are transduced, amplified and spatially enhanced by low-level circuits contained within the brainstem and sensory end organs. Afferent sensory traces undergo another set of transformations upon reaching forebrain sensory areas, where they are contextualized according to internal state, recent stimulus histories, long-term sensory experience and top-down predictions of behavioral relevance (David et al., 2012; McGinley et al., 2015; Mesgarani and Chang, 2013; Polley et al., 2006; Shuler and Bear, 2006; Sohoglu and Chait, 2016). Adaptive modulation of forebrain sensory traces is accomplished through the interaction of long-range neuromodulatory inputs with local excitatory-inhibitory microcircuits (Fu et al., 2014; Kuchibhotla et al., 2016; Letzkus et al., 2011; Marlin et al., 2015; Pi et al., 2013; Pinto et al., 2013; Schneider et al., 2014; Zhou et al., 2014). The contribution of corticothalamic (CT) neurons to forebrain sensory modulation is intriguing in this respect, as CT neurons feature both a long-range feedback projection to the thalamus as well as dense local connectivity with excitatory and inhibitory neurons within the cortical column (Bortone et al., 2014; Bourassa and Deschenes, 1995; Briggs et al., 2016; Llano and Sherman, 2008; Ramon y Cajal, 1906; Winer et al., 2001; Zhang and Deschenes, 1997).

Layer 6 (L6) CTs are glutamatergic pyramidal neurons, yet their targeted activation via optogenetic strategies primarily induces a net suppression of spontaneous and sensory-evoked activity in the cortex via direct connections onto local fast-spiking (FS) inhibitory neurons (Bortone et al., 2014; Kim et al., 2014; Olsen et al., 2012). Whereas activating L6 CT neurons scales down sensory-evoked responses in most layers of the cortical column, their effect on thalamic responses are a mixture of modest facilitation and suppression (Denman and Contreras, 2015; Mease et al., 2014; Olsen et al., 2012; Temereanca and Simons, 2004). A recent study performed in the somatosensory thalamocortical slice preparation has opened a new framework for studying these circuits by showing that their mode of modulation depends entirely on timing; L6 CT neurons can dynamically mediate either synaptic suppression or enhancement, depending on their frequency and time course of activation (Crandall et al., 2015). Here, we pursue this idea in the intact animal by showing that L6 CT neurons can impose multiple forms of modulation on auditory responses in the primary auditory cortex (A1) and medial geniculate body of the thalamus (MGB), where again the sign of modulation – suppression or enhancement - depends on the timing between sensory stimuli and L6 CT spiking.

Enhancement or suppression of cortical sensory representations has an immediate and direct impact on perceptual salience, as studied behaviorally (Froemke et al., 2013; Pinto et al., 2013; Sohoglu and Chait, 2016). Studies of sensory processing in humans and non-human primates suggest that cortical networks can enhance detection, segregate stimulus sources and suppress distracting stimuli by organizing the frequency and phase of low-frequency oscillations in the cortical electrical field (Giraud and Poeppel, 2012; Schroeder and Lakatos, 2009). A stimulus falling on the high excitability phase of a cortical oscillation would recruit strong spiking in principal neurons and robust perception, whereas the same stimulus falling on the low excitability phase might fail to drive spiking activity and could go undetected (Lakatos et al., 2008; Zion Golumbic et al., 2013). A neural circuit responsible for controlling the phase of low-frequency cortical rhythms has yet to be identified.

Here, we present findings that bring these studies into alignment by showing that L6 CT neurons can both enhance and suppress activity in the cortex and thalamus of awake mice, in agreement with studies in the acute thalamocortical slice, depending on the temporal interval between their spiking and sensory-evoked responses (Crandall et al., 2015). In keeping with prior observations in visual cortex, we also find that L6 CT neurons are functionally connected with a subtype of FS interneuron that modulates sensory gain in all layers of the cortical column (Bortone et al., 2014; Olsen et al., 2012). Finally, as per recent studies in non-human primates, we show that activating and deactivating L6 CT and FS neurons generates distinct cortical rhythms that modulate the excitability of cortical sensory responses and bias perceptual processing towards modes that favor enhanced detection of faint sounds or the enhanced resolution of similar sounds (Lakatos et al., 2008). However, unlike earlier studies, we report here that the most striking effects of L6 CT neurons on thalamocortical sound processing and auditory perception are found immediately after L6 CT neurons stop spiking; concurrent presentation of sound stimuli with L6 CT activation induces a weaker additive increase in spiking with no demonstrable effect on sound detection or discrimination.

Results

Ntsr1-Cre targets a subset of L6 neurons in auditory cortex that project to the thalamus

Recent studies of corticothalamic modulation have taken advantage of the Ntsr1-Cre transgenic mouse, which labels a subpopulation of pyramidal neurons in L6 that have short-range vertical connections within the cortical column and subcortical projections that deposit axon collaterals in the thalamic reticular nucleus (TRN) en route to the dorsal thalamus (Bortone et al., 2014; Gong et al., 2007; Kim et al., 2014; Lee et al., 2012; Olsen et al., 2012). Anatomical tracer experiments in the visual and somatosensory cortex confirmed that virtually all Ntsr1-positive neurons (Ntsr1+) are L6 corticothalamic (L6 CT), and virtually all L6 CT neurons are Ntsr1+ (Bortone et al., 2014; Kim et al., 2014). To determine whether the same specificity exists in the auditory cortex, we crossed the Ntsr1-Cre line with a Cre-dependent tdTomato reporter line and injected green fluorescent microspheres into the MGB of their double transgenic offspring (Fig. 1a). After allowing the beads 1 week for retrograde transport, we immunolabeled coronal sections of the auditory cortex for the ubiquitous neuron marker, NeuN, and compared the co-localization of beads in Ntsr1+ neurons (n = 824 neurons from 8 hemispheres of 4 mice, Fig. 1B–D). We observed that approximately 35% of L6 neurons neither projected to the MGB, nor expressed Ntsr1, and were therefore most likely intracortical neurons (Fig. 1E–F, left bars). Among Ntsr1+ neurons, 97% also contained beads, indicating that nearly all Ntsr1+ neurons are CT, which is remarkable considering that the not all CT neurons would necessarily be expected to innervate the region of the lateral MGB targeted for retrobead injection (Fig. 1E). Conversely, 90% of L6 CT neurons were also Ntsr1+ (Fig. 1F). Therefore, while a small fraction of L6 CT neurons may not express Ntsr1-Cre, virtually all Ntsr1+ neurons in the auditory cortex are in L6 and are CT, confirming prior reports made in the visual and somatosensory cortex that the Ntsr1-Cre line offers a powerful approach to study L6 CT projection neurons.

Figure 1. A transgenic strategy to selectively target layer 6 auditory corticothalamic neurons (L6 CT).

Figure 1

(A) Ntsr1-Cre mice were crossed with a Cre-dependent tdTomato reporter line. Fluorescent microspheres injected into the medial geniculate body (MGB) were retrogradely transported to the cell bodies of CT neurons in the auditory cortex. (B) After allowing the thalamic beads 7 days for retrograde transport, coronal sections of auditory cortex were immunolabeled for NeuN, a ubiquitous neural marker. As expected, CT cell bodies identified with green beads were occasionally found in L5 (white arrow) but were concentrated in L6, where Ntsr1+ cell bodies (red) were abundant. Scale bar = 100 μm. (C) Magnification of area designated by white square in (B). Many L6 neurons (blue channel, top left) are Ntsr1+ (red channel, top right) and project to a MGB region near the retrobead injection zone (green channel, bottom left). L6 neurons that do not project to the MGB and do not express Ntsr1-Cre (gray arrowhead, bottom right) are interspersed among Ntsr1+ L6 CT neurons (white arrowhead). Scale bar = 10 μm. (D) We quantified the percentage of L6 neurons that were CT and/or Ntsr1+ in 824 L6 neurons from 8 hemispheres of 4 mice. (E–F) Summary histograms for the percentage of neurons that were positive or negative for beads (E) or Ntsr1 (F). Approximately 35% of L6 neurons do not express Ntsr1 and do not project to the MGB retrobead injection zone (left bars). By contrast, 97% of L6 Ntsr1+ neurons project to the MGB, confirming that Ntsr1-Cre selectively targets L6 CT neurons in the auditory cortex.

L6 CT neurons induce alternating periods of activation and suppression across the cortical column

To manipulate the activity of L6 CT neurons, we expressed channelrhodopsin (ChR2) in the auditory cortex of adult Ntsr1-Cre mice using a cre-dependent viral construct. Ntsr1+ soma were observed in L6, with intense neuropil staining in L4 of A1 and the ipsilateral MGB, and a fainter band of labeling in L1 (Fig. 2A). We recorded unit activity from all layers of the primary auditory cortex (A1) in awake, head-fixed mice while activating L6 CT neurons with blue light (n = 418 recording sites from 11 mice, Fig. 2B,C). Whereas optogenetic activation of L6 CT neurons suppresses spiking in V1 (Olsen et al., 2012), it increases firing rates in all layers of A1 (Fig. 2D, refer to supplemental table 1 and figure legends for all statistical reporting). Closer inspection revealed that firing rate elevation was observed at the onset and steady-state of laser activation, yet robust suppression of spiking was observed at a short delay following the offset of laser activation (50–100 ms post-laser offset), followed by rebound excitation at a longer delay after laser offset (150–200 ms post-laser offset) (Fig. 2E). Thus, the effect of optogenetically activating L6 CT neurons in A1 was opposite to prior reports in V1. Moreover, deactivation of L6 CT neurons upon laser offset initiated an alternating pattern of spiking suppression and facilitation that has not been described previously. None of these phenomena were observed in Ntsr1-Cre mice that were injected with a reporter virus that did not encode ChR2 (Supplemental Fig. 1).

Figure 2. Optogenetic activation of L6 CT neurons in A1 induces alternating periods of spike enhancement and suppression.

Figure 2

(A) ChR2 was expressed in L6 CT neurons by injecting a Cre-dependent viral construct into A1 of Ntsr1-Cre mice. The fluorescent mCherry reporter is visible in L6 cell bodies, dense bands of neuropil in L4 and in MGB axons. (B) Schematic of columnar recording during L6 CT activation. (C) Sound-evoked (left) and laser-evoked (right) laminar profiles of current source density (CSD) amplitude from a single A1 penetration in an awake mouse. Multiunit activity (MUA) at each location is represented by the superimposed white peristimulus time histograms (PSTH, scale bar = 100 spikes/s). White arrow indicates a brief, early sound-evoked current sink used to identify L4. (D) PSTHs represent the mean MUA in each layer. Laser power at the tip of the optic fiber ranges from 5–50 mW. Error bars represent 1 SEM. (E) The change in laser-evoked firing rates relative to baseline activity increase monotonically during the onset and sustained periods (ANOVA, F > 10, p < 1 x 10−6 for each, orange and gray arrows, respectively), decrease immediately following laser offset (F=22.99, p < 1 x 10−6, purple arrow) and increase again 100ms later (F=4.01, p < 5 x 10−5, green arrow), particularly in L2/3, L4 and L5. Error bars represent 1 SEM. A detailed breakdown of all statistical tests can be found in Supplemental Table 1.

Optogenetic activation of L6 CT neurons bi-directionally modulates sound processing in A1

As a next step, we characterized how dynamic changes in columnar excitability elicited by L6 CT activation affected sound-evoked responses and frequency tuning. We measured pure tone frequency tuning from A1 units with or without optogenetic activation of L6 CT units (Fig. 3A). Because optogenetic activation of L6 CTs induced alternating periods of enhancement and suppression, we staggered the onset of pure tone stimuli relative to laser onset at 17 discrete intervals (0 – 800 ms in 50 ms steps, Fig. 3B). As illustrated from a representative L4 unit recording, L6 CT activation induced clear modulation of both auditory responsiveness and auditory tuning (Fig. 3C). The modulation was divergent, alternating between enhancement during L6 CT activation, suppression immediately following the deactivation of L6 CT neurons and then a second enhancement at longer delays following laser offset.

Figure 3. Activating L6 CT neurons induces a tri-phasic modulation of sound-evoked A1 responses.

Figure 3

(A) Pure tone pips were presented to the left ear while recording spiking activity from the contralateral A1 with and without optogenetic activation of L6 CT neurons. Frequency response functions are illustrated with a Gaussian fit. (B) Tones were presented without L6 CT activation (50% of trials, gray) or with L6 CT activation, where the delay between the onset of tone bursts and laser pulses varied from 0 – 800 ms. (C) L6 CT activation imposed diverse forms of modulation on a representative L4 recording site. (D–E) The scaling and shifting components were computed in the same example L4 site (D) and a representative L6 site (E) by regressing the mean normalized tone-evoked firing rates measured during the tone-alone trials against the firing rates recorded during the three epochs surrounding L6 CT activation. (F–G) Mean (±1 SEM) shifting and scaling modulation were computed for each multiunit site for recordings made with a 400 ms laser pulse at 20 mW (F) or a 50 ms laser pulse set to a minimally effective laser power (G). Inset: Laser-induced tuning modulation was not observed from a separate cohort of mice injected with a control virus (Mixed design ANOVA, main effect for shifting modulation: F(2,20) = 0.24, p = 0.79; main effect for scaling modulation: F(2,20) = 2.17, p = 0.14). Statistically significant shifting and scaling modulation for all permutations of laser duration, response period and layer were determined with one-sample t-tests against a population mean of 0 (shifting modulation) or 1.0 (scaling modulation). Looking across both laser durations, we observed significant additive changes during L6 CT activation in L2/3, L5 and L6 (p < 0.05 for each) and significant divisive gain in L4 and L5 (p < 0.05 for each). In the short-delay period following L6 CT deactivation, we observed significant subtractive changes in all layers and significant divisive gain in all but L6 (p < 0.005 for each). In the long-delay period following L6 CT deactivation, we observed significant additive and multiplicative changes for L2/3, L4 and L5 (p < 0.05 for each) but no significant change in L6 tuning (p > 0.1 for each). (H) The mixture of shifting and scaling modulation created sharper frequency tuning during the short-delay period but wider frequency tuning in the long-delay period. Tuning changes in octaves (oct.) are estimated from the change in width at half-height between the tone-alone and tone + L6 CT activation for the 400 ms laser (top) and 50 ms laser (bottom) conditions. Asterisks indicate p < 0.05 with a one-sample t-test relative to a population mean of 0.

Sensory response modulation can be expressed as a linear transformation from responses without laser (roff) to responses with laser (ron) by the formula ron = a × roff + b, where additive/subtractive modulation (b > 0 or b < 0, respectively) reflect constant shifts across the entire tuning function and multiplicative/divisive gain (a > 1, a < 1, respectively) reflects scaling changes that preserve the shape of the tuning function (Seybold et al., 2015). In V1, L6 CT activation induces a purely divisive scaling change such that visual responses were suppressed without affecting orientation tuning (Olsen et al., 2012). Referring to the same representative L4 unit shown in Fig. 3C, we observed a purely additive shift during L6 CT activation, divisive gain when tones were presented at a short delay following laser offset and a multiplicative gain when tones were presented at a longer delay after laser offset (Fig. 3D). By contrast, a representative L6 unit was strongly activated by the laser (likely because it expressed ChR2) but did not show any notable changes in auditory responsiveness after laser offset (Fig. 3E). These effects were summarized by plotting the mean additive/subtractive and multiplicative/divisive modulation in Cartesian coordinates, where pure shifting changes fell on the x-axis, pure scaling changes fell along the y-axis, and mixtures of shifting and scaling modulation fell on the diagonals. For units in L6, the effect of L6 CT activation was straightforward and matched prior reports in V1: the evoked-firing rate change was purely additive during laser with comparatively modest modulation at later time periods. For units in L2/3–L5, L6 CT activation induced a tripartite modulation of auditory responses: additive scaling during L6 CT activation, divisive gain shortly after L6 CT deactivation (50 ms following laser offset) and multiplicative gain at a longer delay following L6 CT deactivation (150 ms following laser offset) (Fig. 3F).

One possibility is that these dynamics arose from an unrealistically long and intense period of L6 CT activation. We controlled for this by performing the same measurements with a shorter laser pulse set to a minimally effective amplitude (50 ms, 5 mW above threshold). The results of this stimulation paradigm were largely the same (Fig. 3G). As a negative control for non-specific effects of laser activation on A1 receptive fields, we did not observe any systematic modulation of auditory responses in Ntsr1-Cre mice that expressed only a control fluorophore in L6 CT units (n = 96 recording sites from 3 mice; Fig. 3G, inset). Importantly, the modulation of tone-evoked responses after laser offset was not purely divisive or multiplicative. The significant subtractive and additive components observed in the short- and long-delay modulation, respectively, suggested that the precision of frequency tuning was also affected. Indeed, for several cortical layers, frequency tuning was significantly narrower during the short delay period and significantly wider during the long delay period following laser offset (Fig. 3H).

Optogenetic activation of L6 CT neurons can alternately bias sound perception towards enhanced detection or discrimination

To study L6 CT modulation of sound perception, we implanted optic fibers over the left and right auditory cortex of Ntsr1-Cre mice that expressed ChR2 in A1 of both hemispheres and measured their sound detection and discrimination performance in an auditory avoidance task (n = 5 mice). In this task, mice were trained to cross from one side of a shuttlebox to the other shortly following the presentation of tone pips at the target frequency, but not to foil tones at other frequencies (Fig. 4A). The distinct forms of receptive field modulation during L6 CT activation or shortly following L6 CT deactivation inspired two hypotheses (Fig. 4B, left): i) The divisive/subtractive gain observed at a short delay following laser offset (50 ms) would suppress A1 activity, thereby impairing tone detection. However, the sharper tuning would improve frequency discrimination; ii) Conversely, the multiplicative/additive gain observed at the longer delay period following laser offset (150 ms) would enhance tone-evoked activity and improve tone detection performance. However, the loss of tuning precision would impair tone discrimination.

Figure 4. L6 CT activation can bias sound perception towards enhanced detection at the expense of discrimination, or vice versa.

Figure 4

(A) Mice were trained in an auditory avoidance task that required them to cross from one side of a shuttle box shortly following the presentation of 14 kHz tone bursts (target) but not tones of other frequencies (foils). Mice expressed ChR2 in L6 CT neurons in left and right auditory cortex and were implanted with bilateral optic fiber assemblies. (B) Schematic of A1 tuning modulation and design of behavioral optogenetics experiment. The distinct types of receptive field modulation following L6 CT deactivation were predicted to have dissociable effects on tone detection and discrimination behaviors. (C) Probability of a “Go” (i.e., crossing) response for target tones, foil tones, the laser stimulus alone and the three combined tone + laser test conditions as a function of sound level. Compared to tone-alone trials, target detection is impaired in the short-delay configuration but enhanced in the long-delay configuration (ANOVA, main effects for delay, F = 10.44, p < 0.005 for short and long conditions). (D) Probability of a Go response as a function of frequency separation between the target tone and the foil tone at a fixed sound level (40 dB SPL). Discrimination is enhanced for difficult conditions (10%) in the short delay condition but is reduced in easy conditions (20%) in the long delay condition, ANOVA, main effect for delay condition, F = 14.3, p < 0.0005 for both short and long delays). (E) Mean (±1 SEM) target detection threshold (thr.), defined as the sound level associated with a 50% probability of making a Go response on target trials. (F) Mean (±1 SEM) false alarm threshold, defined as the frequency spacing associated with a 50% probability of making a Go response on foil trials. Horizontal lines in E and F represent p < 0.05 using a paired t-test between tone-alone and the corresponding tone + laser condition, after correcting for multiple comparisons. (G) Overall sensitivity, measured with the d’ statistic, was higher on short-delay trials, but the difference is not statistically significant after correcting for multiple comparisons (paired t-test, p = 0.16). However, the d’ statistic was significantly different than zero (no separation between the hit and false positive distributions) for the short-delay period, indicated by asterisk (one-sample t-test, p < 0.001, p > 0.1 for all other conditions).

We tested these predictions by interleaving behavioral trials where 50 ms tone pips were presented 1) without L6 CT activation (tone alone), 2) concurrent with 50 ms pulses of L6 CT activation, 3) shortly after L6 CT deactivation, or 4) at a longer delay following L6 CT deactivation (Fig. 4B, right). Importantly, activating L6 CT neurons without tone presentation did not cause mice to “hear the light” and cross sides of the shuttle box, as described previously with non-specific optogenetic activation of subcortical auditory centers (Guo et al., 2015) (Fig. 4C, left). Mice performed predictably on tone-alone trials; the likelihood of detecting the target tone increased with sound level, and false alarms with these easily discriminable frequencies were rare (Fig. 4C, black). Concurrent L6 CT activation did not affect tone detection performance (Fig. 4C, orange). Compared to tone alone trials, target detection was impaired in short delay trials but significantly enhanced in long delay trials, without any non-specific effects on foil tones (Fig. 4C, purple and green).

We observed the opposite pattern of perceptual changes when mice were required to discriminate between increasingly similar tone frequencies rather than detect faint tones. In the tone alone condition, mice were unable to discriminate the target and foil frequencies when they differed by 10% or less (Fig. 4D, black). Concurrent L6 CT activation had no effect on tone discrimination (Fig. 4D, orange). In the short delay period, where divisive/subtractive modulation dominated A1 responses, accurate discrimination of targets and foils persisted at frequency differences as small as 10%, even though detection was impaired overall (Fig. 4D, purple). In the long delay period, where multiplicative/additive modulation was prevalent, discrimination was impaired for targets and foils that were separated by as much as 20%, even though detection of target tones was enhanced overall (Fig. 4D, green).

These findings demonstrate that L6 CT neurons can enhance sensory feature detection or discrimination, depending on the relative timing between their activation and sensory stimuli. Compared to tone-alone trials, detection thresholds were elevated by 11.2 ± 1.9 dB when tones were presented 50ms after L6 CT neurons were deactivated, but were improved by 11.4 ± 2.6 dB on trials where the delay was just 100ms longer (Fig. 4E). On the other hand, the threshold for mistaking the foil tone frequency for the target frequency (false alarm) was reduced to a 8.7 ± 1.17 % difference at the short delay following L6 CT deactivation but increased to 20.7 ± 1.87 % on long delay trials (Fig. 4F). Discriminability of similar tones (10% between target and foil), estimated here with the d’ statistic, was only significantly different than zero during short delay period when A1 responses were suppressed and tuning was more precise (Fig. 4G). Thus, the perceptual effects of L6 CT activation were robust (net effect on tone detectability and discriminability > 20 dB and >25%, respectively), were only observed in trials where L6 CT activation preceded sound onset, and were bi-directional, supporting either feature detection or feature discrimination.

L6 CT neurons enhance A1 sound responses by modulating a thalamic microcircuit

In addition to their local connections within A1, L6 CT neurons deposit axon collaterals in the thalamic reticular nucleus (TRN) before terminating in the ventral subdivision of the medial geniculate body of the thalamus (MGBv, Fig. 5A). The auditory sub-region of TRN is composed of GABAergic neurons that project to MGBv, whereas the mouse MGBv contains no GABAergic interneurons, but rather only glutamatergic neurons that project both to TRN and to the middle layers of A1 (Hackett et al., 2015; Ito et al., 2011; Jones, 2007) (Fig. 5B). Modulation of A1 responses could arise either through local circuit effects of L6 CT neurons within the cortical column or could be inherited from the dynamic interplay of inhibition and excitation between TRN and MGBv. To characterize whether and how receptive field modulation in A1 could be attributed to changes at the level of the thalamus, we recorded from all layers of the A1 column in head-fixed, awake mice during L6 CT activation while making simultaneous recordings from tonotopically matched regions of the MGBv (n= 106 recordings sites from 5 mice) or TRN (n= 24 recording sites from 2 mice, Fig. 5C).

Figure 5. Enhanced A1 responses at long delays following L6 CT activation can be attributed to short-term dynamics in thalamic sound processing.

Figure 5

(A) Coronal sections showing mCherry expression in auditory L6 CT neurons and medial geniculate body (MGB) axon terminals (top) as well as a more rostral section showing the L6 CT axon bundle in the internal capsule (ic) and collaterals in the thalamic reticular nucleus (TRN, bottom). (B) Schematic of procedure for simultaneous recordings of the A1 column and either MGB or TRN in awake, head-fixed mice. The ventral, medial and dorsal subdivisions of the MGB are illustrated (v, m and d, respectively). Parameters for L6 CT activation laser were a 50 ms duration laser pulse, 5 mW above threshold. (C) Frequency response areas (FRAs) from simultaneously recorded A1/MGB or A1/TRN sites. Recordings were topographically aligned such that frequency tuning was roughly matched between cortical and thalamic sites. (D) Iso-intensity frequency tuning functions from representative L4, MGB and TRN recording sites. The gray and blue functions correspond to the tone-alone and tone + laser conditions, respectively. Enhanced auditory responses are observed in both A1 and thalamus when tones and L6 CT activation are concurrent (orange). Divisive suppression is found in A1 shortly after L6 CT deactivation but not in either thalamic recording site (purple). Multiplicative enhancement is observed in A1 and MGB at long delays following L6 CT deactivation (green). TRN tuning is suppressed at this interval. (E) Mean (±1 SEM) tone-evoked firing rates normalized to the best frequency in the tone-alone condition (gray). Compared to tone-alone responses, firing rates were significantly elevated with concurrent L6 CT activation in A1, MGB and TRN units (paired t-test, p < 0.05); during the short delay period, A1 and TRN units showed significantly reduced firing rates (paired t-test, p < 0.05) while MGB units showed unchanged firing rates (p > 0.05); during the long delay period, A1 and MGB units showed significantly enhanced firing rates (p < 0.05) while TRN units showed significantly reduce firing rates (p < 0.05). (F) Mean (±1 SEM) shifting and scaling modulation computed for each multiunit site with the paired recording approach. Statistically significant shifting and scaling modulation for all permutations of laser duration, response period and brain structure was determined with one-sample t-tests against a population mean of 0 (shifting modulation) or 1.0 (scaling modulation). We observed significant additive gain during L6 CT activation in L2–5 and MGB and significant divisive gain in L2–5 (p < 0.05 for each). In the short-delay period following L6 CT deactivation, we observed significant subtractive and divisive gain only in L2–5 (p < 0.05 for both). In the long-delay period following L6 CT deactivation, we observed significant additive gain in MGB and L2–5, significant multiplicative gain in L2–5 and MGB, and significant subtractive and divisive gain in TRN (p < 0.05 for each).

We found that auditory responses were enhanced in both MGB and TRN during L6 CT activation (50ms), much as they were in A1 (Fig. 5D–F, orange). Shortly following the offset of L6 CT activation, when A1 responses were strongly suppressed, frequency tuning in MGBv and TRN was not significantly changed from the tone alone condition (Fig. 5D–F, purple). During the long delay period following laser offset, frequency tuning in MGBv showed a comparable level of multiplicative/additive enhancement as was simultaneously observed in A1 (Fig. 5D–F, green). Interestingly, we found that TRN modulation during the long delay period was inverted; whereas MGB and A1 both showed multiplicative-additive modulation 150 ms after laser offset, tuning modulation in the TRN was a mixture of divisive-subtractive.

These findings lead us to conclude that enhanced behavioral detection and A1 unit responses at the longer delay following L6 CT deactivation could be fully explained by a change in sound-evoked MGBv activity. Whereas MGBv and A1 unit responses were enhanced at a longer delay following L6 CT deactivation, the modulation of sound-evoked activity in TRN was matched in strength but was opposite in sign. This suggests that enhanced auditory responsiveness at longer delays following L6 CT deactivation could have arisen from a purely intra-thalamic circuit wherein the feedforward inhibition from TRN to MGBv is scaled down over time, as has been suggested from studies of L6 CT activation in thalamocortical slice recordings (Crandall et al., 2015). Reduced inhibition from TRN could disinhibit MGBv neurons, making them hypersensitive to auditory stimulation (Sherman and Guillery, 2002). By contrast, there was no thalamic antecedent for the robust A1 suppression observed just after L6 CT neurons were deactivated. This raises the possibility that enhanced frequency discrimination, reduced tone detection and suppressed A1 responses observed shortly following L6 CT deactivation were not mediated by the L6 feedback to the thalamus, but instead were mediated through the intracortical connections of L6 CT neurons.

L6 CT activation changes the frequency and resets the phase of low-frequency cortical rhythms

Cortical spiking activity rides on a background of slower undulations in the underlying electric field (Figure S2A–C). The cortical local field potential is generated by the flow of transmembrane currents distributed across a volume of tissue spanning hundreds of microns laterally and up to several millimeters vertically (Fig. S2D) (Kajikawa and Schroeder, 2011). Low-frequency oscillations in cortical electric fields can be studied at higher spatial resolution by measuring the second spatial derivative of the local field potential, the current source density (CSD), using linear multielectrode arrays that evenly sample cortical activity across all layers (Fig. S2E) (Kaur et al., 2005; Muller-Preuss and Mitzdorf, 1984; Kajikawa and Schroeder, 2011). The amplitude, frequency and phase of the underlying CSD are closely linked with spike probability and sensory tuning in auditory cortex (Fig. S2F) (Kayser et al., 2015; O’Connell et al., 2011). This lead us to question whether the unexplained suppression of spiking activity associated with L6 CT deactivation was linked to stereotyped changes in the underlying cortical electric field, as estimated from the translaminar CSD.

L6 CT activation induced an alternating pattern of current sinks and sources similar to the laminar signature of sound-evoked CSD signals (Fig. 2C). When studied in the time domain, L6 CT activation drove robust high-frequency oscillations across the cortical column (Fig. 6A) with a distinct high gamma peak in L5 and L6 (110 Hz with its 220 Hz harmonic) and two peaks in the high gamma range in L2/3 and L4 (40 Hz and 110 Hz) (Fig. 6B). By contrast, L6 CT activation in V1 elicits a single peak in the L6 frequency spectrum at 60 Hz (Olsen et al., 2012). Abrupt cessation of L6 CT spiking at laser offset initiated a low-frequency delta-theta rhythm (2–6 Hz) across all layers (Fig. 6C). L6 CT deactivation induced one reliable cycle of this delta-theta rhythm regardless of laser duration, with a L2/3 current source occurring 50–100 ms after laser offset and a current sink occurring 100–200 ms later (Fig. 6D). Prior work demonstrates a strong correlation between spike probability and the phase of the L2/3 low-frequency cortical phase, with low spiking probability associated with the upstroke of the current source (at 0 radians) and high spiking probability aligned to the downstroke of the current sink (π radians) (Lakatos et al., 2008, 2013). We noted that auditory suppression in the short delay period following L6 CT deactivation was aligned with the upstroke of the CSD, while the auditory enhancement observed in the long delay period was aligned with the CSD downstroke (Fig. 6E). Importantly, the particular signature of L6 CT activation (Fig. S3A), was not observed upon optogenetic activation of L5 corticofugal projection neurons (Fig. S3B), cholinergic modulatory afferents from the basal forebrain (Fig. S3C), parvalbumin-expressing FS interneurons (Fig. S3D), or sensory activation with sound alone (Fig. S3E). Therefore, the signature of strong high gamma activity followed by a low-frequency rhythm at laser offset was specific to L6 CT neurons and is not simply the product of driving a hypersynchronous response from any population of auditory cortex neurons.

Figure 6. L6 CT activation changes the frequency and resets the phase of local electric field oscillations in A1.

Figure 6

(A) The raw L6 CT-evoked CSD signal recorded across the A1 column from a single trial in an awake mouse. Optogenetic activation of L6 CT neurons induces a high-frequency oscillation while the laser is on, followed by a few cycles of a low-frequency rhythm following L6 CT deactivation. Scale bar = 0.2 s and 5 mV/mm2. (B–C) Change in CSD frequency power spectrum during laser (blue) and 0–400 ms after the laser is turned off (black) relative to pre-laser baseline. B plots the full frequency range to highlight the high-gamma peak during laser activation, whereas C plots frequencies ≤ 30 Hz to emphasize the delta-theta power after laser offset. (D) Mean (±1 SEM) L2/3 unfiltered CSD amplitude for laser durations varying from 10–400 ms. The short- and long-delay periods following L6 CT deactivation are indicated by the purple and green arrows, respectively. (E) Phase histograms at the corresponding laser duration for the short- and long-delay period. L2/3 CSD phase is consistently near zero in the short-delay period and pi in the long-delay period. (F) Normalized spontaneous firing rate in each layer as a function of the spontaneous L2/3 delta-theta CSD phase (2–6 Hz). Spontaneous firing rate was modulated across L2/3 CSD phase for all layers (ANOVA, F > 3.6 and p < 0.001), with the lowest spike rate consistently occurring at the zero phase. (G) Tone-evoked frequency tuning functions at three phases of the spontaneously occurring L2/3 delta-theta CSD: zero, π and the average of the intermediate phases, ±π/2. Tuning shape was significantly modulated by phase in L4, L5 and L6, (ANOVA, F > 6.3, p < 0.005 for all), but not in L2/3 itself (F = 1.4, p = 0.28).

We also asked whether inactivating L6 CT units with a hyperpolarizing opsin would have opposite effects to findings described with ChR2. We addressed this question by expressing the neural silencer, ArchT, in L6 CT units and performing an additional set of paired recordings from A1 and MGBv of awake mice (n = 265 units in 4 mice). Whereas activating L6 CT neurons that express ChR2 increased firing rates throughout the column (Fig. 2 and Fig. S4A, orange data points), silencing L6 CT neurons suppresses spiking throughout the column (Fig. S4B–D, orange data points). Similarly, whereas frequency tuning in A1 is strongly suppressed at a short delay following the offset of ChR2 activation (Fig. 3 and 5E, purple data points), sound-evoked responses at the same delay following the offset of ArchT inactivation was enhanced (Fig. S4E, purple data points). Finally, whereas activating L6CT neurons with ChR2 induces a robust high gamma rhythm (Fig. 6B) followed by several cycles of delta-theta at laser offset (Fig. 6B–C), silencing L6CT neurons only induces the low-frequency rhythm (Fig. S4F). Generally, the effects of briefly inhibiting L6 CT neurons with ArchT were less pronounced than driving spiking with ChR2, and the receptive field modulation was not the exact mathematical inverse at each time interval. However, the effects of adding and removing spikes in single neurons would not be expected to have exactly symmetric effects at the level of non-linear, recurrently interconnected networks like a cortical column (Phillips and Hasenstaub, 2016; Seybold et al., 2015). As a first approximation, the effects of activating and then deactivating L6 CT neurons with ChR2 was opposite to the effects of silencing and then reactivating L6 CT neurons with ArchT.

As a final proof that the findings described here were not a purely artificial byproduct of introducing a hyper-synchronized volley of spikes with ChR2, we also characterized A1 spiking modulation when delta-theta rhythms occurred spontaneously, independent of any optogenetic activation. We found that spontaneous spike probability in A1 could be modulated by as much as 30% according to the phase of naturally occurring L2/3 delta-theta rhythms (Fig. 6F). Spiking was modulated in all layers, though least in L6. As predicted, spike probability was strongly suppressed at the upstroke of the CSD (phase 0), which corresponds to the CSD phase during the short delay period following laser offset (Fig. 6E). Spontaneously arising delta-theta rhythms modulated sensory tuning as well. Tone-evoked spikes falling on the 0 phase of delta-theta rhythms were suppressed, leading to divisive gain of A1 tuning functions in L2/3, L4 and L5 that closely resembled the tuning modulation observed at the short delay following L6 CT deactivation (compare Fig. 5E, purple to Fig. 6G, purple). Importantly, neither spontaneous spiking nor frequency tuning was enhanced at the downstroke of the CSD (phase π) relative to intermediate phases (±π/2). This agrees with our prior assertion that the facilitated spiking at the longer delay following L6 CT deactivation was not generated by an intracortical circuit, but instead arose from dynamic switching of inhibition and excitation between MGB and TRN. By contrast, the circuit dynamics underlying the suppressive effects of L6 CT activation were likely to arise from within A1, as has previously been described in V1 (Bortone et al., 2014). Our findings suggest that suppressed unit responses and enhanced tone discrimination observed during the short delay period following L6 CT deactivation could be achieved by inducing a low-frequency rhythm in A1 with the low-excitability phase aligned to sensory-evoked thalamocortical inputs.

L6 CT neurons control low-frequency network oscillations by driving a sub-population of fast-spiking cortical neurons

The preceding analysis of phase-associated spiking underscored the correlation between L6 CT-evoked delta-theta rhythms, spike probability and sound-evoked responsiveness. As a next step, we undertook an analysis of spike-triggered changes in CSD phase and amplitude to identify a possible neural circuit that could reset the delta-theta rhythm. We examined the average cortical CSD amplitudes associated with spontaneously occurring spikes from 739 isolated single units in A1, MGB and TRN (Fig. 7A). For many single units, the spike-triggered CSD was essentially flat, suggesting that their spiking did not demonstrably change the spatiotemporal patterning of sinks and sources throughout the column. We refer to these units as “non-resetters” (Fig. 7B). For other units, spontaneously occurring spikes were associated with the emergence of a clear, laminar CSD pattern. For these “resetter” units, spontaneously occurring spikes evoked 1 cycle of a delta-theta rhythm with a clearly defined laminar pattern of sinks and sources (Fig. 7B and 6C). We rank-ordered the change in pre- versus post-spike vector strength to identify 184 single units that were clearly associated with a L2/3 CSD phase reset (Fig. 7D). Variability in the free-running, spontaneous L2/3 oscillations was reset to a single phase following a resetter spike, such that the short delay period fell on the zero phase (low excitability) and remained sharply aligned for approximately 1 cycle (Fig. 7E).

Figure 7. L6 CT neurons reset the phase of low-frequency cortical rhythms by driving a sub-type of cortical fast-spiking interneuron.

Figure 7

(A) Schematic diagram depicting the analysis approach for spike-triggered CSD phase and amplitude. (B) Spike-triggered CSD amplitude from two exemplar A1 single units. Spontaneous spikes are associated with a clear pattern of alternating sinks and sources across the cortical column in some single units (a “resetter” unit, bottom), but not others (a “non-resetter” unit, top). (C) L2/3 delta-theta CSD phase histograms at discrete time bins before and after spontaneous spikes from the same two single units shown in (B). The L2/3 CSD vector strength represents the phase precision over time. (D) Spontaneous spike-triggered phase changes from 723 single units recorded from the thalamus and cortex of awake mice are sorted according to the change in L2/3 CSD vector strength. Resetter units (n = 184) were operationally defined as those that increased the post-spike L2/3 CSD vector strength by 0.05 or more. (E) Spike-triggered phase histograms from all resetter and 185 non-resetter units. Each trace is the mean phase trajectory from a single unit. Resetter units reset the phase of the L2/3 CSD to π at the time of the spike. The spike-triggered phase remains well organized at the short- and long-delay intervals following spontaneous resetter spikes (purple/pi and green/0 phase, respectively). (F) Histogram of resetter occurrence as a function of brain region and spike waveform. G) Mean (±1 SEM) latency between spontaneous spike occurrence and the time of CSD reset, operationally defined as the peak negativity in the L2/3 delta-theta CSD waveform. For MGBv resetter (MGBvr, n = 52) and cortical FSr units (n=31), spontaneously occurring spikes occured significantly earlier than the L2/3 CSD reset, suggesting that they could induce the reset (paired t-tests, p < 0.05 for both). By contrast, RSr unit spikes (n=78) occurred during or just after the CSD reset event (paired t-test, p = 0.1). (H) Cartoon illustrating the 4 cell types in the putative CSD reset circuit and the cross-correlation analysis approach. (I) Mean (±1 SEM) cross-correlogram of L6 CT unit spike trains with the other resetter unit types. (J) Mean (±1 SEM) probability that a spike in each resetter type will follow a laser-evoked L6 CT spike. Horizontal lines represent significant differences between FSr evoked spike probability and other unit types (unpaired t-test, p < 0.05 after correcting for multiple comparisons). (K) FSr-spike triggered L2/3 CSD amplitude for spike events occurring spontaneously, during auditory stimulation or during optogenetic activation of L6 CT units. Note similarity of FSr-triggered CSD change and laser-evoked CSD change in FIg. 6D. (L) Mean (±1 SEM) laser-evoked firing rate in L6 CT neurons, FSr neurons and FS neurons not associated with CSD reset (FSnr). (M) L6 CT spiking ceases immediately at laser offset whereas FSr neuron spiking remains significantly higher for at least 10ms (Wilcoxon rank-sum, p < 0.05).

To learn more about the properties of resetter neurons, we analyzed their occurrence as a function of spatial position (TRN, MGB or A1 layer) and spike type (FS units, regular spiking units or directly ‘phototagged’ L6 CT units, Fig. S2C). We found that resetter neurons could have the FS waveform characteristically associated with parvalbumin-containing interneurons or could have regular-spiking waveforms (Fig. 7F and Fig. S2C). They could be found in any layer of the cortical column or in MGB. There were only two conditions that failed to reveal resetter neurons: they were never found in TRN and they were never the L6 CT neurons themselves. To distinguish between resetter neurons that might have caused CSD resets versus resetter neurons whose spikes were merely folded into an ongoing reset event, we examined the relationship between the timing of resetter neuron spikes versus the onset of the CSD reset. This analysis revealed two subpopulations of resetter neurons with spike timing that significantly lead – rather than lagged – resets in the spontaneous L2/3 CSD rhythm: resetter units in the ventral subdivision of the MGB (MGBvr) and cortical fast-spiking resetter units (CtxFSr) (Fig. 7G).

With CSD resetter neurons functionally defined, we returned to the question of how L6 CT units reset the phase of the low-frequency CSD rhythm to suppress spiking, sharpen frequency tuning and improve behavioral frequency discrimination. Although L6 CT unit spiking did not directly reset the CSD, L6 CT units could indirectly generate the delta-theta rhythm by driving MGBvr or CtxFSr units. To address this possibility, we optogenetically ‘phototagged’ L6 CT neurons and cross-correlated their spike trains with the other resetter neuron types (Fig. 7H). There was no consistent relationship between the spiking of L6 CT neurons and the MGBvr or regular spiking units in A1 (Fig. 7I). However, we found a significant correlation between L6 CT neurons and CtxFSr units. L6 CT spikes lead CtxFSr spikes by approximately 13ms, suggesting that L6 CT units could reset the CSD phase by driving CtxFSr units (Fig. 7J).

Whether spontaneously occurring, driven by sound or evoked by optogenetic activation of L6 CT neurons, CtxFSr unit spikes induced one or two cycles of a low-frequency pattern of electrical sinks and sources in L4 and L2/3 (Fig. 7K). Laser stimulation directly activated L6 CT neurons, and their activity was followed shortly thereafter by robust indirect activation of CtxFSr neurons (Fig. 7L). Interestingly, L6 CT activation had no effect on cortical FS units whose spontaneous spikes were not associated with a columnar CSD reset (CtxFSnr, Fig. 7L). During laser stimulation, when L6 CT and CtxFSr are co-active (and likely others, as well) a high gamma rhythm dominates and the 2–6Hz delta-theta rhythm is clearly defined, but relatively weak (Fig. 6B). L6 CT units cease firing immediately after the laser power falls to zero, yet CtxFSr units continue to spike for a short while thereafter (Fig. 7M). Because unopposed CtxFSr spiking induced 1–2 cycles of the delta-theta rhythm, we surmised that the low-frequency rhythm induced by the abrupt cessation of L6 CT spiking may be generated, in part, from this brief period when CtxFSr units spiking is unopposed by L6 CT spiking. Therefore, as in V1, L6 CT activation induces divisive suppression by selectively driving cortical FS units that impose strong, feedforward inhibition (Bortone et al., 2014). However, in auditory cortex, divisive suppression occurs only shortly after L6 CT units deactivate and is coincident with a low-frequency electrical rhythm that dampens spontaneous and sound-evoked spiking but enhances frequency discrimination for approximately 100 ms.

Discussion

As summarized in Figure 8, we described a L6 corticothalamic circuit (Fig. 1) that can either enhance or suppress spontaneous (Fig. 2) or sound-evoked (Fig. 3) activity, depending on the timing between A1 spikes and optogenetic activation of L6 CT neurons. We demonstrated that enhanced A1 sound-evoked responses and wider frequency tuning were correlated with improved behavioral sound detection but reduced discrimination accuracy. Conversely, suppressed A1 responses and sharper frequency tuning were associated with improved behavioral sound discrimination but elevated detection thresholds (Fig. 4). Interestingly, the strongest modulation of cortical responses and sound perception occurred just after deactivation of L6 CT units, not during their activation. We found that multiplicative-additive gain in A1 responses at a long interval following L6 CT deactivation could be inherited from a corresponding change in MGBv units, yet we did not observe a thalamic antecedent for the divisive-subtractive gain at short intervals following L6 CT deactivation (Fig. 5). To explain the suppressive modulation of auditory responses following the offset of L6 CT activation, we turned to dynamic changes in the frequency and phase of sinks and sources in the local electrical field. We found that L6 CT deactivation reset the phase of delta-theta rhythms such that the low-excitability period of the CSD was associated with suppressed sound-evoked responses, sharper frequency tuning and improved discrimination (Fig. 6). To identify the neural circuit underlying delta-theta reset, we first characterized hundreds of single neurons with spontaneous spike events associated with strong phase alignment and enhanced low-frequency CSD signal amplitude. Among our heterogeneous sample of “resetter” neurons, the cortical FS neuron was the only type that spiked before CSD reset and was strongly driven by L6 CT neurons (Fig. 7). We concluded that the divisive-subtractive gain observed shortly after L6 CT deactivation could be attributed to a delta-theta phase reset that arose from the interaction of L6 CT and cortical FS neurons (see also (Carracedo et al., 2013).

Figure 8. Summary of findings supporting a contribution of L6 CT neurons to perceptual modes of heightened detection or discrimination.

Figure 8

Unit recording traces are arranged to illustrate the main effects described in previous figures. Left column, in a baseline condition with minimal spiking activity in L6 CT and FS resetter neurons, the power of low-frequency CSD rhythms is weak and sound-evoked spiking in thalamic and cortical principal neurons is moderate. Intense firing of L6 CT neurons engages FS resetter interneurons that increase the power and reset the phase of low-frequency rhythms. Middle columns, at short delays following an intense volley of spikes in L6 CT and FS resetter neurons, the induced cortical delta-theta rhythm is at a positive, low-excitability phase and sound-evoked spikes are suppressed in A1, but in the MGBv. Right column, at longer delays following a volley of spikes in L6 CT and FS resetter neurons, the phase of the cortical delta-theta rhythm has rotated to a negative, high-excitability phase and sound-evoked spikes are greatly enhanced both in MGBv and A1. Modulating the excitability of cortical neurons has predictable effects on sensory tuning across ensembles of A1 neurons and behavioral measures of sound perception, illustrated as the resolution of two objects on a sonar display. Suppressed sound-evoked A1 activity shortly following the volley of spikes in the cortical reset network dampens excitability but reduces overlap between neighboring tuning functions, supporting enhanced discrimination of sound frequencies but reduced auditory sensitivity (middle column). Enhanced sound-evoked spiking scales up excitability and increases the overlap between neighboring tuning functions, resulting in enhanced sensitivity to sound at the expense of reduced frequency discriminability (right column).

Distinct signatures of neuromodulation in A1 as compared to other cortical areas

L6 CTs are glutamate-releasing pyramidal neurons (Bortone et al., 2014; Bourassa and Deschenes, 1995; Zhang and Deschenes, 1997). In V1, the net effect of L6 CT activation is to suppress spiking via disynaptic inhibition from GABAergic FS interneurons (Bortone et al., 2014; Olsen et al., 2012). As an exception to this rule, L6 CT neurons directly excite neurons in L5a in both V1 and barrel cortex, though again, L6 CT activation mediated a net inhibitory effect outside of L5a (Kim et al., 2014). By contrast, the net effect of activating L6 CT neurons in all layers of A1 is excitatory. This sign reversal seems unlikely to simply reflect the absence of di-synaptic inhibition evoked by L6 CT neurons, as we also observed strong activation of FS “resetting” interneurons during laser stimulation, (Fig. 7L). Apparently, in A1, the combined effect of feedforward excitatory inputs from the L6 CTs and di-synaptic inhibition from CtxFSr neurons tips towards net excitation across the column, while in V1 it tips towards net suppression. This may reflect subtle but important differences in the set point of local inhibitory networks between the two brain areas, as has also been suggested from the opposite effects of locomotion on sensory-evoked responses in A1 and V1 (Fu et al., 2014; McGinley et al., 2015; Niell and Stryker, 2010; Schneider et al., 2014; Zhou et al., 2014).

In many respects, A1 suppression at a short delay following L6 CT deactivation resembled the modulation in V1 during L6 CT activation: the modulation had a clear divisive component associated with strong activation a specialized subtype of FS interneuron (Olsen et al., 2012). In V1, L6 CT neurons suppress visual processing throughout the column by driving L6 FS interneurons with vertically oriented, translaminar axon fields (Bortone et al., 2014). In A1, the suppressive effect of L6 CT deactivation was also specific to a subtype of FS interneuron (Fig. 7L) that also changed network excitability across the entire column, though there was no indication in our data that these FS neurons were restricted to L6 (Fig. 7G).

A L6 CT circuit for resetting the phase of low-frequency rhythms and dynamically regulating stimulus salience

Expectation can rise and fall over time. Stimuli that arrive at expected intervals are more rapidly and accurately processed than stimuli that occur at unexpected intervals (Buran et al., 2014; Jaramillo and Zador, 2011; Jones et al., 2002; Nobre et al., 2007; Wright and Fitzgerald, 2004). A time-dependent modulation of neural and perceptual salience may arise from fluctuations in the underlying low-frequency cortical electric field because not only is perceptual salience modulated by oscillation phase, but phase itself can be proactively controlled through attention and expectation (Luo and Poeppel, 2007; Schroeder and Lakatos, 2009). By resetting the phase at opportune moments, attended stimulus features can benefit from enhanced processing at the high-excitability phase of an oscillation while distractors can be suppressed by aligning the timing of their occurrence to the low-excitability phase (Henry and Obleser, 2012; Lakatos et al., 2008). By resetting the phase of nested cortical oscillators, the temporal signature of a target speaker can be perceptually enhanced and neurophysiologically segregated from the temporal signature of a competing speakers’ speech (Ghitza, 2011; Giraud and Poeppel, 2012; Giraud et al., 2007; Zion Golumbic et al., 2013). Modulation through phase entrainment is not only achieved in time, but also in space; for example, attending to a stream of tones at a fixed frequency while ignoring distractor tones at another frequency sets up regional pockets of low-frequency oscillations where the high-excitability region is in-phase throughout the target region of the tonotopic map but in counter-phase in map regions that encode distractor frequencies (Lakatos et al., 2013; O’Connell et al., 2014).

While there is general agreement that the phase of ongoing delta and theta oscillations can be adaptively reset either through high-contrast bottom-up stimulus features or top-down executive control signals, the neural mechanism of phase reset is unknown. By computing the spike-triggered phase for hundreds of individual neurons, we identified a sub-network of neurons distributed throughout middle and deep layers of the A1 column and MGB that exert a strong influence over the amplitude and phase of delta-theta rhythms. This analysis suggested two overlapping networks: First, a bottom-up network involving neurons in the MGBv and cortical FS neurons that reset the phase shortly following the onset of a tone close to their preferred frequency; Second, a separate network also involving cortical FS neurons that are driven by L6 CT neurons. As an essential node in a deep-layer modulatory circuit, L6 CT neurons may be able to resolve the competing demands of detection and discrimination by coordinating their spiking at opportune moments during analysis of a sensory scene. In this regard, it is not surprising that the strongest effects of L6 CT activation on sensory processing and perception occurred after a volley of L6 CT spikes had ended, not while L6 CTs were activated. Anticipatory listening would require some form of structured neural activity to reliably precede predicted inputs. Future work will determine whether L6 CT neurons fulfill that role.

STAR Methods

Contact for Reagent and Resource Sharing

Requests for further information should be directed to and will be fulfilled by the Lead Contact, Daniel Polley (daniel_polley@meei.harvard.edu).

Experimental Model and Subject Details

All procedures were approved by the Massachusetts Eye and Ear Infirmary Animal Care and Use Committee and followed the guidelines established by the National Institutes of Health for the care and use of laboratory animals. Both male and female mice were used in this study. All mice were maintained under light (7am – 7pm) and dark (7pm – 7am) cycle conditions with ad libitum access to food and water. Animals chronically implanted with headplates were housed individually. Age-matched litter mates were randomly assigned to experimental groups.

For L6 CT experiments, we used 58 hemizygous Ntsr1-Cre transgenic mice, aged 6–12 weeks (B6.FVB(Cg)-Tg(Ntsr1-Cre)GN220Gsat/Mmcd). For comparisons of laser-evoked cortical activity from other cell types, we used an additional two PV-Cre:Ai32 mice, two ChAT-Cre:Ai32 mice, and two wild type mice expressing CamKIIα-hChR2.

Methods Details

Virus-mediated gene delivery

Mice of either sex aged 6–7 weeks were anesthetized with ketamine (100 mg/kg) and xylazine (10 mg/kg). A surgical plane of anesthesia was maintained throughout the procedure with supplements of ketamine (50 mg/kg) as needed. The animal’s body temperature was maintained near 36.5° C using a homeothermic blanket system (Fine Science Tools). The surgical area was numbed with a subcutaneous injection of lidocaine (5 mg/mL). An incision was made on the right side of the scalp to expose the skull around the caudal end of the temporal ridge, where the caudomedial end of the temporalis muscle joins to the skull. The temporal ridge provides a reliable cranial landmark for core fields of the auditory cortex. We made 2–3 burr holes along the temporal ridge, spanning a region 1.0–2.0 mm rostral to the lambdoid suture. At each burr hole, 0.3 – 0.5 μl of either AAV5-EF1α-DIO-hChR2(E123T/T159C)-mCherry, AAV5-CAG-FLEX-ArchT-tdTomato or AAV5-FLEX-tdTomato solution was injected into the cortex 450 μm below the pial surface at 0.05 – 0.1 μl/min using a motorized injector (Stoelting Co.). For animals undergoing behavioral assessments, 0.6 μl of virus solution was injected into both the left and right auditory cortex. Following the procedure, antibiotic ointment was applied to the wound margin and an analgesic was administered (Buprenex, 0.05 mg/kg). Neurophysiology and behavior experiments began 3–4 weeks following virus injection.

Preparation for awake head-fixed recordings

Mice were once again brought to a surgical plane of anesthesia, using the same protocol for general anesthesia, local anesthesia and body temperature control described above. The periosteum overlying the dorsal surface of the skull was thoroughly removed. The skull surface was prepared with 70% ethanol and etchant (C&B Metabond). A titanium head plate was then cemented to the skull, centered on Bregma. After recovery, animals were housed individually. Animals were given at least 48 hours to acclimate to the head plate before any further experiments.

Before the first recording session, animals were briefly anesthetized with isoflurane (1.5% in oxygen) while a small craniotomy (0.5 × 1.0 mm, medial-lateral × rostral-caudal) was made along the caudal end of the right temporal ridge, 1mm rostral to the lambdoid suture to expose A1. A small chamber was built around the craniotomy with UV-cured cement and filled with ointment. At the end of each recording session, the chamber was flushed, filled with fresh ointment, and sealed with UV-cured cement. The chamber was removed and rebuilt under isoflurane anesthesia before each subsequent recording session. Typically, 4–7 recording sessions were performed on each animal over the course of 1–2 weeks. For dual A1/MGB or A1/TRN recordings, a second craniotomy and chamber provided access to the MGB (1mm rostral to the lambdoid suture, 2–3 mm lateral to midline) or the TRN (2 mm rostral to the MGB craniotomy).

Neurophysiology

On the day of recording, the head was immobilized by attaching the head plate to a rigid clamp (Altechna). The body rested atop a disk, coated with a sound-attenuating polymer that was mounted on a low-friction, silent rotor. We continuously monitored the eyelid and status of the rotating disk to confirm that all recordings were made in the awake condition.

For columnar recordings, a single-shank linear silicon probe (NeuroNexus A1x16-100-177-3mm) was inserted into the auditory cortex craniotomy perpendicular to the brain surface using a micromanipulator (Narishige) and a hydraulic microdrive (FHC) with the tip of the probe positioned approximately 1.3 mm below the brain surface, such that the top 2 electrode contacts were outside the brain, the bottom 2 contacts were in the white matter or hippocampus, and the middle 11–12 contacts spanned all six layers of the auditory cortex. At the beginning of the first recording session, several penetrations were made along the caudal-rostral extent of the craniotomy to locate the high-frequency reversal of the tonotopic gradient that demarcates the rostral boundary of mouse A1 (Hackett et al., 2011). For dual recordings, a second silicon probe (NeuroNexus A1X16-50-177-5mm) was inserted into MGB or TRN using a dorsal approach. To identify the MGBv on any given day of recording, we first recorded lateral to the MGB, in the hippocampus, and then progressively marched the electrode medial in 0.1mm steps until we had at least eight contiguous channels with noise-evoked spiking activity. By validating this approach in pilot experiments through electrolytic lesion reconstructions in post-mortem tissue (data not shown), we were assured of recording from the lateral bank of the MGB, which contains the MGBv and, depending on the caudal-rostral coordinates, might also contain recording sites in the dorsal subdivision or suprageniculate nucleus (Hackett et al., 2011). MGBv recordings were limited to the most ventral recordings sites (2.6–3.0 mm below the brain surface) to exclude recordings from the dorsal subdivision or suprageniculate nucleus. For TRN recordings, units were classified as putative auditory TRN units only if they were both sound responsive and exhibited a thin spike waveform (peak-to-trough delay less than 0.4 ms, Supplemental Fig. 2c). Based on the frequency tuning of the recorded MGB or TRN units, the location of the cortical probe was positioned within the A1 tonotopic gradient to maximize the overlap between cortical and thalamic receptive fields (Fig. 5c).

Optogenetic and acoustic stimulation for neurophysiology recordings

Digital waveforms for the laser command signal and acoustic stimuli were generated with a 24-bit digital-to-analog converter (PXI, National Instruments) using scripts programmed in MATLAB (MathWorks) and LabVIEW (National Instruments). Stimuli were presented via a free-field electrostatic speaker positioned 10cm from left ear canal (Tucker-Davis Technologies). Free-field stimuli were calibrated before recording using a wide-band ultrasonic acoustic sensor (Knowles Acoustics, model SPM0204UD5). The optical signal was generated with a calibrated 473 nm diode laser for ChR2 experiments (LuxX, Omicron) or a 532 nm DPSS laser for ArchT experiments (LaserGlow), coupled to an optic fiber. The fiber tip was positioned approximately 1cm above the exposed surface of A1.

Once the silicon probe was positioned in an A1 column, we estimated the laminar position of each electrode from the CSD pattern evoked by broadband noise bursts (50 ms duration, 4 ms onset/offset cosine ramps, 1 s interstimulus interval, 70 dB SPL, 100 repetitions; see analysis of local field potential and current source density). Frequency response areas (FRAs) from all recording sites were delineated from pure tone pips (50 ms duration, 4 ms onset/offset cosine ramps, 0.5 s interstimulus interval, 4–45 kHz with 0.1 octave steps, 0–60 dB SPL with 5 dB steps, 2 repetitions of each stimulus, pseduorandomized). Based on the FRAs of all recorded units across layers, we chose a single suprathreshold sound level, normally 40–60 dB SPL, for subsequent measures of the iso-level frequency tuning function. We activated or inactivated L6 CT neurons using laser light at various intensities (400 ms duration, 2 s inter-stimulus interval, 5–50 mW at the fiber tip in 5 mW steps, 10 repetitions of each stimulus, pseudorandomized). Laser power at the fiber tip was calibrated with a power meter (Thorlabs). The effective laser power at a given point in the cortex was lower than the level calibrated at the optic fiber tip.

To investigate the modulatory effect of L6 CT neuronal activity on columnar sound processing, L6 CT neurons were activated with laser (either 400 ms or 50 ms duration, 3.5 s inter-stimulus interval, either at 20 mW or 5mW above the minimally effective laser power), while pure tone stimuli were presented alone or at various delays with respect to the onset of the laser stimulus (0–800 ms in 50 ms steps, 15 repetitions of each delay, pseudorandomized with a 2 s interval separating each trial). In a subset of experiments described in Fig. 6D–E, we varied the duration of the laser stimuli (10 – 400 ms in octave steps) while holding the intensity constant (20 mW).

Analysis of extracellular unit recordings

Raw signals were digitized at 32-bit, 24.4 kHz (RZ5 BioAmp Processor; Tucker-Davis Technologies) and stored in binary format. In order to eliminate potential movement-generated artifacts, the common mode signal (channel-averaged neural traces) was subtracted from all channels. In experiments where simultaneous recordings were made from probes in cortex and thalamus, the common mode removal was performed separately for each probe. Electrical signals were notch filtered at 60Hz, then band-pass filtered (300–3000 Hz, second order Butterworth filters), from which the multiunit activity (MUA) was extracted as negative deflections in the electrical trace with an amplitude exceeding 4 s.d. of the baseline hash. Single units were separated from the MUA using a wavelet-based spike sorting package (wave_clus). Single unit isolation was confirmed based on the inter-spike-interval histogram (less than 3% of the spikes in the 0–3 ms bins) and the consistency of the spike waveform (s.d of peak-to-trough delay of spikes within the cluster less than 0.2 ms). The average trough-to-peak delay from each single unit formed a clear bi-modal distribution (Supplemental Fig. 2), allowing us to further divide our recordings into fast-spiking and regular-spiking units (FS units < than 0.4 ms; RS units > 0.4 ms).

Frequency tuning from MUA or single unit recordings was quantified as the average evoked firing rate measured 10–60 ms following tone onset. By fitting a linear regression model between the tone-evoked firing rates in the tone-alone and the tone plus laser conditions, we could estimate the type of modulation from the slope and the y-intercept of the linear fit. The slope indicates the multiplicative gain of the change; the y-intercept indicates the baseline offset. Only the y-intercept values were used if data was not well fit by linear regression (p > 0.05). The cross-correlograms between simultaneously recorded SUs were calculated with up to 250 ms lag time. If neuron A’s activity consistently leads neuron B’s, the peak of their cross-correlogram had a positive lag time. Therefore, we computed the averaged cross-correlation between 0– 50 ms to estimate the direct excitatory drive from neuron A to neuron B.

Analysis of the local field potential and current source density

To extract local field potentials, raw signals were notch filtered at 60 Hz and down-sampled to 1000 Hz. To eliminate potential artifacts introduced by impedance mismatching across recording channels, signals were spatially smoothed along the channels with a triangle filter (5-point Hanning window). The CSD was calculated as the second spatial derivative of the local field potential signal. The frequency components of the CSD signal were analyzed by calculating the Thomson’s multitaper power spectral density estimate. For each trial, a set of three spectra were each estimated from three 400-ms windows (0–400 ms before laser, during the 400 ms laser, and 0–400 ms following laser offset). The spectral gain during laser and after laser were calculated using the before-laser spectrum as the baseline.

Noise-evoked columnar CSD patterns were used to determine the location of the A1 recording channel. Two CSD signatures were used to identify L4: A brief current sink first occurs approximately 10 ms after the noise onset, which was used to determine the lower border of L4 (Kaur et al., 2005). A tri-phasic CSD pattern (sink-source-sink from upper to lower channels) occurs between 20 ms and 50 ms, where the border between the upper sink and the source was used to define the upper boundary of L4. Normally, 2 channels were assigned to L4. Other layers were defined relative to the location of L4 (L2/3: 3 channels above L4; L5: 3 channels below L4; L6: 3 channels below L5). CSD-derived layer assignments were cross-validated against sound-evoked MUA response patterns, where L4 and L5 units responded with higher firing rates and shorter latency. CSD traces were bandpass filtered (2–6 Hz, 2nd order Butterworth filters) to obtain the activity in the delta-theta band. The temporal delay caused by filtering was identified using the cross-correlogram between the original and filtered CSDs, and corrected by time-shifting the filtered signal. The instantaneous phase and amplitude of the CSD were calculated from its analytical signal using the Hilbert transform.

To calculate spike-triggered CSD amplitude, we computed the average columnar CSD waveform 250 ms before and after a spike occurrence for a reference single unit. Spike-triggered CSD phase was computed similarly, except that phase trajectories in the delta-theta band were analyzed rather than CSD amplitude. To classify single units as resetters or non-resetters, we compiled a histogram of the spike-triggered L2/3 CSD phase trajectories 100 ms before and after a reference spike. We then calculated the vector strength for each distribution and operationally defined resetters as single units associated with an increase in vector strength ≥ 0.05 (n = 184 single units). The phase delay of any resetter neuron was defined as the lag time between the spike and the trough of the first current sink in the L2/3 CSD.

Chronic optic fiber implantation

Once mice were brought to a stable anesthetic plane, we positioned an implantable optic fiber assembly atop craniotomies made over the left and right auditory cortex (n=5 mice). The fiber tips were lowered until they rested on the brain surface before the assemblies were fixed into place with dental cement (C & B Metabond). The animals were given buprenex and antibiotic ointment post operation. At least 48 hours of recovery time were given before any experiment were performed on implanted animals.

Behavioral training and testing

Behavioral testing occurred in an acoustically transparent enclosure (20 × 15 × 30 cm, L × W × H) bisected into two virtual zones resting atop electrified flooring (8 pole scrambled shocker, Coulbourn Instruments). Mouse position was tracked with a commercial webcam. The acoustic, laser, and foot-shock signals were all generated on a National Instruments PXI system using scripts programmed in LabVIEW. Auditory stimuli were delivered through a calibrated free-field speaker positioned above the apparatus to provide a relatively homogenous sound field (Tucker-Davis Technologies). Laser stimuli were generated by a pair of calibrated diode lasers, coupled to the animal with flexible, lightweight patch cables. Mice were given at least 5 min to acclimate to the apparatus and cable tethering before each day of training or testing.

Mice were initially shaped to cross between each zone of the chamber in order to terminate a foot shock (0.1–0.5 mA, chosen to be the minimally effective intensity for each mouse). Foot shock terminated after 10s or upon crossing sides of the avoidance chamber, whichever occurred first. With conditioned crossing behavior established, mice were then trained to associate the target sound (14 kHz, 50 ms tone bursts with 4 ms onset/offset cosine ramps, repetition rate 2.5 Hz, 6 s total duration, 70 dB SPL) with onset of foot shock. Mice learned that they could avoid a foot shock by crossing sides of the chamber before the 6 s stimulus period ended. Crossing during this 6 s period was defined as a hit. Animals were trained with blocks of 10 target trials with randomized inter-trial intervals set to 40 – 50 sec.

Once the animal’s hit rate exceeded 60% in target-only blocks, the shaping procedure would switch to the foil blocks, in which the animal was presented with a train of 8 kHz tone pips that did not predict the onset of shock. Crossing behavior during foil tones was defined as false alarms. No punishment was given on false alarm trials. Similarly, animals were trained with blocks of 10 foil trials with randomized inter-trial intervals between 40 to 50 seconds. Training continued until the false alarm rate dropped to below 40%, after which the training would switch back to the target blocks. Target and foil shaping blocks would alternate whenever the animal’s performance crossed the threshold (higher than 60% hits, lower than 40% false alarms). As the animal reached this stage and its performance d’ exceeded 1.0, the animal would be trained with blocks of interleaved target and foil trials (10 targets, 10 foils) until the performance d’ plateaued. Once shaping was complete, we determined whether optogenetic activation of L6 CT neurons alone was enough to create a percept that generated a crossing response. This was achieved by delivering laser pulses with the same temporal structure as the auditory stimuli (20 trials consisting of 50 ms pulses with 4 ms onset/offset cosine ramps, repetition rate 2.5 Hz, 6 s duration, 10 mW).

The testing phase consisted of two sets of experiments. The first set of experiments investigated whether L6 CT activation influenced detection thresholds. Four conditions of the laser stimuli were used: three different onset delays (0, 100, and 200 ms for each laser-tone pair) and a control condition with no laser. Target (14 kHz) and foil (8 kHz) frequencies matched the shaping phase but tones were presented across a range of sound levels (0–60 dB SPL in 20 dB steps) and mice were allowed 10s to cross rather than 6s. The second set of behavioral experiments investigated L6 CT modulation of tone discrimination performance. These testing blocks used a single tone level (40 dB SPL), but the frequency separation between the target (14 kHz) and foil tone was decreased from 40% (8 kHz) to 20%, 10% or 5% (foil frequencies: 11.2 kHz, 12.6 kHz, and 13.3 kHz). Foot shock reinforcement was not used during test blocks to avoid learning effects. Trials were randomized across all stimulus conditions (target/foil, sound level, laser delays). Each unique permutation was presented 15 times to generate the complete set of psychometric functions.

For each animal, the discrimination index d’ at any stimulus condition was calculated as z(hit rate) – z(false alarm rate). We fitted every psychometric function with a generalized linear model with a binomially distributed outcome. The threshold for detection or false alarm was derived from the estimated model, and compared across conditions with data from all mice. From the models, sound levels associated with a 50% hit rate were defined as detection thresholds. The target/foil frequency separation associated with a 50% false alarm rate was defined as the discrimination threshold.

Anatomy

Ntsr1-Cre mice were crossed with the Cre-dependent tdTomato reporter line, Ai14 (B6;129S6-Gt(ROSA)26Sortm14(CAG-tdTomato)Hze/J; stock number 007908). Double-transgenic offspring of either sex aged 6–7 weeks were anesthetized with ketamine (100 mg/kg) and xylazine (10 mg/kg). A surgical plane of anesthesia was maintained throughout the procedure with supplements of ketamine (50 mg/kg) as needed. For both hemispheres, silicon probe recordings were made in MGB using a dorsal approach to identify the location of MGBv using a similar approach described above for unit recordings in head-fixed awake recordings. We then injected 0.3 μl of green retrobeads (LumaFluor Inc.) into the MGBv at 0.05 – 0.1 μl/min using a motorized injector (Stoelting Co.).

After allowing 7 days for retrograde transport, mice were deeply anesthetized with ketamine and prepared for transcardial perfusion with a 4% formalin solution in 0.1M phosphate buffer. The brains were extracted and post-fixed at room temperature for an additional 12 hours before transfer to 30% sucrose solution. Coronal sections of the brain (40 μm thick) were sectioned with a cryostat. Tissue was immunolabeled for NeuN (rabbit anti-NeuN, abcam) and visualized with Alexafluor 405 (goat anti-rabbit IgG, life tech) to identify neuronal population.

For each injected hemisphere, the MGBv was examined for retrobeads and only cases showing properly positioned injections were included. Two sections from the auditory cortex were selected for quantification. To quantify co-localization of beads in Ntsr1+ neurons, sections were imaged with 3D deconvolution epifluorescence microscopy (Leica) and a 200x200 μm stereotactic plane was established in L6 from the median projection of the stack, with its bottom edge 100 μm above the white matter. Only NeuN+ cells with soma completely contained inside the imaging boundary were further classified for co-localization of markers for Ntsr1 and beads. Cells with NeuN staining as well as the tdTomato fluorescent marker were classified as Ntsr1+ neurons. Cells with at least 3 pixels above threshold in the green channel that were located within the boundary of the NeuN+ somatic compartment were classified as bead+ CT neurons.

Quantification and Statistical Analysis

All statistical analysis was performed with Matlab (Mathworks). A complete reporting of all statistical tests and outcomes is provided in Table S1. Descriptive statistics are reported as mean ± SEM unless otherwise indicated. In cases where the same data sample was used for multiple comparisons, we used the Bonferroni correction to adjust for the increased probability of Type-I error. Non-parametric statistical tests were used in select cases where data samples did not meet the assumptions of parametric statistical tests. Statistical significance was defined as P < 0.05.

Data and Software Availability

Data acquisition and analysis were performed with custom scripts in Matlab (Mathworks). Wave_clus, the software package used in this study for spike sorting, can be found at http://www2.le.ac.uk/departments/engineering/research/bioengineering/neuroengineering-lab/spike-sorting. Requests for data and custom scripts used in this study can be directed to the lead author (daniel_polley@meei.harvard.edu).

Supplementary Material

1
2

Table S1. (related to Figures 18) Results for statistical tests.

Acknowledgments

We thank Ken Hancock for support with data collection software. A Tesla K20 GPU was donated to WG by the NVIDIA Corporation. This work was supported by NIH grant R21012894 (DBP) and a HHMI international student research fellowship (WG).

Footnotes

Author Contributions

WG and ABM collected data. ARC performed pilot experiments. WG analyzed the data. WG, ARC and DBP designed the experiments. WG and DBP wrote the manuscript.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Bortone DS, Olsen SR, Scanziani M. Translaminar inhibitory cells recruited by layer 6 corticothalamic neurons suppress visual cortex. Neuron. 2014;82:474–485. doi: 10.1016/j.neuron.2014.02.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bourassa J, Deschenes M. Corticothalamic projections from the primary visual cortex in rats: a single-fiber study using biocytin as an anterograde tracer. Eur J Neurosci. 1995;7:19–30. doi: 10.1016/0306-4522(95)00009-8. [DOI] [PubMed] [Google Scholar]
  3. Briggs F, Kiley CW, Callaway EM, Usrey WM. Morphological substrates for parallel streams of corticogeniculate feedback originating in both V1 and V2 of the macaque monkey. Neuron. 2016;90:388–399. doi: 10.1016/j.neuron.2016.02.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Buran BN, von Trapp G, Sanes DH. Behaviorally gated reduction of spontaneous discharge can improve detection thresholds in auditory cortex. J Neurosci. 2014;34:4076–4081. doi: 10.1523/JNEUROSCI.4825-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Carracedo LM, Kjeldsen H, Cunnington L, Jenkins A, Schofield I, Cunningham MO, Davies CH, Traub RD, Whittington M. A neocortical delta rhythm facilitates reciprocal interlaminar interactions via nested theta rhythms. J Neurosci. 2013;33:10750–10761. doi: 10.1523/JNEUROSCI.0735-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Chance F, Abbott L. Divisive inhibition in recurrent networks. Network. 2000;11:119–129. [PubMed] [Google Scholar]
  7. Crandall SR, Cruikshank SJ, Connors BW, Crandall SR, Cruikshank SJ, Connors BW. A corticothalamic switch: controlling the thalamus with dynamic synapses. Neuron. 2015;86:768–782. doi: 10.1016/j.neuron.2015.03.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. David SV, Fritz JB, Shamma SA. Task reward structure shapes rapid receptive field plasticity in auditory cortex. Proc Natl Acad Sci. 2012;109:2144–2149. doi: 10.1073/pnas.1117717109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Denman DJ, Contreras D. Complex effects on in vivo visual responses by specific projections from mouse cortical layer 6 to dorsal lateral geniculate nucleus. J Neurosci. 2015;35:9265–9280. doi: 10.1523/JNEUROSCI.0027-15.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Froemke RC, Carcea I, Barker AJ, Yuan K, Seybold BA, Martins ARO, Zaika N, Bernstein H, Wachs M, Levis PA, et al. Long-term modification of cortical synapses improves sensory perception. Nat Neurosci. 2013;16:79–88. doi: 10.1038/nn.3274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Fu Y, Tucciarone JM, Espinosa JS, Sheng N, Darcy DP, Nicoll Ra, Huang ZJ, Stryker MP. A cortical circuit for gain control by behavioral state. Cell. 2014;156:1139–1152. doi: 10.1016/j.cell.2014.01.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Ghitza O. Linking speech perception and neurophysiology: Speech decoding guided by cascaded oscillators locked to the input rhythm. Front Psychol. 2011;2:1–13. doi: 10.3389/fpsyg.2011.00130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Giraud A-L, Poeppel D. Cortical oscillations and speech processing: emerging computational principles and operations. Nat Neurosci. 2012;15:511–517. doi: 10.1038/nn.3063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Giraud AL, Kleinschmidt A, Poeppel D, Lund TE, Frackowiak RSJ, Laufs H. Endogenous cortical rhythms determine cerebral specialization for speech perception and production. Neuron. 2007;56:1127–1134. doi: 10.1016/j.neuron.2007.09.038. [DOI] [PubMed] [Google Scholar]
  15. Gong S, Doughty M, Harbaugh CR, Cummins A, Hatten ME, Heintz N, Gerfen CR. Targeting Cre recombinase to specific neuron populations with bacterial artificial chromosome constructs. J Neurosci. 2007;27:9817–9823. doi: 10.1523/JNEUROSCI.2707-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Guo W, Hight AE, Chen JX, Klapoetke NC, Hancock KE, Shinn-Cunningham BG, Boyden ES, Lee DJ, Polley DB. Hearing the light: neural and perceptual encoding of optogenetic stimulation in the central auditory pathway. Sci Rep. 2015;5:10319. doi: 10.1038/srep10319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hackett Ta, Clause AR, Takahata T, Hackett NJ, Polley DB. Differential maturation of vesicular glutamate and GABA transporter expression in the mouse auditory forebrain during the first weeks of hearing. Brain Struct Funct. 2015;221:2619–2673. doi: 10.1007/s00429-015-1062-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Henry MJ, Obleser J. Frequency modulation entrains slow neural oscillations and optimizes human listening behavior. Proc Natl Acad Sci U S A. 2012;109:2009–2100. doi: 10.1073/pnas.1213390109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ito T, Bishop DC, Oliver DL. Expression of glutamate and inhibitory amino acid vesicular transporters in the rodent auditory brainstem. J Comp Neurol. 2011;519:316–340. doi: 10.1002/cne.22521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Jaramillo S, Zador AM. The auditory cortex mediates the perceptual effects of acoustic temporal expectation. Nat Neurosci. 2011;14:246–251. doi: 10.1038/nn.2688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Jones EG. The Thalamus. Cambridge University Press; 2007. [Google Scholar]
  22. Jones MR, Moynihan H, Mackenzie N, Puente J. Temporal aspects of stimulus-driven attending in dynamic arrays. Am Psychol Soc. 2002;13:313–319. doi: 10.1111/1467-9280.00458. [DOI] [PubMed] [Google Scholar]
  23. Kajikawa Y, Schroeder CE. How Local Is the Local Field Potential? Neuron. 2011;72:847–858. doi: 10.1016/j.neuron.2011.09.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kaur S, Rose HJ, Lazar R, Liang K, Metherate R. Spectral integration in primary auditory cortex: Laminar processing of afferent input, in vivo and in vitro. Neuroscience. 2005;134:1033–1045. doi: 10.1016/j.neuroscience.2005.04.052. [DOI] [PubMed] [Google Scholar]
  25. Kayser C, Wilson C, Safaai H, Sakata S, Panzeri S. Rhythmic auditory cortex activity at multiple timescales shapes stimulus-response gain and background firing. J Neurosci. 2015;35:7750–7762. doi: 10.1523/JNEUROSCI.0268-15.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kim J, Matney CJ, Blankenship A, Hestrin S, Brown SP. Layer 6 corticothalamic neurons activate a cortical output layer, layer 5a. J Neurosci. 2014;34:9656–9664. doi: 10.1523/JNEUROSCI.1325-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kuchibhotla KV, Gill JV, Lindsay GW, Papadoyannis ES, Field RE, Sten TAH, Miller KD, Froemke RC. Parallel processing by cortical inhibition enables context-dependent behavior. Nat Neurosci. 2016;20:1–14. doi: 10.1038/nn.4436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Lakatos P, Karmos G, Mehta AD, Ulbert I, Schroeder CE. Entrainment of neuronal oscillations as a mechanism of attentional selection. Science. 2008;320:110–113. doi: 10.1126/science.1154735. [DOI] [PubMed] [Google Scholar]
  29. Lakatos P, Musacchia G, O’Connel MN, Falchier AY, Javitt DC, Schroeder CE. The spectrotemporal filter mechanism of auditory selective attention. Neuron. 2013;77:750–761. doi: 10.1016/j.neuron.2012.11.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Lee CC, Lam YW, Sherman SM. Intracortical convergence of layer 6 neurons. Neuroreport. 2012;23:736–740. doi: 10.1097/WNR.0b013e328356c1aa. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Letzkus JJ, Wolff SBE, Meyer EMM, Tovote P, Courtin J, Herry C, Luthi A. A disinhibitory microcircuit for associative fear learning in the auditory cortex. Nature. 2011;480:331–335. doi: 10.1038/nature10674. [DOI] [PubMed] [Google Scholar]
  32. Llano DA, Sherman SM. Evidence for nonreciprocal organization of the mouse auditory thalamocortical-corticothalamic projection systems. J Comp Neurol. 2008;507:1209–1227. doi: 10.1002/cne.21602. [DOI] [PubMed] [Google Scholar]
  33. Luo H, Poeppel D. Phase patterns of neuronal responses reliably discriminate speech in human auditory cortex. Neuron. 2007;54:1001–1010. doi: 10.1016/j.neuron.2007.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Marlin BJ, Mitre M, D’amour JA, Chao MV, Froemke RC. Oxytocin enables maternal behaviour by balancing cortical inhibition. Nature. 2015;520:499–504. doi: 10.1038/nature14402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. McGinley MJ, David SV, McCormick DA. Cortical membrane potential signature of optimal states for sensory signal detection. Neuron. 2015;87:179–192. doi: 10.1016/j.neuron.2015.05.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Mease RA, Krieger P, Groh A. Cortical control of adaptation and sensory relay mode in the thalamus. Proc Natl Acad Sci U S A. 2014;111:6798–6803. doi: 10.1073/pnas.1318665111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Mesgarani N, Chang EF. Selective cortical representation of attended speaker in multi-talker speech perception. Nature. 2013;485:233–236. doi: 10.1038/nature11020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Muller-Preuss P, Mitzdorf U. Functional anatomy of the inferior colliculus and the auditory cortex: current source density analyses of click-evoked potentials. Hear Res. 1984;16:133–142. doi: 10.1016/0378-5955(84)90003-0. [DOI] [PubMed] [Google Scholar]
  39. Niell CM, Stryker MP. Modulation of visual responses by behavioral state in mouse visual cortex. Neuron. 2010;65:472–479. doi: 10.1016/j.neuron.2010.01.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Nobre A, Correa A, Coull J. The hazards of time. Curr Opin Neurobiol. 2007;17:465–470. doi: 10.1016/j.conb.2007.07.006. [DOI] [PubMed] [Google Scholar]
  41. O’Connell MN, Falchier A, McGinnis T, Schroeder CE, Lakatos P. Dual mechanism of neuronal ensemble inhibition in primary auditory cortex. Neuron. 2011;69:805–817. doi: 10.1016/j.neuron.2011.01.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. O’Connell MN, Barczak A, Schroeder CE, Lakatos P. Layer specific sharpening of frequency tuning by selective attention in primary auditory cortex. J Neurosci. 2014;34:16496–16508. doi: 10.1523/JNEUROSCI.2055-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Olsen SR, Bortone DS, Adesnik H, Scanziani M. Gain control by layer six in cortical circuits of vision. Nature. 2012;482:47–52. doi: 10.1038/nature10835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Phillips EAK, Hasenstaub AR. Asymmetric effects of activating and inactivating cortical interneurons. eLife. 2016;5:e18383. doi: 10.7554/eLife.18383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Pi HHJ, Hangya B, Kvitsiani D, Sanders JJI, Huang ZJ, Kepecs A. Cortical interneurons that specialize in disinhibitory control. Nature. 2013;503:521–524. doi: 10.1038/nature12676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Pinto L, Goard MJ, Estandian D, Xu M, Kwan AC, Lee S-H, Harrison TC, Feng G, Dan Y. Fast modulation of visual perception by basal forebrain cholinergic neurons. Nat Neurosci. 2013;16:1857–1863. doi: 10.1038/nn.3552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Polley DB, Steinberg EE, Merzenich MM. Perceptual learning directs auditory cortical map reorganization through top-down influences. J Neurosci. 2006;26:4970–4982. doi: 10.1523/JNEUROSCI.3771-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Ramon y Cajal S. The structure and connexions of neurons. Nobel Lect Physiol or Med. 1906;1901–1921:220–253. [Google Scholar]
  49. Schneider DM, Nelson A, Mooney R. A synaptic and circuit basis for corollary discharge in the auditory cortex. Nature. 2014;513:189–194. doi: 10.1038/nature13724. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Schroeder CE, Lakatos P. Low-frequency neuronal oscillations as instruments of sensory selection. Trends Neurosci. 2009;32:9–18. doi: 10.1016/j.tins.2008.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Seybold BA, Phillips EAK, Schreiner CE, Hasenstaub AR. Inhibitory actions unified by network integration. Neuron. 2015;87:1181–1192. doi: 10.1016/j.neuron.2015.09.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Sherman SM, Guillery RW. The role of the thalamus in the flow of information to the cortex. Philos Trans R Soc London. 2002;357:1695–1708. doi: 10.1098/rstb.2002.1161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Shuler MGH, Bear MF. Reward timing in the primary visual cortex. Science. 2006;311:1606–1609. doi: 10.1126/science.1123513. [DOI] [PubMed] [Google Scholar]
  54. Sohoglu E, Chait M. Detecting and representing predictable structure during auditory scene analysis. eLife. 2016;5:1–17. doi: 10.7554/eLife.19113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Temereanca S, Simons DJ. Functional Topography of Corticothalamic Feedback Enhances Thalamic Spatial Response Tuning in the Somatosensory Whisker/Barrel System. Neuron. 2004;41:639–651. doi: 10.1016/s0896-6273(04)00046-7. [DOI] [PubMed] [Google Scholar]
  56. Winer JA, Diehl J, Larue DT. Projections of auditory cortex to the medial geniculate body of the cat. J Comp Neurol. 2001;430:27–55. [PubMed] [Google Scholar]
  57. Wright Ba, Fitzgerald MB. The time course of attention in a simple auditory detection task. Percept Psychophys. 2004;66:508–516. doi: 10.3758/bf03194897. [DOI] [PubMed] [Google Scholar]
  58. Zhang ZW, Deschenes M. Intracortical axonal projections of lamina VI cells of the primary somatosensory cortex in the rat: a single-cell labeling study. J Neurosci. 1997;17:6365–6379. doi: 10.1523/JNEUROSCI.17-16-06365.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Zhou M, Liang F, Xiong XR, Li L, Li H, Xiao Z, Tao HW, Zhang LI. Scaling down of balanced excitation and inhibition by active behavioral states in auditory cortex. Nat Neurosci. 2014;17:841–850. doi: 10.1038/nn.3701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Zion Golumbic EM, Ding N, Bickel S, Lakatos P, Schevon Ca, McKhann GM, Goodman RR, Emerson R, Mehta AD, Simon JZ, et al. Mechanisms underlying selective neuronal tracking of attended speech at a “cocktail party. Neuron. 2013;77:980–991. doi: 10.1016/j.neuron.2012.12.037. [DOI] [PMC free article] [PubMed] [Google Scholar]

STAR Methods References

  1. Anderson LA, Linden JF. Physiological differences between histologically defined subdivisions in the mouse auditory thalamus. Hear Res. 2011;274:48–60. doi: 10.1016/j.heares.2010.12.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Hackett TA, Rinaldi Barkat T, O’Brien BMJ, Hensch TK, Polley DB. Linking topography to tonotopy in the mouse auditory thalamocortical circuit. J Neurosci. 2011;31:2983–2995. doi: 10.1523/JNEUROSCI.5333-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Kaur S, Rose HJ, Lazar R, Liang K, Metherate R. Spectral integration in primary auditory cortex: Laminar processing of afferent input, in vivo and in vitro. Neuroscience. 2005;134:1033–1045. doi: 10.1016/j.neuroscience.2005.04.052. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2

Table S1. (related to Figures 18) Results for statistical tests.

RESOURCES