Abstract
Sounds can arise from the environment and also predictably from many of our own movements, such as vocalizing, walking, or playing music. The capacity to anticipate and discriminate these movement-related (reafferent) sounds from environmental sounds is critical to normal hearing1,2, yet the neural circuits that learn to anticipate the often arbitrary and changeable sounds resulting from our movements remain largely unknown. Here we developed an acoustic virtual reality (aVR) system in which a mouse learned to associate a novel sound with its locomotor movements, allowing us to identify the neural circuit mechanisms that learn to suppress reafferent sounds and probe the behavioral consequences of this predictable sensorimotor experience. We found that aVR experience gradually and selectively suppresses auditory cortical responses to the reafferent frequency, in part by strengthening motor cortical activation of auditory cortical inhibitory neurons that respond to the reafferent tone. This plasticity is behaviorally adaptive, as aVR-experienced mice showed an enhanced ability to detect non-reafferent tones during movement. Together, these findings describe a dynamic sensory filter involving motor cortical inputs to the auditory cortex that can be shaped by experience to selectively suppress predictable acoustic consequences of movement.
Auditory activity in the brains of humans and other mammals is suppressed during a wide variety of movements, including vocalization and locomotion1,3–9. Although the stereotyped and often simple acoustic consequences (i.e., auditory reafference) of rhythmic movements like licking or chewing can be suppressed by brainstem mechanisms8, a more flexible form of movement-related suppression is thought to arise independently in the auditory cortex through mechanisms that are not well understood3,6,10–13. To begin to identify these cortical mechanisms, we developed an acoustic virtual reality (aVR) system where head-fixed mice on a treadmill heard a series of tones presented at a rate proportional to the mouse’s running speed, simulating an experimentally adjustable yet predictable form of auditory reafference associated with locomotion (Fig. 1A,B; Extended Data Fig. 1A-D; Supplementary Video 1). Following ~1 week of aVR experience, auditory cortical responses to the reafferent tone were nearly abolished during locomotion while responses to frequencies one or more octaves distant from the reafferent frequency showed a more modest amount of suppression similar to that seen during locomotion in aVR-naïve mice 3,5 (Fig. 1C-F). Notably, this differential suppression only manifested during movement: in resting mice, a similar fraction of auditory cortical neurons responded to reafferent and non-reafferent tones and did so with equivalent firing rates (Fig. 1E,F and Extended Data Fig. 2A-D).
Figure 1. Locomotion-related suppression is specific for the frequency of self-generated sounds.
A. Mice were acclimated to closed-loop acoustic virtual reality (aVR) system.
B. Example trace of locomotion (black trace), timing of individual aVR tone pips (red ticks), and instantaneous rate of aVR tone pips (red trace).
C. Extracellular recordings were made from auditory cortical neurons during running and resting. The mouse brain in this figure has been reproduced with permission from27.
D. During electrophysiology, (a) mice first heard the reafferent tone frequency that they expected treadmill running to produce (e.g. 4 kHz). During phase (b), mice heard tones that were yoked in time to their running speed but with random frequency. During phase (c), mice heard tones of random frequency while at rest.
E. Action potential responses from example neuron while at rest (black, phase (c) from panel B) and while running (red, phase (b) from panel B) following playback of the expected reafferent frequency (4 kHz) as well as tone frequencies 1 octave away.
F. Population PSTHs showing neural responses to reafferent (left) and non-reafferent (right) frequencies during running (red) and resting (black) conditions. Responses to non-reafferent sounds are averaged across all 5 non-reafferent frequencies. During running, responses to the expected reafferent frequency were suppressed relative to non-reafferent frequencies (N = 11 mice, n = 317 neurons, p=1.1×10−18).
G. Locomotion-related suppression for an example auditory cortical neuron.
H. Average locomotion-related suppression (black, mean +/− SE) of auditory cortical neurons, centered on the expected reafferent frequency heard by each mouse (N = 11 mice). Blue area shows 95% confidence bounds.
I. Extracellular recordings were made from the auditory thalamus during running and resting following 6–9 days of aVR acclimation.
J. Locomotion-related suppression (black, mean +/− SE) of auditory thalamic neurons (N = 5 mice, n = 109 neurons) is not specific to the expected reafferent frequency.
Statistical details in Methods.
To further characterize the movement-dependent filter formed by aVR experience, we compared the frequency tuning curves of auditory cortical neurons at running and at rest and calculated a locomotion-related gain function for each neuron (Fig. 1G). Regardless of a neuron’s best frequency, locomotion-related suppression was greater for the reafferent tone than for tones one or more octaves away, with tones one half octave from the reafferent frequency showing intermediate levels of suppression during locomotion (Fig. 1H; Extended Data Fig. 2E-H, Extended Data Fig. 3A). The width of this “notch” filter may be constrained by the tuning widths of auditory cortical inhibitory neurons14 and more broadly conforms to the idea that excitatory neurons in sensory cortex receive input from local inhibitory neurons tuned to both similar and dissimilar stimulus features15,16. Movement-related suppression in auditory thalamic neurons remained flat across sound frequencies (Fig. 1I,J), as previously observed in aVR-naive mice3,8,17, and reafferent tones were more strongly suppressed in infragranular than supragranular auditory cortex (Extended Data Fig. 3B-D), indicating that circuits local to the auditory cortex are a likely source of the reafferent notch filter that arises following aVR experience.
The formation of this auditory cortical filter required a predictable and relatively prolonged association of movement with an ensuing sound. Mice acclimated for ~1 week on a treadmill in which fixed-frequency tones were presented only at rest did not display enhanced cortical suppression at the training frequency during rest or running (Extended Data Fig. 4A,B). Furthermore, mice acclimated for ~1 week on a treadmill in which tones were presented at a fixed tempo during locomotion regardless of running speed (i.e., “metronome”-experienced mice) showed no enhanced locomotion-related auditory cortical suppression at the training frequency (Extended Data Fig. 4C,D). Moreover, in aVR-experienced mice, enhanced suppression of the reafferent tone was evident as soon as they began to move on the treadmill, whereas stimulus-specific adaptation emerged more slowly and only after several tone presentations (i.e., measured at the non-reafferent frequency; Extended Data Fig. 4E,F). Finally, one hour of aVR experience (~1000–3000 reafferent tones) was inadequate to induce any enhancement of locomotion-related suppression at the reafferent frequency (Extended Data Fig. 4G).
To determine more precisely the time course over which this notch filter arises, we used 2-photon calcium methods to longitudinally image Layer II/III excitatory neurons in the auditory cortex of mice across their first 5 days of aVR experience (Fig. 2A,B). Across days, tuning curves measured during rest were relatively stable (r = 0.53), but at the population level, locomotion-related suppression became progressively more specific for the reafferent frequency (Fig. 2C). The emergence of frequency-specific suppression during locomotion involved both increased suppression of the reafferent frequency and decreased suppression of non-reafferent frequencies (Fig. 2D-F). Tracking the activity of a subset of the same neurons (n = 241 from 2 mice) across consecutive days of aVR experience revealed that their tuning curves measured during locomotion – but not during rest – changed only at the reafferent frequency, with individual neurons becoming less responsive across consecutive days (Fig. 2G-I).
Figure 2. Reafferent suppression arises in parallel with sensory-motor experience.
A. Injection of AAV encoding Cre-dependent GCaMP6f into auditory cortex of CaMK2::Cre mouse followed by calcium imaging of L2/3 excitatory neurons during aVR experience over several days.
B. Maximum projection of an example field of L2/3 excitatory neurons in auditory cortex from a mouse acclimated to aVR experience producing 32 kHz tones (left panel). Color represents the best frequency of every neuron. White box shows region in panel G.
C. Population-averaged calcium transients evoked by the expected reafferent frequency (left) and a non-reafferent frequency (right, 2 octaves away) during rest (black) and running (red) following 1, 3 and 5 days of aVR experience. Shaded areas show mean +/− SE.
D. Magnitude of locomotion-related suppression (α) for the expected reafferent frequency, (N = 4 mice).
E. Magnitude of locomotion-related suppression (β) for non-reafferent frequencies (−2 octaves).
F. Selectivity of locomotion-related suppression for the expected reafferent frequency [(α - β) / (α + β)]. Values greater than zero indicate stronger locomotion-related suppression at the reafferent frequency; values less than zero indicate stronger locomotion-related suppression at non-reafferent frequencies.
G. The maximum projection on subsequent days, zoomed in on the region outlined in white panel B.
H. Average calcium traces from two example neurons shown in panel G during days 4 and 5 of aVR experience. Responses for neuron 1 are to 32 kHz tones (the expected reafferent frequency) and responses for neuron 2 are to 2 kHz tones (a non-reafferent frequency). Black traces are responses during rest and red traces are responses during running.
I. In 2 mice, 241 sound-responsive neurons were monitored across subsequent days of aVR experience. Change in tuning curves across subsequent days (black = rest, red = running). Shaded regions are 95% confidence bounds.
Statistical details in Methods.
Auditory cortical interneurons integrate auditory inputs with locomotor-related signals from the secondary motor cortex (M2)3,18,19 and form inhibitory synapses on both similarly and dissimilarly tuned excitatory cells, affording a substrate on which aVR experience could act to sculpt the movement-dependent notch filter described here3,18 (Fig. 3A). To explore this possibility, we expressed Channelrhodopsin (ChR2) in genetically identified inhibitory neurons (PV+, SST+, or VGAT+ neurons, see Methods), acclimated mice to ~1 week of aVR experience, then recorded the action potential activity of photo-identified auditory cortical inhibitory cells (Fig. 3B and Extended Data Fig. 5A,B). During running, the spontaneous firing rates in only those inhibitory neurons responsive to the reafferent frequency increased, while their responses to reafferent and non-reafferent tones modestly decreased (Fig. 3C,D). In contrast, the spontaneous firing rates of putative excitatory neurons slightly decreased during locomotion (Extended Data Fig. 5C). Moreover, the magnitude of the running-related increase in an inhibitory neuron’s spontaneous firing rate scaled with the strength of its response to the reafferent frequency (Extended Data Fig. 5D,E). Finally, locomotion-related suppression at the reafferent frequency comprised both divisive and subtractive components, consistent with the involvement of both PV+ and SST+ interneurons20 (Extended Data Fig. 5F). Therefore, aVR experience selectively enhances the movement-dependent recruitment of inhibitory interneurons responsive to the reafferent frequency.
Figure 3. Reafferent-tuned inhibitory neurons increase their activity during locomotion and receive enhanced motor cortical input.
A. Proposed model of experience-dependent strengthening of M2 inputs onto inhibitory neurons tuned to the reafferent frequency in the auditory cortex (blue circle). Synapse strength is proportional to size.
B. Extracellular recordings from photo-identified inhibitory neurons (pi-INs) in the auditory cortex.
C. Locomotion velocity as a function of time (top). Spontaneous activity of simultaneously recorded pi-INs (bottom) that were responsive to the reafferent frequency (black) or were not, but were responsive to other frequencies (orange).
D. PV+, SST+ and VGAT+ pi-INs that are responsive to the reafferent frequency increase their firing rate significantly during locomotion, unlike pi-INs that are not responsive to the expected reafferent frequency (* p<0.05, Wilcoxon signed rank test).
E. Extracellular recordings from pi-INs in the auditory cortex. A bipolar microstimulating electrode was implanted in M2.
F. Example of M2 stimulation-evoked action potentials recorded from a VGAT+ pi-INs in auditory cortex.
G. pi-INs that were more strongly driven by the reafferent frequency were more strongly recruited by electrical stimulation in M2. Solid lines and shadows show linear regression and 95% confidence bounds, respectively.
H. Changes in spontaneous firing rate during locomotion relative to rest for pi-INs (PV+ and SST+) were significantly correlated with the magnitude of the same neurons’ responses to M2 electrical stimulation.
I. Effect size of the correlations shown in G and for four other conditions. All pi-INs: pi-INs from PV+, SST+ and VGAT+ mice. Non-reaff: same neurons shown in panel G (VGAT+) but for responses to a non-reafferent tone. Naïve: pi-INs from VGAT+ mice that were acclimated to a quiet treadmill. put-ENs: putative excitatory neurons in auditory cortex of VGAT::ChR2 mice acclimated to aVR. Error bars show 95% confidence bounds.
Statistical details in Methods
One idea is that aVR experience strengthens the connections between M2 and auditory cortical interneurons that respond to the reafferent frequency, resulting in their selective recruitment during locomotion. To test this idea, we calculated frequency-tuning curves of photo-identified PV+, SST+, and VGAT+ inhibitory neurons in the auditory cortex of aVR-experienced mice. We then applied brief current pulses in M2 and measured resultant action potential activity in these identified interneurons (Fig. 3E,F). These measurements revealed that auditory cortical inhibitory neurons that strongly responded to the reafferent frequency were driven more strongly by M2 stimulation than inhibitory neurons that responded only weakly or not at all to the reafferent frequency (Fig. 3G). Moreover, the PV+ and SST+ inhibitory neurons most strongly activated by M2 stimulation in the resting mouse showed the greatest increases in spontaneous firing rates when the mouse was running (Fig. 3H, Extended Data Fig. 5G). Using a similar approach, we also detected a similar but weaker correlation for auditory cortical excitatory neurons in aVR-experienced mice (Fig. 3I and Extended Data Fig. 5H,J). These differential effects of M2 stimulation on auditory cortical excitatory and inhibitory neurons are consistent with the observation that M2 synapses excite both cell types but exert a primarily suppressive effect on auditory cortical activity through feedforward inhibition18,19. Finally, in aVR-naïve mice, there was only a weak correlation between tone-evoked and M2-stimulation-evoked responses in auditory cortical interneurons (Extended Data Fig. 5I,J).
To begin to explore how movement-related signals in the auditory cortex influence auditory perception, we trained mice to detect tones while running and resting and confirmed that this tone detection task required auditory cortical activity (Fig. 4A,B; Supplementary Video 2; Extended Data Fig. 6A). This assay revealed that mice were significantly worse at detecting tones at intermediate intensities (10–40 dB) while running than at rest, despite displaying equal levels of motivation in these two states (Fig. 4C,D; Extended Data Fig. 6B). Furthermore, optogenetically activating inhibitory interneurons or M2 axon terminals in the auditory cortex degraded performance in resting mice at these intermediate sound intensities, indicating that the same circuit elements that are influenced by aVR experience modulate hearing in a movement-dependent manner (Fig. 4E-H; Extended Data Fig. 6C-F). In contrast, optogenetically activating inhibitory interneurons in visual cortex, illuminating GFP-expressing M2 terminals in the auditory cortex, or simply illuminating the intact skull in resting mice did not diminish their ability to detect tones (Extended Data Fig. 6G-I).
Figure 4. Tone detection behavior is compromised by locomotion, is auditory-cortex dependent, and adapts following aVR experience.
A. Mice were trained to lick a response port upon hearing a tone while resting or running on a treadmill.
B. Tones were presented with random inter-tone intervals following a period without spontaneous licking. Mice were rewarded for licking within 1 sec following tone presentation.
C. Average psychometric functions (N = 19 mice) showing detection rates as a function of tone intensity while mice were resting (black) and running (red). (red asterisk, p<0.005).
D. Behavioral threshold (intensity at 50% performance) during resting and running for each mouse (black circles; red + denotes mean thresholds). Dashed line shows unity. (N = 19 mice, p=0.009, paired t-test).
E. Optogenetic activation of inhibitory neurons in auditory cortex during tone detection.
F. Tone detection performance (N = 4 mice) during rest (black), running (red) and rest with optogenetic activation of auditory cortical inhibitory neurons (blue). Mice were worse at detecting tones during optogenetic trials compared to non-optogenetic trials at rest, (blue asterisk p<0.05) and during running compared to rest. (red asterisk p<0.05).
G. Optogenetic activation of ChR2+ M2 terminals in auditory cortex during tone detection.
H. Tone detection performance (N = 4 mice) during rest (black), running (red) and rest with optogenetic activation of M2 terminals in auditory cortex (blue). Mice were worse at detecting tones during optogenetic trials compared to non-optogenetic trials at rest, (blue asterisk, p<0.05) and during running compared to rest (red asterisk, p<0.05).
I. Tone detection performance (N = 10 mice) during rest and running for naïve mice performing a 2-frequency detection task. (red asterisk p<0.05 )
J. As in panel (I), but following aVR experience with Tone A. (red asterisk, p<0.05).
Statistical details in Methods and Supplementary Table 1.
A remaining issue is whether the auditory cortical filter formed by aVR experience improves the mouse’s ability to detect non-reafferent tones during movement, as hypothesized for brain mechanisms that suppress predictable sensory reafference2,21. We trained mice to detect two different tones (Tone A and Tone B, separated by 2 octaves), which were randomly interleaved from one trial to the next. After several days of training, mice showed a similar deficit in detecting both tones during locomotion, similar to mice trained on a single tone (Fig. 4I). Mice then received ~1 week of aVR experience in which only one of the tones (Tone A) was used as a reafferent training stimulus. Following aVR experience, mice no longer showed a locomotion-related deficit in detecting the non-reafferent tone (Tone B), even though they continued to show a movement-related deficit in detecting the reafferent tone (Tone A) (Fig. 4I,J; Extended Data Fig. 6J). Therefore, aVR experience improves the mouse’s ability to detect unpredicted sounds during movement (Extended Data Fig. 6K) in addition to selectively suppressing auditory cortical responses to predictable reafferent sounds.
The present findings establish that temporally coupled locomotor – auditory experience results in the formation of a movement-dependent filter that suppresses auditory cortical responses to predictable self-generated sounds. A plausible idea is that coincident motor and auditory activity during sound-generating movements strengthens M2 synapses onto PV+ and SST+ interneurons, or onto neurons interposed between M2 and these auditory cortical interneurons, leading to enhanced movement-related suppression of auditory cortical responses to the reafferent tone22,23 and an enhanced ability to detect non-reafferent tones during movement24,25. Notably, aVR experience also reduced locomotion-dependent suppression at non-reafferent frequencies in layer 2/3 of the auditory cortex, providing an auditory cortical correlate of this adaptive perceptual change. The involvement of M2 in this form of auditory cortical suppression is reminiscent of the motor cortical-mediated suppression of responses to predictable stimuli in mouse primary visual cortex26, consistent with a generalized predictive cortical mechanism. Ultimately, the motor – auditory cortical circuit characterized here can flexibly encode the relationship between a movement and the sound it produces, helping to maintain sensitivity to novel sounds in the environment while also monitoring the acoustic consequences of sound-generating movements.
Methods
Surgical procedures
All experimental protocols were approved by Duke University Institutional Animal Care and Use Committee. Male and female mice were purchased from Jackson Labs and were housed and bred in an onsite vivarium. We used mice that were 2 – 4 months old for our experiments. During all experiments, mice were kept on a reverse day-night cycle (12 h day, 12 h night).
For all surgical procedures, mice were anesthetized under isoflurane (1–2% in O2) and placed in a stereotaxic holder (Leica), skin was removed over the top of the head, and a titanium headpost was attached to the skull using a transparent adhesive (Metabond). Prior to electrophysiology experiments and following aVR acclimation, male and female mice (C57, VGAT::ChR2, PV::Cre, SST::Cre) craniotomies were made to expose auditory cortex, auditory thalamus, and/or motor cortex to allow electrophysiology or electrical stimulation. A small craniotomy was made over the right sensory cortex and a silver pellet was positioned atop the cortical surface and cemented in place (Metabond) for use as a ground electrode. Exposed craniotomies were covered with a silicone elastomer (Kwik-Sil) and the mouse was allowed to recover in its home cage.
Prior to calcium imaging experiments, an AAV encoding Cre-dependent GCaMP6f was injected into the auditory cortex of CaMKII::Cre mice (see Viral injections), a tattoo was placed on the surface of the skull to mark the injection site, and a custom Y-shaped titanium headpost was attached to the skull with Metabond. Exposed skull on the top and side was also covered in Metabond. Mice were allowed to recover in their home cage for three weeks, after which time mice were again anesthetized and a rectangular craniotomy was made over the original injection site. A stack of 2 laminated glass coverslips was placed over the craniotomy and sealed with Metabond. Mice were returned to their home cage and allowed to recover for 1 to 2 days.
For simultaneous psychophysics and pharmacological manipulations or optogenetic (M2 terminals or VGAT+ neurons) stimulation experiments in the auditory cortex, a custom Y-shaped titanium headpost was attached to the skull with Metabond and marks were bilaterally placed on the surface of the skull over the auditory cortex. For experiments in which M2 axon terminals were activated optogenetically, an AAV encoding either Channelrhodopsin or eGFP was injected bilaterally into M2 at the same time as the headpost implant (see Viral Injections). Once mice were proficient with the task, they were anesthetized again under isoflurane and craniotomies were opened bilaterally over the auditory cortex, which was confirmed by performed recordings (Carbostar-1, Kation Scientific) at multiple locations within the craniotomy in anaesthetized mice to ensure auditory responses to 8kHz tones (1 to 2 sec inter-tone interval). For pharmacological experiments, exposed craniotomies were covered with a silicone elastomer (Kwik-Sil) and mice were allowed to recover in their home cage for a day before behavioral testing. For optogenetic stimulation experiments, circular glass coverslips (3mm diameter) were implanted bilaterally over the auditory cortex and were sealed in place with Metabond. For simultaneous psychophysics and optogenetic manipulation experiments in the visual cortex (VGAT+), circular glass coverslips were bilaterally implanted over visual cortex (identified using stereotaxic coordinates) at the same time as an annular headpost was attached to the skull. Mice were returned to their home cage and allowed to recover for 1 to 2 days after implantation.
Viral injections
For expression of calcium indicators, the skull over the auditory cortex of male and female CaMKII-Cre mice was exposed and two small craniotomies were made over the auditory cortical surface (estimated using stereotaxic coordinates) separated by approximately 300 um along the rostral-caudal dimension. A pipette was backfilled with AAV.1.hSyn.FLEX.GCaMP6f, angled at 30 degrees relative to vertical, and lowered into the auditory cortex. Approximately 150–200 nL of virus was pressure injected (Nanoject) into the center of auditory cortex over the course of 15 minutes, repeated for each craniotomy. For expression of Channelrhodopsin or eGFP, the skull over M2 of C57, or auditory cortex for PV::Cre and SST::Cre male and female mice, was exposed. A single craniotomy was made over M2 or auditory cortex (stereotaxic coordinates), and approximately 300nL of AAV.1.hSyn.ChR2.EYFP.WPRE (M2), AAV.1.CB7.eGFP.WPRE (M2) or AAV.1.EF1α.DIO.ChR2.EYFP.WPRE (A1) was pressure injected over the course of 20 minutes. Following injections, craniotomies were filled with melted bone wax and the injection sites were covered in Metabond.
Acoustic virtual reality
We designed a custom acoustic virtual reality (aVR) system for yoking a series of fixed-frequency tone pips (25 msec with 5 msec cosine ramp onset and offset) to a mouse’s running speed. To create the aVR system, we built a non-motorized treadmill from a 6-inch Plexiglas disk (Delvies Plastic) that was coated with a thin silicone sheet (Durometer, Marian Chicago) mounted to the post of a rotary encoder (U.S. Digital). Output from the rotary encoder was monitored with a data acquisition card (National Instruments) connected to a computer (Dell) running custom Matlab software (Mathworks, PsychToolBox) and sampled at ~30 Hz. The computer was also connected to a sound card (RME Fireface UCX), the output of which was routed to an ultrasonic speaker (Tucker Davis Technologies) located lateral to the mouse, 6 inches from the mouse’s right ear. We recorded the noise produced by the mouse’s footsteps on the treadmill and the sound of the rotating treadmill itself during running by placing an ultrasonic microphone close (~1 cm) to the mouse’s ear. We measured <1dB increase (estimated by taking the rms value of 5 sec segments of recordings) in the noise produced when the mouse was running on the treadmill compared to rest.
To calculate the inter-tone-interval, we computed a filtered version of the mouse’s velocity, which was the median of the last 5 velocity samples. Upon every sampling period, the desired inter-tone-interval was updated to be proportional to the reciprocal of the current median-filtered velocity, scaled such that the rate of tone presentations closely matched the foot step rate, which was calculated from videos of mice running on the treadmill at various speeds. At speeds greater than 30 cm/sec, the inter-tone-interval saturated at 100 msec to ensure spacing between tones. For the anti-coupled version of aVR, sounds were not presented while mice were running but the total number of tones that should have been presented, and the calculated intervals between them, were stored in memory. During rest, tones were played back to the mouse with inter-tone-intervals drawn from the intervals that the mouse would have heard while running until the number of resting tones equaled the number of tones that mouse should have heard while running. For the metronome aVR, sounds were presented only during running and at a fixed rate (~2/sec) that was non-modulated by running speed.
Mice were held in place using two clamps (Altos Photonics) that secured the arms of the headpost (see Surgical procedures). On their first day of treadmill experience, the aVR system was turned off and mice ran and rested for 2 hours without hearing any tones. Beginning on the second day (referred to as Day 1 of aVR experience), tones of a fixed frequency were yoked to the mouse’s velocity as described above. For each mouse, the tone frequency was fixed during the first 6–9 days of aVR experience at 2, 4, 8, 16, 32, or 64 kHz. Mice were placed on the treadmill for ~2 hours per day and were free to transition between periods of running and rest, which typically occurred several times during each 2-hour aVR acclimation session.
Electrophysiology and aVR
Following 6–9 days of aVR acclimation, mice were positioned atop the treadmill and a 32-channel electrode (Neuronexus, 4 × 8 configuration) was implanted into the auditory cortex or auditory thalamus. The electrode was connected to a digitizing headstage (Intan) and electrode signals were acquired, monitored in real-time, and stored for subsequent offline analysis (OpenEphys). The electrode was allowed to settle for ~30 minutes, during which time mice ran on the treadmill and heard tones of the reafferent frequency to which they had been acclimated. Following this initial 30 minutes, the frequency of the running-related tones was switched from a fixed frequency to a pseudo-random distribution comprising 2, 4, 8, 16, 32 and 64 kHz tones. In a subset of mice (N=4) we also included tones spaced one-half octave higher and lower than the reafferent frequency. Random-frequency tones were presented with inter-tone-intervals dictated by running speed. Random-frequency tones yoked to running continued until the mouse heard 50 to 100 tones of each frequency (n = 7 mice, 2 to 15 minutes) or for 30 minutes (n = 4 mice). After this time, tones with random frequency were presented during a period of rest with inter-tone-intervals drawn from the distribution that the mouse had recently heard while running. Electrode signals were filtered (300 to 5000 Hz) and action potentials from individual neurons were sorted offline for each electrode independently based on visualization of the action potential waveform and principal component analysis of the waveform using custom Matlab software (PostHawk, D.M.S.).
Tone-evoked action potential responses were measured for each neuron at each tone frequency, independently for tones presented during running and during rest. To calculate population peri-stimulus-time histograms (PSTHs), the tone-evoked response of every neuron that was responsive to a particular frequency (see Statistical Methods) was averaged independently for running and resting conditions. For each neuron we measured the response strength to each tone frequency [RS(f)] as the firing rate following tone presentation minus the baseline firing rate. To calculate locomotion-related gain, we took the ratio of RS(f) measured during rest and running [gain(f) = RS(f)run / RS(f)rest]. To average the locomotion-related gain functions across mice acclimated to aVR producing reafferent tones of different frequencies, we aligned the gain function for each neuron to the reafferent frequency experienced by the mouse from which the neuron was recorded.
Recordings from photo-identified inhibitory neurons (pi-INs) were made in VGAT::ChR2 mice (Jackson labs) and in PV::Cre and SST:Cre mice (Jackson labs) injected with Cre-dependent ChR2 (see Viral Injections). During electrophysiology, a multi-electrode array was implanted in the auditory cortex and an optical fiber coupled to a blue laser (420nm, Shanghai) was directed at the auditory cortical surface. Action potential responses were analyzed in response to a series of 30 laser pulses (100 msec each, separated by 1 second, laser power: 15 – 30mW). pi-INs were identified based on short-latency, high reliability responses to optical stimulation (See Extended Data Fig. 5). Neurons that were not classified as pi-INs in VGAT::ChR2 mice were classified as putative excitatory neurons (put-ENs). Following aVR experience, we presented tones while recording from pi-INs during rest to measure their frequency tuning curves. The best frequency of each neuron was computed as the frequency that drove the strongest response and was restricted to neurons that were significantly driven by tones of at least one frequency. We recorded the spontaneous activity of pi-INs as mice transitioned between periods of running and rest.
To measure the strength of the functional connection between M2 and auditory cortex neurons, we implanted a bipolar stimulating electrode into M2 and we electrically stimulated within M2 (100 usec, 300 uA) every 2 seconds while recording from pi-INs and put-ENs in auditory cortex. Response strength to M2 stimulation was calculated as the difference between the firing rate in a baseline window immediately preceding electrical stimulation (100 msec window) and in a brief window 13 to 40 msec following electrical stimulation. We performed a bootstrap analysis 1000 times to compute the 95% confidence bounds of the regression lines. Effect size was computed as Cohen’s f2, which is defined as R2/(1-R2), where R is the Pearson correlation.
To measure locomotion-related suppression across cortical layers, in a subset of mice (N = 3) we implanted the electrode perpendicular to the surface of the auditory cortical surface. To estimate the cortical layer in which each neuron on the 4×8 electrode resided, we calculated the current-source density (CSD) triggered off of tone playback. First, we created a two-dimensional map of local-field potential (LFP, bandpass filtered to include 0.1 to 70 Hz) activity by averaging across electrode shanks that were at the same depth, and plotting the averaged activity at each depth as a function of time relative to tone onset. We then computed the CSD as the second spatial derivative of the depth-specific LFP signal. We estimated which electrodes were in cortical layer 4 based on transitions between sources and sinks in the CSD and based on latency of tone-evoked responses. We then separated our electrodes into those residing in infragranular (deeper than layer 4) and supragranular (superficial to layer 4) layers of the cortex. We subsequently analyzed the tone-evoked LFP signal to measure locomotion-related suppression28.
Calcium imaging and aVR
Three weeks following viral infection with GCaMP6f and one day following the implantation of a cranial window, mice were positioned atop the treadmill and under a resonant scanning two-photon microscope (Neurolabware) with a mode-locked titanium sapphire laser (Mai Tai DeepSee) at 920 nm (laser power levels: 50–130mW). The microscope objective (16× 0.8 NA water immersion, Nikon) was angled at ~35 degrees such that imaging was performed perpendicular to the auditory cortical surface while mice were positioned on the treadmill in a normal, upright position. Prior to aVR acclimation, we monitored the locomotion-related suppression of aVR-naïve mice independently at each frequency. We then chose as the reafferent frequency on subsequent days the sound frequency that had the least amount of movement-related suppression for neurons in our field of view. By using this approach, we did not bias ourselves toward sound frequencies that were already strongly suppressed. On each imaging day, mice ran on the aVR treadmill and heard fixed-frequency tones yoked to their running speed for 2 hours, during which time images were not acquired. Following each 2 hour aVR session, tones of random frequency (2 to 64 kHz) were presented during running (with timing yoked to the mouse’s running speed) and rest (with timing chosen from the running inter-tone-intervals) during image acquisition at 15.5 Hz until mice heard at least 50 tones of each frequency, typically lasting for approximately 5 to 10 minutes.
GCaMP6f fluorescence images were registered to correct for movement artifact in the horizontal plane. Regions of interest (ROIs) were selected using a semi-automated identification method based on nearby correlated pixel activity (Scanbox) and by manually tracing around individual cell bodies. The calcium trace for each ROI was calculated as the mean fluorescence signal averaged across all pixels comprising the ROI minus the mean fluorescence signal in an annular region surrounding each ROI (neuropil). ΔF/F was computed as (F(t) – F0)/F0, where F(t) was the raw calcium snippet surrounding a single tone presentation and F0 was the mean baseline fluorescence of each snippet during the 0.5 sec preceding tone stimulation. For each neuron, we first measured whether the ΔF/F was significantly elevated following tone presentation independently at each frequency (p<0.005, t-test).
Population-level analysis:
The calcium traces of all ROIs within an imaging session that were responsive to a particular frequency were averaged together independently for tones presented during running and during rest. Across days, the population responsive to a particular frequency did not necessarily comprise the exact same neurons. To compute the reafferent suppression index (RSI), we first calculated the difference between the tone-evoked calcium traces measured during rest and during running. From the differences measured at the reafferent frequency (alpha) and a frequency 2 octaves higher or lower (beta), we calculated the RSI as (alpha – beta)/(alpha + beta). We then tracked RSI as a function of day following the onset of aVR experience.
Single-neuron analysis:
Using anatomical coordinates and cell-body morphology, we tracked 577 neurons (from 2 mice) across subsequent pairs of days. Of these 577 neurons, 241 had significant tone-evoked responses to one or more tones on one or both days. For each neuron, we estimated tuning curves during rest and during locomotion on each of the two days. We then subtracted the Day1 tuning curve from the Day 2 tuning curve, independently for tuning curves measured during rest and locomotion, to determine how much the tuning curve of individual neurons changed across subsequent days of aVR experience. We averaged the change in resting and change in running tuning curves across all 241 tone-responsive neurons.
Single-frequency tone detection task
Mice were acclimated to the treadmill for 2 to 3 days then water restricted for 24 hours prior to training. To train mice to lick in response to tone presentations, tones (25ms long, 70 and 80 dB SPL) of a fixed frequency (8 kHz) were presented with variable inter-tone-intervals (10 to 15 sec) followed by a water reward 1 sec later. Licking behavior was measured with a custom-built infrared detector located between the mouse’s mouth and the water delivery spout and was sampled with a data acquisition card (NI) connected to a computer (Dell) running custom software (Matlab). Within 3 to 4 days of training, mice learned to associate the tone pips with rewards. Following this initial training phase, tones were presented at variable inter-tone intervals (8 to 15s) and mice were required to lick within a 1 sec window following tone presentation to receive a reward. To ensure that mice did not lick continuously (to maximize rewards), the inter-tone-interval was reset if the mouse licked during a varying window (5 to 8 sec) prior to tone presentation. Tones of lower intensities (0 to 60 dB, in steps of 10 dB) were gradually introduced over the subsequent 5 days.
During testing, tones were presented using a block design where each block consisted of all tone intensities (0 to 80dB in steps of 10dB) presented in random order. To ensure that the mice were engaged in the task, blocks during which the mouse did not lick were removed from analysis. Most mice spontaneously transitioned between periods of running and rest but a subset of mice tended to either rest or run continuously throughout each session. For these mice, tones were only presented during the less explored behavior, and this transient adjustment of the task criteria was sufficient to alter a mouse’s behavior such that they began to spontaneously transition between periods of running and resting. Mice performed an average of 215 trials per day, consisting of, on average, 170 trials during rest and 45 trials during running. To create psychometric functions for each mouse, trials were separated based on whether mice were running or resting at the time of tone presentation and trials were pooled across all testing days (12.4 +/− 5.4 days). 0 dB trials were used to estimate the rate of spontaneous licking and the false alarm rate. For each intensity, hit rate was calculated as the number of correct detections divided by the total number of tone presentations at each intensity. Behavioral threshold was calculated by estimating the intensity at which performance was 50% in each condition. This value was determined by linear interpolation of surrounding intensity values. To account for differences in thresholds due to difference in false alarm rates between conditions, we removed the fraction of correct trials that could be accounted for by false alarms for each condition29. Not applying this correction, and estimating thresholds on our raw data did not change our results (data not shown). Following training each day, mice received 1.5mL of water. Each mouse was weighed daily to ensure that its weight did not fall below 80% of their pre-water-restriction weight.
Two-frequency tone detection task
Training and testing methods were similar to the single-frequency version of the task, but using tones of two frequencies (4 and 16 kHz). During behavioral testing, trials were presented in a block design in which each block consisted of each frequency-intensity combination (4 and 16khz; 0, 30, 40, 50 and 60dB). Following an initial phase of behavioral testing, the lickometer and water spout were removed from the treadmill and mice received aVR experience with one of the two testing frequencies for 7 days, 2 hours per day. Five mice each received aVR experience with 4kHz and 16kHz; results were consistent regardless of the aVR frequency used (data not shown). After 7 days of aVR experience, behavioral testing was performed again for 4 to 8 days. At the beginning of each day, mice received 30 minutes of experience with aVR, during which time they ran and heard the reafferent frequency they were acclimated to over the previous 7 days. This brief re-exposure to aVR was followed by 30 minutes in their home cage without any aVR or behavioral testing. At the end of this 1-hour period, we reintroduced them to the behavioral testing chamber where mice performed the behavioral task.
Optogenetic stimulation during psychophysics
Mice were first trained on the single-tone detection task as described above using a subset of intensities (0, 30, 40 and 60dB). During behavioral testing, a blue (420 nm) laser (Shanghai) was coupled to a pair of optical fibers (Doric optical splitter), which were positioned bilaterally over the auditory cortex cranial windows. Laser pulses (25 Hz, 50% duty cycle, 15–30mW) were presented on 50% of sound trials. Laser stimulation began 200 msec prior to tone presentation and continued for 1.2 sec, which covered the entire response window. To accurately estimate the mouse’s chance performance on optogenetic trials and to deter the mouse from using light as a cue, 50% of stimuli were laser only trials, and a short air puff (100ms, Picospritzer II) was directed toward the mouse’s face as negative reinforcement on trials where the animal licked in response to laser only trials.
Pharmacological manipulation during psychophysics
Mice were first trained on the single-tone detection task as described above. Following initial training, saline or muscimol (2ug/ul, 150nL) was bilaterally pressure injected using a Nanoject system into the auditory cortex on alternate days. Mice were allowed to return to their home cage for 30 minutes after injection. At the end of this period, the mice were reintroduced to the testing chamber where they performed the behavioral task.
Statistical methods
Paired and unpaired two-sided statistical tests were performed with the non-parametric Wilcoxon Sign Rank test and Wilcoxon Rank Sum test, respectively, unless otherwise stated. All error bars are standard error unless otherwise stated. Bootstrap analyses with 1000 repetitions were used to measure confidence intervals for linear regressions (Fig. 3G,H; Extended Data Fig. 5D,F-I). Shuffled analyses with 1000 repetitions were used to estimate null distributions (Fig. 1H,J). Repeated measures 2-way ANOVAS followed by post-hoc Tukey tests (where required) were used to compare psychometric curves (Fig. 4C,F,H,I,J; Extended Data Fig. 6E-K). ANOVA p-values were corrected using the Holm-Bonferroni method to account for multiple comparisons (Fig. 4F,H,I,J; Extended Data Fig. 6J,K). In all figures and captions, “n” is the number of neurons in the data sample, “N” is the number of animals contributing data points to the distributions. No statistical methods were used to predetermine sample sizes, but our sample sizes were similar to those reported in previous publications in the field. Details regarding sample sizes, p-values and statistical tests for individual figures panels are detailed below.
Fig. 1F. Of 317 neurons (n= 11 mice), 120 were significantly responsive to the reafferent frequency and 248 were responsive to at least one non-reafferent frequency, calculated by comparing baseline firing rates during the 100 msec preceding tone onset to the 100 msec following tone onset (p<0.005, paired t-test). During rest, response strength (driven minus baseline firing rates) to reafferent and non-reafferent tones were not significantly different (11.2+/−22.3 vs. 13.1+/−24.1 action potentials/sec, n = 120 and 248, p=0.46). During running, response strength to the reafferent tone was significantly less than the response strength to non-reafferent tones (−0.1+/−19.9 vs.6.3+−23.3 action potentials/sec n = 120 and 248, p=0.001). For a paired, within-neuron comparison, we restricted our analysis to neurons that responded to the reafferent frequency and at least one other frequency (n = 115) and we averaged each neuron’s response to all non-reafferent frequencies to which it responded significantly. During running, responses to the reafferent frequency were weaker than to non-reafferent frequencies (p=1.1×10−18, Wilcoxon signed-rank test). As a population, responses to the reafferent frequency during running were not significantly different from 0 (p=0.12).
Fig. 1H. We aligned the gain function for each neuron to the reafferent frequency experienced by the mouse from which the neuron was recorded. The number of neurons responsive to each tone frequency (−3, −2, −1, 0, 1, 2, and 3 octaves, relative to the reafferent frequency) were 70, 92, 153, 120, 110, 95, and 67, respectively. To determine whether the notch at the reafferent frequency was significant relative to what would be expected by chance, we shuffled the data by randomly assigning to each neuron a reafferent frequency rather than using the actual frequency experienced by the mouse from which the neuron was recorded. This shuffling was performed 1000 times and the 95% confidence bounds of the distribution were computed.
Fig. 1J. As in 1H. 109 neurons were recorded from 5 mice. The number of neurons responsive to each tone frequency (−2, −1, 0, 1, and 2 octaves, relative to the reafferent frequency) were 24, 39, 33, 24, and 20, respectively. As in 1H, we computed confidence bounds by randomly assigning to each neuron a reafferent frequency rather than using the actual frequency experienced by the mouse from which the neuron was recorded.
Fig. 2C. Solid lines show the mean calcium response to tones averaged across all neurons that were responsive to that tone during rest. The population size that was responsive to each tone on each day is noted below each set of red and black traces. Vertical line shows dF/F.
Fig. 2I. For 241 neurons, the tuning curves estimated across subsequent days were subtracted, and this was done independently for curves measured during rest (black) and running (red). At each frequency (and independently for each movement condition), we performed a bootstrap analysis, randomly sampling from our distribution (with replacement), repeated 1000 times. The shaded areas show the 95% confidence bounds for these distributions. The only significant change in tuning was at 32 kHz (the reafferent frequency) and only during locomotion (red).
Fig. 3D. Data are from pi-INs recorded from 2 PV::Cre mice (n = 36), 2 SST::Cre mice (n = 21), and 3 VGAT::ChR2 mice (n = 44). Magenta, green and black asterisks indicate p=0.03, p=0.04, and p=0.02, respectively. Analyzing all interneurons together, p=0.01. (Sign Rank and Rank Sum tests for paired and unpaired tests, respectively).
Fig. 3G. Data are from pi-INs recorded from 2 PV::Cre mice (n = 54), 2 SST::Cre mice (n = 32), and 5 VGAT::ChR2 mice (n = 75). Solid lines are linear regression and shaded lines show 95% confidence bounds from a bootstrap analysis repeated 1000 times.
Fig. 3H. Data are from pi-INs recorded from 2 PV::Cre mice (n = 33) and 2 SST::Cre mice (n = 26). PV+ and SST+ neurons were pooled together for the regression analysis. Solid lines are linear regression and shaded lines show 95% confidence bounds from a bootstrap analysis repeated 1000 times.
Fig. 3I. Data are from regressions shown in panel G and Extended Data Fig. 5G-I. Effect sizes for PV, VGAT, SST, and All pi-INs are significantly larger than effect sizes for Non-reafferent and Naïve conditions (p<0.01, Wilcoxon). Effect sizes for PV, VGAT and All pi-INs are significantly larger than effect size for put-ENs (p<0.01, Wilcoxon). Bar height determined by linear fit of raw data; error bars show SE of linear fits from 1000 repetitions of bootstrap analysis.
Fig. 4C. Resting and running performance curves are mean and SE for N = 19 mice. Lines are sigmoid fits to the data points. Comparisons with repeated measures 2-way ANOVAs at non-zero intensities (factors: intensity x behavioral state, p(interaction) = 0.0003, F(7, 126) = 4.22), followed by post hoc Tukey test. red asterisks, p<0.005.
Fig. 4D. Each point contains the behavioral threshold (intensity at 50% performance) during rest and running for a single mouse. Comparisons with two-sided paired t-test. p = 0.009.
Fig. 4F. Data points show mean and SE for N = 4 mice. Rest vs. optogenetic activation of inhibitory interneurons in the auditory cortex: Comparisons with repeated measures 2-way ANOVAs at non-zero intensities. (factors: intensity x laser state, p(interaction) = 0.01, F (2, 6) = 14.27, post hoc Tukey test, blue asterisk p<0.05). Rest vs. running: Comparisons with repeated measures 2-way ANOVAs at non-zero intensities. (factors: intensity x behavioral state, p(interaction) = 0.03, F (2, 6) = 6.60, post hoc Tukey test, red asterisk p<0.05). ANOVA p values were corrected using the Holm Bonferroni method.
Fig. 4H. Data points show mean and SE for N = 4 mice. Rest vs. optogenetic activation of M2 axon terminals in the auditory cortex: Comparisons with repeated measures 2-way ANOVAs at non-zero intensities. (factors: intensity x laser state, p(interaction) = 0.04, F (2, 6) = 5.84, post hoc Tukey test, blue asterisk, p<0.05). Rest vs. running: Comparisons with repeated measures 2-way ANOVAs at non-zero intensities. (factors: intensity x behavioral state, p(interaction) = 0.04, F (2, 6) = 8.29, post hoc Tukey test, red asterisk, p<0.05). ANOVA p values were corrected using the Holm Bonferroni method.
Fig. 4I. Data points show mean and SE for N = 10 mice during rest and running. Comparisons in each subpanel with repeated measures 2-way ANOVAs at non-zero intensities (factors: intensity x behavioral state. For Tone A: p(interaction) = 0.0002, F(3, 27) = 11.56. For Tone B: p(interaction) = 0.04, F(3, 27) = 3.85), followed by post hoc Tukey test ANOVA p values corrected using the Holm Bonferroni method. red asterisks, p < 0. 05
Fig. 4J. Data points show mean and SE for N = 10 mice during rest and running. Comparisons with repeated measures 2-way ANOVAs at non-zero intensities (factors: intensity x behavioral state. For Tone A: p(interaction)=0.01, F(3, 27) = 5.62. For Tone B: p(interaction) = 0.05, F(3, 27) = 2.90), followed by post hoc Tukey test. ANOVA p values were corrected using the Holm Bonferroni method. red asterisks, p < 0.05.
Extended Data
Extended Data Figure 1. aVR experience is coupled to locomotion.
A. Heat map showing the rate of tone presentation as a function of instantaneous stepping rate with a single paw, measured via simultaneous videography. Data points show mean tone rate and stepping rate 1 Hz (1/s) bins. Red dashed line shows linear regression through all data points. Reafferent tones during aVR experience were strongly correlated to instantaneous paw stepping rate (0.78). Data are from 3716 steps recorded from 1804 seconds of video from 2 mice.
B. The average tone presentation rate during aVR experience closely matches the average stepping rate measured either with a single paw or two paws. Dots are median and error bars are standard deviation.
C. Cumulative distance run by 11 mice over 6 to 9 days of aVR experience. Each line is for a different mouse color-coded by the reafferent frequency to which the mouse was acclimated.
D. Cumulative number of tones heard by same 11 mice as in panel C.
Extended Data Figure 2. aVR experience alters locomotion-related suppression at the level of individual neurons.
A. Fraction of neurons with elevated firing rates (magenta) and suppressed firing rates (cyan) in response to tones during rest. A roughly equal number of neurons were excited by the reafferent frequency as were excited by other frequencies.
B. Fraction of rest-responsive neurons with elevated firing rates (magenta) and suppressed firing rates (cyan) in response to tones of varying frequency during running. Nearly 50% of neurons were responsive to non-reafferent frequencies during running, whereas fewer than 25% were responsive to the reafferent frequency.
C. Heat map showing response strength (tone-evoked rate – baseline rate) for neurons responsive to the expected reafferent frequency (left) and another frequency (+2 octaves) during rest. Neurons ordered by magnitude of response independently for each heat map.
D. Response strengths of same neurons in C but during running. Neurons are re-sorted by magnitude of response. 23% of neurons retain their response to the reafferent frequency during running, consistent with a sparse representation of expected reafferent sounds.
E. Two alternative models for how locomotion-related suppression could change following aVR experience. In each model, the black curves show frequency tuning curves of three neurons during rest, red curves during running, and the green dashed line indicates the reafferent frequency. Across-neuron model: Locomotion-related suppression is uniform across frequencies within a neuron but is strongest for neurons that are strongly responsive to the expected reafferent frequency. Within-neuron model: Suppression in non-uniform at the single neuron level and regardless of how strongly the neuron responds to the expected reafferent frequency, suppression is always strongest at the reafferent frequency.
F. Five example neurons’ tuning curves measured during rest (black) and running (red). Each neuron’s best frequency (BF) is shown by the blue triangle, and the reafferent frequency to which each mouse was acclimated is shown by the green dashed line. In all five neurons, locomotion-related suppression was strong at the reafferent frequency relative to other frequencies, regardless of the neurons best frequency.
G. Neurons were sorted by their best frequency, measured relative to the reafferent frequency that each mouse experienced. Locomotion-related suppression at the expected reafferent frequency (green) and averaged across all non-reafferent frequencies (black). Regardless of a neuron’s best frequency, suppression was always strongest at the reafferent frequency, supporting the within-neuron model in panel E. Shaded regions show 95% confidence bounds estimated with a bootstrap analysis repeated 1000 times.
H. Probability of observing a minima in the gain function of individual neurons at each frequency, measured relative to the reafferent frequency. A significant number of neurons had minima in their gain functions at the expected reafferent frequency, further supporting the within-neuron model in panel E. Shaded region shows a null distribution, which we estimated by randomly assigning to each neuron a reafferent frequency rather than using the actual frequency experienced by the mouse from which the neuron was recorded. This shuffling was performed 1000 times and the 95% confidence bounds of the distribution were computed. Error bars show the 95% confidence bounds estimated from a bootstrap analysis repeated 1000 times.
Extended Data Figure 3. Specificity of suppression following aVR experience.
A. Locomotion-related gain tested at half-octave spacing from the reafferent frequency. Neuronal responses to frequencies one-half octave from the reafferent frequency were suppressed at an intermediate level. Data are mean+/−SE.
B. Current-source density triggered off of tone-onset for electrode recordings made perpendicular to the auditory cortical surface. Black dashed line demarcates putative supragranular (SG) and infragranular (IG) layers of cortex. Electrode 1 is the most superficial; electrode spacing is 100um.
C. Example tone-evoked local field potential (LFP) traces from an SG electrode (left) and an IG electrode (right) in response to the expected reafferent frequency (left) and a non-reafferent frequency (right). Locomotion-related suppression of LFP responses was stronger for the reafferent frequency compared to non-reafferent frequencies. Data are mean+/−SE.
D. The difference in LFP during rest relative to running as a function of electrode location (1 is the most superficial; electrode spacing is 100 um). Positive values indicate greater suppression during locomotion.
Extended Data Figure 4. Frequency-specific locomotion-related suppression requires several days of coupled sensory-motor experience.
A. Example sensory-motor experience during anti-coupled aVR experience. Mice did not hear tones while running, but tones were played back during subsequent resting periods with inter-tone intervals drawn from the intervals that mice should have heard while running.
B. Population PSTHs for the expected frequency (left) and for non-reafferent frequencies (right) during rest (black) and running (red) following anti-coupled aVR. Anti-copuled aVR experience does not lead to changes in auditory responsiveness during running or rest. Shaded region shows mean+/−SE. p = 0.57, two-sided Wilcoxon rank sum test.
C. Example sensory-motor experience during metronome aVR experience. Tones were presented during running at a fixed rate (2/sec) but the tone rate was not modulated by running speed.
D. Population PSTHs for the expected frequency (left) and for non-reafferent frequencies (right) during rest (black) and running (red) following metronome aVR. Metronome aVR experience does not lead to changes in auditory responsiveness during running or rest. Shaded region shows mean+/−SE. p = 0.57, two-sided Wilcoxon rank sum test.
E. Mice were acclimated to aVR for 7 days. On the day of electrophysiology, we altered on each locomotor bout the sound produced by the treadmill to be either expected (blue) or a non-reafferent frequency (2 octaves away, red). We then analyzed responses to each sound frequency during rest (R) and to the first 5 tones heard at the beginning of each bout of locomotion (L1 – L5). (i) Tone-evoked responses (population PSTHs) to the reafferent (blue) and a non-reafferent sound (red) during rest. (ii) Tone-evoked responses during locomotion to the first 5 tones in a series of the expected reafferent frequency. (iii) Tone-evoked responses during locomotion to the first 5 tones heard in a series of non-reafferent tones.
F. Firing rates to the reafferent (blue) and non-reafferent (red) reafferent sounds during rest (R) and during the first 5 tones heard during locomotion (L1 – L5). Responses to the first tone heard during locomotion were significantly suppressed only if that tone matched the expected reafferent frequency (blue asterisk, p = 0.002, two-sided Wilcoxon signed rank test). Black asterisks indicate significant differences between firing rates to the reafferent and non-reafferent reafferent sounds (L1, p=0.002; L2, p=0.03; L3, p=0.007, two-sided Wilcoxon rank sum test). Sample size: N = 4 mice, n = 74 neurons. Red n.s. indicates that evoked responses to the first tone heard during a bout of running are not significantly different from that evoked during rest for non-reafferent tones (p=0.4, two-sided Wilcoxon signed rank test).
G. Population PSTHs for the expected frequency (left) and for non-reafferent frequencies (right) during rest (black) and running (red). Data were collected from 3 mice after each mouse’s first experience hearing fixed-frequency reafferent tones for 1 hour, during which time mice heard 927, 3167 and 1069 reafferent tones at 16 kHz, 2 kHz and 16 kHz, respectively. This experience was insufficient to shift the locomotion-related suppression toward the reafferent frequency. Shaded region shows mean+/−SE. p = 0.47, two-sided Wilcoxon rank sum test.
Extended Data Figure 5. Characterizing photo-identified inhibitory neurons in auditory cortex.
A. Voltage trace of a photo-identified neuron recorded from a VGAT::ChR2 mouse in response to a 100 msec pulse of blue light targeted to the cortical surface. Inset shows example waveforms belonging to the sorted unit (black) and belonging to the noise cluster (magenta), showing good electrophysiological isolation.
B. Rasters showing response of the same neuron to 30 pulses of blue light (100 msec each).
C. Tone-evoked responses of auditory cortical inhibitory neurons (VGAT+) during rest (black) and locomotion (red) in response to reafferent (left) and non-reafferent (right) frequencies. Responses are suppressed during locomotion, but suppression is not specific to the reafferent frequency. Shaded region shows mean+/−SE. p = 0.36, two-sided Wilcoxon rank sum test.
D. pi-INs (VGAT+) that were more strongly driven by the reafferent frequency were more strongly recruited during running. Black line and shaded area show linear regression and 95% confidence bounds from a bootstrap analysis repeated 1000 times , respectively. The p-value represents the probability that the slope of the regression line includes zero, estimated from the bootstrap analysis.
E. Spontaneous firing rate during rest and locomotion for 93 putative excitatory neurons (non-photo-identified in VGAT::ChR2 mice). Filled circle shows mean. Firing rates were significantly lower during running relative to rest (two-sided Wilcoxon signed rank test).
F. Tone-evoked responses during running and rest for the reafferent frequency (blue) and non-reafferent frequencies (+/− 2 octaves, red). Dots are responses of individual neurons (n = 317), lines are linear regression, and shaded regions are 95% confidence bounds from bootstrap analysis repeated 1000 times. Suppression to non-reafferent sounds is best fit as a gain model (slope = 0.47+/−0.05; offset = −0.19+/−0.70), whereas suppression of expected reafferent tones has a stronger gain component (i.e. shallower slope, two-sided Wilcoxon rank sum test, p = 3.3×10−317) and an offset term that is significantly different from zero (slope = 0.27+/−0.4; offset = −3.55+/−0.58. two-sided signed rank test, p = 3.3×10−165). Inset shows a zoom in of the regression lines near the origin. These data suggest that suppression of expected reafferent sounds involves both divisive and subtractive forms of inhibition.
G. Responses to a non-reafferent tone in VGAT+ pi-INs recorded from aVR-acclimated mice were weakly correlated with responses to electrical stimulation in M2 (n = 75 neurons from 5 mice). These data indicate that the strong relationship between tone-evoked responses and M2 stimulation responses in auditory cortical pi-INs is distinct to the reafferent frequency. Black line and shaded area show linear regression and 95% confidence bounds from a bootstrap analysis repeated 1000 times, respectively.
H. Responses to the expected reafferent tone in put-ENs recorded from aVR-acclimated mice were correlated with responses to electrical stimulation in M2 (n = 181 neurons from 5 VGAT::ChR2 mice). These effect size for put-ENs is significantly weaker than for pi-INs. Black line and shaded area show linear regression and 95% confidence bounds from a bootstrap analysis repeated 1000 times, respectively.
I. Responses to a non-reafferent tone in VGAT+ pi-INs recorded from naive mice were weakly correlated with responses to electrical stimulation in M2 (n = 41 neurons from 2 mice). Black line and shaded area show linear regression and 95% confidence bounds from a bootstrap analysis repeated 1000 times, respectively.
J. Slope of the linear fit for the relationship shown in Fig. 3I. Error bars show 95% confidence bounds from a bootstrap analysis. Data are from regressions shown in panel Fig. 3G and Extended Data Fig. 5G-I. Slope of linear fit for PV, VGAT, SST, and All pi-INs are significantly larger than slope of linear fits for Non-reafferent and Naïve conditions (p<0.01, Wilcoxon). Bar height determined by linear fit of raw data; error bars show SE of linear fits from 1000 repetitions of bootstrap analysis.
Extended Data Figure 6. Tone detection behavior is compromised by locomotion, auditory-cortex dependent, and adapts following VR experience.
A. Data points show mean and SE detection rates for N = 4 mice as a function of tone intensity for trials performed during rest with infusion of either saline (black) or muscimol (magenta) into the auditory cortex.
B. Difference in performance as a function of intensity for each mouse (gray dots). Large connected dots show mean difference in performance and colored dots indicate intensities where performance was significantly different (p<0.05) across conditions (N = 19 mice, RM 2-way ANOVA followed by post hoc Tukey test).
C. Tone-evoked responses from putative excitatory neurons recorded from VGAT::ChR2 mouse without (black) and with (blue) simultaneous blue laser stimulation. Optogenetic activation of inhibitory neurons decreases the spontaneous and tone-evoked firing rates of excitatory neurons. (p<0.05, two-sided paired t-test)
D. Tone-evoked firing rates during rest are weaker during optogenetic activation of inhibitory interneurons. Dashed line is unity.
E. Tone detection performance during rest (black) and rest with optogenetic activation of auditory cortical inhibitory neurons (blue). Mice were worse at detecting tones on optogenetic trials (RM 2-way ANOVA, Factors: intensity x laser state, p(intensity x laser state) =0.0028, F (2, 10) = 11.23, post hoc Tukey test at individual intensities, blue asterisk, p<0.05 on laser trials) compared to rest.
F. Tone detection performance during rest (black) and rest with optogenetic activation of M2 terminals in auditory cortex (blue). Four of these mice were presented with 8kHz tones and the remaining two were presented 4kHz tones. Mice were worse at detecting tones on optogenetic trials regardless of the tone frequency. (Statistics similar to panel E, p(intensity x laser state) = 0.01, F (2, 10) = 6.66, blue asterisk p<0.05 on laser trials)
G. Average psychometric functions (N = 3mice) showing detection rates as a function of tone intensity for trials performed during rest when visual cortex was inhibited. (RM 2-way ANOVA, p(intensity x laser state) = 0.33, F (2, 4) = 1.47)
H. Average psychometric functions (N = 2 mice) showing detection rates as a function of tone intensity for trials performed during rest (black) and during rest with laser stimulation (blue) by mice were injected with an AAV encoding eGFP in M2. These controls show that laser stimulation of auditory cortex in the absence of ChR2 does not influence behavior.
I. Average psychometric functions (N = 8 mice) showing detection rates as a function of tone intensity for trials performed during rest (black) and during rest with laser stimulation (blue) when the optical fiber was placed over intact skull near, but not directly over auditory cortex. Five of 8 mice were injected with an AAV encoding ChR2 into M2, of which 3 were presented with 8kHz tones and 2 with 4kHz tones. The other three were VGAT ChR2 mice presented with 8kHz tones. These controls show that sham laser stimulation (which is visible to the mouse) alone, improves behavior. (RM 2-way ANOVA, factors: intensity x laser state, p(interaction) =0.0066, F(2, 14) = 7.35, post hoc Tukey tests, blue asterisk, p<0.05)
J. Difference in hit rates in response to Tone A relative to Tone B during rest before (Pre) and after (Post) aVR experience with Tone A. ). Lines represent mean difference and shaded regions show SE for N = 10 mice. There is no difference in rest performance before and after aVR experience. (RM 2-way ANOVA in each panel, factors: intensity x time of testing, p(time of testing) = 0.46, F(1, 9) = 0.61)
K. Difference in hit rates in response to Tone A relative to Tone B during running before (Pre) and after (Post) aVR experience with Tone A. Lines represent mean difference and shaded regions show SE for N = 10 mice. Mice are significantly better at detecting Tone B compared to Tone A after aVR experience, indicating that this is a movement-specific change. (RM 2-way ANOVA in each panel, factors: intensity x time of testing, p(time of testing) = 0.04, F(1, 9) = 8.07, red asterisk, p<0.05, p values in J and K corrected using the Holm-Bonferroni method.)
Supplementary Material
Acknowledgements
We thank Dr. Katie Tschida, Dr. Masashi Tanaka, and Dr. Dale Purves for their thoughtful and valuable comments on this manuscript and members of the Mooney lab for discussions regarding experimental design and data analysis. We thank Dr. John Pearson for his comments regarding statistical analyses. We thank Michael Booze for his expert animal care and technical support. This research was supported by an HHMI fellowship of the Helen Hay Whitney Foundation and a Career Award at the Scientific Interface from the Burroughs Wellcome Fund (D.M.S.), the Holland-Trice Graduate Fellowship in Brain Sciences (J.S.), and NIH grant 5 R01 DC013826 (R. M.).
Footnotes
The authors declare no competing financial interests.
Data Availability Statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
- 1.Schneider David M., Nelson Anders, and Mooney Richard. “A synaptic and circuit basis for corollary discharge in the auditory cortex.” Nature 5137517 (2014): 189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Kuchibhotla Kishore V., et al. “Parallel processing by cortical inhibition enables context-dependent behavior.” Nature neuroscience 201 (2017): 62–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Zhou Mu, et al. “Scaling down of balanced excitation and inhibition by active behavioral states in auditory cortex.” Nature neuroscience 176 (2014): 841–850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Rummell Brian P., Klee Jan L., and Sigurdsson Torfi. “Attenuation of responses to self-generated sounds in auditory cortical neurons.” Journal of Neuroscience 3647 (2016): 12010–12026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Flinker Adeen, et al. “Single-trial speech suppression of auditory cortex activity in humans.” Journal of Neuroscience 3049 (2010): 16643–16650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Eliades Steven J., and Wang Xiaoqin. “Sensory-motor interaction in the primate auditory cortex during self-initiated vocalizations.” Journal of neurophysiology 894 (2003): 2194–2207. [DOI] [PubMed] [Google Scholar]
- 7.Singla Shobhit, et al. “A cerebellum-like circuit in the auditory system cancels responses to self-generated sounds.” Nature Neuroscience 207 (2017): 943–950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Curio Gabriel, et al. “Speaking modifies voice‐evoked activity in the human auditory cortex.” Human brain mapping 94 (2000): 183–191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Keller Georg B., and Hahnloser Richard HR. “Neural processing of auditory feedback during vocal practice in a songbird.” Nature 4577226 (2009): 187. [DOI] [PubMed] [Google Scholar]
- 10.Eliades Steven J., and Wang Xiaoqin. “Neural substrates of vocalization feedback monitoring in primate auditory cortex.” Nature 4537198 (2008): 1102. [DOI] [PubMed] [Google Scholar]
- 11.Houde John F., and Jordan Michael I. “Sensorimotor adaptation in speech production.” Science 2795354 (1998): 1213–1216. [DOI] [PubMed] [Google Scholar]
- 12.Mifsud Nathan G., and Whitford Thomas J. “Sensory attenuation of self-initiated sounds maps onto habitual associations between motor action and sound.” Neuropsychologia (2017). [DOI] [PubMed] [Google Scholar]
- 13.Moore Alexandra, and Wehr Michael, “Parvalbumin-expressing inhibitory interneurons in auditory cortex are well-tuned for frequency.” Journal of neuroscience 3334 (2013): 13713–13723 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Fino Elodie and Yuste Rafael. “Dense inhibitory connectivity in neocortex.” Neuron 696 (2011): 1188–1203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Znamenskiy Petr, et al. “Functional selectivity and specific connectivity of inhibitory neurons in primary visual cortex.” bioRxiv (2018): 10.1101/294835 [DOI] [Google Scholar]
- 16.Williamson Ross S., et al. “Locomotion and task demands differentially modulate thalamic audiovisual processing during active search.” Current Biology 2514 (2015): 1885–1891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Nelson Anders, et al. “A circuit for motor cortical modulation of auditory cortical activity.” Journal of Neuroscience 3336 (2013): 14342–14353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Nelson Anders, and Mooney Richard. “The basal forebrain and motor cortex provide convergent yet distinct movement-related inputs to the auditory cortex.” Neuron 903 (2016): 635–648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Wilson Nathan R., et al. “Division and subtraction by distinct cortical inhibitory networks in vivo.” Nature 488 (2012): 343–348 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Leinweber Marcus, et al. “A sensorimotor circuit in mouse cortex for visual flow predictions.” Neuron 956 (2017): 1420–1432. [DOI] [PubMed] [Google Scholar]
- 21.Wolpert Daniel M., Ghahramani Zoubin, and Jordan Michael I. “An internal model for sensorimotor integration.” Science (1995): 1880–1882. [DOI] [PubMed] [Google Scholar]
- 22.Keller Georg B., Bonhoeffer Tobias, and Hübener Mark. “Sensorimotor mismatch signals in primary visual cortex of the behaving mouse.” Neuron 745 (2012): 809–815. [DOI] [PubMed] [Google Scholar]
- 23.Froemke Robert C., Merzenich Michael M., and Schreiner Christoph E. “A synaptic memory trace for cortical receptive field plasticity.” Nature 4507168 (2007): 425. [DOI] [PubMed] [Google Scholar]
- 24.Froemke Robert C., et al. “Long-term modification of cortical synapses improves sensory perception.” Nature neuroscience 161 (2013): 79–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.McGinley Matthew J., David Stephen V., and McCormick David A. “Cortical membrane potential signature of optimal states for sensory signal detection.” Neuron 871 (2015): 179–192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Weiss Carmen, Herwig Arvid, and Schütz-Bosbach Simone “The self in action effects: Selective attenuation of self-generated sounds.” Cognition 1212 (2011): 207–218. [DOI] [PubMed] [Google Scholar]
- 27.Franklin KB, and Paxinos George. The mouse brain in stereotaxic coordinates, compact The coronal plates and diagrams. Amsterdam: Elsevier Academic Press, 2008. [Google Scholar]
- 28.Rummell Brian P., Klee Jan L., and Sigurdsson Torfi “Attenuation of responses to self-generated sounds in auditory cortical neurons.” Journal of Neuroscience 3647 (2016): 12010–12026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Glickfeld Lindsey.L., Histed Mark.H., and Maunsell John. H. R. “Mouse primary visual cortex is used to detect both orientation and contrast changes.” Journal of Neuroscience 3350 (2013): 19416–19422. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.