Summary
Perceptual decision-making tasks are essential to many fields of neuroscience. Current protocols generally reward deprived animals with water. However, balancing animals’ deprivation level with their well-being is challenging, and trial number is limited by satiation. Here, we present electrical stimulation of the medial forebrain bundle (MFB) as an alternative that avoids deprivation while yielding stable motivation for thousands of trials. Using licking or lever press as a report, MFB animals learnt auditory discrimination tasks at similar speed to water-deprived mice. Moreover, they more reliably reached higher accuracy in harder tasks, performing up to 4,500 trials per session without loss of motivation. MFB stimulation did not impact the underlying sensory behavior since psychometric parameters and response times are preserved. MFB mice lacked signs of metabolic or behavioral stress compared with water-deprived mice. Overall, MFB stimulation is a highly promising tool for task learning because it enhances task performance while avoiding deprivation.
Keywords: behavior, perceptual decision-making, audition, perception, motivation, Go/No-go, satiation, reinforcement learning, mouse, medial forebrain bundle, licking, lever pressing
Graphical abstract

Highlights
-
•
Medial forebrain bundle (MFB) stimulation can be used as a reward in mouse behavior studies
-
•
Training with MFB stimulation improves performance on perceptual discrimination tasks
-
•
MFB-stimulated mice perform thousands of trials per session without motivation loss
-
•
MFB stimulation reduces signs of stress compared with deprivation protocols
Motivation
Training animals on decision-making tasks to measure their perceptual abilities and match this to ongoing neural activity is an essential part of integrative neuroscience. This can be achieved by depriving animals to motivate them to respond for a reward such as water or food. This limits sessions to a few hundred trials per day before animals reach satiation and requires carefully balancing deprivation with animal well-being. Therefore, we developed intracranial stimulation of the MFB as a reward because this does not lead to satiation and avoids deprivation.
Verdier et al. show that MFB stimulation can be used as a reward to train mice on perceptual discrimination tasks without deprivation. With this approach, mice displayed better performance, which they maintained for up to 4,500 trials per session.
Introduction
Probing the sensory abilities and decision-making processes of animals in order to model these faculties and link them to neural activity is central to systems neuroscience. This relies on training animals to perform a range of instrumental tasks. In mice, the most common protocols deprive animals of water to motivate them to lick for small water rewards. Although this approach has enabled multiple successful studies, it suffers from certain experimental drawbacks. First, water deprivation requires careful and time-consuming monitoring of water consumption in order to avoid discomfort to animals while maintaining their motivation. Despite well-established procedures, motivation levels can be hard to control depending on the individual animal, the housing conditions, or the experimental environment, leading to variable results. Finally, once animals reach satiation, they stop performing, thus limiting the number of trials per session. This can impede the possibility of probing fine psychophysics1 and curtail analysis through lack of data, particularly the ability to link noisy neural data to behavior and perception.
Here we propose a novel learning paradigm that addresses these issues by using intracranial stimulation of the medial forebrain bundle (MFB) as a reward. After the remarkable observation that stimulation in the hypothalamic region caused rats to return to the location where they had received the stimulation,2 a flurry of research identified the MFB as the region evoking maximal self-stimulation without adverse effects.3 This fiber bundle courses via the lateral hypothalamus between multiple regions of the brain’s reward system, most notably the ventral tegmental area and the nucleus accumbens (NA). This self-stimulating effect has mainly been used in order to study the neural underpinnings of motivation itself, but some studies have used this rewarding effect to perform spatial conditioning4,5,6 or to study reward valuation.7 This suggested to us that this method could provide a reward system without the aforementioned drawbacks of satiation or prior deprivation.
Here we demonstrate that MFB stimulation can be used in head-fixed mice to flexibly train animals on auditory discrimination tasks with high and stable performance over thousands of trials per session. We compare performance and learning with that of mice trained on the same tasks using water deprivation and find evidence for similar learning rates as well as more reliable learning on hard tasks. Moreover, we show that MFB stimulation reduces physiological and behavioral stress in comparison to water deprivation. Therefore, training with MFB stimulation can contribute to reducing the impact of experiments on animal welfare, when compatible with the underlying scientific question. Overall, this technique provides an easy-to-implement alternative to water deprivation with demonstrable experimental advantages.
Results
MFB-rewarded mice can learn auditory discrimination tasks
The MFB is a long fiber tract of around 400 μm in diameter that can be targeted with standard stereotaxic procedures for stimulation with a bipolar electrode (Figure S1). The implantation surgery is short (∼1 h) (Figure 1A), and the small size of the electrode allows easy combination of MFB stimulation with head posts, large cranial windows (Figure 1B), and/or electrophysiology. In some mice (n = 16), we tested the rewarding effect of MFB stimulation using a conditioned object preference assay (Figures S2G–S2I and Video S1) in which stimulation was manually elicited when mice contacted a cotton swab in the cage. All mice showing conditioned preference for the cotton swab were classified as having a “functional MFB,” defined as successful acquisition of the report behavior for the perceptual task (defined in STAR Methods, Figure S2). This first step allows an efficient triage of animals after surgery. Post-hoc histology showed that the absence of conditioning was due to large electrode targeting errors (>400 μm, Figure S1). However, small targeting errors (<400 μm) had little impact on the rewarding effect of the stimulation or on the success of the MFB-driven discrimination task (Figure S1). This was an important factor in the high success rate of our implantation surgeries. Among all implanted mice in our lab, 87% (82/94) experienced functional MFB-based reinforcement (Figures S2A and S2D). A detailed breakdown of mice included in all experimental groups is given in Table S1.
Figure 1.
Protocol for perceptual discrimination learning with MFB reward
(A) Medial forebrain bundle (MFB) implantation site of bipolar electrode fixed with dental cement to skull.
(B) Photo of head-fixed mouse showing head post and MFB electrode (left) and mouse with 5-mm cranial window over the auditory cortex, head-post, and MFB electrode (right), illustrating that the small MFB electrode can be combined with recording methods.
(C) Structure of the Go/No-go step of the discrimination task.
(D) Scheme of head-fixed mouse during task: pressing the lever with the forepaw after Go sound elicits stimulation via the implanted electrode.
(E) Calibration curve for an example mouse. Measured probability of pressing the lever during 1.5-s response window after Go sound for MFB stimulation of different voltages (black dots) and sigmoid curve fit to data (gray line). Arrow indicates voltage used for learning.
(F) Traces of lever press for 30 Go and 30 No-go trials from an example mouse after training (top) and the mean lever press trace for all trials (bottom). For each Go trial, the time of reward (50 ms after lever press) is shown by a green dot, and for each No-go trial, trials punished by a time out are indicated by a red dot. The distribution of reward timings is shown in green.
(G–I) As in (D)–(F) for licking.
In this study, we established that MFB stimulation can be used as a reinforcement signal for sound discrimination learning (Figure 1C) using two types of reporting motor actions often used in sensory discrimination tasks: the pressing of a lever or the licking of a spout (Figures 1D–1I). The first step of the training (Free reward) was to elicit MFB stimulation as soon as mice spontaneously licked or pressed the lever in order to associate report behavior to reward (Figures S2B and S2E). The second training step (Go training) was to condition reward on licking/pressing during a 1.5-s window following sound presentation to associate the stimulus and the reward (Figures S2C and S2F). Licking report behavior was acquired on the first day, whereas lever pressing required several sessions since it is a complex motor behavior8 (Figures S2A–S2F). It is important to note that mice spontaneously licked for MFB reward without any prior water deprivation to motivate licking. Voltage was initially set at 3 V. However, to optimize learning, once the basic behavioral report had been acquired, we calibrated the minimal voltage that elicited a robust response, and we used this calibrated level during subsequent training (Figures 1E and 1H). This avoided possible side effects such as excess movement or impulsivity while ensuring a robust reward.
Mice were then trained on the Go/No-go step of the task (Figures 1F and 1I). They had both to respond to the Go sound and to refrain from pressing following a No-go sound. False alarms (FAs) were punished by a 5–7 s timeout. Each trial was separated by a random interval during which mice had to restrain from pressing to initiate the next trial. (See Videos S2 and S3 for mice performing licking or lever pressing report). These steps, along with the detail of the auditory tasks discussed below, are shown in Figure S3. In order to facilitate the use of this protocol, we supply on our Github PAGE (https://github.com/AntoninVerdier/MFBasic) designs for the 3D models as well as instructions and code to implement a simple version of the task.
Fast and reliable learning with MFB reward
In order to assess learning using the MFB procedure, we trained mice on three auditory discrimination tasks (full protocol presented in Figure S3). To provide a reference for comparison, we also trained mice on similar tasks using a classical water deprivation protocol. Water-deprived mice were trained to lick for a reward. MFB-deprived mice were trained either to lick or to press a lever to get the reward. Stimulus contingencies, training steps, experimenters, and training apparatus were otherwise identical. In all tasks the valence of the sounds was counterbalanced across mice.
We first trained the three groups on a classical auditory frequency discrimination task (Pure tone vs. Pure tone: PTvsPT). Mice were trained to discriminate between two pure tones of low and high frequency (6 and 16 kHz respectively). The task was considered to be learnt when accuracy was above 80% over a full session. No difference in the speed of learning was observed between the three groups, either in terms of the number of trials (Figures 2A and S4) or the number of sessions (Figure 2D).
Figure 2.
MFB-rewarded mice learn tasks at equal rates to water-deprived mice and reach higher peak accuracy on hard tasks
(A–C) Learning curves for water-deprived (blue) and MFB-rewarded (red and yellow) mice performing the pure tone (PTvsPT) discrimination task ([A] n = 7, 8, 8), the pure tone psychometric task (PTvsPT-PS) requiring classification of high and low pure tones ([B], n = 7, 8, 8), and the pure tone vs. frequency modulated (PTvsFM) discrimination task ([C], n = 7, 4). Shaded areas indicate SEM. See Figure S4 for independent evolution of Go vs. No-go accuracy.
(D–F) Number of days necessary to reach criterion accuracy for water-deprived and MFB-rewarded mice for the three tasks as in (A)–(C) (Wilcoxon rank-sum test, water vs. MFB lever p values: 0.74, 0.019, 0.29, rank-sum statistic: 52.5, 79, 48; water vs. MFB lick p values: 0.37, 0.004, rank-sum statistic: 48,75; n = 7, 8, 8; 7, 8, 9; 7, 4). The vertical bar in (F) indicates that variability was higher for PTvsFM task for water-deprived mice (Levene test, p value: 0.0156, Levene statistic: 8.8). For water-deprived mice that never reached criterion in the PTvsFM task, the maximum day of training is shown as a cross.
(G) Average psychometric function for pure tone frequency classification, drawn from the best 800 consecutive trials of each mouse for the water-deprived and MFB-rewarded mice (n = 7, 8, 8). Shaded areas indicate SEM.
(H) Accuracy of the best session for water-deprived and MFB-rewarded mice for the PTvsFM discrimination task. (Wilcoxon rank-sum test, p value: 0.02, rank-sum statistic: 30, n = 7, 4).
See also Video S3.
Following learning on this first task, mice were introduced to a psychometric task (Pure tone vs. Pure Tone Psychometry: PTvsPT-PS) in which they were required to categorize 16 logarithmically scaled sounds into low and high frequency categories (threshold frequency: 9,800 Hz). As in the previous task, the learning rate did not differ between groups in the number of trials to reach the criterion. However, the number of experimental sessions necessary was higher for the water-deprived group (Figures 2B and 2E) given that MFB-rewarded mice perform more trials per session (see below). All groups reached 90% accuracy and yielded the same psychometric curves (Figure 2G). MFB, therefore, did not alter the measured acoustic abilities of mice that can be probed independently of the reward type once the task has been learnt.
Finally, we checked that the MFB reward could also support learning in a harder task. We trained a sub-group of four of the MFB lever press mice and the group of seven water-deprived mice to discriminate between a pure tone and a frequency sweep with identical start frequency (Pure tone vs. Frequency Modulation: PTvsFM). This task is known to be more demanding9 since mice cannot make a decision based on the salient sound onset but must integrate the sound over time in order to reach a decision. All MFB-rewarded mice learnt the task within eight training sessions whereas water-deprived mice showed significantly more variable learning rates, and some animals never reached the learning criterion even after 30 training sessions (Figures 2C, 2F, and S4). Moreover, peak accuracy was significantly higher in MFB-rewarded mice (Figure 2H). Despite the small sample size and the difference in report behavior (lever vs licking, see discussion), this suggests that MFB stimulation may provide more stable motivation in increasingly complex tasks, enabling faster acquisition at the cohort level.
These results indicate that the MFB stimulation technique allows for reliable learning in a range of auditory tasks with different behavioral reports.
MFB yields stable performance for thousands of trials per session
When training relies on water rewards, satiation limits the number of trials performed in a single day or behavioral session. In our experiments, we observed a systematic decline in Go response accuracy during the session, which is likely due to satiation (Figures 3A and 3B). We therefore generally limited animals to 320 trials per day and never went above 640 trials. More generally, current reports indicate that the number of trials performed in a single session by water-deprived mice compatible with sustained motivation can vary between a few hundreds of trials and a maximum of 1,000 trials in some cases depending on the laboratory and on protocol parameters (see literature search in Table 1).
Figure 3.
MFB-rewarded mice perform thousands of trials per session
(A) Example sessions accuracy on Go (green) and No-go (red) trials for a water-deprived and an MFB lever-pressing mouse in the PTvsPT task. Go accuracy drops for the water-deprived mouse after 240 trials, whereas the MFB-rewarded mouse is still performing above the criterion after 2,000 trials.
(B and C) Average of Go (green) and No-go (red) accuracies for water-deprived (B) and MFB lever-pressing (C) mice in the PTvsPT task. For MFB mice, in order to cumulate sessions of varying duration (600–2,000 trials), time was normalized between 0 and 1 (n = 7, 8). Shaded areas indicate SEM.
(D) Trial count of several sessions in the PTvsPT-PS task (>80% accuracy) with varying trial duration. The black line corresponds to the maximum possible number of trials mice could reach given the session duration (supposing no punishment intervals or delayed trials by licking outside of trials). Shaded areas provide the range of trial numbers found with other strategies in the literature: optogenetic stimulation of dopaminergic fibers (DA),10 water deprivation,11 and water deprivation by adding citric acid to drinking water.12
(E) Accuracy over 200 trials for different trial speeds for MFB-rewarded mice performing the PTvsPT task by lever pressing or licking. Note that 500-ms inter-trial intervals provide the same accuracy as 5000-ms-spaced trials (n = 4, 6).
(F) Trial count for 4-h session with 2-s ITI for MFB-rewarded mice performing the PTvsPT task by lever pressing or licking. Each point is one session (lever: n = 8 sessions from n = 6 mice, lick: n = 9 sessions from n = 6 mice).
Table 1.
Comparison of reward systems for training in the mouse with number of trials per session
| Article | Reward | Action | Sensory modality | Task | Trials per session |
|---|---|---|---|---|---|
| Jaramillo et al.13 | Water | nose poke in freely moving mouse | auditory | TWFC with change in stimulus/action contingency | median: 600 (300–1,000) |
| Guo et al., 201414 | water with sucrose | licking in head-fixed mouse | somatosensory | TWFC | 400–600 |
| Goltstein et al., 201815 | soymilk (food deprivation) | licking in head-fixed mouse | visual | TWFC | 250 |
| Urai et al., 202112 | citric acid water | wheel movement in freely moving mouse | visual | TWFC | 750 |
| Berditchevskaia et al., 201616 | water reward and air puff punishment | licking in head-fixed mouse | visual | Go/No-go | 100–300 |
| Poort et al.17 | Water | licking in head-fixed mouse | visual | Go/No-go | mean: 199 (31–342) |
| The International Brain Laboratory et al., 202012 | Water | wheel movement in head-fixed mouse | visual | TWFC | median: 700, up to 1,000 |
| Birtalan et al.18 | water (fully automated) | nose poke in freely moving mouse | visual | five-choice serial reaction time task | mean: 753 |
| Burgess et al., 201710 | optogenetic stimulation of dopaminergic neurons in VTA | wheel movement in head-fixed mouse | visual | TWFC | median: 800, likely up to 1,000 |
| This study | MFB | licking/lever pressing in head-fixed mouse | auditory | Go/No-go | mean: 3,800 |
TWFC, Two way forced choice; VTA, ventral tegmental area
Using MFB reward, we observed during training that mice did not show signs of an irreversible drop in motivation (Figures 3A, 3C, and 3D). The number of trials was not limited by the animal’s motivation but by experimental time constraints. For example, in the psychometric task (PTvsPT-PS) and for sessions ranging between 1 h 30 min and 3 h after learning (accuracy >80%), the number of trials scaled with the session duration, without loss of accuracy and reached up to 2,500 trials. The number of trials per session can be further increased by reducing the inter-trial interval. Varying this parameter in trained MFB-rewarded mice during the PTvsPT task, we observed that mice can maintain high accuracy even when performing up to 30 trials/minute, with an average trial window of 1,500 ms and average total inter-trial interval (randomized no press window + inter-trial interval [ITI] durations, see STAR Methods) of 500 ms (Figure 3E). This is likely enabled by the high and stable motivation of the mice but also the lack of time required for reward consumption (i.e., licking and drinking of water). Using these optimized conditions, we evaluated the maximum number of trials that could be obtained in a 4-h duration using a fast 2-s ITI to provide a benchmark for the novel MFB approach. We found that mice performed on average 3,800 trials, with some mice achieving more than 4,500 trials, a far larger number than typically reported in the literature (Figure 3D and Table 1). Mice maintained high and consistent accuracy throughout these sessions (91% ± 3% lever press, 93% ± 6% licking) (Figure 3F, example sessions Figure S5).
These results show that MFB allows us to train mice to perform very large numbers of trials with high performance. This provides opportunities to accelerate psychometric measurements by probing, within a single session, many repeats of multiple stimulus properties or eventually to decrease the number of training days by increasing the number of trials per day.
MFB provides strong motivation without excess movement, impulsivity, or signs of stress
The repeated use of intracranial stimulation could lead to detrimental effects on behavior. To address this possibility, we first assessed whether the MFB led to agitated behavior during the task, which could result from excessive activation of the dopaminergic system. We quantified the level of body movement with a piezoelectric sensor placed under the animal as a proxy for agitation, which can be driven by both the motor response and the reward. We contrasted hit and FA trials during which the mouse executed the same motor action to measure reward-induced movement. In all cases, the response led to an increase in movement that however subsided more quickly during the trial in MFB-trained mice, likely because they did not initiate long lick trains for water consumption (Figure 4A). Both water and MFB reward led to an increase in movement due to reward. This increase was however much smaller than that induced simply by the animal’s response itself and of comparable amplitude for MFB- and water-rewarded mice (Figure 4A).
Figure 4.
MFB-stimulated mice show no signs of increased agitation or impulsivity
(A) Movement level for hit and false alarm (FA) trials during the psychometrics task. The difference between the two curves is indicative of the movement induced by the reward since hit trials contain movement induced by the response (licking/lever press) and reward (water delivery or MFB stimulation), whereas FA trials only contain movement induced by the response. The movement level for each mouse is normalized by the peak of FA movement, and therefore the absolute levels of movement between MFB and water cannot be compared (see STAR Methods for details, n = 4, 8, 8, error bars are SEM).
(B) Average accuracy on Go and No-go trials in psychometrics task during the three best sessions for water-deprived (left) and MFB-rewarded lever-pressing and licking mice. Note that all groups are biased toward Go-responding, although the effect is weaker in lever-pressing MFB mice (Wilcoxon signed rank test, p values = 0.04, 0.19, 0.02; signed rank statistic: 26, 28, 34; n = 7, 8, 8).
(C) Ratio of Go to No-Go accuracy showing that all groups have similar levels of impulsivity based on this measure. (Wilcoxon rank-sum test, p values: 0.12, 0.77, 0.19; rank-sum statistic: 70, 59, 55; n = 7, 8, 8).
(D) Average wait period between trials for water-deprived and MFB-rewarded mice that captures the tendency of mice to lick/lever press during the wait period. MFB-rewarded mice are significantly more restrained during the wait period (Wilcoxon rank-sum test, p values: 6.2 E-4, 1.5 E-4, 0.003; rank-sum statistic: 98, 100, 41; n = 7, 8, 8).
To evaluate whether MFB reward was associated with more impulsive behavior, we measured two classical indicators of impulsivity in the Go/No-go paradigm19: FA probability and premature responses during the ITI. As expected, mice showed worse accuracy on No-go trials than on Go trials, although this difference was not significant in MFB lever-pressing animals (Figures 4B and 4C). No significant difference between groups was observed for Go/No-go accuracy ratios (Figure 4C). However, irrespective of whether the report was licking or lever pressing, inter-trial wait periods were shorter in MFB-rewarded mice than in the water deprivation task (Figure 4D). This indicates a reduction of pre-trial impulsivity because in our protocol the inter-trial wait period is prolonged when impulsive licks or lever press are detected (see STAR Methods).
We also investigated signs of physiological or behavioral stress. Mice in the water deprivation protocol underwent an initial and sustained weight loss as well as a progressive loss of weight every week before the weekend when they had 1 day of ad libitum access to water (Figures 5A and 5B). MFB-rewarded mice however showed a trend toward increasing weight as expected by aging mice and stable weight during the week (Figures 5A and 5B). We also assessed activity levels in the home cage during the night using a running wheel. Running duration was similar in MFB-rewarded mice and in control animals left in their home cages without any training. Water-deprived mice spent less time active, as previously shown (Figures 5C and 5D).15 To directly measure stress levels, we performed an open field test prior to any behavioral training and 12 weeks later. In both groups, we found a reduction in exploratory behavior after training. However, reductions both in distance traveled and entries in the field center were significantly stronger in water-deprived mice compared with MFB-rewarded mice (Figures 5E and 5F). The behavioral change common to both groups may be due to the aging of mice,20 the test-rest effect,21 or the repeated schedule of head fixation and training, although the literature suggests that head fixation itself does not modify behavior in the open field.22
Figure 5.
Improved mouse well-being using MFB stimulation relative to water deprivation protocol
(A) Weight evolution starting from the week prior to any behavioral experiments (isolated point) and across the first 5 weeks during which mice were trained five days a week. (n = 7, 8 mice, error bars are SEM). In this panel and following, only data from MFB lever-pressing mice were collected.
(B) Weight evolution averaged across days of the week (n = 7, 8 mice, error bars are SEM).
(C) Top: temporal distribution of periods of activity during the lights-off period for each session for a cage containing two animals (black bar indicates a period during which wheel running was detected). Bottom: mean activity throughout the lights-off period for MFB and water-deprived mice (blue and red) with the mean activity of control mice (gray) with no interventions overlayed.
(D) Percent of time spent active throughout the whole night for MFB and water-deprived animals as well as control animals with no interventions (Wilcoxon rank-sum test, p values: 0.007, 0.02; rank-sum statistic: 73, 58; n = 7, 7, 6 sessions).
(E) Superposed trajectories for mice during the open field session.
(F) Ratio of total distance traveled (left) and number of entries into the center of the field (right) before and after training during 5-min exploration of the open field (Wilcoxon rank-sum test, p values: 0.039, 0.028; rank-sum statistic: 83, 89; n = 7, 8 mice).
Overall, we have shown that the use of the MFB stimulation does not induce any motor agitation or any increase in impulsive behavior relative to the water deprivation approach. Moreover, it attenuates the impact of learning protocols on markers of stress relative to water deprivation. The protocol is therefore compatible with the experimental demands of well-controlled and calm behavior and can contribute to increased animal well-being.
Response latency and discrimination times in MFB-stimulated mice
We finally assessed whether the MFB reward task reproduces subtler aspects of behavior such as response times. In a recent report, it was shown that discrimination time is longer for the difficult PTvsFM task than for the easier PTvsPT task.9 We applied the same methodology to our data to estimate the earliest time at which the Go and the No-go sound could be significantly distinguished based on the animal’s response (Figure 6A illustrates the method). Consistent with previous observations established using water deprivation, we found longer discrimination times for the PTvsFM task for both MFB reward and water deprivation (Figures 6B and 6D). Moreover, there was a good quantitative match between the two methods despite using differences in reward and motor response (median for MFB and water: PTvsPT, 240/220ms; PTvsFM, 490/580ms).
Figure 6.
Similar task dependance of discrimination times for MFB and water-reward protocols
(A) Average lever-press probability for PTvsPT and PTvsFM tasks across all Go and No-go trials for MFB-rewarded mice. Arrow shows the estimated discrimination time defined as the earliest time at which p value of a t test for the difference between the Go and No-go traces (bottom panel) is below 0.01. Shaded areas are SEM.
(B) Discrimination time as defined in (A) for each animal. (n = 8, 4 Wilcoxon rank-sum test, p value = 0.03; rank-sum statistic: 23).
(C) Average lick probability for PTvsPT and PTvsFM tasks across all Go and No-go trials for water-deprived mice.
(D) Discrimination time, defined as earliest time where Go and No-go lick signals are significantly different, for the PTvsPT and PTvsFM tasks (n = 7, 4; Wilcoxon rank-sum test, p value = 0.001; rank-sum statistic: 23; only animals that reached criterion accuracy for PTvsFM task are included).
We also found in MFB-rewarded mice that response times were significantly shorter for FA No-go trials than for Go trials (Figures S6A and S6B) in accordance with the literature.16,23,24 We reasoned that the difference was driven by early, non-discriminative responses perhaps associated with an excessive motivation to perform reward collection. If this is the case, we would expect behavioral epochs with short lever press latencies on Go trials to be associated with low discrimination accuracy (i.e., high FA rates). Indeed, we found that FA rate and lever press latencies in Go trials were positively correlated (Figures S6C–S6E).
These results exemplify how our MFB paradigm can be easily transferred to the study of perceptual discrimination since it reproduces known results from the literature including subtler aspects of behavior such as temporal dynamics. Therefore, our direct stimulation of the reward system appears to leave intact many of the core mechanisms underlying perception and decision-making, allowing for their continued study with this new approach.
Discussion
By implementing stimulation of the MFB as a reward in mice, we have shown in various auditory behavioral tasks that this approach provides an efficient and high-yield protocol for learning without changing psychometric measures (Figure 2G) or response dynamics (Figure 6). MFB stimulation allows us to train animals for thousands of trials per session without loss of motivation, which can reduce the number of training days, especially for more difficult tasks (Figure 2). MFB stimulation removes the need to adjust deprivation levels to control motivation. This more easily yields a fast and reliable learning outcome across animals and high steady-state performance level as we observed when comparing our MFB stimulation and water deprivation protocols in a difficult task (Figure 2). MFB-rewarded mice also showed reduced impulsivity and signs of stress compared with their water-deprived counterparts, which have previously been reported as having modified behavior.25,26
The strongest advantage of MFB stimulation is the lack of satiation, providing high and stable motivation throughout long sessions. This increase in trial counts per session is an asset for psychometric measurements to increase the number of stimulus variations tested, and also for protocols involving the study of neural activity. Indeed, it allows us to collect data from the same neurons while providing the freedom to examine neural responses to variations in stimuli or task contingency. It is therefore important to stress that MFB stimulation is compatible with recordings of neural data as shown in the literature.4,5 Moreover, we observed no stimulation-induced agitation, which is important when combined with imaging techniques that require field-of-view stability.
MFB stimulation provides a novel methodology to train mice in conditions that are conducive to their well-being, when compatible with the underlying scientific question. In particular, the approach refines current protocols by minimizing the impact of the experiment on mice’s daily activities since MFB mice are not deprived in any way. The drawback of the MFB protocol is that it requires the implantation of an extra electrode during head-post surgery. In behavioral experiments that do not require surgeries, the potential stress of this additional procedure may counterbalance the benefits.
There are limitations in the comparisons we made between MFB stimulation and water-deprived protocols. Although the two tasks were operated by the same experimenters, we cannot ensure that all aspects of the protocols were similarly optimized. In MFB-rewarded mice, we have tested two motor reports (lever pressing and licking) and only one for water-deprived mice. Lever pressing requires motor learning8 contrary to licking. This requirement slowed learning of sound-response association phase (Go training) for lever pressing compared with licking, and it may have also an impact on learning rates in the discrimination task. Another difference between the protocols is that the water-deprived animals discriminated Go from No-go stimuli by exceeding a threshold on the number of licks, whereas MFB-rewarded mice were trained to produce an all or none response (one lever press/one lick). This is also likely to impact learning speed. With the criterion used for water-deprived mice being more permissive with respect to impulsivity, it is likely that water-deprived mice were given an easier protocol than MFB mice. We therefore believe that the conclusion that MFB stimulation provides a more stable setting enabling faster acquisition at the cohort level in difficult tasks remains valid in practice, despite implementation discrepancies.
A potential issue with direct stimulation of the reward system is that it is a less ethologically relevant motivation than water or food and therefore could distort both behavioral and neural activity. It is important to bear in mind that states of deprivation, although ethologically relevant, are not “neutral” states. They induce metabolic changes,26 most obvious in weight loss and behavioral stress, as we measured (Figures 5E and 5F), and brain-wide changes in neural activity.25 A recent study showed that food deprivation induces energy-saving changes in neural coding that lead to a slight impairment of visual discrimination both at the neuronal and behavioral performance level.27 These findings are likely also relevant for water restriction since it leads to reduced food intake. In this study, we have shown that psychometric performance (Figure 2) and response timing (Figures 6 and S6) are very similar between water and MFB reward. Therefore, for the majority of studies focused on sensory or motor behaviors and not motivation or reward per se, MFB stimulation is unlikely to modify the results of interest and provides a way to avoid the side effects of deprived states.
MFB stimulation evokes dopamine release in the NA28 and pharmacologically increasing or reducing dopamine levels increases or reduces MFB stimulation efficacy respectively.29 Although the MFB contains dopaminergic fibers essential for instrumental behavior,30 it has been shown that the physiology of MFB self-stimulation is incompatible with a direct recruitment of these fibers to drive reward.31 Instead, the rewarding level of stimulation is best predicted by the level of direct activation of another set of currently undefined, descending, myelinated fibers in the MFB. It is hypothesized that these fibers indirectly drive dopamine release.31
Fine comparisons of optogenetic dopaminergic release with MFB stimulation highlight differences between the two paradigms,32 and large forebrain lesions including the NA leave MFB self-stimulation intact.30 The MFB also houses glutamatergic, monoaminergic, and GABAergic fibers that extend to around 50 different brain areas that are both antidromically and orthodromically activated upon stimulation.31 Therefore, a complex array of neural systems beyond the NA participate in the effect of MFB stimulation, and they remain to be dissected.33,34
MFB stimulation is clearly distinguished from other rewards by the absence of satiety and the remarkable stability of its rewarding valence. Factors that lead to satiation for other rewards (e.g., sucrose accumulation in the gut for food reward) do not impact MFB stimulation efficacy.35 The MFB reward can lead to seemingly “pathological” behaviors with large stimulation intensity, during which animals prefer to self-stimulate for MFB reward over all other choices, leading to maladaptive behaviors such as self-starvation (preference of MFB stimulation over food).36,37,38 Importantly, these behaviors only occur with ad libitum access to strong MFB stimulation. We therefore did not witness any such behaviors in MFB-rewarded mice trained in our lab. Calibration of voltage (Figures 1D and 1G) to use the minimal efficient voltage is likely beneficial to reduce such negative consequences. In line with this, parametric studies that evaluate the impact of different factors on response-intensity curves similar to those used in our calibration protocol show that MFB reward in fact can compete with food reward and the rewarding effect of both MFB and food can summate.39 These results suggest that the MFB stimulation and more ethological rewards are evaluated along shared dimensions.
The commonalities between MFB stimulation and other rewards are supported by neuronal recordings. First, during self-stimulation the evoked dopamine release is reduced when stimulation is expected.40,41 Therefore dopamine release is maximal when reward is unexpected in accordance with reward prediction error as found for other rewards.42 Second, neurons in the hypothalamus respond with similar changes in firing pattern to MFB stimulation and to glucose.43 Third, changes in brain reward circuitry such as chronic food restriction or leptin administration impact both MFB stimulation and food intake.44
However, MFB stimulation has a wider range of effects than ethological rewards. Stimulation that is not contingent on reward invigorates responding,45 and post-training simulation can enhance memory.46 Although these extra effects may positively impact learning in sensory stimulation tasks, experimenters should bear in mind that transitioning from water or food reward to MFB stimulation is therefore not just a change in reward paradigm but may have broader impacts.
Several other studies have proposed refinements to the water deprivation protocol. Adding sucrose to water can increase the number of trials,14 and replacing the animal’s drinking water with ad libitum access to citric acid water allows for a less impactful but still efficient version of the deprivation protocol.12,47 However, even with these improvements, mice still undergo weight loss due to deprivation, and deprivation methods so far do not allow experimenters to obtain more than a thousand trials per session (Figure 3D and Table 1). Recently, Burgess et al.10 demonstrated that optogenetic activation of dopaminergic neurons in the ventral tegmental area could be used as a very efficient reward system. This approach showed similar advantages to ours in terms of accelerating learning and enhancing trial yield. However, electrical MFB stimulation is more flexible and easier to implement. First, optogenetic stimulation requires the use of a specific CRE line, making it incompatible with studies in WT or other CRE lines of interest to the researcher. Second, whereas MFB implantation is achieved with one brief surgery, the optogenetic approach requires injecting a viral vector and waiting for expression (2–3 weeks) before finally implanting the optic fiber in a second surgery.
In conclusion, MFB stimulation adds to the toolbox of methods available to train rodents depending on the demands of the scientific question. It is an easy-to-implement alternative to deprivation, and the strong improvement in trial yield that it provides should enable and accelerate novel experiments that require learning fine perceptual measurements, or complex task rules, or that aim at linking single-trial behavior to neural data.
Limitations of the study
Here we explored only a few specific tasks among the many available. Further work would be required to expand the technique to different sensory modalities and task rules. It is likely that MFB will further generalize given that it has already been shown to work well in freely moving animals both with nose poke and with spatial conditioning.4,5,6 Finally, further optimization of MFB stimulation parameters (delay, duration, etc.) could lead to improved results.
STAR★Methods
Key resources table
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Experimental models: Organisms/strains | ||
| Mouse: C57BL/6J, 2 months, male | Janvier Labs | C57BL/6JRj |
| Software and algorithms | ||
| MFBasic | This paper | https://doi.org/10.5281/zenodo.7298882 |
| Other | ||
| Stereotaxic device | Neurostar | Stereodrive 018.117 |
| Isoflurane vaporizer and anesthesia System |
Minerve | TEC 850 - Compact anesthetic module |
| Stereo microscopes | Leica | Leica M80 |
| Skull drill | Meisinger | 1RF HP 330 104 001 001 - 007 |
| Homeothermic system | Stoeling | Rodent Warmer X1 |
| Autoclave | Matachana | Serie 2000 |
| BNC to electrode wire | Custom | See STAR Methods |
| Surgical scissors | Fine Science Tools | 14090-11 |
| Surgical tweezers | Fine Science Tools | 11254-20 |
| Aluminum base plate | Thorlabs | MB2020/M |
| Aluminum tubes for head bar holder | Thorlabs | TR50/M |
| Scalpel handle | Fine Science Tools | 10,003-12 |
| Adjustable arm | RS Online - Pro | Cat#213-0670 |
| Flat ended syringue - 18G | Gima | Cat#23750 with manual sanding after |
| Acquisition card | National Instruments | Thor labs |
| External USB Battery | iPossible | B095K2H48N – Amazon |
| Pulse Pal Stimulator | Sanworks | Cat#1102 |
| 3D printer | Ultimaker | Ultimaker 3 |
| PLA material | Ultimaker | PLA Ultimaker Noir |
| Vetbond | Phymep | Vetbond |
| Superbond | Phymep | Superbond |
| Cement | Lang | Ortho-Jet |
| Bipolar electrode | Plastics One | MS303T/2 |
| Scalpel blade | Fine Science Tools | 10,010-00 |
Resource availability
Lead contact
Further information and requests should be addressed to and will be fulfilled by the lead contact, Sophie Bagur (sophie.bagur@pasteur.fr).
Materials availability
This study did not generate new unique reagents.
Experimental model and subject details
7 to 9-week old C57Bl6 male mice (Janvier Labs) were used. Mice were group-housed in an animal facility (08:00–20:00 light) in cages of 3–5 individuals with access to enrichment (running wheel, cotton bedding and wooden logs).
All experiments were performed in accordance with the French Ethical Committee (Direction Générale de la Recherche et de l'Innovation) and European legislation (2010/63/EU). Procedures were approved by the French Ministry of Education in Research after consultation with the ethical committee #59 (authorization number APAFIS#28757-2020122111048873).
Method details
Surgical procedures
Surgeries were performed in order to equip all mice with a metal head-plate for fixation. Implantations of a bipolar stimulation electrode (60-μm-diameter twisted stainless steel, PlasticsOne) in the medial forebrain bundle were performed for the MFB group. Mice were injected with buprenorphine (Vétergesic, 0,05-0,1 mg/kg) 30min prior to anesthesia induction with isoflurane (3%) and then placed on a thermal blanket during the whole procedure with isoflurane maintained at 1–1.5%. The eyes were protected with Ocrygel (TVM Lab). Lidocaine was injected under the skin of the skull 5 min prior to incision.
The MFB electrode was implanted using stereotaxic coordinates (AP -1.4, ML +1.2, DV +4.8) (Figure S1 for histological verification). The electrode was then fixed along with the headplate to the skull using dental cement (Ortho-Jet, Lang).
After surgery, mice received a subcutaneous injection of glucose (G30) and metacam (1 mg/kg). Mice were subsequently housed for one week with metacam delivered via drinking water or dietgel (ClearH20) without any manipulation. All animals were co-housed before and after surgery without any degradation to the implants.
6 mice were housed in identical conditions to provide a control group for night time activity recordings. These mice were left in their cages during the whole period prior to activity recordings which were performed at the same age as for traine mice.
Behavioral setup
Behavioral experiments were monitored and controlled using homemade software (Elphy, G. Sadoc, UNIC, France for water deprivation and MATLAB homemade GUI for MFB) coupled to a National Instruments card (PCIe-6351). Sounds were amplified (SA1 Stereo power amp, Tucker-Davis Technologies) and delivered through high frequency loudspeakers (MF1-S, Tucker-Davis Technologies) in a pseudo-random sequence. Mice were placed on a 3D printed contention tube with two piezo-electric sensors inserted at the base to allow monitoring of movement.
Water deprivation
Water delivery (5μL/reward) was controlled with a solenoid valve (LVM10R1-6B-1-Q, SMC). A voltage of 5V was applied through an electric circuit joining the lick tube and aluminum foil on which the mouse was sitting, so that lick events could be monitored by measuring the voltage through a series resistor in this circuit.
MFB stimulation
MFB stimulation was delivered via a pulse train generator (PulsePal V2, Sanworks) that produced 2ms biphasic pulses at 50Hz for 100ms. Lever group: The stimulation was delivered when the mouse pressed the lever, raising the opposite end and interrupting an infrared beam (ZHITING, infrared motor speed meter, 3.3V). The lever was weighted to fall back into its initial position as soon as the mouse removed its paw. It was designed using Autodesk Fusion 360 and produced with an Ultimaker-3 3D printer. Licking group: MFB-rewarded mice were using the same setup as water-deprived mice.
Behavioral procedures
Figure S3 provides an overview of the behavioral protocols.
Sounds and task structure
Mice were trained on three different tasks. In each case, sounds of 500ms were presented at 70dB with a sample rate of 192kHz. The identity of Go and No-go sounds was counterbalanced across mice.
Frequency discrimination task (PTvsPT): Discrimination between a pure tone at high frequency (16kHz) and a pure tone at low frequency (6kHz). Frequency psychometrics (PTvsPT PS): Discrimination between a group of 8 high frequency pure tones (10–16kHz) and a group of 8 low frequency pure tones (6–9.5kHz) in which the 16 tones were equally spaced on a logarithmic scale. Pure tone vs frequency-modulated sound (PTvsFM): Discrimination between a pure tone at 6kHz and either a frequency sweep rising from 6 to 16kHz or a 2-step rise consisting of 250ms of 6kHz and 250ms of 16kHz.
MFB stimulation
Mice performed behavior five days per week (Monday to Friday). During the entire behavioral training period, food and water were available ad libitum. Animal weight was monitored daily.
Habituation: During four days, mice were habituated to head fixation by progressively increasing the duration from 30min to 2 h without reward or sound stimulation. Free reward (Lever press): During 1 or 2 days mice received systematic reward when they pressed the lever with a 2s minimal inter reward interval. In order to encourage lever pressing, the experimenter actively shaped the mouse’s behavior for 10–50 trials by placing its paw on the lever with a small plastic stick. During this initial phase, before calibration, stimulation started at 3V and was reduced by the experimenter if it elicited a motor reaction. Free reward (licking): During a short 30 min session, mice received a systematic reward when they licked the spout with a 500ms inter reward interval. In order to encourage licking as the mice are not water-deprived, the spout is placed very close to the mouse’s mouth. During this initial phase, before calibration, stimulation started at 3V and was reduced by the experimenter if it elicited a motor reaction. Go training: Once animals were spontaneously pressing or licking, go training was conducted. Go trials were presented with 80% probability, while the remaining trials were blank trials (no stimulus). A trial consisted of a random inter-trial interval (ITI) between 1 and 2 s to avoid prediction of stimulus appearance, a random ‘no press’/’no lick' period between 3 and 5 s and a fixed response window of 1.5 s. For the lever group, any lever press longer than 50ms was counted as a lever press. For the lick group, a single lick was registered as a response. Lever press/lick during the response window on a Go trial was scored as a ‘hit’ and triggered an immediate MFB stimulation. No lever press/lick was scored as a ‘miss’ and the next trial immediately followed. Calibration: Once reliable report to the sound was established (>80% hits for the Go stimulus), the voltage of the stimulation was systematically reduced from the voltage used during go training down to 0 in 0.5V increments in order to obtain a calibration curve. During all following experiments, the minimal voltage that elicited >90% of maximum pressing/lickins was used. We found that this voltage could be adjusted by 0.5V to either motivate (increase) the mouse during harder tasks or to slow unsolicited response (decrease) and that this could facilitate learning. Calibration was performed using the Go sound from the PTvsPT protocol since this was learnt first. For mice using licking report, this calibration can also be performed before Go sound training by simply measuring the number of spontaneous licks at different voltage levels. Go/No-go training: After Go training, the second sound (No-go) was introduced. During presentation of the No-go sound, the absence of lever press/lick was scored as a ‘correct rejection’ (CR) and the next trial immediately followed. Any report during No-go trials was scored as a ‘false alarm’ (FA), no stimulation was given, and the animal was punished with a random time-out period between 5 and 7 s. Each session contained 50% probability for each trial type. Psychometric measurement: Once mice reached 80% accuracy on the PTvsPT task, the number of sounds increased to 16 (8 high frequency sounds and 8 low frequency sounds) in order to establish psychometric curves for frequency discrimination (PTvsPT - PS task).
During all phases of the protocol, the session duration was 1h30-3h and was only limited by time constraints on the use of the apparatus.
Criteria defining mice with functional MFB stimulation (licking): Given that mice rapidly and spontaneously licked to receive MFB stimulation we evaluated implantation success during the “Free reward” phase (Figure S2B). Mice were counted as positively responding if within a maximum of two 30min sessions they reached licking levels >0.25licks/s.
Criteria defining mice with functional MFB stimulation (lever pressing): Given that lever pressing is a complex motor behavior that takes a few days to master and that conditioning can be confused with a simple increase in movement during which the mouse randomly activates the lever, we evaluated implantation success during the “Go training” phase (Figure S2F). Mice were counted as positively responding if they reached >80% accuracy in this training phase within a maximum of 10 days. Mice that showed no progress during more than 5 days were counted as non-functional without performing the full 10 days of training.
Impact of inter-trial interval: We measured whether MFB-rewarded mice could maintain their performance at high levels even with a short inter-trial interval. We performed in succession on the same day 4 bouts of 200 Go/No-go trials, decreasing the average total inter-trial interval (random ITI + random no press window) from an average of 5s down to 0.5s.
Maximal number of trials measurement: To provide a benchmark for the number of trials obtainable in a single session we performed on the PTvsPT task a 4 h session with a 2s ITI. Mice were checked at regular intervals during the session for signs of stress and offered water after 2h.
Coton interaction assay: Before beginning training on the task, the efficacy of MFB stimulation can be rapidly tested using an object interaction assay in order to eliminate non-responding mice. We performed stimulation at 3V. We performed this in freely moving mice in a neutral cage with a cotton swab. After an initial 5 min habituation phase, mice undergo 5 min conditioning during which when they touch the swab they receive stimulation (Figure S2G). During conditioning the preference is generally obvious by eye (Video S1). To quantify the impact of the stimulation we manually timed the time spent interacting with the swab during the habituation and stimulation phase (Figures S2H and S2I). Mice which fail to increase their interaction time with the swab can be tested at a higher voltage (we have tried up to 6V with no adverse reactions) and if no clear preference is manifest then the stimulation is counted as non-efficient.
Water deprivation
Mice performed behavior five days per week (Monday to Friday). Water deprivation began after habituation to head fixation. Before starting the training procedure, mice were water restricted for two consecutive days. Afterward, water restriction (0.8mL/day) was interleaved with a 16 h ad libitum supply overnight every Friday. During the entire behavioral training period, food was available ad libitum. Animal weight was monitored daily.
Habituation: Prior to deprivation, during four days, mice were habituated to head fixation by progressively increasing the duration from 30min to 2 h without reward or sound stimulation. Free lick: During the first day of training, mice received water by licking the lick port without any sound. Go training: After this period, Go training was conducted where Go trials were presented with 80% probability, while the remaining trials were blank trials (no stimulus). A trial consisted of a random inter-trial interval (ITI) between 3 and 0 s to avoid prediction of stimulus appearance, a random ‘no lick’ period between 1.5 and 3.5 s and a fixed response window of 1.5s. Licking during the response window on a Go trial above lick threshold (3–5 consecutive licks) was scored as a ‘hit’ and triggered immediate water delivery. Licking below the threshold was scored as a ‘miss’ and the next trial immediately followed. At the beginning of each session, 10 trials with ‘free rewards’ were given independently of the licking to motivate the mouse. Go/No-go training: When animals reached more than 80% ‘hits’ for the Go stimulus, the second sound (No-go) was introduced, and the lick count threshold was set between 5 and 8 licks (this parameter was adjusted at the beginning of training for each mouse). During presentation of the No-go sound, licking below threshold was scored as a ‘correct rejection’ (CR) and the next trial immediately followed, licking above threshold on No-go trials was scored as a ‘false alarm’ (FA), no water reward was given, and the animal was punished with a random time out period between 5 and 7 s. Each session contained 40% probability for the Go and No-go stimuli and 20% blank trials. Psychometric measurement: Once mice reached 80% accuracy on the PTvsPT task, the number of sounds increased to 16 (8 high frequency sounds and 8 low frequency sounds) in order to establish psychometric curves for frequency discrimination (PTvsPT - PS task).
During all phases of the protocol, the session was continued until animals had received a maximum of 0.8mL or until they ceased licking in response to sound. This occurs within 300–600 trials. Throughout the protocol, we performed 320 trials per day except toward the end of training on Thursday or Friday when mice sometimes performed up to 640 trials. Mice that did not attain 0.8mL during the session received the missing amount directly in their cage.
Open field
Before the start of learning and then again after 3 months of training, mice were placed at the center of a brightly lit 50cmx50xm open field for 3 min. The surface and walls were cleaned between each mouse. The animal’s position was tracked using an online homemade tracking program (MATLAB, Mathworks).
Home cage activity
In order to measure mice’s activity level in their homecage, we monitored the turning of a running wheel from 8 pm to 8 am, the dark period which was undisturbed by behavioral experiments. The measurement was made by counting infrared beam breaks due to the passage of an opaque section of the wheel. We counted as “active” any period of 5s during which the wheel made one full turn.
Histology: In order to extract the brain for histology, mice were deeply anesthetized using a ketamine-medetomidine mixture. Once all reflexes had disappeared we performed an electrolytic lesion using a 20s of continuous 5V stimulation. Mice were then perfused intracardially with 4% buffered paraformaldehyde fixative. The brains were carefully dissected and left in paraformaldehyde overnight and then sliced into 100um sections using a vibratome.
Quantification and statistical analysis
Learning and performance analysis
Learning curves were obtained by calculating the fraction of correct responses over blocks of 200 trials. Accuracy over one session was calculated as (hits + correct rejections)/total trials. The learning rate is quantified by the number of training days to reach accuracy greater than or equal to 80%. Note that for all evaluations of learning time we only took into account valid trials left after excluding all periods during which the mouse failed to respond for more than 15 trials.
Response time analysis
Response times are defined as the time to first lever press/first lick. Discrimination time was defined as the earliest 10ms time bin at which a significant difference between lever pressing/licking signal for Go and No-go trials was found. We used the non-parametric Wilcoxon rank-sum test to obtain the p value for the null hypothesis that the Go and No-go distributions were identical for each time bin and then applied the Benjamini-Hochberg correction for multiple testing over all time bins. In order to avoid biassing statistical significance due to differing number of trials, we performed multiple random subsampling of datasets to restrict to 100 trials and took the mean reaction time over all shuffles.
Piezoelectric signal
The acquired piezoelectric signal was filtered to remove 50Hz contamination. Changes in the animal’s position within the tube can create baseline shifts in the signal. We therefore measured the amount of movement using the absolute value of the first-order temporal derivative. The absolute values of this signal cannot be compared between MFB-rewarded and water-deprived mice since water-deprived mice were placed on aluminum foil in order to detect licking and the deformations of the foil attenuate the values readout by the sensor.
Statistics
For each statistical analysis provided in the manuscript, the Kolmogorov–Smirnov normality test was first performed on the data. If the data failed to meet the normality criterion, statistics relied on non-parametric tests. We therefore represent the median and quartiles of data in boxplots in all figures, in accordance with the use of non-parametric tests. Ranksum and signed rank: we report the signed rank statistic if the number of replicates is too weak to provide the normal Z statistic.
Acknowledgments
We thank Léa Vinel for assistance with behavioral experiments.
This work was funded by the European Research Council (ERC CoG 770841), the Fondation Pour l'Audition (FPA IDA02), and S.B. was funded by the Fondation pour la Recherche Médicale (SPF202005011970, FRM). This project has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No 964568 (Hearlight). We acknowledge the support of the Fondation Pour l'Audition to the Institut de l’Audition.
Author contributions
S.B. contributed the original idea of the project and designed research with A.V. and B.B. S.B. and A.V. performed experiments and analyzed the data. Mouse training was co-supervised by S.B. and A.V. and performed by N.D. and D.G. A.A. designed the lever-pressing detection and helped with experiments. S.B. and A.V. analyzed the data and wrote the paper with comments from B.B. B.B. provided funding and equipment for task implementation.
Declaration of interests
None of the authors have any competing interests to declare.
Published: December 2, 2022
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.crmeth.2022.100355.
Supplemental information
Data and code availability
-
•
Data: All data in this paper will be made available by the lead contact upon request.
-
•
Code: In order to facilitate other laboratories trying out the new methodology, we have made available on a Github PAGE (https://github.com/AntoninVerdier/MFBasic) the 3D printer files for contention tube and lever, simplified Arduino code to implement a minimalist Go/No-go task with MFB stimulation and a detailed description of all the hardware needed and how to set it up. An archival DOI is listed in the key resources table.
-
•
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.
References
- 1.Carandini M., Churchland A.K. Probing perceptual decisions in rodents. Nat. Neurosci. 2013;16:824–831. doi: 10.1038/nn.3410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Olds J., Milner P. Positive reinforcement produced by electrical stimulation of septal area and other regions of rat brain. J. Comp. Physiol. Psychol. 1954;47:419–427. doi: 10.1037/h0058775. [DOI] [PubMed] [Google Scholar]
- 3.Olds M.E., Olds J. Approach-avoidance analysis of rat diencephalon. J. Comp. Neurol. 1963;120:259–295. doi: 10.1002/cne.901200206. [DOI] [PubMed] [Google Scholar]
- 4.Naudé J., Tolu S., Dongelmans M., Torquet N., Valverde S., Rodriguez G., Pons S., Maskos U., Mourot A., Marti F., Faure P. Nicotinic receptors in the ventral tegmental area promote uncertainty-seeking. Nat. Neurosci. 2016;19:471–478. doi: 10.1038/nn.4223. [DOI] [PubMed] [Google Scholar]
- 5.de Lavilléon G., Lacroix M.M., Rondi-Reig L., Benchenane K. Explicit memory creation during sleep demonstrates a causal role of place cells in navigation. Nat. Neurosci. 2015;18:493–495. doi: 10.1038/nn.3970. [DOI] [PubMed] [Google Scholar]
- 6.Kobayashi T., Nishijo H., Fukuda M., Bures J., Ono T. Task-dependent representations in rat hippocampal place neurons. J. Neurophysiol. 1997;78:597–613. doi: 10.1152/jn.1997.78.2.597. [DOI] [PubMed] [Google Scholar]
- 7.Negus S.S., Miller L.L. Intracranial self-stimulation to evaluate abuse potential of drugs. Pharmacol. Rev. 2014;66:869–917. doi: 10.1124/pr.112.007419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Peters A.J., Chen S.X., Komiyama T. Emergence of reproducible spatiotemporal activity during motor learning. Nature. 2014;510:263–267. doi: 10.1038/nature13235. [DOI] [PubMed] [Google Scholar]
- 9.Ceballo S., Piwkowska Z., Bourg J., Daret A., Bathellier B. Targeted cortical manipulation of auditory perception. Neuron. 2019;104:1168–1179.e5. doi: 10.1016/j.neuron.2019.09.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Burgess C.P., Lak A., Steinmetz N.A., Zatka-Haas P., Bai Reddy C., Jacobs E.A.K., Linden J.F., Paton J.J., Ranson A., Schröder S., et al. High-yield methods for accurate two-alternative visual psychophysics in head-fixed mice. Cell Rep. 2017;20:2513–2524. doi: 10.1016/j.celrep.2017.08.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Aguillon-Rodriguez V., Angelaki D.E., Bayer H.M., Bonacchi N., Carandini M., Cazettes F., et al. The International Brain Laboratory Standardized and reproducible measurement of decision-making in mice. Neuroscience. 2021;10:e63711. doi: 10.1101/2020.01.17.909838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.The International Brain Laboratory. Urai A.E., Aguillon-Rodriguez V., Laranjeira I.C., Cazettes F., Mainen Z.F., Churchland A.K. Citric acid water as an alternative to water restriction for high-yield mouse behavior. eNeuro. 2021;8 doi: 10.1523/ENEURO.0230-20.2020. ENEURO.0230-20.2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Jaramillo S., Zador A.M. Mice and rats achieve similar levels of performance in an adaptive decision-making task. Front Syst Neurosci. 2014;8 doi: 10.3389/fnsys.2014.00173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Guo Z.V., Hires S.A., Li N., O’Connor D.H., Komiyama T., Ophir E., Huber D., Bonardi C., Morandell K., Gutnisky D., et al. Procedures for behavioral experiments in head-fixed mice. PLoS One. 2014;9 doi: 10.1371/journal.pone.0088678. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Goltstein P.M., Reinert S., Glas A., Bonhoeffer T., Hübener M. Food and water restriction lead to differential learning behaviors in a head-fixed two-choice visual discrimination task for mice. PLoS One. 2018;13 doi: 10.1371/journal.pone.0204066. e0204066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Berditchevskaia A., Cazé R.D., Schultz S.R. Performance in a GO/NOGO perceptual task reflects a balance between impulsive and instrumental components of behaviour. Sci. Rep. 2016;6:27389. doi: 10.1038/srep27389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Poort J., Khan A.G., Pachitariu M., Nemri A., Orsolic I., Krupic J., et al. Learning Enhances Sensory and Multiple Non-sensory Representations in Primary Visual Cortex. Neuron. 2015;86:1478–1490. doi: 10.1016/j.neuron.2015.05.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Birtalan E., Bánhidi A., Sanders J.I., Balázsfi D., Hangya B. Efficient training of mice on the 5-choice serial reaction time task in an automated rodent training system. Sci Rep. 2021;10:22362. doi: 10.1038/s41598-020-79290-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Dent C.L., Isles A.R. An overview of measuring impulsive behavior in mice. Curr. Protoc. Mouse Biol. 2014;4:35–45. doi: 10.1002/9780470942390.mo140015. [DOI] [PubMed] [Google Scholar]
- 20.Shoji H., Takao K., Hattori S., Miyakawa T. Age-related changes in behavior in C57BL/6J mice from young adulthood to middle age. Mol. Brain. 2016;9:11. doi: 10.1186/s13041-016-0191-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Valek L., Tegeder I. Failure of diphtheria toxin model to induce Parkinson-like behavior in mice. Int. J. Mol. Sci. 2021;22:9496. doi: 10.3390/ijms22179496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Juczewski K., Koussa J.A., Kesner A.J., Lee J.O., Lovinger D.M. Stress and behavioral correlates in the head-fixed method: stress measurements, habituation dynamics, locomotion, and motor-skill learning in mice. Sci. Rep. 2020;10:12245. doi: 10.1038/s41598-020-69132-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Niv Y., Daw N.D., Joel D., Dayan P. Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology. 2007;191:507–520. doi: 10.1007/s00213-006-0502-4. [DOI] [PubMed] [Google Scholar]
- 24.You W.-K., Mysore S.P. Dynamics of visual perceptual decision-making in freely behaving mice. Neuroscience. 2020;9 doi: 10.1101/2020.02.20.958652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Allen W.E., Chen M.Z., Pichamoorthy N., Tien R.H., Pachitariu M., Luo L., Deisseroth K. Thirst regulates motivated behavior through modulation of brainwide neural population dynamics. Science. 2019;364:253. doi: 10.1126/science.aav3932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Bekkevold C.M., Robertson K.L., Reinhard M.K., Battles A.H., Rowland N.E. Dehydration parameters and standards for laboratory mice. J. Am. Assoc. Lab. Anim. Sci. 2013;52:233–239. [PMC free article] [PubMed] [Google Scholar]
- 27.Padamsey Z., Katsanevaki D., Dupuy N., Rochefort N.L. Neocortex saves energy by reducing coding precision during food scarcity. Neuron. 2022;110:280–296.e10. doi: 10.1016/j.neuron.2021.10.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Michael A.C., Ikeda M., Justice J.B. Mechanisms contributing to the recovery of striatal releasable dopamine following MFB stimulation. Brain Res. 1987;421:325–335. doi: 10.1016/0006-8993(87)91302-3. [DOI] [PubMed] [Google Scholar]
- 29.Gallistel C.R., Karras D. Pimozide and amphetamine have opposing effects on the reward summation function. Pharmacol. Biochem. Behav. 1984;20:73–77. doi: 10.1016/0091-3057(84)90104-7. [DOI] [PubMed] [Google Scholar]
- 30.Huston J.P., Borbély A.A. Operant conditioning in forebrain ablated rats by use of rewarding hypothalamic stimulation. Brain Res. 1973;50:467–472. doi: 10.1016/0006-8993(73)90753-1. [DOI] [PubMed] [Google Scholar]
- 31.Nieuwenhuys R., Geeraedts L.M., Veening J.G. The medial forebrain bundle of the rat. I. General introduction. J. Comp. Neurol. 1982;206:49–81. doi: 10.1002/cne.902060106. [DOI] [PubMed] [Google Scholar]
- 32.Trujillo-Pisanty I., Conover K., Solis P., Palacios D., Shizgal P. Dopamine neurons do not constitute an obligatory stage in the final common path for the evaluation and pursuit of brain stimulation reward. PLoS One. 2020;15 doi: 10.1371/journal.pone.0226722. e0226722. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ikemoto S. Brain reward circuitry beyond the mesolimbic dopamine system: a neurobiological theory. Neurosci. Biobehav. Rev. 2010;35:129–150. doi: 10.1016/j.neubiorev.2010.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Pallikaras V., Shizgal P. The convergence model of brain reward circuitry: implications for relief of treatment-resistant depression by deep-brain stimulation of the medial forebrain bundle. Front. Behav. Neurosci. 2022;16 doi: 10.3389/fnbeh.2022.851067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Shizgal P., Conover K. On the neural computation of utility. Curr. Dir. Psychol. Sci. 1996;5:37–43. doi: 10.1111/1467-8721.ep10772715. [DOI] [Google Scholar]
- 36.Endröczi E., Lissák K. Behavioral reactions evoked by electrical stimulation in the medial forebrain bundle region. Physiol. Behav. 1966;1:223–227. doi: 10.1016/0031-9384(66)90008-4. [DOI] [Google Scholar]
- 37.Routtenberg A., Lindy J. Effects of the availability of rewarding septal and hypothalamic stimulation on bar pressing for food under conditions of deprivation. J. Comp. Physiol. Psychol. 1965;60:158–161. doi: 10.1037/h0022365. [DOI] [PubMed] [Google Scholar]
- 38.Carlisle H.J., Snyder E. The interaction of hypothalamic self-stimulation and temperature regulation. Experientia. 1970;26:1092–1093. doi: 10.1007/BF02112693. [DOI] [PubMed] [Google Scholar]
- 39.Conover K.L., Shizgal P. Competition and summation between rewarding effects of sucrose and lateral hypothalamic stimulation in the rat. Behav. Neurosci. 1994;108:537–548. doi: 10.1037/0735-7044.108.3.537. [DOI] [PubMed] [Google Scholar]
- 40.Kilpatrick M.R., Rooney M.B., Michael D.J., Wightman R.M. Extracellular dopamine dynamics in rat caudate–putamen during experimenter-delivered and intracranial self-stimulation. Neuroscience. 2000;96:697–706. doi: 10.1016/S0306-4522(99)00578-3. [DOI] [PubMed] [Google Scholar]
- 41.Garris P.A., Kilpatrick M., Bunin M.A., Michael D., Walker Q.D., Wightman R.M. Dissociation of dopamine release in the nucleus accumbens from intracranial self-stimulation. Nature. 1999;398:67–69. doi: 10.1038/18019. [DOI] [PubMed] [Google Scholar]
- 42.Schultz W. Predictive reward signal of dopamine neurons. J. Neurophysiol. 1998;80:1–27. doi: 10.1152/jn.1998.80.1.1. [DOI] [PubMed] [Google Scholar]
- 43.Ono T., Nakamura K., Nishijo H., Fukuda M. Hypothalamic neuron involvement in integration of reward, aversion, and cue signals. J. Neurophysiol. 1986;56:63–79. doi: 10.1152/jn.1986.56.1.63. [DOI] [PubMed] [Google Scholar]
- 44.Fulton S., Woodside B., Shizgal P. Modulation of brain reward circuitry by leptin. Science. 2000;287:125–128. doi: 10.1126/science.287.5450.125. [DOI] [PubMed] [Google Scholar]
- 45.Gallistel C.R. Motivating effects in self-stimulation. J. Comp. Physiol. Psychol. 1966;62:95–101. doi: 10.1037/h0023472. [DOI] [PubMed] [Google Scholar]
- 46.Huguet G., Kádár E., Serrano N., Tapias-Espinosa C., García-Brito S., Morgado-Bernal I., Aldavert-Vera L., Segura-Torres P. Rewarding deep brain stimulation at the medial forebrain bundle favours avoidance conditioned response in a remote memory test, hinders extinction and increases neurogenesis. Behav. Brain Res. 2020;378:112308. doi: 10.1016/j.bbr.2019.112308. [DOI] [PubMed] [Google Scholar]
- 47.Reinagel P. Training rats using water rewards without water restriction. Front. Behav. Neurosci. 2018;12:84. doi: 10.3389/fnbeh.2018.00084. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
-
•
Data: All data in this paper will be made available by the lead contact upon request.
-
•
Code: In order to facilitate other laboratories trying out the new methodology, we have made available on a Github PAGE (https://github.com/AntoninVerdier/MFBasic) the 3D printer files for contention tube and lever, simplified Arduino code to implement a minimalist Go/No-go task with MFB stimulation and a detailed description of all the hardware needed and how to set it up. An archival DOI is listed in the key resources table.
-
•
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.






