Abstract
Working memory (WM) is a critical cognitive function allowing recent information to be temporarily held in mind to inform future action. This process depends on coordination between key subregions in prefrontal cortex (PFC) and other connected brain areas. However, few studies have examined the degree of functional specialization between these subregions throughout the phases of WM using electrophysiological recordings in freely-moving animals, particularly mice. To this end, we recorded single-units in three neighboring medial PFC (mPFC) subregions in mouse – supplementary motor area (MOs), dorsomedial PFC (dmPFC), and ventromedial (vmPFC) – during a freely-behaving non-match-to-position WM task. We found divergent patterns of task-related activity across the phases of WM. The MOs is most active around task phase transitions and encodes the starting sample location most selectively. Dorsomedial PFC contains a more stable population code, including persistent sample-location-specific firing during a five second delay period. Finally, the vmPFC responds most strongly to reward-related information during the choice phase. Our results reveal anatomically and temporally segregated computation of WM task information in mPFC and motivate more precise consideration of the dynamic neural activity required for WM.
Introduction
Working memory (WM) is a fundamental cognitive function allowing prior sensorimotor and rule information to be held in mind, manipulated, and protected from interference for future use1. This process relies on a distributed hierarchy of brain networks performing varying degrees of top-down and bottom-up operations2,3. The prefrontal cortex (PFC) is positioned at the top of this hierarchy, exerting the highest-order influence over WM via extensive reciprocal connections with cortical4 and subcortical structures5. It is thought to be responsible for orchestrating several key aspects of WM, including actively directing and maintaining attention toward salient features of a context, selecting strategies to accomplish goals based on contextual needs, and monitoring the outcome of enacted motor plans to change strategies if necessary6,7. Difficulty with any of these functions is a prevalent symptom across many human neurological and psychiatric disorders, and usually coincides with aberrant activation of PFC-containing networks during WM8,9. A deeper examination of PFC activity throughout the different phases of WM is critical to better understanding the potential mechanisms underlying diverse types of WM dysfunction.
Over the past decade, mice have become a standard model organism in PFC research due to rapid development of genetic tools permitting more precise targeting of cell-types and brain-wide circuits10. Extensive connectomic and genomic mapping has established that mouse PFC can be divided into subregions based on local and long-range projection patterns11,12 and cytoarchitecture13,14. However, attempts at segregating mouse PFC into subregions based on functional processing of WM task features has yielded surprisingly inconsistent findings15. A potential explanation for this variability comes from recent work showing that neural activity subserving goal-directed actions is spread across many brain areas, and the primary locus of control can shift dynamically depending on contextual needs and temporal progress through a task16–22. Thus, trying to localize multi-faceted mental processes, like WM, onto isolated mouse PFC subregions may be an unreliable approach. Instead, experiments in mice should focus on characterizing a range of WM-related computations in multiple PFC subregions over the entire time course of a single behavioral paradigm15.
Most mouse studies probing PFC neural circuit contributions to spatial WM have concentrated on single subregions within the same task23–26, and reports using electrophysiological recordings from multiple subregions are not accompanied by a detailed comparison between them19,27–29. Moreover, most multi-regional WM data have been collected from head-fixed mice30–33. Critically, more comprehensive analysis describing how distinct PFC areas selectively contribute to WM task variables across time in freely-moving mice is needed, as head-fixed experiments may engage different brain networks than more naturalistic behaviors34,35. To this end, we recorded single-units in three adjacent mouse PFC subregions agreeing with modern PFC parcellation schemes15: the supplementary motor cortex (MOs), the dorsomedial PFC (dmPFC), and the ventromedial PFC (vmPFC). Activity in these subpopulations was tracked in real time as the mice performed a freely-behaving delayed-non-match-to-position WM task 36. We asked how each subregion represented the retrospective sample location and other task-related information such as prospective choices and reward across the encoding, maintenance, retrieval, and outcome phases of WM37. Our results indicate that the dmPFC population stably codes for the retrospective sample location throughout all phases in single behavioral trials, while MOs prioritizes contextual transitions and vmPFC is most sensitive to choice and outcome variables.
Results
Behavior and electrode implantation
To examine the relationship between spatial WM and PFC subregional activity, mice were water restricted to ~90% of their initial body weight and trained on a freely-moving, delayed-non-match-to-position (DNMTP) WM task (Fig. 1A). Critically, this task was designed so that mice could not know the exact choice port they would need to visit until the end of the delay period. During training, mice that made the correct non-match choice on ≥70% of trials over three consecutive days were implanted with a custom-built, 28-wire, advanceable bundle of microelectrodes into one of three separate PFC subregions (see Methods for details): the supplementary motor area (MOs, blue, n = 4), dorsomedial PFC (dmPFC, green n = 6), or ventromedial PFC (vmPFC, pink, n = 6) (Fig. 1B). We gathered at least three daily sessions of simultaneous behavioral and electrophysiological data per mouse, advancing the electrodes ventrally into the PFC by ~60 μm after each session, so that new neurons were recorded the following day (white dots in Fig. 1B depict the final electrode bundle locations for each mouse). Single-units were isolated offline using Kilosort3, aligned to important DNMTP task events, and organized into pseudopopulations by combining the neurons recorded over all sessions within each subregion. The total neuron counts for each pseudopopulation were 304 in the MOs, 354 in the dmPFC, and 330 in the vmPFC. A one-way ANOVA revealed no significant main effect of implant location on DNMTP performance (F(2,70) = 2.71, p = 0.073, Fig. 1C).
MOs is more active and has more sample port-selective neurons during the sample/encoding phase of WM
We tracked subregional neural firing activity throughout all key phases of the DNMTP task. Each pseudopopulation contained neurons exhibiting task-related changes in firing rate (FR) that consistently appeared within a brief time window around the sample phase nose poke across many trials (example neuron in Fig. 2A). Heat maps of the FR were created by first Z-scoring the spike counts in 100 ms bins for all neurons in separate pseudopopulations, followed by sorting them from highest to lowest peak Z-scored activity around the sample nose poke (Fig. 2B). Quantification of mean Z-scored pseudopopulation FR at each time point revealed that the MOs significantly increased its firing rate compared to the other two implant locations in a 200-300 millisecond window before the sample poke (Fig. 2C, for details on statistics see Methods). There were no significant differences between dmPFC and vmPFC at any time points. We also quantified the percentage of neurons in each pseudopopulation that either increased or decreased their firing rate by 0.5 Z-units around the sample (Fig. 2D). In correspondence with the above findings, the MOs had the largest proportions of both increasing and decreasing units compared to the other regions around the sample poke, while vmPFC contained the lowest. In both figure panels, and throughout the rest of the paper, time bins with significant pairwise differences (p < .05, see Methods) between two mPFC subregions can be visualized in the figures as straight lines containing their two respective colors.
We next looked at how selective the pseudopopulations were for the sample port location in the same period around the sample poke (examples of selective MOs neurons in Fig. 2E). Using a permutation testing method (see Methods for details), we found that the MOs also contained the most neurons that could significantly differentiate sample port location based on their FR (Fig. 2F). This was followed by dmPFC and lastly by the vmPFC. These results suggest a functional gradient, which is strongest in MOs and weakest in vmPFC, in the extent to which these different subregions encode not only the beginning of the sample phase but also the sample port location. The stability of sample location selectivity was subsequently measured using cross-temporal linear support vector machine (SVM) decoding analysis (Fig. 2G, see Methods for details). In the MOs, the predictive ability of the decoder was nearly 100% effective within 100-200 ms of the training time bin, but this effectiveness rapidly decreased to chance levels in a few hundred milliseconds. In contrast, models trained on timepoints 0.5s before the sample poke were able to predict sample location for up to one second after the sample poke with ≥70% accuracy in both the dmPFC and vmPFC, but not in the MOs, indicating a more temporally stable population code in the former two regions.
Retrospective sample information is stably maintained in the dmPFC throughout the entire five second delay
The approaches taken above were next applied to the delay phase of the task. MOs Z-scored population activity around the delay poke was elevated above the other subregions to a degree comparable to the sample phase (Fig. 3C). This poke-related difference in pseudopopulation FR between subregions did not persist into the five second delay holding period. Similar to its activity in the sample phase, the vmPFC contained fewer neurons appreciably changing their activity throughout the delay (Fig. 3D). Combined with the observation that the vmPFC also had a notably small number of selective neurons (Fig. 3F), we conclude that this subregion plays a minimal role during the delay period in our task.
The most striking finding during the delay phase was a time-dependent transition in the subregion that encoded the retrospective sample location most prominently. For the 200 ms around when the mice poked the back delay port, the MOs had the highest proportion of selective neurons by a small but significant margin over the dmPFC. Slightly less than one second into the delay hold, the dmPFC became the only subregion to contain any neurons with retrospective sample port selectivity throughout the rest of the holding period (Fig. 3E,F). This pattern of sample selectivity throughout the delay holding period suggests that there may be a subset of dmPFC neurons that maintain a persistently higher firing rate on either left or right sample trials. However, alternative WM mechanisms have been theorized which rely on more dynamic representations involving chains of multiple different neurons becoming transiently selective at different points in time, leading to an unstable population code38. Using the same cross-temporal SVM analysis as before, we were able to infer that the pattern of activity was likely due to a persistent mechanism. Decoding accuracy remained above 70% from the beginning to the end of the delay holding period in the dmPFC suggesting that a stable population code was present during this time (Fig. 3G). Conversely, the MOs and vmPFC decoded at chance levels throughout most of the delay.
Retrospective sample port information is most strongly encoded by the dmPFC before and after the choice poke
Reminiscent of nose pokes in the previous phases, the MOs was again the most active subregion directly around the choice poke (Fig. 4C,D). We also recorded a large amount of activity starting ~1.5 seconds before the choice poke which appeared to be highest in the vmPFC, although this difference was not statistically significant in this time frame (Fig. 4C). Contrary to the previous two task phases, the selectivity in this pre-poke period and the period directly around the poke was most represented in the dmPFC instead of the MOs (Fig. 4E,F). This significance continued in dmPFC until about one second after the choice, once again implying that the dmPFC stably represents the location of the sample port that was visited earlier in the trial. Consistent with this idea, the cross-temporal SVM uncovered a more stable code both before and after the choice poke in dmPFC compared to other areas (Fig. 4G).
MOs most strongly differentiates task phase
Although selective coding of retrospective sample port identity is necessary for successful DNMTP performance, it is not the only task parameter that mice must constantly monitor to make the correct non-match choice and maximize reward. A mouse must also be aware of several other aspects of the task including tracking which phase of the task it is in so that the correct motor strategy can be enacted at specific points in time, upcoming choice port location and the location of the choice it eventually makes, and what the outcome of the choice was (correct or incorrect) so that the mouse can update its strategy on the next trial if necessary. Any or all of these factors may interact at any given point throughout the time course of a single trial, so we wanted to analyze each factor with respect to the other factors.
For this, we used a general linear model (GLM) to predict how much the firing rate of each neuron in 200 ms bins around pokes depended on the four following predictor variables: poke context (sample, delay, or choice poke), sample port location (left or right), choice port location (left, center, or right), and outcome (correct or incorrect). Shuffled coefficient of partial determination analysis (CPD, see Methods for details) unveiled the percent of total firing rate variability that was explained by each of these variables. For this first GLM analysis, we were mainly interested in how closely the PFC monitored task phase information around pokes. The task phase the mouse was transitioning into around a poke accounted for significantly more total neural variability in the MOs as compared to dmPFC and vmPFC (Fig. 5A). The dmPFC furthermore encoded this variable to a larger degree than vmPFC. Poke context accounted for an overwhelming percentage of all explainable variability in each region (Fig. 5B), confirming that information about task progression is a crucial part of PFC computation regardless of the subregion. In lieu of these results, we assessed how well the population activity in separate subregions could decode the poke context around poke events. We implemented a similar SVM approach as the one described in previous sections, but we applied a multiclass coding scheme instead of a binary one since the model now had three possible choices (sample, delay, or choice) to differentiate. This method was able to decode poke context with 100% accuracy in all subregions (Fig. 5C).
GLM exposes distinct subregional processing of different task variables
In order to investigate the contribution of DNMTP task variables over the same wider time frame used in the first several figures, we needed to remove the poke context task variable due to its dependence on a prohibitively small window around pokes. Therefore, three predictor variables remained for this GLM over time: sample port location, choice port location, and outcome. With a similar CPD approach as above, we calculated the percent of neurons in each region that significantly (according to a shuffled CPD control analysis, see Methods for details) encoded the specified task variables in each 200 ms bin. This produced analogous results to our retrospective sample location selectivity investigation using permutation testing in Figs. 2, 3, and 4, further strengthening these findings (Fig. 6A). The time-based GLM also established that neither upcoming choice port identity nor trial outcome were encoded in the sample or delay phases of the task (Fig. 6B,C). Importantly, we only observed a considerable number of choice-port-selective neurons around the choice poke itself, with the vmPFC displaying the largest percentage (Fig. 6B). Likewise, only after the choice was made were we able to find significant subregional differences in the number of neurons encoding outcome (Fig. 6C). This was a compelling affirmation that the mice were not choosing an incorrect prospective motor plan or specific choice location before they needed to make the actual choice. Surprisingly, the MOs was the region with the most neurons encoding the outcome variable, possibly due to motor activity changing drastically on correct vs incorrect trials as mice were either consuming a water reward by licking (correct), or removing their nose from the port when no water was dispensed (incorrect).
GLM beta weights for retrospective sample location selectivity confirm its representational stability throughout trials
Since sample location was the only task feature significantly encoded by all subregions throughout all phases of WM, we wanted to take a closer look at the temporal dynamics of this information in each area. We reasoned that knowledge about the way in which retrospective representations evolve is critical to fully understand the circuit mechanisms underlying WM, as most definitions of WM rely on the ease with which animals can maintain previously encoded information over time. A recurring theme in our data is that this sample location information seems to be strongly reactivated very close to poke events. We therefore investigated the extent to which the sample selectivity of neurons in each region was reactivated around these pokes. The percentages of neural selectivity that was significant across multiple pokes was quantified in the Venn diagrams in Fig. 7A MOs and dmPFC both had a large number of neurons selective for the sample location across multiple task phases. Interestingly, there were few vmPFC neurons that were sample-selective over multiple pokes.
The maximum GLM beta weights from these selective neurons were identified around pokes (600 ms before and after poke). We sorted these beta weights by amplitude for better visualization, and plotted examples of these histograms for MOs and vmPFC in Fig. 7B. In these examples, negative beta weights represented a neuron with left sample selectivity, while positive signified right selectivity, and higher amplitude meant the neuron had a higher average difference in firing rate between left and right trials. To check temporal stability of these beta weights, we ran simple Pearson correlations between beta weights from significantly selective neurons at two different time points - for example, sample poke (light bars) versus delay poke (dark bars) as shown in Fig. 7B. The r values from these correlations are graphed as bars in Fig. 7C for every subregion across every time-based comparison. Briefly, we found that the sample-selective subpopulations of MOs and dmPFC neurons have similar beta weight distributions when comparing sample and delay pokes, while the vmPFC subpopulation is negatively correlated. The dmPFC also had similar selectivity across all adjacent time comparisons, adding more evidence that it is uses the most stable coding of all three subregions. Interestingly, none of the areas were stable from sample to choice pokes, which potentially arises from the fact that these events are separated in time by a cognitively demanding delay phase, which may largely reorganize neural activity.
vmPFC Z-scored population activity displays the largest change in reward outcome-related firing rate
Up to this point, we had not detected any substantial contributions from the vmPFC to DNMTP WM performance. However, this region is known to have the densest reciprocal connections with the ventral tegmental area and amygdala out of all PFC subareas39,40, hinting at potential involvement in more valence or reward-based information processing. As a result, we subtracted the mean Z-scored firing rate of all recorded neurons on correct trials from that on incorrect trials (subsampling for the lower count of incorrect trials), which gave rise to more pronounced differences in pre-choice and post-outcome population activity in the vmPFC compared to MOs and dmPFC (Fig. 8A, positive values indicate higher FR on incorrect trials). Not surprisingly, the vmPFC was also different relative to MOs and dmPFC in terms of the percentages of neurons responding by either increasing or decreasing their firing rate based on the reward outcome (Fig. 8B).
Discussion
In this paper, we delineate how the neural populations in adjacent mouse mPFC subregions processed information about WM task-related variables over time. The first key finding was that each subregion exhibited characteristic and differentiated activity during the task relative to the other subregions. The second key finding was that much of the WM task-related activity appeared unrelated to storing WM information (i.e., representations of the previously visited sample port locations). Rather, it reflected changes in activity related to poke context (sample, delay, or choice poke), and rewards. The third key finding was that WM of the sample location was stored briefly (~1 second) in the MOs and dmPFC and in a sustained manner only in the dmPFC, with minimal retrospective representations detected in vmPFC. The final key finding was that vmPFC predominantly represented choice and outcome-related activity relative to MOs and dmPFC.
Of note, there is a dearth of comparative analysis using single neuron and population activity of neighboring mPFC subregions during WM, particularly in rodents. Activity across mPFC subregions has been examined in rats performing DNMTP tasks41,42 and a more complex match to place task43. In these studies, single neurons responded prominently to reward locations and task phase, but in contrast to our findings in this study, no sustained neural activity related to retrospective sample port location was noted during the delay period. Expanding on these studies, our work implements complementary computational analyses to further explore the disparate functional roles and temporal dynamics/stability of subregional mPFC neural populations during WM. In addition, we included another brain region in our experiments, the MOs, after recent evidence that it supervises more abstract functions than basic motor control and should be considered a part of the mPFC44.
In our DNMTP task, the MOs behaves in a manner consistent with the theory proposed by Barthas and Kwan (2017) which posits its involvement in context-dependent selection of motor plans45,46 and the online monitoring of sequential sensorimotor tasks47. Activity in our recorded MOs neurons peaks in a tight window around all port pokes. Importantly, population-level differences in this transient activity can simultaneously differentiate which part of the task the poke is occurring in alongside distinguishing the left versus right sample port location. These transient poke-centered phenomena may represent close monitoring of the phase of the DNMTP task the animal is in at fixed intervals, while also providing information at each stage about the spatial rule of the current trial, facilitating ongoing motor plan selection and timing. Consistent with the notion that MOs is involved in context-dependent selection of motor plans, it has been reported that MOs can relay its contextual/motor information to many areas of the neocortex it connects with, including sensory and primary motor cortex48. Future timed inactivation studies may be useful in exploring these ideas further in WM tasks.
With respect to retrospective sample location selectivity, our analyses uncovered strong sample port representations in both MOs and dmPFC during all task periods. The retrospective sample location was detectable using single neuron and population analyses, including using an SVM to decode the sample port location and developing a GLM to show that this port location information is significantly encoded in MOs and dmPFC neurons well after the sample phase. As mentioned above, the retrospective sample location-related activity in MOs manifests only briefly around pokes (~1 sec), and these recurring transient patterns of selectivity do not remain stable throughout a trial. We speculate that MOs may not contain the molecular or circuit architecture necessary to maintain persistent activity in a group of neurons49. In contrast, subsets of dmPFC neurons are selective for the retrospective sample port at the sample, delay and choice pokes in similar patterns, with a smaller group (~10%) exhibiting sustained retrospective sample port selectivity throughout the delay. These findings are consistent with the notion that there are overlapping ensembles of WM neurons operating with different dynamics in different PFC subregions on different timescales. They also suggest that briefly active WM ensembles may be more common than sustained ensembles. It will be useful to explore these ideas further in primates and rodents using different WM tasks with a range of delay lengths.
Perhaps the most intriguing finding in our data is the aforementioned stability of dmPFC retrospective sample port selectivity of a group of neurons throughout the course of behavioral trials, especially during the majority of the delay holding period when neither of the other subregions display sample location-related activity. A substantial body of evidence has set the precedent for the existence of persistently selective delay representations in various primate brain areas including dorsolateral prefrontal cortex50,51, and in head-fixed mice. The latter work was done in the nearby anterior lateral motor cortex (ALM) during a delayed motor response task showing sample-selective preparatory activity for an upcoming left or right lick30–32,52,53. This task differs from ours in that the head-fixed animals know the exact location of the correct choice throughout the delay period, allowing them to make a precise motor plan which may be represented as persistent activity. In our task, the prospective location that will be rewarded cannot be anticipated. Additionally, the delay lengths used in the reports cited above are relatively short (1-2 seconds) compared to ours at five seconds. We believe that this longer interval is crucial to unraveling the dynamics of WM, as we observed that most neurons coding for the retrospective sample location in MOs and dmPFC around the delay poke are not sustained beyond 1-2 sec into the delay, with a small group of neurons (~10%) in dmPFC exhibiting sustained WM activity. Other studies in freely-moving rats also found non-sustained sample-selective delay activity in the frontal orienting field during a WM motor planning task45, and in prelimbic cortex in a delayed alternation task54.
An often-overlooked aspect of WM is the need to maintain some signal which updates strategies based on feedback from reward outcomes, so that behavior can be adjusted in the immediate future if results don’t match expectations. Our findings in vmPFC align with the possibility that this region is predominantly involved processing choices and outcomes. This could be communicated by different release patterns of dopamine in this region 55, or changes in firing from glutamatergic amygdala inputs 56. Overall, our results point toward a dynamic flow of information from MOs to vmPFC as the mice progress through the task. Similar dynamics across distant brain regions have been characterized before in humans 22, monkeys 57, and mice 30, but they remain poorly characterized both in primates and mice in PFC subregions involved in WM.
One caveat of this study is that we did not record from these subregions simultaneously, due to the technical difficulty of probing multiple sites along the curvature of the cortex. Future studies would benefit from using more advanced electrophysiological setups (e.g., Neuropixels 2.0 probes58) to record all subregions at once. We also did not collect video data to accompany these recordings. In future studies, video would be useful for connecting specific movements to associated neural activity patterns59. Furthermore, with the way our task was designed, we cannot definitively determine if the mice are actively remembering the retrospective port location or prospectively planning a left or right turn away from the sample port during the delay period60, although both of these scenarios require a WM mechanism. However, our GLM data showing no neurons in any region that were selective for the upcoming choice port lends support to the idea that the sustained representation we describe is most likely a retrospective one that aligns well with stored WM information very closely related to the sample port visited earlier in the trial.
One ongoing challenge in neuroscience is the mapping of rodent mPFC subregions to the corresponding areas from primate PFC. Although this can be done according to common afferent and efferent projections, molecular expression, and functions61–64, there is often a lack of consensus regarding the extent of similarity/homology. Another challenge has been the difficulty in eliciting persistent WM activity in rodent PFC to be able to study the underlying mechanisms in mice. By identifying such activity, as well other types of WM activity, this study provides a foundation to perform such studies using tools that are uniquely available in mice. Our work here also paves the way for more detailed analysis of these networks and a more nuanced and dynamic view of how neighboring brain regions process information in complex and complementary ways as mice progress through a complicated behavior that requires WM and other functions.
Methods
Animals.
All animal experiments performed in this study were approved by the Veterans Affairs Portland Health Care System Institutional Animal Care and Use Committee. Nine female and seven male mice, bred on a C57BL/6J background, were housed in the Veterans Affairs Portland Health Care System Veterinary Medical Unit on a reverse 12-hour light cycle with lights turning off at 8:00 AM (PST), and on at 8:00 PM (PST). All mice were group-housed before electrode implantation, after which they were single-housed until the completion of the experiment to prevent them from damaging each other's implants. Mice had ad libitum access to water (unless restricted for experimentation) and PMI PicoLab 5L0D Laboratory Rodent Diet (LabDiet, Inc., St. Louis, MO, USA). Mice were housed in rooms with constant temperature (22-26 °C dry bulb) and humidity (30-70%) monitoring.
One day prior to the start of initial behavioral training, mice were water restricted to 85-90% of their initial body weights so that they were motivated to seek out water rewards. This weight-based water restriction continued for the duration of active behavioral experimentation (but not during recovery from surgery), after which they were promptly returned to ad libitum water until they were sacrificed for implant location confirmation. The long-term water restriction protocol consisted of giving the mice about one gram of a 98%-water, gelatinous hydrocolloid mixture (HydroGel®, ClearH20, Westbrook, ME, USA) daily, after behavioral tasks, to keep the mice at a constant water-motivated weight. Furthermore, during the one-week recovery from surgery, mice were supplemented with an electrolyte-based recovery diet (DietGel® Recovery, ClearH20, Westbrook, ME, USA).
Behavioral setup and training.
On the first day of delayed-non-match-to-position (DNMTP) behavioral training, water-restricted mice were acclimated to the behavioral chamber (Fig. 1A, Bpod, Sanworks LLC, Rochester, NY, USA). The main hardware components of this chamber included a state machine control board and illuminable, photo-gated, water-dispensing ports. Integration of this system with MATLAB software (MathWorks, Natick, MA, USA) and our electrophysiological recording system allowed for precise closed-loop control over specific DNMTP task parameters via custom MATLAB scripts. These parameters included the timing of task phases, lighting of ports, delivery of water rewards or signaling of incorrect behavior with negative reinforcers, and online synchronization with electrophysiological data for accurate timestamping of neural firing and important behavioral events.
After a 15-minute habituation session on day one, mice were taught on day two that only lit ports could dispense water rewards. To do this, we randomly lit and pre-baited one of the four (three “front” and one “back”) ports so that water was available as soon as the port light turned on. This allowed mice to initially learn the simple Pavlovian association that water was available from lit, but not dark, ports. Once the mice were familiar with this association (usually after one 15-minute session), we changed to a slightly more instrumental design where water was not dispensed until the mice poked their nose in the lit port and broke the plane of the infrared photogate, prompting them to learn that their active engagement with the port was required for water to be dispensed.
The next step in training was a modified version of the final DNMTP task with intertrial intervals (ITIs), sample phase, delay phase, and choice phase. After a five second ITI, either the left or right front port had a 50% chance to randomly light up on a given trial (sample phase), and the mouse was required to poke its nose in the lit port to get a small water reward (3 μL) and activate the back delay port. The mouse then turned around to poke in the back delay port, which dispensed a one μL reward immediately after the poke. Poking in any of the dark ports during the sample or delay phases led to a punishment consisting of illumination of a bright house light and a behavioral timeout for 15 seconds, after which the exact same trial restarted. Successful progression through the delay led to the choice phase, where the previous sample port lit up along with one of the other two front ports (randomly, 50% chance). The mouse had to poke the lit port that it did not previously visit during the sample phase to complete a correct non-match choice and get a larger seven μL reward. Consumption of this reward typically took about two to three seconds, after which the mice left the port. Leaving the port after water consumption initiated a five second ITI period in which the mouse could not enter any other ports, or the ITI timer would reset. An incorrect choice similarly led to an illumination of a bright house light and a fifteen second timeout, but in this case a new trial with new port locations started afterwards.
Once mice achieved 70% non-match performance on this training task with no delay, we removed the sample phase reward and implemented a slow walk-up of delay length in each session to help the mice get to the final delay period of five seconds. The mice had to hold their nose in the back port until the delay timer ended to get the small one μL reward and enter the choice phase. After each successful delay hold, the timer went up from zero by 0.10 seconds until a five second delay was reached. We let the mice do this until they reliably got to 5 seconds for the delay period. The final version of the task had the mice starting the delay at zero seconds and walking up by one second per trial to the final five second delay. These first five trials were removed from analysis. The final task sessions, in which we also recorded brain activity, usually lasted around one hour and the mice completed anywhere from 48 to 141 trials, depending on motivation for water based on hydration status. Once mice performed at >70% for three consecutive sessions they were implanted with electrodes.
Electrode implantations and single-unit recordings.
Custom 28-channel implantable electrode bundles were constructed in-lab. The process consisted of threading 32-channel electrode interface boards (EIB-36-Narrow-PTB, Neuralynx, Inc., Bozeman, MT, USA) with 12 μm diameter tungsten wire (California Fine Wire Company, Grover Beach, CA, USA), and securing the wires in the board with gold pins (Neuralynx, Inc., Bozeman, MT, USA). Four slightly larger diameter local field potential wires were implanted in various mPFC-connected brain regions, although none of the data collected with these wires was used for analysis. Silver ground and reference wires were also soldered onto the EIB. The apparatus was built on a custom 3D-printed scaffold (Grey V4 resin, Formlabs, Somerville, MA, USA), with holes for drivable screws (McMaster Carr, Elmhurst, IL, USA) that allowed for advancement of the electrodes after every recording session. The wires were affixed to the EIBs using dental cement (UNIFAST Trad, GC America Inc., Alsip, IL, USA) to protect them from damage.
For implantation, mice were lightly anesthetized with 3% vaporized isoflurane (Covetrus, Dublin, OH, USA) and transferred to a stereotaxic surgery apparatus (David Kopf Instruments, Tujunga, CA, USA) where they were kept at ~1% isoflurane for the remainder of the surgery. Body temperature was monitored with a Physitemp TCAT-2LV temperature controller system, which held animals between 36 and 37 °C. Prior to initial incision, they were injected with carprofen and dexamethasone for pain management, along with a topical application of lidocaine to the skull and surrounding skin. Electrode bundles were implanted on the left side of the skull at the following coordinates (from bregma): supplementary motor area (MOs): +1.80 mm anterior, −1.50 mm lateral left, −1.00 mm ventral to brain surface; dorsomedial prefrontal cortex (dmPFC): +1.80 mm anterior, −0.40 mm lateral left, −0.50 mm ventral to brain surface; ventromedial prefrontal cortex (vmPFC): +1.80 mm anterior, −0.40 mm lateral left, −1.70 mm ventral to brain surface. A larger diameter reference wire was implanted in the left striatum: +0.50 mm anterior, −1.60 mm lateral left, −2.50 mm ventral to brain surface. A ground screw was also placed in the skull over the right cerebellum. The full setup was secured to the skull using the same dental cement mentioned previously. This included threading screws into skull-secured acrylic cuffs so that they could be advanced and drive the electrodes deeper into the brain.
Mice were allowed to recover for at least one week with daily health monitoring before returning to water restriction and behavioral testing. Electrical recordings during behavior began after two to three re-habituation sessions while plugged into the electrophysiological tether and commutator (Doric, Quebec, Canada). Data was collected with a CerePlex Direct neural acquisition system connected via a 32-channel CerePlex μ (mu) headstage (Blackrock Neurotech, Salt Lake City, UT, USA) to the implanted EIB. Unfiltered data was sampled at 30 kHz throughout an entire behavioral session. Single-units were isolated offline using Kilosort3 and timestamped to behavioral events. Since the geometry of our bundle was unknown, we used a random linear arrangement for our probe configuration parameter. We also turned off the option for registration and drift correction so that Kilosort would not try to re-order the channel map and match templates based on the random arrangement of our bundle. Units considered “good” by the Kilosort algorithm (consistently similar waveform shape and clean autocorrelations) were then manually curated and thrown out if their amplitudes or template presence drifted significantly over the course of the recording, or if cross-correlations with other units determined that they were duplicates. In the second case, the highest amplitude duplicate was kept, and the rest removed.
Analyses and statistics
The following analyses were completed using custom MATLAB (R2022a) scripts, and the use of specific built-in MATLAB functions is noted when appropriate. These analyses were performed with the goal of testing for differences between these subregional pseudopopulations.
Z-scoring and comparison of Z-scored firing rate across subregions.
After spike sorting, we created three pseudopopulations by combining all neurons within each of the PFC subregions across all recording sessions. The first analysis involved Z-scoring the firing rate of every neuron around key behavioral events. This was done by summing the spikes in every 100 ms time bin five seconds before and after the sample, delay, or choice pokes for every correct trial (one hundred total bins for each poke). We then normalized each trial of each neuron across the time bins to create time-based Z-scores. These Z-scores were averaged across all correct trials in that neuron's session to get the mean Z-score for every neuron around important DNMTP task events, and this result is depicted in the heatmaps seen in Figs. 2B, 3B and 4B. We could then take the mean across all neurons in each pseudopopulation to see how the regions differed in terms of simple firing rate changes over time (Figs. 2C, 3C and 4C).
To evaluate if the pseudopopulation Z-scored firing rate differed between subregions, we ran a separate one-way ANOVA (MATLAB function anovan) at every relevant time bin shown in the figures (this number was different between task phases). After uncorrected p-values for each ANOVA were found, we adjusted them for false discovery rate (FDR) using the Benjamini-Hochberg method 65. Each time point that still had a corrected p-value of < .05 was taken, and unpaired t-tests in that bin were conducted on each combination of subregional comparisons. The p-values from these multiple comparisons were then Bonferroni post-hoc corrected, and only comparisons with adjusted p-values still below .05 were considered significantly different at the corresponding timepoint. The above ANOVA strategy was also used to calculate significant subregional differences in the changes in Z-scored firing rate on incorrect vs correct trials in Fig. 8, except that that incorrect scores were subtracted from correct scores before analysis.
We took a related approach to analyze the percentages of neurons in each region that exhibited an increase or decrease by 0.5 Z-units (Figs. 2D, 3D and 4D). However, instead of running ANOVAs on each time point, we ran a X2 test for homogeneity of proportions across the three groups (MATLAB function crosstab). Like the ANOVA approach, we also adjusted p-values for false discovery rate over time using the Benjamini-Hochberg method. If adjusted p-value was still less than .05 for a time bin, we ran separate pairwise X2 tests for each combination of subregions and corrected for these multiple comparisons using the Bonferroni-Holm method 66. Any adjusted p-value below .05 after these conservative corrections was considered to represent a significant difference in the proportion of neurons either increasing or decreasing their firing rate between two groups at that time point. Moreover, this X2 strategy was similarly employed in Fig. 8, to study the differences in percentages of neurons that increase or decrease their firing rates in response to incorrect trials.
Determining retrospective sample location selectivity using permutation testing.
Retrospective sample location selectivity was analyzed in several ways throughout the paper. In Figs. 2F, 3F and 4F, we used a permutation testing method to compare the raw spike counts in 100 ms time bins between left and right sample location trials (correct trials only). Since sessions rarely had an equal number of correct left and right sample trials, for each session we randomly subsampled trials from the greater of the two to match the number of trials from the lesser. Next, we randomly sampled two trials from a combined subpopulation of left and right samples, such that the spike counts for these random two trials could be from two left trials, two right trials, or a left and a right trial. We took the difference between two randomly sampled trials 1000 times and created a shuffled distribution of differences. We then calculated the true mean difference between all left and right trials, compared that to the shuffled distribution, and counted the number of shuffled differences that were greater than or less than the true mean difference. Neurons were considered to be significantly selective for a sample location if < 25 out of 1000 shuffled differences were greater in magnitude than the actual difference (approximating a p-value of < .025). This was done for every time point around key poke events.
Support vector machines (SVMs) for cross-temporal decoding of sample location and prediction of poke context.
To track the stability of location selectivity across time in each region, we trained linear SVMs (MATLAB fitcsvm) to decode retrospective sample location every 100 ms using binned raw spike counts from each neuron in a pseudopopulation on all correct trials. We then assessed how well each time point’s trained model could predict sample location based on the pseudopopulation activity at all time points (including the one it was trained on). A stable population representation would display above-chance predictive accuracy (> 50%) at time points far from the training time. A leave-two-out cross-validation scheme was applied to protect against overfitting the model. This consisted of holding out one trial from both left and right location samples and testing how the model trained on the remaining trials predicted the identity of these held out trials. Importantly, we subsampled all left and right trial counts to 12, which was the lowest left or right sample location trial count across all sessions, although most sessions had much more than 12. Our leave-two-out strategy was therefore repeated 12 times, with each value from this subsampling appearing once without replacement in the testing set. The prediction accuracy of the model for each subsample was calculated as the number of correct classifications of all held out trial combinations, out of 24. The overall subsampling procedure was repeated 20 times, for a total of 480 model predictions to calculate prediction accuracy for each cross-temporal comparison.
A second SVM was used in Fig. 5 to classify the poke context (sample, delay, or choice poke) in a brief window around the three poke events across correct trials. Because there were three contexts, a multiclass SVM was needed. We used the fitcecoc MATLAB function with a ‘one-vs-one’ coding design and 5-fold cross validation per comparison. We also ran a shuffled version of this model to confirm that the model was not overfit, and that chance level was ~33%.
General linear modeling of multiple task parameters over time.
We used a general linear model (GLM, MATLAB function fitlm) to study how neurons in each region encoded multiple DNMTP task variables simultaneously around all task pokes. For Fig. 5, the GLM predictor variables included poke context (is the poke occurring during the sample, delay, or choice phase), sample location identity (right or left), choice location identity (left, center, or right), and outcome (correct or incorrect). This particular analysis with poke context (Fig. 5) was done in a brief time frame around poke events, because mice took inconsistent amounts of time to progress to different pokes, and we wanted to make sure the task phases in question were temporally well isolated. We took a similar GLM approach to Akam et al. (2021), in which a GLM was run on every time bin for every neuron centered around poke events, which initially included the contributions of all predictor variables (known as the “full” model). In other words, our GLM is attempting to predict the firing rate of each neuron at every categorical level (e.g. sample, delay, or choice poke) of our predictor variables at all time points in question.
The sum of squares error (SSE) of this full model represents the amount of residual variance in a neuron's firing rate that cannot be explained by changes in the predictor variables. The full model should account for more firing rate variability and therefore have a relatively low SSE since it contains more relevant predictor variables. To uncover the extent to which pseudopopulations encoded singular task variables, we found the coefficient of partial determination (CPD) for each individual variable. This can be calculated by removing the singular task variables from the full model and running the GLM again on the reduced model. Since the reduced model should have a higher error term than the full, subtracting the full model from each reduced model should approximate the contribution of each removed variable to the full model’s firing rate prediction. The resulting number is the CPD for that singular variable, and it represents the percentage of total firing rate variability for a given neuron explained by that variable. A CPD was generated for each neuron at each time point, and the mean CPD for each pseudopopulation was reported over time and analyzed for significance with the same ANOVA approach as mentioned in Z-scoring and comparison of Z-scored firing rate across subregions. Using CPDs from all predictor variables, we could then compare the contributions of each variable to the explainable (not total) firing rate variance in each region.
For Fig 6., significantly strong CPDs (now measured across all relevant time points) for each variable were determined by comparing the true CPD value to a shuffled distribution of CPDs generated from randomly shuffling the relationship between firing rate and predictor variables 1000 times. As a very conservative cutoff, CPDs were only considered significant if none of the shuffled CPD values were higher than the unshuffled one, and we calculated the proportion of neurons at every time point that fulfilled this criterion. We then could determine differences in proportions of complex DNMTP task variable encoding between regions using the same X2 approach employed at the end of the section Z-scoring and comparison of Z-scored firing rate across subregions. We then took these significant neurons and followed their sample location selectivity throughout the entire behavioral trial. Venn diagrams in Fig. 7 depict the percent of neurons with significant CPDs in each region taken from 3 bins (600 ms) on either side the labeled poke.
Among the most informative GLM outputs are beta-weights representing the strength and direction of the relationship between the firing rate of every neuron and the levels of each predictor variable. As another approach to visualize the stability of retrospective sample port location encoding, we recovered the maximum beta-weight amplitudes for every neuron with significant selectivity in a 600 ms time window around pokes. In this case, the beta-weights of significant neurons could be positive or negative with respect to the sample port location predictor variable, with negative signifying neurons that fired more on the left side of the box and vice versa. We quantified how these beta-weights changed over time using Pearson correlations (r, MATLAB function corr) comparing the significant beta-weight population vector at one point to the beta-weight population of those same neurons in the future. The significance levels of these correlations were also reported in Fig 7C and represent the confidence that the reported correlations are different from zero.
Acknowledgements:
We would like to thank Bakr Alkarawi for his help assembling the electrode implants used in this study. This work was financially supported by the Portland VA Research Foundation (PVARF), the OHSU Physician-Scientist Program to A.A., and the National Institute on Alcohol Abuse and Alcoholism (T32 AA007468 to A.S.).
Footnotes
Competing interest: None of the authors have any competing interest to declare.
References
- 1.Miller E. K., Lundqvist M. & Bastos A. M. Working Memory 2.0. Neuron 100, 463–475 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Christophel T. B., Klink P. C., Spitzer B., Roelfsema P. R. & Haynes J.-D. The Distributed Nature of Working Memory. Trends Cogn Sci 21, 111–124 (2017). [DOI] [PubMed] [Google Scholar]
- 3.Eriksson J., Vogel E. K., Lansner A., Bergström F. & Nyberg L. Neurocognitive architecture of working memory. Neuron 88, 33 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Harris J. A. et al. Hierarchical organization of cortical and thalamic connectivity. Nature 575, 195–202 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Cruz K. G. et al. Cortical-subcortical interactions in goal-directed behavior. Physiological Reviews 103, 347–389 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Miller E. K. & Cohen J. D. An Integrative Theory of Prefrontal Cortex Function. Annual Review of Neuroscience 24, 167–202 (2001). [DOI] [PubMed] [Google Scholar]
- 7.Funahashi S. Working Memory in the Prefrontal Cortex. Brain Sciences 7, 49 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chai W. J., Abd Hamid A. I. & Abdullah J. M. Working Memory From the Psychological and Neurosciences Perspectives: A Review. Front Psychol 9, 401 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Xu P., Chen A., Li Y., Xing X. & Lu H. Medial prefrontal cortex in neurological diseases. Physiological Genomics 51, 432–442 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Laubach M., Amarante L. M., Swanson K. & White S. R. What, If Anything, Is Rodent Prefrontal Cortex? eNeuro 5, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ährlund-Richter S. et al. A whole-brain atlas of monosynaptic input targeting four different cell types in the medial prefrontal cortex of the mouse. Nat Neurosci 22, 657–668 (2019). [DOI] [PubMed] [Google Scholar]
- 12.Gao L. et al. Single-neuron projectome of mouse prefrontal cortex. Nat Neurosci 25, 515–529 (2022). [DOI] [PubMed] [Google Scholar]
- 13.Ortiz C. et al. Molecular atlas of the adult mouse brain. Science Advances 6, eabb3446 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Bhattacherjee A. et al. Spatial transcriptomics reveals the distinct organization of mouse prefrontal cortex and neuronal subtypes regulating chronic pain. Nat Neurosci 26, 1880–1893 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Le Merre P., Ährlund-Richter S. & Carlén M. The mouse prefrontal cortex: Unity in diversity. Neuron 109, 1925–1944 (2021). [DOI] [PubMed] [Google Scholar]
- 16.Gilad A., Gallero-Salas Y., Groos D. & Helmchen F. Behavioral Strategy Determines Frontal or Posterior Location of Short-Term Memory in Neocortex. Neuron 99, 814–828.e7 (2018). [DOI] [PubMed] [Google Scholar]
- 17.Pinto L. et al. Task-Dependent Changes in the Large-Scale Dynamics and Necessity of Cortical Regions. Neuron 104, 810–824.e9 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Pinto L., Tank D. W. & Brody C. D. Multiple timescales of sensory-evidence accumulation across the dorsal cortex. eLife 11, e70263 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Steinmetz N. A., Zatka-Haas P., Carandini M. & Harris K. D. Distributed coding of choice, action and engagement across the mouse brain. Nature 576, 266–273 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Aoi M. C., Mante V. & Pillow J. W. Prefrontal cortex exhibits multidimensional dynamic encoding during decision-making. Nat Neurosci 23, 1410–1420 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Koay S. A., Charles A. S., Thiberge S. Y., Brody C. D. & Tank D. W. Sequential and efficient neural-population coding of complex task information. Neuron 110, 328–349.e11 (2022). [DOI] [PubMed] [Google Scholar]
- 22.Wang X.-J. Theory of the Multiregional Neocortex: Large-Scale Neural Dynamics and Distributed Cognition. Annual Review of Neuroscience 45, 533–560 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bolkan S. S. et al. Thalamic projections sustain prefrontal activity during working memory maintenance. Nat Neurosci 20, 987–996 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Abbas A. I. et al. Somatostatin Interneurons Facilitate Hippocampal-Prefrontal Synchrony and Prefrontal Spatial Encoding. Neuron 100, 926–939.e3 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wu Z. et al. Context-Dependent Decision Making in a Premotor Circuit. Neuron 106, 316–328.e6 (2020). [DOI] [PubMed] [Google Scholar]
- 26.Vogel P., Hahn J., Duvarci S. & Sigurdsson T. Prefrontal pyramidal neurons are critical for all phases of working memory. Cell Reports 39, (2022). [DOI] [PubMed] [Google Scholar]
- 27.Allen W. E. et al. Thirst regulates motivated behavior through modulation of brainwide neural population dynamics. Science 364, eaav3932 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Voitov I. & Mrsic-Flogel T. D. Cortical feedback loops bind distributed representations of working memory. Nature 1–9 (2022) doi: 10.1038/s41586-022-05014-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Chen S. et al. Brain-wide neural activity underlying memory-guided movement. Cell 187, 676–691.e16 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Guo Z. V. et al. Flow of Cortical Activity Underlying a Tactile Decision in Mice. Neuron 81, 179–194 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Guo Z. V. et al. Maintenance of persistent activity in a frontal thalamocortical loop. Nature 545, 181–186 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Li N., Chen T.-W., Guo Z. V., Gerfen C. R. & Svoboda K. A motor cortex circuit for motor planning and movement. Nature 519, 51–56 (2015). [DOI] [PubMed] [Google Scholar]
- 33.Chen G., Kang B., Lindsey J., Druckmann S. & Li N. Modularity and robustness of frontal cortical networks. Cell 184, 3717–3730.e24 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Whishaw I. Q. et al. Organization of the reach and grasp in head-fixed vs freely-moving mice provides support for multiple motor channel theory of neocortical organization. Exp Brain Res 235, 1919–1932 (2017). [DOI] [PubMed] [Google Scholar]
- 35.Bjerre A.-S. & Palmer L. M. Probing Cortical Activity During Head-Fixed Behavior. Front. Mol. Neurosci. 13, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Teutsch J. & Kätzel D. Operant Assessment of DMTP Spatial Working Memory in Mice. Front. Behav. Neurosci. 13, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Nee D. E. et al. A Meta-analysis of Executive Components of Working Memory. Cerebral Cortex 23, 264–282 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Spaak E., Watanabe K., Funahashi S. & Stokes M. G. Stable and Dynamic Coding for Working Memory in Primate Prefrontal Cortex. J Neurosci 37, 6503–6516 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Oh S. W. et al. A mesoscale connectome of the mouse brain. Nature 508, 207–214 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Anastasiades P. G. & Carter A. G. Circuit organization of the rodent medial prefrontal cortex. Trends in Neurosciences 44, 550–563 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Onos K. D. et al. Prefrontal Neurons Encode Actions and Outcomes in Conjunction with Spatial Location in Rats Performing a Dynamic Delayed Non-Match to Position Task. PLoS One 11, e0149019 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Francoeur M. J. & Mair R. G. Representation of actions and outcomes in medial prefrontal cortex during delayed conditional decision-making: Population analyses of single neuron activity. Brain and Neuroscience Advances 2, 2398212818773865 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Böhm C. & Lee A. K. Canonical goal-selective representations are absent from prefrontal cortex in a spatial working memory task requiring behavioral flexibility. eLife 9, e63035 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Barthas F. & Kwan A. C. Secondary Motor Cortex: Where ‘Sensory’ Meets ‘Motor’ in the Rodent Frontal Cortex. Trends in Neurosciences 40, 181–193 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Erlich J. C., Bialek M. & Brody C. D. A cortical substrate for memory-guided orienting in the rat. Neuron 72, 330–343 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Sul J. H., Jo S., Lee D. & Jung M. W. Role of rodent secondary motor cortex in value-based action selection. Nat Neurosci 14, 1202–1208 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ostlund S. B., Winterbauer N. E. & Balleine B. W. Evidence of Action Sequence Chunking in Goal-Directed Instrumental Conditioning and Its Dependence on the Dorsomedial Prefrontal Cortex. J. Neurosci. 29, 8280–8287 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Ebbesen C. L. et al. More than Just a “Motor”: Recent Surprises from the Frontal Cortex. J. Neurosci. 38, 9402–9413 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Masse N. Y., Yang G. R., Song H. F., Wang X.-J. & Freedman D. J. Circuit mechanisms for the maintenance and manipulation of information in working memory. Nat Neurosci 22, 1159–1167 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Funahashi S., Bruce C. J. & Goldman-Rakic P. S. Mnemonic coding of visual space in the monkey’s dorsolateral prefrontal cortex. J Neurophysiol 61, 331–349 (1989). [DOI] [PubMed] [Google Scholar]
- 51.Fuster J. M. & Alexander G. E. Neuron activity related to short-term memory. Science 173, 652–654 (1971). [DOI] [PubMed] [Google Scholar]
- 52.Chen T.-W., Li N., Daie K. & Svoboda K. A Map of Anticipatory Activity in Mouse Motor Cortex. Neuron 94, 866–879.e4 (2017). [DOI] [PubMed] [Google Scholar]
- 53.Chia X. W., Tan J. K., Ang L. F., Kamigaki T. & Makino H. Emergence of cortical network motifs for short-term memory during learning. Nat Commun 14, 6869 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Yang S.-T., Shi Y., Wang Q., Peng J.-Y. & Li B.-M. Neuronal representation of working memory in the medial prefrontal cortex of rats. Molecular Brain 7, 61 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Ellwood I. T. et al. Tonic or Phasic Stimulation of Dopaminergic Projections to Prefrontal Cortex Causes Mice to Maintain or Deviate from Previously Learned Behavioral Strategies. J. Neurosci. 37, 8315–8329 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Keefer S. E. & Petrovich G. D. The basolateral amygdala-medial prefrontal cortex circuitry regulates behavioral flexibility during appetitive reversal learning. Behav Neurosci 134, 34–44 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Siegel M., Buschman T. J. & Miller E. K. Cortical information flow during flexible sensorimotor decisions. Science 348, 1352–1355 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Steinmetz N. A. et al. Neuropixels 2.0: A miniaturized high-density probe for stable, long-term brain recordings. Science 372, eabf4588 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Musall S., Kaufman M. T., Juavinett A. L., Gluf S. & Churchland A. K. Single-trial neural dynamics are dominated by richly varied movements. Nat Neurosci 22, 1677–1686 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Myroshnychenko M., Seamans J. K., Phillips A. G. & Lapish C. C. Temporal Dynamics of Hippocampal and Medial Prefrontal Cortex Interactions During the Delay Period of a Working Memory-Guided Foraging Task. Cereb Cortex 27, 5331–5342 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Uylings H. B. M., Groenewegen H. J. & Kolb B. Do rats have a prefrontal cortex? Behavioural Brain Research 146, 3–17 (2003). [DOI] [PubMed] [Google Scholar]
- 62.Narayanan N. S., Cavanagh J. F., Frank M. J. & Laubach M. Common medial frontal mechanisms of adaptive control in humans and rodents. Nat Neurosci 16, 1888–1895 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Vogt B. A. & Paxinos G. Cytoarchitecture of mouse and rat cingulate cortex with human homologies. Brain Struct Funct 219, 185–192 (2014). [DOI] [PubMed] [Google Scholar]
- 64.Tsutsui K.-I., Oyama K., Nakamura S. & Iijima T. Comparative Overview of Visuospatial Working Memory in Monkeys and Rats. Frontiers in Systems Neuroscience 10, (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Benjamini Y. & Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society. Series B (Methodological) 57, 289–300 (1995). [Google Scholar]
- 66.Holm S. A Simple Sequentially Rejective Multiple Test Procedure. Scandinavian Journal of Statistics 6, 65–70 (1979). [Google Scholar]
- 67.Akam T. et al. The Anterior Cingulate Cortex Predicts Future States to Mediate Model-Based Action Selection. Neuron 109, 149–163.e7 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]