Abstract
Reinforcement learning and goal-seeking behavior are thought to be mediated by midbrain dopamine neurons. However, little is known about neural substrates of curiosity and exploratory behavior, which occur in the absence of clear goal or reward. This is despite behavioral scientists having long suggested that curiosity and exploratory behaviors are regulated by an innate drive. We refer to such behavior as information-seeking behavior and propose 1) key neural substrates and 2) the concept of environment prediction error as a framework to understand information-seeking processes. The cognitive aspect of information-seeking behavior, including the perception of salience and uncertainty, involves, in part, the pathways from the posterior hypothalamic supramammillary region to the hippocampal formation. The vigor of such behavior is modulated by the following: supramammillary glutamatergic neurons; their projections to medial septal glutamatergic neurons; and the projections of medial septal glutamatergic neurons to ventral tegmental dopaminergic neurons. Phasic responses of dopaminergic neurons are characterized as signaling potentially important stimuli rather than rewards. This paper describes how novel stimuli and uncertainty trigger seeking motivation and how these neural substrates modulate information-seeking behavior.
Keywords: Music perception, Risk taking, Reward prediction error, Expectancy, Latent learning, Stimulus-stimulus learning
Anticipation of pleasure is, in itself, a very considerable pleasure. DAVID HUME, A Treatise of Human Nature.
A mind is fundamentally an anticipator, an expectation-generator. It mines the present for clues, which it refines with the help of the materials it has saved from the past, turning them into anticipations of the future.
DANIEL DENNETT, Kinds of Minds.
1. Introduction
The ability to anticipate future events provides survival advantage. Human and non-human animals who can anticipate the future can prepare themselves for opportunities and dangers. The anticipation of future events depends on remembering the past and perceiving the present. Thus, animals need to acquire environmental information and store it for future use. To this end, an animal’s active interaction with its environment that leads to information acquisition would help the animal generate the best possible prediction of future events. We discuss the existence of a motivation system that coordinates cognition and behavior to acquire information and suggest neural substrates of these processes. In addition, we discuss how midbrain dopamine neurons are involved in acquiring environmental information. Our ideas are summarized in Fig. 1, which is explained over multiple, upcoming sections.
Fig. 1.

A conceptual model for seeking behavior process. A. The diagram shows a highly simplified model describing relationships among key concepts related to seeking motivation (see Sections 1 and 2). Although it is not straightforward to characterize a neurobiology-based model in terms of the concepts used in machine or reinforcement learning, reinforcement learning concepts (Sutton and Barto, 1998) are shown in italic in the proposed model, to clarify the relationships among key concepts: agent, the learner and decision maker; environment, things that interact with the agent; state, anything that the agent have observed (could include the Internal Models); policy, the rule that the agent employ selective action options; reward, experienced goal (i.e., optimal level of detected EPE); value, desired goal (i.e., optimal level of anticipated EPE); action, selected behavioral response. The goal of the model is to maximize knowledge about the environment. The process executes as follows: (1) The agent or animal perceives the environment (state). (2) The perceived information is compared against the prediction generated by internal models. (3) The comparison results in the detection of environment prediction errors (EPEs). Detected EPEs are integrated into the models, to update knowledge, which generate new predictions. (4) New predictions are subjectively assessed for their likelihood, generating future, anticipated EPEs (i.e., uncertainty). (5) Detected and anticipated EPEs are used to produce reward and value, respectively, as shown in (B; Although reward and value are distinguished in the model due to being differentially derived, both are motivators. See subsection 1.1). Reward/value activates the information-seeking process, depending on the supramammillo-septal pathway (SuM-MS). (6) The information-seeking process modulates the activity of the goal-seeking process, depending on the meso-limbic pathway (VTA-VStr). (7) The executive system produces actions based on perceptual and cognitive inputs, motivation inputs, and policy, which partly comes from the internal models. This process is repeated until the animal gets “bored” caused by low or high EPEs (B) or until seeking behavior is interrupted by other needs, including rewardsclassic, threats, fatigue, etc. In addition, the diagram explains how intracranial self-stimulation (ICSS) is produced. Artificial activation of certain neural elements of seeking motivation (green arrows) activates the executive system, which instigates seeking response, and if the seeking response is contingently paired with such activation, animals display ICSS. B. EPE level has an inverted U-curve relationship with seeking motivation level. The information-seeking process may be affected by broad range of EPEs (1), while the goal-seeking process may be responsive to a narrow EPE range (2).
1.1. Reward and behavior
Behavior is motivated by reward. This term, reward, is defined as having behavioral effect without referring to any conscious experience as follows: A reward is a thing, experience, or brain manipulation that instigates and reinforces approach behavior. Therefore, if a stimulus or event instigates and reinforces approach behavior, then it is a reward.
Accordingly, rewards can be any of a variety of things, including food, water, novel stimuli, abused substances, and intracranial manipulations. We distinguish classic rewards (rewardsclassic) such as food and water from non-conventional rewards which are discussed below. Rewardsclassic such as food and water are regulated homeostatically. Homeostatic needs instigate animals to seek rewardsclassic or conditioned stimuli that have been associated with rewardsclassic (CSreward). This results in CSreward acquiring motivational properties of rewardsclassic and thereby becoming important guides for seeking behaviors. Such behavior persists until homeostatic needs are met, and therefore, are typically labeled as goal-directed behavior or goal-seeking behavior. In addition, certain species-specific stimuli, including reproduction-related stimuli and other social stimuli, are homeostatically regulated (Matthews and Tye, 2019). For example, social play behavior in juvenile rats serves as a reward and is regulated by age and exposure amount (Ikemoto and Panksepp, 1992; Panksepp, 1981b; Panksepp and Beatty, 1980; Trezza et al., 2011). Therefore, they can instigate goal-seeking behavior.
The other class of rewards is environmental stimuli that do not have a direct link with biological needs but can be characterized as salient or novel stimuli. Although many such stimuli can instigate and reinforce approach behavior, their effects as such are transient and not as persistent as those of rewardsclassic and CSreward. In other words, they are not considered to be behavioral goals, but instigators of curiosity and exploratory behavior. Therefore, we refer to the behavioral interaction with such stimuli as information-seeking behavior.
Note that artificial rewards such as abused substances (e.g., cocaine and heroin) and brain stimulation rewards (e.g., intracranial self-stimulation, see subsection 2.7) are not classified as rewardsclassic because no evidence indicates that they are regulated by homeostatic or specialized, evolutionarily selected mechanisms. However, they can elicit powerful reinforcing effects such that animals and humans seek them despite negative consequences (Koob and Volkow, 2010; Olds, 1958; Routtenberg and Lindy, 1965). The reason for this, at least in part, is that they can activate midbrain dopamine neurons (Koob and Volkow, 2010; Wise and Bozarth, 1987), which play a critical role in goal-seeking behavior (Fig. 1A; see Section 7).
In sum, sensory stimuli associated with rewardsclassic can act as persistent goals and support goal-seeking behavior. Alternately, other environmental stimuli can serve as rewards whose effects are transient and inconsistent, and support information-seeking behavior.
1.2. Information-seeking behavior
Curiosity and exploratory behavior are suggested to be regulated by a drive or motivation system (Dashiell, 1925; Nissen, 1930), and it is reinforcing to engage in the action to satisfy or activate that motivation (Berlyne, 1950; Montgomery, 1954; Woodworth, 1958). For example, rodents learn operant responding for the opportunity to explore an environment that contains no rewardsclassic (Montgomery, 1954; Myers and Miller, 1954). Similarly, monkeys learn operant responding for the opportunity to view their surroundings from an enclosed chamber (Butler, 1953; Butler and Harlow, 1954). The existence of such motivation is further supported by the phenomenon called latent learning (Blodgett, 1929; Tolman, 1948). For example, Tolman and his colleagues famously demonstrated that rats left in a maze with multiple paths and corners in the absence of any rewardclassic still learn about maze environments. This knowledge is demonstrated by efficient seeking behavior when rewardsclassic are later introduced in the maze (Tolman, 1948). These behavioral observations support motivational processes that instigate and reinforce information-seeking behavior in the absence of rewardsclassic. We now refer to the processes as seeking motivation, and we suggest that this motivation also plays an important role in the seeking for rewardsclassic. Note that the term motivation is used here in two ways: (1) a coordinator of multiple structural and sub-functional activities for specified function and (2) an invigorator that increases occurrences or effort for specified function.
In the ensuing sections, we will provide further evidence and elaborate on how this motivation system coordinates neural substrates to regulate seeking behavior for environmental information (i.e., information-seeking behavior) and rewardsclassic (i.e., goal-seeking behavior). Other groups have also begun investigating how the brain regulates seeking behavior that is not necessarily linked directly with rewardsclassic (Ahmadlou et al., 2021; Bromberg-Martin and Hikosaka, 2009; Daddaoua et al., 2016; Gottlieb et al., 2014; Gruber and Ranganath, 2019; Marvin et al., 2020; Monosov, 2020). This review also discusses how the proposed mechanisms of seeking motivation relate to the midbrain dopamine (DA) system. DA neurons are known to display phasic activities that are characterized as reward prediction errors (RPEs), the difference between actual rewardclassic and predictions of the time and magnitude of rewardclassic, to teach the animal relationships between the environment, behavior, rewardclassic, and punishment and to shape goal-seeking behavior (Schultz et al., 1997). Note that in addition to reinforcement learning, phasic signals can produce motivation. Phasic DA activity is correlated with reward-seeking responses (Phillips et al., 2003; Roitman et al., 2004; Satoh et al., 2003), and phasic stimulation of DA neurons or medial prefrontal cortex neurons (which in turn activate midbrain DA neurons (Beier et al., 2015)) is found to produce invigorating effects on reward-seeking responses (Hamid et al., 2016; Fig. 6 of Ilango et al., 2014a; Fig. 2f of Yang et al., 2022).
Fig. 6.

Key structures of the network involved in seeking motivation. The red lines indicate reciprocal neural connectivity, while the black lines indicate predominantly unidirectional connectivity. In particular, the interaction with the hippocampus is suggested to play a key role in information-seeking behavior. Note that this diagram by no means represent all important structures that participate in information-seeking behavior. Abbreviations: LS, lateral septal nuclei; MS, medial septal nucleus; PAG, periaqueductal gray; PFC, prefrontal cortex; SuM, supramammillary region; VStr, ventral striatum; VTA, ventral tegmental area; ZI, zona incerta.
Fig. 2.

Cognitive and behavioral components of seeking motivation. A. The neural substrates mediating the cognitive component of seeking motivation consists of (1) SuMn-to-Hipp, (2) SuMnGlu-to-MS, and (3) MS-to-Hipp pathways. In addition, the MS receives input from the Hipp, and the SuM receives input from the MS. B. The neural substrates mediating the behavioral component of seeking motivation consists of (1) SuMnGlu-to-MS, (2) MSnGlu-to-VTA, and (3) VTAnDA-to-VStr pathways. Abbreviations: DA, dopaminergic; GABA, gamma-aminobutyric acidergic; Glu, glutamatergic; Hipp, hippocampal formation; LS, lateral septal nuclei; MS, medial septal nucleus; SuM, supramammillary region; VStr, ventral striatum; VTA, ventral tegmental area.
2. Environment and seeking motivation
2.1. Environment prediction error
To facilitate mechanistic discussions of information-seeking behavior, we propose the term environment prediction errors (EPEs). EPEs allow the animal to learn about the environment and motivate the animal to seek environmental information (Fig. 1). First, EPEs are used to update internal models of the environment. Internal models of the environment are developed over time by acquiring previously perceived information and, in turn, generate predictions about the environment. The prediction is compared against perceived environment, and the discrepancy between the prediction and the perception concerning the environment is an EPE. Second, although it may not be intuitive, EPEs will invigorate the animal to seek more information (see subsection 2.3 for evidence). Detected EPEs are associated with such perceptions as novelty and salience. Novel and salient stimuli are worthy of attention and investigation because such perceptions indicate that their features are not clearly represented in the internal models.
An important, related term is uncertainty. In terms of EPE, uncertainty is a subjective confidence level about the prediction that the internal model has generated and not yet been evaluated, thereby the anticipation of EPEs. Like detected EPEs (i.e., novelty and salience), anticipated EPEs (i.e., uncertainty) instigate attention and investigation to learn more about the environment.
In summary, both detection and anticipation of EPEs depend on internal models, which have been developed through prior environmental interactions with sensation and perception, learning and memory, and integration of such processes. Because acquired information should make it more effective in predicting the future for obtaining rewardsclassic and avoiding dangers, the brain must be organized in such a way to detect and anticipate EPEs and then to instigate information-seeking behavior to update internal models.
2.2. Seeking motivation
Seeking motivation coordinates cognitive and behavioral processes to produce adaptive seeking behavior. For one, it increases the cognitive capacity for attending and perceiving environmental information, storing it for future use, and recalling it. Thus, seeking motivation has a positive feedback relation with cognition (Fig. 1A). In addition, seeking motivation invigorates approach behavioral responses toward the environment. Note that approach behavior includes active avoidance behavior, which is a type of approach behavior - approach toward safety (see subsection 7.2). For example, hearing fire alarm in a high-rise building instigates seeking behavior for a staircase. This seeking behavior is active in the same way that, for example, the sight of a novel object in a familiar environment instigates seeking behavior toward the object. Such actions are associated with such feelings as hope, desire, and energy. Thus, active avoidance behavior should be distinguished from passive avoidance (i.e., freezing) behavior. The detection of a predator in proximity will elicit passive avoidance behavior, which is associated with fear, anxiety, and aversion. Negative affective states at extreme levels elicit escape behavior, which occurs in the absence of seeking motivation. Thus, EPEs can be positive and instigate active seeking behavior, or negative and elicit passive avoidance or escape behaviors. The present paper focuses on seeking motivation that instigates approach, seeking behavior
2.3. Relationship between seeking motivation and learning
Positive EPEs activate seeking motivation that instigates and reinforce approach behavior. Let us explain this idea using the example of seeking behavior for light illumination in rodents. Rodents are attracted to brief illumination of light, which has not been associated with any rewardclassic: When a rat or mouse is placed in a chamber equipped with a lever switch and a lightbulb, the animal explores about the chamber environment including the lever. The animal incidentally depresses the lever and produces a brief light illumination, an event that leads to learning to respond on the lever that produces the light illumination (Keller et al., 2014; Kish, 1966; Stewart and Hurwitz, 1958). Accordingly, novel environment instigates exploratory behavior, and salient light illumination further activates seeking motivation, which in turn instigates seeking behavior for the stimulus. A seeking response that has resulted in a light illumination is reinforced, and rodents learn to make the response again and again as long as the stimulus produces an EPE. As rodents repeatedly experience the stimulus, EPEs will decrease to the point that the exposure no longer activates seeking motivation or behavior. Such explanation can be applied to other observations: Rodents are attracted more to novel contexts and stimuli than familiar ones, as shown by context- and object-choice tasks (Bardo et al., 1989; Ennaceur, 2010); monkeys spend more time gazing at novel stimuli than familiar stimuli (Daddaoua et al., 2016; Ghazizadeh et al., 2016).
Like salient and novel stimuli, uncertainty, i.e., anticipated EPEs, can instigate and reinforce active seeking behavior. Hungry rodents interact more with cues signaling uncertain deliveries of food reward over cues signaling certain outcomes (Anselme et al., 2013). Additionally, monkeys spend more time gazing at stimuli predicting uncertain rewards over certain rewards (Daddaoua et al., 2016; Ghazizadeh, Griggs, and Hikosaka, 2016). Given choices, monkeys often prefer choosing uncertain choices over certain choices associated with rewardsclassic (Monosov et al., 2015), even if selecting uncertain choices results in fewer overall rewardsclassic (McCoy and Platt, 2005; Monosov et al., 2015; O’Neill and Schultz, 2010). Likewise, some people enjoy gambling such as playing cards and betting on horse races. Such activities may be instigated and reinforced by uncertainty, i.e., the anticipation of EPEs associated with such activities. In summary, these observations, involving novelty and uncertainty, are consistent with the idea that the detection and anticipation of EPEs can elicit seeking motivation, which instigates and reinforces seeking behavior.
2.4. Insights from music perception
Research on music perception provides important insights into the relationship between EPEs and seeking motivation. First, research suggests that the intrinsic reward value of music is tied to the anticipation of upcoming chords and chord sequences (Huron, 2006; Salimpoor et al., 2015). This idea is consistent with our presently proposed perspective, which explains music reward as follows: The anticipation and detection of EPEs concerning chords and chord sequences stimulate and reinforce listening (Fig. 1A).
In addition, research on music perception found that upcoming chords and chord sequences that are easy and difficult to predict have little music value (Cheung et al., 2019). In other words, low and high levels of EPEs concerning chords and chord sequences are less effective than the mid-range level of EPEs in stimulating and reinforcing listening. Fig. 1B depicts an inverted U-shaped relationship between EPEs and seeking motivation. Interestingly, in certain circumstances the value of stimuli or contexts that are complex and, thus produce large EPEs, can increase information values after repeated exposure (Berlyne, 1970). Moreover, the complexity of stimuli exposed immediately earlier can affect the seeking value of the next stimuli that the animal encounters (Berlyne and Crozier, 1971). This may mean that the cognitive capacity to handle information has a limit and that highly complex stimuli overload the system, leading to little information value towards an understanding of the environment. Repeated exposure to the environment producing large EPEs perhaps decrease EPEs, resulting in increased seeking motivation for better understanding of the environment.
Our discussion above explains the latent learning phenomenon that we previously mentioned. The anticipation and detection of EPEs activate seeking motivation, which instigates and reinforces exploratory behavior in novel environments, and this process results in learning about environmental stimuli and their relationships in the absence of rewardclassic.
2.5. Seeking motivation can persist in familiar environments
Here we elaborate on the role of seeking motivation in familiar environments where seeking behavior may be regulated by the passage of time. The detection of EPEs instigates seeking motivation to explore environments or objects, and through this activity, animals learn about the environment. However, seeking motivation may not diminish in the environment where the animal has repeatedly interacted. Many environments are complex, and their conditions are dynamic, not static; they constantly change. Even in a relatively static laboratory environment, exploratory behavior does not disappear. For example, when rats are placed in a 40 cm × 40 cm chamber for 30 min daily, rats display robust exploratory activities on day 1, and activities become shorter and less robust over the next several days. However, exploratory behavior never disappears over the ensuing days; rats display exploratory behavior at the beginning of each daily session (Ikemoto, 2002). The absence from the environment for a few hours revives seeking motivation. Therefore, a seemingly simple environment appears to offer a rich medium of EPEs.
2.6. Seeking motivation in relation to arousal and stress
Seeking motivation could be described as arousal. The term “arousal” has been used to describe states that coordinate central and peripheral activities in the face of stressful conditions, including actual and foreseen challenges. Stress is often discussed, categorically, as the condition that leads to unwanted physiological states (e.g., high blood pressure, muscle tension, and weakened immune system), negative emotion (e.g., anxiety and depression), and substance use disorders (e. g., alcohol abuse and excessive tobacco smoking). In the present paper, we view stress as existing on a continuum that can be influenced by the perception of novelty and uncertainty. Take, for example, hiking on a trail in the woods. Just the physical act of hiking itself induces some level of stress for a person. If the hiking trail is novel to the hiker, the activity imposes an additional level of stress, accompanied by a higher level of seeking motivation. Furthermore, an encounter with a wild bear while hiking on the trail would be considered extremely stressful and could elicit fight-or-flight behavior, which does not involve seeking motivation. The continuum perspective seems to provide a comprehensive view for understanding behavior. Thus, manageable stressors, those located away from the extremes of this continuum, are healthy conditions that drive active seeking motivation.
The concept of arousal is too broad to discuss seeking processes, as the arousal concept is often used in describing the wakefulness state in contrast to the sleep state and the state associated with extreme stress. For example, how does arousal influence feeding or grooming behaviors? What do animals do when arousal is completely diminished? As you will see below, the seeking motivation concept provides more specific answers to these questions. Therefore, our discussion centers on the concept of seeking motivation, a specific form of arousal that may be alternatively labeled as seeking arousal.
2.7. Artificial, intracranial manipulations and seeking motivation
The neural substrates of seeking motivation can be exogenously activated by intracranial manipulations. Particularly, the procedures referred to as intracranial self-administration (ICSA) and intracranial self-stimulation (ICSS) are useful in demonstrating that stimulated neural elements are involved in seeking-behavior processes. In ICSA, animals learn to produce seeking responses to administer neuroactive chemicals intracranially into discrete brain regions (Ikemoto and Wise, 2004), and in ICSS, animals similarly learn to respond for intracranial electrical currents delivered into discrete brain sites (Milner, 1989) or photostimulation that can activate or inhibit specific neural populations with optogenetic procedures (Ilango et al., 2014b). These phenomena can be viewed as the products of activating information-seeking process and then goal-seeking process, or goal-seeking process alone (Fig. 1A).
When Olds and Milner (1954) initially discovered the ICSS phenomenon, such stimulation was thought to induce “pleasure” (Olds, 1956) because of its capacity to reinforce behavior. However, additional observations led to an alternative view. The same manipulations that support ICSS or ICSA instigate exploratory behavioral activities, including increased forward locomotion and sniffing (Ikemoto and Panksepp, 1994; Panksepp, 1981a). In addition, such manipulations augment the seeking responses reinforced by other rewards (Gallistel, 1969; Ikemoto, 2010). Moreover, abused substances, especially psychomotor stimulants, produce not only reinforcing effects, but also behavioral activation effects, and these effects are attributed to their capacity to activate midbrain DA neurons (Wise and Bozarth, 1987). These observations suggest that the manipulations supporting ICSS or ICSA activate neural networks regulating seeking behaviors (Ikemoto, 2010; Ikemoto and Panksepp, 1999; Panksepp, 1982). Therefore, the present paper considers the ICSS and ICSA as the phenomena that arise from the activation of seeking motivation, which instigates and reinforces seeking behavior.
2.8. Proposed neural substrates
We have discussed that seeking motivation coordinates and regulates (A) cognitive processes for attention, perception, acquisition, storage, and consolidation concerning environmental information and (B) seeking behavior. We propose two sets of neural pathways that play a fundamental role in seeking motivation (Fig. 2), and central to each pathway is the supramammillary region (SuM). We argue that function (A) depends on (1) SuM neurons (SuMn) projecting to the hippocampal formation (Hipp) (SuMn-to-Hipp) and (2) SuMn projecting to the medial septal area (MS) and (3) then to the Hipp (SuMn-to-MS-to-Hipp) (Fig. 2A). In addition, function (B) depends on (1) glutamatergic (Glu) neurons in the SuM projecting to the MS (SuMnGlu-to-MS), (2) glutamatergic MS neurons projecting to the ventral tegmental area (VTA) (MSnGlu-to-VTA), and (3) VTA DA neurons projecting to the ventral striatum (VStr) (VTAnDA-to-VStr) (Fig. 2B). Note that we do not claim that these pathways are exclusively involved in said functions.
3. Supramammillary circuits in seeking motivation
The SuM is a posterior hypothalamic structure located just dorsal to the mammillary body (MB) and anterior to the VTA (Fig. 3). The SuM extensively projects to the septohippocampal system (Vertes, 1992) (Fig. 4). The extensive septohippocampal projections of the SuM underscore the importance of the SuM in regulating the septohippocampal system.
Fig. 3.

The supramammillo-septal projection. The experiment described in (A) produced results (B, C) suggesting that the SuM-to-MS pathway is primarily glutamatergic. A. The AAV-retro-Nuc-flox(mCherry)-eGFP retrogradely labels Cre-containing soma with eGFP while Cre-negative soma with mCherry. vGlut2-Cre mice received vector injections into the MS. B. SuM neurons are labeled with eGFP, but not mCherry. C. The same injection produced both eGFP and mCherry labeling at the injection sites and in the medial horizontal limb of the DB, which projects to the MS. Abbreviations: cc, corpus collosum; fr, fasciculus retroflexus; LS, lateral septal nuclei; MM, medial mammillary nucleus; ml, medial lemniscus; MS, medial septal nucleus; PH, posterior hypothalamic nucleus; SHi, septohippocampal nucleus; SuM, supramammillary region; VStr, ventral striatum. Scale bars = 200 μm. Figure is adapted from Kesner et al. (2021).
Fig. 4.

The hippocampal formation and connectivity. Abbreviations: EC, entorhinal cortex; DG, dentate gyrus; MS-DB, medial septal nucleus-diagonal band of Broca; Sub, subiculum; SuM, supramammillary region.
We serendipitously discovered the role of the SuM in reward-seeking behavior. While investigating the functional heterogeneity of VTAnDA in seeking behavior using ICSA procedures, we found that rats self-administer infusions of AMPA directly into the SuM (Ikemoto et al., 2004). This structure had received little attention with respect to reward or reward-seeking behavior, although a study in the 1950 s showed that electrical stimulation of this structure supports ICSS in rats (Olds and Olds, 1958). Our group found that intra-SuM infusions of other excitatory pharmacological agents reinforce behavior, including the GABAA receptor antagonist picrotoxin (Ikemoto, 2005), and nicotine (Ikemoto et al., 2006) – a key substance contained in tobacco that is reinforcing and widely abused. The latter finding implicates the SuM in nicotine addiction. We also found that DA receptor blockade readily attenuates self-administration of intra-SuM AMPA and that intra-SuM AMPA increases extracellular concentrations of DA in the VStr (Ikemoto et al., 2004). We initially had no explanation for how the stimulation of SuM AMPA receptors resulted in increased VStr DA release since we were unaware of the structural relationship between SuM neurons (SuMn) and VTAnDA-to-VStr at that time. These initial findings led us to investigate the circuit mechanisms through which the stimulation of SuMn reinforces seeking behavior.
Summarized below are the findings from our recent study (Kesner et al., 2021) supporting that the SuMnGlu-to-MSnGlu-to-VTAnDA circuit participates in seeking motivation (Fig. 2B):
Optogenetic photostimulation of SuMnGlu, but not GABA or DA neurons, expressing channelrhodopsin-2 (ChR2), an excitatory opsin, reinforces seeking behavior.
Stimulation of SuMnGlu-to-MS reinforces seeking behavior.
Rats learn to self-administer AMPA along the midline of the septal area, but not the diagonal band of Broca (DB), suggesting that the excitation of neurons in the MS or its vicinity reinforces seeking behavior.
Importantly, noncontingent injections of intra-septal AMPA increase seeking behavior reinforced by the 1-s presentation of a salient visual stimulus (bright light), suggesting that increased Glu transmission in the MS increases information-seeking behavior.
Stimulation of MSnGlu, but not GABA or cholinergic neurons, reinforces seeking behavior.
SuMnGlu monosynaptically excite MSnGlu.
Stimulation of SuMnGlu-to-MS increases the signals of GCaMP (a genetically encoded protein that increases fluorescence upon Ca2+binding) expressed selectively in VTAnDA, suggesting that SuMnGlu-to-MS modulates VTAnDA.
Stimulation of MSnGlu or MSnGlu-to-VTA increases the signals of VStr dLight (a protein that fluoresces upon binding DA), indicating that MSnGlu-to-VTA modulates DA neuron activity.
Pretreatment with DA receptor antagonists decreases seeking behavior reinforced by intra-septal AMPA injections, or by the stimulation of SuMnGlu-to-MS.
MSnGlu project to the VTA (Fuhrmann et al., 2015) and form synaptic contacts with VTAn. The excitation of MSnGlu-to-VTA reinforces seeking behavior. Moreover, the levels of seeking behavior reinforced by stimulation of MSnGlu-to-VTA positively correlate with VStr dLight signals driven by such stimulation.
These results suggest that SuMnGlu-to-MS modulate MSnGlu-to-VTA, which in turn modulate VTAnDA-to-VStr and reinforce seeking behavior. Therefore, we suggest that the SuMnGlu-to-MSnGlu-to-VTAnDA circuit participates in regulating seeking motivation. Importantly, we argue that this circuit is particularly important in coordinating both information-seeking and goal-seeking behaviors.
Note that rodents engage in compulsive-seeking behavior reinforced by brain stimulation, i.e., ICSS, in a stable environment for hours. This observation suggests that ICSS does not depend on acquiring new information, as indicated in Fig. 1A. Therefore, we suggest that the two subsystems can work independently while working together in seeking behavior as (A) a cognitive subsystem and (B) a behavioral subsystem, and that the activation of the motivational subsystem is sufficient to support ICSS without EPEs generated by the cognitive subsystem (Figs. 1 and 2). Below we expand upon the idea that the cognitive aspect of information-seeking behavior is importantly assisted by the hippocampal formation, while the behavioral aspect involves the midbrain DA system, each with functional connectivity to the supramammillo-septal circuit.
4. Hippocampal formation and seeking behavior
The hippocampal formation is important for processing environmental information for perception, memory, and prediction (Zeidman and Maguire, 2016). It receives highly processed, multimodal sensory information from cortical regions, including sensory information from olfactory, visual, and auditory cortices, goal-related information from the prefrontal cortex (PFC), and emotion-related information from the amygdala (Amaral and Lavenex, 2007). Thus, it appears to be suited for integrating and comparing incoming information. In general, cortical input reaches the hippocampal formation through the entorhinal cortex, then is further processed by the dentate gyrus, CA3, CA1, subiculum, and back to the entorhinal cortex, in this sequence (Fig. 4) (Amaral and Lavenex, 2007). Although the intrinsic hippocampal circuitry is largely unidirectional, it contains both serial and parallel projections. Nearly all regions of the hippocampal formation receive input from the MS-DB and the SuM (Fig. 4).
Such organization allows the hippocampal formation to integrate multimodal information (Behrens et al., 2018; Gray and McNaughton, 2003; Whishaw and Wallace, 2003) for cognitive mapping (O’Keefe and Nadel, 1978), episodic memory (what, where, and when information for encoding, storing, consolidating, and recalling) (Eichenbaum, 2017; Rolls, 2013), and the detection of novelty (Kumaran and Maguire, 2007, 2009). In addition, it is important in imagining the future, which involves recalling past events and remapping and realigning information for predicting future outcomes (Cheung et al., 2019; Julian and Doeller, 2021; Okuda et al., 2003; Rigoli et al., 2019; Schacter et al., 2012). Such processes must be critical for the generation of EPEs. Thus, the hippocampal formation is an important cognitive component that guides information- and goal-seeking behaviors.
Note that EPEs occur at the levels of synapses, microcircuits, and macrocircuits and that the hippocampal formation is important in high-order EPEs as its connectivity suggests. High-order EPEs depend on the integration of information coming from multiple brain regions. It is particularly important for so-called associative novelty. O’Keefe and Nadel (1978) offered a memorable example of associative memory: “My wife was found in my bed with my best friend.” Novelty does not lie in wife, bed, or best friend, but the combination of the three.
Seeking-related operations of the hippocampal formation are reflected by the occurrence of high-frequency hippocampal theta oscillations (HTO), which are prominent during environmental interaction but not consummatory behaviors (Buzsaki, 2002; Whishaw and Vanderwolf, 1973). HTO reflect essential hippocampal operations of acquiring environmental information for cognitive maps and episodic memory (Buzsaki and Moser, 2013; Eichenbaum et al., 1999; McNaughton and Corr, 2018; Squire, 1992). HTO and related functions are modulated by both MS-DB and SuM, as discussed below.
5. MS-DB and seeking behavior
5.1. MS-DB neurons respond to saliency and uncertainty
Accumulating evidence suggests that MS-DB neurons play important roles in the cognitive aspect of information-seeking behavior. First, MS-DB neurons critically modulate HTO, as the disruption of MS-DB activity results in diminishing HTO (Winson, 1978). Consistently, the disruption of MS-DB produces deficits in spatial navigation and spatial and episodic memory (Givens and Olton, 1994; Hagan et al., 1988; M’Harzi and Jarrard, 1992; Okada and Okaichi, 2010). Second, MS-DB neurons play an important role in encoding environmental salience. MS-DB neurons respond to salient sensory stimuli in rodents, including visual, auditory, and somatosensory modalities (Hayat and Feldman, 1974; Segal, 1974; Zhang et al., 2018b). Particularly, MSnGlu respond to loud noise (Zhang et al., 2018b). Similarly, MS-DB neurons respond to salient place contexts associated with rewardclassic in monkeys (Kita et al., 1995; Nishijo et al., 1997). Third, MS-DB neurons respond to uncertainty. Some septal neurons of monkeys increase firing rates in response to conditioned stimuli (CS), signaling uncertain reward outcomes (Monosov and Hikosaka, 2013; Monosov et al., 2015). Specifically, a group of neurons located in the anteromediodorsal part of the septal area display increased activity when reward outcomes are uncertain but not when reward outcomes are certain (Monosov and Hikosaka, 2013). Another population in the MS-DB responds more diversely to stimuli indicating reward uncertainty and certain and uncertain aversive outcomes (Monosov et al., 2015). Therefore, considerable evidence supports the idea that MS-DB neurons are important in the cognitive aspect of information-seeking behavior, including the detection and anticipation of EPEs.
5.2. MSn coordinate behavior and cognition in information seeking
As discussed above, HTO are an observable metric reflecting essential activities in seeking behavior in rodents and, thus, are useful windows into the mechanisms underlying seeking motivation. Another observable variable useful in understanding possible neural mechanisms for seeking motivation is locomotor activity, which enables the host to travel in space for environmental information. In fact, the activities of both locomotor activity and HTO are coordinated; MSnGlu modulate both locomotor activity and HTO. First, locomotor speed positively correlates with both the frequency and power of HTO, and HTO increase just before the onset of locomotion (Bender et al., 2015; Green and Arduini, 1954; McFarland et al., 1975; Morris and Hagan, 1983; Vandecasteele et al., 2014; Vanderwolf, 1969; Whishaw and Vanderwolf, 1973). Second, MSnGlu activity increases just before the onset of locomotion and positively correlates with locomotor speed (Fuhrmann et al., 2015). Third, selective optogenetic stimulation of MSnGlu at theta frequencies is sufficient to induce HTO, initiate locomotor activity, and increase locomotor speed in a frequency-dependent manner (Fuhrmann et al., 2015). MSnGlu coordinate locomotion and HTO by regulating the activities of cholinergic and GABA neurons within the MS (Robinson et al., 2016). Together, these findings suggest that these MS-DB cholinergic, GABA, and Glu neurons form a set of local circuits to coordinate HTO and locomotion.
Note that locomotor activity and HTO appear to be modulated by different MSn projections. The projections from the MS to the hippocampus modulate HTO, thereby information integration, as discussed above. In particular, MSnGlu send speed-related information to the medial entorhinal cortex of the hippocampal formation (Justus et al., 2017). For the regulation of movements, MS projections to sub-cortical areas appear to be important. The stimulation of MSnGlu projecting to the preoptic area, but not to the hippocampus, increases locomotor activity (Zhang et al., 2018a). In addition, the MSnGlu-to-VTAnDA circuit likely contributes to locomotor activity as this circuit modulates the activity of VTAnDA-to-VStr (Kesner et al., 2021), a well-established circuit for controlling locomotor activity and motivation (Ikemoto, 2007; Ikemoto and Panksepp, 1999; Wise, 2004).
Such coordinating role of MS is consistent with the aforementioned finding that the administration of AMPA into the MS increases seeking responses reinforced by the presentation of salient visual stimuli in rats (Kesner et al., 2021).
6. SuMn and seeking behavior
SuMn are important in arousal, salience, and hippocampal-dependent information processing. Synthesis of these functions is characterized best in seeking motivation. In particular, SuMn coordinates the activities of other structures that modulate behavior and cognition during information seeking. The SuM interacts closely with the septohippocampal complex in this role.
6.1. SuMnGlu modulate HTO via the MS-DB
Although SuMn lesions disrupt HTO only in limited conditions (McNaughton et al., 1995; Pan and McNaughton, 2004; Thinschmidt et al., 1995), the SuM is involved in generating HTO. SuMn transform tonic signals from brainstem regions to a rhythmic signal; this signal is delivered to the MS, which, in turn, relays these signals to the hippocampus (Bland and Oddie, 1998; Kirk, 1998; Kocsis and Vertes, 1994; Pan and McNaughton, 2004). First, inactivation of regions rostral to the SuM modifies the amplitude but not the frequency of reticular-elicited HTO, while inactivation of regions caudal to the SuM affects both frequency and amplitude (Kirk and McNaughton, 1993). Second, the stimulation of SuMn drives HTO (Bland and Oddie, 1998; Kirk, 1998; Pan and McNaughton, 2004; Pedersen et al., 2017; Vertes and Kocsis, 1997), while the inhibition of SuMn decreases the frequency of HTO in freely moving rats (Pedersen et al., 2017; Saji et al., 2000). Moreover, lesions of SuMn produce behavioral deficits similar to those observed from hippocampal lesions (Pan and McNaughton, 2002).
6.2. SuMnGlu modulate vigilance states and locomotor activity
Chemogenetic stimulation of SuMnGlu increases wake time and HTO, whereas chemogenetic inhibition of SuMnGlu decreases wake time and the HTO during rapid eye movement (REM) sleep (Pedersen et al., 2017). These effects on vigilance states, i.e., wakefulness and REM sleep (a sleep state where brain activity is similar to active wakefulness) are also consistent with the seeking motivation hypothesis. A lack of seeking motivation results in diminished interest in interacting with the environment, which promotes resting behaviors, including sleep. By contrast, increased seeking motivation is accompanied by increased environmental activities, which prolong wakefulness. In addition, SuMn inhibition induced by microinjections of GABA receptor agonists decreases locomotor activity (Kesner et al., 2021; Ma and Leung, 2007). Conversely, SuMn activation induced by local injections of the GABAA receptor antagonist picrotoxin robustly increases locomotor activity (Shin and Ikemoto, 2010). A more recent study showed that the firing rates of SuMN are highly correlated with locomotor speed in mice and that optogenetic excitation and inhibition of SuMN initiate and diminish locomotor activity, respectively (Farrell et al., 2021).
6.3. The activation of SuMn is associated with novelty, uncertainty, and seeking behavior
Research involving c-Fos as a marker for neural activation suggests that SuMn are activated by novel stimuli and uncertainty. c-Fos is strongly induced in SuMn by exposure to the following conditions: novel environments (Wirtshafter et al., 1998); taste cues associated with sickness (Yasoshima et al., 2005); contexts paired with a foot shock (Beck and Fibiger, 1995); odors associated with predators (Day et al., 2004); swim and restraint stress (Cullinan et al., 1995); contexts that allow hungry rats to anticipate food (Le May et al., 2019); and appetitive tasks that require spatial working memory (Vann et al., 2000). As such conditions demand attention and, possibly, actions for further information, these findings are consistent with the idea that SuMn regulate information-seeking processes.
In addition, a link has been established between SuM c-Fos and seeking behavior. Lateral hypothalamic stimulation (Arvanitogiannis et al., 1997) and intra-VTA carbachol injections (Ikemoto et al., 2003) induce c-Fos in SuMn. These manipulations reinforce seeking behavior (Arvanitogiannis et al., 1997; Ikemoto and Wise, 2002) and instigate sniffing and locomotor activity (Ikemoto and Panksepp, 1994; Ikemoto et al., 2003; Miliaressis and LeMoal, 1976), which may reflect seeking motivation. Similarly, intra-SuM picrotoxin injections induce c-Fos, instigate locomotor activity, and reinforce seeking behavior (Ikemoto, 2005; Shin and Ikemoto, 2010).
While SuMn play an important role in hippocampal functions via the MS-DB, a recent report demonstrates that SuMn modulate novelty detection through direct hippocampal projections (Chen et al., 2020). In this study, SuMn projecting to the dentate gyrus increase activity in response to contextual novelty more than social novelty, while SuMn projecting to the CA2 increase activity in response to social novelty more than contextual novelty. Moreover, optogenetic manipulations of SuM-to-dentate gyrus and -CA2 projections alter behavioral responses to contextual and social novelties, respectively (Chen et al., 2020). The results of the study have two notable implications. First, while the unidirectional intrinsic circuitry (Fig. 4) suggests sequential processing of information arriving at the entorhinal cortex, the hippocampal formation integrates qualitatively distinct information depending on the region. Second, different SuM-to-hippocampal pathways are recruited depending on the type of novel stimuli. It is important to examine whether similar functional distinction exists in SuM-to-MS-DB pathways, because, as discussed above, MS-DB-to-hippocampus, -to-preoptic area, and -to-VTA are involved in HTO, locomotor activity, and seeking behavior, respectively.
6.4. Unlike VTAnDA, SuMn do not selectively respond to rewardclassic or CSreward
DA neuron activity is known to indicate RPEs, and this property of DA neurons plays a critical role in reinforcement learning (Steinberg et al., 2013). Because the stimulation of SuMn reinforces seeking behavior, it is of interest to determine whether SuMn display RPEs or similarly encoded responses. We investigated whether SuMn selectively respond to rewardclassic and CSreward, using electrophysiology recording procedures in freely-moving mice (Kesner et al., 2021). Mice were trained to lever-press for a sucrose reward (Fig. 5A). After a lever press, mice were presented with one of two tones: one tone signaled the availability of the sucrose solution (CSreward), while the other had no programmed consequence (CSno-reward). The results were surprising, as we expected to observe a significant proportion of SuM neurons responding to CSreward: Only a small population of SuMn differentially respond between CSreward and CSno-reward (Fig. 5B).
Fig. 5.

SuM neurons do not respond selectively between CSreward (CS+) and CSno-reward (CS-), but decrease activity during consummatory behavior. A. Diagram showing operant sucrose-seeking procedure with cue discrimination. B-D. Heatmaps of normalized firing rates of SuM neurons during the sucrose-seeking procedure. SuM neuron activities are presented with the reference point (time 0) set at the onset of CS+ or CS- events (B), nose poke entry (C), and nose poke withdrawing (D). Neurons are arranged in the same sequence from top to bottom across all heatmaps. This figure is adapted from Kesner et al. (2021).
Similarly, another study found that SuMn do not display predictive activity related to decision making for rewardclassic (Ito et al., 2018). Rats trained to run on a T-maze had to choose between two arms alternatively for a rewardclassic. SuM activity did not display any information concerning the prediction of choice between the two arms. However, two other structures, the medial prefrontal cortex (mPFC) and reuniens thalamic nucleus, did display such information. Interestingly, inhibition of SuMn decreased the coherence of activity between these two structures in relation to HTO. Given that SuMn directly project to these structures (Vertes, 1992), this study suggests that during environmental interaction, SuMn do not selectively respond to rewardclassic but that SuMn coordinate other brain structures’ activity with HTO.
The results of the Ito et al. (2018) and the Kesner et al. (2021) studies are consistent with the c-Fos findings that SuMn do not selectively respond to rewardclassic-related stimuli but more broadly to novelty/salience and uncertainty (i.e., the detection and anticipation of EPEs). These discoveries are consistent with the striking single-unit recording observation that SuMn uniformly decrease activity during rewardclassic intake (Fig. 5C,D) (Kesner et al., 2021) when the host pays little attention to the environment.
6.5. The inhibition of SuMn disrupts seeking behavior, but not rewardclassic-taking behavior
Additional results from the Kesner et al. (2021) study support a role for SuMn during seeking behavior. Mice were trained on the same behavioral procedure used for the electrophysiological experiment discussed above. Before testing, mice received an injection of a mixture of the GABAA and GABAB receptor agonists, muscimol and baclofen, into the SuM, to inhibit SuMn. This manipulation disrupted the approach toward the sucrose spout, discrimination between CSreward and CSno-reward, and decreased rewards earned and locomotor activity. Although these behavioral deficits are consistent with induced drowsiness or the loss of appetite, the results of our follow-up experiment refute these explanations.
When animals were tested on sucrose consumption that did not depend on instrumental responding, again they displayed decreased locomotor activity, but not the amount of sucrose consumed during the 30-min test. A close examination of the consummatory behavior revealed that when mice received intra-SuM muscimol and baclofen injections, they consumed more sucrose during the first 10 min than when they received saline injections. Because the muscimol/baclofen injection was administered directly into the SuM, the inhibitory action must have been effective immediately, and the increase in sucrose consumption at the beginning of the session argues strongly against the drowsiness or the appetite loss hypotheses. Instead, it is important to consider rodents’ exploratory behavior in incompletely familiar environments. As alluded to above (subsection 2.5), even though rodents explored the test chamber many times in prior days, upon introduction to the test chamber for the day, rodents engage in exploratory behavior about the chamber for the first few minutes before settling down. Therefore, the inhibition of SuMn must have selectively decreased seeking motivation, which interferes with consummatory motivation; the attenuated seeking motivation may have increased the tendency to engage in sucrose-taking behavior over exploratory behavior. These results support that SuM inhibition decreases seeking motivation.
7. VTAnDA and seeking behavior
Midbrain DA neurons play a critical role during voluntary behavior, i.e., cortex-driven approach and avoidance, by modulating thalamocortico-basal ganglia processes (Alexander et al., 1986; Haber, 2016; Ikemoto et al., 2015). Particularly, VTAnDA-to-VStr are known to modulate stimulus-stimulus and stimulus-response parings and vigor of goal-seeking behavior (Ikemoto, 2007; Ikemoto and Panksepp, 1999). Below, we first discuss the heterogenous roles of midbrain DA neurons and their role in salient and novel stimuli. Then we propose a view that VTA DA neurons, which are known to respond to rewardclassic and CSreward and indicate RPEs, are better characterized to respond to any type of reward (i.e., rewardsuniversal) and indicate EPEs (Fig. 1).
7.1. Heterogenous responses of midbrain DA neurons
Midbrain DA neurons have been extensively investigated during reinforcement learning, a process where animals use RPEs to acquire rewardclassic or avoid punishment (Schultz et al., 1997). However, midbrain DA neurons respond to the environment in a heterogeneous manner. We briefly describe the heterogenous responses of DA neurons. First, some DA neurons respond to movements (Barter et al., 2015; Coddington and Dudman, 2018; Dodson et al., 2016; Howe and Dombeck, 2016; Wang and Tsien, 2011a). Second, a recent study suggests that the heterogeneity of DA neurons emerges as tasks get more complex (Engelhard et al., 2019). Important determining factors considering DA neuron heterogeneity include location and connectivity (Parker et al., 2016). Noxious stimuli consistently excite DA neurons located in the dorsolateral part of the substantia nigra pars compacta, which projects to the dorsolateral striatum or the tail of the striatum (Lerner et al., 2015; Matsumoto and Hikosaka, 2009; Menegas et al., 2017; Schultz and Romo, 1987). DA neurons located outside the dorsolateral part respond variably to aversive stimuli (Brischoux et al., 2009; Coizet et al., 2006; Gore et al., 2014; Guarraci and Kapp, 1999; Jo et al., 2018; Mantz et al., 1989; Wang and Tsien, 2011b).
VTAnDA respond to both reward and aversive stimuli (Brischoux et al., 2009; Guarraci and Kapp, 1999; Jo et al., 2018; Wang and Tsien, 2011b). These differential responses are explained, in part, by their projection sites. Aversive stimuli excite 65% of VTAnDA projecting to the mPFC (Mantz et al., 1989). In addition to mPFC projecting neurons, VTAnDA projecting to the ventromedial and ventrolateral subregions of the nucleus accumbens respond to aversive stimuli (Badrinarayan et al., 2012; Yuan, 2019). However, the same subregions respond to reward as well. Therefore, DA neurons projecting these subregions appear to be involved in salience rather than aversion. This idea based on terminal DA release data should be interpreted cautiously because of possible dissociation between DA release in the terminal regions and cell body firing (Mohebi et al., 2019).
7.2. What do phasic responses of DA neurons mean?: Beyond rewardclassic prediction error
Particularly relevant findings with respect to the present thesis are the following three types of DA neuron phasic responses: First, DA neurons respond to novel sensory stimuli that have not been paired with rewardclassic (Horvitz, 2000; Redgrave and Gurney, 2006; Schultz, 1998; Wang and Tsien, 2011b). Second, DA neurons increase firing rates upon stimuli predicting advanced information about future rewardclassic (Bromberg-Martin and Hikosaka, 2009). That is, DA neurons respond to mere information about potential rewardclassic. These stimuli are salient because of their association with CSreward and CSno-reward. Third, DA neurons respond to uncertain reward deliveries. DA neurons display preferred responses toward CS signaling riskier delivery over safer delivery of rewardclassic (Stauffer et al., 2014). Moreover, DA neurons display ramping activity during CS that signal uncertain delivery of rewardclassic until the potential time of rewardclassic. This ramping activity is absent for a CS signaling either 100% or 0% delivery of rewardclassic (Fiorillo et al., 2003). Note that the ramping activity is not considered a phasic response, but rather a tonic activity change. In sum, DA neurons respond to not only rewardclassic and CSreward, but also to other rewards, including novel sensory stimuli, mere information, and uncertainty.
In addition, DA neurons can respond to information concerning non-rewardclassic. Active avoidance behavior is suggested to be a form of approach behavior – approach toward “safety”, and mediated by, in part, DA neurons: Increased DA activity instigates and reinforces active avoidance, while decreased DA activity diminishes active avoidance (Ikemoto and Panksepp, 1999). Recent evidence provides stronger support for this view. The presentation of CS paired with a foot shock (CSaversive) instigates freezing behavior, i.e., a passive avoidance. However, repeated trials may lead to the emergence of active avoidance behavior. This transformation in behavior coincides with the change in DA release in the accumbens core. When animals display passive avoidance behavior (i.e., freezing), core DA levels do not increase upon CSaversive; when the same animals later display active avoidance behavior, core DA levels increase upon CSaversive (Gentry et al., 2016; Oleson et al., 2012). Thus, these studies suggest that DA neurons are involved in transforming passive avoidance behavior into active avoidance behavior and support the view that DA neurons respond to information concerning non-rewardclassic, to regulate active seeking behavior. Note that because these studies measured DA release in the accumbens core, it is not clear whether the presentation of CSaversive alters DA neuron firing rates as avoidance behavior is transformed from passive to active.
7.3. DA neurons display prediction errors of salient stimuli
We suggest that phasic responses of DA neurons are better characterized as prediction errors of rewardsuniversal, which includes rewardsclassic, CSreward, and novel or salient stimuli associated with information seeking motivation. Indeed, phasic DA signal as generalized prediction error has been suggested (Gardner et al., 2018), and internal models such as that described in Fig. 1 have been proposed to explain such activity (Suri, 2001). Let us first discuss how arbitrary the concept of rewardsclassic is in terms of RPE. It is difficult to determine the exact moment of the receipt of rewardsclassic and to distinguish CSreward from actual rewardsclassic (Wise, 2002). For example, when do animals detect a food reward? When they see it, touch it, taste it, or finally digest it? The recognition of a rewardclassic appears to consist of a series of stimulus-stimulus sequences with adaptive responding during each step: Sight instigates reaching out for it; touch instigates initial ingestion; and taste instigates chewing and swallowing it, which gradually provides energy as it gets digested. Therefore, defining the absolute moment of the receipt of rewardsclassic may be arbitrary. An important implication of this analysis is that RPEs indicate errors for the magnitude and timing of rewardsuniversal. Speaking generally, DA neurons respond when a potentially important stimulus (i.e., rewardsuniversal) occurs unpredictably, and if something else can predict the unpredicted important stimulus, then DA neurons begin responding to that thing.
This function of DA neurons is sufficient to explain the biphasic DA-neuron response, which consists of a fast component signaling mere saliency of sensory stimuli and a slow component associated with rewardclassic (Schultz, 2016). Although the two components of DA signals have been suggested to be functionally distinct (Schultz, 2016), both components appear to signal the same, rewardsuniversal, which could guide goal-seeking behavior. Accordingly, the first component indicates DA neurons’ capacity to respond to salient stimuli, i.e., rewardsuniversal that have not been predicted or identified in reference to internal models; the second component indicates CSreward, i.e., rewardsuniversal that are known as the predictors of potentially important stimuli by internal models, but not predicted. The biphasic DA neuron response is explained by computational time required for perception. The perception of single-modality sensory stimuli requires fewer neural relays and is thereby processed faster than CSreward. For example, novel visual stimuli may evoke excitation of DA neurons through signals from the superior colliculus (Redgrave and Gurney, 2006), which receives visual information directly from the retina (Sefton and Harvey, 2004). On the other hand, conditioned visual stimuli, by definition, depend on the retrieval of prior experience, which is represented by bits and pieces of memory stored in multiple cortical regions (Bar, 2004); therefore, such retrieval and synthesis delay the perception of CSreward. Although novel sensory stimuli can affect postsynaptic neurons more quickly than CSreward, there is no evidence that the apparent differences in timing are interpreted differently by postsynaptic target neurons. Instead, the first component of the biphasic response may have the same function as the second. As mentioned above, phasic DA responses can be triggered by the presentation of novel sensory stimuli, including visual stimuli. Salient visual stimuli, which are not associated with rewardclassic, can instigate and reinforce seeking behavior, a process that depends on VTAnDA-to-VStr (Shin et al., 2010). Therefore, post-synaptic cells may not be affected differently by the two components of the biphasic response, and this analysis supports the idea that phasic responses of DA neurons are better characterized as prediction errors of rewardsuniversal, which include novel stimuli and rewardsclassic.
7.4. DA neurons modulate behavioral responses instigated by novelty and uncertainty
The aforementioned findings are largely correlational in nature, showing strong relationships between phasic increase in DA neuron activity and the occurrences of salient stimuli and uncertainty, although evidence for the latter is thin at this time (Fiorillo et al., 2003; Stauffer et al., 2014). However, accumulating evidence suggests causal role of DA neurons in modulating seeking behavior instigated by novel stimuli and uncertainty. Pharmacological manipulations used to increase DA activity enhance seeking responses for CSreward by increasing the salience of CSreward (Berridge and Robinson, 1998), and such action of DA takes place in the VStr (Ikemoto and Panksepp, 1999). Notably, VStr DA modulates seeking motivation for salient stimuli. As alluded above, focal injections of amphetamine, which stimulates the release and blocks the uptake of dopamine, into the VStr increase DA concentration and increase seeking responses for unconditioned, salient visual stimuli in a DA receptor type 1 (D1R)- and type 2 (D2R)-dependent manner (Shin et al., 2010). Consistently, systemic injections of the D2R antagonist haloperidol decrease preference for novel stimuli over familiar ones in rodents (Bardo et al., 1989). Moreover, the novelty-seeking personality trait correlates with VTA/substantia nigra activity in response to novel stimuli in humans (Krebs et al., 2009).
In addition, DA modulates seeking behavior under uncertainty. Administration of DA receptor agonists and antagonists can respectively decrease and increase seeking behavior under uncertainty. For one, the administration of pramipexole, a non-ergot D2R agonist used to treat Parkinson’s disease, increases pathological gambling behavior (i.e., a seeking behavior under uncertainty) in a treatment-dependent manner (Dodd et al., 2005; Driver-Dunckley et al., 2003). Similarly, the D2R agonist cabergoline decreases the subjective cost of responding to uncertain choices in healthy humans (Le Heron et al., 2020).
Uncertainty-driven behaviors observed during pharmacological manipulation of DA activity may be controlled by the striatum, particularly the VStr, and the PFC. Rodent research suggests that VStr DA activity appears to modulate seeking behavior under uncertainty (Piantadosi et al., 2021). Typical procedures compare two choices between a small, certain rewardclassic and a large, uncertain rewardclassic. Intra-VStr injections of D2R agonists, which inhibit D2R-expressing neurons, increase choices for large, uncertain reward outcomes (Zalocusky et al., 2016). Conversely, intra-VStr D2R antagonists, which activate D2R-expressing neurons, decrease large, uncertain choices (Cocker et al., 2012). Notably, one study demonstrated similar behavioral effects with pharmacological manipulations of VStr D1Rs, but not D2Rs (Stopper et al., 2013). Moreover, phasic activation of VStr D2R-expressing neurons immediately after choices decreases subsequent choices for large, uncertain reward outcomes (Zalocusky et al., 2016). By contrast, lesions or pharmacological inhibition of VStr neurons decrease responses for choices linked to large, uncertain outcomes (Cardinal and Howes, 2005; Floresco et al., 2018; Mai et al., 2015; Stopper and Floresco, 2011). In humans, polymorphisms within the D2R gene predict striatal D2R density and are associated with reinforcement learning using punishment, or “No-Go learning.” Individuals with two copies of the D2R T-allele have greater striatal D2 receptor density and a greater tendency to avoid choices linked with negative consequences (Frank et al., 2009, 2007). Moreover, individuals with two copies of the DARPP-32 T-allele, which affects D1R-mediated striatal synaptic plasticity, have a greater tendency to choose responses associated with positive consequences (Frank et al., 2009). Thus, while VStr DA plays an important role in behavior under uncertainty, the specific VStr subregions and the respective roles of D1Rs and D2Rs during behavior under uncertainty have not been established. Importantly, early evidence indicates that VStr DA signals interact with cortical inputs arriving from the PFC, amygdala, and hippocampus (Piantadosi et al., 2021).
Increasing evidence suggests that the PFC is involved in decision-making based on uncertainty levels in humans. Polymorphisms within the Catechol-O-methyltransferase gene affect DA levels in the PFC, which are important for decision-making based on whether other choices might produce better outcomes than the status quo. Carriers of a particular allele have a greater propensity for explorative decisions (Blanco et al., 2015; Frank et al., 2009, 2007; Kayser et al., 2015). Indeed, modeling PFC control of behavioral functions suggests its involvement in learning and forecasting the probable outcomes of actions (Alexander and Brown, 2011).
In summary, while DA responses to CSreward have been suggested to functionally differ from DA responses to novel stimuli, postsynaptic target neurons may not differentiate these two stimuli. Thus, in addition to rewardsclassic and CSreward, DA neurons play an important role in seeking behavior instigated by novel stimuli and uncertainty.
8. Differential roles of VTAnDA and SuMnGlu in seeking behavior
Let us clarify the different roles of VTAnDA and SuMnGlu. VTAnDA participate in producing seeking behavior toward goals by interacting with the thalamo-cortico-basal ganglia system (Ikemoto et al., 2015). VTAnDA appear to be essential in developing stimulus-stimulus and stimulus-response associations concerning goal-seeking and invigorating goal-seeking behavior (Ikemoto, 2007); thus, VTAnDA are overwhelmingly involved in seeking rewardsuniversal, toward which the animal should learn to approach. While little is known about the role of SuMnGlu in seeking behavior, the available evidence (Section 6) suggests that SuMnGlu modulate general environmental interaction instigated by novelty and uncertainty. In other words, SuMnGlu are responsible for coordinating cognitive and behavioral systems for environmental interaction. In this regard, SuMnGlu may be important in both active seeking and passive avoidance behavior and may respond to a broad range of EPEs compared to VTAnDA (Fig. 1B). As discussed in Section 3, the stimulation of the SuMnGlu-to-MSnGlu pathway instigates active seeking behavior, and we consider the MSnGlu-to-VTA pathway as an interface between SuMnGlu and VTAnDA that shapes information-seeking behavior into goal-seeking behavior where the goal is information. By contrast, stimulation of the SuMnGlu-to-paraventricular thalamic nucleus pathway may be involved in avoidance behavior as it elicits aversion (Kesner et al., 2021). Such an idea needs to be examined by future research.
9. Implications
9.1. Neural network of active seeking behavior
The aforementioned pathways involving SuMn-to-Hipp, SuMnGlu-to-MS-to-Hipp, and SuMnGlu-to-MSnGlu-to-VTAnDA imply that the flow of signals among these structures are hierarchical and linear; however, adaptive behavior most likely depends on dynamic interactions between many brain structures. For example, in addition to midbrain DA neurons and septal neurons (Monosov and Hikosaka, 2013; Monosov et al., 2015), evidence indicates that neurons in other brain regions also respond to uncertain rewards, including the anterior and posterior cingulate cortex, orbital frontal cortex, striatum, zona incertus, and lateral periaqueductal gray (McCoy and Platt, 2005; O’Neill and Schultz, 2010; White et al., 2019). Therefore, an extended network is most likely responsible for controlling seeking behavior.
In particular, a recent study identified that the mPFC-to-zona incerta (ZI)-to-periaqueductal gray (PAG) circuit modulates novelty-seeking behavior (Ahmadlou et al., 2021). In addition, the study’s findings suggest that there may be at least two levels of information-seeking behavior. The activation and deactivation of the mPFC-to-ZI pathway or ZI-to-PAG pathway respectively increase and decrease high, but not low, level information-seeking behavior, and the level of information-seeking behavior is correlated with the activation level of the neurons of these pathways (Ahmadlou et al., 2021). Note that the SuM-to-MS-to-VTA may be involved in both levels of information-seeking behavior since the inhibition of the neurons of these pathways disrupts mere locomotor activity. However, we suspect that the SuM-to-MS-to-VTA pathway belongs to the same seeking-behavior network as the mPFC-to-ZI-to-PAG pathway. First, the ZI projects to the SuM and VTA (Fig. 6). Second, the PAG projects to the SuM and VTA. Third, the mPFC is reciprocally linked with the SuM, VTA, and hippocampal formation (Fig. 6). It is of interest to investigate how these regions interact during seeking behavior.
9.2. Nicotine addiction
As mentioned in the introduction, micro-infusions of nicotine into the SuM reinforce behavior (Ikemoto et al., 2006). This finding raises the question of whether the SuMnGlu-to-MSnGlu-to-VTAnDA-to-VStr circuit is involved in nicotine reward and addiction. In addition, it will be important to determine whether nicotinic acetylcholine transmission within the SuM is involved in regulating behavioral responses to salient stimuli and uncertainty. To this end, nicotinic acetylcholine receptors containing the β2 subunit have been implicated in controlling behavior under uncertainty (Addicott et al., 2013; Naude et al., 2016) and, importantly, the β2 subunit is expressed in the SuM (Wada et al., 1989). Moreover, nicotine self-administration in rats diminishes with the removal of salient stimuli accompanied by nicotine infusions (Caggiula et al., 2001). In other words, the co-presentation of salient stimuli is essential in maintaining nicotine self-administration in rats. Such an observation begs the question of whether the SuMnGlu-to-MSnGlu--to-VTAnDA-to-VStr circuit can provide a mechanistic explanation for how salient stimuli promotes nicotine self-administration.
9.3. Psychopathology
The SuMnGlu-to-MSnGlu may be a critical subcortical mechanism for curiosity, which we believe is an active information-seeking behavior. In this light, dysregulation of the SuMnGlu-to-MSnGlu-to-VTAnDA-to-VStr circuit may lead to psychopathological conditions: Hyperactivity of this circuit may contribute to pathological behaviors under uncertainty, such as uncontrolled gambling. Conversely, hypoactivity of this circuit may lead to the motivational disorder known as abulia and apathy, characterized as the lack of motivation for voluntary behavior or active information-seeking behavior.
10. Coda
We have proposed two sets of neural pathways focusing on the SuM, arguing that SuMn-to-Hipp and SuMnGlu-to-MS-to-Hipp pathways are important in the cognitive aspect of information-seeking behavior, including the detection and anticipation of EPEs, and that the SuMnGlu-to-MSnGlu-to-VTAnDA pathway plays an important role in instigating and reinforcing seeking behavior. It is important to investigate how these pathways participate in different aspects of seeking behavior and motivation. We believe that this perspective paper provides a useful framework for future research on seeking motivation.
Acknowledgment
The US National Institutes of Health (NIH) Center on Compulsive Behaviors provided fellowships to A.J.K. and C.B.C. In addition, A.J.K. was supported by the Intramural Research Programs of the NIH National Institute of Alcohol Abuse and Alcoholism. The Intramural Research Programs of the NIH National Institute on Drug Abuse (DA000550, DA000575) supported C.B.C. and S.I. We would like to thank anonymous reviewers for their valuable comments on the previous versions of the manuscript.
Abbreviations:
- CSreward
conditioned stimulus associated with reward
- CSno-eward
conditioned stimulus associated with no programmed consequence
- CSaversive
conditioned stimulus associated with aversive stimulus
- ChR2
channelrhodopsin-2
- D1R
dopaminergic receptor type 1
- D2R
dopaminergic receptor type2
- DA
dopamine
- DB
diagonal band of Broca
- EPE
environment prediction error
- Glu
glutamatergic
- Hipp
hippocampal formation
- HTO
hippocampal theta oscillations
- ICSS
intracranial self-stimulation
- MS
medial septal area
- PAG
periaqueductal gray
- REM
rapid eye movement
- rewardclassic
classic reward
- rewarduniversal
any type of reward
- RPE
reward prediction error
- SuM
supramammillary region
- SuMN
supramammillary neurons
- VTANDA
ventral tegmental doopaminergic neurons
- VStr
ventral striatum
- ZI
zona incerta
References
- Addicott MA, Pearson JM, Wilson J, Platt ML, McClernon FJ, 2013. Smoking and the bandit: a preliminary study of smoker and nonsmoker differences in exploratory behavior measured with a multiarmed bandit task. Exp. Clin. Psychopharmacol. 21, 66–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ahmadlou M, Houba JHW, van Vierbergen JFM, Giannouli M, Gimenez G-A, van Weeghel C, Darbanfouladi M, Shirazi MY, Dziubek J, Kacem M, de Winter F, Heimel JA, 2021. A cell type–specific cortico-subcortical brain circuit for investigatory and novelty-seeking behavior. Science 372, eabe9681. [DOI] [PubMed] [Google Scholar]
- Alexander GE, DeLong MR, Strick PL, 1986. Parallel organization of functionally segregated circuits linking basal ganglia and cortex. Annu. Rev. Neurosci. 9, 357–381. [DOI] [PubMed] [Google Scholar]
- Alexander WH, Brown JW, 2011. Medial prefrontal cortex as an action-outcome predictor. Nat. Neurosci. 14, 1338–1344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amaral DG, Lavenex P, 2007. Hippocampal neuroanatomy. In: Andersen P, Morris RGM, Amaral DG, Bliss T, O’Keefe J (Eds.), The Hippocampus Book. Oxford University Press, Oxford, pp. 37–114. [Google Scholar]
- Anselme P, Robinson MJ, Berridge KC, 2013. Reward uncertainty enhances incentive salience attribution as sign-tracking. Behav. Brain Res. 238, 53–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arvanitogiannis A, Flores C, Shizgal P, 1997. Fos-like immunoreactivity in the caudal diencephalon and brainstem following lateral hypothalamic self-stimulation. Behav. Brain Res. 88, 275–279. [DOI] [PubMed] [Google Scholar]
- Badrinarayan A, Wescott SA, Vander Weele CM, Saunders BT, Couturier BE, Maren S, Aragona BJ, 2012. versive stimuli differentially modulate real-time dopamine transmission dynamics within the nucleus accumbens core and shell. J. Neurosci. 32, 15779–15790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bar M, 2004. Visual objects in context. Nat. Rev. Neurosci. 5, 617–629. [DOI] [PubMed] [Google Scholar]
- Bardo MT, Neisewander JL, Pierce RC, 1989. Novelty-induced place preference behavior in rats: effects of opiate and dopaminergic drugs. Pharmacol. Biochem Behav. 32, 683–689. [DOI] [PubMed] [Google Scholar]
- Barter JW, Li S, Lu D, Bartholomew RA, Rossi MA, Shoemaker CT, Salas-Meza D, Gaidis E, Yin HH, 2015. Beyond reward prediction errors: the role of dopamine in movement kinematics. Front. Integr. Neurosci. 9, 39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beck CHM, Fibiger HC, 1995. Conditioned fear-induced changes in behavior and in the expression of the immediate-early gene C-Fos – with and without Diazepam pretreatment. J. Neurosci. 15, 709–720. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Behrens TEJ, Muller TH, Whittington JCR, Mark S, Baram AB, Stachenfeld KL, Kurth-Nelson Z, 2018. What is a cognitive map? Organizing knowledge for flexible behavior. Neuron 100, 490–509. [DOI] [PubMed] [Google Scholar]
- Beier KT, Steinberg EE, DeLoach KE, Xie S, Miyamichi K, Schwarz L, Gao XJ, Kremer EJ, Malenka RC, Luo L, 2015. Circuit architecture of VTA dopamine neurons revealed by systematic input-output mapping. Cell 162, 622–634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bender F, Gorbati M, Cadavieco MC, Denisova N, Gao X, Holman C, Korotkova T, Ponomarenko A, 2015. Theta oscillations regulate the speed of locomotion via a hippocampus to lateral septum pathway. Nat. Commun. 6, 8521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berlyne DE, 1950. Novelty and curiosity as determinants of exploratory behavior. Br. J. Psychol. 41, 68–80. [Google Scholar]
- Berlyne DE, 1970. Novelty, complexity, and hedonic value. Percept. Psychophys. 8, 279–286. [Google Scholar]
- Berlyne DE, Crozier JB, 1971. Effects of complexity and prechoice stimulation on exploratory choice. Percept. Psychophys. 10, 242–246. [Google Scholar]
- Berridge KC, Robinson TE, 1998. What is the role of dopamine in reward: Hedonic impact, reward learning, or incentive salience? Brain Res. Rev. 28, 309–369. [DOI] [PubMed] [Google Scholar]
- Blanco NJ, Love BC, Cooper JA, McGeary JE, Knopik VS, Maddox WT, 2015. A frontal dopamine system for reflective exploratory behavior. Neurobiol. Learn Mem. 123, 84–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bland BH, Oddie SD, 1998. Anatomical, electrophysiological and pharmacological studies of ascending brainstem hippocampal synchronizing pathways. Neurosci. Biobehav Rev. 22, 259–273. [DOI] [PubMed] [Google Scholar]
- Blodgett HC, 1929. The effect of the introduction of reward upon the maze performance of rats. University of California Publications in Psychology 4, 113–134. [Google Scholar]
- Brischoux F, Chakraborty S, Brierley DI, Ungless MA, 2009. Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli. Proc. Natl. Acad. Sci. USA 106, 4894–4899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bromberg-Martin ES, Hikosaka O, 2009. Midbrain dopamine neurons signal preference for advance information about upcoming rewards. Neuron 63, 119–126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Butler RA, 1953. Discrimination learning by rhesus monkeys to visual-exploration motivation. J. Comp. Physiol. Psychol. 46, 95–98. [DOI] [PubMed] [Google Scholar]
- Butler RA, Harlow HF, 1954. Persistence of visual exploration in monkeys. J. Comp. Physiol. Psychol. 47, 258–263. [DOI] [PubMed] [Google Scholar]
- Buzsaki G, 2002. Theta oscillations in the hippocampus. Neuron 33, 325–340. [DOI] [PubMed] [Google Scholar]
- Buzsaki G, Moser EI, 2013. Memory, navigation and theta rhythm in the hippocampal-entorhinal system. Nat. Neurosci. 16, 130–138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caggiula AR, Donny EC, White AR, Chaudhri N, Booth S, Gharib MA, Hoffman A, Perkins KA, Sved AF, 2001. Cue dependency of nicotine self-administration and smoking. Pharmacol. Biochem Behav. 70, 515–530. [DOI] [PubMed] [Google Scholar]
- Cardinal RN, Howes NJ, 2005. Effects of lesions of the nucleus accumbens core on choice between small certain rewards and large uncertain rewards in rats. BMC Neurosci. 6, 37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen S, He L, Huang AJY, Boehringer R, Robert V, Wintzer ME, Polygalov D, Weitemier AZ, Tao Y, Gu M, Middleton SJ, Namiki K, Hama H, Therreau L, Chevaleyre V, Hioki H, Miyawaki A, Piskorowski RA, McHugh TJ, 2020. A hypothalamic novelty signal modulates hippocampal memory. Nature 586, 270–274. [DOI] [PubMed] [Google Scholar]
- Cheung VKM, Harrison PMC, Meyer L, Pearce MT, Haynes JD, Koelsch S, 2019. Uncertainty and surprise jointly predict musical pleasure and amygdala, hippocampus, and auditory cortex activity. Curr. Biol. 29, 4084–4092 (e4084). [DOI] [PubMed] [Google Scholar]
- Cocker PJ, Dinelle K, Kornelson R, Sossi V, Winstanley CA, 2012. Irrational choice under uncertainty correlates with lower striatal D(2/3) receptor binding in rats. J. Neurosci. 32, 15450–15457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coddington LT, Dudman JT, 2018. The timing of action determines reward prediction signals in identified midbrain dopamine neurons. Nat. Neurosci. 21, 1563–1573. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coizet V, Dommett EJ, Redgrave P, Overton PG, 2006. Nociceptive responses of midbrain dopaminergic neurones are modulated by the superior colliculus in the rat. Neuroscience 139, 1479–1493. [DOI] [PubMed] [Google Scholar]
- Cullinan WE, Herman JP, Battaglia DF, Akil H, Watson SJ, 1995. Pattern and time course of immediate early gene expression in rat brain following acute stress. Neuroscience 64, 477–505. [DOI] [PubMed] [Google Scholar]
- Daddaoua N, Lopes M, Gottlieb J, 2016. Intrinsically motivated oculomotor exploration guided by uncertainty reduction and conditioned reinforcement in non-human primates. Sci. Rep. 6, 20202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dashiell JF, 1925. A quantitative demonstration of animal drive. J. Comp. Psychol. 5, 205–208. [Google Scholar]
- Day HE, Masini CV, Campeau S, 2004. The pattern of brain c-fos mRNA induced by a component of fox odor, 2,5-dihydro-2,4,5-trimethylthiazoline (TMT), in rats, suggests both systemic and processive stress characteristics. Brain Res. 1025, 139–151. [DOI] [PubMed] [Google Scholar]
- Dodd ML, Klos KJ, Bower JH, Geda YE, Josephs KA, Ahlskog JE, 2005. Pathological gambling caused by drugs used to treat Parkinson disease. Arch. Neurol. 62, 1377–1381. [DOI] [PubMed] [Google Scholar]
- Dodson PD, Dreyer JK, Jennings KA, Syed ECJ, Wade-Martins R, Cragg SJ, Bolam JP, Magill PJ, 2016. Representation of spontaneous movement by dopaminergic neurons is cell-type selective and disrupted in parkinsonism. Proc. Natl. Acad. Sci. USA 113, E2180–E2188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Driver-Dunckley E, Samanta J, Stacy M, 2003. Pathological gambling associated with dopamine agonist therapy in Parkinson’s disease. Neurology 61, 422–423. [DOI] [PubMed] [Google Scholar]
- Eichenbaum H, 2017. Prefrontal-hippocampal interactions in episodic memory. Nat. Rev. Neurosci. 18, 547–558. [DOI] [PubMed] [Google Scholar]
- Eichenbaum H, Dudchenko P, Wood E, Shapiro M, Tanila H, 1999. The hippocampus, memory, and place cells: is it spatial memory or a memory space? Neuron 23, 209–226. [DOI] [PubMed] [Google Scholar]
- Engelhard B, Finkelstein J, Cox J, Fleming W, Jang HJ, Ornelas S, Koay SA, Thiberge SY, Daw ND, Tank DW, Witten IB, 2019. Specialized coding of sensory, motor and cognitive variables in VTA dopamine neurons. Nature 570, 509–513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ennaceur A, 2010. One-trial object recognition in rats and mice: methodological and theoretical issues. Behav. Brain Res. 215, 244–254. [DOI] [PubMed] [Google Scholar]
- Farrell JS, Lovett-Barron M, Klein PM, Sparks FT, Gschwind T, Ortiz AL, Ahanonu B, Bradbury S, Terada S, Oijala M, Hwaun E, Dudok B, Szabo G, Schnitzer MJ, Deisseroth K, Losonczy A, Soltesz I, 2021. Supramammillary regulation of locomotion and hippocampal activity. Science 374, 1492–1496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiorillo CD, Tobler PN, Schultz W, 2003. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 299, 1898–1902. [DOI] [PubMed] [Google Scholar]
- Floresco SB, Montes DR, Tse MMT, van Holstein M, 2018. Differential contributions of nucleus accumbens subregions to cue-guided risk/reward decision making and implementation of conditional rules. J. Neurosci. 38, 1901–1914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frank MJ, Doll BB, Oas-Terpstra J, Moreno F, 2009. Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation. Nat. Neurosci. 12, 1062–1068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frank MJ, Moustafa AA, Haughey HM, Curran T, Hutchison KE, 2007. Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning. Proc. Natl. Acad. Sci. USA 104, 16311–16316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fuhrmann F, Justus D, Sosulina L, Kaneko H, Beutel T, Friedrichs D, Schoch S, Schwarz MK, Fuhrmann M, Remy S, 2015. Locomotion, theta oscillations, and the speed-correlated firing of hippocampal neurons are controlled by a medial septal glutamatergic circuit. Neuron 86, 1253–1264. [DOI] [PubMed] [Google Scholar]
- Gallistel CR, 1969. The incentive of brain-stimulation reward. J. Comp. Physiol. Psychol. 69, 713–721. [DOI] [PubMed] [Google Scholar]
- Gardner MPH, Schoenbaum G, Gershman SJ, 2018. Rethinking dopamine as generalized prediction error. Proc. Biol. Sci. 285, 285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gentry RN, Lee B, Roesch MR, 2016. Phasic dopamine release in the rat nucleus accumbens predicts approach and avoidance performance. Nat. Commun. 7, 13154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghazizadeh A, Griggs W, Hikosaka O, 2016. Ecological origins of object salience: reward, uncertainty, aversiveness, and novelty. Front Neurosci. 10, 378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Givens B, Olton DS, 1994. Local modulation of basal forebrain: effects on working and reference memory. J. Neurosci. 14, 3578–3587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gore BB, Soden ME, Zweifel LS, 2014. Visualization of plasticity in fear-evoked calcium signals in midbrain dopamine neurons. Learn Mem. 21, 575–579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gottlieb J, Hayhoe M, Hikosaka O, Rangel A, 2014. Attention, reward, and information seeking. J. Neurosci. 34, 15497–15504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gray JA, McNaughton N, 2003. The Neuropsychology of Anxiety: An Enquiry into the Functions of the Septo-hippocampal System. Oxford University Press, Oxford. [Google Scholar]
- Green JD, Arduini AA, 1954. Hippocampal electrical activity in arousal. J. Neurophysiol. 17, 533–557. [DOI] [PubMed] [Google Scholar]
- Gruber MJ, Ranganath C, 2019. How curiosity enhances hippocampus-dependent memory: the Prediction, Appraisal, Curiosity, and Exploration (PACE) Framework. Trends Cogn. Sci. 23, 1014–1025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guarraci FA, Kapp BS, 1999. An electrophysiological characterization of ventral tegmental area dopaminergic neurons during differential pavlovian fear conditioning in the awake rabbit. Behav. Brain Res. 99, 169–179. [DOI] [PubMed] [Google Scholar]
- Haber SN, 2016. Corticostriatal circuitry. In: Pfaff DW, Volkow ND (Eds.), Neuroscience in the 21st Century. Springer, New York, pp. 1–21. [Google Scholar]
- Hagan JJ, Salamone JD, Simpson J, Iversen SD, Morris RGM, 1988. Place navigation in rats is impaired by lesions of medial septum and diagonal band but not nucleus basalis magnocellularis. Behav. Brain Res. 27, 9–20. [DOI] [PubMed] [Google Scholar]
- Hamid AA, Pettibone JR, Mabrouk OS, Hetrick VL, Schmidt R, Vander Weele CM, Kennedy RT, Aragona BJ, Berke JD, 2016. Mesolimbic dopamine signals the value of work. Nat. Neurosci. 19, 117–126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hayat A, Feldman S, 1974. Effects of sensory stimuli on single cell activity in the septum of the cat. Exp. Neurol. 43, 298–313. [DOI] [PubMed] [Google Scholar]
- Horvitz JC, 2000. Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events. Neuroscience 96, 651–656. [DOI] [PubMed] [Google Scholar]
- Howe MW, Dombeck DA, 2016. Rapid signalling in distinct dopaminergic axons during locomotion and reward. Nature 535, 505–510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huron D, 2006. Sweet anticipation. Music and the Psychology of Expectation. MIT Press, Cambridge, MA. [Google Scholar]
- Ikemoto S, 2002. Ventral striatal anatomy of locomotor activity induced by cocaine, Damphetamine, dopamine and D1/D2 agonists. Neuroscience 113, 939–955. [DOI] [PubMed] [Google Scholar]
- Ikemoto S, 2005. The supramammillary nucleus mediates primary reinforcement via GABA A receptors. Neuropsychopharmacology 30, 1088–1095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ikemoto S, 2007. Dopamine reward circuitry: two projection systems from the ventral midbrain to the nucleus accumbens-olfactory tubercle complex. Brain Res. Rev. 56, 27–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ikemoto S, 2010. Brain reward circuitry beyond the mesolimbic dopamine system: a neurobiological theory. Neurosci. Biobehav. Rev. 35, 129–150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ikemoto S, Panksepp J, 1992. The effects of early social isolation on the motivation for social play in juvenile rats. Dev. Psychobiol. 25, 261–274. [DOI] [PubMed] [Google Scholar]
- Ikemoto S, Panksepp J, 1994. The relationship between self-stimulation and sniffing in rats: does a common brain system mediate these behaviors? Behav. Brain Res. 61, 143–162. [DOI] [PubMed] [Google Scholar]
- Ikemoto S, Panksepp J, 1999. The role of nucleus accumbens dopamine in motivated behavior: a unifying interpretation with special reference to reward-seeking. Brain Res. Rev. 31, 6–41. [DOI] [PubMed] [Google Scholar]
- Ikemoto S, Wise RA, 2002. Rewarding effects of the cholinergic agents carbachol and neostigmine in the posterior ventral tegmental area. J. Neurosci. 22, 9895–9904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ikemoto S, Wise RA, 2004. Mapping of chemical trigger zones for reward. Neuropharmacology 47, 190–201. [DOI] [PubMed] [Google Scholar]
- Ikemoto S, Witkin BM, Morales M, 2003. Rewarding injections of the cholinergic agonist carbachol into the ventral tegmental area induce locomotion and c-Fos expression in the retrosplenial area and supramammillary nucleus. Brain Res. 969, 78–87. [DOI] [PubMed] [Google Scholar]
- Ikemoto S, Qin M, Liu ZH, 2006. Primary reinforcing effects of nicotine are triggered from multiple regions both inside and outside the ventral tegmental area. J. Neurosci. 26, 723–730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ikemoto S, Yang C, Tan A, 2015. Basal ganglia circuit loops, dopamine and motivation: a review and enquiry. Behav. Brain Res. 290, 17–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ikemoto S, Witkin BM, Zangen A, Wise RA, 2004. Rewarding effects of AMPA administration into the supramammillary or posterior hypothalamic nuclei but not the ventral tegmental area. J. Neurosci. 24, 5758–5765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ilango A, Kesner AJ, Broker CJ, Wang DV, Ikemoto S, 2014a. Phasic excitation of ventral tegmental dopamine neurons potentiates the initiation of conditioned approach behavior: parametric and reinforcement-schedule analyses. Front. Behav. Neurosci. 8, 155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ilango A, Kesner AJ, Keller KL, Stuber GD, Bonci A, Ikemoto S, 2014b. Similar roles of substantia nigra and ventral tegmental dopamine neurons in reward and aversion. J. Neurosci. 34, 817–822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ito HT, Moser EI, Moser MB, 2018. Supramammillary nucleus modulates spike-time coordination in the prefrontal-thalamo-hippocampal circuit during navigation. Neuron 99, 576–587 e575. [DOI] [PubMed] [Google Scholar]
- Jo YS, Heymann G, Zweifel LS, 2018. Dopamine neurons reflect the uncertainty in fear generalization. Neuron 100, 916–925 e913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Julian JB, Doeller CF, 2021. Remapping and realignment in the human hippocampal formation predict context-dependent spatial behavior. Nat. Neurosci. 24, 863–872. [DOI] [PubMed] [Google Scholar]
- Justus D, Dalugge D, Bothe S, Fuhrmann F, Hannes C, Kaneko H, Friedrichs D, Sosulina L, Schwarz I, Elliott DA, Schoch S, Bradke F, Schwarz MK, Remy S, 2017. Glutamatergic synaptic integration of locomotion speed via septoentorhinal projections. Nat. Neurosci. 20, 16–19. [DOI] [PubMed] [Google Scholar]
- Kayser AS, Mitchell JM, Weinstein D, Frank MJ, 2015. Dopamine, locus of control, and the exploration-exploitation tradeoff. Neuropsychopharmacology 40, 454–462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keller KL, Vollrath-Smith FR, Jafari M, Ikemoto S, 2014. Synergistic interaction between caloric restriction and amphetamine in food-unrelated approach behavior of rats. Psychopharmacology 231, 825–840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kesner AJ, Shin R, Calva CB, Don RF, Junn S, Potter CT, Ramsey LA, Abou-Elnaga AF, Cover CG, Wang DV, Lu H, Yang Y, Ikemoto S, 2021. Supramammillary neurons projecting to the septum regulate dopamine and motivation for environmental interaction in mice. Nat. Commun. 12, 2811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirk IJ, 1998. Frequency modulation of hippocampal theta by the supramammillary nucleus, and other hypothalamo-hippocampal interactions: mechanisms and functional implications. Neurosci. Biobehav Rev. 22, 291–302. [DOI] [PubMed] [Google Scholar]
- Kirk IJ, McNaughton N, 1993. Mapping the differential effects of procaine on frequency and amplitude of reticularly elicited hippocampal rhythmical slow activity. Hippocampus 3, 517–525. [DOI] [PubMed] [Google Scholar]
- Kish GB, 1966. Studies of sensory reinforcement. In: Honig WK (Ed.), Operant Behavior: Areas of Research and Application. Appleton-Century-Crofts, Prentice-Hall, Englewood Cliffs, NJ, pp. 109–159. [Google Scholar]
- Kita T, Nishijo H, Eifuku S, Terasawa K, Ono T, 1995. Place and contingency differential responses of monkey septal neurons during conditional place-object discrimination. J. Neurosci. 15, 1683–1703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kocsis B, Vertes RP, 1994. Characterization of neurons of the supramammillary nucleus and mammillary body that discharge rhythmically with the hippocampal theta-rhythm in the rat. J. Neurosci. 14, 7040–7052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koob GF, Volkow ND, 2010. Neurocircuitry of addiction. Neuropsychopharmacology 35, 217–238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krebs RM, Schott BH, Duzel E, 2009. Personality traits are differentially associated with patterns of reward and novelty processing in the human substantia nigra/ ventral tegmental area. Biol. Psychiatry 65, 103–110. [DOI] [PubMed] [Google Scholar]
- Kumaran D, Maguire EA, 2007. Which computational mechanisms operate in the hippocampus during novelty detection? Hippocampus 17, 735–748. [DOI] [PubMed] [Google Scholar]
- Kumaran D, Maguire EA, 2009. Novelty signals: a window into hippocampal information processing. Trends Cogn. Sci. 13, 47–54. [DOI] [PubMed] [Google Scholar]
- Le Heron C, Kolling N, Plant O, Kienast A, Janska R, Ang YS, Fallon S, Husain M, Apps MAJ, 2020. Dopamine modulates dynamic decision-making during foraging. J. Neurosci. 40, 5273–5282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Le May MV, Hume C, Sabatier N, Schele E, Bake T, Bergstrom U, Menzies J, Dickson SL, 2019. Activation of the rat hypothalamic supramammillary nucleus by food anticipation, food restriction or ghrelin administration. J. Neuroendocr. 31, e12676. [DOI] [PubMed] [Google Scholar]
- Lerner TN, Shilyansky C, Davidson TJ, Evans KE, Beier KT, Zalocusky KA, Crow AK, Malenka RC, Luo L, Tomer R, Deisseroth K, 2015. Intact-brain analyses reveal distinct information carried by SNc dopamine subcircuits. Cell 162, 635–647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- M’Harzi M, Jarrard LE, 1992. Effects of medial and lateral septal lesions on acquisition of a place and cue radial maze task. Behav. Brain Res. 49, 159–165. [DOI] [PubMed] [Google Scholar]
- Ma J, Leung LS, 2007. The supramammillo-septal-hippocampal pathway mediates sensorimotor gating impairment and hyperlocomotion induced by MK-801 and ketamine in rats. Psychopharmacology 191, 961–974. [DOI] [PubMed] [Google Scholar]
- Mai B, Sommer S, Hauber W, 2015. Dopamine D1/D2 receptor activity in the nucleus accumbens core but not in the nucleus accumbens shell and orbitofrontal cortex modulates risk-based decision making. Int. J. Neuropsychopharmacol. 18, pyv043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mantz J, Thierry AM, Glowinski J, 1989. Effect of noxious tail pinch on the discharge rate of mesocortical and mesolimbic dopamine neurons: selective activation of the mesocortical system. Brain Res. 476, 377–381. [DOI] [PubMed] [Google Scholar]
- Marvin CB, Tedeschi E, Shohamy D, 2020. Curiosity as the impulse to know: common behavioral and neural mechanisms underlying curiosity and impulsivity. Curr. Opin. Behav. Sci. 35, 92–98. [Google Scholar]
- Matsumoto M, Hikosaka O, 2009. Two types of dopamine neuron distinctly convey positive and negative motivational signals. Nature 459, 837–841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matthews GA, Tye KM, 2019. Neural mechanisms of social homeostasis. Ann. N. Y. Acad. Sci. 1457, 5–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McCoy AN, Platt ML, 2005. Risk-sensitive neurons in macaque posterior cingulate cortex. Nat. Neurosci. 8, 1220–1227. [DOI] [PubMed] [Google Scholar]
- McFarland WL, Teitelbaum H, Hedges EK, 1975. Relationship between hippocampal theta activity and running speed in the rat. J. Comp. Physiol. Psychol. 88, 324–328. [DOI] [PubMed] [Google Scholar]
- McNaughton N, Corr PJ, 2018. Survival circuits and risk assessment. Curr. Opin. Behav. Sci. 24, 14–20. [Google Scholar]
- McNaughton N, Logan B, Panickar KS, Kirk IJ, Pan WX, Brown NT, Heenan A, 1995. Contribution of synapses in the medial supramammillary nucleus to the frequency of hippocampal theta rhythm in freely moving rats. Hippocampus 5, 534–545. [DOI] [PubMed] [Google Scholar]
- Menegas W, Babayan BM, Uchida N, Watabe-Uchida M, 2017. Opposite initialization to novel cues in dopamine signaling in ventral and posterior striatum in mice. Elife 6, e21886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miliaressis E, LeMoal M, 1976. Stimulation of the medial forebrain bundle: Behavioral dissociation of its rewarding and activating effects. Neurosci. Lett. 2, 295–300. [DOI] [PubMed] [Google Scholar]
- Milner PM, 1989. The discovery of self-stimulation and other stories. Neurosci. Biobehav. Rev. 13, 61–67. [DOI] [PubMed] [Google Scholar]
- Mohebi A, Pettibone JR, Hamid AA, Wong JT, Vinson LT, Patriarchi T, Tian L, Kennedy RT, Berke JD, 2019. Dissociable dopamine dynamics for learning and motivation. Nature 570, 65–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monosov IE, 2020. How outcome uncertainty mediates attention, learning, and decision-making. Trends Neurosci. 43, 795–809. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monosov IE, Hikosaka O, 2013. Selective and graded coding of reward uncertainty by neurons in the primate anterodorsal septal region. Nat. Neurosci. 16, 756–762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monosov IE, Leopold DA, Hikosaka O, 2015. Neurons in the primate medial basal forebrain signal combined information about reward uncertainty, value, and punishment anticipation. J. Neurosci. 35, 7443–7459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Montgomery KC, 1954. The role of the exploratory drive in learning. J. Comp. Physiol. Psychol. 47, 60–64. [DOI] [PubMed] [Google Scholar]
- Morris R, Hagan J, 1983. Hippocampal electrical activity and ballistic movement. Neurobiol. Hippocampus 321–331. [Google Scholar]
- Myers AK, Miller NE, 1954. Failure to find a learned drive based on hunger; evidence for learning motivated by" exploration. J. Comp. Physiol. Psychol. 47, 428–436. [DOI] [PubMed] [Google Scholar]
- Naude J, Tolu S, Dongelmans M, Torquet N, Valverde S, Rodriguez G, Pons S, Maskos U, Mourot A, Marti F, Faure P, 2016. Nicotinic receptors in the ventral tegmental area promote uncertainty-seeking. Nat. Neurosci. 19, 471–478. [DOI] [PubMed] [Google Scholar]
- Nishijo H, Kita T, Tamura R, Eifuku S, Terasawa K, Ono T, 1997. Motivation-related neuronal activity in the object discrimination task in monkey septal nuclei. Hippocampus 7, 536–548. [DOI] [PubMed] [Google Scholar]
- Nissen HW, 1930. A study of exploratory behavior in the white rat by means of the obstruction method. Pedagog. Semin. J. Genet. Psychol. 37, 361–376. [Google Scholar]
- O’Keefe J, Nadel L, 1978. The Hippocampus as a Cognitive Map. Clarendon Press, Oxford. [Google Scholar]
- Okada K, Okaichi H, 2010. Functional cooperation between the hippocampal subregions and the medial septum in unreinforced and reinforced spatial memory tasks. Behav. Brain Res. 209, 295–304. [DOI] [PubMed] [Google Scholar]
- Okuda J, Fujii T, Ohtake H, Tsukiura T, Tanji K, Suzuki K, Kawashima R, Fukuda H, Itoh M, Yamadori A, 2003. Thinking of the future and past: the roles of the frontal pole and the medial temporal lobes. Neuroimage 19, 1369–1380. [DOI] [PubMed] [Google Scholar]
- Olds J, 1956. Pleasure centers in the brain. Sci. Am. 195, 105–116. [Google Scholar]
- Olds J, 1958. Self-stimualtion experiment and differentiated reward systems. In: Jasper HH, Proctor LD, Knighton RS, Noshay WC, Costello RT (Eds.), Reticular Formation of the Brain. Little Brown and Company, Boston, pp. 671–683. [Google Scholar]
- Olds J, Milner P, 1954. Positive reinforcement produced by electrical stimulation of septal area and other regions of rat brain. J. Comp. Physiol. Psychol. 47, 419–427. [DOI] [PubMed] [Google Scholar]
- Olds J, Olds ME, 1958. Positive reinforcement produced by stimulating hypothalamus with iproniazid and other compounds. Science 127, 1175–1176. [DOI] [PubMed] [Google Scholar]
- Oleson EB, Gentry RN, Chioma VC, Cheer JF, 2012. Subsecond dopamine release in the nucleus accumbens predicts conditioned punishment and its successful avoidance. J. Neurosci. 32, 14804–14808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Neill M, Schultz W, 2010. Coding of reward risk by orbitofrontal neurons is mostly distinct from coding of reward value. Neuron 68, 789–800. [DOI] [PubMed] [Google Scholar]
- Pan WX, McNaughton N, 2002. The role of the medial supramammillary nucleus in the control of hippocampal theta activity and behaviour in rats. Eur. J. Neurosci. 16, 1797–1809. [DOI] [PubMed] [Google Scholar]
- Pan WX, McNaughton N, 2004. The supramammillary area: its organization, functions and relationship to the hippocampus. Prog. Neurobiol. 74, 127–166. [DOI] [PubMed] [Google Scholar]
- Panksepp J, 1981a. Hypothalamic integration of behavior: Rewards, punishments, and related psychobiological process. Handb. Hypothal. 3, 289–487. [Google Scholar]
- Panksepp J, 1981b. The ontogeny of play in rats. Dev. Psychobiol. 14, 327–332. [DOI] [PubMed] [Google Scholar]
- Panksepp J, 1982. Toward a general psychobiological theory of emotions. Behav. Brain Sci. 5, 407–467. [Google Scholar]
- Panksepp J, Beatty WW, 1980. Social deprivation and play in rats. Behav. Neural Biol. 30, 197–206. [DOI] [PubMed] [Google Scholar]
- Parker NF, Cameron CM, Taliaferro JP, Lee J, Choi JY, Davidson TJ, Daw ND, Witten IB, 2016. Reward and choice encoding in terminals of midbrain dopamine neurons depends on striatal target. Nat. Neurosci. 19, 845–854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pedersen NP, Ferrari L, Venner A, Wang JL, Abbott SBG, Vujovic N, Arrigoni E, Saper CB, Fuller PM, 2017. Supramammillary glutamate neurons are a key node of the arousal system. Nat. Commun. 8, 1405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Phillips PEM, Stuber GD, Helen MLAV, Wightman RM, Carelli RM, 2003. Subsecond dopamine release promotes cocaine seeking. Nature 422, 614–618. [DOI] [PubMed] [Google Scholar]
- Piantadosi PT, Halladay LR, Radke AK, Holmes A, 2021. Advances in understanding meso-cortico-limbic-striatal systems mediating risky reward seeking. J. Neurochem. 157, 1547–1571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Redgrave P, Gurney K, 2006. The short-latency dopamine signal: a role in discovering novel actions? Nat. Rev. Neurosci. 7, 967–975. [DOI] [PubMed] [Google Scholar]
- Rigoli F, Michely J, Friston KJ, Dolan RJ, 2019. The role of the hippocampus in weighting expectations during inference under uncertainty. Cortex 115, 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson J, Manseau F, Ducharme G, Amilhon B, Vigneault E, El Mestikawy S, Williams S, 2016. Optogenetic activation of septal glutamatergic neurons drive hippocampal theta rhythms. J. Neurosci. 36, 3016–3023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roitman MF, Stuber GD, Phillips PEM, Wightman RM, Carelli RM, 2004. Dopamine operates as a subsecond modulator of food seeking. J. Neurosci. 24, 1265–1271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rolls ET, 2013. The mechanisms for pattern completion and pattern separation in the hippocampus. Front. Syst. Neurosci. 7, 74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Routtenberg A, Lindy J, 1965. Effects of the availability of rewarding septal and hypothalamic stimulation on bar pressing for food under conditions of deprivation. J. Comp. Physiol. Psychol. 60, 158–161. [DOI] [PubMed] [Google Scholar]
- Saji M, Kobayashi S, Ohno K, Sekino Y, 2000. Interruption of supramammillohippocampal afferents prevents the genesis and spread of limbic seizures in the hippocampus via a disinhibition mechanism. Neuroscience 97, 437–445. [DOI] [PubMed] [Google Scholar]
- Salimpoor VN, Zald DH, Zatorre RJ, Dagher A, McIntosh AR, 2015. Predictions and the brain: how musical sounds become rewarding. Trends Cogn. Sci. 19, 86–91. [DOI] [PubMed] [Google Scholar]
- Satoh T, Nakai S, Sato T, Kimura M, 2003. Correlated coding of motivation and outcome of decision by dopamine neurons. J. Neurosci. 23, 9913–9923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schacter DL, Addis DR, Hassabis D, Martin VC, Spreng RN, Szpunar KK, 2012. The future of memory: remembering, imagining, and the brain. Neuron 76, 677–694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schultz W, 1998. Predictive reward signal of dopamine neurons. J. Neurophysiol. 80, 1–27. [DOI] [PubMed] [Google Scholar]
- Schultz W, 2016. Dopamine reward prediction-error signalling: a two-component response. Nat. Rev. Neurosci. 17, 183–195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schultz W, Romo R, 1987. Responses of nigrostriatal dopamine neurons to high-intensity somatosensory stimulation in the anesthetized monkey. J. Neurophysiol. 57, 201–217. [DOI] [PubMed] [Google Scholar]
- Schultz W, Dayan P, Montague PR, 1997. A neural substrate of prediction and reward. Science 275, 1593–1599. [DOI] [PubMed] [Google Scholar]
- Sefton AJ, Harvey A, 2004. Visual system. In: Paxinos G (Ed.), The Rat Nervous System. Academic Press, pp. 1083–1165. [Google Scholar]
- Segal M, 1974. Convergence of sensory input on units in the hippocampal system of the rat. J. Comp. Physiol. Psychol. 87, 91–99. [DOI] [PubMed] [Google Scholar]
- Shin R, Ikemoto S, 2010. Administration of the GABAA receptor antagonist picrotoxin into rat supramammillary nucleus induces c-Fos in reward-related brain structures. Supramammillary picrotoxin and c-Fos expression. BMC Neurosci. 11, 101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shin R, Cao J, Webb SM, Ikemoto S, 2010. Amphetamine administration into the ventral striatum facilitates behavioral interaction with unconditioned visual signals in rats. PLoS One 5, e8741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Squire LR, 1992. Memory and the hippocampus: a synthesis from findings with rats, monkeys, and humans. Psychol. Rev. 99, 195–231. [DOI] [PubMed] [Google Scholar]
- Stauffer WR, Lak A, Schultz W, 2014. Dopamine reward prediction error responses reflect marginal utility. Curr. Biol. 24, 2491–2500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steinberg EE, Keiflin R, Boivin JR, Witten IB, Deisseroth K, Janak PH, 2013. A causal link between prediction errors, dopamine neurons and learning. Nat. Neurosci. 16, 966–973. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stewart J, Hurwitz HMB, 1958. Studies in light-reinforced behaviour III: the effects of continuous, zero and fixed-ratio reinforcement. Q J. Exp. Psychol. 10, 56–61. [Google Scholar]
- Stopper CM, Floresco SB, 2011. Contributions of the nucleus accumbens and its subregions to different aspects of risk-based decision making. Cogn. Affect Behav. Neurosci. 11, 97–112. [DOI] [PubMed] [Google Scholar]
- Stopper CM, Khayambashi S, Floresco SB, 2013. Receptor-specific modulation of risk-based decision making by nucleus accumbens dopamine. Neuropsychopharmacology 38, 715–728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suri RE, 2001. Anticipatory responses of dopamine neurons and cortical neurons reproduced by internal model. Exp. Brain Res. 140, 234–240. [DOI] [PubMed] [Google Scholar]
- Sutton RS, Barto AG, 1998. Reinforcement Learning. The MIT Press, Cambridge, MA. [Google Scholar]
- Thinschmidt JS, Kinney GG, Kocsis B, 1995. The supramammillary nucleus: Is it necessary for the mediation of hippocampal theta rhythm? Neuroscience 67, 301–312. [DOI] [PubMed] [Google Scholar]
- Tolman EC, 1948. Cognitive maps in rats and men. Psychol. Rev. 55, 189–208. [DOI] [PubMed] [Google Scholar]
- Trezza V, Campolongo P, Vanderschuren LJ, 2011. Evaluating the rewarding nature of social interactions in laboratory animals. Dev. Cogn. Neurosci. 1, 444–458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vandecasteele M, Varga V, Berenyi A, Papp E, Bartho P, Venance L, Freund TF, Buzsaki G, 2014. Optogenetic activation of septal cholinergic neurons suppresses sharp wave ripples and enhances theta oscillations in the hippocampus. Proc. Natl. Acad. Sci. USA 111, 13535–13540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vanderwolf CH, 1969. Hippocampal electrical activity and voluntary movement in the rat. Electroencephalogr. Clin. Neurophysiol. 26, 407–418. [DOI] [PubMed] [Google Scholar]
- Vann SD, Brown MW, Aggleton JP, 2000. Fos expression in the rostral thalamic nuclei and associated cortical regions in response to different spatial memory tests. Neuroscience 101, 983–991. [DOI] [PubMed] [Google Scholar]
- Vertes RP, 1992. PHA-L analysis of projections from the supramammillary nucleus in the rat. J. Comp. Neurol. 326, 595–622. [DOI] [PubMed] [Google Scholar]
- Vertes RP, Kocsis B, 1997. Brainstem-diencephalo-septohippocampal systems controlling the theta rhythm of the hippocampus. Neuroscience 81, 893–926. [DOI] [PubMed] [Google Scholar]
- Wada E, Wada K, Boulter J, Deneris E, Heinemann S, Patrick J, Swanson LW, 1989. Distribution of alpha 2, alpha 3, alpha 4, and beta 2 neuronal nicotinic receptor subunit mRNAs in the central nervous system: a hybridization histochemical study in the rat. J. Comp. Neurol. 284, 314–335. [DOI] [PubMed] [Google Scholar]
- Wang DV, Tsien JZ, 2011a. Conjunctive processing of locomotor signals by the ventral tegmental area neuronal population. PLoS ONE 6, e16528. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang DV, Tsien JZ, 2011b. Convergent processing of both positive and negative motivational signals by the VTA dopamine neuronal populations. PLoS One 6, e17047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Whishaw IQ, Vanderwolf CH, 1973. Hippocampal EEG and behavior: changes in amplitude and frequency of RSA (theta rhythm) associated with spontaneous and learned movement patterns in rats and cats. Behav. Biol. 8, 461–484. [DOI] [PubMed] [Google Scholar]
- Whishaw IQ, Wallace DG, 2003. On the origins of autobiographical memory. Behav. Brain Res. 138, 113–119. [DOI] [PubMed] [Google Scholar]
- White JK, Bromberg-Martin ES, Heilbronner SR, Zhang K, Pai J, Haber SN, Monosov IE, 2019. A neural network for information seeking. Nat. Commun. 10, 5168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Winson J, 1978. Loss of hippocampal theta rhythm results in spatial memory deficit in the rat. Science 201, 160–163. [DOI] [PubMed] [Google Scholar]
- Wirtshafter D, Stratford TR, Shim I, 1998. Placement in a novel environment induces Fos-like immunoreactivity in supramammillary cells projecting to the hippocampus and midbrain. Brain Res. 789, 331–334. [DOI] [PubMed] [Google Scholar]
- Wise RA, 2002. Brain reward circuitry: insights from unsensed incentives. Neuron 36, 229–240. [DOI] [PubMed] [Google Scholar]
- Wise RA, 2004. Dopamine, learning and motivation. Nat. Rev. Neurosci. 5, 483–494. [DOI] [PubMed] [Google Scholar]
- Wise RA, Bozarth MA, 1987. A psychomotor stimulant theory of addiction. Psychol. Rev. 94, 469–492. [PubMed] [Google Scholar]
- Woodworth RS, 1958. Dynamic Psychology. Henry Holt and Company, New York. [Google Scholar]
- Yang C, Hu Y, Talishinsky AD, Potter CT, Calva CB, Ramsey LA, Kesner AJ, Don RF, Junn S, Tan A, Pierce AF, Nicolas C, Arima Y, Lee S-C, Su C, Coudriet JM, Mejia-Aponte CA, Wang DV, Lu H, Yang Y, Ikemoto S, 2022. Medial prefrontal cortex and anteromedial thalamus interaction regulates goal-directed behavior and dopaminergic neuron activity. Nat. Commun. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yasoshima Y, Scott TR, Yamamoto T, 2005. Involvement of the supramammillary nucleus in aversive conditioning. Behav. Neurosci. 119, 1290–1297. [DOI] [PubMed] [Google Scholar]
- Yuan L, et al. , 2019. Topography of reward and aversion encoding in the mesolimbic dopaminergic system. J. Neurosci. 39 (33), 6472–6481. https://doi.org/10.1523/JNEUROSCI.0271-19.2019. https://www.jneurosci.org/content/jneuro/39/33/6472.full.pdf. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zalocusky KA, Ramakrishnan C, Lerner TN, Davidson TJ, Knutson B, Deisseroth K, 2016. Nucleus accumbens D2R cells signal prior outcomes and control risky decision-making. Nature 531, 642–646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zeidman P, Maguire EA, 2016. Anterior hippocampus: the anatomy of perception, imagination and episodic memory. Nat. Rev. Neurosci. 17, 173–182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang GW, Shen L, Zhong W, Xiong Y, Zhang LI, Tao HW, 2018a. Transforming sensory cues into aversive emotion via septal-habenular pathway. Neuron 99, 1016–1028 e1015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang GW, Sun WJ, Zingg B, Shen L, He J, Xiong Y, Tao HW, Zhang LI, 2018b. A non-canonical reticular-limbic central auditory pathway via medial septum contributes to fear conditioning. Neuron 97, 406–417 e404. [DOI] [PMC free article] [PubMed] [Google Scholar]
