Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Mar 1.
Published in final edited form as: J Neural Transm (Vienna). 2017 Mar 21;125(3):501–513. doi: 10.1007/s00702-017-1713-z

Roles of centromedian parafascicular nuclei of thalamus and cholinergic interneurons in the dorsal striatum in associative learning of environmental events

Ko Yamanaka 1,2,3, Yukiko Hori 2,4, Takafumi Minamimoto 2,4, Hiroshi Yamada 2,5, Naoyuki Matsumoto 2,6, Kazuki Enomoto 1,2, Toshihiko Aosaki 7, Ann M Graybiel 8, Minoru Kimura 1,2,*
PMCID: PMC5608633  NIHMSID: NIHMS865382  PMID: 28324169

Abstract

The thalamus provides a massive input to the striatum, but despite accumulating evidence, the functions of this system remain unclear. It is known, however, that the centromedian (CM) and parafascicular (Pf) nuclei of the thalamus can strongly influence particular striatal neuron subtypes, notably including the cholinergic interneurons of the striatum (CINs), key regulators of striatal function. Here we highlight the thalamostriatal system through the CM-Pf to striatal CINs. We consider how, by virtue of the direct synaptic connections of the CM and Pf, their neural activity contributes to the activity of CINs and striatal projection neurons (SPNs). CM-Pf neurons are strongly activated at sudden changes in behavioral context, such as switches in action-outcome contingency or sequence of behavioral requirements, suggesting that their activity may represent change of context operationalized as associability. Striatal CINs, on the other hand, acquire and loose responses to external events associated with particular contexts. In light of this physiological evidence, we propose a hypothesis of the CM-Pf - CINs system, suggesting that it augments associative learning by generating an associability signal and promotes reinforcement learning guided by reward prediction error signals from dopamine-containing neurons. We discuss neuronal circuit and synaptic organizations based on in vivo/in vitro studies that we suppose to underlie our hypothesis. Possible implications of CM-Pf - CINs dysfunction (or degeneration) in brain diseases are also discussed by focusing on Parkinson’s disease.

Keywords: thalamostriatal projection, CM-Pf, dorsal striatum, cholinergic interneurons, surprise, non-human primates

Introduction

When faced with uncertain environments, humans and other animals exploit knowledge gained from past experience and process currently available cues with heightened attention to determine whether the cues around them predict desirable or undesirable outcomes. Unexpectedly occurred environmental events and reward often causes a “surprise”, i.e., an unexpectedness of the observation based on experience, which helps in making inferences about current state and promotes learning to predict the future by the acquisition of associations strengthened by reinforcement (Roesch et al., 2012, McGuire et al., 2014). As such cues become more reliable predictors through learning, vigilant processing and learning decline. Theories of animal learning and reinforcement learning propose that unexpected reward produces errors of prediction and promotes learning to reduce the prediction error related to reward (‘reward prediction error’ or RPE) and ultimately guides behavior to maximize total expected rewards (Rescorla and Wagner, 1972, Sutton and Barto, 1998). Many experimental studies have delimited neural activity patterns in basal ganglia circuits to support this theory of reinforcement learning: the encoding of errors of reward prediction by midbrain dopamine neurons (Montague et al., 1996, Schultz, 1998b, Satoh et al., 2003, Bayer and Glimcher, 2005, Enomoto et al., 2011), the dopamine-dependent plasticity of corticostriatal synaptic transmission (Reynolds et al., 2001, Calabresi et al., 2007, Shen et al., 2008), and the encoding of reward value of actions in projection neurons in the striatum (SPNs) (Samejima et al., 2005, Lau and Glimcher, 2008). In the reinforcement learning model of the basal ganglia, RPE signals from dopamine-containing neurons directly promote learning by facilitating or suppressing specific action- and cognition-related corticostriatal signals to adjust future predictions and to select behavior (Houk et al., 1995, Montague et al., 1996, Doya, 2000).

Environmental cues also guide us making inferences about the current situation under uncertain circumstances, and promote learning to predict the future. In this mode of learning, unexpected events produce a surprise and enhance processing of the event and thus help to change the association of such events with the current situation or with future rewarding and aversive events by changing the associative strength of events, as part of associative learning about the event (Mackintosh, 1975, Pearce and Hall, 1980). Thus, surprise drives the modulation of the associative strength of the event, which can then influence behavioral learning in terms of rate of learning (Schultz and Dickinson, 2000). Large errors will result in a boost of the associative learning, whereas small errors will delay learning. Studies of neuronal activity of behaving animal and fMRI studies of human brain suggested parietal cortex, lateral prefrontal cortex, and amygdala are involved in attentional processes for decision making (Glascher et al., 2010, Roesch et al., 2010, Roesch et al., 2012), and in interaction between reinforcement learning and attention (Leong et al., 2017). However, available knowledge is still limiting our understanding of the neuronal circuits, encoded signal contents and synaptic organizations responsible for surprise and learning.

Objective of this review is to provide an overview of functional properties of neurons in the centromedian parafascicular (CM-Pf) complex of the thalamus and of the cholinergic interneurons of the striatum (CINs) and to note the potential of the intralaminar thalamostriatal projections as part of a mechanism related to learn and unlearn association of environmental events with action and outcome based on surprise. We first summarize evidence that the neuronal signals generated by neurons of the CM-Pf complex in non-human primate subjects performing tasks include responses to surprise evoked by the unexpected appearance of events in the environment. Surprise, produced by changes in action-outcome contingency and sequencing of task requirements, is critical for such responses. Thus, the activity of the CM-Pf neurons seems to represent processes previously described as attention to specific cue or “associability of cue”, which is equal to the weighted average of prediction errors generated across the past few trials (Mackintosh, 1975, Pearce and Hall, 1980). We next describe the evolution of CIN responses during the acquisition of associations between click sounds and reward in the putamen and caudate nucleus, which are the main targets of thalamostriatal projections from CM-Pf. We consider potential functions of CM-Pf inputs in shaping the activity of striatal CINs and SPNs during behavioral tasks that require a switch in behavioral context. Finally, we propose a hypothetical model in which a surprise elicited by unexpected external events activates brainstem-thalamocortical circuits for attention, CM-Pf circuits for generating associability signals and midbrain dopamine neurons for generating RPEs. We suggest that associability signals of the CM-Pf play roles in weighting external inputs to reduce trial-and-error learning costs depending on their expected associative strength with outcome, whereas RPEs of dopamine neurons promote reinforcement learning of SPNs (as well as CINs and other striatal interneurons) to modulate corticostriatal signals influencing action and cognition signals to update prediction. Thus, surprise promotes two kinds of learning in the striatum, depending on discrete teaching signals, thereby enhancing prediction about the future and guidance of behavior to specific goals. Dysfunction of the system may be involved in a range of basal ganglia disorders affecting movement and cognition.

Neuronal signals in the thalamic CM-Pf nuclei

Neurons in the CM-Pf complex (Fig. 1a) are characterized by their strong responsiveness to multimodal sensory stimuli and the conditional nature of these responses. For example, click sounds occurring near a monkey sitting in a primate chair evoke phasic activation of CM-Pf neurons at short latencies (around 25 ms, SLF neurons) or at long latencies (around 230 ms, LLF neurons) (Fig. 1b) (Matsumoto et al., 2001, Minamimoto and Kimura, 2002). In monkeys, the SLF neurons are predominantly located in the Pf, which is a main origin of fiber projections to the caudate nucleus and the rostral putamen, whereas LLF neurons are predominantly in the CM, which is main origin of the thalamostriatal projection to the putamen (Matsumoto et al., 2001, Smith et al., 2004, Glimcher and Lau, 2005). Robust responses of both SLF and LLF neurons occur when the clicks appear at varied time intervals or in combination with other stimuli such as beep sounds and LED lights. But the responses gradually decline if the click repeatedly appears at fixed time interval with no other stimuli (Fig. 1c). Similarly, robust responses are evoked by handclaps and knocks on the door or walls of the room in which the monkey is sitting. In many instances, the neurons respond to such alerting stimuli only for the first few times, and the stimuli gradually became ineffective in evoking responses if they repeatedly appear. Thus the basic responsiveness of these intralaminar neurons seems to be a surprise or prediction error, which can drive learning (Rescorla and Wagner, 1972, Pearce and Hall, 1980, Sutton, 1988). Midbrain dopamine neurons encode RPEs (Schultz, 1998b, Satoh et al., 2003, Nakahara et al., 2004, Bayer and Glimcher, 2005). The activity of CM-Pf neurons, however, is different from that of dopamine neurons, because it is similar whether or not the unexpected stimuli are associated with reward (Fig. 1b). In a visually cued target detection task, SLF neurons, for example, exhibit responses with much larger magnitude to a visual target appearing on the side opposite to that of the cued location (i.e., invalid) than to the cued target (valid) (Fig. 1d, e). The invalid target works against prediction as evidenced by longer reaction time of arm movement after invalid target than after valid target (Minamimoto and Kimura, 2002).

Fig. 1.

Fig. 1

CM-Pf of primate posterior intralaminar nuclei and their neuronal activity during behavioral tasks. (A) Nissl-stained frontal section at the level of CM-Pf. CL: centralis lateralis; MD: mediodorsal nucleus; PFdl: dorsolateral part of Pf, FR: fasciculus retroflexus. Asterisk indicates electrolytic lesion mark made by passing DC current (0.2 mA for 10 min) through a recording elgiloy microelectrode. Scale bar: 2 mm. (B) Ensemble average responses of SLF and LLF neurons in the CM-Pf to Clicks presented to monkeys sitting in a primate chair with or without following water reward. (C) Responses of LLF neurons to Click cues in a 75-trial-long block in which beeps, clicks, and LED flashes appeared in a random order with equal probabilities and at equal interval (left). The same neurons were then examined in a 25-trial-long block in which only clicks were presented (right). (D) Experimental paradigm. Illustration of the valid cue condition, in which the cue and the target appear at the same location (left), and the invalid cue condition, in which the cue and the target appear at different locations (right). (E) Average responses of SLF neurons to a contralateral validly cued target (thin line) as compared with those to a contralateral invalidly cued target (thick line). Figures are modified from Matsumoto et al. (2001) and Minamimoto and Kimura (2002).

Anatomical and physiological evidence is in accord with neuronal circuits of CM-Pf being important for associative learning of events and contexts: the CM-Pf receives information about environmental events coming through multimodal sensory channels, is sensitive to unexpectedness and surprise, and has no specific sensitivity to reward. In contrast to the rostral part of the intralaminar thalamic nuclei that relay multimodal sensory arousal to widespread cortical areas (Mennemeier et al. 1997; Pare´ et al. 1988; Steriade et al. 1994, 1997), the CM and Pf, making up the caudal intralaminar thalamic nuclei, are the main source of the massive thalamostriatal projection (Fig. 1a) (Macchi et al., 1984, Ragsdale and Graybiel, 1991, Groenewegen and Berendse, 1994, Haber and McFarland, 2001, Smith et al., 2004).

Activation at surprise produced by unexpected change of behavioral context as a basic characteristic of CM-Pf neurons

Surprise produced by unexpected events activates multiple discrete neuronal circuits. For example, surprise produces selective concentration of neuronal processing of the event and neglect of other perceivable information, referred to as the allocation of limited processing resources (Pashler, 1998). Surprise produces errors of outcome prediction as well as processing change relative to behavioral context in order to modulate the associative strength of the context with outcome. A strong surprise produced by a large error of prior prediction helps in updating prediction of the future. Neural signals related, respectively, to predicted and unpredicted sensory events can be enhanced or diminished depending on the error of prediction. Certainly, during performance of visually instructed left/right button press task for asymmetric reward by monkeys (Fig. 2a; Hold-GO reward task), SLF neurons exhibit robust phasic responses to GO signals immediately after a switch of action-outcome contingency, and these responses decline within a few trials (Fig. 2b left panel and c). Neuronal activation becomes maximal when the largest change is required in the associative strength of the currently appearing instruction signals (Cue/GO) with outcome. This activation is not likely an encoding of prediction errors of outcome, because the outcome-signaling beep sound that occurs after behavioral responses does not evoke neuronal activation during the first few trials immediately after switch of action-outcome contingency. In addition, instructions associated with large and small reward evoke similar responses (Fig. 2b). Further, selective activation of the neurons also occurs when a sequence of behavioral requirements is suddenly changed. If the Hold-GO reward task (Fig. 2a) is switched to the Hold-Cue-GO reward task (Fig. 2d), the outcome-instructing Cue comes instead of the GO signal that was the correct sequence in the last trials. The GO signal instructs, and is associated with, action and its outcome (behavioral context) during the Hold-GO task, whereas the Cue signal plays this role during the Hold-Cue-GO task. Thus, the alternation of tasks induces a switch in the sequence of task events. At this time, neuronal responses to the Cue become maximal and quickly return to baseline levels (Fig. 2e, f). Thus, the neuronal activity may represent “associability of cue” (Mackintosh, 1975, Pearce and Hall, 1980). It is compatible with the notion of weighting of incoming evidence to reduce costs of trial-and-error search depending on their expected associative strength with outcome (Zhang and Sejnowski, 1999, Butts, 2003, Lacey et al., 2007). In studies of neuronal activity of rats performing behavioral tasks, the basolateral amygdala was suggested to encode a surprise at the change of attention produced by unexpected delivery as well as omission of reward (unsigned error, in contrast to signed reward prediction error posited as the signal from dopamine-containing neurons) (Roesch et al., 2010, Roesch et al., 2012).

Fig. 2.

Fig. 2

CM-Pf neurons of SLF type report a “Surprise” induced by an unexpected change in behavioral context. (A) Monkeys depressed either left or right target guided by instruction (green LED) after holding a center (Hold) button for large or small reward. Because the action-outcome contingency was fixed during a block of 60–80 trials, monkeys reacted faster to a large-reward GO signal than to a small-reward GO signal. The contingency was then switched unpredictably after the 60–80 trial block. (B) Pseudocolor plot showing normalized discharge rates for 10 SLF neurons from 4 monkeys during 20 trials before and after switching action-outcome contingency. (C) Spike density histogram (upper panel) and reaction times (z-scores) of GO responses (lower panel) before and after contingency switch. Red asterisks indicate significant change in firing rate and reaction time following the contingency switch (P < 0.05, Wilcoxon signed rank test). (D) Unpredictable switches between the Cue-GO task and the Hold-Cue-GO task. Monkeys depressed an illuminated target immediately after center hold (Hold-GO task) or after a Cue instructing direction and outcome of forthcoming GO signal (Hold-Cue-GO task). (E, F) Pseudocolor plot showing normalized discharge rates for the SLF neurons (E) and population responses to GO and Cue signals before and after switches from Hold-GO task to Hold-Cue-GO task (F). Red asterisks indicate significant difference in firing rate between time before and after the task switch (P < 0.05, Wilcoxon signed rank test). From Yamanaka et al. (2016)

LLF neurons, another class of neurons in the CM-Pf, also encode expected and unexpected transitions of behavioral context, but in a different manner from that of the SLF neurons. During performance of an asymmetrically rewarded GO/NOGO task, LLF neurons were selectively activated when a small-reward action was requested, when a large-reward option had been expected based on an implicitly developed prediction (Minamimoto et al., 2005).

Representation of the associability of cue, as an average of prediction errors generated across the past few trials, is especially suitable for modulation of behavioral responses by CM-Pf neurons in a context-dependent mode. In support of these possibilities, unexpected external events suppress preexisting expectation and response bias (counteracting pre-action bias), whereas the expected demands boost prior expectation and behavioral reaction (Minamimoto et al., 2009, Minamimoto et al., 2014). On the other hand, it is natural to question whether CM-Pf neuron activity reflects mechanisms similar to some more general forms of attention, such as those discussed by Glascher et al. (2010), in which the surprise in facing the unexpected stimulus-action-outcome sequence results in state prediction errors to guide the next action. Further, decision-making in multidimensional environments is facilitated by a bidirectional interaction between attention and trial-and-error reinforcement learning processes (Leong et al., 2017). It is difficult definitively to rule out these alternatives, and some theories of associative learning do use an attention signal to control the assignment of salience to particular stimuli for learning (Pearce and Hall, 1980).

The representation of associability of context by CM-Pf neurons suggested here might not necessarily be a major component, but rather, a specific subcomponent, of a general surprise signal in the brain. However, our proposal about the mechanisms underlying associative learning of behavioral contexts, giving strong weight to the CM-Pf thalamus, is novel and is bolstered by multiple lines of evidence (Fig. 2). Unexpected delivery or omission of reward at the end of a trial may also produce signals of a general surprise and RPE encoded by midbrain dopamine neurons (Schultz, 1998a, Satoh et al., 2003, Nakahara et al., 2004, Bayer and Glimcher, 2005). The event associability signals of the CM-Pf emerge at a time when behavioral learning is about to occur (Fig. 2c). A question, however, is how the associability signals are used in the target brain region for learning prompted by the changes in task context.

Associative learning of events with behavioral context by cholinergic interneurons in the striatum

Thalamostriatal fibers make direct synaptic contacts with SPNs (Sadikot et al., 1992, Lacey et al., 2007, Ellender et al., 2013, Matamales et al., 2016) and with CINs (Lapper et al., 1992, Sidibe and Smith, 1999, Ding et al., 2010, Bradfield et al., 2013). CINs and fast-spiking neurons make up a major class of interneurons in the striatum. Though CINs represent a small population of striatal neurons (less than 3%), their broad arborizations and tiled distribution (Bolam et al., 1984, Wilson et al., 1990, Aosaki et al., 1995) provide dense cholinergic innervation throughout the striatum. Their cell bodies lie mainly in the matrix compartment, many near striosomal borders, and their fine axons innervate both the striosome and matrix compartments (Graybiel et al., 1986, Crittenden and Graybiel, 2016). It should be noted that the thalamostriatal projection, as estimated by molecular markers of neuronal activity, has a stronger impact on CINs than on SPNs (Bradfield et al., 2013). Studies of in vitro slice preparations point to the importance of the Pf-CIN synapses in regulating processing in the striatal circuits by both nicotinic and muscarinic acetylcholine receptors differentially distributed on different elements of striatal circuits and their afferents (Ding et al., 2010, Threlfell et al., 2012). Inactivating neuronal activity in the macaque CM-Pf complex by local infusion of the GABAA receptor agonist, muscimol, almost completely abolished the responsiveness to behaviorally salient stimuli of the tonically active neurons (TANs), which are though largely to correspond to CINs (Matsumoto et al., 2001). Furthermore, inactivation of Pf has been shown to block an increase of acetylcholine release in the dorsomedial striatum during reversal learning, and to lead to deterioration in the performance of reversal learning (Brown et al., 2010, Bradfield et al., 2013). Selective removal of excitatory input from Pf to CINs in the posterior dorsomedial striatum is reported to reduce firing rates of CINs and to produce deficits in reward-oriented learning especially after changes in action-outcome contingency (Kato et al., 2011, Bradfield et al., 2013). Intriguingly, immunotoxin-mediated elimination of CINs (Kobayashi et al., 1995) from the dorsomedial striatum (DMS) or gene-specific silencing of M4 muscarinic receptor enhances reversal learning (Okada et al., 2014) and impairment of attentional set-shifting (Aoki et al., 2015), suggesting engagement of CINs and M4 receptors in the striatum to acquisition, extinction and relearning of reward contingency (Kato et al., 2011, Okada et al., 2014, Aoki et al., 2015). Thus, the thalamostriatal projections from CM-Pf seem critical for learning associations of behavioral context with outcome at least partly by directly affecting CIN-mediated processing in the striatum under conditions that require a switch in behavioral context.

In the non-human primate striatum, TANs and phasically active neurons are regarded as corresponding to CINs and SPNs (Aosaki et al., 1995, Raz et al., 1996, Shimo and Hikosaka, 2001, Apicella, 2007). The identification of TANs is more difficult in the rodent striatum than in the primate striatum. Electrophysiological, morphological, juxtacellular recording-staining and optogenetic confirmation suggested that the population of neurons identified as TANs is likely to have a large component of CINs (Bennett and Wilson, 1999, Inokawa et al., 2010, Bradfield et al., 2013, Atallah et al., 2014). A biochemical marker particularly sensitive to CIN activity was reported (Brown et al., 2010). The firing of TANs switches from a tonic mode at about 3–8 spikes/s to a transient pause for about 300 ms followed by rebound facilitation after sensory cues for motivationally salient events, such as reward and aversion (Kimura et al., 1984, Apicella et al., 1991, Morris et al., 2004, Yamada et al., 2004, Atallah et al., 2014). The pause is sometimes preceded by brief activation. Electrophysiological studies showed that the pause-rebound facilitation with or without a preceding small activation is generated by extrinsic synaptic drive and/or slowing of intrinsic pace-making (Maurice et al., 2004, Wilson, 2005, Ding et al., 2006, Ding et al., 2010, Goldberg and Reynolds, 2011). Evidence in slice experiments suggests that the CINs can control single spikes of SPNs, that this modulation occurs in striosomes as well as the matrix compartment of the striatum, and that this modulation is vulnerable to control by exogenous sources—thus echoing the highly temporally controlled spike activity of the CINs themselves (Graybiel et al., 1994, Crittenden et al., 2017).

The above mentioned knowledge is a basis of our hypothesis that stimulus associability signals derived from the CM-Pf play indispensable roles in associative learning about the cue that occurs within the striatum, with CINs working as key players in this thalamostriatal mechanism. The activity of CINs has been examined during behavioral learning (Aosaki et al., 1994, Blazquez et al., 2002, Atallah et al., 2014). Aosaki and others directly assessed the associative learning part of the model by recording activity of TANs of monkeys undergoing behavioral conditioning during which particular sensory events (click) became associated with reward (Fig. 3) (Aosaki et al., 1994). Through the conditioning for up to 3 to 4 weeks, population responses gradually developed in both putamen and caudate nucleus, suggesting that TANs acquired new association of click with reward and maintained for weeks as a population. On the other hand, when individual TANs were recorded for extended times during conditioning, a given TAN could acquire the responsiveness in at least as short a time as 10 min. Similarly, individual TANs could lose their responses to the conditioned stimulus after about the same duration of extinction training (Aosaki et al., 1994). Further, when the monkey was trained to receive reward in relation to a new conditioning stimulus, TANs were capable of switching their sensory responses to the new stimulus.

Fig. 3.

Fig. 3

Changes in TAN responses to a click cue before, during and after click-reward associative learning in two macaque monkeys. Almost no licks were triggered by the click at the start of training, but by the fifth day, more than 90 % of licks were triggered and the values were nearly 100% for the reminder of the 3-week training period. (A) Among a total of 858 TANs recorded, only a small number (17%) of TANs responded to the click without following reward before conditioning (top). After conditioning, more than two-thirds of TANs responded to the reward-predictive click with a pause and rebound facilitation of sustained firing. In some TANs, the pause was preceded by an initial facilitation (not shown). (B) Ensemble average histograms showing responses of TANs to clicks (upper) recorded in the putamen (left) and caudate nucleus (right) before and during conditioning as indicated at left. Lower histograms show average orofacial muscle activity as a measure of licking. (C) Changes in percentages of TANs in the putamen (filled circles) and caudate nucleus (open circles) responsive to the clicks during training. From Aosaki et al. 1994.

Figure 4 summarizes our hypothesis by highlighting sharp contrast in temporal dynamics of responses to conditioning stimulus between CM-Pf neurons and striatal TANs after an unpredictable change of various behavioral contexts. CM-Pf neurons are selectively activated for very short time (a few trials for 10 to 20 s) instantly after switch of behavioral contexts supporting online associability of cue (Fig. 4a). On the contrary, striatal TANs learn and unlearn association through much longer time range (5 to 15 min of trials) (Fig. 4b). The development of response plasticity is tightly correlated with the probability that a given stimulus would evoke a behavioral response (Blazquez et al., 2002). Indeed, TANs (including optogenetically identified CINs) in the rat ventromedial striatum during a reward-based T-maze learning task showed a striking modulation of activity at the switch of reinforcement contingency (auditory- to tactile-instruction versions) and the reduced activity during overtraining (Atallah et al., 2014). In the same auditory and tactile cuing experiment, by contrast, SPNs were nearly insensitive to contingency switches and gradually lost outcome signaling while maintaining reports at trial start and goal approach, as though more related to stable aspects of task structure (Atallah et al, 2014). The high sensitivity of TANs to switches of action-outcome contingency is further supported by findings that removal of the excitatory input from Pf to the CINs in rat posterior DMS resulted in deficit in goal-directed learning after changes in the action-outcome contingency (Bradfield et al., 2013). Toxicogenetic ablation of CINs displayed behavioral deficits which were same pattern observed upon thalamostriatal denervation supporting the role of the thalamostriatal projection in updating action outcome contingencies (Matamales et al., 2016).

Fig. 4.

Fig. 4

Contrasting temporal dynamics of responses after unpredictable switches of action-outcome association and sequence of task events in CM-Pf neurons (a few trials, 10 to 20 s) and striatal TAN (5 to 15 min of trials). (A) Pseudocolor plot showing normalized discharge rates for CM-Pf neurons before and after switching action-outcome contingency (upper panel) and switches from Hold-GO task to Hold-Cue-GO task (same record as Fig. 2B and E). (B) Time course of acquisition and extinction of conditioned response of an individual TAN. At the start of the recording period (left panel), the TAN recorded showed no response to the new stimulus, but it gradually acquired the response, as shown by records at 5 and 15 min. By contrast, the same TAN began the recording period showing a response to the old clicks. The records at 5 and 10 min of recording show the gradual loss of the response of the TAN as the extinction trials continued. From Aosaki et al. (1994). (C) Anatomical relations of primate CM-Pf and striatum (putamen, Put. and caudate nucleus, Cd N.) highlighting transmission of surprise signals from CM-Pf to the striatum where TANs learn and relearn associations of stimuli with outcome. Dopamine-containing innervation of the striatum is drawn as a reference.

It was previously proposed that TANs preferentially encode prediction errors to situational events (Macchi et al., 1984, Apicella et al., 2011) and unpredictably appeared task events (Apicella, 2007, Yamada et al., 2007). But the CM-Pf activity related to stimulus associability with reward as well as no-reward appears to inform TANs that “it is the time for a change” and to induce TAN activity at initial phases of acquisition, extinction and relearning of stimulus outcome associations. The role of the CM-PF - TAN system in weighing incoming evidence may participate in update of beliefs and adjusting behavior. In this sense, it may be reasonable to suppose that stimulus associability-based learning and relearning as instantiated by TANs modulates the learning rates of dopamine-dependent reinforcement learning in the striatum. This idea is consistent with the marked effects of Pf thalamic input on dopamine release in the local striatal regions by either direct action on dopaminergic axons or by way of CINs (Rice et al., 2011, Threlfell et al., 2012). By the same token, aberrations in this process could yield repetitive, overly focused and inflexible behaviors (Alexander et al., 1990, Graybiel, 2008, Crittenden et al., 2017).

Although the synaptic organization of CIN circuits in the striatum underlying behavioral learning, extinction and relearning still remains unclear, recent evidence suggests linking cholinergic signaling to corticostriatal synaptic plasticity through muscarinic receptors. Specifically, in direct pathway SPNs (dSPNs), cholinergic signaling through M4 receptors promotes long-term depression (LTD) of corticostriatal glutamatergic synapses (Girasole and Nelson, 2015, Shen et al., 2015). The M4 receptor-mediated LDT of dSPNs suppressed regulator of G protein signaling type 4 (RGS4) activity and blocked D1 dopamine receptor-dependent long-term potentiation (LTP) (Shen et al., 2015). As described above, selective elimination of CINs and silencing M4 receptors enhanced reversal learning and extinction learning (Okada et al., 2014). These results suggest cholinergic signaling through M4 but not M1 receptors on dSPNs is deeply involved in LTD of corticostriatal synaptic transmission for stabilizing learned reward contingency (Okada et al., 2014), and for replacing with new one through extinction and reversal learning if it is necessary. Furthermore, bidirectional plasticity, not only LTD but also LTP, is induced in dSPNs as well as in indirect pathway SPNs (iSPNs) (Shen et al., 2008, Girasole and Nelson, 2015), while cholinergic control of corticostriatal LTD selectively occurs in dSPNs (Shen et al., 2015). On the other hand, dSPNs and iSPNs are similarly innervated by cortical and thalamic afferents (Fujiyama et al., 2006, Doig et al., 2010, Doig et al., 2014, Huerta-Ocampo et al., 2014, Smith et al., 2014, Matamales et al., 2016, Parker et al., 2016). Thalamic afferents from centrolateral nucleus, rostral intralaminar nucleus, make large-amplitude responses through AMPA receptors, whereas those from Pf, caudal intralaminar nucleus, produce small-amplitude responses and LTD in dSPNs and iSPNs through NMDA receptors (Parker et al., 2016).

Roles of CM-Pf nuclei of the thalamus and cholinergic interneurons of the dorsal striatum in learning and enduring representations of events dependent on behavioral context

We suggested above that the contributions of CINs to the generation of conditional striatal outputs through SPNs and striatal interneuronal networks promote reinforcement learning guided by reward prediction error signals from dopamine neurons. Indeed, outcome-associated instructions in a GO/NOGO task in monkeys evoke pause-rebound facilitation responses of TANs depending on how the instructions are associated with outcome (Hori et al., unpublished data). Encoding trial-unique action-outcome contingencies may help in processing action signals in a conditioned manner. Indeed, the action signals encoded in most of SPNs (75%) are specific to combination of action with outcome (Hori et al., 2009). In addition, initial and peak activations of the SPNs occurred exactly at the pause response of TANs suggesting potential synaptic coupling between TANs and SPNs.

Accumulating evidence suggests the existence of synaptic interactions among CINs, various types of GABAergic interneurons and direct and indirect pathway SPNs. In the striatum, muscarinic receptors do not directly couple to endogenous ion channels but instead indirectly alter excitability through second messenger signaling cascades (Bernard et al., 1992, Calabresi et al., 1998, Shen et al., 2007, Goldberg et al., 2012). The sustained firing of CINs at a rate of 3–8 spikes/s under both in vitro and in vivo preparation and broad axonal arborizations (Bolam et al., 1984, Perez-Rosello et al., 2005) have been thought to create a background tone of acetylcholine that tonically drives muscarinic receptor activation. Some actions of acetylcholine on SPNs are mediated through G-protein-coupled muscarinic receptors and were thought to enhance corticostriatal synaptic transmission (Akins et al., 1990, Galarraga et al., 1999, Shen et al., 2005). GABAergic interneurons receive nicotinic excitation and provide potent inhibition to SPNs (Galarraga et al., 1999, Faust et al., 2015, Mamaligas and Ford, 2016, Ztaou et al., 2016), whereas their suppression largely reduces CIN-evoked inhibition of SPNs (Faust et al., 2015, Faust et al., 2016). This indicates that the majority of CIN-evoked inhibition of SPNs is derived from GABAergic interneurons, although a recent study has reported that the firing of individual CINs resulted in monosynaptic spontaneous inhibitory post-synaptic currents in SPNs through muscarinic receptors (Mamaligas and Ford, 2016, Crittenden et al., 2017). On the other hand, Smith and his colleagues conducted a quantitative ultrastructural analysis of the GABAergic and glutamatergic innervation of CINs in the postcommissural putamen of rhesus monkeys. They showed that 60% of all synaptic inputs to CINs originate from GABAergic terminals, whereas 21% are from putatively glutamatergic terminals that establish asymmetric synapses, and 19% from other (non-GABAergic) sources of symmetric synapses (Gonzales et al., 2013). Thus, CINs receive prominent GABAergic inputs from multiple origins, including a significant component from axon collaterals of direct and indirect pathway SPNs (Akins et al., 1990, Gonzales et al., 2013). Further studies are necessary to understand local circuit mechanisms underlying conditional nature of striatal outputs through dSPNs and iSPNs.

Direct-indirect pathways and striosome-matrix systems are basic processing circuits in the basal ganglia, receiving glutamatergic signals from the cerebral cortex and the thalamus and dopaminergic signals from the midbrain (Graybiel and Ragsdale, 1978, Gerfen, 1984, Albin et al., 1989, Crittenden and Graybiel, 2011, Atallah et al., 2014, Friedman et al., 2015). The projections from CM-Pf neurons reach both dSPNs and iSPNs (Vandermaelen and Kitai, 1980, Ding et al., 2010, Smith et al., 2014) as well as CINs (Ding et al., 2010, Bradfield et al., 2013). They may exert discrete impacts on SPNs of the two systems through CINs: transient presynaptic suppression of cortical input followed by prolonged enhancement of responsiveness of iSPNs suggesting involvement of Pf inputs in attentional shift (Ding et al., 2010). Thalamostriatal projections originating from subnuclei of the intralaminar nuclei have differential impacts: neurons in the rodent centrolateral nucleus produce larger amplitude EPSPs as efficient drivers for SPNs, especially dSPNs, than Pf neurons do (Matamales et al., 2016). Evidence further suggests that identified striosomal SPNs, not only SPNs of the cholinergic-rich matrix compartment, respond to input from the CINs (Crittenden et al., 2017). Neuronal processes of coticostriatal synaptic plasticity through dSPNs and iSPNs in concert with mesostriatal dopamine system and CM-Pf - CIN system are obviously the basis of behavioral learning and relearning. But recent studies suggest that they are much more sophisticated than previously considered. For instance, control of plasticity in corticostriatal signal transmission may not be restricted in D1- and D2-receptor-mediated segregation in different SPN classes: D1-receptor-mediated LTP in dSPNs (Reynolds et al., 2001, Calabresi et al., 2007) and D2-receptor-mediated LTD in iSPNs (Gerdeman et al., 2002, Kreitzer and Malenka, 2007). Both dSPNs and iSPNs may participate in corticostriatal synaptic plasticity in bidirectional (LTP and LTD) manner mediated not solely by D1, D2 receptors but also M4 and A2A receptors (Shen et al., 2008, Girasole and Nelson, 2015, Shen et al., 2015). Thus, although there are obviously many interacting circuits yet to be identified, currently available knowledge is compatible with our hypothetical model that the CINs and their inputs originating from CM-Pf play a critical role in associative learning, extinction and relearning based on salient stimulus.

Possible future research directions are to identify and manipulate signal contents and their plastic changes during associative learning and reversal learning initiated by salient stimulus in individual cell types composing corticostriatal, CM-Pf - CIN and mesostriatal dopamine systems. To this end, cutting-edge technologies including circuit-specific transgenic animals, optogenetics, chemogenetics and endoscopic imaging of activity of hundreds of neurons simultaneously are available in the rodent studies and becoming possible tools in the studies of non-human primates.

Dysfunction of the thalamostriatal system and disease states

CM-Pf neurons die in both animal models of Parkinson’s disease (PD) (Villalba et al., 2014) and in PD patients (Henderson et al., 2000, Henderson et al., 2005, Halliday et al., 2011). Reversal learning in PD patients is particularly impaired as well (Peterson et al., 2009, Buelow et al., 2015). Cell type-specific recording, stimulation and inhibition in PD model animals demonstrated critical involvement of thalamostiatal system. Corticostriatal afferents have received the most attention in PD research, and dysregulation of synaptic plasticity at these inputs is thought to contribute to imbalanced basal ganglia circuit function (Kreitzer and Malenka, 2008, Surmeier et al., 2009). In contrast, fewer studies have examined the thalamostriatal system in PD (Galvan and Smith, 2011, Smith et al., 2014). Recently, Parker et al. found, by using optical stimulation of cortical and thalamic inputs and recording from dSPNs and iSPNs, that thalamostriatal synapses are a site of maladaptive changes in PD mice. After dopamine depletion, relative strength of thalamic, not cortical, inputs reversed: from direct pathway biasing under intact striatum to indirect pathway biasing under PD state. Chemogenetic or optogenetic inhibition of thalamic inputs to striatum restored motor function (Parker et al., 2016). Thus, although our understanding about causal relationship of the circuit and synaptic mechanisms of thalamostriatal system to neurological disorders remains still limited, literatures suggest significant involvement of thalamostriatal system in normal and pathological function of the basal ganglia.

Conclusions

We highlight the thalamostriatal projections from the CM-Pf to CINs in the striatum and describe their substantial involvement in associative learning. We suggest that this thalamostriatal circuit encodes associability signals that promote learning independent of positive or negative reinforcement. We propose a hypothesis about associative learning of environmental events through the CM-Pf - CIN system. Neurons in the CM-Pf are strongly activated at changes in behavioral situations such as altered action-outcome contingencies and transitions in behavioral requirements, suggesting the encoding of the associability of behavioral context, an average of prediction errors generated across the past few trials. We discuss neuronal circuit and synaptic organizations based on in vivo/in vitro studies mostly using rodent subjects that we consider to underlie our hypothesis of associative learning and unlearning of CINs and reinforcement learning by dSPNs and iSPNs that is augmented by online associability signals from the CM-Pf. We examine in this light possible implications of CM-Pf - CIN dysfunction or degeneration of these thalamostriatal circuits in neurologic disorders by focusing on Parkinson’s disease.

Acknowledgments

We thank M. Haruno and Y. Sakai, Y. Kubota for critical reading and advice on the manuscript, R. Sakane, M. Funami and I. Kawashima for technical assistance. This study was supported by Grant-in-Aid for Scientific Research 23120010, 26290009, and 15K14320 to M.K., and for Young Scientists (B) 20700293 to Y.H., 24700425 to K.Y., by the Development of Biomarker Candidates for Social Behavior carried out under the Strategic Research Program for Brain Sciences from the Ministry of Education, Culture, Sports, Science and Technology of Japan (M.K.), and by National Institutes for Health grant R01 NS025529 to A.M.G.

References

  1. Akins PT, Surmeier DJ, Kitai ST. M1 muscarinic acetylcholine receptor in cultured rat neostriatum regulates phosphoinositide hydrolysis. Journal of neurochemistry. 1990;54:266–273. doi: 10.1111/j.1471-4159.1990.tb13310.x. [DOI] [PubMed] [Google Scholar]
  2. Albin RL, Young AB, Penney JB. The functional anatomy of basal ganglia disorders. Trends Neurosci. 1989;12:366–375. doi: 10.1016/0166-2236(89)90074-x. [DOI] [PubMed] [Google Scholar]
  3. Alexander GE, Crutcher MD, DeLong MR. Basal ganglia-thalamocortical circuits: parallel substrates for motor, oculomotor, “prefrontal” and “limbic” functions. Prog Brain Res. 1990;85:119–146. [PubMed] [Google Scholar]
  4. Aoki S, Liu AW, Zucca A, Zucca S, Wickens JR. Role of Striatal Cholinergic Interneurons in Set-Shifting in the Rat. J Neurosci. 2015;35:9424–9431. doi: 10.1523/JNEUROSCI.0490-15.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Aosaki T, Kimura M, Graybiel AM. Temporal and spatial characteristics of tonically active neurons of the primate’s striatum. J Neurophysiol. 1995;73:1234–1252. doi: 10.1152/jn.1995.73.3.1234. [DOI] [PubMed] [Google Scholar]
  6. Aosaki T, Tsubokawa H, Ishida A, Watanabe K, Graybiel AM, Kimura M. Responses of tonically active neurons in the primate’s striatum undergo systematic changes during behavioral sensorimotor conditioning. J Neurosci. 1994;14:3969–3984. doi: 10.1523/JNEUROSCI.14-06-03969.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Apicella P. Leading tonically active neurons of the striatum from reward detection to context recognition. Trends Neurosci. 2007;30:299–306. doi: 10.1016/j.tins.2007.03.011. [DOI] [PubMed] [Google Scholar]
  8. Apicella P, Ravel S, Deffains M, Legallet E. The role of striatal tonically active neurons in reward prediction error signaling during instrumental task performance. J Neurosci. 2011;31:1507–1515. doi: 10.1523/JNEUROSCI.4880-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Apicella P, Scarnati E, Schultz W. Tonically discharging neurons of monkey striatum respond to preparatory and rewarding stimuli. Exp Brain Res. 1991;84:672–675. doi: 10.1007/BF00230981. [DOI] [PubMed] [Google Scholar]
  10. Atallah HE, McCool AD, Howe MW, Graybiel AM. Neurons in the ventral striatum exhibit cell-type-specific representations of outcome during learning. Neuron. 2014;82:1145–1156. doi: 10.1016/j.neuron.2014.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bayer HM, Glimcher PW. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron. 2005;47:129–141. doi: 10.1016/j.neuron.2005.05.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bennett BD, Wilson CJ. Spontaneous activity of neostriatal cholinergic interneurons in vitro. J Neurosci. 1999;19:5586–5596. doi: 10.1523/JNEUROSCI.19-13-05586.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bernard V, Normand E, Bloch B. Phenotypical characterization of the rat striatal neurons expressing muscarinic receptor genes. J Neurosci. 1992;12:3591–3600. doi: 10.1523/JNEUROSCI.12-09-03591.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Blazquez PM, Fujii N, Kojima J, Graybiel AM. A network representation of response probability in the striatum. Neuron. 2002;33:973–982. doi: 10.1016/s0896-6273(02)00627-x. [DOI] [PubMed] [Google Scholar]
  15. Bolam JP, Wainer BH, Smith AD. Characterization of cholinergic neurons in the rat neostriatum. A combination of choline acetyltransferase immunocytochemistry, Golgi-impregnation and electron microscopy. Neuroscience. 1984;12:711–718. doi: 10.1016/0306-4522(84)90165-9. [DOI] [PubMed] [Google Scholar]
  16. Bradfield LA, Bertran-Gonzalez J, Chieng B, Balleine BW. The thalamostriatal pathway and cholinergic control of goal-directed action: interlacing new with existing learning in the striatum. Neuron. 2013;79:153–166. doi: 10.1016/j.neuron.2013.04.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Brown HD, Baker PM, Ragozzino ME. The parafascicular thalamic nucleus concomitantly influences behavioral flexibility and dorsomedial striatal acetylcholine output in rats. J Neurosci. 2010;30:14390–14398. doi: 10.1523/JNEUROSCI.2167-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Buelow MT, Amick MM, Queller S, Stout JC, Friedman JH, Grace J. Feasibility of use of probabilistic reversal learning and serial reaction time tasks in clinical trials of Parkinson’s disease. Parkinsonism & related disorders. 2015;21:894–898. doi: 10.1016/j.parkreldis.2015.05.019. [DOI] [PubMed] [Google Scholar]
  19. Butts DA. How much information is associated with a particular stimulus? Network: Computational Neural systems. 2003;14:177–187. [PubMed] [Google Scholar]
  20. Calabresi P, Centonze D, Pisani A, Sancesario G, North RA, Bernardi G. Muscarinic IPSPs in rat striatal cholinergic interneurones. J Physiol. 1998;510(Pt 2):421–427. doi: 10.1111/j.1469-7793.1998.421bk.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Calabresi P, Picconi B, Tozzi A, Di Filippo M. Dopamine-mediated regulation of corticostriatal synaptic plasticity. Trends Neurosci. 2007;30:211. doi: 10.1016/j.tins.2007.03.001. [DOI] [PubMed] [Google Scholar]
  22. Crittenden JR, Graybiel AM. Basal Ganglia disorders associated with imbalances in the striatal striosome and matrix compartments. Front Neuroanat. 2011;5:59. doi: 10.3389/fnana.2011.00059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Crittenden JR, Graybiel AM. Disease-associated changes in the striosome and matrix compartments of the dorsal striatum. In: Steiner H, Tseng KY, editors. Handbook of Basal Ganglia Structure and Function. Amsterdam: Elsevier; 2016. pp. 801–821. [Google Scholar]
  24. Crittenden JR, Lacey CJ, Feng-Ju Weng E, Garrison CA, Lin Y, Graybiel AM. Striatal cholinergic interneurons modulate spike-timing in striosomes and matrix by an amphetamine-sensitive mechanism. Front Neuroanat. 2017;11:20. doi: 10.3389/fnana.2017.00020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Ding J, Guzman JN, Tkatch T, Chen S, Goldberg JA, Ebert PJ, Levitt P, Wilson CJ, Hamm HE, Surmeier DJ. RGS4-dependent attenuation of M4 autoreceptor function in striatal cholinergic interneurons following dopamine depletion. Nat Neurosci. 2006;9:832–842. doi: 10.1038/nn1700. [DOI] [PubMed] [Google Scholar]
  26. Ding JB, Guzman JN, Peterson JD, Goldberg JA, Surmeier DJ. Thalamic gating of corticostriatal signaling by cholinergic interneurons. Neuron. 2010;67:294–307. doi: 10.1016/j.neuron.2010.06.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Doig NM, Magill PJ, Apicella P, Bolam JP, Sharott A. Cortical and thalamic excitation mediate the multiphasic responses of striatal cholinergic interneurons to motivationally salient stimuli. J Neurosci. 2014;34:3101–3117. doi: 10.1523/JNEUROSCI.4627-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Doig NM, Moss J, Bolam JP. Cortical and thalamic innervation of direct and indirect pathway medium-sized spiny neurons in mouse striatum. J Neurosci. 2010;30:14610–14618. doi: 10.1523/JNEUROSCI.1623-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Doya K. Reinforcement learning in continuous time and space. Neural Comput. 2000;12:219–245. doi: 10.1162/089976600300015961. [DOI] [PubMed] [Google Scholar]
  30. Ellender TJ, Harwood J, Kosillo P, Capogna M, Bolam JP. Heterogeneous properties of central lateral and parafascicular thalamic synapses in the striatum. J Physiol. 2013;591:257–272. doi: 10.1113/jphysiol.2012.245233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Enomoto K, Matsumoto N, Nakai S, Satoh T, Sato TK, Ueda Y, Inokawa H, Haruno M, Kimura M. Dopamine neurons learn to encode the long-term value of multiple future rewards. Proc Natl Acad Sci U S A. 2011;108:15462–15467. doi: 10.1073/pnas.1014457108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Faust TW, Assous M, Shah F, Tepper JM, Koos T. Novel fast adapting interneurons mediate cholinergic-induced fast GABAA inhibitory postsynaptic currents in striatal spiny neurons. Eur J Neurosci. 2015;42:1764–1774. doi: 10.1111/ejn.12915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Faust TW, Assous M, Tepper JM, Koos T. Neostriatal GABAergic Interneurons Mediate Cholinergic Inhibition of Spiny Projection Neurons. J Neurosci. 2016;36:9505–9511. doi: 10.1523/JNEUROSCI.0466-16.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Friedman A, Homma D, Gibb LG, Amemori K, Rubin SJ, Hood AS, Riad MH, Graybiel AM. A corticostriatal path targeting striosomes controls decision-making under conflict. Cell. 2015;161:1320–1333. doi: 10.1016/j.cell.2015.04.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Fujiyama F, Unzai T, Nakamura K, Nomura S, Kaneko T. Difference in organization of corticostriatal and thalamostriatal synapses between patch and matrix compartments of rat neostriatum. Eur J Neurosci. 2006;24:2813–2824. doi: 10.1111/j.1460-9568.2006.05177.x. [DOI] [PubMed] [Google Scholar]
  36. Galarraga E, Hernandez-Lopez S, Reyes A, Miranda I, Bermudez-Rattoni F, Vilchis C, Bargas J. Cholinergic modulation of neostriatal output: a functional antagonism between different types of muscarinic receptors. J Neurosci. 1999;19:3629–3638. doi: 10.1523/JNEUROSCI.19-09-03629.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Galvan A, Smith Y. The primate thalamostriatal systems: Anatomical organization, functional roles and possible involvement in Parkinson’s disease. Basal ganglia. 2011;1:179–189. doi: 10.1016/j.baga.2011.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Gerdeman GL, Ronesi J, Lovinger DM. Postsynaptic endocannabinoid release is critical to long-term depression in the striatum. Nat Neurosci. 2002;5:446–451. doi: 10.1038/nn832. [DOI] [PubMed] [Google Scholar]
  39. Gerfen CR. The neostriatal mosaic: compartmentalization of corticostriatal input and striatonigral output systems. Nature. 1984;311:461–464. doi: 10.1038/311461a0. [DOI] [PubMed] [Google Scholar]
  40. Girasole AE, Nelson AB. Bridging the Gap: Muscarinic M4 Receptors Promote Striatal Plasticity in Health and Disease. Neuron. 2015;88:621–623. doi: 10.1016/j.neuron.2015.11.007. [DOI] [PubMed] [Google Scholar]
  41. Glascher J, Daw N, Dayan P, O'Doherty JP. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron. 2010;66:585–595. doi: 10.1016/j.neuron.2010.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Glimcher PW, Lau B. Rethinking the thalamus. Nat Neurosci. 2005;8:983–984. doi: 10.1038/nn0805-983. [DOI] [PubMed] [Google Scholar]
  43. Goldberg JA, Ding JB, Surmeier DJ. Muscarinic modulation of striatal function and circuitry. Handbook of experimental pharmacology. 2012:223–241. doi: 10.1007/978-3-642-23274-9_10. [DOI] [PubMed] [Google Scholar]
  44. Goldberg JA, Reynolds JN. Spontaneous firing and evoked pauses in the tonically active cholinergic interneurons of the striatum. Neuroscience. 2011;198:27–43. doi: 10.1016/j.neuroscience.2011.08.067. [DOI] [PubMed] [Google Scholar]
  45. Gonzales KK, Pare JF, Wichmann T, Smith Y. GABAergic inputs from direct and indirect striatal projection neurons onto cholinergic interneurons in the primate putamen. J Comp Neurol. 2013;521:2502–2522. doi: 10.1002/cne.23295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Graybiel AM. Habits, rituals, and the evaluative brain. Annu Rev Neurosci. 2008;31:359–387. doi: 10.1146/annurev.neuro.29.051605.112851. [DOI] [PubMed] [Google Scholar]
  47. Graybiel AM, Aosaki T, Flaherty AW, Kimura M. The basal ganglia and adaptive motor control. Science. 1994;265:1826–1831. doi: 10.1126/science.8091209. [DOI] [PubMed] [Google Scholar]
  48. Graybiel AM, Baughman RW, Eckenstein F. Cholinergic neuropil of the striatum observes striosomal boundaries. Nature. 1986;323:625–627. doi: 10.1038/323625a0. [DOI] [PubMed] [Google Scholar]
  49. Graybiel AM, Ragsdale CW., Jr Histochemically distinct compartments in the striatum of human, monkeys, and cat demonstrated by acetylthiocholinesterase staining. Proc Natl Acad Sci U S A. 1978;75:5723–5726. doi: 10.1073/pnas.75.11.5723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Groenewegen HJ, Berendse HW. The specificity of the ‘nonspecific’ midline and intralaminar thalamic nuclei. Trends Neurosci. 1994;17:52–57. doi: 10.1016/0166-2236(94)90074-4. [DOI] [PubMed] [Google Scholar]
  51. Haber S, McFarland NR. The place of the thalamus in frontal cortical-basal ganglia circuits. The Neuroscientist : a review journal bringing neurobiology, neurology and psychiatry. 2001;7:315–324. doi: 10.1177/107385840100700408. [DOI] [PubMed] [Google Scholar]
  52. Halliday G, Lees A, Stern M. Milestones in Parkinson’s disease–clinical and pathologic features. Movement disorders: official journal of the Movement Disorder Society. 2011;26:1015–1021. doi: 10.1002/mds.23669. [DOI] [PubMed] [Google Scholar]
  53. Henderson JM, Carpenter K, Cartwright H, Halliday GM. Loss of thalamic intralaminar nuclei in progressive supranuclear palsy and Parkinson’s disease: clinical and therapeutic implications. Brain. 2000;123(Pt 7):1410–1421. doi: 10.1093/brain/123.7.1410. [DOI] [PubMed] [Google Scholar]
  54. Henderson JM, Schleimer SB, Allbutt H, Dabholkar V, Abela D, Jovic J, Quinlivan M. Behavioural effects of parafascicular thalamic lesions in an animal model of parkinsonism. Behav Brain Res. 2005;162:222–232. doi: 10.1016/j.bbr.2005.03.017. [DOI] [PubMed] [Google Scholar]
  55. Hori Y, Minamimoto T, Kimura M. Neuronal encoding of reward value and direction of actions in the primate putamen. J Neurophysiol. 2009;102:3530–3543. doi: 10.1152/jn.00104.2009. [DOI] [PubMed] [Google Scholar]
  56. Houk JC, Adams JL, Barto AG. A model of how the basal ganglia generate and use neural signals that predict reinforcement. In: Houk JC, et al., editors. Models of Information Processing in the Basal Ganglia. Cambridge: The MIT Press; 1995. pp. 249–270. [Google Scholar]
  57. Huerta-Ocampo I, Mena-Segovia J, Bolam JP. Convergence of cortical and thalamic input to direct and indirect pathway medium spiny neurons in the striatum. Brain structure & function. 2014;219:1787–1800. doi: 10.1007/s00429-013-0601-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Inokawa H, Yamada H, Matsumoto N, Muranishi M, Kimura M. Juxtacellular labeling of tonically active neurons and phasically active neurons in the rat striatum. Neuroscience. 2010;168:395–404. doi: 10.1016/j.neuroscience.2010.03.062. [DOI] [PubMed] [Google Scholar]
  59. Kato S, Kuramochi M, Kobayashi K, Fukabori R, Okada K, Uchigashima M, Watanabe M, Tsutsui Y. Selective neural pathway targeting reveals key roles of thalamostriatal projection in the control of visual discrimination. J Neurosci. 2011;31:17169–17179. doi: 10.1523/JNEUROSCI.4005-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Kimura M, Rajkowski J, Evarts E. Tonically discharging putamen neurons exhibit set-dependent responses. Proc Natl Acad Sci U S A. 1984;81:4998–5001. doi: 10.1073/pnas.81.15.4998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Kobayashi K, Morita S, Sawada H, Mizuguchi T, Yamada K, Nagatsu I, Fujita K, Kreitman RJ, Pastan I, Nagatsu T. Immunotoxin-mediated conditional disruption of specific neurons in transgenic mice. Proc Natl Acad Sci U S A. 1995;92:1132–1136. doi: 10.1073/pnas.92.4.1132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Kreitzer AC, Malenka RC. Endocannabinoid-mediated rescue of striatal LTD and motor deficits in Parkinson’s disease models. Nature. 2007;445:643–647. doi: 10.1038/nature05506. [DOI] [PubMed] [Google Scholar]
  63. Kreitzer AC, Malenka RC. Striatal Plasticity and Basal Ganglia Circuit Function. Neuron. 2008;60:543–554. doi: 10.1016/j.neuron.2008.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Lacey CJ, Bolam JP, Magill PJ. Novel and distinct operational principles of intralaminar thalamic neurons and their striatal projections. J Neurosci. 2007;27:4374–4384. doi: 10.1523/JNEUROSCI.5519-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Lapper SR, Smith Y, Sadikot AF, Parent A, Bolam JP. Cortical input to parvalbumin-immunoreactive neurones in the putamen of the squirrel monkey. Brain Res. 1992;580:215–224. doi: 10.1016/0006-8993(92)90947-8. [DOI] [PubMed] [Google Scholar]
  66. Lau B, Glimcher PW. Value representations in the primate caudate nucleus during matching behavior. Neuron. 2008;58:451–463. doi: 10.1016/j.neuron.2008.02.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Leong YC, Radulescu A, Daniel R, DeWoskin V, Niv Y. Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments. Neuron. 2017;93:451–463. doi: 10.1016/j.neuron.2016.12.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Macchi G, Bentivoglio M, Molinari M, Minciacchi D. The thalamo-caudate versus thalamo-cortical projections as studied in the cat with fluorescent retrograde double labeling. Exp Brain Res. 1984;54:225–239. doi: 10.1007/BF00236222. [DOI] [PubMed] [Google Scholar]
  69. Mackintosh NJ. A theory of attention: variations in the associability of stimuli with reinforcement. Psychol Rev. 1975;82:276–298. [Google Scholar]
  70. Mamaligas AA, Ford CP. Spontaneous Synaptic Activation of Muscarinic Receptors by Striatal Cholinergic Neuron Firing. Neuron. 2016;91:574–586. doi: 10.1016/j.neuron.2016.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Matamales M, Skrbis Z, Hatch RJ, Balleine BW, Gotz J, Bertran-Gonzalez J. Aging-Related Dysfunction of Striatal Cholinergic Interneurons Produces Conflict in Action Selection. Neuron. 2016;90:362–373. doi: 10.1016/j.neuron.2016.03.006. [DOI] [PubMed] [Google Scholar]
  72. Matsumoto N, Minamimoto T, Graybiel AM, Kimura M. Neurons in the thalamic CM-Pf complex supply striatal neurons with information about behaviorally significant sensory events. J Neurophysiol. 2001;85:960–976. doi: 10.1152/jn.2001.85.2.960. [DOI] [PubMed] [Google Scholar]
  73. Maurice N, Mercer J, Chan CS, Hernandez-Lopez S, Held J, Tkatch T, Surmeier DJ. D2 dopamine receptor-mediated modulation of voltage-dependent Na+ channels reduces autonomous activity in striatal cholinergic interneurons. J Neurosci. 2004;24:10289–10301. doi: 10.1523/JNEUROSCI.2155-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. McGuire JT, Nassar MR, Gold JI, Kable JW. Functionally dissociable influences on learning rate in a dynamic environment. Neuron. 2014;84:870–881. doi: 10.1016/j.neuron.2014.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Minamimoto T, Hori Y, Kimura M. Complementary process to response bias in the centromedian nucleus of the thalamus. Science. 2005;308:1798–1801. doi: 10.1126/science.1109154. [DOI] [PubMed] [Google Scholar]
  76. Minamimoto T, Hori Y, Kimura M. Roles of the thalamic CM-PF complex-Basal ganglia circuit in externally driven rebias of action. Brain Res Bull. 2009;78:75–79. doi: 10.1016/j.brainresbull.2008.08.013. [DOI] [PubMed] [Google Scholar]
  77. Minamimoto T, Hori Y, Yamanaka K, Kimura M. Neural signal for counteracting pre-action bias in the centromedian thalamic nucleus. Frontiers in systems neuroscience. 2014;8:3. doi: 10.3389/fnsys.2014.00003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Minamimoto T, Kimura M. Participation of the thalamic CM-Pf complex in attentional orienting. J Neurophysiol. 2002;87:3090–3101. doi: 10.1152/jn.2002.87.6.3090. [DOI] [PubMed] [Google Scholar]
  79. Montague PR, Dayan P, Sejnowski TJ. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci. 1996;16:1936–1947. doi: 10.1523/JNEUROSCI.16-05-01936.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Morris G, Arkadir D, Nevet A, Vaadia E, Bergman H. Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons. Neuron. 2004;43:133–143. doi: 10.1016/j.neuron.2004.06.012. [DOI] [PubMed] [Google Scholar]
  81. Nakahara H, Itoh H, Kawagoe R, Takikawa Y, Hikosaka O. Dopamine neurons can represent context-dependent prediction error. Neuron. 2004;41:269–280. doi: 10.1016/s0896-6273(03)00869-9. [DOI] [PubMed] [Google Scholar]
  82. Okada K, Nishizawa K, Fukabori R, Kai N, Shiota A, Ueda M, Tsutsui Y, Sakata S, Matsushita N, Kobayashi K. Enhanced flexibility of place discrimination learning by targeting striatal cholinergic interneurons. Nature communications. 2014;5:3778. doi: 10.1038/ncomms4778. [DOI] [PubMed] [Google Scholar]
  83. Parker PR, Lalive AL, Kreitzer AC. Pathway-Specific Remodeling of Thalamostriatal Synapses in Parkinsonian Mice. Neuron. 2016;89:734–740. doi: 10.1016/j.neuron.2015.12.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Pashler H. The psychology of attention. Cambridge, Massachusetts: The MIT Press; 1998. p. 494. [Google Scholar]
  85. Pearce JM, Hall G. A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. Psychol Rev. 1980;87:532–552. [PubMed] [Google Scholar]
  86. Perez-Rosello T, Figueroa A, Salgado H, Vilchis C, Tecuapetla F, Guzman JN, Galarraga E, Bargas J. Cholinergic control of firing pattern and neurotransmission in rat neostriatal projection neurons: role of CaV2.1 and CaV2.2 Ca2+ channels. J Neurophysiol. 2005;93:2507–2519. doi: 10.1152/jn.00853.2004. [DOI] [PubMed] [Google Scholar]
  87. Peterson DA, Elliott C, Song DD, Makeig S, Sejnowski TJ, Poizner H. Probabilistic reversal learning is impaired in Parkinson’s disease. Neuroscience. 2009;163:1092–1101. doi: 10.1016/j.neuroscience.2009.07.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Ragsdale CW, Jr, Graybiel AM. Compartmental organization of the thalamostriatal connection in the cat. J Comp Neurol. 1991;311:134–167. doi: 10.1002/cne.903110110. [DOI] [PubMed] [Google Scholar]
  89. Raz A, Feingold A, Zelanskaya V, Vaadia E, Bergman H. Neuronal synchronization of tonically active neurons in the striatum of normal and parkinsonian primates. J Neurophysiol. 1996;76:2083–2088. doi: 10.1152/jn.1996.76.3.2083. [DOI] [PubMed] [Google Scholar]
  90. Rescorla RA, Wagner AR. Current research and theory. In: Black AH, Prokasy WF, editors. Classical Conditioning II. New York: Appleton Century Crofts; 1972. pp. 64–99. [Google Scholar]
  91. Reynolds JN, Hyland BI, Wickens JR. A cellular mechanism of reward-related learning. Nature. 2001;413:67–70. doi: 10.1038/35092560. [DOI] [PubMed] [Google Scholar]
  92. Rice ME, Patel JC, Cragg SJ. Dopamine release in the basal ganglia. Neuroscience. 2011;198:112–137. doi: 10.1016/j.neuroscience.2011.08.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Roesch MR, Calu DJ, Esber GR, Schoenbaum G. Neural correlates of variations in event processing during learning in basolateral amygdala. J Neurosci. 2010;30:2464–2471. doi: 10.1523/JNEUROSCI.5781-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Roesch MR, Esber GR, Li J, Daw ND, Schoenbaum G. Surprise! Neural correlates of Pearce-Hall and Rescorla-Wagner coexist within the brain. Eur J Neurosci. 2012;35:1190–1200. doi: 10.1111/j.1460-9568.2011.07986.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Sadikot AF, Parent A, Francois C. Efferent connections of the centromedian and parafascicular thalamic nuclei in the squirrel monkey: a PHA-L study of subcortical projections. The Journal of comparative neurology. 1992;315:137–159. doi: 10.1002/cne.903150203. [DOI] [PubMed] [Google Scholar]
  96. Samejima K, Ueda Y, Doya K, Kimura M. Representation of action-specific reward values in the striatum. Science. 2005;310:1337–1340. doi: 10.1126/science.1115270. [DOI] [PubMed] [Google Scholar]
  97. Satoh T, Nakai S, Sato T, Kimura M. Correlated coding of motivation and outcome of decision by dopamine neurons. J Neurosci. 2003;23:9913–9923. doi: 10.1523/JNEUROSCI.23-30-09913.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Schultz W. The phasic reward signal of primate dopamine neurons. Adv Pharmacol. 1998a;42:686–690. doi: 10.1016/s1054-3589(08)60841-8. [DOI] [PubMed] [Google Scholar]
  99. Schultz W. Predictive reward signal of dopamine neurons. J Neurophysiol. 1998b;80:1–27. doi: 10.1152/jn.1998.80.1.1. [DOI] [PubMed] [Google Scholar]
  100. Schultz W, Dickinson A. Neuronal coding of prediction errors. Annu Rev Neurosci. 2000;23:473–500. doi: 10.1146/annurev.neuro.23.1.473. [DOI] [PubMed] [Google Scholar]
  101. Shen W, Flajolet M, Greengard P, Surmeier DJ. Dichotomous dopaminergic control of striatal synaptic plasticity. Science. 2008;321:848–851. doi: 10.1126/science.1160575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Shen W, Hamilton SE, Nathanson NM, Surmeier DJ. Cholinergic suppression of KCNQ channel currents enhances excitability of striatal medium spiny neurons. J Neurosci. 2005;25:7449–7458. doi: 10.1523/JNEUROSCI.1381-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Shen W, Plotkin JL, Francardo V, Ko WK, Xie Z, Li Q, Fieblinger T, Wess J, Neubig RR, Lindsley CW, Conn PJ, Greengard P, Bezard E, Cenci MA, Surmeier DJ. M4 Muscarinic Receptor Signaling Ameliorates Striatal Plasticity Deficits in Models of L-DOPA-Induced Dyskinesia. Neuron. 2015;88:762–773. doi: 10.1016/j.neuron.2015.10.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Shen W, Tian X, Day M, Ulrich S, Tkatch T, Nathanson NM, Surmeier DJ. Cholinergic modulation of Kir2 channels selectively elevates dendritic excitability in striatopallidal neurons. Nat Neurosci. 2007;10:1458–1466. doi: 10.1038/nn1972. [DOI] [PubMed] [Google Scholar]
  105. Shimo Y, Hikosaka O. Role of tonically active neurons in primate caudate in reward-oriented saccadic eye movement. J Neurosci. 2001;21:7804–7814. doi: 10.1523/JNEUROSCI.21-19-07804.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Sidibe M, Smith Y. Thalamic inputs to striatal interneurons in monkeys: synaptic organization and co-localization of calcium binding proteins. Neuroscience. 1999;89:1189–1208. doi: 10.1016/s0306-4522(98)00367-4. [DOI] [PubMed] [Google Scholar]
  107. Smith Y, Galvan A, Ellender TJ, Doig N, Villalba RM, Huerta-Ocampo I, Wichmann T, Bolam JP. The thalamostriatal system in normal and diseased states. Frontiers in systems neuroscience. 2014;8:5. doi: 10.3389/fnsys.2014.00005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Smith Y, Raju DV, Pare JF, Sidibe M. The thalamostriatal system: a highly specific network of the basal ganglia circuitry. Trends Neurosci. 2004;27:520–527. doi: 10.1016/j.tins.2004.07.004. [DOI] [PubMed] [Google Scholar]
  109. Surmeier DJ, Plotkin J, Shen W. Dopamine and synaptic plasticity in dorsal striatal circuits controlling action selection. Curr Opin Neurobiol. 2009;19:621–628. doi: 10.1016/j.conb.2009.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Sutton RS. Learning to predict by the method of temporal differences. Mach Learn. 1988;3:9–44. [Google Scholar]
  111. Sutton RS, Barto AG. Reinforcement Learning. Cambridge: The MIT press; 1998. [Google Scholar]
  112. Threlfell S, Lalic T, Platt NJ, Jennings KA, Deisseroth K, Cragg SJ. Striatal dopamine release is triggered by synchronized activity in cholinergic interneurons. Neuron. 2012;75:58–64. doi: 10.1016/j.neuron.2012.04.038. [DOI] [PubMed] [Google Scholar]
  113. Vandermaelen CP, Kitai ST. Intracellular analysis of synaptic potentials in rat neostriatum following stimulation of the cerebral cortex, thalamus, and substantia nigra. Brain Res Bull. 1980;5:725–733. doi: 10.1016/0361-9230(80)90212-9. [DOI] [PubMed] [Google Scholar]
  114. Villalba RM, Wichmann T, Smith Y. Neuronal loss in the caudal intralaminar thalamic nuclei in a primate model of Parkinson’s disease. Brain structure & function. 2014;219:381–394. doi: 10.1007/s00429-013-0507-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  115. Wilson CJ. The mechanism of intrinsic amplification of hyperpolarizations and spontaneous bursting in striatal cholinergic interneurons. Neuron. 2005;45:575–585. doi: 10.1016/j.neuron.2004.12.053. [DOI] [PubMed] [Google Scholar]
  116. Wilson CJ, Chang HT, Kitai ST. Firing patterns and synaptic potentials of identified giant aspiny interneurons in the rat neostriatum. J Neurosci. 1990;10:508–519. doi: 10.1523/JNEUROSCI.10-02-00508.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Yamada H, Matsumoto N, Kimura M. Tonically active neurons in the primate caudate nucleus and putamen differentially encode instructed motivational outcomes of action. J Neurosci. 2004;24:3500–3510. doi: 10.1523/JNEUROSCI.0068-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Yamada H, Matsumoto N, Kimura M. History- and current instruction-based coding of forthcoming behavioral outcomes in the striatum. J Neurophysiol. 2007;98:3557–3567. doi: 10.1152/jn.00779.2007. [DOI] [PubMed] [Google Scholar]
  119. Zhang K, Sejnowski TJ. Neuronal tuning: To sharpen or to broaden? Neural computation. 1999;11:75–84. doi: 10.1162/089976699300016809. [DOI] [PubMed] [Google Scholar]
  120. Ztaou S, Maurice N, Camon J, Guiraudie-Capraz G, Kerkerian-Le Goff L, Beurrier C, Liberge M, Amalric M. Involvement of Striatal Cholinergic Interneurons and M1 and M4 Muscarinic Receptors in Motor Symptoms of Parkinson’s Disease. J Neurosci. 2016;36:9161–9172. doi: 10.1523/JNEUROSCI.0873-16.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES