Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Dec 1.
Published in final edited form as: Curr Opin Neurobiol. 2014 Jul 23;0:142–147. doi: 10.1016/j.conb.2014.07.011

Two tales of how expectation of reward modulates behavior

Long Ding 1, David J Perkel 2
PMCID: PMC4254302  NIHMSID: NIHMS612677  PMID: 25062505

Abstract

Expectation of reward modulates many types of behaviors. Here we highlight two lines of research on reward-modulated perceptual decision making in primates and social context-modulated singing in songbirds, respectively. These two seemingly distinct behaviors are both known to involve cortico-basal ganglia-thalamic circuits. The underlying computations may be conceptualized using a simple, common framework. We summarize and compare our current knowledge of the two fields to motivate new experiments for each field, with the goal of finding general principles for how the brain implements reward-modulated behavior.

Introduction

In goal-directed behaviors, expectations of reinforcement are an integral part of the definition of goals. For example, for animals foraging for food, it is often beneficial to have some knowledge of how much food reward may be expected of currently available food sources and then act accordingly. For animals trying to attract a mate, it is likely beneficial to appreciate what the potential mates prefer, which may be thought of as an expectation of successful mate attraction given certain attributes of the mate-attracting behavior. In this review, we highlight two lines of research that deal with seemingly distinct types of goal-directed behaviors: reward-modulated perceptual decision making in primates and social context-dependent singing behavior in songbirds. We suggest that, despite the apparent differences in species, behavioral modality and the general premise of goals, these two lines of research share many similarities and could inform each other to advance our understanding of the neural basis of how expectation of reinforcement modulates behavior in general.

In the following sections, we first describe the behaviors of interest. We then introduce a common framework to describe the general computations that are needed for these two types of behaviors. Next we describe the cortico-basal ganglia-thalamo-cortical circuits that are (likely) involved in mediating these behaviors, drawing evidence from fMRI studies in humans and neurophysiological studies in monkeys and songbirds. Finally, we put forth some open questions for future studies, guided by a comparison of existing knowledge between the two lines of research.

The behaviors and general framework

Reward-modulated perceptual decision making

To illustrate some key properties of reward-modulated perceptual decision behavior, we focus on examples of adding explicit reward expectation manipulations to a well-known perceptual decision task, the visual motion direction discrimination task [1,2]. On a basic motion discrimination task, a subject sees a field of moving dots on a screen and needs to report the global motion of these dots. On each trial, there are two possible directions (i.e., two possible choices) and a proportion of the dots move in one direction while the other dots move in random directions. This proportion specifies the motion stimulus strength and influences the difficulty of the perceptual judgment: if the motion strength is high, i.e., most dots are moving coherently in one direction with only a few dots moving randomly, it is easy to tell that the global motion is in that direction; conversely, if the motion strength is low, i.e., only a few dots are moving coherently in one direction in a field of randomly moving dots, it is hard to determine the true global motion. Another factor influencing the task difficulty is the viewing time: when the viewing time is controlled by the experimenter, the longer the viewing time, the more likely the subject makes a correct decision; when the viewing time is controlled by the subject, trials with lower motion strength tend to have longer viewing time (i.e., longer response time).

With the basic task design, reward expectation is not explicitly manipulated, except that all correct choices earn a fixed reward and error choices are not rewarded and/or incur a fixed penalty (e.g., time out). One way to explicitly manipulate reward expectation is thus to make the reward size associated with the two choices unequal. In humans and monkeys, this kind of manipulation has robust effects on behavioral performance [3-8]. These effects include: 1) subjects are more likely to pick the choice associated with the larger reward; such a choice bias is more prominent on trials with low motion strength, while minimal on trials with very high motion strength; 2) when subjects control the viewing time, for the same motion strength, response time is shorter for the choice associated with the larger reward; such response time bias also depends on motion strength, with the largest bias occurring on trials with low motion strength.

To understand performance on this task, consider a simple framework illustrated in Fig 1A. At the onset of the decision process, the brain combines sensory information related to the moving dots and contextual information related to the reward contingency, and produces a categorical “left” or “right” choice. Using concepts from the perception literature, a perceptual categorization process can be generally thought of as having two components: sensory representation, reflecting the conversion of sensory inputs into task-relevant evidence, and readout, reflecting the conversion of the task-relevant evidence into a categorical decision [9]. A priori, the contextual information may influence both components. As an example of reward modulation of sensory representation, when the “left” choice is paired with the larger reward, there may appear to be more leftward-moving dots than what are actually present, biasing the evidence toward the “left” choice. As an example of reward modulation of readout, in the same left-choice-with-larger-reward scenario, the subject may require observing a total of 100 rightward-moving dots before making a “right” choice, but only 20 leftward-moving dots to make a “left” choice, thereby reading out the same evidence with a leftward bias. Based on model fitting using “accumulate-to-bound” models, reward-biased readout better recapitulates observed behavioral effects on reward-biased motion discrimination tasks [3-6] and this seems to be a common finding for many unequal-payoff perceptual decision tasks [7,10-13]. During decision formation, the brain may also generate predictions of how much reward is expected. A comparison between these predictions and actual reward received can then instruct changes of the categorical decision-formation process to improve future decisions.

Figure 1.

Figure 1

A common conceptual framework for reward-modulated perceptual decision making (A) and social context-modulated vocal behavior (B). Blue text indicates the conceptual functions (identical for the two panels) and black text provides behavior-specific information.

Social context-dependent singing

Another example in which behavior is altered depending on the context, perhaps including the expectation of reward, is in avian song. Songbirds use song for courtship, territory defense and individual recognition. In zebra finches, a young male learns a tutor song through a trial-and-error process [14], in which the young bird’s own songs are compared against a memorized template [15]. This learning occurs in an unsupervised manner; no external rewards are needed to guide progress or indicate success. Over a period of ~3 months, songs show gradual improvement until a good match is achieved [14]. During this period, song variability also decreases such that the adult song becomes acoustically stereotyped, save for a small amount of residual variability.

This residual variability is modulated by social context, potentially in anticipation of social reward. When a male sings as part of female-directed courtship (“directed song”), trial-to-trial song variability drops to about half of the value measured when the bird sings alone (“undirected song”) [16-18]. Females care about this distinction, spending more time near a speaker playing the mate’s directed song than one playing undirected song [19]. One practical advantage of this behavior is that it is natural and, unlike the primate decision task, does not require extensive training.

We can examine song variability in the same conceptual framework as described above for primate perceptual decisions. Consider a simple framework illustrated in Fig 1B. We make two assumptions: the presence of a female indicates the possibility of a social reward; and production of more stereotyped songs, compared to more variable songs, incurs some cost or reduced benefit to the male bird. This cost may include effort spent in maintaining tighter control of motor production and loss of opportunity for practicing to improve his songs in future. At a given time, the male bird makes a decision, based on sensory information (whether a female bird is present) and expectation of reinforcement (expected cost of performing low-variability songs and probability of attracting that particular female bird). This combination of information is fed into the song system controlling song variability. At the end of a performance, the male bird may either gain a mating opportunity or fail to do so. A failure may help generate a prediction error to guide either the decision process and/or motivate more practicing to improve song stereotypy. To date, songbird researchers have made a binary distinction between directed and undirected song. There has been no exploration into whether song variability is graded or rather switched in a binary fashion from high to low and back. Similarly, unlike studies in primate decision making, there has been no exploration into how song quality varies with manipulations of the probabilities or amounts of reward.

Cortico-basal ganglia-thalamo-cortical loops

Reward-modulated perceptual decision making

The neuronal mechanisms underlying reward-modulated perceptual decisions are only beginning to be explored. In the oculomotor basal ganglia loop (Fig 2A), every major station tested on the basic motion discrimination task with saccade output, (i.e., the frontal eye field (FEF), lateral intraparietal area (LIP), the caudate nucleus and the superior colliculus (SC)), contains neurons with decision-related activity [20-25]. The LIP and caudate nucleus, in particular, play a causal role in perceptual decision-making on the motion discrimination task [26**,27]. On value-based decision tasks that do not manipulate perceptual demand, these same regions display various kinds of reward modulation of visual cue-evoked activity (for a severely truncated list of examples, see [28-31] for LIP, FEF, SC and the caudate nucleus). Thus the oculomotor basal ganglia loop seems a solid starting point to search for neural mechanisms that mediate reward-modulated perceptual decisions. Indeed, theoretical studies have demonstrated that the cortico-basal ganglia circuit can make optimal perceptual decisions under different reinforcement contexts [32,33]; in human subjects performing unequal-reward perceptual tasks, the fronto-parietal network, which includes FEF and LIP, shows activation patterns suggestive of incorporation of reward and sensory information [4,13]; in monkeys performing unequal-reward motion discrimination tasks, LIP neurons show reward-modulated activity that is consistent with changes in readout for decision formation [5**], FEF neurons encode an error-predicting signal that may contribute to decision evaluation [34], and dopamine neurons in the SNc carry reward-predictive activity that may be used as an instructive signal for improving future decisions [35**]. Functional roles of other cortical and subcortical regions remain to be determined.

Figure 2.

Figure 2

Simplified diagram of candidate brain regions for reward-modulated perceptual decision-making (A) and social context-modulated vocal behavior (B). Abbreviations: DA: dopamine; DLM: medial portion of the dorsolateral nucleus of the thalamus; FEF: frontal eye field; GPe: external segment of the globus pallidus; GPi: internal segment of the globus pallidus; HVC: used as a proper name; LIP: lateral intraparietal area; LMAN: lateral magnocellular nucleus of the anterior nidopallium; MD: mediodorsal nucleus of thalamus; RA: robust nucleus of the arcopallium; SC: superior colliculus; SNc: substantia nigra pars compact; SNr: substantia nigra pars reticulata; STN: subthalamic nucleus; VTA: ventral tegmental area.

Social context-dependent singing

The neuronal mechanisms underlying social context-dependent singing behavior have been much more extensively studied. In the schematic diagram of the song system (Fig. 2B), the pathway from HVC, through Area X, DLM and LMAN to RA is considered homologous to the mammalian cortex-basal ganglia-thalamic pathway [36-40]. Several lines of evidence suggest that song variability arises in this pathway. First, whether songs are variable or not, HVC activity is precisely timed with low variability, while RA and LMAN activity show considerable modulation of variability consistent with song variability [41,42**]. Second, silencing or lesioning LMAN dramatically reduces song variability [16,17,43], suggesting that the basal ganglia pathway is causally involved in song variability modulation. Third, in anesthetized animals, single-shock stimuli of HVC give rise to variable responses in DLM and LMAN [44*], suggesting that the basal ganglia pathway is capable of generating neural variability observed in LMAN.

Within the basal ganglia pathway, the social-context modulation of song variability likely depends on dopaminergic modulation of Area × circuitry. Activity of midbrain neurons changes with social context [45,46] and dopamine levels in the extracellular space of Area × are higher during directed song to a female bird, when a social reward might be expected, than during undirected song, when a social reward is not expected immediately [47]. Dopamine influences multiple properties of Area × circuitry, including Area × spiny neuron electrophysiology [48,49], the spontaneous firing rate of the pallidal neurons in Area × and evoked responses in pallidal and DLM neurons by HVC electrical stimulation [44*]. Although Area × may not be the only source of song variability (zebra finches sing with normal song variability after recovering from Area × lesions [50**]), two recent studies provided direct evidence for a causal role of dopaminergic modulation of Area × circuitry in social-context modulation of song variability: infusion of a D1 receptor antagonist into Area × prevents the reduction in variability observed during directed song [44*,51] and activity of Area × GPi-like neurons remains more variable during undirected songs than during directed songs, even after LMAN lesions [42].

Open questions

As may already be apparent from the brief review of our current knowledge of decision making in primates and singing in birds, the former line of research could benefit from more mechanistic investigations, while the latter may be at a stage where an intuitive and predictive computational framework would be valuable to guide future research. For example, in the oculomotor system diagram in Fig 2A, only three regions have been examined so far to determine how single neurons behave during reward-biased perceptual decision tasks, in clear contrast to the wealth of knowledge in the song system, in which we know how social context modulates neural activity at most major stations. In particular, specific computational roles have been proposed for the core basal ganglia structures [32,33**,52-54]. Determining which of these roles are actually in play will be important for our understanding of the neural mechanisms implementing the computations needed for the observed behavior. In addition, past studies have focused on average firing rate of neurons. Is the variability of neural activity also actively modulated according to the reward context in perceptual decision making?

In songbirds, we have made the assumption that song stereotypy has a cost, given that male birds do not seem to sing highly stereotyped songs without a social need. Is there a way to quantify such a cost relative to the social reward expectation? In zebra finches, in which most of the cited work was conducted, the song repertoire does not normally change in adult birds. Could it be that one consequence of practicing is to reduce the cost of song stereotypy, in addition to improving the acoustic quality of songs? Another untested issue is whether song variability is graded or under binary control. Careful analysis of the transition from the initial minutes when the male sings directed song to a newly-introduced female to later stages when he apparently loses interest and sings undirected song, could help clarify this issue. At a more general level, how do the dopamine neurons acquire social context modulation and how do they respond to a successful or courtship performance? As it seems that the same neural structures are involved in both social context-dependent modulation of song variability and vocal learning in young birds, how much do the detailed neural mechanisms overlap between these behaviors? For example, dopamine neurons receive indirect disinhibitory input from Area × via the ventral pallidum, which confers on them selective auditory responses for the bird’s own song [55]. Such auditory selectivity suggests that dopamine neurons may be able to evaluate a bird’s own song relative to the memorized tutor song and provide an internal “reward” signal during song practice. If so, how is this kind of signal multiplexed with social context information?

We have attempted to unify thinking about these two behaviors by placing them into a similar conceptual framework. It is possible that their parallels will strengthen, and this juxtaposition will benefit work on both systems. It is also possible that, as additional data become available, fundamental differences will emerge between them. In either case, we feel that attempting to unify conceptual models of brain function is a valuable goal, provided that these attempts lead to experimentally testable predictions.

Highlights.

  • A common framework for reward-modulated perception and vocal behavior

  • Both behaviors involve the cortico-basal ganglia pathway

  • Reward modulates the readout process in perceptual decision making

  • Social context modulates neural variability via dopamine-dependent mechanisms

Acknowledgements

We thank Joshua I. Gold for valuable comments on the manuscript. This work was supported by the National Eye Institute grant R01 EY022411 (L.D.) and National Institute of Mental Health grant R01 MH066128 (D.P.).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

No conflicts of interest to declare.

References

  • [1].Britten KH, Shadlen MN, Newsome WT, Movshon JA. The analysis of visual motion: a comparison of neuronal and psychophysical performance. J Neurosci. 1992;12:4745–4765. doi: 10.1523/JNEUROSCI.12-12-04745.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Morgan MJ, Ward R. Conditions for motion flow in dynamic visual noise. Vision Res. 1980;20:431–435. doi: 10.1016/0042-6989(80)90033-4. [DOI] [PubMed] [Google Scholar]
  • [3].Feng S, Holmes P, Rorie A, Newsome WT. Can monkeys choose optimally when faced with noisy stimuli and unequal rewards? PLoS Comput Biol. 2009;5:e1000284. doi: 10.1371/journal.pcbi.1000284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Mulder MJ, Wagenmakers EJ, Ratcliff R, Boekel W, Forstmann BU. Bias in the brain: a diffusion model analysis of prior probability and potential payoff. J Neurosci. 2012;32:2335–2343. doi: 10.1523/JNEUROSCI.4156-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5**].Rorie AE, Gao J, McClelland JL, Newsome WT. Integration of sensory and reward information during perceptual decision-making in lateral intraparietal cortex (LIP) of the macaque monkey. PLoS One. 2010;5:e9308. doi: 10.1371/journal.pone.0009308. The authors demonstrate how LIP neurons respond to reward expectation on a visual motion discrimination task and, combining with systematic model-based analyses, show that the reward modulation is consistent with changes in signals related to the readout, instead of sensory representation, in the decision process.
  • [6].Voss A, Rothermund K, Voss J. Interpreting the parameters of the diffusion model: an empirical validation. Mem Cognit. 2004;32:1206–1220. doi: 10.3758/bf03196893. [DOI] [PubMed] [Google Scholar]
  • [7].Wagenmakers EJ, Ratcliff R, Gomez P, McKoon G. A Diffusion Model Account of Criterion Shifts in the Lexical Decision Task. J Mem Lang. 2008;58:140–159. doi: 10.1016/j.jml.2007.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Teichert T, Ferrera VP. Suboptimal integration of reward magnitude and prior reward likelihood in categorical decisions by monkeys. Front Neurosci. 2010;4:186. doi: 10.3389/fnins.2010.00186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Gold JI, Ding L. How mechanisms of perceptual decision-making affect the psychometric function. Prog Neurobiol. 2013;103:98–114. doi: 10.1016/j.pneurobio.2012.05.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Diederich A. A further test of sequential-sampling models that account for payoff effects on response bias in perceptual decision tasks. Perception & Psychophysics. 2008;70:229–256. doi: 10.3758/pp.70.2.229. [DOI] [PubMed] [Google Scholar]
  • [11].Diederich A, Busemeyer JR. Modeling the effects of payoff on response bias in a perceptual discrimination task: bound-change, drift-rate-change, or two-stage-processing hypothesis. Percept Psychophys. 2006;68:194–207. doi: 10.3758/bf03193669. [DOI] [PubMed] [Google Scholar]
  • [12].Gao JA, Tortell R, McClelland JL. Dynamic Integration of Reward and Stimulus Information in Perceptual Decision-Making. PLoS One. 2011;6 doi: 10.1371/journal.pone.0016749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Summerfield C, Koechlin E. Economic value biases uncertain perceptual choices in the parietal and prefrontal cortices. Frontiers in Human Neuroscience. 2010;4 doi: 10.3389/fnhum.2010.00208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Deregnaucourt S, Mitra PP, Feher O, Pytte C, Tchernichovski O. How sleep affects the developmental learning of bird song. Nature. 2005;433:710–716. doi: 10.1038/nature03275. [DOI] [PubMed] [Google Scholar]
  • [15].Konishi M. The role of auditory feedback in the control of vocalization in the white-crowned sparrow. Z Tierpsychol. 1965;22:770–783. [PubMed] [Google Scholar]
  • [16].Kao MH, Brainard MS. Lesions of an avian basal ganglia circuit prevent context-dependent changes to song variability. J Neurophysiol. 2006;96:1441–1455. doi: 10.1152/jn.01138.2005. [DOI] [PubMed] [Google Scholar]
  • [17].Kao MH, Doupe AJ, Brainard MS. Contributions of an avian basal ganglia-forebrain circuit to real-time modulation of song. Nature. 2005;433:638–643. doi: 10.1038/nature03127. [DOI] [PubMed] [Google Scholar]
  • [18].Kao MH, Wright BD, Doupe AJ. Neurons in a forebrain nucleus required for vocal plasticity rapidly switch between precise firing and variable bursting depending on social context. J Neurosci. 2008;28:13232–13247. doi: 10.1523/JNEUROSCI.2250-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Woolley SC, Doupe AJ. Social context-induced song variation affects female behavior and gene expression. PLoS Biol. 2008;6:e62. doi: 10.1371/journal.pbio.0060062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [20].Ding L, Gold JI. Caudate encodes multiple computations for perceptual decisions. J Neurosci. 2010;30:15747–15759. doi: 10.1523/JNEUROSCI.2894-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [21].Ding L, Gold JI. Neural correlates of perceptual decision making before, during, and after decision commitment in monkey frontal eye field. Cereb Cortex. 2012;22:1052–1067. doi: 10.1093/cercor/bhr178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Horwitz GD, Newsome WT. Separate signals for target selection and movement specification in the superior colliculus. Science. 1999;284:1158–1161. doi: 10.1126/science.284.5417.1158. [DOI] [PubMed] [Google Scholar]
  • [23].Kim JN, Shadlen MN. Neural correlates of a decision in the dorsolateral prefrontal cortex of the macaque. Nat Neurosci. 1999;2:176–185. doi: 10.1038/5739. [DOI] [PubMed] [Google Scholar]
  • [24].Roitman JD, Shadlen MN. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. J Neurosci. 2002;22:9475–9489. doi: 10.1523/JNEUROSCI.22-21-09475.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Shadlen MN, Newsome WT. Motion perception: seeing and deciding. Proc Natl Acad Sci U S A. 1996;93:628–633. doi: 10.1073/pnas.93.2.628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26**].Ding L, Gold JI. Separate, causal roles of the caudate in saccadic choice and execution in a perceptual decision task. Neuron. 2012;75:865–874. doi: 10.1016/j.neuron.2012.07.021. The authors demonstrate that disruption of caudate nucleus activity biases monkeys’ choice behavior on a visual motion discrimination task. This is the first study to demonstrate a causal relationship between the caudate nucleus and perceptual decision making.
  • [27].Hanks TD, Ditterich J, Shadlen MN. Microstimulation of macaque area LIP affects decision-making in a motion discrimination task. Nat Neurosci. 2006;9:682–689. doi: 10.1038/nn1683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Ding L, Hikosaka O. Comparison of reward modulation in the frontal eye field and caudate of the macaque. J Neurosci. 2006;26:6695–6703. doi: 10.1523/JNEUROSCI.0836-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [29].Ikeda T, Hikosaka O. Reward-dependent gain and bias of visual responses in primate superior colliculus. Neuron. 2003;39:693–700. doi: 10.1016/s0896-6273(03)00464-1. [DOI] [PubMed] [Google Scholar]
  • [30].Kawagoe R, Takikawa Y, Hikosaka O. Expectation of reward modulates cognitive signals in the basal ganglia. Nat. Neurosci. 1998;1:411–416. doi: 10.1038/1625. [DOI] [PubMed] [Google Scholar]
  • [31].Platt ML, Glimcher PW. Neural correlates of decision variables in parietal cortex. Nature. 1999;400:233–238. doi: 10.1038/22268. [DOI] [PubMed] [Google Scholar]
  • [32].Hsiao PY, Lo CC. A plastic corticostriatal circuit model of adaptation in perceptual decision making. Front Comput Neurosci. 2013;7:178. doi: 10.3389/fncom.2013.00178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [33*].Rao RP. Decision making under uncertainty: a neural model based on partially observable markov decision processes. Front Comput Neurosci. 2010;4:146. doi: 10.3389/fncom.2010.00146. This paper shows that a basal ganglia circuit-based model, by learning the mapping between uncertain belief about sensory inputs and reward expectation, can perform on the basic visual motion discrimination task and adapt to different reward context.
  • [34].Teichert T, Yu D, Ferrera VP. Performance monitoring in monkey frontal eye field. J Neurosci. 2014;34:1657–1671. doi: 10.1523/JNEUROSCI.3694-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35**].Nomoto K, Schultz W, Watanabe T, Sakagami M. Temporally extended dopamine responses to perceptually demanding reward-predictive stimuli. J Neurosci. 2010;30:10692–10702. doi: 10.1523/JNEUROSCI.4828-09.2010. The authors demonstrate reward-predictive dopamine responses on a visual motion discrimination task with unequal reward association.
  • [36].Farries MA, Ding L, Perkel DJ. Evidence for “direct” and “indirect” pathways through the song system basal ganglia. J Comp Neurol. 2005;484:93–104. doi: 10.1002/cne.20464. [DOI] [PubMed] [Google Scholar]
  • [37].Farries MA, Perkel DJ. A telencephalic nucleus essential for song learning contains neurons with physiological characteristics of both striatum and globus pallidus. J Neurosci. 2002;22:3776–3787. doi: 10.1523/JNEUROSCI.22-09-03776.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Goldberg JH, Adler A, Bergman H, Fee MS. Singing-related neural activity distinguishes two putative pallidal cell types in the songbird basal ganglia: comparison to the primate internal and external pallidal segments. J Neurosci. 2010;30:7088–7098. doi: 10.1523/JNEUROSCI.0168-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [39].Luo M, Perkel DJ. A GABAergic, strongly inhibitory projection to a thalamic nucleus in the zebra finch song system. J Neurosci. 1999;19:6700–6711. doi: 10.1523/JNEUROSCI.19-15-06700.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [40].Luo M, Perkel DJ. Intrinsic and synaptic properties of neurons in an avian thalamic nucleus during song learning. J Neurophysiol. 2002;88:1903–1914. doi: 10.1152/jn.2002.88.4.1903. [DOI] [PubMed] [Google Scholar]
  • [41].Kozhevnikov AA, Fee MS. Singing-related activity of identified HVC neurons in the zebra finch. J Neurophysiol. 2007;97:4271–4283. doi: 10.1152/jn.00952.2006. [DOI] [PubMed] [Google Scholar]
  • [42**].Woolley SC, Rajan R, Joshua M, Doupe AJ. Emergence of context-dependent variability across a basal ganglia network. Neuron. 2014;82:208–223. doi: 10.1016/j.neuron.2014.01.039. The authors show that trial-to-trial variability in song-related neural activity in finches emerges in the basal ganglia structure Area X from stereotyped inputs from the cortex-like input structure HVC.
  • [43].Olveczky BP, Andalman AS, Fee MS. Vocal experimentation in the juvenile songbird requires a basal ganglia circuit. PLoS Biol. 2005;3:e153. doi: 10.1371/journal.pbio.0030153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [44**].Leblois A, Wendel BJ, Perkel DJ. Striatal dopamine modulates basal ganglia output and regulates social context-dependent behavioral variability through D1 receptors. J Neurosci. 2010;30:5730–5743. doi: 10.1523/JNEUROSCI.5974-09.2010. This study show that D1 dopamine receptor activation is critical for the reduction in trial-to-trial variability of syllable acoustics when a male finch sings to a female (directed song).
  • [45].Hara E, Kubikova L, Hessler NA, Jarvis ED. Role of the midbrain dopaminergic system in modulation of vocal brain activation by social context. Eur J Neurosci. 2007;25:3406–3416. doi: 10.1111/j.1460-9568.2007.05600.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [46].Yanagihara S, Hessler NA. Modulation of singing-related activity in the songbird ventral tegmental area by social context. Eur J Neurosci. 2006;24:3619–3627. doi: 10.1111/j.1460-9568.2006.05228.x. [DOI] [PubMed] [Google Scholar]
  • [47].Sasaki A, Sotnikova TD, Gainetdinov RR, Jarvis ED. Social context-dependent singing-regulated dopamine. J Neurosci. 2006;26:9010–9014. doi: 10.1523/JNEUROSCI.1335-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [48].Ding L, Perkel DJ. Dopamine modulates excitability of spiny neurons in the avian basal ganglia. J Neurosci. 2002;22:5210–5218. doi: 10.1523/JNEUROSCI.22-12-05210.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [49].Ding L, Perkel DJ, Farries MA. Presynaptic depression of glutamatergic synaptic transmission by D1-like dopamine receptor activation in the avian basal ganglia. J Neurosci. 2003;23:6086–6095. doi: 10.1523/JNEUROSCI.23-14-06086.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [50**].Ali F, Otchy TM, Pehlevan C, Fantana AL, Burak Y, Olveczky BP. The basal ganglia is necessary for learning spectral, but not temporal, features of birdsong. Neuron. 2013;80:494–506. doi: 10.1016/j.neuron.2013.07.049. The authors find that lesions of LMAN affect aversive conditioning of song acoustics but not of song tempo.
  • [51].Leblois A, Perkel DJ. Striatal dopamine modulates song spectral but not temporal features through D1 receptors. Eur J Neurosci. 2012;35:1771–1781. doi: 10.1111/j.1460-9568.2012.08095.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [52].Bogacz R, Larsen T. Integration of Reinforcement Learning and Optimal Decision-Making Theories of the Basal Ganglia. Neural Computation. 2011;23:817–851. doi: 10.1162/NECO_a_00103. [DOI] [PubMed] [Google Scholar]
  • [53].Ding L, Gold JI. The basal ganglia’s contributions to perceptual decision making. Neuron. 2013;79:640–649. doi: 10.1016/j.neuron.2013.07.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [54].Lo CC, Wang XJ. Cortico-basal ganglia circuit mechanism for a decision threshold in reaction time tasks. Nat Neurosci. 2006;9:956–963. doi: 10.1038/nn1722. [DOI] [PubMed] [Google Scholar]
  • [55].Gale SD, Perkel DJ. A basal ganglia pathway drives selective auditory responses in songbird dopaminergic neurons via disinhibition. J Neurosci. 2010;30:1027–1037. doi: 10.1523/JNEUROSCI.3585-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES