Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2014 Nov 5;369(1655):20130489. doi: 10.1098/rstb.2013.0489

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2014 The Author(s) Published by the Royal Society. All rights reserved.

PMC Copyright notice

Figure 2. — Single-phase models have a problem with correctly assigning credit during reinforcement learning. (a) Activity is only in the MSNs of the direct pathway. Activity levels are designated by the size of the yellow star. Activity in the striatum influences the final action selection by premotor cortex, which in this instance chooses R. In this case, dopamine-dependent reinforcement works correctly to strengthen synapses onto the R MSN in the direct pathway. (b) Both direct and indirect pathways are active in this example, and both ‘vote’ on actions. The final choice is R because votes for L in the direct pathway are balanced by votes against L in the indirect pathway. If R is reinforced, the resulting dopamine release will lead to enhancement of the active synapses onto the L MSNs, which have the largest eligibility trace in the direct pathway. This is problematic because it will not lead to the desired change in behaviour. (Online version in colour.)