Skip to main content
. Author manuscript; available in PMC: 2018 Aug 22.
Published in final edited form as: Neural Comput. 2013 Sep 18;25(12):3093–3112. doi: 10.1162/NECO_a_00522

Figure 1:

Figure 1:

Model description. (A) Decision-making network. Each circle represents a population of neurons. As the targets appear, the input population is activated in the same way on each trial. The input is fed through plastic synapses into two excitatory populations representing the two possible choices. These two populations compete through an inhibitory population and work as a winner-take-all network. The plastic synapses are modified depending on the activity of the pre- and, postsynaptic neurons and on the outcome of the choice (reward or no reward). (B) Learning rule in rewarded trials in which the population representing the left target is activated. The synapses to the chosen target are potentiated with a learning rate αr, and those to the other target are depressed with a learning rate γα7.. (C) Same as in panel B but in unrewarded trials. The synapses to the chosen target are depressed αn, and those to the other target are potentiated γαη.