Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2012 Feb 15;2:2. doi: 10.1186/2190-8567-2-2

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Copyright ©2012 Schiess et al.; licensee Springer

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PMC Copyright notice

Fig. 3 — Balanced cell reinforcement (bCR, Equation 26) compared to zone reinforcement. (A) Average performance of bCR (green) and ZR (red) on the same task as in panel 6A. (B) Performance when learning stimulus-response associations for four different patterns; bCR (green), ZR (red), a logarithmic scale is used for the x-axis. The inset shows the distribution of NMDA-spike durations after learning the task with bCR. The performance values in the figure are averages over 40 runs, and error bars show 1 SEM. (C) Development of the average reward signal $R (Z)$ for bCR (green) and ZR (red) when the task is to spike at the mid time of the single input pattern ( $R (Z) = - 2 / (n T) \sum_{i} | t_{i}^{sp} - t^{targ} |$ , where $t_{i}^{sp} \in Z$ , $i = 1, \dots, n$ , is the ith of the n output spike times, $t^{targ} = 250 ms$ the target spike time, and $T = 500 ms$ the pattern duration; if there was no output spike within $[0, T)$ we added one at T, yielding $R (Z) = - 1$ ). (D) Spike raster plot of the output spike times Z with $R (Z)$ shown in C using bCR. With ZR, the distribution of spike times after 3000 trials roughly corresponds to the one for bCR after 160 trials (vertical line at ∗), where the two performances coincide (see ∗ and black lines in C). The mean and standard deviation of the spike times at the end of the learning process, averaged across the last 300 trials, was $251 \pm 45$ and $256 \pm 121 ms$ for bCR and ZR, respectively.