Abstract
Animals must often make opposing responses to similar complex stimuli. Multiple sensory inputs from such stimuli combine to produce stimulus-specific patterns of neural activity. It is the differences between these activity patterns, even when small, that provide the basis for any differences in behavioral response. In the present study, we investigate three tasks with differing degrees of overlap in the inputs, each with just two response possibilities. We simulate behavioral output via winner-takes-all activity in one of two pools of neurons forming a biologically based decision-making layer. The decision-making layer receives inputs either in a direct stimulus-dependent manner or via an intervening recurrent network of neurons that form the associative layer, whose activity helps distinguish the stimuli of each task. We show that synaptic facilitation of synapses to the decision-making layer improves performance in these tasks, robustly increasing accuracy and speed of responses across multiple configurations of network inputs. Conversely, we find that synaptic depression worsens performance. In a linearly nonseparable task with exclusive-or logic, the benefit of synaptic facilitation lies in its superlinear transmission: effective synaptic strength increases with presynaptic firing rate, which enhances the already present superlinearity of presynaptic firing rate as a function of stimulus-dependent input. In linearly separable single-stimulus discrimination tasks, we find that facilitating synapses are always beneficial because synaptic facilitation always enhances any differences between inputs. Thus we predict that for optimal decision-making accuracy and speed, synapses from sensory or associative areas to decision-making or premotor areas should be facilitating.
Keywords: facilitation, depression, stimulus discrimination, choice behavior, exclusive-or logic
a fundamental role of an animal's brain is to allow the animal to adapt to an ever-changing environment such that the animal's response, or behavior, leads to most reward or least harm given a particular confluence of environmental stimuli. Subtle differences in stimuli can require opposite behaviors. For example, those characteristics differentiating a friend from a foe or a stronger opponent from a weaker one could appear minor amidst the general overlap of features pertaining to a fellow member of the same species. However, the animal must learn to discriminate differences in such salient features to produce opposite responses, even if much of the sensory information (and neural activity within the brain) is similar. Thus how appropriate responses, or decisions that lead to those responses, are learned and produced from patterns of overlapping and similar neural activity is a matter of intense investigation, both experimentally (Curtis and Lee 2010; Glimcher and Rustichini 2004; Gold and Shadlen 2007, Harris et al. 2008; Histed et al. 2009; Jay 2003; Kennerley and Walton 2011; Takeda et al. 2005) and theoretically (Eckhoff et al. 2008; Feng et al. 2009; Rigotti et al. 2010a, 2010b; Sakai et al. 2006; Sakai and Miyashita 1991; Shen et al. 2008; Soltani and Wang 2008, 2010; Wang 2008).
In our studies, we focus on the inputs to a binary winner-takes-all (WTA) decision-making circuit (Wang 2002). The WTA network serves as a model for perceptual decision-making and the animal's behavioral choice in a two-alternative forced-choice task (Wang 2002; Wong and Wang 2006). Response of the WTA network depends on bias in the input to its two decision-making pools. In our model, input synapses to the decision-making neurons are modified by a dopamine (DA)-modulated Hebbian reward-based plasticity rule (Reynolds and Wickens 2002; Soltani and Wang 2006) that, in principle, can lead the decision-making pools to produce responses that maximize reward, given sufficient information about the stimuli within the neural activity of its input cells. For the networks of spiking neurons in our study, the cells providing input to the decision-making circuit reside in a randomly recurrently connected associative layer. Within such a layer, stimulus-selective cells can arise, even when the inputs are broadly overlapping (Bourjaily and Miller 2011a; Rigotti et al. 2010b). A general linear readout, as used in support vector machines, could produce appropriate responses to information within the high-dimensional space of associative layer neural firing rates via a trained or fitted set of synaptic weights (Burges 1998). However, the requirement that such weights are produced by biologically realistic learning rules generally constrains the readout process such that information can be contained within neurons in the associative layer, but firing of those neurons cannot produce a reliable response. In this report, we suggest that the use of dynamic synapses between the associative layer and decision-making layer can enhance the readout process beyond that achievable by a general linear classifier.
Multiple decision-making studies have focused on the formation and function of the internal dynamics of the decision-making networks (Albantakis and Deco 2011; Cisek et al. 2009; Daw et al. 2006; Lee and Seo 2007; Rolls et al. 2010; Theodoni et al. 2011; Wang 2002; Wong and Wang 2006; Xie et al. 2002). In computational studies of decision-making, the inputs have been primarily via static synapses (i.e., synapses without short-term synaptic plasticity), whose postsynaptic output increases approximately linearly with presynaptic firing rate (until saturation at a rate proportional to the inverse of the postsynaptic receptor's time constant) (Wang 2002; Wong and Wang 2006). However, numerous experimental studies across different cortical regions (Abbott et al. 1997; Chance et al. 1998; Markram et al. 1998b; Nelson and Turrigiano 1998; Wang et al. 2006) have shown that synaptic strength is dynamic on the timescale of interspike intervals and that synaptic transmission increases nonlinearly with presynaptic firing rate (Chapman et al. 2000; Grillner et al. 2005; Tsodyks et al. 1998). Computational studies have shown a number of functions for both facilitating and depressing synapses, such as persistent neural activity (Barak and Tsodyks 2007; Conboy et al. 2010; Markram 1997; Martinez-Garcia et al. 2011), filtering neural signals (Farajidavar et al. 2008; Lindner et al. 2009), and gain control (Abbott et al. 1997), among others.
Some of the most difficult behavioral tasks are linearly nonseparable (Grand and Honey 2008; Harris 2006; Harris et al. 2008; Melchers et al. 2008; Sanderson et al. 2006); that is, the choice of response cannot be determined by an appropriately weighted combination of individual stimuli (Minsky and Papert 1987). Behavioral tasks that are linearly nonseparable include those that employ exclusive-or (XOR) logic, such as biconditional discrimination (Melchers et al. 2008; Sanderson et al. 2006) and negative patterning (Harris et al. 2008), inference tasks such as associative transitive inference (Bunsey and Eichenbaum 1993), certain paired association tasks (Takeda et al. 2005), and transverse patterning (Sanderson et al. 2006). Although these tasks vary in design and sensory modality, they all require animals to produce stimulus pair-selective responses to solve the task. In biconditional discrimination, as in other linearly nonseparable tasks, successful decision-making requires responses selective to stimulus pairs (e.g., A + B vs. C + B vs. C + D) (Bourjaily and Miller 2011a), with different responses to the most similar pairs. Thus, if the animal only learns a preferred response to one member of the pair (e.g., stimulus B from stimulus pair A + B), then if it responds correctly to stimulus pair A + B, it would respond incorrectly to stimulus pair C + B. Our results for the biconditional discrimination task can be applied to the other linearly nonseparable associative learning tasks mentioned above.
In artificial and/or feedforward neural networks with nonlinear behavior, XOR has been solved using methods such as selective suppression of feedback (Hasselmo and Cekic 1996) and associative learning (Alkon et al. 1990) of input correlations in conjunction with stochastic gradient ascent reinforcement learning (Iannella and Back 2001; Park et al. 2000). At the synaptic level, XOR has been solved using stochastic gradient ascent learning, where stochastic synapses and/or irregular spiking provided requisite noise for stochastic gradient ascent learning to a readout cell (Seung 2003; Xie and Seung 2004).
A critical property of dynamic synapses is that they lead to nonlinear synaptic transmission (Dayan and Abbott 2001; Fiete et al. 2004; Markram et al. 1998a, 1998b), which could enhance the circuit's ability to produce solutions to linearly nonseparable tasks. Here, we address how short-term plasticity in afferent synapses to a decision-making network (Wang 2002; Wong and Wang 2006) with a biophysically based reinforcement learning rule (Jay 2003; Reynolds and Wickens 2002; Shen et al. 2008) can affect performance and reaction times in both a linearly nonseparable task and simpler tasks. Although individual synapses are known to have both facilitating and depressing dynamics (Grillner et al. 2005; Maass and Zador 1999; Markram et al. 1998b), to investigate the specific role of each, we look at either purely facilitating or purely depressing synapses (Chapman et al. 2000; Dayan and Abbott 2001; Grillner et al. 2005; Markram et al. 1998a) compared with static synapses in each network and task.
For the overwhelming majority of network types trained on linearly nonseparable tasks and for all tests of linearly separable tasks, we find that facilitation's superlinear synaptic transmission improves performance. Specifically, facilitation improves decision-making accuracy and speed of response, whereas depression's sublinear synaptic transmission worsens both relative to static synapses. The beneficial role of facilitation suggests that synapses from sensory and associative areas of the brain to decision-making regions such as prefrontal cortex and premotor areas should be facilitating.
MATERIALS AND METHODS
Simplified Firing Rate Model
We produced a simplified firing rate model network to demonstrate clearly the interaction between the following three factors: nonlinearity of associative layer responses, dynamics due to short-term plasticity at the synapse, and the details of the reward-based learning rule. The simplified model was written in Matlab (The MathWorks, Natick, MA), and the code is available online (http://people.brandeis.edu/∼pmiller/code.html).
As in the full network (see below), four stimulus pairings were used (A + B, A + D, C + B, or C + D). We assume four types of neural response in the associative layer (AB, AD, CB, or CD), corresponding to groups of cells whose preferred stimulus is one of the four stimulus pairings (see Table 1). In the associative layer, each cell group i is represented by a single firing rate, which evolves during each stimulus pair presentation s according to
where τ = 5 ms and the applied current Ii,sApp = 40 for the group's preferred stimulus pair, Ii,sApp = 20 for the opposite (nonoverlapping) stimulus pair, and Ii,sApp = 0 for each of the two intermediate stimulus pairs, as given in Table 1.
Table 1.
Firing rate response of linear neurons to paired inputs
| Stimulus Pair Input, Hz |
||||
|---|---|---|---|---|
| Cell Group's Preferred Stimulus | A + B | A + D | C + B | C + D |
| AB | 40 | 20 | 20 | 0 |
| AD | 20 | 40 | 0 | 20 |
| CB | 20 | 0 | 40 | 20 |
| CD | 0 | 20 | 20 | 40 |
The steady-state firing rate, f(IApp), in response to an applied current, IApp, can take on three forms. The response function may be linear, flin(IApp) = IApp; sublinear, fsub(IApp) = IApp/(1 + αIApp); or superlinear, fsupra(IApp) = IApp·(1 + αIApp). We set the degree of nonlinearity as α = 0.1 in the results (presented in Fig. 3).
Fig. 3.
Interaction of firing rate and synaptic nonlinearity in a firing rate model. A and B: fraction of reward accumulated over all trials (blue = 0; green = 0.5; red = 1). C and D: asymmetry in the synaptic strengths from associative layer excitatory cells to decision layer pools (blue = asymmetry of −1, i.e., input to the incongruous pool; green = asymmetry of 0, i.e., equal input to both pools; red = asymmetry of 1, i.e., input to the congruous pool). In A and C, the presynaptic rate dependence of the reinforcement learning rule matches the synaptic input to the decision layer cells. In B and D, the reinforcement learning rule producing the maximum reward for each combination of firing rate and synaptic nonlinearity is used. Stat, static; Dep, depression; Fac, facilitation; Sublin, sublinear; Supralin, supralinear; f-l, firing rate-linearity relationship.
The four associative layer cell groups i provide synaptic input to each of two decision layer cell groups, j = 1, 2, whose firing rates evolve according to
where IjIn is input from the associative layer, IjRec is recurrent self-excitation and recurrent cross-inhibition from within the decision layer, and IjNoise is uncorrelated noise input in the decision layer to produce stochastic responses. The total input IjIn to a decision layer cell group j depends on the synaptic strength, WijAD, which is modified through reward-based plasticity and the rate-dependent synaptic transmission, si(t), from presynaptic associative layer cell group i: IjIn(t) = ∑i WijADsi(t).
For static (i.e., linear) synapses, sistat(t) = ri(t); for depressing (i.e., sublinear) synapses, sidep(t) = ri(t)/[1 + αri(t)]; and for facilitating (i.e., superlinear) synapses, sifac(t) = ri(t)⋅[1 + αri(t)]. Note that the above formulas for the simplified rate model include only the steady-state change in amplitude wrought by the short-term plasticity, to demonstrate its effect on decision-making; they do not account for the transient dynamics that occur when firing rates change (Abbott et al. 1997). We modeled the complete dynamics in the full spiking-neuron network model [see Supplemental Table S1, where simulation details are presented in a form suggested by Nordlie et al. (2009)]. (Supplemental data for this article is available online at the Journal of Neurophysiology website.)
The synaptic strengths from the four associative layer groups to the two decision layer groups were initially equal, set to Wij(0) = 10/〈 si(∞)〉, where si(∞) is the steady-state synaptic input of cell group i and the average (in angle brackets) was over the four associative layer cell groups. Thus the initial mean input to the decision layer before training was held constant when we varied the nonlinearity in the associative layer or the short-term dynamics of the afferent synapses. The synaptic strengths were updated by ΔWij (defined below) on a trial-by-trial basis in a reward-dependent manner to the “winning” decision layer pool with the higher firing rate after a 500-ms stimulus presentation. If the winning decision layer pool was the designated one for obtaining reward (group 1 for stimulus pair A + B or C + D and group 2 for stimulus pair A + D or C + B), then ΔWij = ΔW+(Ai − 〈Ai〉); otherwise, ΔWij = −ΔW−(Ai − 〈Ai〉). Ai depended on the firing rate of the associative layer cell and could be linear, Ai = ri(tmax); sublinear, Ai = ri(t)/[1 + αri(tmax)]; or superlinear, Ai = ri(t)·[1 + αri(tmax)]. Thus we present our results in a 3 × 3 × 3 matrix form, because they depend on the linearity, sublinearity, or superlinearity of neural response, synaptic dynamics, and the presynaptic rate dependence of the reinforcement rule. All other terms in the decision layer were kept constant across these 27 combinations of network.
The recurrent current to the decision layer cell groups (j = 1, 2) is given by IjRec(t) = Wsymrj(t) − WXr2−j(t) with Wsym = 0.8 and WX = 0.6. The noise current to the decision layer cell groups is given by IjNoise(t) = ση(t), where σ = 0.2 and η(t) is a white-noise process with unit variance. The firing rate curve of the decision layer neurons is sigmoidal, fdec(I) = rmax/{1 + exp [(ITh − I)/Iσ]} with rmax = 50 Hz, ITh = 50, and Iσ = 8.
We simulated 10 independent sessions of 1,000 consecutive trials with each linearity-type configuration and calculated the fraction of correct responses across these trials. We also calculated the net asymmetry, SW, of the final weight matrix produced by the learning rule as a number that can vary from −1 to +1:
A weight asymmetry of SW = −1 means that associative layer cell groups selectively potentiate to decision layer cell groups that produce the incongruous response, i.e., opposite to the one desired for their preferred cue.
Two-stimulus discrimination task.
To assess the effect of dynamic synapses on the inputs to a decision-making network in a two-stimulus discrimination task, we used the rate model, as described above. In this task, we used two distinct associative layer pools, each connected to a separate decision-making pool. Inputs to the two associative layer pools differed (by 5% of the mean in the results presented) to simulate concurrent visual input of different strengths in opposing directions (Wang 2002), as used in visual motion coherence discrimination tasks (Shadlen and Newsome 1996). To compensate for the change of effective synaptic strength inherently produced by dynamic synapses, we evaluated discrimination accuracy, namely, the probability that the “winning” decision-making pool was the one with greater stipulated input, as a function of baseline synaptic strength of afferent synapses to the decision-making circuit.
Spiking Network Simulations
Task logic and inputs.
In the biconditional discrimination task (Sanderson et al. 2006), four stimulus pairings were activated, one per trial (A + B, C + D, A + D, or C + B), with each stimulus producing inputs as Poisson spike trains. If either both A and B were activated or neither was activated (i.e., C + D), the decision was to release the lever. If either A or B but not both were present (i.e., A + D or C + B), the opposite decision was made, namely, to hold the lever until the end of the trial. In the stimulus-response association task, single stimuli were presented and the decision was to release for inputs A or D and to hold for inputs B or C.
We produced networks with different degrees of input correlations (as shown in Fig. 4A) by altering the number of independent inputs per stimulus as 2, 4, 6, 10, or 20 to excitatory and inhibitory cells in the associative layer (see Fig. 4B). Each input comprised a train of independent Poisson spikes with a mean firing rate defined by r̄ = 480 Hz/(no. of inputs per stimulus). For example, in a network with 20 inputs, any stimulus produced 20 independent Poisson spike trains at 24 Hz, with each input projecting to independent sets of cells within the associative layer, whereas with 2 inputs per stimulus, the 2 trains of 240-Hz Poisson spikes (a firing rate much higher than that produced by an individual cell) could be considered as 20 independent Poisson spike trains of 24 Hz grouped into 2 sets of 10: while the receiving cells between sets were uncorrelated, the receiving cells within such a set of 10 would be identical. Thus, as the number of inputs per stimulus decreased, the correlation in connectivity from afferent cells increased (see Fig. 4A).
Fig. 4.
Inputs and architecture for the full spiking network with decision-making accuracy. In this study, we have examined the robustness of our findings across networks with varying input correlation and input probability, thus creating multiple network conditions. In A, we provide a general description of our inputs and show an example of input configurations from a single stimulus-responsive input population for 2 different input correlations (see materials and methods, Spiking Network Simulations). Each input population is represented by 20 cells, which output independent Poisson spike trains but correlated connections. The total number of independent sets of connections ranges from 2 to 20. Top: with 2 independent inputs per stimulus population and low input connection probability (1/5 is shown), associative layer cells receive initially sparse selective inputs. Bottom: with 10 independent inputs per stimulus population and low input connection probability (1/5 is shown), associative layer cells receive more uniform, less correlated inputs. Poisson input groups randomly project to a sparse-random recurrent network of excitatory (red) and inhibitory (blue) cells. Input projection probability ranges from 1/20 (sparse) to 1/2 (dense), with input connections selected independently between each set of independent input cells per input population. B: the network consists of an input population of Poisson independent inputs per stimulus that project randomly (represented by dashed lines) to a random recurrent network of excitatory (red) and inhibitory (blue) cells called the associative layer. Excitatory-to-excitatory connections (arrows) and inhibitory-to-excitatory connections (colored circles) are probabilistic and plastic. All-to-all inhibitory-to-inhibitory synapses are also present but not plastic. Triplet spike-timing-dependent plasticity (STDP) occurs at excitatory-to-excitatory and input-to-excitatory synapses, whereas long-term potentiation of inhibition (LTPi) occurs at inhibitory-to-excitatory synapses. Homeostasis by multiplicative synaptic scaling is present on all plastic synapses within the associative layer. Inhibition is feedforward only (i.e., the network does not include recurrent excitatory-to-inhibitory synapses) (Bourjaily and Miller 2011a). C: excitatory cells from the associative layer project all-to-all with dynamic synapses (red arrowheads), initially with equal synaptic strength to excitatory cells in both the hold and release pools of the decision-making network, which also receive an urgency-gating signal (Cisek et al. 2009), modeled as a linear ramping conductance to ensure decisions are made each trial. The decision-making network consists of 2 excitatory pools with strong recurrent connections, which compete via cross-inhibition. Strong self-recurrent excitation ensures bistability for each pool, while the cross-inhibition generates winner-take-all (WTA) dynamics such that only 1 population can be active following the stimulus, resulting in 1 decision (Wang 2002). Whether the motor output (based on the decision of hold vs. release) is correct or incorrect for the corresponding cue (e.g., A + B), according to the rules of the behavioral tasks, determines the presence or absence of dopamine (DA) at the input synapses (dashed lines, double arrowhead). D–G: biconditional discrimination decision-making network accuracy depends on afferent short-term plasticity. Each matrix contains the results for 25 networks trained with LTPi and triplet STDP, with 5 levels of input correlation (x-axis) and 5 levels of sparseness (y-axis) in 1 of 3 conditions. The color bar in D represents stimulus pair selectivity at the population level with a range of 0–2. For decision-making networks in E–G, the color bar represents percent correct with a range of 50–100% correct. D: trained network stimulus pair selectivity. E: depressing synapses produce 2 reliable networks via sublinear synaptic transmission. F: static synapses generate 6 reliable networks due to associative learning generating selectivity within the associative layer (Bourjaily and Miller 2011a). G: networks with facilitating synapses generate 12 reliable networks via superlinear synaptic transmission. These results demonstrate the beneficial role of facilitation for reliable decision-making. [Panels A–C reprinted from Bourjaily and Miller (2011b).]
Input sparseness was defined via the probability of any input group projecting to any given cell. As input connection probability increased, sparseness decreased. We used the following five values for input connection probability: 1/2, 1/3, 1/5, 1/10, and 1/20 (1/5 in the default network). Five levels of input sparseness, combined with five different degrees of input correlations, led to 25 variant networks in each regime.
Associative layer connectivity.
Both input and recurrent connectivity in the initial network were sparse and random. We simulated 400 neurons with an excitatory-to-inhibitory ratio of 4:1 (Abeles 1991). Excitatory-to-excitatory connections were sparse and random. Inhibition was feedforward only. Inhibitory-to-inhibitory connections were all-to-all. Finally, inhibitory-to-excitatory synapses connected randomly. Connections to the decision layer were initially all-to-all from excitatory neurons with a uniform strength. We used a stronger initial constant input weight for the sparse networks (1/10, 1/20), because their initial low average activity was too small to drive the decision-making layer otherwise (Bourjaily and Miller 2011a).
Decision layer connectivity.
The decision-making network, based on Wang (2002), comprised two 200-cell excitatory and 50-cell inhibitory pools. Connections within each pool were all-to-all. Strong recurrent excitation within each pool and cross-inhibition from each inhibitory pool to the opposing excitatory pool together generated WTA activity so that only one pool was stably active, modeling the decision. The decision-making network received an “urgency-gating” input, which ramped up linearly from cue onset to offset, ensuring that a decision was made each trial (Cisek et al. 2009).
Neuron and synapse properties.
We used leaky integrate-and-fire neurons (LIF) (Tuckwell 1988), defined according to the equations in Supplemental Table S1D, with parameters defined in Supplemental Table S2, D1 and D2. The spiking threshold potential was dynamic, because it depolarized with increased firing and decreased exponentially to the base value as the firing rate decreased. After a spike, there was a dynamic refractory conductance, gref(t).
We included voltage noise by a Gaussian distribution of zero mean with unit variance and amplitude σV in the associative layer. We also modeled noise as independent excitatory AMPA and inhibitory GABAA conductance variables, drawn from a uniform distribution [0 σS] (see Supplemental Table S2 for parameters).
Synaptic conductances were modeled as instantaneous steps after a spike followed by an exponential decay (described in Supplemental Table S1). Recurrent excitatory currents were modeled by AMPA and voltage-dependent NMDA receptors. Inhibitory currents were modeled by GABAA receptors. NMDA receptors were also defined by the voltage term (Compte et al. 2000; Jahr and Stevens 1990). Total conductance was the weighted sum of all inputs (see Supplemental Table S1). Initial values of weights depended on synapse type as given in Supplemental Table S2 and were modified according to the plasticity rules described below and in Supplemental Table S1.
Short-term synaptic plasticity.
Short-term facilitation and depression were modeled as instantaneous steps after a presynaptic spike followed by an exponential decay (Dayan and Abbott 2001), described by the equations below, which mimic in vitro measurements (Markram and Tsodyks 1996; Markram et al. 1998a). The total conductance of the presynaptic synapse to the decision-making network was a multiplication of the facilitation or depression variable (defined below) by the normal conductance [e.g., s(t)AMPAfac = sAMPA(t) × F(t), where F(t) is the facilitation variable].
Dynamics of the facilitation or depression variables are given in Supplemental Table S1. The facilitation variable represents empirically the gradual decay of calcium concentration and calcium-dependent receptor activation that follows calcium influx at the axon terminal following each action potential. The depression variable represents empirically the fraction of vesicles available for release, which decreases following each presynaptic spike but recovers to its base level with a time constant dependent on vesicle recycling.
Long-term plasticity.
For all plastic connections, changes in synaptic strength per trial were limited to a maximum of 50% per trial to constrain the model to biologically observed change from in vitro and slice procedures (Bi and Poo 1998; Maffei et al. 2006), while across all trials, synaptic strength was hard bounded between 0 and 20× WEE (associative excitatory-to-excitatory base synaptic weight), 20× WEI (associative inhibitory-to-excitatory base synaptic weight), and 10× Input synE (excitatory input synaptic weight), the initial mean synaptic weights given in Supplemental Table S2.
We modeled long-term potentiation of inhibition (LTPi) from recent experimental work by Maffei et al. (2006) and as in our prior article (Bourjaily and Miller 2011a). LTPi occurred when an inhibitory cell spiked and its postsynaptic excitatory cell was depolarized (explicitly modeled with a voltage threshold) but silent, as described in Supplemental Table S1. If the excitatory cell was co-active (i.e., spiking), synapse strength underwent no change. We referred to this as a “veto” effect in our model of LTPi. Any excitatory spike within a uniform temporal window about the inhibitory spike, the LTPi window, resulted in a veto. For each inhibitory spike (“non-vetoed”), the synapse was potentiated. LTPi has been reported experimentally as a mechanism for increasing (but not decreasing) the strength of inhibitory synapses in cortex (Maffei et al. 2006). We used homeostasis by multiplicative postsynaptic scaling (Turrigiano et al. 1998) at the inhibitory-to-excitatory synapses to compensate for LTPi's inability to depress synapses. Hard upper bounds on inhibitory synaptic strength limited further potentiation of the inhibitory synapses of the most inhibited cells.
Triplet spike-timing-dependent plasticity (STDP) was modeled according to the rule published by Pfister and Gerstner (2006) using the parameters cited from the full model “all-to-all” cortical parameter sets (see Supplemental Table S2). The weight changes are detailed in Supplemental Table S1.
Synaptic stability was maintained by multiplicative postsynaptic scaling (Turrigiano et al. 1998) that was updated each trial. The change in synaptic strength was proportional to the difference between the mean rate, r̄, and a goal rate, rgoal, with a rate constant, εH, shared by all synapses. The goal rates, rgE and rgIE, were heterogeneous about their means with an added 5-Hz random spread from a uniform distribution.
DA-modulated Hebbian reward rule and reaction times.
Reward plasticity updated the associative layer excitatory to decision layer excitatory synapses according to Supplemental Table S1. If the decision matched the instructed cue, then reward was delivered (e.g., for stimulus pair A + B, a “release” response resulted in reward = 1, producing LTP when pre- and postsynaptic cells were active, whereas a “hold” response to A + B resulted in reward = 0, and the sign of the change to the synapse was negative, i.e., LTD between coactive cells). We used this rule because it has been suggested by the DA dependence of correlational synaptic plasticity in vitro (Jay 2003; Reynolds and Wickens 2002; Shen et al. 2008) and has been applied successfully by others in computational studies (Soltani and Wang 2006, 2010) for biologically based reinforcement learning. If the associative layer contains sufficient numbers of stimulus pair-selective cells, then this rule will maximize reward and generate selective potentiation from excitatory associative layer to excitatory decision layer cells necessary for reliable decision-making.
Decisions and reaction times were calculated as follows. The reaction time for a decision was made when the mean activity of one excitatory pool (e.g., release) in a 20-ms time bin was at least 20 Hz greater than that of the opposite excitatory pool (e.g., hold). We averaged reaction times across the final 80 trials and then across the 4 random instantiations for each of the 25 networks. For a reward of 1, we required the “correct” excitatory pool to maintain at least 20 Hz greater activity than that of the opposite pool in the final time bin of the stimulus presentation. Repeating these calculations with alternate bin sizes of 10, 50, and 100 ms did not produce significant differences in reaction times.
Single-Stimulus and Stimulus Pair Selectivity Metric
Single-stimulus selectivity defined each neuron's selectivity for one stimulus (e.g., A) over the other three stimuli (e.g., B, C, and D). Likewise, stimulus pair selectivity defined each neuron's selectivity for one stimulus pair (e.g., A + B) over the other three stimulus pairs (e.g., A + D, C + B, and C + D). We defined selectivity, Si, of each excitatory neuron i as its maximum firing rate minus its mean response across all four stimuli, normalized by the mean response: Si = (rimax − 〈ri〉)/〈ri〉, where rimax was its maximal response and 〈ri〉 was its mean response.
Task Performance Metrics, Statistics, and Curve Fitting
We addressed how short-term plasticity affected a network's accuracy at biconditional discrimination, a nonlinear XOR operation. Linear operations can produce up to 75% correct responses; thus we designated networks with response accuracy in the range of 76–84% as borderline networks, whereas those with >85% correct were designated as reliable. Final decision-making accuracy (% correct) was calculated from the mean of the final 80 trials.
We computed statistical significance between networks with different synaptic dynamics using a two-sample t-test. Specifically, we compared 4 instantiations of each of the 25 network input configurations. We also combined the network input configurations and instantiations to strengthen statistical power (4 instantiations, 25 network input configurations, 100 networks total) before statistical comparison by paired t-test for an overall global change in accuracy.
To assess whether cells preferentially strengthened connections to the congruous decision-making pool, for each cell we calculated a response bias as the mean firing rate to stimuli requiring a response of “release” minus that to stimuli requiring a response of “hold” (i.e., rrelease − rhold) and a connection bias as the mean synaptic weight to cells in the release pool minus that to cells in the hold pool (i.e., Wrelease − Whold).
Numerical Procedures
Simulations were run for at least 400 trials using the Euler-Maruyama method of numerical integration with a time step dt = 0.02 ms. To ensure network stability, we ran key networks up to 2,000 trials with sustained results. For all configurations, we repeated with four random instantiations of initial network structure, cell/synapse heterogeneity, and noise. Robustness was further ensured by simulating key networks using 10 distinct random instantiations. Simulations were written in C++ and run on Intel Xeon machines. Matlab r2011a was used for rate models, as well as all data analysis and visualization.
RESULTS
Biconditional Discrimination: Firing Rate Model
Necessity for nonlinearity of inputs to the decision-making network.
In a task such as biconditional discrimination, one expects to find in a successful final network that those associative layer cells most selective to a particular stimulus pair connect preferentially to congruous decision-making neurons, namely, those whose activity generates the correct response to that stimulus pair (Fig. 1B). That is, associative layer neurons most responsive to stimulus pairs A + B or C + D should connect to the release pool of neurons, whereas those associative layer neurons most responsive to stimulus pairs A + D or C + B should connect more strongly to decision layer neurons in the hold pool. However, even with this appropriate connectivity, the output from these cells to the decision layer neurons must be a nonlinear function of the associative layer inputs for the network to produce a correct response in a biconditional discrimination task.
Fig. 1.
Synaptic nonlinearity can combine with asymmetric connectivity to produce above-chance performance in biconditional discrimination. Simplified networks are shown with linear input-output associative layer neurons. A: the initial pretrained linear network has all possible connections, so excitatory associative layer cells do not preferentially project to either decision-making pool. B: after training with synaptic plasticity, linear excitatory associative layer neurons project selectively to either the release or hold decision-making pool. For the example A + B trial, each input (A, B) fires at 20 Hz, generating a 40-Hz AB cell that projects to release but also is matched by the combined output of 2 cells, AD and CB, that each fire at 20 Hz and project to hold. Thus the total synaptic transmission from information-containing cells in the associative layer is equal to the 2 decision-making pools, leading to chance performance. C: synaptic facilitation produces superlinear synaptic transmission, which generates nonlinear input to the 2 decision-making pools, leading to the ability to form reliable decisions. (Arrow thickness represents strength of synaptic transmission.) D: synaptic depression with the same connectivity as static or facilitating synapses causes the opposite pool to win, because the congruous pool is driven by associative layer neurons firing at a higher rate but with lower synaptic transmission, which is sublinear as a function of firing rate or depressing synapses. E: however, with synaptic depression, many neurons can potentiate to the incongruous decision-making pool (e.g., A + B → hold). Given sufficient numbers of such oppositely connected neurons from the associative layer to the decision-making circuit, networks with depressing synapses can drive reliable decisions, because lower synaptic transmission from the cells with the highest firing rates to the incongruous pool allows the congruous pool to win and generate selective potentiation over trials. DA, dopamine; WTA, winner take all.
If the associative layer neurons operate in a linear regime, the response of a cell to its preferred stimulus pair (e.g., its firing rate rAB to stimulus pair A + B, see Table 1) is equal to the sum of two responses to nonpreferred stimulus pairs (e.g., its firing rate rAD to A + D plus firing rate rCB to C + B, see Table 1) (Fig. 1B). The corollary is that on a trial, for example, A + B, AB-selective cells that drive activity in the release pool of the decision-making layer are opposed and matched by equal drive from CB-selective and AD-selective cells, each firing at one-half the rate, promoting activity in the hold pool (Fig. 2A).
Fig. 2.
Impact of nonlinear synaptic transmission as a function of presynaptic activity in a biconditional discrimination task. A: synaptic transmission rates as a function of mean firing rates are shown as solid curves. Static synapses produce linear synaptic transmission (blue line). This is problematic for biconditional discrimination, because on an A + B trial the activity of the most responsive cell (e.g., AB; black marker on blue line) is always equal to the combined activity from 2 cells receiving input from either A or B, but not both (e.g., AD and CB; blue circle at double the value of the blue cross). Thus these synapses produce equal input to the 2 decision-making pools, as shown in B. Facilitating synapses produce superlinear synaptic transmission: r〈PrelF〉 = r(P0F + fFrτF)/(1 + fFrτF) (Dayan and Abbott 2001), which results in the synaptic transmission rates produced by the maximal AB cell (black marker on green line) being greater than the sum of the synaptic transmission rates from the AD and CB cells (green circle at double the value of the green cross). Depressing synapses produce sublinear synaptic transmission rates: r〈PrelD〉 = rP0D/[1 + (1 − Dfrac)rτD], which results in the synaptic transmission rates produced by the maximal AB cell (black marker on red line) being less than the sum of the synaptic transmission rates from the AD and CB cells (red circle at double the value of the red cross). B–D: linearly nonseparable tasks [e.g., exclusive-or (XOR)] such as biconditional discrimination can be aided by the appropriate type of efferent synapse type (blue, static; red, depressing; or green, facilitating) depending on the nonlinearity of the input-output function in the operating regime of neurons in the associative layer. Colored lines represent constant synaptic output as a function of the firing rate of 2 cells, 1 receiving A and B inputs (y-axis) and the other receiving C and D inputs (x-axis). The mean rate of the cells (across all 4 input combinations) is r0. B: neurons with linear responses and static synapses cannot solve linearly nonseparable tasks, because no straight line can separate AB/CD from AD/CB, but the nonlinear synaptic transmission of facilitation or depression can separate the responses to solve the task. C: if neurons in the associative layer have a superlinear response, as produced by the spiking threshold, then responses to the optimal stimulus pair are more than double the response to a single component of the pair. In this case, static synapses are sufficient, but the superlinear synaptic transmission of facilitation enhances separability. Depression, however, with its sublinear synaptic transmission, compromises the threshold nonlinearity, reducing or eliminating any separability achieved. D: if neurons are in a saturating regime, for example, if firing near their maximal rates, the separability achieved by static synapses with linear output is enhanced by depressing synapses but is compromised by facilitating synapses.
However, nonlinearity in the synaptic output as a function of presynaptic firing rate as produced by dynamic synapses (Fig. 2A) can produce preferential input to one of the decision-making pools and improve performance. Short-term facilitation produces superlinear synaptic transmission rates, whereas depression produces sublinear synaptic transmission rates. Because the facilitation function of the rates is superlinear, the synaptic transmission rates to the decision-making network for AB neurons during trials A + B is more than double the sum of the synaptic transmission rates from AD and CB neurons firing at one-half the rate [e.g., F(rAB) > F(rAD) + F(rCB)] (Fig. 2A). Thus, on each trial in the presence of facilitation, the maximally driven cells will reliably drive the correct decision-making pool (Fig. 1C), producing improved task performance (Fig. 3A).
Short-term depression produces the opposite result of synaptic facilitation such that a maximally driven cell produces less synaptic transmission than the sum of that produced by the two secondarily driven cells, which fire at one-half the rate [e.g., D(rAB) < D(rAD) + D(rCB)] (Fig. 2A). This leads to poor decisions if the associative layer neurons become connected to the decision layer neurons via dopaminergic learning (i.e., reward-based plasticity) as shown in Fig. 1, A and D. Intriguingly, if the dopaminergic learning could lead to the connectivity shown in Fig. 1E, with associative layer cells more strongly connected to the incongruous decision-making pool, then the nonlinearity of synaptic depression would be sufficient to solve the task given linear associative layer responses. Indeed, Fig. 3, A and C (left columns, middle rows) demonstrates such a result in the linear associative layer network.
In Fig. 2, B–D, we motivate an expected interaction between any nonlinearity of the firing rate responses of associative layer neurons with nonlinearity in their synaptic transmission to the decision layer neurons. We assume four types of associative layer neurons, identified by their preferred stimulus pair. Thus the network's response to any input combination can be summarized by the firing rates of these four types of cells and expressed as a single point in a 4-dimensional (4-D) space of firing rates. For visualization, we project the 4-D space into 2-D, plotting the rate of the AB-responsive cells against the rate of the CD-responsive cells to each of the four possible stimulus pairs. Figure 2B is an example of such a plot and summarizes the results of Fig. 2A. In particular, the four responses fall on a straight line (the response to A + D is equal to the response to C + B in this projection, so only 3 circles are visible), indicating the linear dependence for all linearly responsive neurons:
for all cells i whose firing rate increases linearly with inputs A, B, C, and D. Equivalently, the combination of firing rates of the four cell types is linearly dependent:
for each stimulus pair X = A, C; Y = B, D. Thus, if synaptic transmission is simply proportional to presynaptic firing rate (dashed blue line denotes constant synaptic transmission), then the stimulus pairs cannot generate opposite responses. However, since the sign of
is positive for the pairs X + Y =A + B and X + Y = C + D and opposite for the pairs X + Y = A + D and X + Y = C + B, synaptic transmission through facilitating synapses does separate these stimulus pairs (green line). Similarly, for depressing synapses,
although the sign of the result is opposite, so the points fall on the opposite side of the line of constant synaptic transmission (red curve) compared with facilitation (green curve).
In Fig. 3, the left columns demonstrate how either facilitating or depressing synaptic transmission can indeed produce high mean reward accumulation from neurons with linear responses. The differences in signs of the nonlinearity lead depressing synapses to connect preferentially to the incongruous decision-making pool (Fig. 3C, left column, middle row), as expected.
We should note that the schematic diagrams of Fig. 2, B–D, have two shortcomings. First, they are a 2-D projection of what is a 4-D space. However, the symmetry in our simplified model means the arguments are still valid; in particular, when the 4 points are linearly dependent, they fall on a 3-D hyperplane of 4-D space but are seen to fall on a line in this 2-D projection. Similarly, when a line can correctly separate the points in the 2-D projection, then any 3-D hyperplane that intersects the line will separate them in the full 4-D space.
The second shortcoming lies in the fact that just because the points can be correctly separated, meaning a set of synaptic weights can be chosen to correctly bias the decision-making pools, it does not mean that all reinforcement learning rules will lead to such a solution. Indeed, Fig. 3 demonstrates a dependence on the reinforcement learning rule. In biological terms, if the DA-modulated reward-based learning rule is purely postsynaptic, then any dependence on presynaptic firing rate must be through synaptic transmission. Thus a depressing synapse would lead to the presynaptic rate dependence of the reinforcement rule being sublinear, whereas for a facilitating synapse it would be superlinear, and for a static synapse it would be linear. Figure 3, A and C, show the results with such a reinforcement rule that matches the synaptic transmission. Alternatively, if we select the reinforcement rule (either linear, sublinear, or superlinear in presynaptic rate) that maximizes reward, we obtain the results from Fig. 3, B and D. The key features of the results do in fact match our insight from nonlinear synaptic transmission rates from Fig. 2, B–D. In particular, for a sublinear (saturating) neural response, the sublinear, depressing synapses produce greatest reward, whereas for superlinear (threshold) neural responses, the superlinear, facilitating synapses produce greatest reward. If the neural response is sublinear, or in some cases, if synapses are depressing, then a negative asymmetry arises in the synaptic connections, meaning that neurons connect preferentially to the incongruous decision-making pool, as in Fig. 3C (middle row).
Biconditional Discrimination: Spiking Network Simulations
Static synapses.
Static synapses from excitatory cells in the associative layer (Fig. 4B) to the decision-making layer (Fig. 4C) served as a control to assess the effect of facilitating or depressing synapses. In the biconditional discrimination task, one untrained associative layer with sparse input (1/20) could produce reliably correct decisions (at least 85% correct) by training only the excitatory synapses from the associative layer to the decision-making layer's excitatory cells via a DA-modulated reward-based Hebbian learning rule (5 other untrained networks produced borderline reliability of 76–84% correct). However, the addition of correlation-based synaptic plasticity (LTPi and triplet STDP), at inputs to and at recurrent synapses within the associative layer, increased stimulus pair selectivity and led to greater reward compared with the untrained associative layer (both with reward-based plasticity) (Bourjaily and Miller 2011a). Multiple trained associative layers (24%, 6 of 25) generated reliably correct decision-making with additional borderline networks (20%, 5 of 25) (Fig. 4F).
Facilitating synapses.
Because facilitating synapses produce superlinear synaptic transmission rates (Fig. 2A), we expected that facilitating synapses would improve task performance relative to static synapses by enhancing the superlinearity already present in the responses of associative layer cells. Indeed, the addition of facilitation increased the number of randomly connected initial networks that produced reliably correct decisions (8%, 2 of 25) as well as the number of borderline networks (20%, 5 of 25). Furthermore, nearly one-half of the trained associative networks (44%, 12 of 25) generated reliable decisions, whereas 20% were borderline (5 of 25) (Fig. 4G). Of the 12 reliable networks, 6 were significantly improved (P < 0.01) over networks using static synapses. Overall, 11 of 25 trained facilitating networks were significantly improved over those using static synapses (P < 0.05), and across all trained networks and input configurations, there was a significant improvement of ∼7% in decision-making accuracy (P < 0.001) in trained networks with facilitating synapses compared with those with static synapses to the decision-making layer.
To test whether facilitation simply increased accuracy only because it increased the strength of input from the associative layer to the decision layer, we varied initial synaptic strength. We observed the same number of reliable networks across a range of −25% to +25% of the default strength and a decrease of 2–3 reliable networks at ±50% of the default value. The loss of reliability arose because either no decision-making pool responded (too little drive) or both pools responded (too much drive), whereas networks with static synapses always produced a single active decision-making pool; indeed, their synaptic input strengths had been optimized to avoid such extremes. Thus the cause of increased performance was not simply a matter of linearly scaling the synaptic input strength.
The addition of facilitating synapses reduced the minimal amount of stimulus pair selectivity necessary to generate reliable decisions to a value of 0.88, compared with the value of 1.23 with static synapses. Furthermore, facilitating synapses reduced the mean time to reach a decision across all networks, on average 52 ms, compared with a value of 118 ms for static synapses, with the difference being significant for all 25 input configurations (P < 0.01 by 2-sample t-test in each case). Thus, for the biconditional discrimination task, facilitation enhanced both speed and accuracy of decisions.
Depressing synapses.
Depressing synapses produce sublinear synaptic transmission rates, counter to the observed superlinear activity responses in the associative network, so we expected they would worsen performance. Indeed, no untrained random associative networks with depressing synapses produced reliable decisions. Furthermore, only two of the trained associative networks (8%, 2 of 25) generated reliable decisions, of which one was significantly worse than the equivalent network with static synapses (P < 0.05), as well as a small number of networks with borderline accuracy (25%, 7 of 25) (Fig. 4E). Eight of 25 trained networks with depressing synapses were significantly worse than those with static synapses (P < 0.05), and there was a corresponding significant decrement in average decision-making accuracy of ∼13% (P < 0.001). The reduction in accuracy corresponded to a need for greater stimulus pair selectivity, a minimal value of 1.35, for reliable responses. Reaction times were also significantly longer for networks with depressing synapses (mean 158 ms) compared with static synapses, (mean 118 ms), with 17 of 25 networks producing significantly slower responses (P < 0.01). Thus depressing synapses worsened performance by both increasing reaction time and decreasing accuracy.
Selective potentiation of afferent synapses to the decision-making network.
We quantified selective potentiation across networks by the strength of afferent synapses to the congruous decision-making pool, that is, the decision-making pool, producing reward for the preferred stimuli of cells (e.g., the strength of synapse from an AB-tuned cell to “release”), versus that to the incongruous decision-making pool (e.g., the strength of synapse from an AB-tuned cell to “hold”). Compared with control networks, those with facilitating synapses produced an increase in the number of neurons projecting to the congruous decision-making pool and a corresponding decrease in the percentage of synapses to the incongruous decision-making pool. For networks with depressing synapses, we found the opposite to be the case and indeed found many cells to be connected preferentially to the incongruous decision-making pool (cf. Figs. 1E and 3C).
Stimulus-Response Association: Spiking Network Simulations
We simulated a more common task, stimulus-response association, to assess the value of nonlinear synaptic transmission when the task is linearly separable. Only a single stimulus was shown per trial and required to drive a decision. Because single-stimulus selectivity is inherently greater than stimulus pair selectivity, many untrained random associative layers with static synapses could produce reliable decision-making responses (84%, 21 of 25), a number that did not change upon training the associative layer with LTPi and triplet STDP. However, facilitating synapses caused all but one of the trained networks (96%, 24 of 25) to produce reliable decision-making, without affecting the number of reliable untrained networks. In the same task, depressing synapses worsened accuracy, with 76% (19 of 25 networks) producing reliable decision-making from both trained and untrained associative layers. Thus again, we found facilitation to be beneficial and depression to be detrimental to performance.
Two-Stimulus Discrimination Task: Firing Rate Model
Models of decision-making have mostly been based on a task requiring a comparison of the magnitude of two distinct stimuli (Ratcliff 1978; Usher and McClelland 2001; Wang 2002), such as whether coherence of rightward or leftward motion is greater in a pattern of randomly flickering dots (Shadlen and Newsome 1996). Thus we assessed how accuracy in such a task was affected by dynamic synapses at the inputs to the decision-making circuit. Figure 5A indicates that facilitating synapses boosted decision-making accuracy, whereas depressing synapses reduced accuracy. We found this to be true both when all of the noise in the system was in the inputs (using a deterministic decision-making circuit) as well as when the decision-making circuit itself was noisy. The deterministic effect was a straightforward consequence of the nonlinearity of synaptic transmission of dynamic synapses as a function of firing rate: facilitation enhanced whereas depression decreased the difference in postsynaptic conductance resulting from a difference in presynaptic firing rates (Dayan and Abbott 2001).
Fig. 5.
Facilitating synapses enhance accuracy in a 2-stimulus discrimination task. A–C: inputs differing by 5% of their mean are transmitted to separate decision-making pools through firing rate model cell groups, which respond either linearly (A), superlinearly (B), or sublinearly (C). Baseline synaptic strength of the inputs to the decision-making circuit is varied and decision-making accuracy is measured for input synapses that are static (blue), facilitating (green), or depressing (red). Performance peaks for a given total input that produces activation of just 1 of the 2 decision-making pools (no pools are active if input is too weak, whereas both are active if input is too strong). D: signal-to-noise ratio (SNR) of the postsynaptic conductance resulting from a Poisson input train is enhanced through facilitating synapses (green) and reduced through depressing synapses (red) compared with static synapses (blue). Solid lines indicate simulated data; dashed lines represent the analytic formula, assuming the synaptic time constant is much shorter than interspike intervals, an approximation that fails at high firing rates (see Appendix). Only the steady-state responses are measured and compared. Signal is produced by a difference in the presynaptic firing rate equal to 5% of the mean rate.
Dynamic synapses increase noise in the postsynaptic conductance, because they add variability to the amplitude of conductance pulses in addition to the variability in timing of spikes (see Lindner et al. 2009 and Appendix). However, for the typical parameters used, and even if all noise was at the level of inputs, via a combination of Poisson spike timing and fluctuating input firing rates, facilitating synapses led to higher decision-making accuracy (Fig. 5A) by increasing the signal-to-noise ratio (Fig. 5B). The benefit of facilitating synapses was even more striking if noise was greater within the decision-making circuitry than at the inputs, because in that case the deterministic enhancement of the signal by synaptic facilitation dominated its effect (data not shown). For all conditions with which we have studied the two-stimulus discrimination task, depressing input synapses reduced accuracy, as expected, because they reduced the downstream effect of any difference between input signals (Dayan and Abbott 2001) as well as enhancing noise (Lindner et al. 2009).
DISCUSSION
Short-term plasticity is a well-established feature of synapses in cortical circuits (Hempel et al. 2000; Markram et al. 1998b; Tsodyks and Markram 1997; Wang et al., 2006). Although presynaptic frequency-dependent dynamic changes in effective synaptic strength may be an inevitable consequence of the biological mechanisms underlying synaptic transmission, theoretical studies have shown that these dynamics, on a short timescale, can be extremely beneficial for multiple cortical functions (Maass and Zador 1999): filtering inputs in the frequency domain (Farajidavar et al. 2008; Matveev and Wang 2000; Natschlager et al. 2001; Tsodyks et al. 1998), dynamically controlling gain to enhance sensitivity to changes in inputs (Abbott et al. 1997), producing a temporal lag allowing for motion direction selectivity (Chance et al. 1998), providing a necessary slow time constant for memory maintenance (Barak and Tsodyks 2007; Martinez-Garcia et al. 2011; Mongillo et al. 2008), and producing an optimal estimator of presynaptic membrane potential dynamics (Pfister et al. 2010), among others. Here, we show how facilitating synapses can produce improved classifiers based on a nonlinear combination of presynaptic firing rates. Specifically, facilitation improves task performance by selectively enhancing synaptic transmission to a decision-making circuit from those input cells most responsive to a given stimulus. On the contrary, depressing synapses worsen performance by relatively enhancing synaptic transmission from less responsive cells. The effects of facilitation and depression depended monotonically on the time constant and amplitude of short-term plasticity. Enhancing the amplitude of facilitation and depression boosted their respective positive and negative changes in task performance more so than increasing the time constants.
Biconditional discrimination employs XOR logic, making it linearly nonseparable; synaptic facilitation provides nonlinearity necessary to solve such a task, even when associative layer neural responses are a linear combination of their inputs. In principle, any nonlinearity should allow a network to produce suitable performance in an XOR-like/linearly nonseparable task; thus we assessed whether depressing synapses could also be beneficial given the proper conditions. Indeed, for linearly responsive neurons, depressing synapses lead to good performance if associative layer neurons connect most strongly to the decision-making pool that produces the incongruous response to that of their preferred stimulus (e.g., AB associative neurons to “hold” decision pool neurons) (Figs. 1E and 5). Consistent with this expectation based on linear neurons, in simulated networks of spiking neurons trained with depressing synapses, we found an excess of these “incongruous” connected neurons. However, performance in these simulated networks proved to be worse with depressing synapses, because the saturating effect of synaptic depression on neural output opposed the superlinearity of the firing rate curve of spiking neurons in the associative layer. Thus, although synaptic facilitation enhanced the existing nonlinearity in the response of a biophysically based neural circuit, enhancing performance, synaptic depression reduced the nonlinearity, producing worse performance.
In addition to testing the afferent synapses to the decision-making network from the associative layer, we also incorporated short-term plasticity at the associative layer synapses plastic for triplet STDP and LTPi, independently/in turn. However, we did not find that the addition of short-term plasticity at these locations to and within the associative layer led to any appreciable difference in decision-making performance compared with static synapses as when applied to afferent synapses to the decision-making network, as presented here.
Many decision-making studies have looked at single-stimulus discrimination tasks based on motion coherence (Mensh et al. 2004; Wang 2002), natural images (Seung 2003; Theodoni et al. 2,011), and orientation (Suh et al. 2003), among others. Thus we included a stimulus-response association task and a discrimination task, to generalize our findings. Although responding selectively to distinct single stimuli is much easier than to stimulus pairs, facilitation and depression of inputs to the decision-making circuit still improved and worsened performance, respectively, in such tasks. Similarly, in a model of a discrimination task based on motion coherence (Shadlen and Newsome 1996), facilitation of synaptic input to the decision-making circuit enhanced discriminability, even when all of the noise in the system was in the stimulus and so transmitted through the same facilitating synapses, whereas depression of input synapses decreased discriminability (Fig. 5).
In summary, we find for three distinct types of tasks that networks with facilitating afferent synapses to a biologically based decision-making circuit produce responses that are both significantly more accurate (i.e., more often correct) and significantly quicker than those with either static or depressing synapses. In linearly nonseparable tasks, the superlinear synaptic transmission of facilitating synapses (Dayan and Abbott 2001) is advantageous because it boosts the inherent superlinearity in neural firing rates (Anderson et al. 2000; Carandini and Ferster 2000; Miller and Troyer 2002). In linearly separable tasks, facilitation can boost the signal-to-noise ratio by increasing the signal without producing a concomitant increase in the input noise (Lindner et al. 2009). Thus we expect synapses from associative areas to decision-making areas to be facilitating rather than depressing, as has been shown in hippocampal projections to prefrontal cortex (Mulder et al. 1997) and internal prefrontal connections (Berger et al. 2009; Hempel et al. 2000; Wang et al. 2006). We suggest that the optimal dynamics of a synapse may reflect not only the statistics of the presynaptic membrane potential (Pfister et al. 2010) but also the type of information it needs to transmit, given the computational function of the circuit.
GRANTS
We are grateful for financial support for this work from National Institute of Deafness and Other Communicative Diseases Award DC009945, the Department of Biology, Brandeis University, and from a National Science Foundation Integrative Graduate Education and Research Traineeship Award to the Neuroscience Graduate Program of Brandeis University.
DISCLOSURES
No conflicts of interest, financial or otherwise, are declared by the authors.
ENDNOTE
At the request of the authors, readers are herein alerted to the fact that additional materials related to this manuscript may be found at the institutional website of one of the authors, which at the time of publication they indicate is: http://people.brandeis.edu/∼pmiller/code.html. These materials are not a part of this manuscript, and have not undergone peer review by the American Physiological Society (APS). APS and the journal editors take no responsibility for these materials, for the website address, or for any links to or from it.
AUTHOR CONTRIBUTIONS
Author contributions: M.A.B. and P.M. conception and design of research; M.A.B. and P.M. performed experiments; M.A.B. and P.M. analyzed data; M.A.B. and P.M. interpreted results of experiments; M.A.B. and P.M. prepared figures; M.A.B. and P.M. drafted manuscript; M.A.B. and P.M. edited and revised manuscript; M.A.B. and P.M. approved final version of manuscript.
Supplementary Material
APPENDIX: SIGNAL-TO-NOISE RATIO FOR THE DIFFERENCE IN SYNAPTIC INPUT BETWEEN TWO POISSON SPIKE TRAINS
A decision-making circuit produces a decision based on the difference between two sets of synaptic inputs. We define a signal-to-noise ratio (SNR) for the difference between two synaptic conductance inputs, g1 and g2, from two presynaptic cells with respective rates r1 and r2, as
| (A1) |
where 〈g1〉 and 〈g2〉 are the expected means of the two inputs with standard deviations σ1 and σ2, respectively. Note that any static scaling of the conductance of synapses will scale the means and standard deviations equally, not affecting the SNR.
However, the nonlinear rate-based filtering of a facilitating synapse increases the mean conductance of one set of inputs more than it does the other. In particular, if r1 > r2, then synaptic conductances are scaled, respectively, by the facilitation variables F1 and F2, with F1 > F2. Yet, because spike trains are not regular, the dynamics of facilitating synapses produces spike-to-spike variability in the postsynaptic conductance, which adds to the variability in postsynaptic input. Thus we must ask whether the boost to the signal produced by facilitating synapses is greater than the boost to the postsynaptic noise. We follow the methods of Lindner et al. (2009) below, to produce the formulas for mean and variance of synaptic transmission at steady state, via the synapses of our model.
Postsynaptic responses are defined by the conductance g(t), which increases to g0F(t−)D(t−) immediately following a presynaptic spike at time t and then decays exponentially according to
| (A2) |
We assume the time constant τs is significantly shorter than the interspike interval. For AMPA receptor-mediated synapses, τs = 2 ms, so this condition is met for most realistic presynaptic spike trains. Note that F(t−) = 1 for static and depressing synapses, whereas D(t−) = 1 for static and facilitating synapses. In the following, we reproduce the analysis for facilitating synapses and later state the corresponding results from the equivalent derivation for depressing synapses.
For a spike train of average firing rate r, the mean postsynaptic conductance is
| (A3) |
where 〈F−〉 = 〈F(t−)〉 is the expected mean facilitation variable immediately prior to a spike. The difference in mean conductance of two inputs to a decision-making circuit (the numerator in Eq. A1) is
| (A4) |
where we assume Δr ≪ r̄ and r1 = r̄ + Δr/2, r2 = r̄ − Δr/2, 〈F1〉 = F̄ + 〈ΔF〉/2, and 〈F2〉 = F̄ − 〈ΔF〉/2.
The variance in postsynaptic conductance is calculated as
| (A5) |
where σF2 is the variance in F. Thus the variance in conductance can be calculated as
| (A6) |
where we have assumed rτs ≪ 1. It can be seen for synaptic facilitation that F > 1, so the conductance variance, and hence noise in the postsynaptic cell, is higher because of both a multiplicative increase in the conductance (first term) and, through σF2, the variance in F. For the input to a decision-making circuit, the variance (the noise term) in the denominator of Eq. A1 is increased as
| (A7) |
Combining Eqs. A4 and A7 with Eq. A1 leads to the SNR for dynamic synapses,
| (A8) |
which requires us to calculate the mean and variance of the facilitation variable, F.
We note that F decays to 1 between spikes with a time constant τF as
| (A9) |
whereas immediately following a spike, it increases from its prior value, F−, to F+ according to
| (A10) |
where fF is the facilitation factor indicating the amount of facilitation and p0 is the initial, baseline vesicle release probability in the absence of spikes. The term in brackets prevents the release probability increasing above unity so that the maximum value of facilitation, Fmax, is 1/p0. The postsynaptic conductance produced by each spike is proportional to F−, the value of F immediately before the spike, whose mean and variance we now calculate.
For a Poisson train, the expected value of F− is given by (see Dayan and Abbott 2001)
| (A11) |
Thus, for the two input trains to a decision-making circuit, the difference in facilitation factor becomes
| (A12) |
From calculations similar to those leading to Eq. A9 (see Dayan and Abbott 2001 and also Lindner et al. 2009), the variance in F− is given by
| (A13) |
and thus the standard deviation in F− can be written as
| (A14) |
which becomes much smaller than the increase in synaptic response (proportional to 〈F−〉) if rτF is large compared with 1. Because the time constant for facilitation is on the order of hundreds of milliseconds, for typical firing rates needed to drive a decision-making circuit, the added variability in synaptic response due to variation in excitatory postsynaptic potential (EPSP) amplitudes is small compared with the boost in EPSP amplitude.
Substituting Eqs. A11, A12, and A14 into Eq. A8 and assuming Δr ≪ r̄ leads to the SNR for the difference in Poisson spike trains with facilitating synapses as
| (A15) |
Dashed curves in Fig. 5D are plotted using Eq. A15 with fF = 0.4 and fF = 0 for facilitating and static synapses, respectively, demonstrating a boost to SNR by synaptic facilitation.
Depressing synapses, with a fraction Dfrac of vesicles released per spike and recovery with time constant τD, lead to similar results, but the equivalent of 〈ΔF〉 is negative,
| (A16) |
so depressing synapses reduce the signal as well as increasing the noise. Thus, for depressing synapses, the SNR is always reduced by an amount that can be calculated as
| (A17) |
to give
| (A18) |
which is shown as the dashed red curve in Fig. 5D, with Dfrac = 0.5 and τD = 200 ms.
REFERENCES
- Abbott LF, Varela JA, Sen K, Nelson SB. Synaptic depression and cortical gain control. Science 275: 220–224, 1997 [DOI] [PubMed] [Google Scholar]
- Abeles M. Corticonics: Neural Circuits of the Cerebral Cortex. Cambridge, UK: Cambridge University Press, 1991 [Google Scholar]
- Albantakis L, Deco G. Changes of mind in an attractor network of decision-making. PLoS Comput Biol 7: e1002086, 2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Alkon DL, Blackwell KT, Barbour GS, Rigler AK, Vogl TP. Pattern-recognition by an artificial network derived from biologic neuronal systems. Biol Cybern 62: 363–376, 1990 [DOI] [PubMed] [Google Scholar]
- Anderson JS, Lampl I, Gillespie DC, Ferster D. The contribution of noise to contrast invariance of orientation tuning in cat visual cortex. Science 290: 1968–1972, 2000 [DOI] [PubMed] [Google Scholar]
- Barak O, Tsodyks M. Persistent activity in neural networks with dynamic synapses. PLoS Comput Biol 3: e35, 2007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berger TK, Perin R, Silberberg G, Markram H. Frequency-dependent disynaptic inhibition in the pyramidal network: a ubiquitous pathway in the developing rat neocortex. J Physiol 587: 5411–5425, 2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bi GQ, Poo MM. Synaptic modifications in cultured hippocampal neurons: dependence on spike timing, synaptic strength, and postsynaptic cell type. J Neurosci 18: 10464–10472, 1998 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bourjaily MA, Miller P. Synaptic plasticity and connectivity requirements to produce stimulus-pair specific responses in recurrent networks of spiking neurons. PLoS Comput Biol 7: e1001091, 2011a [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bourjaily MA, Miller P. Excitatory, inhibitory, and structural plasticity produce correlated connectivity in random networks trained to solve paired-stimulus tasks. Front Comput Biol 5: 37, 2011b [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bunsey M, Eichenbaum H. Critical role of the parahippocampal region for paired-associate learning in rats. Behav Neurosci 107: 740–747, 1993 [DOI] [PubMed] [Google Scholar]
- Burges CJ. A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 2: 121–167, 1998 [Google Scholar]
- Carandini M, Ferster D. Membrane potential and firing rate in cat primary visual cortex. J Neurosci 20: 470–484, 2000 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chance FS, Nelson SB, Abbott LF. Synaptic depression and the temporal response characteristics of V1 cells. J Neurosci 18: 4785–4799, 1998 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chapman RS, Seung HK, Schwartz SE, Bird EK. Predicting language production in children and adolescents with Down syndrome: the role of comprehension. J Speech Lang Hear Res 43: 340–350, 2000 [DOI] [PubMed] [Google Scholar]
- Cisek P, Puskas GA, El-Murr S. Decisions in changing conditions: the urgency-gating model. J Neurosci 29: 11560–11571, 2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Compte A, Brunel N, Goldman-Rakic PS, Wang XJ. Synaptic mechanisms and network dynamics underlying spatial working memory in a cortical network model. Cereb Cortex 10: 910–923, 2000 [DOI] [PubMed] [Google Scholar]
- Conboy L, Bisaz R, Markram K, Sandi C. Role of NCAM in emotion and learning. Adv Exp Med Biol 663: 271–296, 2010 [DOI] [PubMed] [Google Scholar]
- Curtis CE, Lee D. Beyond working memory: the role of persistent activity in decision making. Trends Cogn Sci 14: 216–222, 2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daw ND, O'Doherty JP, Dayan P, Seymour B, Dolan RJ. Cortical substrates for exploratory decisions in humans. Nature 441: 876–879, 2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dayan P, Abbott LF. Theoretical Neuroscience. Cambridge, MA: MIT Press, 2001 [Google Scholar]
- Eckhoff P, Holmes P, Law C, Connolly PM, Gold JI. On diffusion processes with variable drift rates as models for decision making during learning. New J Phys 10: nihpa49499, 2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Farajidavar A, Saeb S, Behbehani K. Incorporating synaptic time-dependent plasticity and dynamic synapse into a computational model of wind-up. Neural Netw 21: 241–249, 2008 [DOI] [PubMed] [Google Scholar]
- Feng S, Holmes P, Rorie A, Newsome WT. Can monkeys choose optimally when faced with noisy stimuli, and unequal rewards? PLoS Comput Biol 5: e1000284, 2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiete IR, Hahnloser RH, Fee MS, Seung HS. Temporal sparseness of the premotor drive is important for rapid learning in a neural network model of birdsong. J Neurophysiol 92: 2274–2282, 2004 [DOI] [PubMed] [Google Scholar]
- Glimcher PW, Rustichini A. Neuroeconomics: the consilience of brain, and decision. Science 306: 447–452, 2004 [DOI] [PubMed] [Google Scholar]
- Gold JI, Shadlen MN. The neural basis of decision making. Annu Rev Neurosci 30: 535–574, 2007 [DOI] [PubMed] [Google Scholar]
- Grand C, Honey RC. Solving XOR. J Exp Psychol Anim Behav Process 34: 486–493, 2008 [DOI] [PubMed] [Google Scholar]
- Grillner S, Markram H, De Schutter E, Silberberg G, LeBeau FE. Microcircuits in action—from CPGs to neocortex. Trends Neurosci 28: 525–533, 2005 [DOI] [PubMed] [Google Scholar]
- Harris JA. Elemental representations of stimuli in associative learning. Psychol Rev 113: 584–605, 2006 [DOI] [PubMed] [Google Scholar]
- Harris JA, Livesey EJ, Gharaei S, Westbrook RF. Negative patterning is easier than a biconditional discrimination. J Exp Psychol Anim Behav Process 34: 494–500, 2008 [DOI] [PubMed] [Google Scholar]
- Hasselmo ME, Cekic M. Suppression of synaptic transmission may allow combination of associative feedback and self-organizing feedforward connections in the neocortex. Behav Brain Res 79: 153–161, 1996 [DOI] [PubMed] [Google Scholar]
- Hempel CM, Hartman KH, Wang XJ, Turrigiano GG, Nelson SB. Multiple forms of short-term plasticity at excitatory synapses in rat medial prefrontal cortex. J Neurophysiol 83: 3031–3041, 2000 [DOI] [PubMed] [Google Scholar]
- Histed MH, Pasupathy A, Miller EK. Learning substrates in the primate prefrontal cortex, and striatum: sustained activity related to successful actions. Neuron 63: 244–253, 2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iannella N, Back AD. A spiking neural network architecture for nonlinear function approximation. Neural Netw 14: 933–939, 2001 [DOI] [PubMed] [Google Scholar]
- Jahr CE, Stevens CF. Voltage dependence of NMDA-activated macroscopic conductances predicted by single-channel kinetics. J Neurosci 10: 3178–3182, 1990 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jay TM. Dopamine: a potential substrate for synaptic plasticity and memory mechanisms. Prog Neurobiol 69: 375–390, 2003 [DOI] [PubMed] [Google Scholar]
- Kennerley SW, Walton ME. Decision making, and reward in frontal cortex: complementary evidence from neurophysiological and neuropsychological studies. Behav Neurosci 125: 297–317, 2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee D, Seo H. Mechanisms of reinforcement learning and decision making in the primate dorsolateral prefrontal cortex. Ann NY Acad Sci 1104: 108–122, 2007 [DOI] [PubMed] [Google Scholar]
- Lindner B, Gangloff D, Longtin A, Lewis JE. Broadband coding with dynamic synapses. J Neurosci 29: 2076–2088, 2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maass W, Zador AM. Dynamic stochastic synapses as computational units. Neural Comput 11: 903–917, 1999 [DOI] [PubMed] [Google Scholar]
- Maffei A, Nataraj K, Nelson SB, Turrigiano GG. Potentiation of cortical inhibition by visual deprivation. Nature 443: 81–84, 2006 [DOI] [PubMed] [Google Scholar]
- Markram H. A network of tufted layer 5 pyramidal neurons. Cereb Cortex 7: 523–533, 1997 [DOI] [PubMed] [Google Scholar]
- Markram H, Gupta A, Uziel A, Wang Y, Tsodyks M. Information processing with frequency-dependent synaptic connections. Neurobiol Learn Mem 70: 101–112, 1998a [DOI] [PubMed] [Google Scholar]
- Markram H, Tsodyks M. Redistribution of synaptic efficacy between neocortical pyramidal neurons. Nature 382: 807–810, 1996 [DOI] [PubMed] [Google Scholar]
- Markram H, Wang Y, Tsodyks M. Differential signaling via the same axon of neocortical pyramidal neurons. Proc Natl Acad Sci USA 95: 5323–5328, 1998b [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martinez-Garcia M, Rolls ET, Deco G, Romo R. Neural and computational mechanisms of postponed decisions. Proc Natl Acad Sci USA 108: 11626–11631, 2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matveev V, Wang XJ. Differential short-term synaptic plasticity and transmission of complex spike trains: to depress or to facilitate? Cereb Cortex 10: 1143–1153, 2000 [DOI] [PubMed] [Google Scholar]
- Melchers KG, Shanks DR, Lachnit H. Stimulus coding in human associative learning: flexible representations of parts and wholes. Behav Processes 77: 413–427, 2008 [DOI] [PubMed] [Google Scholar]
- Mensh BD, Aksay E, Lee DD, Seung HS, Tank DW. Spontaneous eye movements in goldfish: oculomotor integrator performance, plasticity, and dependence on visual feedback. Vision Res 44: 711–726, 2004 [DOI] [PubMed] [Google Scholar]
- Miller KD, Troyer TW. Neural noise can explain expansive, power-law nonlinearities in neural response functions. J Neurophysiol 87: 653–659, 2002 [DOI] [PubMed] [Google Scholar]
- Minsky M, Papert S. Perceptrons: An Introduction to Computational Geometry. Cambridge, MA: MIT Press, 1987 [Google Scholar]
- Mongillo G, Barak O, Tsodyks M. Synaptic theory of working memory. Science 319: 1543–1546, 2008 [DOI] [PubMed] [Google Scholar]
- Mulder AB, Arts MP, Lopes da Silva FH. Short- and long-term plasticity of the hippocampus to nucleus accumbens and prefrontal cortex pathways in the rat, in vivo. Eur J Neurosci 9: 1603–1611, 1997 [DOI] [PubMed] [Google Scholar]
- Natschlager T, Maass W, Zador A. Efficient temporal processing with biologically realistic dynamic synapses. Network 12: 75–87, 2001 [PubMed] [Google Scholar]
- Nelson SB, Turrigiano GG. Synaptic depression: a key player in the cortical balancing act. Nat Neurosci 1: 539–541, 1998 [DOI] [PubMed] [Google Scholar]
- Nordlie E, Gewaltig MO, Plesser HE. Towards reproducible descriptions of neuronal network models. PLoS Comput Biol 5: e1000456, 2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park H, Amari SI, Fukumizu K. Adaptive natural gradient learning algorithms for various stochastic models. Neural Netw 13: 755–764, 2000 [DOI] [PubMed] [Google Scholar]
- Pfister JP, Dayan P, Lengyel M. Synapses with short-term plasticity are optimal estimators of presynaptic membrane potentials. Nat Neurosci 13: 1271–1275, 2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pfister JP, Gerstner W. Triplets of spikes in a model of spike timing-dependent plasticity. J Neurosci 26: 9673–9682, 2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ratcliff R. A theory of memory retrieval. Psychol Rev 85: 59–108, 1978 [Google Scholar]
- Reynolds JN, Wickens JR. Dopamine-dependent plasticity of corticostriatal synapses. Neural Netw 15: 507–521, 2002 [DOI] [PubMed] [Google Scholar]
- Rigotti M, Ben Dayan Rubin D, Morrison SE, Salzman CD, Fusi S. Attractor concretion as a mechanism for the formation of context representations. Neuroimage 52: 833–847, 2010a [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rigotti M, Rubin DB, Wang XJ, Fusi S. Internal representation of task rules by recurrent dynamics: the importance of the diversity of neural responses. Front Comput Neurosci 4: 24, 2010b [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rolls ET, Grabenhorst F, Deco G. Decision-making, errors, and confidence in the brain. J Neurophysiol 104: 2359–2374, 2010 [DOI] [PubMed] [Google Scholar]
- Sakai K, Miyashita Y. Neural organization for the long-term memory of paired associates. Nature 354: 152–155, 1991 [DOI] [PubMed] [Google Scholar]
- Sakai Y, Okamoto H, Fukai T. Computational algorithms and neuronal network models underlying decision processes. Neural Netw 19: 1091–1105, 2006 [DOI] [PubMed] [Google Scholar]
- Sanderson DJ, Pearce JM, Kyd RJ, Aggleton JP. The importance of the rat hippocampus for learning the structure of visual arrays. Eur J Neurosci 24: 1781–1788, 2006 [DOI] [PubMed] [Google Scholar]
- Seung HS. Learning in spiking neural networks by reinforcement of stochastic synaptic transmission. Neuron 40: 1063–1073, 2003 [DOI] [PubMed] [Google Scholar]
- Shadlen MN, Newsome WT. Motion perception: seeing and deciding. Proc Natl Acad Sci USA 23: 628–633, 1996 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen W, Flajolet M, Greengard P, Surmeier DJ. Dichotomous dopaminergic control of striatal synaptic plasticity. Science 321: 848–851, 2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soltani A, Wang XJ. A biophysically-based neural model of matching law behavior: melioration by stochastic synapses. J Neurosci 26: 3731–3744, 2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soltani A, Wang XJ. From biophysics to cognition: reward-dependent adaptive choice behavior. Curr Opin Neurobiol 18: 209–216, 2008 [DOI] [PubMed] [Google Scholar]
- Soltani A, Wang XJ. Synaptic computation underlying probabilistic inference. Nat Neurosci 13: 112–119, 2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suh YG, Lee YS, Min KH, Park OH, Seung HS, Kim HD, Park HG, Choi J, Lee J, Kang SW, Oh UT, Koo JY, Joo YH, Kim SY, Kim JK, Park YH. Novel non-vanilloid VR1 antagonist of high analgesic effects, and its structural requirement for VR1 antagonistic effects. Bioorg Med Chem Lett 13: 4389–4393, 2003 [DOI] [PubMed] [Google Scholar]
- Takeda M, Naya Y, Fujimichi R, Takeuchi D, Miyashita Y. Active maintenance of associative mnemonic signal in monkey inferior temporal cortex. Neuron 48: 839–848, 2005 [DOI] [PubMed] [Google Scholar]
- Theodoni P, Kovacs G, Greenlee MW, Deco G. Neuronal adaptation effects in decision making. J Neurosci 31: 234–246, 2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tsodyks M, Pawelzik K, Markram H. Neural networks with dynamic synapses. Neural Comput 10: 821–835, 1998 [DOI] [PubMed] [Google Scholar]
- Tsodyks MV, Markram H. The neural code between neocortical pyramidal neurons depends on neurotransmitter release probability. Proc Natl Acad Sci USA 94: 719–723, 1997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tuckwell HC. Introduction to Theoretical Neurobiology. Cambridge, UK: Cambridge Univ. Press, 1988 [Google Scholar]
- Turrigiano GG, Leslie KR, Desai NS, Rutherford LC, Nelson SB. Activity-dependent scaling of quantal amplitude in neocortical neurons. Nature 391: 892–896, 1998 [DOI] [PubMed] [Google Scholar]
- Usher M, McClelland JL. The time course of perceptual choice: the leaky, competing accumulator model. Psychol Rev 108: 550–592, 2001 [DOI] [PubMed] [Google Scholar]
- Wang XJ. Probabilistic decision making by slow reverberation in cortical circuits. Neuron 36: 955–968, 2002 [DOI] [PubMed] [Google Scholar]
- Wang XJ. Decision making in recurrent neuronal circuits. Neuron 60: 215–234, 2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Y, Markram H, Goodman PH, Berger TK, Ma J, Goldman-Rakic PS. Heterogeneity in the pyramidal network of the medial prefrontal cortex. Nat Neurosci 9: 534–542, 2006 [DOI] [PubMed] [Google Scholar]
- Wong KF, Wang XJ. A recurrent network mechanism of time integration in perceptual decisions. J Neurosci 26: 1314–1328, 2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xie X, Hahnloser RH, Seung HS. Double-ring network model of the head-direction system. Phys Rev E Stat Nonlin Soft Matter Phys 66: 041902, 2002 [DOI] [PubMed] [Google Scholar]
- Xie X, Seung HS. Learning in neural networks by reinforcement of irregular spiking. Phys Rev E Stat Nonlin Soft Matter Phys 69: 041909, 2004 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





