Abstract
Several integrate-to-threshold models with differing temporal integration mechanisms have been proposed to describe the accumulation of sensory evidence to a prescribed level prior to motor response in perceptual decision-making tasks. An experiment and simulation studies have shown that the introduction of time-varying perturbations during integration may distinguish among some of these models. Here, we present computer simulations and mathematical proofs that provide more rigorous comparisons among one-dimensional stochastic differential equation models. Using two perturbation protocols and focusing on the resulting changes in the means and standard deviations of decision times, we show that, for high signal-to-noise ratios, drift-diffusion models with constant and time-varying drift rates can be distinguished from Ornstein-Uhlenbeck processes, but not necessarily from each other. The protocols can also distinguish stable from unstable Ornstein-Uhlenbeck processes, and we show that a nonlinear integrator can be distinguished from these linear models by changes in standard deviations. The protocols can be implemented in behavioral experiments.
Keywords: computational modeling, decision making, first passage time, neural integrator, pulse perturbation, reaction time
1 Introduction
Reaction time tasks have long been used to study human decision making (Luce, 1986). One paradigm requires subjects to detect or discriminate sensory signals by making a voluntary motor response. Response time distributions obtained in such perceptual-motor tasks allow inference of cognitive information processing (Posner, 1978). With the advent of methods for recording neural activity in awake behaving animals, such tasks have been adopted by neurophysiologists, especially in studies on monkeys (Schall, 2003; Gold and Shadlen, 2007), and neuronal firing rates in several brain areas have been shown to correlate with motor responses. The lateral intraparietal area, frontal eye fields, and superior colliculus (Gold and Shadlen, 2007) all exhibit activities that ramp up over time toward a fixed level before a decision is signaled (e.g. by a saccadic eye movement in the direction of the recorded response field) (Hanes and Schall, 1996; Roitman and Shadlen, 2002; Churchland et al., 2008). The slopes of the ramps not only correlate with task difficulty (the harder the task, the lower the slope), but also with response time (higher slopes precede faster responses). These areas may therefore provide neural substrates for integrating sensory information toward a decision criterion before a perceptual decision is made.
Various integrate-to-threshold models have been proposed to describe both response times and neurobiological mechanisms (Smith and Ratcliff, 2004), including drift-diffusion models (Ratcliff, 1978; Mazurek et al., 2003; Smith and Ratcliff, 2004; Ditterich, 2006a; Simen et al., 2006; Ratcliff and McKoon, 2008) and attractor neural networks with mutual inhibition (Brown and Holmes, 2001; Usher and McClelland, 2001; Wang, 2002; Wong and Wang, 2006; Lo and Wang, 2006). All share a common mechanism: accumulation or integration of sensory inputs toward a prescribed threshold, the first crossing of which determines the decision and response time. These models may nonetheless be distinguished by the details of the integration process: drift-diffusion models typically accumulate evidence at a constant rate like a biased random walk, while attractor networks have unstable or stable steady-states that can, respectively, accelerate or decelerate the integration process (Usher and McClelland, 2001; Wong and Wang, 2006).
Often only behavioral data are collected for human subjects, and fits based on response times and choice accuracies are sometimes unable to distinguish among competing models (Ratcliff et al., 1999; Ratcliff, 2006). Moreover, while cellular recordings in awake behaving animals offer direct insights into the integration process, fitting of both behavioral and neural data may still not suffice, especially when models incorporate multiple features and depend on multiple parameters. For example, a drift-diffusion model with a time-dependent ‘urgency’ signal (Ditterich, 2006a; Churchland et al., 2008) may be difficult to separate from a recurrent network model with strong self-excitation (Wang, 2002; Wong and Wang, 2006). Thus far, few principled attempts have been made to tease apart different integration mechanisms.
Subthreshold electrical microstimulation of neural activities in behaving animals may provide more conclusive tests (Cohen and Newsome, 2004). In Ditterch et al. (2003), microstimulation of sensory neurons was shown to affect the speed of the decision. More interestingly, in Hanks et al. (2006), stimulation of cells with choice targets in the recorded neuronal response fields speeded decisions, but also reduced decision speeds when choice target directions were opposite to the recorded response fields, thus providing evidence of mutual inhibition. Perturbations need not be invasive, and so can be used in human studies: e.g., Huk and Shadlen (2005) employed a brief motion pulse in the background of the primary visual stimulus; although it neither determined the choice nor influenced rewards, the pulse had a significant effect on response times.
Earlier modeling efforts (Huk and Shadlen, 2005; Wong et al., 2007; Wong and Huk, 2008) have addressed the data of Huk and Shadlen (2005), but they employed many parameters and direct comparisons were not made between models. Here, we conduct a more rigorous study using simpler models, and seek more objective comparisons among them. We approximate four neural firing rate models as linear, scalar, stochastic differential equations (SDE), and we ask how their predicted response times are affected by short piecewise-constant perturbations with varying onset times and amplitudes. By comparing changes in means and standard deviations of response times, we demonstrate that the perturbations suffice to distinguish among the models. Finally, we show that a nonlinear integrator model, qualitatively similar to that of Wong and Wang (2006), behaves much like one of the linear models.
2 Methods
2.1 Reduction to one-dimensional linear integrate-to-threshold models
Two-alternative forced-choice decision processes can, in essence, be modeled by two populations of excitatory neurons each endowed with self-excitatory connections and mutual inhibition via a shared inhibitory population. Each of the three populations can then be represented in a coarse-grained firing rate model by a single unit whose state describes the population-averaged activity rj(t) of the corresponding neuronal pool (Wilson and Cowan, 1972, 1973). Here we review the further reduction of a firing rate model, under suitable hypotheses, to a one-dimensional dynamical system. For additional details, see Brown et al. (2005); Bogacz et al. (2006).
Consider first the deterministic equations describing the firing rates of two excitatory (E) populations and a common inhibitory (I) population:
(1) |
Here τE and τI are the synaptic time constants for the E and I units, and r1, r2 and rI are respectively the activity of neural units 1, 2 and I. The excitatory populations 1 and 2 are selective to stimuli 1 and 2, respectively. FE and FI are their input-output functions, and the overall input to each unit, with i = E or I, is Ii = Irecurrent,i + Istimulus,i. The decision time is the first passage time from stimulus onset to the first of r1 or r2 reaching a prescribed decision threshold, which thereby signals choice 1 or 2. Since non-decision latencies (e.g. signal transduction and motor preparation) are usually assumed to be independent of stimulus strength Istimulus, we shall model the response time as the decision time plus a constant latency. Hence-forth we use the phrases ‘decision time’, ‘reaction time’ and ‘response time’ interchangeably.
If the decision dynamics passes near a saddle point and moves along its unstable manifold (Brown and Holmes, 2001; Bogacz et al., 2006), the functions FE, FI may be linearized such that Eq. 1 simplifies to:
(2) |
Here α, β, γ, are the recurrent synaptic coupling strengths for self-excitation, inhibitory-to-excitatory, and excitatory-to-inhibitory connections, and I0,E (I0,I) is the constant background input to all the excitatory (inhibitory) cells from outside the local circuit. Unlike Wang (2002), we exclude excitatory connections between r1 and r2 and self-inhibitory connections, retaining only essential features. If τI ≪ τE, we can further assume that the relatively fast dynamics of rI equilibrates rapidly, such that rI ≈ γ(r1+r2)+I0,I, and Eq. 2 becomes
(3) |
(4) |
Defining a new variable X ≡ r1-r2 and subtracting Eq. 4 from 3, we obtain
where k ≡ (α - 1)/τE contains the excitatory coupling strength and leak, and (I1 - I2)/τE is proportional to the difference in inputs. The background inputs I0,E and I0,I and coupling strengths β and γ cancel out.
Generalizing the inputs I1, I2 to be time-varying and including additive noise, the reduced dynamics is described by a one-dimensional SDE of the form
(5) |
where b(Xt, t) ≡ kXt + (I1(t) - I2(t))/τE and σ is the standard deviation of the noise, which is assumed to be a Wiener process with increments dWt drawn from a normal distribution with zero mean and unit variance. Eq. 5 provides a general description for noisy accumulator models, as illustrated schematically in Fig. 1A.
2.2 Linear integrate-to-threshold models
We are now in a position to introduce four linear instances of Eq. 5. When k < 0 (e.g. self-excitation α is relatively weak), we have a classical Ornstein-Uhlenbeck (OU) process (Uhlenbeck and Ornstein, 1930; Wang and Uhlenbeck, 1945). If the decision threshold does not intervene, solutions eventually decelerate and approach a stable steady-state Xsteady-state = (I1(t)-I2(t))/(|k|τE). We denote this stable OU process SOU; in contrast, if k > 0 (stronger self-excitation), sample paths accelerate away from a fixed point in an unstable OU process, denoted UOU. SOU and UOU provide reduced descriptions of the leaky competing accumulator of Usher and McClelland (2001) that respectively produce recency and primacy effects.
In the special case of α = 1 (i.e. k = 0) and constant stimuli I1, I2, Eq. 5 becomes a pure drift-diffusion (CD) equation (Ratcliff, 1978)
(6) |
in which the constant drift rate b = b0 ≡ (I1 - I2)/τE is proportional to the difference in stimuli. This CD model is similar to, but differs in detail from, the balance between leakage and inhibition in Bogacz et al. (2006).)
In some variants of the CD model, drift rates can vary with time. This could be due to dynamic stimuli, to some form of ‘urgency’ induced by the task design (Ditterich, 2006a,b; Churchland et al., 2008), to fluctuating attention (Smith et al., 2004) or to other ‘top-down’ effects (Liu et al., 2008). This time-dependent (TD) diffusion model is described by:
(7) |
We focus on the simple case b(t) = b0t, since similar results obtain for other functions of time, but in Appendix B, we treat a modified TD model with time-varying perturbation amplitude.
2.3 A nonlinear integrate-to-threshold model
The final model to be considered is qualitatively similar to that of Wang (2002); Wong and Wang (2006). It captures stochastic dynamics in the neighborhood of a subcritical pitchfork bifurcation (Guckenheimer and Holmes, 1983; Brown and Holmes, 2001; Strogatz, 2001) that occurs as the stimulus input level is varied (Roxin and Ledberg, 2008; Wong and Wang, 2006):
(8) |
Here b(t) ≡ b0 (constant) and ε represent the biased and non-biased stimulus inputs respectively (Fig. 1B). τX controls the overall temporal dynamics. Eq. 8 may be derived by normal form theory (Guckenheimer and Holmes, 1983; Roxin and Ledberg, 2008): see Fig. 1B for an illustration of the branches of stable and unstable fixed points in the noise-free limit. If |ε| and |b| are sufficiently small, this model without the term approaches that of Wang (2002); Wong and Wang (2006); Roxin and Ledberg (2008). We include to prevent unrealistic runaway activity, and, for simplicity, first assume a symmetrical system with b0 = 0, and then consider biased stimuli |b0| > 0. This model is denoted SPB.
2.4 Perturbation protocols
2.4.1 Single pulse perturbation with varying onset time
We consider additive perturbations b1(t), under which Eq. 5 becomes dXt = [b(Xt, t) + b1(t)]dt + σdWt, where b1(t) is applied from time T to T + ΔT and b1(t) ≡ 0 otherwise. We employ two piecewise-constant forms, the first being the step function used by Huk and Shadlen (2005):
(9) |
in which the amplitude p can be positive or negative, assisting or opposing the unperturbed drift rate b0 due to the ‘primary’ stimulus. We fix the duration ΔT at 10% of the unperturbed mean first passage time τ0/10, apply b1 at different onset times T and ask how the mean and standard deviation of the decision time change.
2.4.2 Zero-effect pulse-antipulse perturbation
The second protocol uses a double pulse of the form
(10) |
where λ is the relative height of the first pulse to the second. The second opposite-signed pulse attempts to reduce or cancel the expected change due to the first, and if the overall perturbation b1 leaves the mean first passage time unchanged, we call it a zero-effect perturbation or ZEP. This protocol, a variant of the paired pulse suggested in Wong et al. (2007), is only applied early in the integration process to avoid interference from the decision threshold. We shall seek critical values of λ for which ZEPs occur for each model. Figs. 1C and D illustrate both protocols.
2.5 Simulations and parameter values
Except for the SPB model, we first consider integration to a single decision threshold in the direction of the drift. This reduces the number of parameters, simplifies mathematical analyses, and helps isolate key effects. It applies to easy tasks in which drift rates are (relatively) high and errors rare. We subsequently relax this condition in our simulations. For the SPB model we first set I1 = I2, corresponding to difficult tasks and high error rates, and employ two thresholds. We then consider I1 ≠ I2 and a single threshold, representing easy tasks.
Signal-to-noise ratios were chosen such that the variance in first passage (decision) times is significant, but not so great that an unreasonably large number of trials is needed to average out the noise, and we selected perturbation amplitudes, durations and onset times such that both protocols cause small but significant effects (e.g. maximum changes of ~ 10% in mean decision times). In particular, durations were an order of magnitude smaller than mean decision times τ0, as in Huk and Shadlen (2005). For the first protocol, onset times T were varied from stimulus onset at t = 0 until there were no significant effects on mean decision time. In the second protocol we require T ≪ τ0. We set X0 = 0 at t = 0 to represent unbiased initial conditions, and the remaining parameters are chosen to ensure ‘realistic’ behavior (e.g. the stable fixed point is above threshold for SOU; the unstable fixed point is below X0 = 0 for UOU). Table 1 lists parameter values.
Table 1.
Model | b(Xt, t) | Parameters | ζ | ΔT | p |
---|---|---|---|---|---|
CD | b0 + b1 | b0 = 5, σ = 2.449 | 20 | 0.4 | 5 |
TD | b0t + b1 | b0 = 5, σ = 2.828 | 20 | 0.1 | 4 |
SOU | kXt + b0 + b1 | k = -1, b0 = 8, σ = 1.414 | 7 | 0.4 | 2 |
UOU | kXt + b0 + b1 | k = 0.2, b0 = 5, σ = 1.414 | 20 | 1 | 2 |
SPB | ε = -0.3, 0.05, b0 = 0 or 0.004, σ = 0.01, τX = 20 | ±0.75 | 5 | 0.005 or 0.0008 (b0 = 0.004) |
Drift-diffusion model with constant drift rate (CD), and with time-varying drift rate (TD); SOU (UOU) stable (unstable) Ornstein-Uhlenbeck processes; SPB: nonlinear model. See Eqs. 5-8 and protocols 9 and 10 for details. Here ζ is the distance from starting point X = 0 to decision threshold, and for SPB ε = -0.3 before stimulus appears at t = 0 and ε = 0.05 for t ≥ 0. Perturbation durations ΔT and amplitudes p are values used in the first perturbation protocol. Parameters for the linear models with smaller signal-to-noise ratios are specified in Section 3.3.
We use a forward Euler-Maruyama scheme (Higham, 2001), integrating a sample path until it hits the prescribed decision threshold and recording the corresponding first passage time, at which the trial ends and the next begins. After collecting an appropriately large ensemble, we extract the first and second moments of the first passage time from the samples. The simulation is run once for each one of the different perturbations. For the models CD, TD, SOU and UOU, the step size is 10-3 of a time unit and the sample size is N = 106, so that errors in the moments are of order . Since changes in moments due to the perturbations are of order 10-2, these choices of time step and sample size reliably capture the effects of perturbation. The mean exit time for the unperturbed SPB model is ~ 80, so that a time step of 0.01 and a sample size is 106 suffices. We ran noisy simulations of this model with ε = -0.3 for t < 0 to represent pre-stimulus activity, and then switched to ε = 0.05 for t ≥ 0 so that X = 0 undergoes a pitchfork bifurcation and becomes unstable, thus forcing a choice (see Fig. 1B. Magnitudes of ε and σ for SPB are much smaller than the corresponding b0 and σ for the linear models because the term accelerates solutions toward the thresholds).
The parameters are not independent, and their number can be reduced by one in all models by dividing Eq. 5 by b0, and rescaling σ → σ/b0, p → p/b0, ζ → ζ/b0, k → k/b0 and the dynamical variable Xt → Xt/b0. The resulting CD and TD models are described by 2 parameters while the SOU and UOU models require 3 and the SPB model 4. Two additional parameters describe the amplitude p and duration ΔT of the perturbation.
We focus on qualitative patterns of changes in response times and, to make unbiased comparisons, we adopt a dimensionless measure of the changes in means and standard deviations by normalizing the relative changes with respect to their unperturbed values.
3 Results
3.1 Single pulse perturbations of linear models
Using a single pulse perturbation with varying onset time T, we estimated changes in mean decision time for the four linear integrate-to-threshold models as described above, obtaining the results shown in Fig. 2. Imposed early during the integration process, a brief pulse in the same (opposite) direction as the drift rate significantly decreases (increases) mean decision times in all four models, as shown by the dashed (solid) curves. These effects fade as T increases due to a thresholding effect: for later perturbations, more trials have already crossed threshold, and so there are relatively fewer trials that are perturbed than unperturbed. Thus, when averaging over all trials, the influence of perturbations is progressively reduced. Note that unlike the work of Huk and Shadlen (2005); Wong et al. (2007), trials that have crossed threshold before and during perturbation are not excluded in the averaging process. This helps to reduce noise in the data, especially with late perturbation onset times.
In Appendix A, we prove that this basic pattern - positive pulses advance mean decision times and negative pulses retard them - must hold provided that the perturbation occurs sufficiently early. The proof applies to a broad class of nonlinear systems including the SPB model of Eq. 8, provided that ε > 0 and thresholds lie inside the region in which drift magnitude increases with X (i.e., well below the activation levels of the stable fixed points).
Note that the drift-diffusion models (CD, TD) and the OU models (SOU, UOU) respond distinctly to early perturbations: the former exhibiting almost constant changes in mean decision times (panels A and B), and the latter respectively showing increasing and decreasing effects (panels C and D). Moreover, due to thresholding, SOU is alone in having an onset time for which the perturbative effect is maximized.
We next investigate how standard deviations are affected by perturbation onset time: see Fig 3. Richer patterns arise than those of Fig. 2. Both CD and TD exhibit similar near-constant changes for early onsets, but the directions of changes for TD are opposite to those for CD, and the curves cross before reaching maxima. Standard deviations for SOU have a similar pattern to its mean, but UOU exhibits initial decreases followed by increases to a peak, unlike its mean. Optimal onset times exist for which the four standard deviations are maximally affected, and all four cases show distinct patterns.
In the TD model considered above we assumed that the perturbation is not affected by the time-dependent gain b(t). This may not hold if the perturbation enters via the same sensory pathway as the stimuli. In Appendix B we consider a case in which perturbation and drift are affected in the same way. We show that this modified TD model differs qualitatively only for early perturbation onset, for which it yields reduced effects.
3.2 Zero-effect perturbations of linear models
We now consider the zero-effect protocol of Eq. 10, appealing to the fact that, in the noise-free case, the first passage time is a monotone function of the ratio λ of the amplitudes of the opposing pulses. The critical λ = λ* that determines a ZEP is therefore given by the unique zero of
(11) |
where τ(λ) describes the functional dependence of passage time on λ. Mean first passage times appear to remain monotone with respect to λ when noise is included. Asymptotic methods can be used to approximate passage time moments for small perturbations (Lindner, 2004), but the lack of compact formulae for passage time distributions (PDFs) in most cases, including OU processes, and the need for double integrations (Lindner, 2004) makes this approach generally intractable.
However, ZEPs may be analytically approximated for early onset times T with T’ := T + ΔT ≪ τ0, such that the probability of threshold crossing prior to T’ is negligible. In this case a sufficient condition is that the PDFs of the perturbed and unperturbed processes are identical at T’, since for t > T’, both processes are governed by the same drift and noise. They are therefore indistinguishable in the limit T, ΔT → 0. Before deriving explicit expressions for λ* under this assumption we present the results of numerical simulations with an intuitive explanation.
Since the unperturbed SDE (5) is independent of the current state Xt in the CD and TD cases, we expect that antisymmetric pulses with λ* = 1, in which the pulses precisely cancel, will produce ZEPs. In contrast, responses to inputs decay with time for SOU, implying that the first pulse must be larger than the second for their net effect to cancel at perturbation o set t = T’ (λ* > 1). For UOU the reverse should hold (λ* < 1). Fig. 4 confirms that this is the case. Table 2 lists the parameters used in the simulations; note that the early onset time conditions are only weakly satisfied (T’ ranges from 25 - 40% of τ0).
Table 2.
Model | τ0 | std(τ0) | T | ΔT | p | λ* (thy.) | λ* (sim.) |
---|---|---|---|---|---|---|---|
CD | 4.005 | 0.960 | 0.5 | 0.5 | 5 | 1 | 1.0013 |
TD | 3.145 | 0.420 | 0.5 | 0.5 | 5 | 1 | 0.9978 |
SOU | 1.831 | 0.374 | 0.1 | 0.4 | 2 | 1.2214 | 1.2251 |
UOU | 2.954 | 0.142 | 0.2 | 1.0 | 2 | 0.9048 | 0.9050 |
τ0 and std(τ0) are the rstand second moments of the passage time without perturbation, T and ΔT are the onset time and total duration of the perturbation; p is the amplitude of the second pulse and λ* (thy.) and λ* (sim.) are the ZEP pulse ratios predicted by the theory and obtained by averaging p < 0 and p > 0 results of Fig. 4.
To derive explicit approximations for λ*, we use a comparison method similar to that of Appendix A. Let Xt denote the unperturbed process and Zt the perturbed process. If X0 = Z0 at t = 0, then Xt = ZT at the onset time T. When the perturbation ends at t = T’ the solution of the unperturbed SDE of Eq. 5 with constant drift rate b0 is
(12) |
while the perturbed system satisfies
(13) |
These expressions well approximate the true activity levels for models CD, SOU and UOU with absorbing thresholds only when both processes have low probability of hitting the threshold during the interval [0, T’], but this condition holds for T’/τ0 ≪ 1, since the integrated drift term is almost zero at early times. Eqs. 12 and 13 differ in the term , which enters ZT’ due to the action of the perturbation b1 during the interval [T, T’]. The ZEP condition for arbitrary perturbations is therefore
(14) |
which for the piecewise constant pulses of Eq. 10 implies that:
(15) |
Hence for k > 0 (UOU), λ* < 1, while for k < 0 (SOU), λ* > 1. In the special case of CD, k = 0 and λ* = 1.
For TD the drift term is time-dependent but the linear term kXt is absent and so the corresponding solutions of the SDEs are
and
In this case, a ZEP must satisfy
(16) |
which yields λ* = 1 for the pulses of Eq. 10, as for CD. This result holds for all time-dependent drift rates, including those used in Smith et al. (2004); Ditterich (2006b); Churchland et al. (2008).
Summarizing, ZEPs occur for the CD and TD models when the opposing pulse amplitudes are equal (λ* = 1), but for SOU and UOU λ* = e-kΔT/2 is larger and smaller than one, respectively. The ratios λ* predicted by Eq. 15 for the parameters of Table 2 are given in the final column of the Table. In all four cases they agree well with the zero crossings of the mean first passage times in Fig. 4, in spite of the fact that T’/τ0 ≈ 0.25 - 0.4 is not very small.
3.3 Signal-to-noise ratio can influence perturbation effects
To what extent do the results of Section 3.1 hold when signal-to-noise ratios are reduced and a second threshold is added to track errors? To investigate this question we select new parameter values according to the criteria of Section 2.5.
For the CD model, we increase σ from 2.449 to 2.828 and set thresholds at ζ = ±5 instead of ±20, yielding an error rate of 5%. This introduces more complex behavior in which correct and error trials respond to the perturbations in different manners, as shown in Fig. 5A. The combined averages (as computed in Huk and Shadlen (2005) and Wong et al. (2007), see black solid and dashed curves) preserve some features of previous results: for early perturbations the direction of changes in mean RT agrees with Fig. 2A, but the approximately-constant change in mean decision time early in the trial is replaced by a monotonic decline. Increased noise yields earlier threshold crossings, advancing the thresholding effect and masking the signatures of Fig. 2A. Moreover, the change in mean error RTs can reverse sign for late perturbation onsets. We checked this seemingly counterintuitive phenomenon numerically by fixing the random generating seed for noise in a sample trial, finding that perturbation of long RT trials can change an impending error to a correct choice (data not shown).
Increasing the noise level to σ = 7.071 with thresholds ζ = ±20 produces an error rate of 10% in the TD model and changes in mean RTs for correct and error choices similar to those of CD. However, averaged over correct and error choices, overall changes in mean RT exhibit a pattern similar to that of Fig. 2B (see black curves in Fig. 5B). In this respect TD maintains its signature more robustly with increasing error rate than CD, although the effects are progressively masked.
For the SOU model, we increase σ to 6.325, yielding an error rate of 17%. The optimal perturbation onset time for changes in mean RT averaged over correct and error trials is masked (compare Fig. 2C with black curves in Fig. 5C). This error rate is relatively high, but we have checked that for error rates as small as 3% SOU model features can also be masked, indicating that, like the CD model, SOU is sensitive to noise.
For the UOU model, more parameters must be adjusted to obtain reasonable RT changes. Here we set σ = 2, ζ = ±10, k = 0.02 and b0 = 0.5, yielding a high error rate of 23.5%, but similar trends occur for error rates from 0.1% to 35%. Fig. 5D shows that these are comparable to the previous UOU results with high signal-to-noise ratio. Overall, we are unable to distinguish between noise-masking effects and intrinsic features of the UOU integrator.
Using the same parameters, we find that changes in standard deviations for all the linear models show optimal perturbation onset times (Fig. 6). The CD, SOU and UOU models are mutually indistinguishable (cf. Figs. 6A, C and D), but TD exhibits a crossing effect (Fig. 6B) similar to Fig. 3B.
3.4 Perturbations of the nonlinear model
We next apply both perturbation protocols to the nonlinear SPB model. The mean and standard deviation of decision times for the unperturbed system are 80 and 22 time units, respectively. The parameters used for the first protocol are given in Table 1, while for the ZEP we set p = 0.005, T = 30 and T = 5, consistent with application early in the accumulation process. Figs. 7A and B show that the means and standard deviations of decision times behave like those for UOU (cf. panels D of Figs. 2 and 3), decreasing monotonically and approaching zero as onset times increase. ZEPs occur in the pulse ratio range 0.8 < λ* < 1 for both mean and standard deviation (Figs. 7C and D). Assuming a linear change in mean in Fig. 7C, we find that λ* ≈ 0.87.
To compare with the behavior of the linear systems in the single threshold case, we now modify the SPB model so that it makes as few errors as they do. This is done by changing b0 from 0 to 0.004, which breaks the symmetry of the subcritical pitchfork bifurcation (Fig. 1B) so that decisions favor one choice, representing easy tasks (cf. Roxin and Ledberg (2008)); all other parameters remain the same. Fig. 8 shows that the basic behavior of Figs. 7C and D is preserved: effects on both mean and standard deviation of RTs decrease monotonically as perturbation onset time increases. While the effects on mean passage times of SPB are much like those for UOU, the fact that standard deviations of all four linear models exhibit maxima (Fig. 3) potentially allows us to distinguish SPB from them. The effects of ZEPs in this biased regime is similar to that shown in Figs. 7C and D (data not shown).
4 Discussion
In this work, we have investigated how several simple stochastic integrate-to-threshold decision-making models respond to short perturbations. We focus on two-alternative forced-choice reaction time tasks for which general firing rate models with three coupled neural populations can be reduced to one-dimensional integrate-to-threshold systems. These reduced systems, which may be pure drift-diffusion or Ornstein-Uhlenbeck processes, can be compared analytically and in simulations that require few parameters.
We examined two perturbation protocols, the first being a brief pulse with variable onset time, and the second a pulse-antipulse pair whose amplitude ratio can be adjusted to produce minimal effects (the zero-effect perturbation, ZEP). The simulations of §3 (Figs. 2 and 3) show that the changes in means and standard deviations of decision times allow both perturbation protocols to distinguish between drift-diffusion models and stable and unstable Ornstein-Uhlenbeck processes, provided that perturbations are delivered sufficiently early in the integration process and the signal-to-noise ratio is not too small. Changes in standard deviations can also assist in distinguishing drift-diffusion models with constant drift from those with time-dependent drift rates (cf. Figs. 3A and B), although differences are small and may not be detectable experimentally. (As Ditterich (2006a) notes, entire reaction time distributions can further assist in this.) For early perturbations the ZEP conditions can be approximated analytically, making predictions that are confirmed by computer simulations (Fig. 4). While general analytical expressions for systems with finite noise appear elusive, in Appendix A we prove inequalities for more general nonlinear systems that partially explain our simulation results.
Although we focus on linear models, our methods extend to other systems, and we include results for a nonlinear model that capture the dynamics near a subcritical pitchfork bifurcation which is typical of reduced population models (Wang, 2002; Wong and Wang, 2006; Roxin and Ledberg, 2008). As the results of Appendix A suggest, it behaves in a manner similar to the unstable OU model under both perturbation protocols, although perturbative effects on standard deviations may allow distinctions to be made (Figs. 7-8). We have also confirmed that the TD model’s behavior remains similar with other increasing drift rates (e.g. b(t) ~ b0t2; simulation results not shown).
These results reinforce the simulations and claims in Wong et al. (2007) that time-varying perturbations can in principle reveal integration mechanisms. However, some parameter ranges frustrate clear-cut comparisons among models. The first occurs for low signal-to-noise ratios and high error rates (cf. Figs 5-6), and so can be mitigated by using perturbations with more easily discriminated stimuli. Secondly, if there is a ‘dead time’ between stimulus onset and the start of evidence integration, the CD and TD models can display similar signatures to the SOU model, even with high signal-to-noise ratios.
Since perturbations can be delivered through the senses as well as by direct electrical stimulation, the method can be noninvasive and is therefore appropriate for human subjects. Furthermore, unlike fitting neuronal firing rates, both perturbation protocols can distinguish linear and nonlinear UOU-type models (Wang, 2002; Wong and Wang, 2006; Roxin and Ledberg, 2008; Wong and Huk, 2008) from drift-diffusion models with an ‘urgent’ time-dependent increase in drift rate (Smith et al., 2004; Ditterich, 2006a; Churchland et al., 2008). However, care is required in designing such experiments. Although a high signal-to-noise ratio can in principle help identify integrators, the reaction times can become too short, masking the signature behaviors of the integrators with the ‘thresholding effect’. A possible solution could be to prolong the response deadline in relatively simple reaction time tasks and analyze only data from long RT trials.
Perturbation inputs to decision-making circuits may originate in a single sensory pathway or in other brain areas, as in multisensory and ‘top-down’ inputs (e.g. due to attention (Smith et al., 2004; Liu et al., 2008)) which could modulate a decision throughout a sensorimotor pathway. The general notion of perturbation developed in the reduced model context extends to such modulatory inputs. The methods may also generalize to other tasks that recruit neural integrators (Goldman et al., 2008), such as interval timing (Shea-Brown et al., 2006).
Acknowledgments
We thank C. D. Brody for suggesting a variant of the TD model, M. Usher and an anonymous reviewer for helpful suggestions that improved the paper, and Nengli Lim for pointing out the extension to noisy systems in Proposition 3 of Appendix A. XZ was supported in part by ONR grant N00014-01-1-0674, and KFWL and PH by PHS grants MH58480 and MH62196 and AFOSR grant FA9550-07-1-0537. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the Air Force Research Laboratory or the U.S. Government.
Appendix A: Rigorous estimates on first passage times
In this appendix we analyze short single pulse perturbations, First in the noiseless limit, and then with additive noise. The inequalities proved here suggest explanations for the monotonicity of mean first passage times of SOU and UOU processes with respect to early onset times (cf. Figs. 2C and D); moreover, they apply to more general, nonlinear one-dimensional systems such as SPB with thresholds |ζ| sufficiently close to Xt = 0.
The noiseless limit
As noise amplitude tends to zero, the SDE Eq. 5 becomes an ordinary differential equation (ODE):
(17) |
Consider two compact perturbations p and with the same amplitude profile g(t) and duration ΔT, applied at different onset times T1 and :
(18) |
(here 1[T1,T2] denotes the indicator function that takes the value 1 for T1 < t < T2 and zero otherwise). We choose and such that p and do not overlap and that neither perturbed solution reaches the threshold X = ζ before . We shall compare solutions Yt and of the corresponding perturbed ODEs,
(19) |
started at the same initial condition , to show how the signs of f’ and g determine their threshold passage times τ and . We allow smooth nonlinear functions f, but assume that f(X) > 0 for all 0 ≤ X < ζ and that g does not change sign. The fact that solutions of scalar ODEs cannot cross each other is a key tool.
Lemma 1
Let Xt(x) denote the solution of Eq. 17 with initial value x. Then Xt(x1) < Xt(x2) for any t > 0 if and only if x1 < x2.
Proof
This follows from the uniqueness of solutions of ODEs and the ordering of points in a one-dimensional phase space.
Since neither perturbation acts after , and neither solution has reached threshold at , Lemma 1 implies that if and only if . We now prove the main result summarized in Table 3, which is a corollary of Proposition 2.
Table 3.
f’ ≥ L > 0 | f’ ≤ -L < 0 | |
---|---|---|
g > 0 | ||
g < 0 |
Proposition 2
Assume that the function g(t) in Eqs. 18 is strictly positive for t ∈ (0, ΔT). Then for some constant L (a) If f’ ≥ L > 0, then for all ; (b) If f’ ≤ -L < 0, then for all .
Proof of part (a)
We need only establish the inequalities at , since no perturbations occur after this time. We compare solutions of Eqs. 19 using the fact that f(X) > 0 and f’(X) ≥ L > 0 for X ≥ 0 implies that . Letting we have
(20) |
Since the solutions coincide until the first perturbation occurs at t = T1, u(T1) = 0, and integrating inequality 20 over the interval [T1, T2], on which p(t) = g(t - T1) and , yields:
(21) |
Both p(t) and on , and integration of 20 and use of 21 gives
(22) |
Finally, using the fact that p(t) ≡ 0 on , we integrate 20 again to obtain
(23) |
where we also use inequality 22 and the fact that the additive perturbations of Eq. 18 are identical over the time intervals [T1, T2] and to rewrite the integral term. Hence and .
Proof of part (b)
We again compare solutions of Eqs. 19 but the direction of the inequalities is now reversed, so that we are in essence using Gronwall’s inequality (Guckenheimer and Holmes, 1983). In this case Eq. 20 becomes
(24) |
inequality 22 becomes
(25) |
and the final inequality reads
(26) |
This implies that , as claimed. Proofs are similar for g < 0.
Extension to noisy systems
The conclusions of Proposition 2 and Table 3 may be extended to apply to expected passage times for SDEs with additive noise. Specifically, replacing Eqns. 19 by
(27a) |
(27b) |
we have the following result:
Proposition 3
Assume that the function g(t) in Eqs. 18 is strictly positive for t ∈ (0, ΔT). Then for some constant L (a) If f’ ≥ L > 0, then ; (b) If f’ ≤ -L < 0, then , where τ and denote the first passage times for Eqns. 27a and 27b respectively.
Proof
We first compare solutions Yt and obtained from Eqns. 27a and 27b under the same sample noise path with increments . Since the stochastic integral is almost surely continuous (Feller, 1957), and the integrated noise terms cancel, the comparison of solutions using proceeds as in the deterministic case above, and we conclude that for g > 0 and f’ ≥ L > 0, and for g > 0 and f’ ≤ -L < 0. Finally, averaging each process over an ensemble of sample paths, we may conclude that the mean first passage times of the two processes satisfy similar inequalities to those in the top row of Table 3: and respectively. A similar argument applies to the case g < 0.
Appendix B: A TD model with time-varying perturbation amplitude
In the body of the paper, perturbations of the TD model were assumed unaffected by the time-dependent gain. Here we extend the simulations to a simple case in which perturbation and stimulus (drift) share the same overall linear increase in amplitude. Specifically, we assume the dynamics to be governed by
(28) |
Otherwise, we change only the perturbation amplitude, taking p = ±2 instead of 4. Fig. 9A shows that the changes in mean and standard deviations of exit time for this model remain similar to the previous TD model, but with smaller perturbative effects near the beginning of the trial (quite similar to a SOU model) due to the monotonic time-dependence of perturbation amplitude. However, although the previous pattern of mean exit times no longer applies, the behavior remains qualitatively dissimilar to the CD, UOU and SPB models.
References
- Bogacz R, Brown E, Moehlis J, Holmes P, Cohen JD. The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks. Psychol. Rev. 2006;113:700–765. doi: 10.1037/0033-295X.113.4.700. [DOI] [PubMed] [Google Scholar]
- Brown E, Gao J, Holmes P, Bogacz R, Cohen JD. Simple neural networks that optimize decisions. Int. J. Bifurcation and Chaos. 2005;15:803–826. M. G. [Google Scholar]
- Brown E, Holmes P. Modeling a simple choice task: stochastic dynamics of mutually inhibitory neural groups. Stochastics and Dynamics. 2001;1:159–191. [Google Scholar]
- Churchland AK, Kiani R, Shadlen MN. Decision-making with multiple alternatives. Nat. Neurosci. 2008;11:693–702. doi: 10.1038/nn.2123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cohen MR, Newsome WT. What electrical microstimulation has revealed about the neural basis of cognition. Curr. Opin. Neurobiol. 2004;14:169–177. doi: 10.1016/j.conb.2004.03.016. [DOI] [PubMed] [Google Scholar]
- Ditterch J, Mazurek ME, Shadlen MN. Microstimulation of visual cortex affects the speed of perceptual decisions. Nat. Neurosci. 2003;6:891–898. doi: 10.1038/nn1094. [DOI] [PubMed] [Google Scholar]
- Ditterich J. Evidence for time-variant decision making. Eur. J. Neurosci. 2006a;24:3628–3641. doi: 10.1111/j.1460-9568.2006.05221.x. [DOI] [PubMed] [Google Scholar]
- Ditterich J. Stochastic models of decisions about motion direction: behavior and physiology. Neural Netw. 2006b;19:981–1012. doi: 10.1016/j.neunet.2006.05.042. [DOI] [PubMed] [Google Scholar]
- Feller W. An Introduction to Probability Theory and its Applications. Second Edition Vol. 2. Wiley; New York: 1957. [Google Scholar]
- Gold JI, Shadlen MN. The neural basis of decision making. Annu. Rev. Neurosci. 2007;30:535–574. doi: 10.1146/annurev.neuro.29.051605.113038. [DOI] [PubMed] [Google Scholar]
- Goldman M, Compte A, Wang X-J. Theoretical and computational neuroscience: Neural integrators: recurrent mechanisms and models. In: Squire L, Albright T, Bloom F, Gage F, Spitzer N, editors. New Encyclopedia of Neuroscience. MacMillan Reference Ltd.; 2008. In press. [Google Scholar]
- Guckenheimer J, Holmes P. Nonlinear Oscillations, Dynamical Systems and Bifurcations of Vector Fields. Springer-Verlag; New York: 1983. [Google Scholar]
- Hanes DP, Schall JD. Neural control of voluntary movement initiation. Science. 1996;274:427–430. doi: 10.1126/science.274.5286.427. [DOI] [PubMed] [Google Scholar]
- Hanks TD, Ditterich J, Shadlen MN. Microstimulation of macaque area LIP affects decision-making in a motion discrimination task. Nat. Neurosci. 2006;9:682–689. doi: 10.1038/nn1683. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Higham DJ. An algorithmic introduction to numerical simulation of stochastic differential equations. SIAM Rev. 2001;43:525–546. [Google Scholar]
- Huk AC, Shadlen MN. A neural integrator underlying perceptual decision-making in macaque parietal cortex. J. Neurosci. 2005;25:10420–10436. doi: 10.1523/JNEUROSCI.4684-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lindner B. Moments of the first passage time under external driving. J. Stat. Phys. 2004;117:703–737. [Google Scholar]
- Liu YS, Holmes P, Cohen JD. A neural network model of the eriksen task: Reduction, analysis, and data fitting. Neural Comput. 2008;20:345–373. doi: 10.1162/neco.2007.08-06-313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lo C-C, Wang X-J. Cortico-basal ganglia circuit mechanism for a decision threshold in reaction time tasks. Nat. Neurosci. 2006;9:956–963. doi: 10.1038/nn1722. [DOI] [PubMed] [Google Scholar]
- Luce RD. Response times: Their Role in Inferring Elementary Mental Organization. Oxford Univ. Press; New York: 1986. Oxford Psychology Series, 8. [Google Scholar]
- Mazurek ME, Roitman JD, Ditterich J, Shadlen MN. A role for neural integrators in perceptual decision making. Cereb Cortex. 2003;13:1257–1269. doi: 10.1093/cercor/bhg097. [DOI] [PubMed] [Google Scholar]
- Posner MI. Chronometric Explorations of Mind. Erlbaum; Hillsdale, N.J.: 1978. [Google Scholar]
- Ratcliff R. A theory of memory retrieval. Psychol. Rev. 1978;85:59–108. [Google Scholar]
- Ratcliff R. Modeling response signal and response time data. Cognit. Psychol. 2006;53:195–237. doi: 10.1016/j.cogpsych.2005.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ratcliff R, McKoon G. The diffusion decision model: Theory and data for two-choice decision tasks. Neural Comput. 2008;20:873–922. doi: 10.1162/neco.2008.12-06-420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ratcliff R, Zandt TV, McKoon G. Connectionist and diffusion models of reaction time. Psychol. Rev. 1999;106:261–300. doi: 10.1037/0033-295x.106.2.261. [DOI] [PubMed] [Google Scholar]
- Roitman JD, Shadlen MN. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. J. Neurosci. 2002;22:9475–9489. doi: 10.1523/JNEUROSCI.22-21-09475.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roxin A, Ledberg A. Neurobiological models of two-choice decision making can be reduced to a one-dimensional nonlinear diffusion equation. PLoS Comput. Biol. 2008;4:e1000046. doi: 10.1371/journal.pcbi.1000046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schall JD. Neural correlates of decision processes: neural and mental chronometry. Curr. Opin. Neurobiol. 2003;13:182–186. doi: 10.1016/s0959-4388(03)00039-4. [DOI] [PubMed] [Google Scholar]
- Shea-Brown E, Rinzel J, Rakitin BC, Malapani C. A firing rate model of parkinsonian deficits in interval timing. Brain Res. 2006;1070:189–201. doi: 10.1016/j.brainres.2005.10.070. [DOI] [PubMed] [Google Scholar]
- Simen P, Cohen JD, Holmes P. Rapid decision threshold modulation by reward rate in a neural network. Neural Netw. 2006;19:1013–1026. doi: 10.1016/j.neunet.2006.05.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith PL, Ratcliff R. Psychology and neurobiology of simple decisions. Trends in Neurosci. 2004;27:161–168. doi: 10.1016/j.tins.2004.01.006. [DOI] [PubMed] [Google Scholar]
- Smith PL, Ratcliff R, Wolfgang BJ. Attention orienting and the time course of perceptual decisions: response time distributions with masked and unmasked displays. Vision Res. 2004;44:1297–1320. doi: 10.1016/j.visres.2004.01.002. [DOI] [PubMed] [Google Scholar]
- Strogatz SH. Nonlinear dynamics and chaos: with applications to physics, biology, chemistry and engineering. Perseus Books Group; Cambridge MA: 2001. [Google Scholar]
- Uhlenbeck GE, Ornstein LS. On the theory of Brownian motion. Phys. Rev. 1930;36:823–841. [Google Scholar]
- Usher M, McClelland JL. On the time course of perceptual choice: The leaky competing accumulator model. Psychol Rev. 2001;108:550–592. doi: 10.1037/0033-295x.108.3.550. [DOI] [PubMed] [Google Scholar]
- Wang MC, Uhlenbeck GE. On the theory of Brownian motion II. Rev. Mod. Phys. 1945;17:323–342. [Google Scholar]
- Wang X-J. Probabilitic decision making by slow reverberation in cortical circuits. Neuron. 2002;36:955–968. doi: 10.1016/s0896-6273(02)01092-9. [DOI] [PubMed] [Google Scholar]
- Wilson H, Cowan J. Excitatory and inhibitory interactions in localized populations of model neurons. Biophys. J. 1972;12:1–24. doi: 10.1016/S0006-3495(72)86068-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilson H, Cowan J. A mathematical theory of the functional dynamics of cortical and thalamic nervous tissue. Kybernetik. 1973;13:55–80. doi: 10.1007/BF00288786. [DOI] [PubMed] [Google Scholar]
- Wong K-F, Huk AC. Temporal dynamics underlying perceptual decision making: Insights from the interplay between an attractor model and parietal neurophysiology. Front. Neurosci. 2008;2:245–254. doi: 10.3389/neuro.01.028.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wong K-F, Huk AC, Shadlen MN, Wang X-J. Neural circuit dynamics underlying accumulation of time-varying evidence during perceptual decision-making. Front. Comput. Neurosci. 2007;1:6. doi: 10.3389/neuro.10.006.2007. doi: 10.3389/neuro.10/006.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wong K-F, Wang X-J. A recurrent network mechanism of time integration in perceptual decisions. J. Neurosci. 2006;26:1314–1328. doi: 10.1523/JNEUROSCI.3733-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]