Abstract
Decisions about noisy stimuli require evidence integration over time. Traditionally, evidence integration and decision making are described as a one-stage process: a decision is made when evidence for the presence of a stimulus crosses a threshold. Here, we show that one-stage models cannot explain psychophysical experiments on feature fusion, where two visual stimuli are presented in rapid succession. Paradoxically, the second stimulus biases decisions more strongly than the first one, contrary to predictions of one-stage models and intuition. We present a two-stage model where sensory information is integrated and buffered before it is fed into a drift diffusion process. The model is tested in a series of psychophysical experiments and explains both accuracy and reaction time distributions.
Author Summary
In models of decision making, evidence is accumulated until it crosses a threshold. The amount of evidence is directly related to the strength of the sensory input for the decision alternatives. Such one-stage models predict that if two stimulus alternatives are presented in succession, the stimulus alternative presented first dominates the decision, as the accumulated evidence will reach the threshold for this alternative first. Here, we show that for short stimulus durations decision making is not dominated by the first, but by the second stimulus. This result cannot be explained by classical one-stage decision models. We present a two-stage model where sensory input is first integrated before its outcome is fed into a classical decision process.
Introduction
Decision making is of crucial interest in many disciplines such as psychology [1], [2], neuroscience [3]–[5], economics [6], [7], and machine learning [8]. Binary decision theories relate to situations where an observer (or machine) is confronted with one of two possible noisy stimuli ‘A’ and ‘B’. A decision has to be made whether ‘A’ or ‘B’ is present. For example, human readers have to decide whether a handwritten character is an or a ; a trader has to decide whether to sell or to keep; a monkey has to decide whether dots on a screen are moving to the left or to the right [9]. While engineering and economical decision theories focus on how to compute optimal decisions [6], [7], [10], psychology and neuroscience investigate the actual decision making process in humans and animals [9], [11]–[14].
Decision making is usually assumed to be a one-stage process where evidence integration and decision making are identical (but see [15], [16]). In a standard accumulator model each bit of evidence is integrated and a decision is reached once the accumulated evidence for one of the two response alternatives crosses a threshold [13], [14], [17]–[33]. If the evidence itself is noisy, then the accumulation of evidence for each of the two stimulus alternatives leads to a diffusion-like process. For example, in the well-known random motion paradigm [9], moving dots appear at random moments in time, so that evidence for leftward or rightward moments arrives probabilistically and the accumulator is expected to evolve along a stochastic path that can be approximated by a drift-diffusion process. This is in good accordance with experimental studies where neurons in the macaque lateral intraparietal cortex (LIP) increase firing rates along a noisy trajectory up to the moment of decision [9], [32]–[34]. Since evidence is very noisy in this case, and arrives slowly over time, the decision process is rather slow [9]. Most experimental [9], [11], [12] and theoretical work on decision making [5]–[8] focuses on paradigms where noisy stimuli are presented for long durations, e.g. until a response is elicited (for exceptions see [31], [32]).
In other paradigms, where stimuli are less noisy, decisions can be extremely fast. For example, humans only need a fraction of a second to recognize objects such as animals in a picture [35]. This astonishing speed is also evident in sports such as table tennis or soccer requiring rapid reactions to moving balls. In these examples, the brain has to decide rapidly upon visual information available for only a hundred milliseconds or less. Note that even in these scenarios where stimuli are of high contrast (“low noise”), the responses of the observers can still be “noisy”.
Here, we first show psychophysically that one-stage models of the noisy accumulator or drift-diffusion type cannot explain the results of feature fusion experiments where two stimulus alternatives are presented in rapid succession for durations in the range of 20–160 ms. Second, we propose, instead, a two-stage model, where evidence integration is separated from a noisy drift-diffusion decision making process. Our results reveal additional aspects of the dynamics of decision making that are hidden in standard experimental paradigms where only one stimulus alternative is presented per trial.
Results
In our psychophysical experiments we worked with visual stimuli comprising two vertical bars with a small horizontal offset either to the left or to the right (Vernier stimulus, Figure 1). The contrast of the bars and the horizontal offset was chosen such that, after flashing the stimulus for 10 ms, human observers can reliably identify (accuracy above 90 percent correct) whether the lower vertical bar is offset to the left or right with respect to the upper vertical bar.
Next, we presented a sequence of two Vernier stimuli. As a reference we performed an experiment where the presentation of a first Vernier stimulus ‘A’ for 10 ms was separated from the presentation of a second Vernier stimulus ‘B’ by an interstimulus interval (ISI; blank screen) of variable duration. If the ISI was 50 ms, observers easily distinguished the two stimuli and could report, with an accuracy of above 90 percent correct, whether the first or the second Vernier stimulus was offset to the right. If the interstimulus interval was shorter, the accuracy dropped (Figure 1). The high precision of the subjects in spite of the short stimulus duration suggests that – in contrast to e.g. traditional random dot stimuli – the stimulus is highly informative with relatively little stimulus noise.
If the ISI is 0 ms, i.e. the two verniers are presented in immediate succession, feature fusion occurs [36]. Observers perceive only one single vernier with a smaller offset because the vernier offsets integrate and partially cancel each other out (Figure 2A; [37], [38]). Our feature fusion experiments with Vernier stimuli are analogous to classic feature fusion experiments with color. For example, observers perceive one single yellow disk when a red disk is rapidly followed by a green disk [39].
Dominance of the second stimulus
In experiment one, vernier stimulus ‘A’, offset either to the left or right, was immediately followed by a second vernier stimulus ‘B’ with opposite offset direction (right or left, respectively). The durations and of both verniers were equal, i.e. , but varied from 10 to 80 ms, each. Vernier stimulus ‘B’ dominates the percept the stronger the longer both vernier stimuli ‘A’ and ‘B’ are presented (Figure 2D). For example, when the two vernier stimuli are presented for 20 ms each, observers report a percept corresponding to stimulus ‘B’ in 60% of the trials, while ‘A’ is reported in only 40% of the trials. When the two stimuli are presented for 40 ms each, observers report a percept corresponding to stimulus ‘B’ in 67% of the trials, while ‘A’ reported in only 33% of the trials.
We wondered whether the dominance of the second stimulus could be explained by classical noisy accumulator models, also called Drift-Diffusion models. In the standard, one-stage Drift-Diffusion Model [20], [22], [23], [27], evidence for ‘A’ or ‘B’ translates directly into the drift rate (upward for ‘A’, downward for ‘B’) of a decision variable (Figures 2B, C). As usually, we added noise to the drift process leading to a random walk of the trajectory. The noise accounts for both noisiness of the evidence itself (an important aspect in the moving-dot paradigm [9], [33], [34], [40]) and internal noise in the brain. After presentation of both stimuli, the drift goes back to zero. A decision is made when hits the upper (for ‘A’) or lower bound (for ‘B’).
In this one-stage model, dominance of stimulus ‘A’ is the stronger the longer the presentation times of ‘A’ and ‘B’, and respectively. This is in striking contrast to the experimental results. We found that the qualitative nature of the results is independent of the specific choice of parameters of the one-stage drift diffusion model: for all tested parameters, the dominance of the second stimulus decreased with increasing duration (whereas the dominance of the second stimulus increased in the experiments). Whereas, for certain, fixed stimulus durations, we could achieve dominance of the second stimulus with specifically optimized parameters, we could never achieve dominance of the second stimulus for the entire range of stimulus durations with one set of parameters.
We explored whether minor modifications of the one-stage drift-diffusion model can explain the dominance of the second vernier. For example, we replaced the noisy accumulator by a noisy leaky accumulator. However, this did not change the results qualitatively. We then tested a very basic two stage model. During stimulus presentation, the stimulus served as the drift in a noisy leaky integrator model. After stimulus termination, the leak was artificially set to zero and the integration continued as a free, unbiased noisy diffusion process. In other words, the result of the leaky evidence integration served as initial condition for the leak-free diffusion process. While qualitatively such a drift-diffusion model explains the dominance results well (Supporting Figure S3B), we suggest an alternative model, which accounts very well for both the dominance and the reaction time distributions.
In this two-stage model, the evidence integration enters the second stage as a drift rate rather than as a bias in the initial condition. (a) During stage one, evidence integration is leaky and dominated by the intrinsic noise of the stimulus. The variable of noisy evidence integration is . (b) Stage two starts after a fixed time after stimulus onset and ends when a second variable hits the upper or lower decision threshold. (c) The variable of the leaky integrator of stage one sets the drift in the (leak-free) drift-diffusion model of stage two.
The combination of (b) and (c) implies that, for long stimuli, stage two is a drift-diffusion model with time-dependent drift set by the momentary value of the integration variable of stage one. In case that the total duration of the stimulus is shorter than the time needed to reach the decision threshold in stage two, the value of the leaky integrator of stage one at the end of the stimulus is written into a buffer and this buffered value serves, during the remaining time, as the (constant) drift for the diffusion process in stage two until a decision is reached. In the limit that stimuli are shorter than , stage two has therefore a constant drift. In the limit that stimuli are presented for times much longer than (so that is negligibly short compared to the stimulation time), our two-stage model becomes equivalent to a standard one-stage drift-diffusion model with a time-dependent drift that is given by the low-pass filtered version of the input signal. However, for very short stimuli, the prediction of our two-stage model is remarkably different from that of a standard one-stage model – and these ultra-short stimuli are at the center of our study.
The results on stimulus dominance during the feature fusion paradigm with two short Verniers can indeed be explained by the two-stage model (Figures 2E, F). Since our stimuli are comparatively strong (over 90 percent accuracy for stimli presented separately), we consider the limit where the evidence integration in stage one is noise-free. Hence, in the first integration stage, evidence for stimulus ‘A’ and ‘B’ is simply accumulated in a noiseless forgetful (leaky) integrator (see also [30]). The time scale of forgetting is related to the time over which an ideal observer expects stimuli to remain constant (see Materials and Methods). The second phase, the decision stage starts at a fixed time and consists of a standard drift-diffusion model without leak (Figure 2F, bottom panel). For a sequence of two short stimuli, the stimulation ends before so that at the termination of the second stimulus (), the output of the evidence integration is written into a buffer and fed later from the buffer as a constant drift rate into stage two. The two-stage model captures the dominance of the second vernier very well (Figure 2G).
Reaction times
The critical test for models of decision making is to account for reaction time distributions rather than accuracy [20]. We therefore wondered whether the two-stage model captures the reaction time distributions in the fusion experiments. In experiment two, stimulus ‘A’ (the first vernier stimulus) was presented for a duration , immediately followed by stimulus ‘B’ (a vernier with opposite offset) of duration with (Figure 3A). Parameters of the two-stage model were adapted individually for each observer and kept fixed across all stimulus conditions. The dominance of the first vernier stimulus increased when increased (Figure 3B). Reaction times for strongly biased situations (e.g. where the first vernier stimulus is much longer than the second one or vice versa) are faster (75% of decisions made before 560 ms) than those in conditions with dominance around 50% (75% of decisions made before 610 ms) leading to an inverted-U-shaped curve of the reaction time quantiles (Figure 3C). The same pattern is observed when responses for the first and second vernier stimulus are analyzed separately (Figure 3D).
Median response times varied strongly across the 13 observers (Figure 4A). We separated the observers into a group of fast responders (median reaction time <500 ms) and one of slow responders (median reaction time >500 ms). While the reaction times of both groups show an inverted U-shape function, the qualitative picture is different between slow and fast responders. If the first vernier stimulus is presented for a short time only, fast responders are particularly fast whereas slow responders are particularly slow. The two-stage model qualitatively reproduces this behavior (Figures 4B,C, Supporting Figure S1).
Evidence integration in stage one as drift in stage two
For each stimulus condition, the outcome of the leaky integration in the first stage serves as a drift of the leakfree drift-diffusion model during the second stage of the two-stage model. For short stimuli, like the ones considered so far, where the stimulus ends before the integration, the result of stage one is written into a buffer and used as a constant value of the drift in the decision stage. In other words, the evidence at stimulus termination serves as drift value, rather than as an initial condition of stage two.
As an alternative, we have also analyzed a drift-diffusion model where the drift was taken as a free parameter, optimized for each stimulus condition independently so as to optimally predict the distribution of reaction times. The drift predicted from this model (which has more degrees of freedom) is statistically not different (as determined by a two way repeated measures analysis of variance) from our two-stage model where the drift is not a free parameter but the result of stage one. This finding suggests that the simple preprocessing by leaky integration correctly determines the drift rate (Figure 4D).
Further above we had reported that a qualitative fit of the dominance was possible by a noisy leaky integrator, if the leak was set to zero at the end of the stimulus. In such a model, the result of the leaky evidence integration serves as the initial value for a free diffusion process. The results of Figure 4D, however, indicate that the result of the leaky integration in the first stage should be used as the drift, and not as an initital condition for the diffusion in stage two. The results of Figure 4D can therefore be considered as a strong argument in favor of the two-stage model. In the following, we consider other aspects of the two-stage model.
Writing into the buffer
If the writing into the buffer is triggered at stimulus termination, as assumed in the two-stage model, the question arises why the switch from ‘B’ to the background, but not that from stimulus ‘A’ to ‘B’, triggers the transition from stage one to stage two in the two-stage model. We suggest that the large change from a vernier stimulus to background is “interpreted” as stimulus termination because there is a strong neural off-transient for a change from ‘A’ to a blank screen, whereas there are no on- and off-transients for a change from ‘A’ to ‘B’, respectively [41]. This is well in accordance with a Bayesian approach (see Supporting Figure S2) suggesting that feature integration should terminate when it becomes unlikely that the momentary stimulus is a continuation of the previous stimulus. The readout in the two-stage model should therefore start when a novelty value of the momentary stimulus crosses a predetermined threshold (cf. Supporting Text S1).
We tested this prediction by the psychophysical experiment in Figure 1, where the first vernier stimulus was followed by a blank background (interstimulus interval; ISI) before the second vernier was presented. With an ISI of 20 ms, the two vernier stimuli, presented for 10 ms each, became individually discriminable. Observers could tell whether the first stimulus was offset to the left or to the right by motion cues [41], [42]. However, for a sequence of ‘A’ immediately followed by ‘B’ with , verniers are not individually visible even though the total duration is 40 ms as in the sequence with the 20 ms ISI. This suggests that in the condition with the 20 ms ISI the termination signal of the first vernier stimulus stopped evidence integration and wrote the result into a buffer, for later use in stage two, whereas evidence was integrated across the two vernier stimuli in the experiment without the blank, before the final result was written into a buffer.
Start of the drift-diffusion process
In our experiments with ultra-short stimuli, the time where the read-out from the buffer starts, occurs after stimulus termination (and is included in the non-decisional time . We also tested a model where the decision process was triggered at stimulus termination, i.e., at the same moment when the result of evidence integration is written into the buffer (i.e. ). Such a model predicts that reaction times increase with total stimulus duration (data not shown), which disagrees with our observation that, for a given level of dominance, the mean reaction times remain largely constant for total stimulus durations of 20 ms, 40 ms, and 80 ms (Supporting Figure S3 D).
Two-stage model in the case of longer stimuli
For most of the stimuli considered so far, the total stimulus duration was below 40 ms. In this case, the two stages of the model are sequential and do not overlap. However, for longer stimuli, evidence integration of stage one is not finished at the moment of when the diffusive decision process in stage two is started.
Indeed, a model with fixed drift in stage two works well for stimuli up to a total duration of 80 ms, but breaks down at 160 ms (data not shown). However, our two stage model assumes that as soon as stimuli extend beyond , the momentary value of the evidence integration stage is written into the buffer and immediately used as drift in the diffusion process of stage two. The drift is updated continuously so that the diffusion process becomes time-varying. The fact that a constant drift in stage two fails when the stimulus extends over 160 ms indicates that the parameter of our model is much shorter than 160 ms. We tested this by fitting for individual subjects such that the mean square error in the dominance was minimized across all stimulus durations, including the 160 ms conditions. The optimal values for were indeed smaller than 160 ms (, , ).
In the model, we explored the situation that the first stimulus becomes much longer than . Obviously, if the first stimulus is made very long, our two-stage model then predicts that the first stimulus dominates.
Discussion
Most models of decision making do not account for the timing of stimuli. Likewise, most experiments use long stimulus durations in the range of several hundreds of milliseconds to seconds [9], [31], [32] or constant stimuli [20], [22], [25]. However, decision making has to occur in many situations in less than 100 ms, for example, when driving a car or playing soccer. Here, we have shown that rapid decision processes show very different characteristics than decision processes on longer time scales. For example for short stimulus durations, later presented stimuli dominate over stimuli presented earlier. We propose that these processes are also present in longer lasting decision processes, but are hidden and barely measureable.
To study the dynamics of rapid decision processes, we used a feature fusion paradigm. This paradigm relies on the well known effect of visual integration masking [43], [44] and follows partly Bloch's law [45]. The results of our experiments are in agreement with earlier results on feature fusion [38] and backward masking experiments [43], but do not agree with the traditional one-stage models of decision making in which evidence is integrated until a decision boundary is reached. The results of our experiment rather support a two-stage model in which evidence integration is separate from the actual decision process. This model is fundamentally different from classical drift-diffusion models [20], [22], [23], [27], race models [18], [19], [25], [32], attractor models [10], [30], [31], one-stage models with pre-processing [28], and “parallel” two-stage models [21], [46]. All these models predict the first stimulus to dominate when in contrast to the fusion results.
The two-stage model
In our model, we assumed several components which are worth discussion each. First, evidence integration in stage one must be leaky. It is the leak that explains why, when the first and second vernier stimulus are of the same duration (), the second vernier stimulus dominates (experiment one). The leak in our model arises naturally from a Bayesian approach and can be traced back to the fact that stimuli are expected to change in natural environments. Similar to our Bayesian novelty detection approach (cf. Supporting Text S1), the leaky evidence integration can also be derived in the framework of Kalman filters [47]–[50].
Second, the accumulated evidence must transferred at an appropriate moment and written into a temporal buffer. Such a buffer is necessary since decisions often occur a considerable time after the stimulus has disappeared. We suggest that the precise moment of transfer is set by a novelty score monitored during evidence integration (see Supporting Text S1). Such a novelty signal and subsequent buffering explains why the two vernier stimuli are perceived individually, if the stimuli are separated by a blank screen (ISI), but fused into a single percept in the absence of the blank. In this sense, feature fusion can be interpreted as a failure to detect the onset of a new stimulus because the new evidence is not sufficiently different to raise a ‘novelty signal’. In contrast, the switch from stimulus to background creates a sufficiently strong transient to stop the feature integration process (Supporting Figure S2).
Third, the noisy decision process is triggered at a fixed time after stimulus onset. If the decision process were triggered at , reaction times would increase with stimulus duration. This is, however, not the case (Supporting Figure S3D). From the fact that our model assumes a fixed start time of the second stage, it necessarily follows that we have to distinguish two different situations: If the total stimulus duration is shorter than , we need to bridge the time between the end of the stimulus and decision by storing the intermediate result of evidence integration into a buffer. This value is then used in stage two as a fixed mean drift rate. If the total stimulus duration is longer than , the result of stage one is used online as a time-dependent drift for all times until the end of the stimulus (at which point it is again ‘frozen’ and transferred into the drift-buffer.).
Our two stage model is similar to previous two stage models in which sensory processing, e.g. motion processing or contrast detection, precedes a decision making stage (e.g. [15], [16], [46]). In our model, the sensory integration stage is leaky to account for the dominance of the second vernier.
Leakage
Our two-stage model comprises a leaky integration stage followed by a drift-diffusion stage. The question arises whether or not a one-stage model with leak in the drift-diffusion process can explain the results. However, this is not the case because in such a model always the first stimulus dominates because the leak pushes the decision variable towards the starting point and not across it (Supporting Figure S3).
Another way to integrate the leak into a one-stage model is to directly transform the input by a leaky integrator (like our stage one) and to use the outcome of the leaky integrator as a time-variant drift in stage-two (, ). However, using stage one only for pre-processing will not change the pattern of results [28]. In such models, the decision variable also moves towards the decision bound for stimulus ‘A’ before dropping back to chance level. Therefore, these models also show a dominance of the first stimulus.
Window of integration
The novel features of the two-stage models are observable well only for stimuli in the range of up to about 100 ms. This duration is in line with the duration of visual integration found in other studies [51]–[53]. One of the paradoxical aspects of our model is that the second stage starts at a fixed time . Obviously, if the duration of a stimulus extends beyond , then the stage of evidence integration and that of stochastic decision making (stage two) will overlap and the separation into two distinct phases disappears (see Supporting Text S1). Therefore it is not surprising that for longer stimulus durations standard one-stage models work well [27], [29], [54].
External and internal noise
In our model, a deterministic filter (leaky integrator) is applied in stage one to a step-like input, representing a noiseless stimulus. This is the limiting case where the stimulus is considered to be of high contrast. In a more realistic scenario the stimulus itself is noisy. The stochasticity of stimuli leads, after stage one, to a noisy result of evidence integration, which is written into the buffer and then used as drift for stage two. This noisy result is modeled by the variance of the drift constant of stage two. It is therefore tempting to relate the stochasticity of drift constants to sensory or physical noise. The stochasticity of stage two may be related to internal noise in the brain [32], [55]. What is the advantage of adding a separate noisy decision process? It is well known that human observers can manipulate the speed-accuracy trade-off according to instruction or reward scheme by a change in strategy corresponding to a shift of the initial condition, , or the decision thresholds in the drift-diffusion process [13], [54].
Accumulators in decision making and motor preparation
Neurons in the superior colliculus [56], the LIP [9], [32], [33], the pre-motor cortex [57], [58], and the dorsoventral lateral prefrontal cortex [11], [12], [59] were shown to be involved in decision making. The firing rate of these neurons increases as long as stimuli are displayed. This ramping activity may relate either to evidence accumulation (“stage one”) or to decision making (“stage two”). Future experiments with feature fusion stimuli may be used to decide between these two alternatives.
Summary
In summary, it is often (intuitively) assumed that visual input directly translates into decisions. A stimulus presented first should drive decisions stronger and faster than a later stimulus (first in, first out). This is obviously correct when the two stimuli are long, because a decision may be reached even before the second stimulus can influence decision masking. In this case, we can assume that evidence integration and decision making are the same. However, for short stimuli this is not the case. Evidence integration and decision making can only be disentangled, when the two stimulus alternatives are presented within one trial (feature fusion) but not when only one stimulus is presented per trial, as it is usually. The distinction between evidence integration and decision making is described well by our two-stage model, where rapid stimuli are integrated and buffered before the decision process starts.
Materials and Methods
Ethics statement
All participants signed informed written consent. The study was approved by the Commission cantonale (VD) d'éthique de la recherche sur l'être humain (Lausanne, Switzerland) and conducted according to the principles expressed in the Declaration of Helsinki.
Observers
A total of 24 observers (8 female, aged 21–32 years) signed informed written consent. Participants had normal or corrected-to-normal visual acuity as measured by the Freiburg visual acuity test [60]. All but two observers (the first and second author) were naive to the purpose of the study. Naive observers were paid students from local universities.
Setup
Stimuli were presented on a Tektronix 608 X-Y display or a HP 1332A X-Y display. Both X-Y displays were equipped with a P11 phosphor and controlled by a PC via a fast 16 bit DA converter. Stimuli were presented at , a 1 MHz dot rate, a 500 Hz refresh rate, and a dot pitch of . Viewing distance was 2 m. The room was dimly illuminated by a background light () to prevent adaptation to scotopic vision. Stimulus contrast was close to 1.0. In each experiment, the conditions have been presented randomly interleaved to reduce the influence of hysteresis, learning, or fatigue in the averaged data.
Stimuli
The vernier stimuli were composed of two vertical segments. Each segment was 10′ (arc min) long, 0.5′ wide, separated by a vertical gap of 1′. A small horizontal offset was inserted between the upper and the lower segments (Figure 2A). Horizontal offset sizes ranged from 30″ to 40″ (arc sec). Offsets were chosen individually to be at least twice the offset size of the offset discrimination threshold for a single vernier stimulus of 20 ms duration as determined using the adaptive PEST procedure [61]. A sequence of two vernier stimuli with opposite offset directions was presented foveally in rapid succession. The offset direction of the first vernier (stimulus ‘A’) was chosen randomly in each trial (left or right). The second vernier (stimulus ‘B’) had an offset direction opposite to that of the first vernier. If, for example, the first vernier stimulus was offset to the left, the second vernier was offset to the right, and vice versa. Observers perceived only one fused vernier and were asked to report the position of the lower segment with respect to that of the upper segment by pressing one of two push buttons. Observers were instructed to respond as rapidly as possible, but also as accurately as possible. No feedback about performance was given. Naive observers did not know that a sequence of two vernier stimuli was presented.
Performance measure
We computed dominance, defined as the proportion of trials on which the response matched the offset direction of the first vernier stimulus. Thus, values above 50% indicate dominance of the first vernier (stimulus ‘A’); values below 50% indicate dominance of the second vernier (stimulus ‘B’). 50% vernier dominance is the point of subjective equality, i.e. first and second vernier stimulus equally contribute to performance.
Experiment 1
First vernier stimulus (‘A’) and second vernier (‘B’) were presented in immediate succession (Figure 2). Both vernier stimuli had either the same duration or the duration of one of the verniers was four times longer than the other. The total duration of the first and second vernier was 20 ms, 40 ms, 80 ms, or 160 ms. All conditions were presented in a random order. Every condition has been repeated 400 times per observer.
Experiment 2
As Experiment 1, except for that the duration of the first vernier was varied in 12 steps between 0 ms and 40 ms. The total duration + always summed up to a total of 40 ms (Figure 3A). Every condition has been repeated 400 times per observer.
Reference experiment
In Figure 1, an ISI was inserted between the first and second vernier stimulus. Observers were informed about the experimental design and asked to indicate whether the first or second vernier stimulus was offset to the right.
Reaction time analysis
Reaction times below 300 ms or above 1200 ms were excluded from analysis to reduce the impact of motor errors and unattended trials (less than 3% of the trials).
Model
We model the stimuli by a time-varying input signal , which is +1 during the presentation of stimulus ‘A’, −1 for stimulus ‘B’ and 0 otherwise. In the evidence accumulation stage of the two-stage model, the stimulus is subjected to leaky integration: Since our stimuli have high contrast, the evidence integration is modeled as a noise-free process.
For times larger than the integrated evidence is fed as the drift into the noisy drift-diffusion model at stage two. We distinguish two different cases. a) Stimuli are shorter than . At the termination of stimulus ‘B’ () the integrated evidence is stored and written into a buffer. Later, for the buffered value is used as the mean drift rate with a fixed scaling factor for the decision stage, which encompasses a standard drift-diffusion model. b) Stimuli are longer than . In this case the momentary evidence is used as the mean drift for . Again, at the end of the stimulus, the last value of the evidence is buffered and used as drift henceforth.
During stage two, in every trial, a decision variable is initialized at and evolves according to the Langevin equation where is the drift rate and is a Wiener process, which introduces noise to the decision process. A decision is made when the decision variable reaches one of two decision boundaries (decision ‘A’) or (decision ‘B’). The associated reaction time is the sum of a non-decisional time (which accounts for sensory delay , and the evidence integration and buffering times as well as motor delays) and the time when the decision variable reaches the boundary. We used the Ratcliff extension [20] of a standard drift diffusion model, in which the non-decisional time , the initial condition and the drift rate vary stochastically from trial to trial. The non-decisional time is drawn from uniform distributions with mean and width . The initial condition is drawn from uniform distributions with mean and width . The drift rate is drawn from a Gaussian distribution with mean – the output of the first stage – and standard deviation . and represent noise in the evidence accumulation.
As a reference, we used a one-stage model, which encompasses a standard drift diffusion model, in which the drift rate depends on time and is given by the input signal: . In this model, the drift becomes zero after the end of the stimulus. We also simulated leaky variants of this one-stage model, for details see Supporting Text S1.
Fitting
In the first step, the parameters , , , , , and of the decision stage were fitted to the experimentally obtained cumulative reaction time distributions by minimizing the product of the p-values of the Kolmogorov-Smirnov statistic for each stimulus condition [62], [63]. Responses to stimuli ‘A’ and ‘B’ and different stimulus conditions were fitted simultaneously using the fast-dm software of Voss & Voss [64]. For both experiments, fits were done individually for each observer. In the experiment of Figures 2, all parameters except the mean drift rate and the drift variability were the same in all stimulus conditions. The drift was calculated from stage one. Drift variability was a function of stimulus duration. In the experiment of Figures 3 and 4, only the mean drift rate was varied across conditions and calculated from stage one. In order to obtain the parameters and of the evidence integration in stage one we ran a simulation experiment with free drift rates as in Figure 4D. The obtained mean drift rates were then used to fit the time constant and the scaling factor , again separately for each observer. This fit was done using the fit-routine of MATLAB. Finally, to extract the optimal values for , we first used the data of experiment 1 with stimulus durations and fitted the parameters of both stages with the described procedure. Then, we performed a line scan of all values of and identified the value that minimized the mean square error of the measured dominance, now including the long duration of 160 ms.
Parameters are different for each observer, i.e. , , , , , , , and for stage one and .
Supporting Information
Acknowledgments
We would like to thank Marc Repnow for excellent technical support. We also would like to thank Frank Scharnowski for fruitful discussions during the planning of the experiment.
Footnotes
The authors have declared that no competing interests exist.
JR is funded by the Swiss National Science Foundation (SNF) project No. 320030-119736 “The dynamics of feature fusion”. MHH and WG receive support from the SNF Sinergia project CRSIKO-122697 “State representation in reward based learning - from spiking neuron models to psychophysics”. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Newell A. Human Problem Solving. Upper Saddle River, NJ, USA: Prentice-Hall, Inc; 1972. [Google Scholar]
- 2.Tversky A, Kahneman D. The framing of decisions and the psychology of choice. Science. 1981;211:453–458. doi: 10.1126/science.7455683. [DOI] [PubMed] [Google Scholar]
- 3.Platt ML, Glimcher PW. Neural correlates of decision variables in parietal cortex. Nature. 1999;400:233–238. doi: 10.1038/22268. [DOI] [PubMed] [Google Scholar]
- 4.Sugrue LP, Corrado GS, Newsome WT. Choosing the greater of two goods: neural currencies for valuation and decision making. Nat Rev Neurosci. 2005;6:363–375. doi: 10.1038/nrn1666. [DOI] [PubMed] [Google Scholar]
- 5.Bogacz R. Optimal decision-making theories: linking neurobiology with behaviour. Trends Cogn Sci. 2007;11:118–125. doi: 10.1016/j.tics.2006.12.006. [DOI] [PubMed] [Google Scholar]
- 6.von Neumann J, Morgenstern O. The Theory of Games and Economic Behavior. Princeton: Princeton Univ. Press; 1944. [Google Scholar]
- 7.Christopoulos GI, Tobler PN, Bossaerts P, Dolan RJ, Schultz W. Neural correlates of value, risk, and risk aversion contributing to decision making under risk. J Neurosci. 2009;29:12574–12583. doi: 10.1523/JNEUROSCI.2614-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Duda RO, Hart PE. Pattern Classification and Scene Analysis. New York: John Wiley & Sons; 1973. [Google Scholar]
- 9.Shadlen MN, Newsome WT. Neural basis of a perceptual decision in the parietal cortex (area LIP) of the rhesus monkey. J Neurophysiol. 2001;86:1916–1936. doi: 10.1152/jn.2001.86.4.1916. [DOI] [PubMed] [Google Scholar]
- 10.Beck JM, Ma WJ, Kiani R, Hanks T, Churchland AK, et al. Probabilistic population codes for bayesian decision making. Neuron. 2008;60:1142–1152. doi: 10.1016/j.neuron.2008.09.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Gold JI, Shadlen MN. The neural basis of decision making. Annu Rev Neurosci. 2007;30:535–574. doi: 10.1146/annurev.neuro.29.051605.113038. [DOI] [PubMed] [Google Scholar]
- 12.Heekeren HR, Marrett S, Ungerleider LG. The neural systems that mediate human perceptual decision making. Nat Rev Neurosci. 2008;9:467–479. doi: 10.1038/nrn2374. [DOI] [PubMed] [Google Scholar]
- 13.Deco G, Rolls ET, Romo R. Stochastic dynamics as a principle of brain function. Prog Neurobiol. 2009;88:1–16. doi: 10.1016/j.pneurobio.2009.01.006. [DOI] [PubMed] [Google Scholar]
- 14.Bogacz R, Wagenmakers EJ, Forstmann BU, Nieuwenhuis S. The neural basis of the speedaccuracy tradeoff. Trends Neurosci. 2010;33:10–16. doi: 10.1016/j.tins.2009.09.002. [DOI] [PubMed] [Google Scholar]
- 15.Reddi B. Decision making: The two stages of neuronal judgement. Curr Biol. 2001;11:R603–R606. doi: 10.1016/s0960-9822(01)00363-3. [DOI] [PubMed] [Google Scholar]
- 16.Carpenter R. Contrast, probability, and saccadic latency: Evidence for independence of detection and decision. Curr Biol. 2004;14:1576–1580. doi: 10.1016/j.cub.2004.08.058. [DOI] [PubMed] [Google Scholar]
- 17.Wald A. Sequential Analysis. New York: John Wiley & Sons; 1947. [Google Scholar]
- 18.Vickers D. Evidence for an accumulator model of psychophysical discrimination. Ergonomics. 1970;13:37–58. doi: 10.1080/00140137008931117. [DOI] [PubMed] [Google Scholar]
- 19.Reddi BAJ, Asrress KN, Carpenter RHS. Accuracy, information, and response time in a saccadic decision task. J Neurophysiol. 2003;90:3538–3546. doi: 10.1152/jn.00689.2002. [DOI] [PubMed] [Google Scholar]
- 20.Ratcliff R, Smith PL. A comparison of sequential sampling models for two-choice reaction time. Psychol Rev. 2004;111:333–367. doi: 10.1037/0033-295X.111.2.333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Smith PL, Ratcliff R. An integrated theory of attention and decision making in visual signal detection. Psychol Rev. 2009;116:283–317. doi: 10.1037/a0015156. [DOI] [PubMed] [Google Scholar]
- 22.Smith PL, Ratcliff R. Psychology and neurobiology of simple decisions. Trends Neurosci. 2004;27:161–168. doi: 10.1016/j.tins.2004.01.006. [DOI] [PubMed] [Google Scholar]
- 23.Holmes P, Brown E, Moehlis J, Bogacz R, Gao J, et al. Optimal decisions: From neural spikes, through stochastic differential equations, to behavior. IEICE Trans Fundamentals. 2005;88:2496–2503. [Google Scholar]
- 24.Bogacz R, Brown E, Moehlis J, Holmes P, Cohen JD. The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks. Psychol Rev. 2006;113:700–765. doi: 10.1037/0033-295X.113.4.700. [DOI] [PubMed] [Google Scholar]
- 25.Brown SD, Heathcote A. The simplest complete model of choice response time: linear ballistic accumulation. Cogn Psychol. 2008;57:153–178. doi: 10.1016/j.cogpsych.2007.12.002. [DOI] [PubMed] [Google Scholar]
- 26.Kepecs A, Uchida N, Zariwala HA, Mainen ZF. Neural correlates, computation and behavioural impact of decision confidence. Nature. 2008;455:227–231. doi: 10.1038/nature07200. [DOI] [PubMed] [Google Scholar]
- 27.Roxin A, Ledberg A. Neurobiological models of two-choice decision making can be reduced to a one-dimensional nonlinear diffusion equation. PLoS Comput Biol. 2008;4:e1000046. doi: 10.1371/journal.pcbi.1000046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Purcell BA, Heitz RP, Cohen JY, Schall JD, Logan GD, et al. Neurally constrained modeling of perceptual decision making. Psychol Rev. 2010;117:1113–1143. doi: 10.1037/a0020311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Salinas E. So many choices: what computational models reveal about decision-making mechanisms. Neuron. 2008;60:946–949. doi: 10.1016/j.neuron.2008.12.011. [DOI] [PubMed] [Google Scholar]
- 30.Wang XJ. Probabilistic decision making by slow reverberation in cortical circuits. Neuron. 2002;36:955–968. doi: 10.1016/s0896-6273(02)01092-9. [DOI] [PubMed] [Google Scholar]
- 31.Wong KF, Huk AC, Shadlen MN, Wang XJ. Neural circuit dynamics underlying accumulation of time-varying evidence during perceptual decision making. Front Comput Neurosci. 2007;1:6. doi: 10.3389/neuro.10.006.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Stanford TR, Shankar S, Massoglia DP, Costello MG, Salinas E. Perceptual decision making in less than 30 milliseconds. Nat Neurosci. 2010;13:379–385. doi: 10.1038/nn.2485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kiani R, Hanks TD, Shadlen MN. Bounded integration in parietal cortex underlies decisions even when viewing duration is dictated by the environment. J Neurosci. 2008;28:3017–3029. doi: 10.1523/JNEUROSCI.4761-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Mazurek ME, Roitman JD, Ditterich J, Shadlen MN. A role for neural integrators in perceptual decision making. Cereb Cortex. 2003;13:1257–1269. doi: 10.1093/cercor/bhg097. [DOI] [PubMed] [Google Scholar]
- 35.Thorpe SJ, Fize D, Marlot C. Speed of processing in the human visual system. Nature. 1996;381:520–522. doi: 10.1038/381520a0. [DOI] [PubMed] [Google Scholar]
- 36.Brewster D. On the influence of succession of light upon the retina. Philos Mag. 1838;4:241. [Google Scholar]
- 37.Herzog MH, Parish L, Koch C, Fahle M. Fusion of competing features is not serial. Vision Res. 2003;43:1951–1960. doi: 10.1016/s0042-6989(03)00278-5. [DOI] [PubMed] [Google Scholar]
- 38.Scharnowski F, Hermens F, Herzog MH. Bloch's law and the dynamics of feature fusion. Vision Res. 2007;47:2444–52. doi: 10.1016/j.visres.2007.05.004. [DOI] [PubMed] [Google Scholar]
- 39.Efron R. The duration of the present. Ann N Y Acad Sci. 1967;138:713–729. [Google Scholar]
- 40.Churchland A, Kiani R, Chaudhuri R, Wang X, Pouget A, et al. Variance as a signature of neural computations during decision making. Neuron. 2011;69:818–831. doi: 10.1016/j.neuron.2010.12.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Macknik SL, Livingstone MS. Neuronal correlates of visibility and invisibility in the primate visual system. Nat Neurosci. 1998;1:144–9. doi: 10.1038/393. [DOI] [PubMed] [Google Scholar]
- 42.Rüter J, Kammer T, Herzog MH. When transcranial magnetic stimulation (TMS) modulates feature integration. Europ J Neurosci. 2010;32:1951–1958. doi: 10.1111/j.1460-9568.2010.07456.x. [DOI] [PubMed] [Google Scholar]
- 43.Breitmeyer B, Öğmen H. Visual Masking: Time Slices Through Conscious and Unconscious Vision. Clarendon Press; Oxford University Press; 2006. [Google Scholar]
- 44.Hermens F, Scharnowski F, Herzog MH. Spatial grouping determines temporal integration. J Exp Psychol Hum Percept Perform. 2009;35:595–610. doi: 10.1037/a0013706. [DOI] [PubMed] [Google Scholar]
- 45.Bloch AM. Experience sur la vision. Comptes Rendus de Seances de la Societe de Biologie (Paris) 1885;37:493–495. [Google Scholar]
- 46.Carpenter RHS, Reddi BAJ, Anderson AJ. A simple two-stage model predicts response time distributions. J Physiol. 2009;587:4051–4062. doi: 10.1113/jphysiol.2009.173955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Körding K. Decision theory: what “should” the nervous system do? Science. 2007;318:606–610. doi: 10.1126/science.1142998. [DOI] [PubMed] [Google Scholar]
- 48.Deneve S, Duhamel JR, Pouget A. Optimal sensorimotor integration in recurrent cortical networks: a neural implementation of kalman filters. J Neurosci. 2007;27:5744–5756. doi: 10.1523/JNEUROSCI.3985-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Huys QJM, Zemel RS, Natarajan R, Dayan P. Fast population coding. Neural Comput. 2007;19:404–441. doi: 10.1162/neco.2007.19.2.404. [DOI] [PubMed] [Google Scholar]
- 50.Marcille N. Models of Evidence Integration in Rapid Decision Making Processes. Ph.D. thesis, Ecole Polytechnique Federale Lausanne; 2011. [Google Scholar]
- 51.Cisek P, Puskas GA, El-Murr S. Decisions in changing conditions: the urgency-gating model. J Neurosci. 2009;29:11560–11571. doi: 10.1523/JNEUROSCI.1844-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ghose GM. Strategies optimize the detection of motion transients. J Vis. 2006;6:429–440. doi: 10.1167/6.4.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Ludwig CJH, Gilchrist ID, McSorley E, Baddeley RJ. The temporal impulse response underlying saccadic decisions. J Neurosci. 2005;25:9907–9912. doi: 10.1523/JNEUROSCI.2197-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Ratcliff R, McKoon G. The diffusion decision model: theory and data for two-choice decision tasks. Neural Comput. 2008;20:873–922. doi: 10.1162/neco.2008.12-06-420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Manwani A, Koch C. Detecting and estimating signals in noisy cable structures, II: information theoretical analysis. Neural Comput. 1999;11:1831–1873. doi: 10.1162/089976699300015981. [DOI] [PubMed] [Google Scholar]
- 56.Glimcher PW, Sparks DL. Movement selection in advance of action in the superior colliculus. Nature. 1992;355:542–545. doi: 10.1038/355542a0. [DOI] [PubMed] [Google Scholar]
- 57.Nachev P, Kennard C, Husain M. Functional role of the supplementary and presupplementary motor areas. Nat Rev Neurosci. 2008;9:856–869. doi: 10.1038/nrn2478. [DOI] [PubMed] [Google Scholar]
- 58.Forstmann BU, Dutilh G, Brown S, Neumann J, von Cramon DY, et al. Striatum and presma facilitate decision-making under time pressure. Proc Natl Acad Sci U S A. 2008;105:17538–17542. doi: 10.1073/pnas.0805903105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Schall JD. Neural basis of deciding, choosing and acting. Nat Rev Neurosci. 2001;2:33–42. doi: 10.1038/35049054. [DOI] [PubMed] [Google Scholar]
- 60.Bach M. The Freiburg Visual Acuity Test - automatic measurement of visual acuity. Optom Vision Sci. 1996;73:49–53. doi: 10.1097/00006324-199601000-00008. [DOI] [PubMed] [Google Scholar]
- 61.Taylor MM, Creelman CD. PEST: efficiency estimates on probability functions. J Acoust Soc Am. 1967;41:782–787. [Google Scholar]
- 62.Kolmogoroff A. Confidence limits for an unknown distribution function. Ann Math Statist. 1941;12:461–463. [Google Scholar]
- 63.Voss A, Rothermund K, Voss J. Interpreting the parameters of the diffusion model: an empirical validation. Mem Cognition. 2004;32:1206–20. doi: 10.3758/bf03196893. [DOI] [PubMed] [Google Scholar]
- 64.Voss A, Voss J. Fast-dm: a free program for efficient diffusion model analysis. Behav Res Methods. 2007;39:767–775. doi: 10.3758/bf03192967. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.