Abstract
Wagering contingent on a previous decision, or post-decision wagering, has recently been proposed to measure conscious awareness. Whilst intuitively appealing, it remains unclear whether economic context interacts with subjective confidence and how such interactions might impact on the measurement of awareness. Here we propose a signal detection model which predicts that advantageous wagers placed on the identity of preceding stimuli are affected by loss aversion, despite stimulus visibility remaining constant. This pattern of predicted results was evident in a psychophysical task where we independently manipulated perceptual and economic factors. Changes in wagering behaviour induced by changes in wager size were largely driven by changes in criterion, consistent with the model. However, for near-threshold stimuli, a reduction in wagering efficiency was also evident, consistent with an apparent but potentially illusory decrease in awareness of the stimulus. These findings challenge an assertion that post-decision wagering provides a direct index of subjective awareness.
Keywords: Post-decision wagering, Perception, Economics, Signal detection theory, Reward, Awareness, Metacognition, Psychophysics
1. Introduction
Knowledge of one’s own uncertainty regarding an outcome plays a key role in determining decision strategy (Foote & Crystal, 2007; Hampton, 2001; Kepecs, Uchida, Zariwala, & Mainen, 2008; Kiani & Shadlen, 2009; Smith, Shields, & Washburn, 2003), and may even be a central property of awareness (Cleeremans, Timmermans, & Pasquali, 2007; Dienes, 2008; Rosenthal, 2000, 2002). For example, knowing in advance that you are unlikely to pass a test makes you reluctant to take the test in the first place (Higham, 2007; Higham & Gerrard, 2005; Koriat & Goldsmith, 1996; Metcalfe & Finn, 2008), and may reflect awareness of your own (low) ability.
Such insights motivated the use of wagering contingent on decision performance (post-decision wagering) as a direct and intuitive measure of conscious awareness (Koch & Preuschoff, 2007; Persaud & McLeod, 2008; Persaud, McLeod, & Cowey, 2007). In its simplest form, a participant is required to detect whether an arbitrary stimulus is present or not and then, in a second stage, gamble on whether his/her response was correct or not. Advantageous wagers are made when participants accurately assess the level of sensory evidence on each trial, linking objective performance to the subjectivity of stimulus information (Persaud et al., 2007). Furthermore, using wagers to measure awareness can potentially overcome problems of interpretation associated with a lack of control over subjective reports (Eriksen, 1960) and the motivation of a subject (cf. Visser & Merikle, 1999). However, the interpretation of post-decision wagering data is potentially confounded by complex factors affecting how gains and losses affect performance (Clifford, Arabzadeh, & Harris, 2008; Dienes & Seth, 2009; Schurger & Sher, 2008), and a full behavioural analysis of advantages, and disadvantages, in defining consciousness based on these economic terms is lacking.
The processing stages underlying post-decision wagering can be logically decomposed using signal detection theory (SDT). In the first instance, it is assumed that objective features of the stimulus lead to a perceptual effect, modelled as a univariate random variable X (Fig. 1a), which we label “sensory evidence”. Discrimination errors at this stage are inevitable if the difference in sensory evidence between signal and noise (or other arbitrary category discriminations) is weak, leading to overlapping probability distributions (Fig. 1a). It is thus assumed that participants make a decision by splicing up an evidence axis into two halves using a decision criterion (Green & Swets, 1966). In a post-decision wagering experiment, the participant is now required to make a high or low wager on whether they were correct. This requires knowledge both of how likely it was that the first decision was correct, and their expected return from the wager (Clifford et al., 2008). The former can be recovered from the sensory evidence on a given trial (Galvin, Podd, Drga, & Whitmore, 2003). The latter, however, is psychologically more complex given evidence that people are loss averse (Kahneman & Tversky, 1979; Tversky & Kahneman, 1991), manifesting in a propensity to weight losses greater than gains when making value-based decisions. Loss aversion has been suggested to impact on post-decision wagering performance by modulating a link between the assessment of sensory evidence and overt confidence (Schurger & Sher, 2008).
To quantify how wagering on sensory evidence is influenced by economic factors, we first developed a simple computational model of post-decision wagering behaviour, building on previous work by Galvin and colleagues (Galvin et al., 2003), but now extending this framework to encompass loss aversion. Using this model, we derive precise predictions about how sensory evidence translates into advantageous wagering responses. The accuracy of metacognitive assessments can be intuited as how transparent the initial decision process is to a putative “higher” level assessment. For example, if there is ambiguity in the decision process then the categorisation of one’s own performance as being correct, or incorrect, will be subject to error. This intuition can be captured within the logic of signal detection theory (SDT), which assesses how faithfully an organism separates signal from noise (Green & Swets, 1966; Macmillan & Creelman, 2005). In standard applications of SDT (Type I), detection performance is assessed by a comparison of the proportion of “hits” and “false alarms” in a stimulus detection task. By applying the logic of SDT to post-decision wagering, we categorise a “hit” as a high wager after a correct decision and a “false alarm” as a high wager after an incorrect decision (see Table 1). This type of analysis is known as Type II SDT (Clarke, Birdsall, & Tanner, 1959; Clifford et al., 2008; Galvin et al., 2003; Kunimoto, Miller, & Pashler, 2001; Macmillan & Creelman, 2005). On the basis of these considerations, we derive a theoretical relationship linking Type I and Type II task performance by applying an ideal observer framework, enabling us to explore the relationship between post-decision wagering and changes in sensory evidence. Our specific prediction here was that post-decision wagering would be influenced by loss aversion, decoupling a link between perception and confident responding. To test our model, we examine data from a psychophysical experiment in which subjects made a high or low wager following a simple sensory judgement.
Table 1.
Type I decision | High wager | Low wager |
---|---|---|
Correct | Hit | Miss |
Incorrect | False alarm | Correct rejection |
2. Methods
2.1. Signal detection model
In signal detection theory (SDT), a Type I decision is based upon overlapping Gaussian probability distributions over a random variable X, conditional on the events signal (S) and noise (N) (Fig. 1a). Assuming an unbiased response criterion, c, for the Type I detection decision (Galvin et al., 2003), we can specify the distribution over X for the probability of the Type I response being correct or incorrect:
(1) |
(2) |
where p(C) and p(I) are the average probabilities of making a correct or incorrect response on any given trial (Macmillan & Creelman, 2005). Full derivations of Eqs. (1) and (2) can be found in (Galvin et al., 2003); the (constant) prior terms from their more general analysis are omitted here for clarity. The distributions specified by (1) and (2) are plotted graphically in Fig. 1b. It is important to note that these Type II distributions are conditional transformations based on whether the first decision was correct or not. That is, the shape of the f(x|C) curve follows the signal distribution when x > c (a Type I hit) and the noise distribution when x < c (a Type I correct rejection). Similarly, the shape of the f(x|I) curve follows shape of the noise distribution when x > c (a Type I false alarm) and the signal distribution when x < c (a Type I miss). The heights of both f(x|C) and f(x|I) are then scaled so that they sum to one.
A correct Type II response is more likely towards the left or right-hand extremes of X in Fig. 1b (high signal or high noise trials), whereas incorrect responses predominate where there is maximal overlap between signal and noise. The inherent assumption here is that the uncertainty associated with being sure of seeing something is the same as the uncertainty associated with being sure of not seeing something (the Type II distributions are symmetric around X = 0), an assumption we return to under Sections 3 and 4.
The log-likelihood of being correct on any given trial (likelihood ratio; LR) is the log of the ratio of (1) and (2):
(3) |
We assume that high wagers are made when the log-likelihood of being correct on the Type I task reaches a given criterion, βw. As the likelihood ratio is symmetric around c, there are thus two values of x for each βw, one for when x ⩽ c and one for when x > c. This corresponds to being sure that a signal was or was not present and wagering high.
In standard applications of SDT, the optimal placement of the log-likelihood ratio criterion can be calculated based on the relative utility of making a hit (H), miss (M), correct rejection (CR) or false alarm (FA) (assuming equal priors), as follows (Macmillan & Creelman, 2005):
(4) |
In Type I SDT, it is likely that the “reward” (R) for a false alarm or miss will be negative. Imagine taking part in a psychophysical task where the penalty was £1 for a false alarm and 1p for a miss, with 10p earned for correct responses. The optimal value of β would then be (0.1 − (−1))/(0.1 − (−0.01)) = 10: the odds favouring stimulus B should be 10:1 or better in order to risk a “B” response (that associated with the false alarm penalty). In our Type II analysis, we similarly assume that subjects base their response criterion, βw, on the relative utility of the four possible Type II wagering outcomes, which are mapped onto the four SDT categories of Eq. (4) using Table 1. This ratio is then adjusted by the prior probability of the trial being correct or incorrect (P(C) and P(I)):
(5) |
In contrast to Eq. (4) for Type I SDT, in Eq. (5) both hits and misses are associated with positive outcomes (trials are labelled “misses” when the Type I response was correct but only the low wager was used, leading to a lower reward than could have been obtained, but still a reward; cf. Kunimoto et al., 2001). Thus, the terms in the denominator of Eq. (5) are the positive utilities of each wager (R), whereas the outcomes in the numerator are associated with the negative costs of losing each wager (C) following incorrect responses.
Crucially, we incorporate asymmetric utility curves when calculating the values of R and C, as proposed within Prospect Theory (Kahneman & Tversky, 1979). A schematic of such a curve is shown in Fig. 1c. Risk aversion for mixed (gain or loss) gambles is explained through loss aversion, such that given the same absolute value losses carry a greater impact than gains (Tversky & Kahneman, 1991). This nonlinear relationship between gain (R) and loss (C) leads to a value of βw that is dependent on the size of the high wager, assuming the low wager remains fixed for simplicity.
The positive and negative utility of each wager is specified as follows:
(6) |
(7) |
where V is the objective value of the wager, r and t are power functions for the gain and loss domain respectively, and s is the loss aversion index. We restricted r and t such that 0 < r < t < 1, resulting in utility being concave for gains, convex for losses and more linear in losses than in gains (Fennema & Van Assen, 1999; Kobberling & Wakker, 2005). We define c ± m (as the likelihood ratio is symmetric about c) as values of x such that:
(8) |
Using the signal detection categories of Table 1, we then compute theoretical hit and false alarm rates for a range of values of m by integrating over the Type II probability distributions specified in (1), (2) and Fig. 1b:
(9) |
(10) |
2.2. Psychophysics
Thirteen participants (3 male, 10 female, mean age 27 ± 8.6 years) performed a lexical decision task spread over two sessions on consecutive days, each contributing 1440 trials to the analysis. The task was to decide whether a heavily masked stimulus was a word or a non-word, and then to gamble either “high” or “low” depending on the correctness of their initial judgement (Fig. 2). One-hundred and seventy-two four-letter words were taken from the MRC Psycholinguistic Database (Wilson, 1988), and matched for familiarity and written frequency of occurrence. A related set of pronounceable non-words was created by a random vowel change. To increase task difficulty, stimuli appeared randomly at any one of four locations surrounding a central fixation point. Threshold detection performance (79%) was assessed for each individual using a staircase procedure (Levitt, 1971); on each trial stimulus duration was 100%, 80% or 50% of this threshold. The low wager remained constant across blocks at 50p; the value of the high wager varied across blocks (£1, £2, £5 or £10). Responses were made using the left-hand (“z” or “x” keys) to indicate the word/non-word, and the right-hand (“up” or “down” arrow keys) to indicate a high or low gamble.
Stimulus duration was randomised within blocks, giving a 3 × 4 factorial design crossing high wager size (£1, £2, £5 or £10) with stimulus visibility (100%, 80% or 50% of threshold duration). This design allowed us to recover an estimate of advantageous wagering (proportion of high wagers following correct responses; “hits” in Table 1) for each cell in our design.
Participants were informed that the wager they were making was a true mixed gamble: they would win the money they wagered if their initial word/non-word decision was correct, and would have this amount deducted from their running total if they were incorrect. They were also informed that only one trial from each block was to be evaluated at random, allowing us to use relatively larger amounts of money while still encouraging participants to treat each trial as a meaningful gamble.
2.3. Analysis
To quantify a match between the psychophysical data and the model, we computed the likelihood of each subject’s wagers under the model (from the integrals in Eqs. (9) and (10)), optimised over free parameters of the utility function. We compared two parameterisations of the utility function: one with, and one without, a loss aversion constant (Tversky & Kahneman, 1991). We took the parameters r, t and s to be free in Model 1 (loss averse + power), and r and t to be free in Model 2 (power). The testable model space was constrained by requiring the loss and gain domains of the utility function to be asymmetric, as symmetric functions do not predict any effect of wager size on the Type II response criterion (see Supplementary Material for a more detailed discussion of empirical forms of the utility curve).
Type I sensitivity was set for each level of stimulus visibility using empirically estimated d′ values from each subject. For each model, we fit the free parameters to each participant’s choice data by maximising the likelihood of the observed choices, using a nonlinear optimisation algorithm (fminsearch in the Matlab Optimisation toolbox). We report negative log-likelihoods (smaller values indicate better fit), penalised for model complexity using the Bayesian information criterion (BIC) (Schwarz, 1978). To compare models at the group level we summed each subject’s BIC, resulting in a log group Bayes factor (Stephan, Weiskopf, Drysdale, Robinson, & Friston, 2007).
For signal detection analysis, Type I “word” responses were classified as hits or false alarms; “non-word” responses were classified as correct rejections or misses. For both the model and the data we computed Type II hit and false alarm rates using the classification scheme of Table 1. To allow comparison with previous datasets employing post-decision wagering (Persaud & McLeod, 2008; Persaud et al., 2007), advantageous wagers were defined as Type II hit rates (proportion of high gambles made after correct decisions). A repeated-measures analysis of variance (ANOVA) was used to analyse these proportions, following an arcsine transformation of the data to correct for heterogeneity of variance. In addition Type I SDT measures of the initial perceptual discrimination (d′ and c) and Type II sensitivity () and criterion (cw) for the wagering response were calculated as follows (where z is the inverse of the cumulative normal distribution function):
(11) |
(12) |
(13) |
(14) |
0.5 was added to all cells in calculation of both Type I and Type II SDT measures to avoid extreme proportions of H or F (Hautus, 1995). SDT measures calculated for each wager and sensitivity condition were analysed using repeated-measures ANOVAs.
3. Results
3.1. Model comparison
As an initial examination of the validity of our model, we computed estimates of advantageous wagering (the proportion of high wagers following correct responses) from both the model output and psychophysical data (Fig. 3a and b). Perceptual sensitivity levels in the model were set to match mean d′ values across subjects in the psychophysical experiment. It is clear that advantageous wagering is predicted not only to increase as a function of perceptual sensitivity, but also to decrease as a function of wager size. The latter decrease is a direct consequence of the loss-averse utility function implemented in the model.
We next tested the model predictions against psychophysical data. Advantageous wagering decreased with both decreasing stimulus visibility (effect of visibility; F(2,24) = 99.87, p < .0001) and with increasing wager size (effect of wager; F(3,36) = 10.71, p < .0001; Fig. 3b), closely matching the predictions of the model (Fig. 3a). Even when the stimulus was most visible, the size of the wager significantly modulated a tendency to gamble high following correct responses (paired t-test between £1 and £10 wagers; t(12) = 3.90, p = .002). Indeed, for the range of stimulus durations we tested here, the effect of wager size did not interact with stimulus visibility (F(6,72) < 1, p > .5).
In assessing the match between the psychophysical results and the model, we compared two parameterisations of the utility function: one with a loss aversion constant (Tversky & Kahneman, 1991) and one without. The model with the loss aversion constant was strongly preferred, despite the Bayesian information criterion (BIC) measure including a penalty for additional complexity (Fig. 3c; BIC difference of 73.2, corresponding to an e73.2 difference in Bayes factor). This suggests the transformation of perceptual sensitivity into confident wagering was indeed significantly affected by the loss aversion parameter of the model. The average utility function adopted by participants is shown in Fig. 3d. Across subjects, the loss aversion parameter s ranged from 1.94 to 4.53, with a mean of 2.74 ± 0.92. Such a parameter range is comparable with previous findings in the literature (Tom, Fox, Trepel, & Poldrack, 2007; Tversky & Kahneman, 1991).
3.2. Type I signal detection analysis
As expected, longer stimulus durations were associated with higher Type I d′ (F(2,24) = 123.47, p < .0001). Importantly, however, Type I perceptual sensitivity remained constant across changes in wager size (interaction of wager size and visibility: F(6,72) = 1.07, p > .3; Fig. 4a). As outlined in Section 2, an explicit assumption of our model is that Type I response criteria are unbiased. To evaluate this assumption we calculated Type I criterion for the word/non-word discrimination as a function of both stimulus visibility and wager size (Fig. 4b). A consistent bias towards responding “word” for the higher visibility trials was observed, leading to a significant effect of visibility on response criterion (F(2,24) = 23.48, p < .0001). However, as for d′, this bias was expressed consistently across changes in wager size (no interaction with wager size; F(6,72) = 1.31, p > .2), suggesting that these criterion shifts are not sufficient to explain biases in post-decision wagering induced by loss aversion.
3.3. Type II signal detection analysis
The results described under Model Comparison above show that a typical post-decision wagering measure – Type II hit rate – is subject to a bias that can be explained by loss aversion, despite the underlying Type I sensitivity to the stimulus remaining constant. However, participants’ wagering efficiency (ability to discriminate between correct and incorrect trials) might still be independent of their overall propensity to wager high (Higham, 2007; Kunimoto et al., 2001; but see Evans & Azzopardi, 2007). In other words, despite using the £10 wager less than the £1 wager for a given level of sensitivity (Fig. 3b), does this usage still discriminate well between correct and incorrect decisions?
To answer this question we computed Type II measures of wagering efficiency, , and wagering criterion, cw (see Section 2) from both model and psychophysics data, as a function of stimulus visibility and wager size (Fig. 5a and b). The model predicts that wager size should primarily affect cw (due to loss aversion), without affecting , which is a function of Type I d′ (Fig. 5a; left-hand panel). The data indeed showed a strong main effect of wager size on cw (F(2,24) = 45.87, p < .0001), which did not interact with stimulus visibility (F(6,72) = 0.56, p > .7), in agreement with the model (right-hand panel of Fig. 5b). The model also predicts a main effect of stimulus visibility on cw, in that wagering is generally more conservative when the task is more difficult (Fig. 5a, right-hand panel), an effect borne out in the data (F(3,36) = 8.93, p < .005).
The match between the model and the data on was less consistent. As predicted, we found a main effect of stimulus visibility (F(2,24) = 7.78, p < .005; Fig. 5a), which did not show any significant interactions with wager size (F(6,72) = 1.78, p > .1). However, for the lowest visibility, we noted a downward trend in as wager size increased, reflected in a linear effect of wager size that interacted with visibility level (F(1,12) = 8.66, p = .012). Indeed, when comparing participants’ sensitivity to their own correctness in low-visibility judgements for £1 and £10 wager blocks, dropped by almost half (from 0.91 to 0.50). To ensure that these effects were not stimulus-specific (cf. Higham, Perfect, & Bruno, 2009; Experiment 3), we reanalysed the low-visibility condition as a function of whether the stimulus was a word or non-word (Fig. 6). A main effect of wager size on was confirmed (F(3,36) = 3.95, p = .016) which did not interact with stimulus type (F(3,36) < 1). A main effect of stimulus type was also found (F(1,12) = 34.4, p < .001), replicating Higham et al.’s (2009) findings that monitoring is worse for distractors (non-words in our task) than targets (words). Overall, our results show that the size of the wager largely affects the criterion for using the high-wager response, but for near-threshold stimuli, also affects metacognitive performance.
3.4. Test of model assumptions
An inherent assumption of our model is that the wagering criterion (m in Fig. 1b) is symmetric about c. In other words, it is assumed that the same degree of loss aversion leads to similar wagering behaviour following reports of both signal and noise trials. It is possible that these criteria (c + m and c − m) are independently specified; for example, it may be easier to be sure of seeing something than to be sure of not seeing something. To examine such effects, we split and cw by whether the first response had been “word” or “non-word” (equivalent to calculating separate hit and false alarm rates for the left- and right-hand sides of the distributions in Fig. 1b). These were entered into a 2 (Type I response) × 3 (visibility) × 4 (wager size) ANOVA. A robust main effect of Type I response on Cw was found (F(1,12) = 37.3, p < .0001), which was driven by a more conservative criterion for wagering following non-word (mean cw = 0.60) compared to word responses (mean cw = −0.16). Importantly, however, this dependence of cw on the initial decision was highly consistent across both wager size (interaction with wager size; F(2,24) < 1, p > .9) and changes in stimulus visibility (F(3,36) < 1, p > .6), indicative of a stable baseline shift that does not affect the experimental manipulations of interest. In addition, we confirmed that the effects of wager size on and cw (reported in Section 3.3) were not qualified by interactions with stimulus or Type I response (word/non-word; all F < 1).
Finally, we note that the results presented in Figs. 5b and 6 are derived from point estimates of and cw, which in turn rely on the assumption of equal variance Gaussian confidence distributions, f(x|C) and f(x|I) (Green & Swets, 1966). In fact, the Type II probability distributions used in the model are known to deviate from these assumptions (Fig. 1b; Galvin et al., 2003). Such deviations produce interactions between Type II d′ and shifts in criterion (Evans & Azzopardi, 2007). However, we consider it unlikely that our observation of decreasing with wager size is an artefact of a criterion shift, for two reasons. First, the prediction based on the Type II ROC curve is that for higher wagers, more conservative criteria would actually result in slight increases in (Fig. 5a, left-hand panel; see also Evans & Azzopardi, 2007). The opposite is seen in the data for low-visibility stimuli (Fig. 5b, left-hand panel and Fig. 6). Second, the same effects hold when an overall measure of the wagering “error” rate is calculated (overall proportion of false alarms and misses in Table 1). Collapsing over stimulus visibility, greater errors were seen in the usage of a £10 wager (45 ± 5%) than in the usage of a £1 wager (34 ± 3%).
4. Discussion
Measuring conscious awareness is fraught with controversy (Seth, Dienes, Cleeremans, Overgaard, & Pessoa, 2008) where even the simplest of methods, asking for subjective reports, is dogged by the conundrum of response bias (Eriksen, 1960). Recently, wagering contingent on decision performance (post-decision wagering) has been proposed as a direct and intuitive measure of conscious awareness (Koch & Preuschoff, 2007; Persaud et al., 2007; Persaud & McLeod, 2008). In the present study, we examine how economic factors affect wagers placed on perceptual decisions, and consider the theoretical implications our results have for post-decision wagering as a measure of subjective awareness.
Our signal detection model, derived from equations developed by Galvin et al. (2003), defines the transformations of noisy sensory evidence into confidence distributions on which wagering responses are based. Due to inherent loss aversion (Kahneman & Tversky, 1979; Schurger & Sher, 2008), this model predicts that advantageous wagering (Type II hits) on a perceptual event depends on the size of the gamble, despite sensory evidence remaining constant. Our psychophysics data confirm this pattern of predicted results. Our findings suggest that loss aversion is a key modulator of the linkage between perceptual sensitivity and confident wagering. This effect was true even for the highest stimulus visibility, demonstrating a labile linkage between perception and behaviour.
It is clear from these results that the typical post-decision wagering measure – advantageous wagering, or Type II hit rate – is subject to a bias induced by loss aversion, despite underlying Type I sensitivity as measured by responses to the stimulus remaining constant. Data from a related task requiring uncertainty monitoring (a scholastic aptitude test where omission of responses was allowed), revealed an analogous reduction in test score when the penalty for incorrect answers was increased (Higham, 2007). Higham went on to show that this reduction was due to more conservative Type II response criteria in the high-penalty condition. Similarly, our model predicts that the effect of loss aversion on advantageous wagering is largely due to a criterion (cw) shift, reducing the propensity of the subject to opt for the high wager regardless of whether her first response was correct or not. In contrast, Type II wagering efficiency (; the difference between good and bad wagers in Table 1) should remain relatively constant in the face of changing wager size. In our psychophysical task, we find predictable effects of the wager size on Type II response criteria, but also observe decreases in as a function of wager size for near-threshold stimuli. In other words, for the same underlying task performance, participants become worse at discriminating between their own correct and incorrect responses as wager size increases.
Previous work has shown that Type II sensitivity and criterion are not in fact independent when derived from contingency tables such as Table 1 (Evans & Azzopardi, 2007), assuming equal variance Type I distributions (Galvin et al., 2003). Indeed, similar (but more subtle) increases in sensitivity as a function of wager size were predicted by our model (Fig. 5a, left panel). However, the empirical effect we observe is a decrease in for near-threshold stimuli as wager size increases (Fig. 5b, left panel and Fig. 6). What might cause this decrease in wagering efficiency on high-wager blocks? It is possible that the prospect of high wagers interferes with participants’ performance monitoring on low-visibility (difficult) trials. However, this account might also predict decreases in Type I task performance, which are not seen in our data. We note that recent evidence suggests that the prospect of large rewards (comparable to those used in the present study) causes paradoxical performance decrements (Mobbs et al., 2009; see also Baumeister, 1984). Whether such influences of large rewards on behaviour could also lead to a reduction in performance monitoring, consistent with the present data, is a question that requires further study.
4.1. Generation of confidence
Two inherent assumptions of our model deserve attention in order to illustrate the current limitations of a purely signal detection-based theory of confidence. A first assumption is that the initial sensory judgement is unbiased. Empirically, however, there was a consistent bias towards responding “word” for the medium and high stimulus visibility conditions. As this bias was consistent across wager size, it is unlikely to affect our conclusions regarding the effects of wager size on behaviour. However, it does affect how we characterise the link between underlying stimulus distributions and confidence, an issue we consider below. A second assumption of the model is that the criteria for wagering are symmetrical (Fig. 1b). This corresponds to equating the confidence required to wager high for words and non-words. In our behavioural data, we see a more conservative criterion for wagering high following non-word decisions. However, this shift was expressed consistently across conditions, leaving unaffected the main conclusions we infer regarding changes in cw and as a function of wager size. In our model, wagering criteria are symmetric because the same utility function is applied to symmetric signal and noise distributions. Within this framework, a baseline difference in wagering criteria may be caused by the variance of the signal (word) distribution being greater than the noise (non-word) distribution, leading to a more prominent f(I|x) distribution following non-word responses and requiring more conservative criteria to maintain a given level of wagering performance. Interestingly, greater signal variance when compared to noise in empirical ROCs has been documented in the memory literature (Ratcliff, Sheu, & Gronlund, 1992). However, it is unclear from the present data whether distributional or response factors are the main drivers of the observed asymmetry in wagering criteria.
These behavioural deviations from the model hint at a deeper issue regarding the relationship between Type I task performance and metacognitive confidence (Baranski & Petrusic, 2001; Busey, Tunnicliff, Loftus, & Loftus, 2000; Higham et al., 2009). In our signal detection model, probability distributions over the stimulus are assumed to deterministically give rise to Type II confidence distributions, without invoking the notion of intermediate processing stages. Higham et al. (2009) identify this generation of confidence from stimulus distributions as the “direct translation hypothesis”. They empirically tested this hypothesis by varying Type I response criteria in a memory paradigm, noting that Type II sensitivity should be systematically affected if Type I distributions are indeed directly translated into Type II distributions, a conclusion supported by their data (Higham et al., 2009; Experiment 3). However, recent human psychophysical data reveal dissociations between objective performance and subjective confidence, suggesting a more complex relationship between Type I and Type II distributions (Wilimzig, Tsuchiya, Fahle, Einhauser, & Koch, 2008; see also Busey & Arici, 2009). In addition, reaction time measures suggest that confidence is at least partly determined by additional processing following the decision itself (Baranski & Petrusic, 1998, 2001). In the present study, word–non-word discrimination may be a suboptimal testbed for the direct translation hypothesis, given that the evidence dimension is unlikely to be unitary. For example, some words may be easier to process than others, engendering higher fluency and thus influencing metacognitive assessment (Kelley & Lindsay, 1993; Koriat, 1993; Winkielman, Halberstadt, Fazendeiro, & Catty, 2006). Future work is needed examine the empirical form of Type I and Type II distributions under simpler psychophysical conditions.
4.2. Post-decision wagering and awareness
What do the present results say about post-decision wagering as a measure of awareness? On a psychological level, the use of advantageous wagering counts (Type II hit rate) is confounded by factors (presumably) external to stimulus awareness (see also Dienes & Seth, 2009). For example, blindsight subject GY was seen to wager high 48% of the time after correct responses to stimuli in his blind hemifield (Persaud et al., 2007; p. 257). Persaud and colleagues argue that as this proportion is no better than chance (50%), it is evidence for a lack of awareness. However, it is clear from the model and data we present here that simply altering the size of the wager can manipulate this proportion to be consistent with higher or lower awareness. Application of Type II SDT measures to GYs response counts might be a more convincing demonstration of a lack of awareness (Persaud et al. (2007), their Supplementary Table 6; but see Clifford et al. (2008)). Again, however, this measure is not perfect: we find changes in wager size produced unpredictable effects on for the types of near-threshold stimuli often used in consciousness research, and particular forms of the payoff matrix can produce values of consistent with a lack of awareness (Clifford et al., 2008). Further problems arise when making comparisons across subjects, or when comparing patients with controls: in our model fit, loss aversion varied considerably between subjects (and may vary in an even more unpredictable fashion within different patient populations), leading to the interpretation of wagering responses being confounded by individual differences in the subjective utility of the gamble (Dienes & Seth, 2009; Schurger & Sher, 2008). There are obvious instances where the ease of use and nonverbal nature of wagering may outweigh such drawbacks, for instance when measuring awareness in non-human animals and children. In these cases, care should be taken to address potentially illusory changes in awareness caused by economic factors.
On a philosophical level, even if it is possible to control for the confounding effects of loss aversion, it is unclear whether awareness can be inferred from successful wagering (Seth, 2008). Recent data demonstrates that adaptive, value-based responses can be made in response to stimuli that are below an objective threshold of awareness (Pessiglione et al., 2007, 2008). Indeed, from an economist’s perspective, a post-decision wager is a gamble in which the sensory uncertainty determines the probabilities of winning and losing. Graded changes in either the stimulus or outcome utilities will produce probabilistic changes in wagering behaviour, which may not have obvious mappings to particular states of consciousness. In contrast, explicitly taking into account this graded nature of processing using, for example, direct confidence rating scales may offer more robust means of assessing both objective sensitivity and subjective meta-sensitivity (Szczepanowski & Pessoa, 2007). In this regard our results emphasise the importance of considering both stimulus and response variables when assessing conscious awareness (Clifford et al., 2008; Evans & Azzopardi, 2007; Hulme, Friston, & Zeki, 2009).
In summary, using a combination of signal detection theory and psychophysics we show that the translation of perceptual sensitivity into a post-decision wagering response is systematically affected by economic variables, in this instance by loss aversion. Type II signal detection measures reveal that changes in behaviour induced by changes in wager size are largely driven by changes in criterion, consistent with our model. However, when stimulus visibility is low and wagers are large, a reduction in wagering efficiency is also seen. Indeed, the complex interaction between objective stimulus visibility, wager size and the subsequent willingness to gamble casts doubt on an assertion (Persaud et al., 2007) that post-decision wagering is a direct index of subjective awareness, despite its intuitive nature. Such interactions raise intriguing questions for future work into the relationship between stimulus processing, subjective awareness and the generation of metacognitive confidence.
Acknowledgements
We are indebted to Paul Azzopardi for insightful comments on an earlier version of this manuscript. We additionally thank Chris Frith, Klaas Stephan and anonymous reviewers for helpful suggestions, and Brett Wilkinson for mathematical advice. This work was carried out under a Wellcome Trust Programme Grant to RJD and MRC funding within the UCL 4 Year PhD in Neuroscience to SMF.
Footnotes
Supplementary data associated with this article can be found, in the online version, at doi:10.1016/j.concog.2009.11.002.
Appendix A. Supplementary material
References
- Baranski J.V., Petrusic W.M. Probing the locus of confidence judgments: Experiments on the time to determine confidence. Journal of Experimental Psychology: Human Perception and Performance. 1998;24:929–945. doi: 10.1037//0096-1523.24.3.929. [DOI] [PubMed] [Google Scholar]
- Baranski J.V., Petrusic W.M. Testing architectures of the decision–confidence relation. Canadian Journal of Experimental Psychology. 2001;55:195–206. doi: 10.1037/h0087366. [DOI] [PubMed] [Google Scholar]
- Baumeister R.F. Choking under pressure: Self-consciousness and paradoxical effects of incentives on skillful performance. Journal of Personality and Social Psychology. 1984;46:610–620. doi: 10.1037//0022-3514.46.3.610. [DOI] [PubMed] [Google Scholar]
- Busey T.A., Arici A. On the role of individual items in recognition memory and metacognition: Challenges for signal detection theory. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2009;35:1123–1136. doi: 10.1037/a0016646. [DOI] [PubMed] [Google Scholar]
- Busey T.A., Tunnicliff J., Loftus G.R., Loftus E.F. Accounts of the confidence–accuracy relation in recognition memory. Psychonomic Bulletin and Review. 2000;7:26–48. doi: 10.3758/bf03210724. [DOI] [PubMed] [Google Scholar]
- Clarke F.R., Birdsall T.G., Tanner W.P. Two types of ROC curves and definition of parameters. Journal of the Acoustical Society of America. 1959;31:629–630. [Google Scholar]
- Cleeremans A., Timmermans B., Pasquali A. Consciousness and metarepresentation: A computational sketch. Neural Networks. 2007;20:1032–1039. doi: 10.1016/j.neunet.2007.09.011. [DOI] [PubMed] [Google Scholar]
- Clifford C.W.G., Arabzadeh E., Harris J.A. Getting technical about awareness. Trends in Cognitive Sciences. 2008;12:54–58. doi: 10.1016/j.tics.2007.11.009. [DOI] [PubMed] [Google Scholar]
- Dienes Z. Subjective measures of unconscious knowledge. Progress in Brain Research. 2008;168:49–64. doi: 10.1016/S0079-6123(07)68005-4. [DOI] [PubMed] [Google Scholar]
- Dienes, Z., & Seth, A. (2009). Gambling on the unconscious: A comparison of wagering and confidence ratings as measures of awareness in an artificial grammar task. Consciousness and Cognition, doi:10.1016/j.concog.2009.09.009. [DOI] [PubMed]
- Eriksen C.W. Discrimination and learning without awareness – A methodological survey and evaluation. Psychological Review. 1960;67:279–300. doi: 10.1037/h0041622. [DOI] [PubMed] [Google Scholar]
- Evans S., Azzopardi P. Evaluation of a ‘bias-free’ measure of awareness. Spatial Vision. 2007;20:61–77. doi: 10.1163/156856807779369742. [DOI] [PubMed] [Google Scholar]
- Fennema H., Van Assen M. Measuring the utility of losses by means of the tradeoff method. Journal of Risk and Uncertainty. 1999;17:277–295. [Google Scholar]
- Foote A.L., Crystal J.D. Metacognition in the rat. Current Biology. 2007;17:551–555. doi: 10.1016/j.cub.2007.01.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galvin S.J., Podd J.V., Drga V., Whitmore J. Type 2 tasks in the theory of signal detectability: Discrimination between correct and incorrect decisions. Psychonomic Bulletin and Review. 2003;10:843–876. doi: 10.3758/bf03196546. [DOI] [PubMed] [Google Scholar]
- Green D.M., Swets J.A. Wiley; New York: 1966. Signal detection theory and psychophysics. [Google Scholar]
- Hampton R.R. Rhesus monkeys know when they remember. Proceedings of the National Academy of Sciences. 2001;98:5359–5362. doi: 10.1073/pnas.071600998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hautus M.J. Corrections for extreme proportions and their biasing effects on estimated values of d’. Behavior Research Methods Instruments & Computers. 1995;27:46–51. [Google Scholar]
- Higham P.A. No special K! A signal detection framework for the strategic regulation of memory accuracy. Journal of Experimental Psychology: General. 2007;136:1–22. doi: 10.1037/0096-3445.136.1.1. [DOI] [PubMed] [Google Scholar]
- Higham P.A., Gerrard C. Not all errors are created equal: Metacognition and changing answers on multiple-choice tests. Canadian Journal of Experimental Psychology. 2005;59:28–34. doi: 10.1037/h0087457. [DOI] [PubMed] [Google Scholar]
- Higham P.A., Perfect T.J., Bruno D. Investigating strength and frequency effects in recognition memory using type-2 signal detection theory. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2009;35:57–80. doi: 10.1037/a0013865. [DOI] [PubMed] [Google Scholar]
- Hulme O.J., Friston K.F., Zeki S. Neural correlates of stimulus reportability. Journal of Cognitive Neuroscience. 2009;21:1602–1610. doi: 10.1162/jocn.2009.21119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kahneman D., Tversky A. Prospect Theory – Analysis of decision under risk. Econometrica. 1979;47:263–291. [Google Scholar]
- Kelley C.M., Lindsay D.S. Remembering mistaken for knowing: Ease of retrieval as a basis for confidence in answers to general knowledge questions. Journal of Memory and Language. 1993;32:1–24. [Google Scholar]
- Kepecs A., Uchida N., Zariwala H., Mainen Z.F. Neural correlates, computation and behavioural impact of decision confidence. Nature. 2008;455:227–231. doi: 10.1038/nature07200. [DOI] [PubMed] [Google Scholar]
- Kiani R., Shadlen M.N. Representation of confidence associated with a decision by neurons in the parietal cortex. Science. 2009;324:759–764. doi: 10.1126/science.1169405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kobberling V., Wakker P.P. An index of loss aversion. Journal of Economic Theory. 2005;122:119–131. [Google Scholar]
- Koch C., Preuschoff K. Betting the house on consciousness. Nature Neuroscience. 2007;10:140–141. doi: 10.1038/nn0207-140. [DOI] [PubMed] [Google Scholar]
- Koriat A. How do we know what we know? The accessibility model of the feeling of knowing. Psychological Review. 1993;100:609–639. doi: 10.1037/0033-295x.100.4.609. [DOI] [PubMed] [Google Scholar]
- Koriat A., Goldsmith M. Monitoring and control processes in the strategic regulation of memory accuracy. Psychological Review. 1996;103:490–517. doi: 10.1037/0033-295x.103.3.490. [DOI] [PubMed] [Google Scholar]
- Kunimoto C., Miller J., Pashler H. Confidence and accuracy of near-threshold discrimination responses. Consciousness and Cognition. 2001;10:294–340. doi: 10.1006/ccog.2000.0494. [DOI] [PubMed] [Google Scholar]
- Levitt H. Transformed up–down methods in psychoacoustics. Journal of the Acoustical Society of America. 1971;49:467–477. [PubMed] [Google Scholar]
- Macmillan N.A., Creelman C.D. Lawrence Erlbaum; New York: 2005. Detection theory: A user’s guide. [Google Scholar]
- Metcalfe J., Finn B. Evidence that judgements of learning are causally related to study choice. Psychonomic Bulletin and Review. 2008;15:174–179. doi: 10.3758/pbr.15.1.174. [DOI] [PubMed] [Google Scholar]
- Mobbs D., Hassabis D., Seymour B., Marchant J.L., Weiskopf N., Dolan R.J. Choking on the money: Reward-based performance decrements are associated with midbrain activity. Psychological Science. 2009;20:955–962. doi: 10.1111/j.1467-9280.2009.02399.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Penny W.D., Stephan K.E., Mechelli A., Friston K.J. Comparing dynamic causal models. NeuroImage. 2004;22:1157–1172. doi: 10.1016/j.neuroimage.2004.03.026. [DOI] [PubMed] [Google Scholar]
- Persaud N., McLeod P. Wagering demonstrates subconscious processing in a binary exclusion task. Consciousness and Cognition. 2008;17:565–575. doi: 10.1016/j.concog.2007.05.003. [DOI] [PubMed] [Google Scholar]
- Persaud N., McLeod P., Cowey A. Post-decision wagering objectively measures awareness. Nature Neuroscience. 2007;10:257–261. doi: 10.1038/nn1840. [DOI] [PubMed] [Google Scholar]
- Pessiglione M., Petrovic P., Daunizeau J., Palminteri S., Dolan R.J., Frith C.D. Subliminal instrumental conditioning demonstrated in the human brain. Neuron. 2008;59:561–567. doi: 10.1016/j.neuron.2008.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pessiglione M., Schmidt L., Draganski B., Kalisch R., Lau H., Dolan R.J. How the brain translates money into force. A neuroimaging study of subliminal motivation. Science. 2007;316:904–906. doi: 10.1126/science.1140459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ratcliff R., Sheu C.F., Gronlund S.D. Testing global memory models using ROC curves. Psychological Review. 1992;99:518–535. doi: 10.1037/0033-295x.99.3.518. [DOI] [PubMed] [Google Scholar]
- Rosenthal D.M. Consciousness, content, and metacognitive judgments. Consciousness and Cognition. 2000;9:203–214. doi: 10.1006/ccog.2000.0437. [DOI] [PubMed] [Google Scholar]
- Rosenthal D.M. How many kinds of consciousness? Consciousness and Cognition. 2002;11:653–665. doi: 10.1016/s1053-8100(02)00017-x. [DOI] [PubMed] [Google Scholar]
- Schurger A., Sher S. Awareness, loss aversion, and post-decision wagering. Trends in Cognitive Sciences. 2008;12:209–210. doi: 10.1016/j.tics.2008.02.012. [DOI] [PubMed] [Google Scholar]
- Schwarz G. Estimating the dimension of a model. Annals of Statistics. 1978;6:461–464. [Google Scholar]
- Seth A.K. Post-decision wagering measures metacognitive content, not sensory consciousness. Consciousness and Cognition. 2008;17:981–983. doi: 10.1016/j.concog.2007.05.008. [DOI] [PubMed] [Google Scholar]
- Seth A.K., Dienes Z.n., Cleeremans A., Overgaard M., Pessoa L. Measuring consciousness: Relating behavioural and neurophysiological approaches. Trends in Cognitive Sciences. 2008;12:314–321. doi: 10.1016/j.tics.2008.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith J.D., Shields W.E., Washburn D.A. The comparative psychology of uncertainty monitoring and metacognition. Behavioural and Brain Sciences. 2003;26:317–339. doi: 10.1017/s0140525x03000086. [DOI] [PubMed] [Google Scholar]
- Stephan K.E., Weiskopf N., Drysdale P.M., Robinson P.A., Friston K.J. Comparing hemodynamic models with DCM. NeuroImage. 2007;38:387–401. doi: 10.1016/j.neuroimage.2007.07.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Szczepanowski R., Pessoa L. Fear perception: Can objective and subjective awareness measures be dissociated? Journal of Vision. 2007;7(4):1–17. doi: 10.1167/7.4.10. 10. [DOI] [PubMed] [Google Scholar]
- Tom S.M., Fox C.R., Trepel C., Poldrack R.A. The neural basis of loss aversion in decision-making under risk. Science. 2007;315:515–518. doi: 10.1126/science.1134239. [DOI] [PubMed] [Google Scholar]
- Tversky A., Kahneman D. Loss aversion in riskless choice. A reference-dependent model. Quarterly Journal of Economics. 1991;106:1039–1061. [Google Scholar]
- Visser T.A.W., Merikle P.M. Conscious and unconscious processes: The effects of motivation. Consciousness and Cognition. 1999;8:94–113. doi: 10.1006/ccog.1998.0378. [DOI] [PubMed] [Google Scholar]
- Wilimzig C., Tsuchiya N., Fahle M., Einhauser W., Koch C. Spatial attention increases performance but not subjective confidence in a discrimination task. Journal of Vision. 2008;8(5):1–10. doi: 10.1167/8.5.7. [DOI] [PubMed] [Google Scholar]
- Wilson M.D. The MRC psycholinguistic database: Machine readable dictionary, version 2. Behavioural Research Methods, Instruments and Computers. 1988;20:6–11. [Google Scholar]
- Winkielman P., Halberstadt J., Fazendeiro T., Catty S. Prototypes are attractive because they are easy on the mind. Psychological Science. 2006;17(9):799–806. doi: 10.1111/j.1467-9280.2006.01785.x. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.