Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2019 May 13;9:7327. doi: 10.1038/s41598-019-43860-w

Individual differences in working memory capacity and cue-guided behavior in humans

Sara Garofalo 1,, Simone Battaglia 1, Giuseppe di Pellegrino 1
PMCID: PMC6514037  PMID: 31086233

Abstract

Information gathered via Pavlovian and Instrumental learning can be integrated to guide behavior, in a phenomenon experimentally known as Pavlovian-to-Instrumental Transfer (PIT). In particular, in appetitive PIT, a reward-associated cue is able to enhance the instrumental response previously associated with the same (outcome-specific PIT), or a similar (general PIT), reward. The PIT effect is increasingly investigated for its numerous implications in clinical contexts as well as daily life situations. Nevertheless, the precise mechanism behind it is not yet clear. The relation between the PIT effect and high-level cognitive abilities - like working memory - is still unknown, but potentially relevant to unveil its functioning. The present study aims to examine the precise relationship between individual differences in working memory and the two forms of PIT effect, namely outcome-specific and general. For this purpose, 100 participants underwent a classical PIT paradigm. Results showed a relationship between individual working memory and outcome-specific PIT, but not general PIT. Importantly, the role of working memory was not related to the acquisition of the learning contingencies, but rather linked to an imbalance between congruent and incongruent choices. The results are discussed in terms of the adaptive and maladaptive implications for human behavior.

Subject terms: Decision, Learning and memory

Introduction

In daily life, we learn which actions and which environmental circumstances can lead us to achieve a goal. Two very basic learning mechanisms involved in this ability are the well-known instrumental and Pavlovian conditioning1. With instrumental conditioning, we learn to predict the consequences of a given action, whereas with Pavlovian conditioning we learn to predict the consequences associated with the presence of external cues. Information gathered via such learning processes, even if independently acquired, can be integrated to guide behaviour24. An example of this mechanism can be observed in the Pavlovian-to-Instrumental Transfer (PIT) effect, which tests the extent to which a Pavlovian stimulus (S) can drive instrumental responses (R) independently paired with the same or a similar outcome (O).

According to some authors, instrumental responses can be controlled by either an R-O association (e.g., pressing the button is rewarded with chocolate), or by an O-R association (e.g., I want chocolate, so I press the button)5,6. And it is also acknowledged in the literature that the presentation of a Pavlovian stimulus can activate both a general motivational representation of the potential reward value (i.e., food craving), and a representation of the specific sensory features of the precise outcome at stake (e.g., dark chocolate)7. So, how can these two mechanisms interact? The cross connection between the two paths to response control, and the two representations that a Pavlovian stimulus can elicit, is at the foundation of two separate forms of transfer, namely outcome-specific PIT and general PIT2,3,7,8. In outcome-specific PIT, the Pavlovian stimulus exerts a selective influence on instrumental responses associated with the same reinforcer via a more5,9 or less10 direct link to the outcome representation. That is, a Pavlovian stimulus associated with dark chocolate will trigger responses specifically associated with dark chocolate. Conversely, in general PIT the stimulus invigorates instrumental responses paired with motivationally similar reinforcers (i.e., reinforces with the same value) which is detached by the specific sensory properties of the outcome2,3. For example, a Pavlovian stimulus associated with dark chocolate triggers responses associated with any kind of food.

Such interaction between instrumental and Pavlovian learning systems determines a cue-guided behaviour, which can present both adaptive and maladaptive implications. It can trigger a flexible response that facilitates the achievement of a goal (e.g., obtain food when hungry), but it can also lead to an inflexible association when a response is perseverated in absence of a reward or despite a negative consequence3,11. Most literature focused on the latter implication. In fact, the PIT effect has been associated with many clinical conditions, such as addiction1215, compulsive behavior16 and other neuropsychiatric disorders, like depression17 or schizophrenia18. Previous findings11 suggested that individual differences in learning style may play a role in the predisposition to such maladaptive implications. A PIT-like effect can also be observed in marketing strategies, where a logo or a jingle can prime a potential consumer to purchase the associated product19.

Given the repercussions in clinical contexts and daily life, the PIT effect is increasingly investigated in humans and has been largely examined also in non-human animals2,3. However, the precise mechanism behind this effect is not yet clear and whether the influence that a Pavlovian stimulus can exert on instrumental choices can be attributed to more strategic cognitive components, or to more automatic determinants, is still hotly debated12,2022.

In particular, the relation between the PIT effect and high-level cognitive abilities – such as working memory - is still unknown, but potentially relevant to shed new light on this effect. A number of findings suggest that working memory is crucially involved in decision-making, especially when the ability to integrate past experiences about action and outcome values is needed to guide current decisions18,23,24. By definition, working memory is the ability to maintain information so that it can be integrated with other information and used to guide behavior; it is involved in maintaining current goals by selecting relevant information and inhibiting irrelevant stimuli25,26. Crucially, the PIT effect requires the combination of different pieces of information without re-exposure to the outcome, a process which is very likely to require working memory27. When facing multiple R− > O (instrumental learning) and S− > O (Pavlovian learning) associations, individual differences in working memory may, thus, influence the accessibility of such representations to control subsequent behavioral expressions28. Interestingly, recent findings suggest that high working memory capacity is not associated with the expression of conditioning per se (i.e., learning of the contingencies), but rather with a more effective use of the acquired information in later phases29. In particular, as compared to general PIT, outcome-specific PIT requires retaining a detailed representation of the outcomes available and of each response-outcome pairing30, thus it should rely more on working memory.

Based on these premises, the present experiment aims to test to which extent outcome-specific and general PIT effect are related to individual working memory capacity. For this purpose, 60 participants performed a PIT task (Table 1), consisting of three phases: instrumental conditioning, in which they learned the association between three instrumental responses and the relative reward or neutral outcome (Fig. 1a); Pavlovian conditioning, in which the same rewards were associated with visual stimuli (Fig. 1b); PIT, in which the participants were asked to perform an instrumental choice under extinction (i.e., in absence of rewards) while being presented with task-irrelevant Pavlovian stimuli (Fig. 1c). The aim of this last phase is to test the ability of Pavlovian stimuli to promote instrumental responses previously associated with a reward (i.e., PIT effect). Working memory ability was tested with the Automated Operation Span Task31.

Table 1.

Trial composition for all task phases.

TRIALS CONTINGENCY
Instrumental Conditioning R+1 → O1
R+2 → O2
R− → NR
Pavlovian Conditioning CS+1 → O1
CS+2 → O2
CS− → NR
PIT Outcome-specific CS+1: R1 vs R2
CS+2: R1 vs R2
General CS+1: R2 vs R−
CS+2: R1 vs R−
vs CS−: R2 vs R−
CS−: R1 vs R−

Figure 1.

Figure 1

Pavlovian-to-Instrumental Transfer (PIT) task. Visual representation of the three task phases to test for the PIT effect. During instrumental conditioning (a) participants learned the association between three instrumental responses and the associated reward or neutral outcome. During Pavlovian conditioning (b) the same rewards were associated with visual stimuli. During PIT (c) participants were asked to perform the again an instrumental choice under extinction (i.e., in the absence of rewards) while being presented with task-irrelevant Pavlovian stimuli.

The hypothesis is that individual differences in working memory will be reflected in the individual tendency to show either outcome-specific or general PIT. More precisely, supporting a detailed representation of the associations between each response and their outcomes, higher working memory capacity should be associated with larger outcome-specific PIT effect. Conversely, working memory should play a more marginal role when the transfer effect reflects enhancement of responding by a motivational component, like in the case of general PIT. Furthermore, the stage in which working memory is more relevant will be analyzed by relating individual working memory ability not only to the PIT effect itself but also to the strength of instrumental and Pavlovian learning.

Results

Instrumental conditioning

Implicit learning

To ensure learning of instrumental contingencies, the number of rewarded (R+1, R+2) and unrewarded (R−) responses (mouse clicks) performed during each 10-second trial was compared. Response (R+1/R+2/R−) served as the independent variable and the average number of responses performed as the dependent variable. Results showed a main effect of response (F(1.46, 144.38) = 192.75; p < 0.0001; part. η2 = 0.66; BF10 = 2.39e + 72; err% = 1.53). Post-hoc analysis confirmed a statistically significant difference (ps <0.0001) between R + 1 (number of responses m = 20.05; sd = 8.54) and R− (number of responses m = 7.38; sd = 5.46), as well as between R+2 (number of responses m = 21.68; sd = 8.04) and R−. Figure 2a shows means and clearly separate 95% confidence intervals. These results indicate a preference for both reward-paired responses (R+1 and R+2) as compared to the unrewarded response (R−), thus indicating successful instrumental learning. Critically, no preference between the two rewarded responses emerged.

Figure 2.

Figure 2

Cat’s eye plot of instrumental and Pavlovian learning. Panel (a) shows the number of responses performed during the instrumental conditioning, where a significant difference between reward-associated responses (R+1, R+2) and unrewarded response (R−) was found. Panel (b) shows the liking scores of the stimuli before and after undergoing Pavlovian conditioning, where an was observed increase for the reward-associated stimuli (CS+1, CS+2) and a reduction for the unrewarded stimulus (CS−). Overall, these results confirm that participants correctly learned instrumental and Pavlovian contingencies.

Explicit learning

Analysis of explicit associations confirmed that participants correctly reported the response-outcome associations on 93.33% of cases.

Pavlovian conditioning

To ensure Pavlovian learning, two measures were considered: reaction times to the presentation of the patch preceding the outcome, and the pre-post change in liking of the conditioned stimuli (CSs).

Implicit learning (CS liking)

Phase (pre/post conditioning) and CS (CS+1/CS+2/CS−) served as independent variables. Liking scores served as dependent variable. Results showed a significant phase by CS interaction (F(2.196) = 44.319; p < 0.001; part. η2 = 0.32; BF10 = 1706255; err% = 1.47). Post-hoc analysis showed a significant difference between CS+1 pre (m = 5.33; sd = 1.87) and CS+1 post (m = 5.94; sd = 1.77; p = 0.001), between CS+2 pre (m = 5.34; sd = 1.81) and CS+2 post (m = 6.30; sd = 1.75; p < 0.001), and between CS− pre (m = 5.47; sd = 1.81) and CS− post (m = 4.51; sd = 1.91; p < 0.001). Importantly: in the pre-conditioning phase, there were not differences in liking scores between all CSs (p = 1); in the post-conditioning phase, liking scores between CSs+ were significantly different from CS− (p < 0.001) but not between CS+1 and CS+2 (p = 0.28). Overall, these results confirm that Pavlovian conditioning did occur. More specifically they show that (1) before Pavlovian training all stimuli were equally liked, and that (2) after Pavlovian training the liking for both CS+1 and CS+2 increased while the liking for CS− decreased (Fig. 2b). This last evidence might be explained by the relative devaluation of the CS− when compared to the reward-associated stimuli in the post-conditioning phase.

Implicit learning (reaction times)

CS (CS+1/CS+2/CS−) served as the independent variable and reaction times as the dependent variable. Results showed a significant main effect of CS (F(1.34, 132.98) = 10.30; p = 0.0005; part. η2 = 0.09; BF10 = 1.82e + 16; err% = 0.61). Post-hoc analysis confirmed a statistically significant difference (ps < 0.006) between CS+1 (milliseconds m = 302.74; sd = 74.84) and CS− (milliseconds m = 318.95; sd = 88.01), as well as between CS+2 (milliseconds m = 295.84; sd = 67.99) and CS−, but not between CS+1 and CS+2 (p = 0.1). Participants were significantly faster when a reward was anticipated (CS+1, CS+2) as compared to when no reward was expected (CS−). This reward-specific response facilitation confirms successful Pavlovian conditioning11.

Explicit learning

Analysis of explicit associations confirmed that participants correctly reported the stimulus-outcome associations on 96.11% of cases.

Pavlovian-to-Instrumental Transfer (PIT)

Outcome-specific PIT

The outcome-specific transfer was tested considering only those trials in which CS+1 or CS+2 were presented while choosing between R+1 or R+2. The rationale of the outcome-specific transfer is to test if a CS (i.e., a reward-paired cue) is able to elicit a response independently associated with the same reinforcer. For this aim, all responses were categorized as congruent (choosing R+1 while CS+1 is presented, or choosing R+2 while CS+2 is presented) or incongruent (choosing R+1 while CS+2 is presented, or choosing R+2 while CS+1 is presented) and compared. Response (congruent/incongruent) served as the independent variable and number of responses as the dependent variable. Results showed a significant difference (F(1, 99) = 152; p < 0.0001; part. η2 = 0.61; BF10 = 1.15e + 37; err% = 1.74) between congruent (number of responses m = 8.28; sd = 2.64) and incongruent (number of responses m = 2.25; sd = 2.37) responses. Figure 3a shows means and clearly separate 95% confidence intervals. Form this results it is possible to conclude that participants showed outcome-specific transfer, as the task-irrelevant presence of a CS+ induced a preference for the instrumental response previously associated with the same reward.

Figure 3.

Figure 3

Cat’s eye plot of the Pavlovian-to-Instrumental Transfer. Panel (a) shows the outcome-specific PIT effect, as represented by a significant difference between the number of congruent (e.g., choosing R+1 when CS+1 is presented) and incongruent responses (e.g., choosing R+2 when CS+1 is presented). Panel (b) shows the general PIT effect, as represented by an increased propensity to choose the reward-associated response (R+) when a reward-associated stimulus is presented (CS+) as compared to the unrewarded stimulus is presented (CS−).

General PIT

The general transfer was tested by comparing the preference for rewarded responses when presented with a CS+, relative to the CS−. CS (CS+/CS−) served as the independent variable and the number of rewarded responses (R+1 or R+2) as the dependent variable. Results showed a significant difference (F(1, 99) = 4.81; p = 0.03; part. η2 = 0.08; BF10 = 784841584730; err% = 1.11) between CS+ (m = 5.71; sd = 5.05) and CS− (m4 = 4–09; sd = 4–72). Figure 3b shows means and clearly separate 95% confidence intervals. From this result it is possible to conclude that participants showed general transfer, as the task-irrelevant presence of a CS+ induced a preference for the instrumental response previously associated with a (different) reward, as compared to the presence of the CS−.

Working memory and indexes of learning

It was checked whether working memory could be related to the participant’s ability to learn during instrumental and Pavlovian learning tasks32. No significant correlation was found neither between working memory and instrumental learning index (t(98) = 0.81, p = 0.41, r = 0.08, 95% CI = −0.11 0.27; BF10 = 0.32) nor between working memory and Pavlovian learning index (t(98) = −0.34, p = 0.72, r = −0.03, 95% CI = −0.23 0.16; BF10 = 0.24).

Working memory and PIT

A first approach to investigate the relationship between working memory and PIT was to calculate two separate Pearson’s product-moment correlation coefficients for outcome-specific and general effect, respectively. Results showed a significant positive correlation (t(98) = 3.78, p = 0.0002, r = 0.35, 95% CI = 0.17 0.51; BF10 = 124.08) between working memory and outcome-specific transfer (Fig. 4c), whereas no correlation was found (t(98) = −0.59, p = 0.55, r = −0.06, 95% CI = −0.25 0.13; BF10 = 0.27) between working memory and general transfer (Fig. 4d).

Figure 4.

Figure 4

Correlation between Working Memory and Pavlovian-to-Instrumental Transfer. Panel (a) shows absence of correlation between working memory and instrumental learning index. Panel (b) shows absence of correlation between working memory and Pavlovian learning index. Panel (c) shows a significant positive correlation between working memory and outcome-specific PIT. Panel (d) shows absence of correlation between working memory and general PIT. The grey area represents the 95% confidence interval.

Working memory and outcome-specific PIT

To further inspect the relationship between individual differences in working memory and outcome-specific PIT, the raw number of congruent and incongruent responses was taken into account. The whole sample was divided into two groups with a median split, in order to separate individuals with high and low working memory working memory ability (median = 39). Working memory ability (high/low) and response (congruent/incongruent) served as the independent variable. The number of responses was used as the dependent variable. Results showed a significant difference (F(1, 98) = 165.14; p < 0.0001; part. η2 = 0.63; BF10 = 1.97e + 31; err% = 1) between congruent (m = 8.28; sd = 2.64) and incongruent (m = 2.25; sd = 2.37) responses. A significant interaction between working memory and response was also found (F(1, 98) = 8.62; p = 0.004; part. η2 = 0.08; BF10 = 8.36e + 32; err% = 2.77), where congruent responses were significantly higher in the high working memory group (m = 9.02; sd = 2.34) as compared to the low working memory (m = 7.56; sd = 2.74) group, and incongruent responses were significantly lower in the high working memory group (m = 1.58; sd = 1.96) as compared to the low working memory (m = 2.89; sd = 2.56) group (Fig. 5a).

Figure 5.

Figure 5

Working Memory and outcome-specific Pavlovian-to-Instrumental Transfer. Panel (a) shows the raw number of responses in the high and low working memory (WM) groups. Panel (b) shows positive correlation between working memory and the number of congruent responses. Panel (c) shows negative correlation between working memory and the number of incongruent responses. The grey area represents the 95% confidence interval.

Moreover, correlation analysis (Pearson’s product-moment) evidenced that working memory ability was not related to the overall number of responses performed during the PIT phase (t(198) = 0.05, p = 0.95, r = 0.003, 95% CI = −0.13 0.14; BF10 = 0.16), but positively correlated with the number of congruent responses (t(198) = 3.44, p = 0.0008, r = 0.32, 95% CI = 0.14 0.49; BF10 = 44.85) and negatively correlated with the number of incongruent responses (t(198) = −3.73, p = 0.0003, r = −0.35, 95% CI = −0.51 − 0.17; BF10 = 108.12) (Fig. 5b,c).

The contribution of working memory and all learning indexes

To further evaluate the specific contribution of working memory as well as other learning factors to the expression of transfer, two ordinary linear models were fitted with either outcome-specific or general PIT indexes as dependent variables and instrumental learning index (number of responses), Pavlovian learning index (change in liking), and working memory as independent factors. The model on outcome-specific transfer was statistically significant (F(3,96) = 5.60, p = 0.001, R2 = 0.15). In particular, working memory proved to be the major determinant of the outcome-specific effect. The model on general transfer was not statistically significant (F(3,96) = 0674, p = 0.57, R2 = 0.02). All estimates and relative importance metrics are reported in Table 2 and Fig. 6.

Table 2.

Contribution of working memory and learning factors to the PIT effect.

β Std. Error t p RI
Outcome-specific PIT
Working Memory 0.01 0.01 3.61 <0.001 0.82
Instrumental 0.21 0.13 1.56 0.12 0.17
Pavlovian −0.01 0.02 −0.21 0.82 0.01
General PIT
Pavlovian 0.60 0.48 1.24 0.21 0.81
Working Memory −0.02 0.04 −0.50 0.61 0.15
Instrumental 0–84 3.14 0.26 0.79 0.03

The table reports estimates of the linear model and relative importance metrics (RI).

Figure 6.

Figure 6

Contribution of working memory and learning factors. Relative importance metrics for outcome-specific (a) and general (b) PIT effect were calculated from two separate ordinary linear models. The PIT index (outcome-specific or general) was used as dependent variable and instrumental learning index, Pavlovian learning index, and working memory (WM) as independent factors. Metrics are expressed as a percentage of explained variability (R2) and factors are plotted in descending order.

Discussion

The present study examined the relationship between individual differences in working memory capacity and two forms of Pavlovian-to-Instrumental Transfer (PIT), namely outcome-specific and general. For the first time, a dissociation between outcome-specific and general PIT was related to the role played by working memory in the mediation of this effect. More specifically, individual working memory was strongly correlated with outcome-specific PIT (Fig. 4c), but no relation with general PIT (Fig. 4d). Importantly, the data here presented suggest that this relation was not secondary to an influence of working memory on the acquisition of the learning contingencies. Indeed, neither instrumental nor Pavlovian learning indexes were related to working memory (Fig. 4a,b). This evidence is in line with the idea that, rather playing a role in the acquisition of conditioning29, individual differences in working memory may become crucial for the recall and strategic use of past learning. However, since the present experiment was not specifically designed to address this hypothesis, conclusion on this issue should be cautious32.

A deeper analysis of the relation between working memory and outcome-specific PIT, further highlighted that this connection was not related to an increase in the number of responses performed (no difference in the mean number of responses between high and low working memory groups), but rather to a precise imbalance between congruent and incongruent responses (Fig. 5a). In other words, even if the overall number of responses stays the same, people with higher working memory tend to be more selective in their choices (i.e., they make more congruent responses and less incongruent responses). This interpretation was confirmed by the positive correlation found between individual working memory ability and the number of congruent responses (Fig. 5b) and by the negative correlation between individual working memory ability and the number of incongruent responses (Fig. 5c).

Finally, when looking at the specific contributors to the transfer effect, it was further confirmed that an explicit representation of the instrumental contingencies and individual working memory ability were the strongest determinants of outcome-specific PIT (Fig. 6a). This result particularly supports the hypothesis that higher working memory is associated with higher outcome-specific PIT effect because a detailed representation of all outcomes and response-outcome pairings is at the foundation of this particular form of transfer (S− > O− > R chain). Although for general PIT the model was not statistically significant, it may be interesting to note that the strongest contributor was represented by an implicit measure of Pavlovian conditioning (change in CS liking) (Fig. 6b).

Taken together, the present findings support the idea, also highlighted in previous works20,33, that outcome-specific and general PIT are sustained by different underlying mechanisms. More precisely, working memory seems to be less relevant when the Pavlovian cue informs the easy choice between two responses, of which only one was previously associated with a reward (general PIT). In such a situation, only action values need to be considered. Conversely, working memory becomes a crucial requirement when the same information from the same Pavlovian cue can guide a more difficult choice between two actions with similar motivational values (outcome-specific PIT). In this case, a precise integration between the specific nature of the outcome and the consequences associated with each response is required. In other words, if a task-irrelevant cue signals a potentially useful information for a complex choice, the working memory comes into play to integrate past experiences about R → O and S → O associations in order to guide such choice18,23,24. Importantly, such influence can have adaptive or maladaptive implications1218.

Given the current understanding of the PIT phenomenon and its implications in both daily life and clinical conditions (see introduction), it seems important to explore the extent to which Pavlovian stimuli can either be a strategic guide for behavior or exert a maladaptive control over instrumental responding. An attempt will be made to interpret the present findings in this sense.

When interpreting a phenomenon like the PIT effect, a mediating effect of the outcome representation - whether its value or its sensory features - can hardly be doubted34, but the nature of this link is still unclear. Many pieces of evidence point to the fact that the working memory system is critically involved in the guidance of action selection through executive functions, and that cognitively controlled decision-making relies on working memory21,35,36. Many findings also suggest that individual differences in working memory play no role in performances depending on automatic processes37. In this regard, it would be interesting to test whether a differentiation between more implicit and executive components of working memory (i.e., the phonological loop and the central executive) may further clarify its role in outcome-specific and general PIT23,38.

Following this narrative and the demonstrated relation with individual working memory capacity, can it be argued that outcome-specific PIT is guided by an explicit anticipation of the associative rules, thus being cognitive and strategic? Likewise, can the absence of a relation between working memory and general PIT be interpreted as evidence that more automatic processes are responsible for this form of PIT? Although a number of studies go along with both speculations12,15,18,30,34,3945, evidence is still indirect and careful consideration should be taken in drawing such conclusions. In the present study, for example, a limitation is represented by the presence of a smaller effect size for the general PIT (part. η2 = 0.14), as compared to the specific PIT (part. η2 = 0.61), which may weaken the possibility of uncovering a relation with the variables included in the subsequent analysis, including working memory.

Above all, the critical question is whether individual differences in working memory can be reflected in different facets of cue-guided behaviors and, therefore, in a different propensity to develop maladaptive behaviors. Can higher working memory be an advantage in guiding behavior? We propose that the critical role played by working memory resides in the integration of the values coming from two different sources: the cue and the response46,47. In outcome-specific PIT, multiple sources of information converge to signal a higher chance to find a reward (congruent response). The role of working memory should not be intended here as a mere cognitive resource35, but rather as the mechanism defining the best choice. The data here presented may indicate that an adaptive component of PIT, as reflected by the outcome-specific form, may go along with working memory to flexibly guide behavior toward the most convenient choice. If that is the case, higher working memory ability could constitute a competitive advantage, whereas a lack of working memory could be a potential risk factor towards the development of maladaptive conducts. Within this scenario, a clear distinction between the two mechanisms appears important for clinical purposes, as more automatic responses may be more prone to evolve in a maladaptive behavior (like drug-seeking), as compared to strategically guided actions.

Drug addiction has been conceptualized as the endpoint of a transition from voluntary, through habitual and ultimately compulsive drug use, which directly depends on interactions between Pavlovian and instrumental learning processes16,48. Drug-related forms of PIT have been clearly reported in the literature12,13,15,49,50, but there is no clear evidence about the prevalence of one form over the other (outcome-specific vs. general) in the addicted population. A possibility is that while occasional use of drugs can be initially guided by an outcome-specific PIT-like mechanism, while the endpoint to a compulsive use can be more governed by a general PIT-like mechanism. Such a distinction may have profound implications in the treatment and prevention of addiction. A propensity to present one or the other form of transfer could be intended as a marker of vulnerability to cue-induced pathological behaviors. Importantly, these implications could be not limited to addiction, but extended to compulsive behavior and neuropsychiatric disorders, like depression or schizophrenia1618.

The view proposed here and elsewhere20,33 is consistent with the idea that, at least in humans, outcome-specific and general PIT are sustained by a different cognitive mechanism, potentially involving different paths of the underlying corticostriatal loop9,16,18,51,52. A major hypothesis concerning the mechanisms behind the transition from adaptive to maladaptive behavior proposed that the initial cortical control performed by the prefrontal cortex can gradually evolve in a more striatal - ventral to dorsal - control of behavior16,48. However, more evidence is needed about the presence of distinct neural markers of outcome-specific and general PIT in humans. Furthermore, an additional question is whether the two separate systems cooperate to reach optimal behavior or are in competition for action control. Importantly, if confirmed by converging evidence, such differentiation could have major implications for understanding the determinants of adaptive and maladaptive behaviors.

Materials and Methods

Participants

One-hundred volunteers (50 female; mean age = 22.9 years, sd = 2.45; mean education = 15.1 years, sd = 1.92) with no history of neurological diseases were recruited from the student population at the University of Bologna. The number of participants was determined based on a power analysis with the following parameters: effect size = 0.35; significance level = 0.05, power = 0.95. The effect-size was estimated based on previous independent literature reporting a relationship between working memory and reward processing5356. All participants gave written informed consent to take part in the experiment. The study was conducted in accordance with institutional guidelines and the 1964 Declaration of Helsinki. It was approved by the Ethics Committee for Psychological Research of the University of Bologna.

Stimuli

Five black squares (4 cm2) were displayed on a 17-inch color monitor with a black background. The squares were highlighted by a white frame (2 mm thickness) and positioned as follows: upper center, bottom center, right center, left center and center of the screen. Three fractal images (balanced for luminance, complexity and color saturation) were used as Pavlovian conditioned stimuli (CS) and presented within the top center square. Two images of food items, surrounded by a light-yellow ring (equally sized), were used as separate rewards and a white ‘x’ was used as a neutral outcome (no-reward). The same visual environment was used in all phases of the PIT task.

Rewards were individually tailored for each participant. Before the experimental session, every participant was required to rate the subjective liking and wanting of 12 savory foods and 12 sweet foods using two separate 10-items Likert scales. Ratings ranged from 0 (not at all) to 10 (very much). Foods were presented using the same images subsequently implemented in the PIT task. Based on the subjective preference expressed, one savory food and one sweet food with similar liking and wanting ratings were selected and used as rewards in the subsequent learning task.

Procedure

The whole experiment consisted of three reinforcement learning tasks – used to assess PIT - and one working memory task – used to assess working memory. On arrival, participants were comfortably seated in a silent room and their position was centered relative to the screen, at a viewing distance of 50 cm from the screen.

Pavlovian-to-Instrumental Transfer (PIT) Task

PIT was assessed following a standard paradigm consisting of three phases, which mirrored the structure used in previous studies11: (1) Instrumental Conditioning task, in which participants learned a response-contingent reward; (2) Pavlovian Conditioning task, in which participants learned a cue-contingent reward; (3) PIT task, during which the influence of task-irrelevant Pavlovian cues on instrumental responding was tested. Participants were aware that, at the end of the experiment, they would be recompensated with an amount of food proportional to the number of food pictures (to be considered as points) visualized during all tasks. The detailed composition of trials for all task phases is reported in Table 1. In each task, participants were required to pay attention to the screen and follow the instructions reported at the beginning of the task. A few example trials were always performed. A computer running OpenSesame software57 controlled stimulus presentation.

Instrumental conditioning phase

Participants were required to learn the association of three possible responses (three white squares horizontally centered on the screen) with their respective reward. Two responses were paired with two different rewards (R+1 and R+2), while the other was paired with a neutral outcome (R−). The task followed a partial reinforcement schedule, such that when a specific response was rewarded, the same response would not be rewarded for the following 2 to 4 seconds. In each trial, only two out of the three squares were presented in white and indicated as possible responses to be selected with a mouse click. After each response, a corresponding neutral or reward outcome appeared for 1 second in the bottom square (Fig. 1a). A fixation cross with a jittered duration ranging between 1 and 1.5 seconds was used as intertrial interval. Response options were not available during this period. The rationale of this task was to learn the association between a specific instrumental response and its corresponding outcome. The response-outcome association was counterbalanced across subjects. The task consisted of a total of 27 trials (9 trials for each pair of responses), each lasting 10 seconds, during which participants were free to perform as many responses (i.e., mouse clicks) as they wished. The task lasted about 6 minutes.

Pavlovian conditioning phase

In each trial, one of three possible visual cues (fractal images) appeared for 5 seconds within the upper square, followed by a white patch within the bottom square. Participants were instructed to press the left-Ctrl button on the computer keyboard, as quickly as possible, to remove the patch and discover the outcome hidden behind it. The outcome was presented for 1 second. A fixation cross with a jittered duration ranging between 1 and 1.5 seconds was used as intertrial interval. Stimuli were not presented during this period. Two of the visual cues were associated with a reward (food picture) on 80% of trials (CS+1 and CS+2), while the other cue was associated with no reward (neutral cue) on all trials (CS−) (Fig. 1b). The stimulus-outcome association was counterbalanced across subjects. The speeded key press aimed to remove the patch and release the hidden reward has been used in previous studies11,52,58,59 as a measure of conditioning that mirrors the reward collection upon presentation of the CS+, observed in animal studies9,30,60. To avoid a possible instrumental component, participants were told that, in this phase, the reward was not dependent upon their response and that, after 3 seconds, the patch hiding it would disappear even without any key press. This was explicitly demonstrated during the example trials. Crucially, responses in the instrumental and Pavlovian tasks were orthogonalized such that mouse-clicks in two spatially different locations served as instrumental response, while a unique key-press on the left ctrl button was used here to remove the patch. The rationale was to observe faster reaction times when a reward was predicted (CS+ condition), as compared to when a neutral outcome was predicted (CS− condition). The task consisted of 75 trials (25 per each visual cue), for a total duration of about 7 minutes.

Transfer phase

Participants were given the same instructions used for instrumental conditioning, i.e. they were required to perform a choice between the two squares available in each trial. The task was identical to the instrumental conditioning phase, except that (a) while performing choices, task-irrelevant Pavlovian CSs were presented randomly within the upper square and (b) the whole task was performed under extinction, so no reward was ever delivered (Fig. 1c). Extinction is a standard procedure for assessing PIT effect, both in human and animal research, since it allows to test the influence of Pavlovian cues on instrumental responding without the confounding effects of the reward52,6163. The global rationale of this task was to test the ability of a Pavlovian cue to drive an instrumental response previously associated with a reward. Two different trials types were designed to disentangle outcome-specific and general transfer (Table 1, PIT). In outcome-specific trials, participants were presented with a task-irrelevant CS+ (CS+1 or CS+2) while required to choose between two available response options (R+1 and R+2) previously paired with two different reward outcomes, of which only one was also previously paired with the concurrently presented CS. So, for example, if the CS+1 was presented, choosing R+1 would constitute a congruent response, while choosing R+2 would constitute an incongruent response. Similarly, if the CS+2 was presented, choosing R+1 would constitute an incongruent response, while choosing R+2 would constitute a congruent response. Evidence for outcome-specific PIT would be seen if the presence of the CS induced a higher rate of congruent responses, as compared to incongruent responses. In general trials, participants were presented with a task-irrelevant CS (either reward-associated or neutral) and required to choose between two available response options none of which was compatible with the CS currently available (i.e., the responses were always associated with a different or no outcome). So, if the CS+1 was presented the available response options were R+2 and R−; if the CS+2 was presented the available response options were R+1 and R−; if the CS− was presented the available response options could be either R+1 and R− or R+2 and R−. Evidence for general PIT would be seen if participants favoured the R+ during the presentation of a CS+, as compared to the CS−. The task consisted of 36 trials of 10 seconds (6 trials for each association of cue and response pair), during which subjects were free to perform as many responses (i.e., mouse clicks) as they wished. The task lasted for about 7 minutes.

CS liking: As a further measure of learning, the visual cues used as CSs were presented before and after Pavlovian conditioning. Participants were required to rate their subjective liking on a 9-items Likert scale. The rationale was to observe an increase in liking scores for CS+1 and CS+2, after Pavlovian conditioning, but not for the CS−.

Explicit learning: At the end of the two conditioning tasks, participants were presented with all instrumental options and visual cues and were required to associate them with their corresponding outcome.

Working memory

Working memory was evaluated with the automated operational span task (AOSPAN), mirroring the same procedure extensively described elsewhere64. AOSPAN requires participants to solve a series of mathematical operations while trying to remember a set of unrelated letters. Participants were presented with one mathematical operation to solve and one letter to remember at a time. The practice session for this task was divided into three sections. The first practice section was letter span: a letter appeared on the screen, and participants were required to recall the letters in the same order in which they were presented. The recall was performed by clicking the box next to the appropriate letter in the correct order. This phase was untimed. Following this, the computer provided feedback about the number of letters correctly recalled in the current set. In the second practice session, participants practiced the mathematical operations of the task and were instructed to solve the operation as quickly as possible. After each operation, participants were given feedback on their accuracy. This practice session served to familiarise them with the mathematical session of the task as well as to calculate how long it would take each person to solve the mathematical operations, in order to account for individual differences. Afterward, the program estimated the subjective mean time required to solve the equations. This time (plus 2.5 standard deviations) was then used as a time limit for the mathematical operations of the experimental session for that individual. In the final practice session, participants performed both the letter recall and mathematical operations together, just as they would do in the real task. Finally, the program progressed to the real trials, which consisted of three sets of 3 to 4 alternations of operation and letter, for a total of 75 letters and 75 mathematical problems. The order of set size was random so that participants could not predict the number of items. In all conditions, letters remained onscreen for 800 ms. The task took approximately 20 min to complete. The score obtained was calculated as the percentage of trials correctly performed in the real task phase only.

Statistical analysis

All statistical analyses are performed using R software v3.3.2 (R Core Team, 2016) and RStudio v1.0.136 (RStudio Team, 2016). The following packages were implemented: lme4, afex, stats, lsmeans, relaimpo, plotmeans, multicon, BayesFactor. In linear mixed-effects models, subjects were always modelled as a random effect. All post-hoc analysis are Bonferroni-corrected. Reported p-values are always intended as two-tailed. The Bayes Factor is reported for all analysis as the probability associated with the alternative hypothesis over the null hypothesis (BF10). The default number of Monte Carlo samples was always 10000. The estimated proportional error (err%) associated with the Bayes Factor is reported only of higher than 0. In the text, values are reported as mean (m) and standard deviation (sd). In all figures, data are reported as cat’s-eye pictures with a 95% confidence interval65. The curves represent the sampling distribution and are centered on the sample mean. The grey indicates 95% of the area between the curves. The horizontal width of each cat’s-eye picture represents the relative likelihood of the interval. Assumptions for a correct use of parametric statistics are always assessed. For the linear model fit and relative importance calculation, the absence of correlation between all regressors was checked. Relative importance was assessed with lmg method to quantify the contribution of each individual regressor to the model66. Relative importance estimates the proportionate contribution of each regressor to the full model’s explained variance (expressed as R2), considering both its direct and its indirect (i.e., combined with the other regressors) influence in the regression equation66. Relative importance metrics were normalized to sum 100 and all indices introduced in the model were rescaled from 0 to 100.

Learning and transfer indexes

To further conduct correlation and regression analyses, performance indexes were calculated for each participant in each task.

Instrumental learning index was calculated by applying the subsequent formula to the number of responses performed: [(reward-associated − unrewarded)/total]. A positive score corresponded to a preference for reward-associated (R+1 or R+2) over unrewarded (R−) responses, and vice-versa.

Pavlovian learning was characterized by two measures, reaction times and CS liking. To create a Pavlovian learning index, we selected the measure that reported a stronger effect-size, namely CS liking (part. η2 = 0.34). The index was created by applying the following formula to the ratings: [(CS+post − CS+pre) − (CS−post − CS−pre)]. A positive score corresponded to a stronger change in liking in the CS+ condition, as compared to the CS− condition, and vice-versa.

Outcome-specific transfer index was calculated by applying the subsequent formula to the number of responses performed: [(congruent − incongruent)/total]. A positive score corresponded to a preference for congruent over incongruent responses, and vice-versa.

General transfer index was calculated by applying the subsequent formula to the preference index described above: CS+ − CS−. A positive score corresponded to a preference for reward-associated responses (R+1 or R+2) when presented with a reward-associated cue (CS+1 or CS+2), as compared to the unrewarded cue (CS−), and vice-versa. Please note that the division for the total number of responses was not applied in this case because this index was obtained from the preference index, which was already corrected for inter-individual variability, by dividing for the total number of performed responses. A further correction would then result in a misleading score.

Supplementary information

Supplementary analysis (1.3MB, docx)

Acknowledgements

The authors thank Cecilia Tonon and Marta Palano for their contribution to data collection.

Author Contributions

All authors developed the hypothesis and contributed to the study design. Testing and data collection were carried on by S.B. S.G. performed the data analysis and interpretation under the supervision of G.d.P. S.G. wrote the manuscript, and G.d.P. and S.B. provided critical revisions. All authors approved the final version of the manuscript.

Competing Interests

The authors declare no competing interests.

Footnotes

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information accompanies this paper at 10.1038/s41598-019-43860-w.

References

  • 1.Rescorla RA, Solomon RL. Two-process learning theory: Relationships between Pavlovian conditioning and instrumental learning. Psychol. Rev. 1967;74:151–182. doi: 10.1037/h0024475. [DOI] [PubMed] [Google Scholar]
  • 2.Cartoni E, Balleine B, Baldassarre G. Appetitive Pavlovian-instrumental Transfer: A review. Neurosci. Biobehav. Rev. 2016;71:829–848. doi: 10.1016/j.neubiorev.2016.09.020. [DOI] [PubMed] [Google Scholar]
  • 3.Holmes NM, Marchand AR, Coutureau E. Pavlovian to instrumental transfer: a neurobehavioural perspective. Neurosci. Biobehav. Rev. 2010;34:1277–95. doi: 10.1016/j.neubiorev.2010.03.007. [DOI] [PubMed] [Google Scholar]
  • 4.Battaglia S, Garofalo S, di Pellegrino G. Context-dependent extinction of threat memories: influences of healthy aging. Sci. Rep. 2018;8:12592. doi: 10.1038/s41598-018-31000-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Balleine BW, Ostlund SB. Still at the choice-point: Action selection and initiation in instrumental conditioning. Ann. N. Y. Acad. Sci. 2007;1104:147–171. doi: 10.1196/annals.1390.006. [DOI] [PubMed] [Google Scholar]
  • 6.Balleine BW, O’Doherty JP. Human and Rodent Homologies in Action Control: Corticostriatal Determinants of Goal-Directed and Habitual Action. Neuropsychopharmacology. 2010;35:48–69. doi: 10.1038/npp.2009.131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Dickinson, A. & Balleine, B. The Role of Learning in the Operation of Motivational Systems. In Stevens’ Handbook of Experimental Psychology 497–533 (John Wiley & Sons, Inc.), 10.1002/0471214426.pas0312 (2002).
  • 8.Alarcón DE, Bonardi C, Delamater AR. Associative Mechanisms Involved in Specific Pavlovian-to-instrumental transfer (PIT) in Human Learning Tasks. Q. J. Exp. Psychol. 2017;71:1607–1625. doi: 10.1080/17470218.2017.1342671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Corbit LH, Balleine BW. Double dissociation of basolateral and central amygdala lesions on the general and outcome-specific forms of pavlovian-instrumental transfer. J. Neurosci. 2005;25:962–970. doi: 10.1523/JNEUROSCI.4507-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cohen-Hatton SR, Haddon JE, George DN, Honey RC. Pavlovian-to-instrumental transfer: paradoxical effects of the Pavlovian relationship explained. J. Exp. Psychol. Anim. Behav. Process. 2013;39:14–23. doi: 10.1037/a0030594. [DOI] [PubMed] [Google Scholar]
  • 11.Garofalo S, di Pellegrino G. Individual differences in the influence of task-irrelevant Pavlovian cues on human behavior. Front. Behav. Neurosci. 2015;9:163. doi: 10.3389/fnbeh.2015.00163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hogarth L, Dickinson A, Duka T. The associative basis of cue-elicited drug taking in humans. Psychopharmacology (Berl). 2010;208:337–51. doi: 10.1007/s00213-009-1735-9. [DOI] [PubMed] [Google Scholar]
  • 13.Watson P, de Wit S, Cousijn J, Hommel B, Wiers RWW. Motivational Mechanisms Underlying the Approach Bias to Cigarettes. J. Exp. Psychopathol. 2013;4:250–262. doi: 10.5127/jep.030512. [DOI] [Google Scholar]
  • 14.Watson P, Wiers RW, Hommel B, De Wit S. Working for food you don’t desire. Cues interfere with goal-directed food-seeking. Appetite. 2014;79:139–148. doi: 10.1016/j.appet.2014.04.005. [DOI] [PubMed] [Google Scholar]
  • 15.Hogarth L, Balleine BW, Corbit LH, Killcross S. Associative learning mechanisms underpinning the transition from recreational drug use to addiction. Ann. N. Y. Acad. Sci. 2013;1282:12–24. doi: 10.1111/j.1749-6632.2012.06768.x. [DOI] [PubMed] [Google Scholar]
  • 16.Everitt BJ, Robbins TW. Neural systems of reinforcement for drug addiction: from actions to habits to compulsion. Nat. Neurosci. 2005;8:1481–9. doi: 10.1038/nn1579. [DOI] [PubMed] [Google Scholar]
  • 17.Boureau Y-L, Dayan P. Opponency revisited: competition and cooperation between dopamine and serotonin. Neuropsychopharmacology. 2011;36:74–97. doi: 10.1038/npp.2010.151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Morris RW, Quail S, Griffiths KR, Green MJ, Balleine BW. Corticostriatal control of goal-directed action is impaired in schizophrenia. Biol. Psychiatry. 2015;77:187–195. doi: 10.1016/j.biopsych.2014.06.005. [DOI] [PubMed] [Google Scholar]
  • 19.Smeets PM, Barnes-Holmes D. Children’s emergent preferences for soft drinks: Stimulus-equivalence and transfer. J. Econ. Psychol. 2003;24:603–618. doi: 10.1016/S0167-4870(03)00004-7. [DOI] [Google Scholar]
  • 20.Garofalo S, di Pellegrino G. Commentary: Monetary, Food, and Social Rewards Induce Similar Pavlovian-to-Instrumental Transfer Effects. Front. Behav. Neurosci. 2017;11:247. doi: 10.3389/fnbeh.2017.00136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Dolan RJ, Dayan P. Goals and habits in the brain. Neuron. 2013;80:312–25. doi: 10.1016/j.neuron.2013.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Seabrooke T, Hogarth L, Mitchell CJ. The propositional basis of cue-controlled reward seeking. Q. J. Exp. Psychol. 2016;69:2452–2470. doi: 10.1080/17470218.2015.1115885. [DOI] [PubMed] [Google Scholar]
  • 23.Cui JF, et al. Effects of working memory load on uncertain decision-making: Evidence from the Iowa Gambling Task. Front. Psychol. 2015;6:1–9. doi: 10.3389/fpsyg.2015.00162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Murray JD, Jaramillo J, Wang X-J. Working Memory and Decision-Making in a Frontoparietal Circuit Model. J. Neurosci. 2017;37:12167–12186. doi: 10.1523/JNEUROSCI.0343-17.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Goldman-Rakic, P. S. Circuitry of Primate Prefrontal Cortex and Regulation of Behavior by Representational Memory. In Comprehensive Physiology 373–417 (John Wiley & Sons, Inc.), 10.1002/cphy.cp010509 (2011).
  • 26.Engle RW, Laughlin JE, Tuholski SW, Conway ARA. Working memory, short-term memory, and general fluid intelligence: A latent-variable approach. J. Exp. Psychol. Gen. 1999;128:309–331. doi: 10.1037/0096-3445.128.3.309. [DOI] [PubMed] [Google Scholar]
  • 27.Schoenbaum G, Roesch M. Orbitofrontal Cortex, Associative Learning, and Expectancies. Neuron. 2005;47:633–636. doi: 10.1016/j.neuron.2005.07.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Barrett LF, Tugade MM, Engle RW. Individual differences in working memory capacity and dual-process theories of the mind. Psychol. Bull. 2004;130:553–573. doi: 10.1037/0033-2909.130.4.553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Stout DM, et al. Individual variation in working memory is associated with fear extinction performance. Behav. Res. Ther. 2018;102:52–59. doi: 10.1016/j.brat.2018.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Holland PC. Relations between Pavlovian-instrumental transfer and reinforcer devaluation. J. Exp. Psychol. Anim. Behav. Process. 2004;30:104–17. doi: 10.1037/0097-7403.30.2.104. [DOI] [PubMed] [Google Scholar]
  • 31.Unsworth N, Heitz RP, Schrock JC, Engle RW. An automated version of the operation span task. Behav. Res. Methods. 2005;37:498–505. doi: 10.3758/BF03192720. [DOI] [PubMed] [Google Scholar]
  • 32.Carter RM, Hofstotter C, Tsuchiya N, Koch C. Working memory and fear conditioning. Proc. Natl. Acad. Sci. 2003;100:1399–1404. doi: 10.1073/pnas.0334049100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Garofalo S, Robbins TW. Triggering Avoidance: Dissociable Influences of Aversive Pavlovian Conditioned Stimuli on Human Instrumental Behavior. Front. Behav. Neurosci. 2017;11:1–11. doi: 10.3389/fnbeh.2017.00063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Hardy L, Mitchell C, Seabrooke T, Hogarth L. Drug cue reactivity involves hierarchical instrumental learning: evidence from a biconditional Pavlovian to instrumental transfer task. Psychopharmacology (Berl). 2017;234:1977–1984. doi: 10.1007/s00213-017-4605-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Otto aR, Raio CM, Chiang A, Phelps Ea, Daw ND. Working-memory capacity protects model-based learning from stress. Proc. Natl. Acad. Sci. 2013;110:20941–20946. doi: 10.1073/pnas.1312011110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Le Doux, J. Emotional colouration of consciousness: how feelings come about. In Frontiers of consciousness: Chichele lectures (eds Weiskrantz, L. & Davies, M.) 69–130 (Oxford University Press, 2008).
  • 37.Rolison JJ, Evans JSBT, Walsh CR, Dennis I. The role of working memory capacity in multiple-cue probability learning. Q. J. Exp. Psychol. 2011;64:1494–1514. doi: 10.1080/17470218.2011.559586. [DOI] [PubMed] [Google Scholar]
  • 38.Hassin RR, Bargh JA, Engell AD, McCulloch KC. Implicit working memory. Conscious. Cogn. 2009;18:665–678. doi: 10.1016/j.concog.2009.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Rescorla RA. Associative relations in instrumental learning: The eighteenth Barlett memorial lecture. Q. J. Exp. Psychol. 1991;43B:1–23. [Google Scholar]
  • 40.Sebold M, et al. Don’t Think, Just Feel the Music: Individuals with Strong Pavlovian-to-Instrumental Transfer Effects Rely Less on Model-based Reinforcement Learning. J. Cogn. Neurosci. 2016;25:1–11. doi: 10.1162/jocn_a_00945. [DOI] [PubMed] [Google Scholar]
  • 41.Cosand LD, et al. Arousal, working memory, and conscious awareness in contingency learning. Conscious. Cogn. 2008;17:1105–1113. doi: 10.1016/j.concog.2008.04.007. [DOI] [PubMed] [Google Scholar]
  • 42.de Wit S, Dickinson A. Associative theories of goal-directed behaviour: a case for animal–human translational models. Psychol. Res. Psychol. Forsch. 2009;73:463–476. doi: 10.1007/s00426-009-0230-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Dickinson, A. Bolles’s psychological syllogism. In Learning, motivation, and cognition: The functional behaviorism of Robert C. Bolles (eds Bouton, M. E. & Fanselow, M. S.) 345–367 (American Psychological Association, 1997).
  • 44.Hofmann W, Gschwendner T, Friese M, Wiers RW, Schmitt M. Working Memory Capacity and Self-Regulatory Behavior: Toward an Individual Differences Perspective on Behavior Determination by Automatic Versus Controlled Processes. J. Pers. Soc. Psychol. 2008;95:962–977. doi: 10.1037/a0012705. [DOI] [PubMed] [Google Scholar]
  • 45.Hogarth L, Chase HW. Parallel goal-directed and habitual control of human drug-seeking: implications for dependence vulnerability. J. Exp. Psychol. Anim. Behav. Process. 2011;37:261–276. doi: 10.1037/a0022913. [DOI] [PubMed] [Google Scholar]
  • 46.O’Doherty JP. The problem with value. Neurosci. Biobehav. Rev. 2014;43:259–268. doi: 10.1016/j.neubiorev.2014.03.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Wimmer GE, Shohamy D. Preference by association: How memory mechanisms in the hippocampus bias decisions. Science (80-.). 2012;338:270–273. doi: 10.1126/science.1223252. [DOI] [PubMed] [Google Scholar]
  • 48.Everitt BJ, Robbins TW. Drug Addiction: Updating Actions to Habits to Compulsions Ten Years On. Annu. Rev. Psychol. 2016;67:150807174122003. doi: 10.1146/annurev-psych-122414-033457. [DOI] [PubMed] [Google Scholar]
  • 49.Garbusow M, et al. Pavlovian-to-instrumental transfer in alcohol dependence: A pilot study. Neuropsychobiology. 2014;70:111–121. doi: 10.1159/000363507. [DOI] [PubMed] [Google Scholar]
  • 50.Hogarth L, Dickinson A, Wright A, Kouvaraki M, Duka T. The role of drug expectancy in the control of human drug seeking. J. Exp. Psychol. Anim. Behav. Process. 2007;33:484–496. doi: 10.1037/0097-7403.33.4.484. [DOI] [PubMed] [Google Scholar]
  • 51.Corbit LH, Janak PH, Balleine BW. General and outcome-specific forms of Pavlovian-instrumental transfer: the effect of shifts in motivational state and inactivation of the ventral tegmental area. Eur. J. Neurosci. 2007;26:3141–3149. doi: 10.1111/j.1460-9568.2007.05934.x. [DOI] [PubMed] [Google Scholar]
  • 52.Talmi D, Seymour B, Dayan P, Dolan RJ. Human Pavlovian Instrumental Transfer. J. Neurosci. 2008;28:360–368. doi: 10.1523/JNEUROSCI.4028-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Hager OM, et al. Reward-dependent modulation of working memory is associated with negative symptoms in schizophrenia. Schizophr. Res. 2015;168:238–244. doi: 10.1016/j.schres.2015.08.024. [DOI] [PubMed] [Google Scholar]
  • 54.Choi JM, Padmala S, Pessoa L. Counteracting effect of threat on reward enhancements during working memory. Cogn. Emot. 2015;29:1517–1526. doi: 10.1080/02699931.2014.993596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Gong M, Li S. Learned reward association improves visual working memory. J. Exp. Psychol. Hum. Percept. Perform. 2014;40:841–856. doi: 10.1037/a0035131. [DOI] [PubMed] [Google Scholar]
  • 56.Yantis S, Anderson BA, Wampler EK, Laurent PA. Reward and attentional control in visual search. Nebr. Symp. Motiv. 2012;59:91–116. doi: 10.1007/978-1-4614-4794-8_5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Mathôt S, Schreij D, Theeuwes J. OpenSesame: An open-source, graphical experiment builder for the social sciences. Behav. Res. Methods. 2012;44:314–324. doi: 10.3758/s13428-011-0168-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Gottfried JA, O’Doherty J, Dolan RJ. Appetitive and Aversive Olfactory Learning in Humans Studied Using Event-Related Functional Magnetic Resonance Imaging. J. Neurosci. 2002;22:10829–10837. doi: 10.1523/JNEUROSCI.22-24-10829.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Gottfried JA, O’Doherty J, Dolan RJ. Encoding Predictive Reward Value in Human Amygdala and Orbitofrontal Cortex. Science (80-.). 2003;301:1104–1107. doi: 10.1126/science.1087919. [DOI] [PubMed] [Google Scholar]
  • 60.Dickinson AD, Smith J, Mirenowicz J. Dissociation of Pavlovian and instrumental incentive learning under dopamine antagonists. Behav. Neurosci. 2000;114:468–483. doi: 10.1037/0735-7044.114.3.468. [DOI] [PubMed] [Google Scholar]
  • 61.Rescorla RA. Control of instrumental performance by Pavlovian and instrumental stimuli. J. Exp. Psychol. Anim. Behav. Process. 1994;20:44–50. doi: 10.1037/0097-7403.20.1.44. [DOI] [PubMed] [Google Scholar]
  • 62.Corbit LH, Muir JL, Balleine BW. The role of the nucleus accumbens in instrumental conditioning: Evidence of a functional dissociation between accumbens core and shell. J. Neurosci. 2001;21:3251–3260. doi: 10.1523/JNEUROSCI.21-09-03251.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Bray S, Rangel A, Shimojo S, Balleine B, O’Doherty JP. The neural mechanisms underlying the influence of pavlovian cues on human decision making. J. Neurosci. 2008;28:5861–6. doi: 10.1523/JNEUROSCI.0897-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Unsworth, N., Heitz, R. P. & Engle, R. W. An automated version of the operation span task_Unsworth et al._2005. 37, 498–505 (2005). [DOI] [PubMed]
  • 65.Cumming G. The New Statistics. Psychol. Sci. 2014;25:7–29. doi: 10.1177/0956797613504966. [DOI] [PubMed] [Google Scholar]
  • 66.Gromping, U. Relative Importance for Linear Regression in R: The Package relaimpo. J. Stat. Softw. 17 (2006).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary analysis (1.3MB, docx)

Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES