Skip to main content
Frontiers in Human Neuroscience logoLink to Frontiers in Human Neuroscience
. 2013 Jan 16;6:351. doi: 10.3389/fnhum.2012.00351

Advanced Parkinson's disease effect on goal-directed and habitual processes involved in visuomotor associative learning

Fadila Hadj-Bouziane 1,2,3,*, Isabelle Benatru 4, Andrea Brovelli 5, Hélène Klinger 3,6, Stéphane Thobois 3,6,7, Emmanuel Broussolle 3,6,7, Driss Boussaoud 8, Martine Meunier 1,2,3
PMCID: PMC3560419  PMID: 23386815

Abstract

The present behavioral study re-addresses the question of habit learning in Parkinson's disease (PD). Patients were early onset, non-demented, dopa-responsive, candidates for surgical treatment, similar to those we found earlier as suffering greater dopamine depletion in the putamen than in the caudate nucleus. The task was the same conditional associative learning task as that used previously in monkeys and healthy humans to unveil the striatum involvement in habit learning. Sixteen patients and 20 age- and education-matched healthy control subjects learned sets of 3 visuo-motor associations between complex patterns and joystick displacements during two testing sessions separated by a few hours. We distinguished errors preceding vs. following the first correct response to compare patients' performance during the earliest phase of learning dominated by goal-directed actions with that observed later on, when responses start to become habitual. The disease significantly retarded both learning phases, especially in patients under 60 years of age. However, only the late phase deficit was disease severity-dependent and persisted on the second testing session. These findings provide the first corroboration in Parkinson patients of two ideas well-established in the animal literature. The first is the idea that associating visual stimuli to motor acts is a form of habit learning that engages the striatum. It is confirmed here by the global impairment in visuo-motor learning induced by PD. The second idea is that goal-directed behaviors are predominantly caudate-dependent whereas habitual responses are primarily putamen-dependent. At the advanced PD stages tested here, dopamine depletion is greater in the putamen than in the caudate nucleus. Accordingly, the late phase of learning corresponding to the emergence of habitual responses was more vulnerable to the disease than the early phase dominated by goal-directed actions.

Keywords: feedback-based learning, striatum, habits, goal-directed actions, Parkinson's disease

Introduction

The idea that habits depend on the striatum has been around for a long time. It emerged in the 1960s when it became obvious that some forms of learning were spared by the dense amnesia induced by medial temporal removal in patient H.M. (Seger and Spiering, 2011). Mishkin and colleagues were the first to formalize it (Mishkin and Petri, 1984; Mishkin et al., 1984). They defined habits as stimulus–response connections formed gradually through trial and error on the basis of reinforcement. They hypothesized that forming new habits requires the striatum as memorizing facts and events requires the medial temporal lobe (Turchi et al., 2010). This implied that habits should eventually be wiped out by Parkinson's disease (PD) as radically as facts and events are lost in amnesia. It took 12 years and a very difficult probabilistic stimulus-response task to start getting evidence supporting this claim (Knowlton et al., 1996; Frank et al., 2004; Shohamy et al., 2004a). In addition, with mild to moderate PD, patients were impaired but managed to improve over days (Frank et al., 2004; Shohamy et al., 2004b).

Then, the learning theory literature proposed to distinguish two separate habit processes: the outcome-driven goal-directed actions mediating acquisition of novel stimulus–response associations and the stimulus-driven habitual responses mediating the subsequent stabilization of newly formed bonds (Dickinson, 1985; Rescorla, 1991; Dickinson and Balleine, 1993, 1994; Dickinson, 1994; Staddon and Cerutti, 2003; Balleine and O'Doherty, 2010; Daw et al., 2011; Doll et al., 2012). These two processes are viewed as anatomically segregated within the rat striatum. Goal directed actions would depend preferentially on the rat's posterior dorsomedial striatum and habitual responses on the dorsolateral striatum (Reading et al., 1991; Packard and McGaugh, 1996; Ragozzino et al., 2002; Yin et al., 2005a,b, 2006; Stalnaker et al., 2010; Thorn et al., 2010; but see Stalnaker et al., 2010). A similar segregation may hold true for analogous regions in the primate brain, the caudate nucleus/rostral putamen, and caudal putamen regions, respectively (Miyachi et al., 1997, 2002; O'Doherty et al., 2004; Tricomi et al., 2009; Wunderlich et al., 2012a). In PD, degeneration of the dopaminergic pathways is thought to affect caudal putamen earlier and more severely than the rostral putamen/caudate complex (Redgrave et al., 2010). The new model on the striatum role in habits therefore predicts a disproportional effect of PD on habitual responses. History repeats itself, however, evidence supporting this new claim is not coming easily. Using a conflict task based on the new model, De Wit et al. (2011) tested patients with mild PD and unexpectedly found only a disease severity-dependent deficit in goal-directed actions.

Here we built on our extensive experience of conditional visuo-motor learning to readdress the question of the habit deficit associated with PD. Conditional visuo-motor learning consists in mapping, through trial-and-error, a set of stimuli onto a set of actions according to purely arbitrary rules (Petrides, 1985a,b). It has extensively been used to demonstrate the role of the dorsal striatum in habit learning in both humans and animals using different techniques (electrophysiology in rats: Carelli and Deadwyler, 1997; Jog et al., 1999; Barnes et al., 2005; Tang et al., 2007; Kim et al., 2009; Kimchi et al., 2009; electrophysiology in monkeys: Tremblay et al., 1998; Hadj-Bouziane and Boussaoud, 2003; Brasted and Wise, 2004; Pasupathy and Miller, 2005; Buch et al., 2006; Williams and Eskandar, 2006; Histed et al., 2009; neuropsychology in humans and monkeys: Petrides, 1985b, 1997; Wise et al., 1996; Passingham et al., 2000; Bussey et al., 2001; Nixon et al., 2004; neuroimaging in humans: Deiber et al., 1997; Toni and Passingham, 1999; Toni et al., 2001a,b; Simon et al., 2002; Haruno et al., 2004; O'Doherty et al., 2004; Tanaka et al., 2004; Tricomi et al., 2004; Boettiger and D'Esposito, 2005; Delgado et al., 2005; Law et al., 2005; Grol et al., 2006; Haruno and Kawato, 2006; Pessiglione et al., 2006; Brovelli et al., 2008; Tricomi et al., 2009).

Using single-cell recordings, we identified two distinct neuronal populations within the monkey striatum. One presents a transient increase in firing rate very early on during learning of novel associations, presumably reflecting the substrate for goal-directed actions. The other presents, on the opposite, a protracted increase in firing rate paralleling performance improvement and stabilizing late in learning, presumably reflecting the substrate for habitual responses (Hadj-Bouziane and Boussaoud, 2003; Hadj-Bouziane et al., 2006). Later, the first type of neurons was found to predominate in the monkey caudate nucleus, whereas the second type was more frequent in the putamen (Williams and Eskandar, 2006). In addition, we recently scanned healthy humans with fMRI on the same conditional visuo-motor task and found positive evidence of a differential involvement of the caudate nucleus and putamen in, respectively, the early vs. late learning of conditional visuo-motor associations (Brovelli et al., 2011; Amiez et al., 2012).

The present behavioral study in PD patients complements our previous approaches in monkeys and healthy humans. Its aim was to distinguish for the first time the effects of PD on performance during the early vs. late learning phases of visuo-motor associations. Conditional learning has been tested in PD patients previously but yielded inconsistent results. Learning of visuo-visual or visuo-verbal associations was often found disrupted, but not always (see respectively, Gotham et al., 1988; Taylor et al., 1990; Vriezen and Moscovitch, 1990; Sprengelmeyer et al., 1995; Pillon et al., 1998; and Marie et al., 1999 vs. Canavan et al., 1989; Postle et al., 1997; Bedard and Sanes, 2009). Learning of visuo-motor associations was studied only once, and no deficit could be detected in the sample of patients tested (Canavan et al., 1989). All these previous studies used a correction procedure that made it impossible to study early vs. late learning separately. As in Petrides (1985a,b), an incorrect response was followed by re-presentation of the same stimulus as many times as it took to get a correct answer. Here, we applied to Parkinson's patients the protocol devoid of correction procedure that we successfully used earlier in monkeys and healthy humans to reveal the striatum involvement in habit learning (monkeys: Hadj-Bouziane and Boussaoud, 2003; Hadj-Bouziane et al., 2003, 2006; humans: Simon et al., 2002; Brovelli et al., 2008, 2011; Amiez et al., 2012).

Sixteen patients and 20 age-matched controls had to learn, by trial and error, the associations between 3 visual stimuli and 3 joystick movements. Learning was pursued until a criterion of three consecutive correct responses was achieved for each association. Performance was then analyzed on a trial-by-trial basis for each association separately. The number of errors committed prior to the first correct response was used to measure the early learning phase dominated by goal-directed actions, and the number of errors committed after the first correct response up to the achievement of the learning criterion was used to measure late learning and the emergence of habitual responses. In healthy subjects, the dorsal striatum is activated, together with the dorsal fronto-parietal network and the ventrolateral prefrontal cortex, both on the incorrect and first correct trials and may reflect the processing of relevant visuomotor mappings during the early phases of learning (Brovelli et al., 2008) whereas subsequent repetitions of the correct response activate the putamen (Amiez et al., 2012).

First, given the large body of evidence that conditional visuo-motor learning engages the striatum in healthy animals and humans, we expected the learning performance of the patients to be markedly retarded. Second, based on the assumptions that goal-directed behaviors are predominantly caudate-dependent whereas habitual responses are primarily putamen-dependent, we predicted that habitual responses would be particularly impaired in our population of PD patients, for which we found earlier greater dopamine depletion in the putamen than in the caudate nucleus (Broussolle et al., 1999).

Methods

Participants

The study was approved by the local Human Research Ethics Committee and all subjects gave informed consent. Sixteen non-demented and non-depressed patients with advanced PD and 20 control subjects, without neurological or psychiatric history and matching the patients for age and level of education, participated to the study (Tables 1 and 2). All PD patients were under consideration for surgical treatment in the Department of Neurology C of the Pierre Wertheimer Neurological Hospital, Lyon, France. They all fulfilled the UK Parkinson's Disease Society, Brain Bank (UKPDSBB) diagnostic criteria (Gibb and Lees, 1988). They presented an akinetorigid syndrome, resting tremor, or both, as well as a good sensitivity to levodopa treatment, marked dyskinesias and on–off fluctuations. The Beck Depression Inventory and Mattis dementia scale were administered (on medication) to 13/16 patients (Table 2) and a full neuropsychological evaluation including the Starkstein apathy scale, the Grober and Buschke memory test and the Modified Wisconsin Card Sorting Test was available for 10/16 patients.

Table 1.

Demographic details of the control subjects matching patients for age and education level (means ± SD).

Case Sex Age (years) Education (years)
1 F 57 12
2 M 58 14
3 F 51 9
4 M 52 9
5 F 55 12
6 F 56 14
7 M 48 12
8 F 66 9
9 F 72 14
10 M 71 12
11 M 63 14
12 F 65 9
13 F 54 9
14 M 60 9
15 M 47 12
16 F 46 9
17 M 46 12
18 M 65 19
19 F 60 14
20 M 44 19
Mean 10M/ 56.8 12.2
SD 10F 8.4 3.1

Cases 1–11 were tested only once, whereas cases 12–20 were, like patients, tested twice.

Table 2.

Demographic and clinical details of the patients (means ± SD).

Case Sex Age (years) Education (years) Disease duration (years) Hoehn-Yahr on(/5) Hoehn-Yshr off(/5) Dose equ.Ldopa (mg/j) Beck Depression Invent.(/63) Mattis Dementia scale (/144)
1 M 60 9 10 2 3 1200 33 128
2 F 43 9 12 2.5 3 500 21 141
3 F 50 12 13 2 3 1125 10 138
4 M 52 9 13 2 4 1775 10 132
5 M 62 9 7 1.5 3 1290 6 124
6 M 49 12 9 1.5 3 1260 21 142
7 M 53 9 11 2 2.5 1175 11 134
8 M 67 10 10 2 3 1150 10 138
9 M 53 10 11 2 4 1500 14 133
10 M 54 19 10 1 3 1500 9 141
11 M 51 14 15 2 4 900 5 134
12 F 67 9 13 2 3 1175 15 130
13 F 61 9 12 3 5 1400
14 M 61 9 12 2 3 1200 2 135
15 F 40 14 10 2 3 600
16 M 61 19 9 1.5 2.5 1000
Mean 11M/ 55.2 11.3 11.1 1.9 3.25 1172 13 135
SD 5F 7.9 3.5 2 0.4 0.6 320 8 5

All patients underwent two testing sessions, one with and one without medication. Patients 1–8 were tested off medication first, patients 9–16 were tested on medication first. Daily doses of dopaminergic treatment were expressed as levodopa equivalent dose in mg/24 h using a common formula: 100 mg levodopa = 10 mg bromocriptine = 6 mg ropinirole = 1 mg pergolide = 1 mg lisuride = 1.2 mg pramipexole = 50 mg piribedil. Patients UPDRS III scores averaged 11/108 on medication (SD 4) and 40/108 off medication (SD 10).

Patients underwent two testing sessions (hereafter referred to as test and retest), lasting about 1 h each, and carried out 3–5 h apart on the same day. After an overnight withdrawal of medication performed for clinical purposes, patients were either tested off medication first, and then on medication (PD-1 subgroup, cases 1–8) or the reverse (PD-2 subgroup cases 9–16). The on medication state was defined as 1 h after an acute challenge of levodopa corresponding to 1.5 × the patient's current dose. The off and on medication state were evaluated based on both clinical examination and UPDRS and Hoehn & Yahr scales. The control group initially comprised 11 subjects submitted to a single session (cases 1–11). As results revealed a marked test-retest effect in patients, 9 more controls (cases 12–20) were added who, like patients, underwent two testing sessions separated by 3–5 h.

Procedure

Stimuli were computer-generated colored geometric patterns presented at the center of a computer screen (Figure 1A). There were four possible motor responses: the right, left, forward, or backward displacement of a joystick. During each trial (Figure 1B), a central cross was presented for 1 s, the stimulus for 3 s, and the feedback for 1 s; a green happy face signaled a correct response, a red sad face an incorrect one. Subjects had to make a response during the 3s-stimulus presentation otherwise the trial was aborted and a large red cross presented for 1 s.

Figure 1.

Figure 1

Experimental task. (A) One of the trios of complex geometric patterns used as stimuli. (B) Trial time-course. First, a central cross appeared at the center of the screen for 1 s. Then, one stimulus was presented for 3 s during which the subject had to guess which among the four possible joystick displacements was associated with this stimulus. A correct response was followed by a green happy face, an incorrect one by a red sad face. Both feedbacks were presented for 1 s. (C) Matrix summarizing the 12 possible associations to be tested when mapping 3 stimuli (rows) onto 4 joystick displacements (columns). Subjects can try associations randomly and memorize each stimulus—response—outcome link thus explored; such an “unorganized” way to solve the task was not infrequent in controls. Alternatively, given that each stimulus appeared once per block of three trials, one can select a movement, repeat it for every stimulus, and then reiterate the procedure for the next movement on the next block of 3 trials. This way the three correct associations are systematically found within the 12 trials represented in the matrix; PD patients particularly relied on this optimal learning strategy.

Subjects mapped 3 visual stimuli onto 3 joystick movements. Stimuli varied in shape, but were of the same overall size and color (Figure 1A). All 3 stimuli were presented once before they were presented again in a new random sequence. In other words, on the first trial, one of the 3 stimuli was randomly selected and presented; on the second trial, one of the remaining two stimuli was randomly selected and presented, regardless of the outcome of the previous trial, and then the remaining one of the 3 stimuli was presented. Testing continued like this until the subject reached the criterion of three consecutive correct responses for each stimulus. Even if two stimuli were learned, presentation of all three stimuli was pursued until the last stimulus was learned too. If patients failed to learn, however, testing was interrupted after 50 trials to minimize their discomfort. Six different trios of stimuli were learned in this way, 3 during the test, and 3 during the retest. The order of presentation of the six trios during test and retest was fixed and identical across subjects.

As each stimulus appeared once per block of three trials, the optimal way to search for the correct association was to select one movement and repeat it until it proves either correct or not associated with any stimulus (Figure 1C). Subjects were never informed about this optimal movement-oriented strategy, but each of them was asked at the end of the experiment to describe the way he/she used to solve the task. Of the 36 subjects tested, only one, a patient, reported having tried each movement in turn in a clockwise manner.

Data analysis

We calculated five variables: (1) global scores, i.e., the total number of trials and errors required to reach, or fail to reach criterion for each trio of associations, (2) response times, i.e., the time elapsed between stimulus onset and the subject's response, (3) search errors, i.e., errors made before the first correct response, (4) repetition errors, i.e., errors made after the first correct response and before the learning criterion or the limit of 50 trials, and (5) strategy scores i.e., number of repetition of the same movement after an incorrect response, expressed as a percentage of the total number of errors to criterion. Strategy scores under 30 reflects rare use of the optimal strategy consisting in repeating the same movement until it proves correct, scores above 60 to a quasi-systematic use.

We used 2 × 2 ANOVAs with group and session, Student's two-sample t-tests for separate variance and paired t-tests to assess between- and within-groups differences. The p-value for significance of these post-hoc t-tests was set at a Bonferroni-adjusted p < 0.025. Comparisons involving retest involve only the 9 controls that were tested twice like patients, whereas comparisons performed on test involved all 20 controls. In controls, Pearson's correlation analyses were used to determine whether age and strategy use predicted learning scores on test (these analyses were not performed on retest due to the small number of subjects involved). In patients, Pearson's correlation analyses were used to determine the influence of age, strategy use, disease stage, and neuropsychological variables on learning scores, and these analyses were performed on both test and retest.

Results

The 20 controls matched the 16 PD patients for age (t34 = 0.6, p = 0.58), and education level (t34 = 0.7, p = 0.48). Patients' response times were expectedly longer than controls reaching 1519 ± 344 ms on test and 1507 ± 108 ms on retest compared to 1286 ± 233 ms and 1118 ± 60 ms, respectively, in controls (group effect: F(1, 23) = 8.4, p = 0.008; session effect and interaction, Fs ≤ 0.4, ps > 0.51). They remained, however, well under 3 s, indicating that the learning deficit described below was not due do a motor inability to respond within the imparted time.

More surprising was the lack of effect of medication. As summarized in Table 3, in both the PD-1 (test off–retest on) and PD-2 (test on–retest off) subgroups, most conditional learning measures improved slightly on levodopa relative to off levodopa. Yet, none of these on/off differences reached significance whether the PD-1 and PD-2 subgroups were considered separately or together. The results below therefore focus on the session effect (test vs. retest), which, unlike medication, markedly affected patients' performance.

Table 3.

Age, education level, disease stage, and conditional learning scores (mean ± sem) of patients according to medication status.

PD-1 (n = 8) PD-2 (n = 8)
Mean SEM Mean SEM
Age (years) 54.5 2.8 56 3
Education (years) 9.9 0.5 12.9 1.5
Hoehn & Yahr off (/5) 3.1 0.1 3.4 0.3
TEST OFF ON
Reaction times (ms) 1601 136 1438 107
Trials per set (max: 50) 39.9 2.9 33.2 4.2
Search errors 13.2 2 6.7 1.2
Retention errors 10.5 2.2 9.2 3.3
Strategy score 14.8 4.7 24.6 5.8
RETEST ON OFF
Reaction times (ms) 1482 170 1532 143
Trials per set (max: 50) 28 3.9 30 3.7
Search errors 8.1 2.5 6.6 1.4
Retention errors 5.8 2.2 6.5 2.7
Strategy score 25.3 6.2 34.7 7.3

The counterbalanced within-subject design revealed no reliable effect of the acute challenge of levodopa.

Global conditional learning performance

The results are detailed for trials to criterion; unless otherwise indicated, errors to criterion yielded the same conclusions.

Global impairment and disease stage

As illustrated in Figure 2, patients systematically performed more poorly than controls (group effect, F(1, 23) = 5.8, p = 0.02) although, like controls, they improved across sessions (session effect F(1, 23) = 13.5, p = 0.001; group × session interaction, F(1, 23) = 0.5, p = 0.83 ns). They required 44% more trials per set on test (t34 = 3.8, p = 0.001), and 36% more trials per set on retest (t23 = 2.1, p = 0.02). More advanced disease stage as evaluated by Hoehn & Yahr on medication scores was correlated with more trials per set on test (test: r = 0.50, p = 0.05), and a combination of more trials per set (r = 0.65, p = 0.006), more errors per set (r = 0.67, p = 0.005) and poorer strategy scores on retest (r = −0.53, p = 0.03). We did not find reliable correlation between patients' impairment on conditional learning and their neuropsychological scores, but this negative result should be taken with caution as a complete neuropsychological evaluation could only be obtained from only 10 of our 16 patients.

Figure 2.

Figure 2

Global learning impairment in PD patients relative to controls. Number of trials (trials per set, mean, and SEM) to reach or fail to reach criterion of three consecutive correct responses for all 3 associations within the imparted 50 trials (maximum) on the test and retest session; the higher the scores, the worse the performance. For each session, scores were averaged over three trios of visuo-motor associations (***p = 0.001; *p = 0.05: between group differences as revealed by two-sample t-tests).

Potential influence of age and strategy

Controls seemed more vulnerable to aging than patients (Figure 3A). Only in controls, were scores positively correlated with age (errors: r = 0.46, p = 0.04 compared to r = 0.13, p = 0.63 ns in patients). Also, only in controls, did performance deteriorate after 60 years of age (t18 = 3.2, p = 0.005; patients: t14 = 0.8, p = 0.43 ns). The global impairment reported above was thus due largely to the younger subjects of the two groups (t19 = 3.5, p = 0.006), the difference between older subjects being marginal (t13 = 1.9, p = 0.10).

Figure 3.

Figure 3

Age class and strategy use influence on test. (A) Under 60 years of age (12/20 controls, 9/16 patients), PD deleterious effect on learning (trials per set, i.e., trials required to learn all 3 associations to criterion, mean ± SEM) was clear-cut (two-sample t-tests: ***p = 0.006). Beyond 60 years of age, the disease effect was only marginally worse than that of normal aging (two-sample t-tests: n.s., non significant p = 0.10). (B) Irrespective of age, PD effect on learning (y axis, trials per set) varied across individuals depending on their ability to use the optimal learning strategy (strategy scores, x-axis) as revealed by Pearson correlation analyses (r's and p's).

Conversely, performance was significantly correlated with strategy scores only in patients (errors: r = −0.71, p = 0.002; compared to: r = −0.31, p = 0.18 ns in controls). As illustrated in Figure 3B, patients with high strategy scores performed within controls' range. Most used the strategy without being aware of it. The one patient who used it deliberately, trying each movement in turn in a clockwise order, outperformed all other patients, and all controls but the youngest one.

Summary

As a group, the 16 early onset PD patients tested at advanced stages of the disease in the present study were unequivocally impaired in conditional visuo-motor learning. At individual level, however, the magnitude of this impairment varied with age and the ability to use the optimal learning strategy. Clear-cut in younger patients, the deficit became indistinguishable from the decline produced by normal aging in older patients, and irrespective of age, could be totally overcome by strategy use.

Early vs. late learning

Search vs. repetition errors and disease stage

On test (Figure 4A), PD patients were significantly impaired during both the early phase preceding the first correct response and the late phase following it. They made 66% more search errors (t34 = 2.6, p = 0.02), and 152% more repetition errors than controls (9.9 ± 1.9 vs. 3.9 ± 0.8; t34 = 2.9, p = 0.01). On retest (Figure 4B), a dissociation between these two deficits emerged. There, patients' early phase deficit was no longer significant (t23 = 1.2, p = 0.25), whereas their late phase deficit persisted, amounting to a marked 237% increase in repetition errors relative to controls (t23 = 2.5, p = 0.02). Another difference between the two deficits concerned their link with disease stage as evaluated by Hoehn & Yahr on medication scores. Disease stage reliably predicted repetition errors (r = 0.64, p = 0.007, on test and r = 0.70, p = 0.003, for retest), but not search errors (r = 0.02 on test and r = 0.39 for retest, p's ns).

Figure 4.

Figure 4

Early (search errors) and late (repetition errors) learning deficits in PD patients relative to controls. Patients were impaired on both measures on test (A), but not on retest (B), where only their repetition errors were significantly increased. Scores are means and SEM per set of three visuo-motor associations averaged over three sets. **p ≤ 0.02, two-sample t-tests.

Age and strategy influence

In PD patients, the receding late phase impairment on retest was paralleled by an improvement of strategy use. Specifically, patients' strategy scores, which were lower than controls' on test (19.7 ± 3.8 vs. 28.3 ± 4.7; t34 = 2.0, p = 0.05), became normal on retest (30.0 ± 3.8 vs. 25.6 ± 4.7; t23 = 0.6, p ns). The relationship between strategy scores and errors also evolved on the second session. Originally correlated with both search (r = −0.60, p = 0.01) and repetition errors (r = −0.55, p = 0.03), strategy scores became correlated solely with repetition errors (r = −0.57, p = 0.02, compared to r = −0.36, p ns, for search errors). Interestingly, comparison of controls' performance on test across age classes indicated that, like PD, normal aging preferentially affected the late learning phase (age < 60 years, n = 12 vs. age ≥ 60 years, n = 8: 2.5 ± 0.6 vs. 6.0 ± 1.4; t18 = 2.3, p = 0.04, for repetition errors, compared to 5.0 ± 0.5 vs. 7.5 ± 1.3; t18 = 1.7, p ns, for search errors).

Summary

PD disrupted both early and late learning of novel visuo-motor associations, but differentially so. The early deficit receded with additional training and associated improved strategy use, and was unrelated to disease stage. By contrast, the late deficit persisted despite additional training and better strategy use, and its magnitude was systematically correlated with disease stage as evaluated by Hoehn & Yahr on medication scores. In healthy subjects, late learning likewise seemed more vulnerable to aging than early learning.

Discussion

The present study provides the first evidence that conditional learning of associations between visual stimuli and motor acts is impaired in PD patients. Early and late learning, respectively, preceding and following the first correct response, were both disrupted, but the late learning deficit was more closely linked to disease severity and more enduring across sessions than the early learning deficit. Given that caudal putamen is particularly affected in PD patients, this second finding provides some support to the idea that putamen-dependent habitual responses are more affected by PD than caudate-dependent goal-directed actions. These two points will be discussed in turn.

Global conditional visuo-motor learning impairment in PD patients

Since Mishkin et al.'s (1984) proposal that habits depend on the striatum, the most convincing evidence supporting this claim came from studies using probabilistic tasks such as the weather prediction task (Knowlton et al., 1996; Frank et al., 2004; Shohamy et al., 2004a). By contrast, studies using conditional (motor or non-motor) learning tasks with error correction yielded inconsistencies (Gotham et al., 1988; Canavan et al., 1989; Vriezen and Moscovitch, 1990; Sprengelmeyer et al., 1995; Postle et al., 1997; Pillon et al., 1998; Marie et al., 1999). Here, by applying to PD patients the no-correction procedure used earlier in monkeys and healthy humans, we found a clear-cut learning impairment that endured over two testing sessions separated by a few hours. Two factors may have contributed to this positive finding in addition to the no-correction procedure.

The youth of a majority of our subjects is one of them: 12/20 controls and 9/16 patients indeed were under 60 years of age. Elderly subjects suffer from a decline in dopamine activity that impairs learning (Volkow et al., 1998; Floel et al., 2008) and habit tasks such as conditional learning are particularly sensitive to this normal aging effect (Levine et al., 1997; Schmitt-Eliassen et al., 2007). Had we included only our subjects over 60 years of age, we would have missed the disease effect as did Canavan et al. (1989) earlier, who compared young patients averaging 55 years of age to older controls averaging 60 years of age, that is, a pathological dopaminergic and striatal dysfunction to a physiological one. To what extent PD disrupts habit learning over and above normal aging is a question that remains to be addressed.

The second factor is that PD-induced brain damage is progressive, thereby leaving room for compensatory mechanisms. At the neural level, compensation varies with the task at hand. PD patients' normal performance in the weather prediction task is associated with an activation in temporal areas that is absent in controls (Moody et al., 2004), whereas normal performance in conditional visuo-motor learning is accompanied by greater activation in the prefrontal cortex relative to controls (Bedard and Sanes, 2009). At the behavioral level, within the present patients, those who resorted to the optimal learning strategy ended up learning normally, one patient even outperforming most controls. Thus, some early studies may have failed to detect conditional learning impairment in PD patients because of alternate strategies that went undetected.

The point that the present study does not help clarifying is the influence of PD patients' drug state on visuo-motor learning. Patients performed slightly but not significantly better after an acute challenge of levodopa than after an overnight withdrawal of medication. Several other recent studies (De Wit et al., 2011; Jocham et al., 2011; Shiner et al., 2012) also failed to find a direct effect of medication on PD patients' learning. These failures provide a reminder that above and beyond the well-known contribution of phasic dopamine to reward prediction errors, the neuromodulator role of dopamine in the brain is highly complex (see e.g., Wunderlich et al., 2012b for a recent discussion of this issue).

Goal-directed vs. habit deficits in PD patients

In recent years, the learning theory literature has emphasized that what was classically viewed as a unique anatomo-functional system in the memory literature, habit learning, may in fact engage two distinct systems. In this theory, goal-directed actions and a “model-based” form of reinforcement learning are opposed to habitual responses and a “model-free” form of reinforcement learning (for review see Doll et al., 2012). One study tested patients with mild PD to distinguish the disease effect on the two processes embedded in habit learning (De Wit et al., 2011). In agreement with the present study, the authors found a global learning impairment that was not remediated by dopaminergic medication. Yet, in apparent contradiction with both the present study and the model, which predicted that patients should suffer from a disruption of habitual responses, patients displayed only a disease severity-dependent deficit in goal-directed actions.

In light of the present data, it is possible that a habit deficit did exist in the previous study but was masked due to normal aging decline in controls, compensatory strategy in patients, or both. All three groups in the previous study (on, off, and controls) were over 60 years of age. The deleterious effect of PD on habits might simply have been indistinguishable from that of normal aging in these elderly subjects. As regards strategy, patients in De Wit et al. (2011) did not use any explicit compensatory strategy. It nevertheless remains possible that some of them developed an implicit one. This appears even more likely given that disease duration and severity were mild (6–8 years of disease instead of 11 years in the present study, Hoehn & Yahr scores < rather than >2) and that we found that the lower the disease stage, the greater the tendency to use an implicit strategy.

This said, however, there is also an important methodological difference across the two studies that may explain the discrepancy between the results. Both studies aimed at evaluating goal-directed actions and habitual responses in PD patients, but they operationalized the concepts differently. The previous study was based on the theory that a habit is an instrumental response that persists even when its outcome is no longer valued. It measured conflict-induced habits, which surprisingly produce longer response times than goal-directed actions. This represents a major deviation from the training-induced habits measured by classical associative tasks, which result in quicker habitual responding. Thus, the diverging results of the two studies vis-à-vis habitual responses might simply due to the fact that they measure two different forms of habits.

Here, we studied classical training-induced habits as measured by conditional visuo-motor learning. We focused on two learning phases operationally defined as, respectively, encompassing trials and errors preceding vs. following the first correct response. We presumed that the early phase predominantly (though probably not exclusively) relies on goal-directed actions, while the late phase reflects the emergence of habitual responses, based on two lines of evidence. First, in monkeys, there exist two learning-related neural changes, one transient, frequent in the caudate nucleus, and occurring very early in learning; the other, durable, frequent in the putamen, and stabilizing late in learning (Hadj-Bouziane and Boussaoud, 2003; Hadj-Bouziane et al., 2006; Williams and Eskandar, 2006). Second, in healthy humans, the caudate nucleus and putamen subserve different computations during learning with, as in monkeys, the caudate nucleus more involved in early learning and the putamen more involved in late learning (Brovelli et al., 2011; Amiez et al., 2012). PD disrupted both the early and late learning of novel visuo-motor associations, in agreement with the fact that, at the advanced stages studied here, the disease interferes with both the associative and sensori-motor loops. However, late learning was more closely linked to the disease and more enduringly affected by it than early learning. First, only the magnitude of the late deficit was positively correlated with Hoehn & Yahr on medication scores. Second, only the late deficit persisted on the second testing session despite performance improvement and better strategy use.

The optimal learning strategy for the present task, repeating the same movement until it proves correct, is of more help during the search for the correct response than during its repetition and, hence, may more effectively reduce patients' goal-directed deficit than their habitual responses deficit. Strategy-related compensation, however, is probably not the sole factor explaining the differential effect of PD on the two learning phases. The sensori-motor loop remains more dysfunctional than its associative counterpart as the disease progresses (Redgrave et al., 2010). We studied earlier a group of PD patients similar to the present ones, that is, non-demented, levodopa-responsive subjects fulfilling criteria for surgical treatment and, accordingly found greater dopamine depletion in the putamen than in the caudate nucleus (Broussolle et al., 1999). The greater vulnerability of late learning to the disease effects observed here for visuo-motor associations could result from this more extensive putamen damage. This speculation is in agreement with the idea of a preferential involvement of the putamen in habitual responses.

Importantly, here, we could only assess the effect of PD on the emergence of habits. We did not test over-learned habits of the kind we all use in daily life when stopping at a red traffic light or speeding through a green one, and that require very long learning periods, incompatible with neuropsychological testing. Extensively trained automatic responses do engage the posterior putamen in healthy humans (Wunderlich et al., 2012a), but it remains to be proved that there are accordingly disrupted by the putamen progressive dysfunction produced by PD.

Conclusion

The present sample of 16 early onset, non-demented, dopa-responsive patients suggests that when PD reaches moderate to severe stages it does interfere with habit learning as measured by conditional visuo-motor associative learning. This finding enriches the still meager body of data supporting the idea proposed by the memory literature that habits depend on the striatum like memories depend on the medial temporal lobe. The deficit following the first correct response seemed the most sensitive to the illness effects in the patient population tested in the present study. As Broussolle et al. (1999) found greater putamen than caudate dysfunction in similar patients, this finding is compatible with the recent view proposed by the learning theory literature that habits include two processes early, caudate-dependent, goal-directed actions and late, predominantly putamen-dependent, habitual responses. The patients' deficit was rarely dramatic, though. It was often compensated by optimization of the learning strategy, and was sometimes hardly distinguishable from normal aging decline. This showed that marked inter-individual variations occur even in a clinically quite homogeneous group.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We thank all the participants for taking part to this study. Fadila Hadj-Bouziane is supported by the Bettencourt Foundation and the NEURODIS Foundation; Martine Meunier by the ANR grants no. RPV08085CSA and ANR-O8-BLAN-0068-1, the AVENIR grant no. R05265CS, and a scholar award from the James S. McDonnell Foundation. Other support was contributed by CNRS, INSERM, Université de la Méditerranée (Marseille, France), Université Claude Bernard (Lyon, France), and the French Ministry of Research.

References

  1. Amiez C., Hadj-Bouziane F., Petrides M. (2012). Response selection versus feedback analysis in conditional visuo-motor learning. Neuroimage 59, 3723–3735 10.1016/j.neuroimage.2011.10.058 [DOI] [PubMed] [Google Scholar]
  2. Balleine B. W., O'Doherty J. P. (2010). Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 35, 48–69 10.1038/npp.2009.131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barnes T. D., Kubota Y., Hu D., Jin D. Z., Graybiel A. M. (2005). Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories. Nature 437, 1158–1161 10.1038/nature04053 [DOI] [PubMed] [Google Scholar]
  4. Bedard P., Sanes J. N. (2009). On a basal ganglia role in learning and rehearsing visual-motor associations. Neuroimage 47, 1701–1710 10.1016/j.neuroimage.2009.03.050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Boettiger C. A., D'Esposito M. (2005). Frontal networks for learning and executing arbitrary stimulus-response associations. J. Neurosci. 25, 2723–2732 10.1523/JNEUROSCI.3697-04.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Brasted P. J., Wise S. P. (2004). Comparison of learning-related neuronal activity in the dorsal premotor cortex and striatum. Eur. J. Neurosci. 19, 721–740 10.1111/j.0953-816X.2003.03181.x [DOI] [PubMed] [Google Scholar]
  7. Broussolle E., Dentresangle C., Landais P., Garcia-Larrea L., Pollak P., Croisile B., et al. (1999). The relation of putamen and caudate nucleus 18F-Dopa uptake to motor and cognitive performances in Parkinson's disease. J. Neurol. Sci. 166, 141–151 [DOI] [PubMed] [Google Scholar]
  8. Brovelli A., Laksiri N., Nazarian B., Meunier M., Boussaoud D. (2008). Understanding the neural computations of arbitrary visuomotor learning through fMRI and associative learning theory. Cereb. Cortex 18, 1485–1495 10.1093/cercor/bhm198 [DOI] [PubMed] [Google Scholar]
  9. Brovelli A., Nazarian B., Meunier M., Boussaoud D. (2011). Differential roles of caudate nucleus and putamen during instrumental learning. Neuroimage 57, 1580–1590 10.1016/j.neuroimage.2011.05.059 [DOI] [PubMed] [Google Scholar]
  10. Buch E. R., Brasted P. J., Wise S. P. (2006). Comparison of population activity in the dorsal premotor cortex and putamen during the learning of arbitrary visuomotor mappings. Exp. Brain Res. 169, 69–84 10.1007/s00221-005-0130-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bussey T. J., Wise S. P., Murray E. A. (2001). The role of ventral and orbital prefrontal cortex in conditional visuomotor learning and strategy use in rhesus monkeys (Macaca mulatta). Behav. Neurosci. 115, 971–982 [DOI] [PubMed] [Google Scholar]
  12. Canavan A. G., Passingham R. E., Marsden C. D., Quinn N., Wyke M., Polkey C. E. (1989). The performance on learning tasks of patients in the early stages of Parkinson's disease. Neuropsychologia 27, 141–156 10.1016/0028-3932(89)90167-X [DOI] [PubMed] [Google Scholar]
  13. Carelli R. M., Deadwyler S. A. (1997). Cellular mechanisms underlying reinforcement-related processing in the nucleus accumbens: electrophysiological studies in behaving animals. Pharmacol. Biochem. Behav. 57, 495–504 10.1016/S0091-3057(96)00442-X [DOI] [PubMed] [Google Scholar]
  14. Daw N. D., Gershman S. J., Seymour B., Dayan P., Dolan R. J. (2011). Model-based influences on humans' choices and striatal prediction errors. Neuron 69, 1204–1215 10.1016/j.neuron.2011.02.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Deiber M. P., Wise S. P., Honda M., Catalan M. J., Grafman J., Hallett M. (1997). Frontal and parietal networks for conditional motor learning: a positron emission tomography study. J. Neurophysiol. 78, 977–991 [DOI] [PubMed] [Google Scholar]
  16. Delgado M. R., Miller M. M., Inati S., Phelps E. A. (2005). An fMRI study of reward-related probability learning. Neuroimage 24, 862–873 10.1016/j.neuroimage.2004.10.002 [DOI] [PubMed] [Google Scholar]
  17. De Wit S., Barker R. A., Dickinson A. D., Cools R. (2011). Habitual versus goal-directed action control in Parkinson disease. J. Cogn. Neurosci. 23, 1218–1229 10.1162/jocn.2010.21514 [DOI] [PubMed] [Google Scholar]
  18. Dickinson A. (1985). Actions and habits: the development of behavioral autonomy. Philos. Trans. R. Soc. Lond. B Biol. Sci. 308, 67–78 [Google Scholar]
  19. Dickinson A. (1994). Instrumental conditioning, in Animal Cognition and Learning, ed Mackintosh N. J. (London: Academic Press; ), 45–79 [Google Scholar]
  20. Dickinson A., Balleine B. (1993). Actions and responses: the dual psychology of behaviour, in Spatial Representation: Problems in Philosophy and Psychology, eds Eilan N., McCarthy R. A., Brewer B. (Malden, MA: Blackwell Publishers; ), 277–293 [Google Scholar]
  21. Dickinson A., Balleine B. W. (1994). Motivational control of goal-directed action. Anim. Learn. Behav. 22, 1–18 [Google Scholar]
  22. Doll B. B., Simon D. A., Daw N. D. (2012). The ubiquity of model-based reinforcement learning. Curr. Opin. Neurobiol. 22, 1075–1081 10.1016/j.conb.2012.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Floel A., Garraux G., Xu B., Breitenstein C., Knecht S., Herscovitch P., et al. (2008). Levodopa increases memory encoding and dopamine release in the striatum in the elderly. Neurobiol. Aging 29, 267–279 10.1016/j.neurobiolaging.2006.10.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Frank M. J., Seeberger L. C., O'Reilly R. C. (2004). By carrot or by stick: cognitive reinforcement learning in parkinsonism. Science 306, 1940–1943 10.1126/science.1102941 [DOI] [PubMed] [Google Scholar]
  25. Gibb W. R., Lees A. J. (1988). A comparison of clinical and pathological features of young- and old-onset Parkinson's disease. Neurology 38, 1402–1406 [DOI] [PubMed] [Google Scholar]
  26. Gotham A. M., Brown R. G., Marsden C. D. (1988). “Frontal” cognitive function in patients with Parkinson's disease ‘on’ and ‘off’ levodopa. Brain 111(Pt 2), 299–321 [DOI] [PubMed] [Google Scholar]
  27. Grol M. J., De Lange F. P., Verstraten F. A., Passingham R. E., Toni I. (2006). Cerebral changes during performance of overlearned arbitrary visuomotor associations. J. Neurosci. 26, 117–125 10.1523/JNEUROSCI.2786-05.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hadj-Bouziane F., Boussaoud D. (2003). Neuronal activity in the monkey striatum during conditional visuomotor learning. Exp. Brain Res. 153, 190–196 10.1007/s00221-003-1592-4 [DOI] [PubMed] [Google Scholar]
  29. Hadj-Bouziane F., Frankowska H., Meunier M., Coquelin P. A., Boussaoud D. (2006). Conditional visuo-motor learning and dimension reduction. Cogn. Process. 7, 95–104 10.1007/s10339-005-0028-4 [DOI] [PubMed] [Google Scholar]
  30. Hadj-Bouziane F., Meunier M., Boussaoud D. (2003). Conditional visuo-motor learning in primates: a key role for the basal ganglia. J. Physiol. Paris 97, 567–579 10.1016/j.jphysparis.2004.01.014 [DOI] [PubMed] [Google Scholar]
  31. Haruno M., Kawato M. (2006). Different neural correlates of reward expectation and reward expectation error in the putamen and caudate nucleus during stimulus-action-reward association learning. J. Neurophysiol. 95, 948–959 10.1152/jn.00382.2005 [DOI] [PubMed] [Google Scholar]
  32. Haruno M., Kuroda T., Doya K., Toyama K., Kimura M., Samejima K., et al. (2004). A neural correlate of reward-based behavioral learning in caudate nucleus: a functional magnetic resonance imaging study of a stochastic decision task. J. Neurosci. 24, 1660–1665 10.1523/JNEUROSCI.3417-03.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Histed M. H., Pasupathy A., Miller E. K. (2009). Learning substrates in the primate prefrontal cortex and striatum: sustained activity related to successful actions. Neuron 63, 244–253 10.1016/j.neuron.2009.06.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Jocham G., Klein T. A., Ullsperger M. (2011). Dopamine-mediated reinforcement learning signals in the striatum and ventromedial prefrontal cortex underlie value-based choices. J. Neurosci. 31, 1606–1613 10.1523/JNEUROSCI.3904-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Jog M. S., Kubota Y., Connolly C. I., Hillegaart V., Graybiel A. M. (1999). Building neural representations of habits. Science 286, 1745–1749 10.1126/science.286.5445.1745 [DOI] [PubMed] [Google Scholar]
  36. Kim H., Sul J. H., Huh N., Lee D., Jung M. W. (2009). Role of striatum in updating values of chosen actions. J. Neurosci. 29, 14701–14712 10.1523/JNEUROSCI.2728-09.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kimchi E. Y., Torregrossa M. M., Taylor J. R., Laubach M. (2009). Neuronal correlates of instrumental learning in the dorsal striatum. J. Neurophysiol. 102, 475–489 10.1152/jn.00262.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Knowlton B. J., Mangels J. A., Squire L. R. (1996). A neostriatal habit learning system in humans. Science 273, 1399–1402 10.1126/science.273.5280.1399 [DOI] [PubMed] [Google Scholar]
  39. Law J. R., Flanery M. A., Wirth S., Yanike M., Smith A. C., Frank L. M., et al. (2005). Functional magnetic resonance imaging activity during the gradual acquisition and expression of paired-associate memory. J. Neurosci. 25, 5720–5729 10.1523/JNEUROSCI.4935-04.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Levine B., Stuss D. T., Milberg W. P. (1997). Effects of aging on conditional associative learning: process analyses and comparison with focal frontal lesions. Neuropsychology 11, 367–381 [DOI] [PubMed] [Google Scholar]
  41. Marie R. M., Barre L., Dupuy B., Viader F., Defer G., Baron J. C. (1999). Relationships between striatal dopamine denervation and frontal executive tests in Parkinson's disease. Neurosci. Lett. 260, 77–80 10.1016/S0304-3940(98)00928-8 [DOI] [PubMed] [Google Scholar]
  42. Mishkin M., Malamut B., Bachevalier J. (1984). Memories and habits: two neural systems, in The Neurobiology of Learning and Memory, eds Lynch G., McGaugh J. L., Weinberger N. M. (New york, NY: Guilford Press; ), 65–88 [Google Scholar]
  43. Mishkin M., Petri H. L. (1984). Memories and habits: some implications for the analysis of learning and retention, in Neuropsychology of Memory, eds Squire L. R., Butters N. (New York, NY: Cambridge University Press; ), 196–210 [Google Scholar]
  44. Miyachi S., Hikosaka O., Lu X. (2002). Differential activation of monkey striatal neurons in the early and late stages of procedural learning. Exp. Brain Res. 146, 122–126 10.1007/s00221-002-1213-7 [DOI] [PubMed] [Google Scholar]
  45. Miyachi S., Hikosaka O., Miyashita K., Karadi Z., Rand M. K. (1997). Differential roles of monkey striatum in learning of sequential hand movement. Exp. Brain Res. 115, 1–5 [DOI] [PubMed] [Google Scholar]
  46. Moody T. D., Bookheimer S. Y., Vanek Z., Knowlton B. J. (2004). An implicit learning task activates medial temporal lobe in patients with Parkinson's disease. Behav. Neurosci. 118, 438–442 10.1037/0735-7044.118.2.438 [DOI] [PubMed] [Google Scholar]
  47. Nixon P. D., McDonald K. R., Gough P. M., Alexander I. H., Passingham R. E. (2004). Cortico-basal ganglia pathways are essential for the recall of well-established visuomotor associations. Eur. J. Neurosci. 20, 3165–3178 10.1111/j.1460-9568.2004.03788.x [DOI] [PubMed] [Google Scholar]
  48. O'Doherty J., Dayan P., Schultz J., Deichmann R., Friston K., Dolan R. J. (2004). Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science 304, 452–454 10.1126/science.1094285 [DOI] [PubMed] [Google Scholar]
  49. Packard M. G., McGaugh J. L. (1996). Inactivation of hippocampus or caudate nucleus with lidocaine differentially affects expression of place and response learning. Neurobiol. Learn. Mem. 65, 65–72 10.1006/nlme.1996.0007 [DOI] [PubMed] [Google Scholar]
  50. Passingham R. E., Toni I., Rushworth M. F. (2000). Specialisation within the prefrontal cortex: the ventral prefrontal cortex and associative learning. Exp. Brain Res. 133, 103–113 [DOI] [PubMed] [Google Scholar]
  51. Pasupathy A., Miller E. K. (2005). Different time courses of learning-related activity in the prefrontal cortex and striatum. Nature 433, 873–876 10.1038/nature03287 [DOI] [PubMed] [Google Scholar]
  52. Pessiglione M., Seymour B., Flandin G., Dolan R. J., Frith C. D. (2006). Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature 442, 1042–1045 10.1038/nature05051 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Petrides M. (1985a). Deficits in non-spatial conditional associative learning after periarcuate lesions in the monkey. Behav. Brain Res. 16, 95–101 [DOI] [PubMed] [Google Scholar]
  54. Petrides M. (1985b). Deficits on conditional associative-learning tasks after frontal- and temporal-lobe lesions in man. Neuropsychologia 23, 601–614 [DOI] [PubMed] [Google Scholar]
  55. Petrides M. (1997). Visuo-motor conditional associative learning after frontal and temporal lesions in the human brain. Neuropsychologia 35, 989–997 10.1016/S0028-3932(97)00026-2 [DOI] [PubMed] [Google Scholar]
  56. Pillon B., Deweer B., Vidailhet M., Bonnet A. M., Hahn-Barma V., Dubois B. (1998). Is impaired memory for spatial location in Parkinson's disease domain specific or dependent on “strategic” processes? Neuropsychologia 36, 1–9 10.1016/S0028-3932(97)00102-4 [DOI] [PubMed] [Google Scholar]
  57. Postle B. R., Jonides J., Smith E. E., Corkin S., Growdon J. H. (1997). Spatial, but not object, delayed response is impaired in early Parkinson's disease. Neuropsychology 11, 171–179 [DOI] [PubMed] [Google Scholar]
  58. Ragozzino M. E., Jih J., Tzavos A. (2002). Involvement of the dorsomedial striatum in behavioral flexibility: role of muscarinic cholinergic receptors. Brain Res. 953, 205–214 10.1016/S0006-8993(02)03287-0 [DOI] [PubMed] [Google Scholar]
  59. Reading P. J., Dunnett S. B., Robbins T. W. (1991). Dissociable roles of the ventral, medial and lateral striatum on the acquisition and performance of a complex visual stimulus-response habit. Behav. Brain Res. 45, 147–161 [DOI] [PubMed] [Google Scholar]
  60. Redgrave P., Rodriguez M., Smith Y., Rodriguez-Oroz M. C., Lehericy S., Bergman H., et al. (2010). Goal-directed and habitual control in the basal ganglia: implications for Parkinson's disease. Nat. Rev. Neurosci. 11, 760–772 10.1038/nrn2915 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Rescorla R. A. (1991). Associative relations in instrumental learning: the eighteenth Barlett Memorial Lecture. Q. J. Exp. Psychol. 43B, 1–23 [Google Scholar]
  62. Schmitt-Eliassen J., Ferstl R., Wiesner C., Deuschl G., Witt K. (2007). Feedback-based versus observational classification learning in healthy aging and Parkinson's disease. Brain Res. 1142, 178–188 10.1016/j.brainres.2007.01.042 [DOI] [PubMed] [Google Scholar]
  63. Seger C. A., Spiering B. J. (2011). A critical review of habit learning and the Basal Ganglia. Front. Syst. Neurosci. 5:66 10.3389/fnsys.2011.00066 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Shiner T., Seymour B., Wunderlich K., Hill C., Bhatia K. P., Dayan P., et al. (2012). Dopamine and performance in a reinforcement learning task: evidence from Parkinson's disease. Brain 135, 1871–1883 10.1093/brain/aws083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Shohamy D., Myers C. E., Grossman S., Sage J., Gluck M. A., Poldrack R. A. (2004a). Cortico-striatal contributions to feedback-based learning: converging data from neuroimaging and neuropsychology. Brain 127, 851–859 10.1093/brain/awh100 [DOI] [PubMed] [Google Scholar]
  66. Shohamy D., Myers C. E., Onlaor S., Gluck M. A. (2004b). Role of the basal ganglia in category learning: how do patients with Parkinson's disease learn? Behav. Neurosci. 118, 676–686 10.1037/0735-7044.118.4.676 [DOI] [PubMed] [Google Scholar]
  67. Simon S. R., Meunier M., Piettre L., Berardi A. M., Segebarth C. M., Boussaoud D. (2002). Spatial attention and memory versus motor preparation: premotor cortex involvement as revealed by fMRI. J. Neurophysiol. 88, 2047–2057 [DOI] [PubMed] [Google Scholar]
  68. Sprengelmeyer R., Canavan A. G., Lange H. W., Homberg V. (1995). Associative learning in degenerative neostriatal disorders: contrasts in explicit and implicit remembering between Parkinson's and Huntington's diseases. Mov. Disord. 10, 51–65 10.1002/mds.870100110 [DOI] [PubMed] [Google Scholar]
  69. Staddon J. E., Cerutti D. T. (2003). Operant conditioning. Annu. Rev. Psychol. 54, 115–144 10.1146/annurev.psych.54.101601.145124 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Stalnaker T. A., Calhoon G. G., Ogawa M., Roesch M. R., Schoenbaum G. (2010). Neural correlates of stimulus-response and response-outcome associations in dorsolateral versus dorsomedial striatum. Front. Integr. Neurosci. 4:12 10.3389/fnint.2010.00012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Tanaka S. C., Doya K., Okada G., Ueda K., Okamoto Y., Yamawaki S. (2004). Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nat. Neurosci. 7, 887–893 10.1038/nn1279 [DOI] [PubMed] [Google Scholar]
  72. Tang C., Pawlak A. P., Prokopenko V., West M. O. (2007). Changes in activity of the striatum during formation of a motor habit. Eur. J. Neurosci. 25, 1212–1227 10.1111/j.1460-9568.2007.05353.x [DOI] [PubMed] [Google Scholar]
  73. Taylor A. E., Saint-Cyr J. A., Lang A. E. (1990). Memory and learning in early Parkinson's disease: evidence for a “frontal lobe syndrome”. Brain Cogn. 13, 211–232 [DOI] [PubMed] [Google Scholar]
  74. Thorn C. A., Atallah H., Howe M., Graybiel A. M. (2010). Differential dynamics of activity changes in dorsolateral and dorsomedial striatal loops during learning. Neuron 66, 781–795 10.1016/j.neuron.2010.04.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Toni I., Passingham R. E. (1999). Prefrontal-basal ganglia pathways are involved in the learning of arbitrary visuomotor associations: a PET study. Exp. Brain Res. 127, 19–32 [DOI] [PubMed] [Google Scholar]
  76. Toni I., Ramnani N., Josephs O., Ashburner J., Passingham R. E. (2001a). Learning arbitrary visuomotor associations: temporal dynamic of brain activity. Neuroimage 14, 1048–1057 10.1006/nimg.2001.0894 [DOI] [PubMed] [Google Scholar]
  77. Toni I., Rushworth M. F., Passingham R. E. (2001b). Neural correlates of visuomotor associations. Spatial rules compared with arbitrary rules. Exp. Brain Res. 141, 359–369 10.1007/s002210100877 [DOI] [PubMed] [Google Scholar]
  78. Tremblay L., Hollerman J. R., Schultz W. (1998). Modifications of reward expectation-related neuronal activity during learning in primate striatum. J. Neurophysiol. 80, 964–977 [DOI] [PubMed] [Google Scholar]
  79. Tricomi E., Balleine B. W., O'Doherty J. P. (2009). A specific role for posterior dorsolateral striatum in human habit learning. Eur. J. Neurosci. 29, 2225–2232 10.1111/j.1460-9568.2009.06796.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Tricomi E. M., Delgado M. R., Fiez J. A. (2004). Modulation of caudate activity by action contingency. Neuron 41, 281–292 10.1016/S0896-6273(03)00848-1 [DOI] [PubMed] [Google Scholar]
  81. Turchi J., Devan B., Yin P., Sigrist E., Mishkin M. (2010). Pharmacological evidence that both cognitive memory and habit formation contribute to within-session learning of concurrent visual discriminations. Neuropsychologia 48, 2245–2250 10.1016/j.neuropsychologia.2010.02.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Volkow N. D., Gur R. C., Wang G. J., Fowler J. S., Moberg P. J., Ding Y. S., et al. (1998). Association between decline in brain dopamine activity with age and cognitive and motor impairment in healthy individuals. Am. J. Psychiatry 155, 344–349 [DOI] [PubMed] [Google Scholar]
  83. Vriezen E. R., Moscovitch M. (1990). Memory for temporal order and conditional associative-learning in patients with Parkinson's disease. Neuropsychologia 28, 1283–1293 [DOI] [PubMed] [Google Scholar]
  84. Williams Z. M., Eskandar E. N. (2006). Selective enhancement of associative learning by microstimulation of the anterior caudate. Nat. Neurosci. 9, 562–568 10.1038/nn1662 [DOI] [PubMed] [Google Scholar]
  85. Wise S. P., Murray E. A., Gerfen C. R. (1996). The frontal cortex-basal ganglia system in primates. Crit. Rev. Neurobiol. 10, 317–356 [DOI] [PubMed] [Google Scholar]
  86. Wunderlich K., Dayan P., Dolan R. J. (2012a). Mapping value based planning and extensively trained choice in the human brain. Nat. Neurosci. 15, 786–791 10.1038/nn.3068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Wunderlich K., Smittenaar P., Dolan R. J. (2012b). Dopamine enhances model-based over model-free choice behavior. Neuron 75, 418–424 10.1016/j.neuron.2012.03.042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Yin H. H., Knowlton B. J., Balleine B. W. (2005a). Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning. Eur. J. Neurosci. 22, 505–512 10.1111/j.1460-9568.2005.04219.x [DOI] [PubMed] [Google Scholar]
  89. Yin H. H., Ostlund S. B., Knowlton B. J., Balleine B. W. (2005b). The role of the dorsomedial striatum in instrumental conditioning. Eur. J. Neurosci. 22, 513–523 10.1111/j.1460-9568.2005.04218.x [DOI] [PubMed] [Google Scholar]
  90. Yin H. H., Knowlton B. J., Balleine B. W. (2006). Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning. Behav. Brain Res. 166, 189–196 10.1016/j.bbr.2005.07.012 [DOI] [PubMed] [Google Scholar]

Articles from Frontiers in Human Neuroscience are provided here courtesy of Frontiers Media SA

RESOURCES