Corticostriatal White Matter Integrity and Dopamine D1 Receptor Availability Predict Age Differences in Prefrontal Value Signaling during Reward Learning

Lieke de Boer; Benjamín Garzón; Jan Axelsson; Katrine Riklund; Lars Nyberg; Lars Bäckman; Marc Guitart-Masip

doi:10.1093/cercor/bhaa104

. 2020 Jun 2;30(10):5270–5280. doi: 10.1093/cercor/bhaa104

Corticostriatal White Matter Integrity and Dopamine D1 Receptor Availability Predict Age Differences in Prefrontal Value Signaling during Reward Learning

Lieke de Boer ^1,^✉, Benjamín Garzón ¹, Jan Axelsson ^2,³, Katrine Riklund ^2,³, Lars Nyberg ^2,^3,⁴, Lars Bäckman ¹, Marc Guitart-Masip ^1,⁵

PMCID: PMC7472214 PMID: 32484215

Abstract

Probabilistic reward learning reflects the ability to adapt choices based on probabilistic feedback. The dopaminergically innervated corticostriatal circuit in the brain plays an important role in supporting successful probabilistic reward learning. Several components of the corticostriatal circuit deteriorate with age, as it does probabilistic reward learning. We showed previously that D1 receptor availability in NAcc predicts the strength of anticipatory value signaling in vmPFC, a neural correlate of probabilistic learning that is attenuated in older participants and predicts probabilistic reward learning performance. We investigated how white matter integrity in the pathway between nucleus accumbens (NAcc) and ventromedial prefrontal cortex (vmPFC) relates to the strength of anticipatory value signaling in vmPFC in younger and older participants. We found that in a sample of 22 old and 23 young participants, fractional anisotropy in the pathway between NAcc and vmPFC predicted the strength of value signaling in vmPFC independently from D1 receptor availability in NAcc. These findings provide tentative evidence that integrity in the dopaminergic and white matter pathways of corticostriatal circuitry supports the expression of value signaling in vmPFC which supports reward learning, however, the limited sample size calls for independent replication. These and future findings could add to the improved understanding of how corticostriatal integrity contributes to reward learning ability.

Keywords: aging, corticostriatal loops, dopamine, value-based decision making, white matter integrity

Introduction

The ability to flexibly update one’s actions based on value-related changes in the environment deteriorates with age, as shown in decision-making studies comparing older and younger adults (Samanez-Larkin et al. 2012; Chowdhury et al. 2013b; Samanez-Larkin and Knutson 2015; de Boer et al. 2017). Animal studies and neuroanatomical evidence suggest that reward learning necessary for optimal value-based decision-making in changeable environments recruits corticostriatal loops (Haber and Knutson 2010; Seger et al. 2010; Haber 2016; Smittenaar et al. 2017). These loops are modulated by dopaminergic projections from the midbrain. Activity within the loop passing through the ventromedial portion of the striatum has consistently been associated with motivational aspects of behavior (Haber and Behrens 2014). Conversely, activity within the loops passing through the dorsolateral portion of the striatum is associated with converting cognitive and motivational signals into motor programs (Bornstein and Daw 2011).

The ventromedial prefrontal cortex (vmPFC) and nucleus accumbens (NAcc) are important nodes within the motivational portion of these loops. We have previously shown that value anticipation in vmPFC is related to performance on a two-armed bandit task (TAB) (de Boer et al. 2017). In that study, this signal proved weaker in a sample of older participants, compared with younger participants. Importantly, this value anticipation signal in vmPFC correlated with performance on the TAB, even when controlling for age. This suggested that as people age, the brain’s ability to produce a strong value signal needed to perform adaptive choices may change.

Aging affects the integrity of the dopaminergic system (Bäckman et al. 2000; Bäckman et al. 2010; Rieckmann et al. 2011) and white-matter tracts in the brain (Raz et al. 2005; Yang et al. 2016; Bennett et al. 2017). The deterioration of either or both of these systems could underlie worse adaptive value-based decision-making in older adults. We have already shown that dopamine (DA) D1-R availability in NAcc predicts the strength of the value signal in vmPFC (de Boer et al. 2017). Integrity of frontostriatal pathways as measured by diffusion weighted imaging (DWI) has previously proven the important of good performance in probabilistic reward learning tasks which measure value-based decision-making ability (Samanez-Larkin et al. 2012; van de Vijver et al. 2016). The relationship between these measures and value-based decision-making performance could stem from the fact that the prefrontal value signal necessary for making adaptive value-based choices cannot properly emerge if the dopaminergic and vmPFC-accumbens integrity are affected. This dual dependence on dopaminergic modulation in the NAcc and frontostriatal connectivity is supported by the recent observation that DA transporter binding potential in NAcc has an indirect effect on reinforcement learning behavior, through frontostriatal functional connectivity (Kaiser et al. 2018).

Based on this evidence, we hypothesized that vmPFC-accumbens white matter integrity would predict the strength of value anticipation signals in vmPFC. Given that D1-R availability in NAcc is also related to the strength of value anticipation in vmPFC in the data used in this study, we expected that one of these two measures could mediate the relationship of the other measure with value anticipation in vmPFC. Alternatively, both vmPFC-accumbens white matter integrity and D1-R availability in NAcc could independently predict the strength of value anticipation signals in vmPFC. We were also expecting to find a direct relationship between vmPFC-accumbens white matter integrity and behavioral performance on the TAB.

No study has previously investigated the combined effect of dopaminergic integrity and white-matter pathway integrity on probabilistic reward learning. Here, we test our hypotheses in a sample of 22 older and 23 young participants, whose data were part of a previously published study (de Boer et al. 2017). For these participants, we report previously unpublished DWI data, as well as functional MRI data during the TAB and DA D1-R availability data with positron emission tomography (PET) available to us (de Boer et al. 2017). We used a computational model to calculate subjective value for each participant on each trial (de Boer et al. 2017).

Materials and Methods

Participants

A total of 30 healthy, cognitively high functioning older adults aged 66–75 and 30 younger adults aged 19–32 were recruited through local newspaper advertisements in Umeå Sweden. The health of all potential participants was assessed before recruitment by a questionnaire administered via telephone by research nurses. The questionnaire enquired about past and present neurological or psychiatric conditions, head trauma, diabetes mellitus, arterial hypertension that required more than two medications, addiction to alcohol or other drugs, and bad eyesight. All participants were right-handed and provided written informed consent prior to commencing the study. Ethical approval was obtained from the Umeå Regional Ethical Review Board. Participants were paid 2000 SEK ( Inline graphic $225) for participation and earned up to 149 additional SEK ($17) in the TAB.

In fMRI analyses, three older participants were excluded—one due to excessive head motion during fMRI scanning, one for only ever selecting one of the two stimuli in the task, and one due to a malfunctioning button box, resulting in no recorded responses. One additional older participant did not complete the full PET scan, but this participant’s fMRI and task data are still included in the analysis where possible. This resulted in a total of 57 participants for fMRI and task analysis (27 old [10 female], 30 young [18 female]) and 56 participants for PET analysis (26 old, 30 young). For DWI analysis, tracts between VS and vmPFC could not be reconstructed for 11 out of 57 participants. Thus, for DWI analysis, 46 participants were included (23 old [8 female], 23 young [14 female]). One of the older participants in this sample was the one that did not complete the PET scan, so for the full analysis, 45 participants (22 old, 23 young) were included.

All participants performed the Mini Mental State Examination (MMSE). Scores ranged from 26 to 30 in the young sample (mean = 29.40, SD = 0.97) and from 27 to 30 in the older sample (mean = 29.37, SD = 0.79), with no evidence of a difference between the two (P = 0.90). PET and fMRI scanning were planned 2 days apart. However, due to a technical problem with the PET scanner, 12 participants were scanned at a longer delay apart (range 4–44 days apart). On the MRI scanning day, participants completed the TAB and another unrelated task inside the MRI scanner. Participants also completed a battery of tasks outside the scanner. Only results from the TAB will be discussed here.

Two-Armed Bandit Task

The TAB task was presented in Cogent 2000 (Wellcome Trust for Neuroimaging). Figure 1a depicts a schematic representation of one TAB trial. Participants were instructed to choose the fractal stimulus they thought to be most rewarding at each trial and were informed of the changing probability of obtaining a reward for each stimulus. These probabilities varied independently from one another. Probabilities were generated using a Gaussian random walk (Daw and Doya 2006). Before scanning, participants were presented with five practice trials. The same set of Gaussian random walks was used for all participants (Fig. 1), but the assignment of random walk to stimulus identity was counterbalanced across participants.

Left: schematic representation of a trial in the TAB. Participants were presented with two fractal images on each trial and selected one of them through a button press. The maximum response time was 2000 ms, meaning the trial would count as a miss if the response time exceeded this limit and the next trial would start immediately after the next intertrial interval. If one stimulus was selected, this option was highlighted with a red frame. After 1000 ms, participants were presented with the outcome: either a green arrow pointing upwards, indicating an obtained reward of 1 SEK ($0.11), or a yellow horizontal bar, indicating no win. Each image was randomly assigned a position on the screen (left or right) on each of the 2 × 110 trials of the experiment. Reward probabilities varied throughout the experiment. Right: varying reward probabilities for obtaining a reward for each bandit on the 220 trials of the experiment.

Statistical Analysis of Brain and Behavior

All statistical analyses were performed using R version 3.5.3. As a measure of performance, we used the number of rewarded trials that each participant saw. This was equivalent to the amount of money each participant earned on the task (participants received 1 Swedish Crown per rewarded trial). Performance differences between groups were assessed with a one-tailed student’s t-test. We used the lm function in the R stats package to perform a number of multiple regressions that assessed the relationship between age, performance, white matter integrity, value anticipation in vmPFC, and DA D1-R availability in the NAcc. The assumptions of the regression models were checked by testing the residuals with Shapiro–Wilk’s test of normality and were considered normal at P > 0.05. Variance inflation factors (VIFs) were calculated with the ols_vif_tol function, part of the olsrr package. We considered predictor values to be overinflated if the VIF > 10 (O’brien 2007). In all of the analyses we performed, we included age as a covariate of no interest. However, we report all of the bivariate relationships without controlling for age in the multiple linear regression tables. As many of our predictors are collinear with age, controlling for age ensures that the observed relationships are robust across age groups. Thus, variables that are affected by age, but that can predict brain activity and performance beyond age, provide robust explanations for processes affected by age-related changes.

Computational Analysis of Behavior

To calculate trial-by-trial choice values, we used a previously reported computational model, a variation on a Bayesian Observer that has been shown to outperform alternative models using standard model comparison methods (de Boer et al. 2017). For brevity, we will present only the winning model from this analysis. For model comparison statistics (including standard reinforcement learning models using the Rescorla–Wagner updating rule) and fitting procedures, we refer to our previous publication (de Boer et al. 2017).

The winning model uses a softmax decision rule, where action propensities ( Inline graphic ) for each bandit were entered. A temperature parameter (with > 0) determined the probability that a participant chose each action a {0,1} (corresponding to each bandit)

(1)

where Inline graphic is the action propensity for bandit on trial . In the winning model, the probability of obtaining a reward derived for each bandit was represented as a beta distribution (one for each bandit)

(2)

Inline graphic was updated upon observing an outcome of each trial. From these probability distributions, we derived the mean probability of getting a reward for each bandit and its variance. We will refer to the mean probability of obtaining a reward for each bandit on a given trial as the expected value Inline graphic , which is calculated as follows:

(3)

Additionally, the variance in reward probability, which was used to calculate the action propensities Inline graphic (see further below), was defined as

(4)

The parameter values at Inline graphic in the beta distributions were 1 ( = = 1). Therefore, , reflecting an expected value at chance level on the first trial, and , reflecting the maximum possible variance on the first trial, in line with a general uncertainty about the underlying reward probability distributions. Both parameters of each beta distribution are updated on each trial as follows: when bandit Inline graphic is chosen and a reward is obtained, is increased by 1, is relaxed toward 1, and both (1a referring to the unchosen option) and are relaxed toward 1. Conversely, after reward omission, is increased by 1, but again, both and are relaxed toward 1. Hence, for the chosen bandit,

(5)

(6)

And for the unchosen bandit,

(7)

where Inline graphic and are separate individually fitted free parameters that determine the speed with which the reward probability distributions are updated (with ) and forgotten ().

In addition, the variance of the bandit that was not chosen on trail Inline graphic was added to the action propensity of that bandit on trial . Hence, for the unchosen option,

(8)

where, if Inline graphic is positive, choices were favored if they had high variance and if they were not chosen on the previous trial, which can be interpreted as an exploration bonus.

Finally, a measure of confidence was added to the value of the bandit that was not chosen on trial Inline graphic . Relative confidence was defined as the probability that a sample drawn from the distribution for bandit would be more likely to lead to a reward than a sample drawn from the distribution for bandit . A relative confidence was added to the unchosen option at trial

(9)

where Inline graphic was an individually fitted parameter that weighted the relative confidence which was calculated as follows:

(10)

where

(11)

and

(12)

MRI Acquisition

Brain images were acquired on an MR750 3T scanner (GE Medical Systems), equipped with a 32-channel phased-array head coil. T₁-weighted 3D-SPGR images were acquired using a single-echo sequence (voxel size: 0.5 × 0.5 × 1 mm, TE = 3.20, flip angle = 12°). DWI scans were acquired with a spin-EPI T₂-weighted sequence (64 slices, voxel size = 1 × 1 × 2 mm, TR = 8000 ms, TE = 84.4 ms, FoV = 25 cm, flip angle = 90°), using three repetitions, with 32 independent directions (b = 1000 s/mm²) and six b = 0 images. Functional images were acquired using a T₂*-sensitive gradient echo sequence (voxel size: 2 × 2 × 4 mm, TE = 30.0 ms, TR = 2000 ms, flip angle = 80°) and contained 37 slices of 3.4-mm thickness, with a 0.5-mm gap between slices. Volume acquisition occurred in an interleaved fashion. About 330 volumes were obtained for each of the two functional runs. During acquisition of fMRI time series, heart rate and respiratory data were collected using a breathing belt and a pulse oximeter.

Functional MRI Analysis

In-house software (dicom2usb, http://dicom-port.com/) was used to de-identify all neuroimaging scans. Functional MRI analyses were performed in SPM8 (http://www.fil.ion.ucl.ac.uk/spm/software/spm8/). The preprocessing pipeline included slice-time correction, realignment, coregistration to the T₁-weighted image, movement correction, and normalization to MNI space. For normalization, we used a diffeomorphic registration algorithm (DARTEL; Ashburner 2007) with spatial resolution after normalization 2 × 2 × 2 mm. Data were smoothed with a final Gaussian kernel equivalent to a standard 8 mm (see below). The fMRI time series data were high-pass filtered with a 128-s cut-off, and whitened with an AR(1) model. For each participant, the canonical hemodynamic response function was used to compute their statistical model.

The movement parameters produced by SPM’s coregistration algorithm showed that 15 participants moved >3 mm in any direction during functional runs. To correct for movement artifacts produced as a consequence of this, we used the ArtRepair toolbox (Mazaika et al. 2009; Levy and Wagner 2011). ArtRepair compares the amount of motion between volume acquisitions based on the mean intensity plot of all functional scans, and linearly interpolates scans in which motion exceeds a specified threshold. We used the recommended threshold value of 1.5% deviation from the mean intensity between scans. The average number of interpolated scans for our participants was 12.2 (1.8%) (SD = 19.6 [3.0%]), and one participant was excluded for showing movement >1.0 mm in >25% of scans, in line with ArtRepair’s recommendations. ArtRepair smooths the individual subject data with a Gaussian smoothing kernel of 4 mm before normalization and movement correction. A Gaussian kernel of 7 mm was then used for the normalization to MNI space, resulting in a smoothed, normalized image equivalent to a standard 8-mm smoothed normalized image.

We estimated a first-level general linear model (GLM) to look at activity corresponding to value anticipation in the brain. In this linear model, we parametrically modulated the time of choice by the expected value (Q) that belonged to the chosen option on each trial as calculated by the computational model described above. In addition, the outcome of each trial (whether the trial led to reward receipt or not) was included as a regressor at the time of outcome. This model included several other regressors of no interest to control for motion. These included SPM’s six motion regressors as well as 18 parameters that corrected for physiological noise, which we recorded with a heartbeat detector and a breathing belt during the scanning sessions. These regressors were calculated using the PhysiO toolbox version r671 (https://www.tnu.ethz.ch/en/software/tapas.html).

For each participant, we then calculated a contrast images by weighting the regressor of interest (Q at choice by 1). This contrast image was used at second level to perform a one-sample t-test across all participants. This resulted in a second-level map with a family-wise error (FWE) corrected threshold at P < 0.05, from which we extracted parameter estimates to be used in further analysis. For DWI analysis, the activity cluster in vmPFC at P(uncorrected) < 0.001 was used to facilitate the reconstruction of paths between NAcc and vmPFC.

DWI Preprocessing and Analysis

Diffusion weighted scans were corrected for motion- and current-induced distortions with FSL’s eddy_correct. To further correct for geometric distortions, the images were nonlinearly aligned with the T₁-weighted structural scan (Wu et al. 2008) with the ANTs software (Avants et al. 2011).

Tractograms were generated with the MRtrix software (Tournier et al. 2012) and filtered with the SIFT2 method (Smith et al. 2015), using anatomically constrained probabilistic tractography (Smith et al. 2012). We specified the two inclusion regions of interest (ROIs; vmPFC and NAcc, Supplementary Fig. 1) as binary mask images and accepted only streamlines that traversed both inclusion regions. We sampled until we recovered 100 streamlines between the vmPFC and NAcc ROIs. Subjects were excluded if >200 million streamlines were considered, but less than 20 were selected as probable (n = 11). FA maps were calculated using FSL’s dtifit. The tract formed by the reconstructed streamlines was used to mask the FA image, and the average within the tract became the individual’s measure of vmPFC-accumbens white matter integrity. FA values were thresholded at 0.2.

The inferior frontal fasciculus was selected as a control tract, as this tract is affected by aging (de Schotten et al. 2014), and could therefore serve as an appropriate control tract to ensure that relationships to FA and neural signals or performance were specific to the tract at hand. We also performed a control analysis including the inferior longitudinal fasciculus (Supplementary Table 1).

PET Image Acquisition and Analysis

PET images were acquired on a 690 PET/CT scanner (GE Medical Systems). A low-dose helical CT scan (20 mA, 120 kV, 0.8 s/revolution) was used for PET attenuation correction. In order to minimize head movement during image acquisition, individually fitted thermoplastic masks were used to fixate the participants’ heads (Positocasts Thermoplastic; CIVCO medical solutions). PET scanning started after an intravenous bolus injection of 200 MBq of [¹¹C]SCH23390. At the time of injection, a 55-min dynamic acquisition started (9 × 120 s, 3 × 180 s, 3 × 260 s, and 3 × 300 s), totaling 18 frames. Attenuation- and decay-corrected 256 × 256 pixel transaxial PET images were reconstructed to a 25-cm field-of-view using the Sharp IR algorithm (6 iterations, 25 subsets, 3.0-mm Gaussian post filter). Sharp IR is an advanced version of the Ordered Subset Expectation Maximization method for improving spatial resolution (Ross and Stearns 2010). The full-width half-maximum resolution was 3.2 mm. This protocol resulted in 47 tomographic slices per timeframe, with 0.98 × 0.98 × 3.3 mm³ voxels. Images were decay corrected to the start of the scan.

We used an ROI-based protocol to estimate nondisplaceable binding (BP_ND). BP_ND values were obtained by coregistering the PET time series images to the T₁-weighted MRI images using SPM. From the T₁-weighted images, we segmented subcortical ROIs using the FIRST algorithm as implemented by FSL (Patenaude et al. 2011). Based on our previous publication (de Boer et al. 2017), we were interested in the NAcc. The cerebellum was segmented with the use of Freesurfer’s recon-all algorithm (Desikan et al. 2006) and used as a reference tissue due to the lack of DA D1 receptors in this structure (Hall et al. 1994). The average time activity curves were extracted across all voxels within each ROI. Then, BP_ND was calculated with the use of the Logan method (Logan et al. 1996) as implemented in imlook4d (imlook4d version 3.5, https://sites.google.com/site/imlook4d). BP_ND values were averaged across hemispheres for the NAcc.

It should be noted that [¹¹C]SCH23390 does not only bind to D1-Rs in the brain—it also shows a (albeit much lower) affinity for 5-HT2A receptors (Ekelund et al. 2007; Slifstein et al. 2007). This has been shown to affect binding potentials in the cortex. In the NAcc, this is not as much of an issue, because the number of D1-Rs is many times greater than 5-HT2A receptors. In the cortex, however, 5-HT2A can represent up to 25% of the PET signal recorded with [¹¹C]SCH23390 (Ekelund et al. 2007).

Results

Behavior

A total of 30 older (aged 66–75) and 30 younger (aged 19–32) participants performed a probabilistic reward learning while being scanned with fMRI. A schematic of the task as well as the variable reward probabilities for each bandit across the task are displayed in Figure 1. DWI pathways could only be reconstructed for 22 older and 23 younger participants. The behavioral results for the entire sample have been previously reported (de Boer et al. 2017). For completeness, we present here the behavioral results, both for the entire sample, and the DWI sample only.

Participants earned between 106 and 149 Swedish Crowns on the task (11–16 USD, M = 128.21, SD = 10.00). Older participants performed slightly worse than younger participants in both the fMRI sample (P[one-tailed] = 0.05, Cohen’s d = 0.45, 95% confidence interval [CI]: −0.09 to 0.99) and the DWI subsample (P[one-tailed] = 0.04, Cohen’s d = 0.52, 95% CI: −0.08 to 1.13) (Fig. 2). A more elaborate comparison of performance between age groups is described in de Boer et al. 2017).

Behavioral performance on the TAB for older and younger participants, separately. Young participants performed marginally better than older participants, both in the whole sample (P[one-tailed] = 0.05, left figure, as previously published in de Boer et al. 2017), as well as in the DWI sample (P[one-tailed] = 0.04, right figure).

Previous Findings in the DWI Subsample

Because the sample size in this study is limited compared with the previous publication (de Boer et al. 2017), we first confirmed that our previous findings held in the sample considered here. As in our previous study (de Boer et al. 2017), we used computational modeling to estimate the predicted expected value for each option as participants performed the TAB task (de Boer et al. (2017), see Materials and Methods). We used an GLM approach to look at fMRI activity corresponding to value anticipation in the brain (see Materials and Methods). In this linear model, we estimated the correlates of anticipated value by parametrically modulating the time of choice by the expected value (Q) that belonged to the chosen option on each trial. In addition, the outcome of each trial (whether the trial led to reward receipt or not) was included as a regressor at the time of outcome. The value signal in vmPFC is a reliable anticipatory signal thought to reflect value computation. We chose to focus on this signal, as it can more reliably be detected as a value signal in vmPFC than the expected value component of an RPE signal (Skvortsova et al. 2014).

From the first-level subject beta maps, we created a second-level map with an FWE-corrected threshold at P < 0.05. From this map, we used the vmPFC as a functional region of interest from which we extracted parameter estimates used in further analysis. The areas that were active during value anticipation are displayed in Table 1. We previously found that value-correlated anticipatory activity in vmPFC at the time of choice significantly correlated with the performance on the TAB task as indexed by the total amount of money won (de Boer et al. 2017). This relationship could also be observed in the subsample of our study for whom DWI data were of sufficient quality for tractography analysis [r(44) = 0.51, P < 0.001, 95% CI: 0.26–0.70]. This correlation survived correction for age [r(43) = 0.45, P = 0.002, 95% CI: 0.18–0.65] and was similar for the alternative measures of performance such as the percentage of optimal switches and the percentage of optimal choices (Supplementary Tables 2 and 3).

Table 1.

Coordinates of clusters responsive to Q at the time of choice

Region	x	y	z	Cluster size	z score	P(FWE-corr, cluster)	P(FWE-corr, peak)
Left precuneus	−22	−52	12	1845	6.14	<0.001	<0.001
Right precuneus	12	−52	16	NA	5.62		<0.001
Right hippocampus	34	−36	−4	121	5.49	<0.001	0.001
vmPFC	−2	50	−8	187	5.44	<0.001	0.001
Right cuneus	12	−80	26	82	5.01	0.001	0.008

Open in a new tab

We also confirmed in the DWI sample that anticipatory value-related activity was correlated with DA D1-R BP_ND in NAcc. Thus, we correlated this anticipatory activity with D1-R BP_ND in NAcc in the entire sample, as well as in the DWI sample. Bivariate correlations were significant in the entire sample [r(54) = 0.41, P = 0.001, 95% CI: 0.17–0.61], as well as in the DWI sample [r(43) = 0.40, P = 0.006, Table 4]. When entered into a multiple regression with age, D1_R BP_ND in NAcc was the only significant predictor of the value signal in vmPFC in the entire sample (P = 0.04, 95% CI of standardized beta weight: 0.02–0.83, for age: 95% CI: −0.39 to 0.41). This predictor was, however, not significant in the DWI sample only (P = 0.10, 95% CI of standardized beta weight: −0.07 to 0.83, for age: 95% CI: −0.48 to 0.42).

Table 4.

Univariate and multivariate standardized coefficients (95% CIs) predicting the number of wins on the TAB from D1-R BPND in NAcc, FA in the connection between NAcc and vmPFC, and the expected-value signal in vmPFC

Dependent: wins	Coefficient (univariate)	Coefficient (multivariate)	VIF
Age	−0.29 (−0.58 to 0.00, P = 0.053)	−0.33 (−0.82 to 0.16, P = 0.176)	3.268340548
D1-R in NAcc	0.23 (−0.08 to 0.53, P = 0.138)	−0.20 (−0.64 to 0.24, P = 0.373)	2.670655726
vmPFC-accumbens FA	0.26 (−0.03 to 0.55, P = 0.081)	−0.11 (−0.49 to 0.27, P = 0.550)	1.973339025
Q in vmPFC	0.51 (0.25–0.77, P < 0.001)	0.54 (0.22–0.86, P = 0.002)	1.442440738

Open in a new tab

The expected-value signal is the only significant predictor of behavior in a multiple regression model. The VIF is a measure of how inflated the variance of each predictor is due to multicollinearity of the predictors. VIFs in this model are below 10, which is considered within acceptable range. Univariate coefficients represent the result of bivariate correlations whereas multivariate coefficients represent multiple regression coefficients of a model including all predictors.

Relationship Between White-Matter Integrity, Behavior, D1 BP_ND, and Neural Correlates of Value Anticipation

Next, we tested the hypothesis that vmPFC-accumbens white matter integrity was correlated to the anticipatory value signal in vmPFC. We performed tractography analysis to reconstruct the pathway between the NAcc ROI, which was used to obtain BP_ND estimates, and the vmPFC ROI in which we saw value anticipatory activity. We used fractional anisotropy (FA) in this pathway as the measure of interest, in line with previous work (Samanez-Larkin et al. (2012)). White-matter integrity in this pathway was significantly different between younger and older participants [M_young (SD) = 0.34 (0.03), M_old (SD) = 0.31 (0.02), P < 0.001, Cohen’s d = 1.43]. We correlated these FA values to the value-anticipatory activity in vmPFC. This correlation proved significant [r(44) = 0.48, P = 0.001, 95% CI: 0.22–0.68, Fig. 3]. When entered in a multiple regression, only FA, not age, was a significant predictor of value-anticipatory activity in vmPFC and survived correction for age (P = 0.01, 95% CI of standardized beta weight: 0.12–0.79, for age: 95% CI: −0.39 to 0.28).

Observed relationships (bivariate correlations) between variables investigated in this study. Age is related to both a lower D1-R BP_ND, and lower FA in the connection between vmPFC and NAcc. Both of these variables predict the expected-value signal in vmPFC during choice. This expected-value signal proved important for performance on value-based decision-making. Numbers and CIs on the lines indicate bivariate correlations and their 95% CI. The multiple regression coefficients for these relationships can be found in Tables 2 and 4. Note that in a multiple regression, D1-R BP_ND in NAcc did not survive as a significant predictor, possibly due to a lack of statistical power in the reduced sample, or as the result of a type 1 error.

We next investigated our hypotheses combining all predictors of anticipated value in vmPFC. We tested the hypothesis that FA in the vmPFC-accumbens tract would mediate the relationship between D1-R availability in NAcc and the strength of the value anticipation signal in vmPFC. Alternatively, we predicted that FA in the vmPFC-accumbens tract and D1-R availability in NAcc could independently predict the strength of value anticipation signals in vmPFC. To test these hypotheses, we first correlated FA in the vmPFC-accumbens tract with D1-R availability in NAcc. In line with a full mediation hypothesis, these variables would have to be correlated, and their correlation would be expected to cancel out the relationship between D1 receptor availability and the strength of the expected value signal in the vmPFC. However, these variables were not significantly correlated (r = 0.14, P = 0.40). Thus, we observed no evidence supporting a mediatory relationship between FA in the vmPFC-accumbens tract and D1-R availability in the NAcc.

In order to test the hypothesis that the two could independently predict the strength of the expected value signal in the vmPFC, we performed a multiple linear regression to investigate how D1-R BP_ND and FA in the vmPFC-accumbens pathway were related to the anticipated value signal in vmPFC. The results of this multiple regression analysis are displayed in Table 2. Our result was in contrast with the hypothesis that predicted a mediation of the relationship between D1-R BP_ND in NAcc and anticipatory value signal in vmPFC by vmPFC-accumbens FA. Instead, it supported the hypothesis that both BP_ND in NAcc and vmPFC-accumbens FA were independent predictors of the anticipated value signal in vmPFC (Table 2; Inline graphic =0.30, P = 0.211, = 0.41, P = 0.052, = 0.49, P = 0.006, Table 2). It should be pointed out that the significance of D1-R BP_ND in NAcc as a predictor was at what is sometimes referred to as trend level. Given the previously observed significant relationship between these variables, and the reduced sample size in this study, we choose to present this relationship as one contributing significant variance, rather than being nonsignificant. If we assumed that the effect size of the relationship between D1-R in NAcc and Q in vmPFC is at the small-to-medium effect size of 0.3 as we reported in de Boer et al. (2017)), we had significantly reduced power in the subsample reported here to detect that relationship. However, the limited sample size of this analysis should also caution against the over interpretation of this effect: we cannot exclude the possibility that this relationship is a type 1 error. The addition of another predictor of white matter integrity did not change the significance of these predictors (inferior frontal fasciculus, Supplementary Table 1), suggesting that this relationship is specific to the white matter pathway that we investigated here. The model with both FA and D1-R BP_ND in NAcc proved superior in predicting the expected-value signal in vmPFC compared with a model with FA or D1-R BP_ND as a single predictor beyond age (Table 3).

Table 2.

Univariate and multivariate standardized coefficients (95% CIs) predicting the expected-value signal in vmPFC

Dependent: Q in vmPFC	Coefficient (univariate)	Coefficient (multivariate)
Age	−0.32 (−0.61 to −0.03, P = 0.030)	0.30 (−0.17 to 0.76, P = 0.211)
D1-R in NAcc	0.41 (0.12 to 0.69, P = 0.006)	0.41 (−0.00 to 0.82, P = 0.052)
vmPFC-accumbens FA	0.48 (0.22 to 0.75, P = 0.001)	0.49 (0.15 to 0.83, P = 0.006)

Open in a new tab

Both D1-R BP_ND and FA in the connection between NAcc and vmPFC independently predicted the strength of this value signal. It should be noted that we present these effects as independent here, based on the previously reported significant relationship between BP_ND in NAcc and the expected-value signal in vmPFC. However, this relationship exceeded the threshold for significance when entered into the regression with age and FA in NAcc-vmPFC. Univariate coefficients represent the result of bivariate correlations, whereas multivariate coefficients represent multiple regression coefficients of a model including all predictors.

Table 3.

Model comparison demonstrating that a model predicting value anticipation in vmPFC with both D1-R in NAcc and vmPFC-accumbens FA is superior to a model with one of the predictors only

Model	BIC	Adjusted R-squared
Age + D1	135	0.122
Age + FA	131	0.203
Age + D1 + FA	130	0.256

Open in a new tab

Finally, given previous findings by Samanez-Larkin et al. (2012), we wanted to understand how vmPFC-accumbens white matter integrity influenced the relationship between the anticipatory value signal in vmPFC and performance on the TAB task. Therefore, we investigated the relationship between all of these variables and performance on the TAB task. Table 4 shows the univariate and multivariate estimates of the relationships between value anticipation in vmPFC, D1-R BP_ND in NAcc, and the amount of money won on the task. Univariate coefficients represent the result of bivariate correlations, whereas multivariate coefficients represent multiple regression coefficients of a model including all predictors. The table also shows the VIF for each predictor variable. VIF is a measure of how inflated the value of a predictor is in a model as a result of multicollinearity between the predictors. A rule of thumb states that a VIF > 10 is cause for further investigation (O’brien 2007). However, in our model, no VIF exceed 3.3 (Table 4). In a multiple regression, only the anticipatory value signal in vmPFC proved the predictive of performance on the TAB (Table 4; Inline graphic = −0.33, P = 0.176; = −0.20, P = 0.373; = −0.11, P = 0.550; = 0.54, P = 0.002). Other measures reported in de Boer et al. (2017)), such as the proportion of optimal choices and optimal switch behavior, showed similar relationships to these neural predictors, with Q in vmPFC as the only significant predictor (Supplementary Tables 2 and 3). This is in contrast with our hypothesis, which predicted that we would observe a direct relationship between vmPFC-accumbens FA and behavioral performance. In Figure 3, we summarize the observed relationships.

Discussion

We showed in a sample of 23 young and 22 older participants that the strength of a anticipatory value signal in vmPFC is predicted by: 1) DA D1-R BP_ND in NAcc and 2) vmPFC-accumbens white matter integrity. The anticipatory value signal in vmPFC is an important predictor of good performance on the probabilistic reward learning task used in this study. Although DA D1-R BP_ND in NAcc and vmPFC-accumbens white matter integrity did not directly predict the performance on the task, these new results suggest that both measures of corticostriatal integrity are crucial for the emergence of the value anticipatory signal.

Our findings are in line with previous studies showing that frontostriatal white matter integrity is important for value-based decision-making. Specifically, one study by Samanez-Larkin et al. (2012) showed that the integrity of white matter on the pathway between NAcc and medial PFC could predict the performance on a probabilistic monetary incentive learning task. This relationship between performance and white matter integrity survived correction for age. Similarly, a study by van de Vijver et al. (2016) has shown that some, but not all parameters reflecting integrity in frontostriatal white matter, were related to the measures of probabilistic reward learning. Frontostriatal white matter integrity was also found to be predictive of the development of delay of gratification in a longitudinal study with adolescents (Achterberg et al. 2016), suggesting that good decision-making and frontostriatal white matter integrity go hand in hand.

We previously reported that DA D1-R availability in NAcc was a significant predictor for the anticipatory value signal in vmPFC. DA receptors in the NAcc have often been implicated in successful probabilistic reward-learning (Koch et al. 2000; Salamone and Correa 2012; Shiner et al. 2012; Chowdhury et al. 2013a). It is believed that DA neurons report reward prediction errors (Schultz et al. 1997; Day et al. 2007) to target structures such as the NAcc. These dopaminergic signals in NAcc appear to be a crucial hallmark of learning (Pessiglione et al. 2006; Jocham et al. 2011) and are also necessary to continue making good decisions based on learned values (Shiner et al. 2012; Collins and Frank 2014).

A limitation of this study is that we could not reconstruct white-matter pathways in 11 out of 57 participants and, thus, lost a considerable amount of power in detecting relationships between DA, FA, and Q in vmPFC. Despite this, we could replicate the previously observed relationship between performance and value anticipation in vmPFC in the DWI subsample (de Boer et al. 2017). The two predictors we found could each explain variance in the multiple regression predicting activity in vmPFC, although the previously observed relationship between DA D1-R availability and activity in vmPFC proved fragile once the sample size was reduced. We believe that this may be related to our inability to detect this relationship with small-to-medium effect size in a small sample like the one used here. However, we cannot exclude the possibility of this relationship constituting a type 1 error. This relationship should, therefore, be replicated in an independent sample to ensure that this is not a false positive result. If this relationship does exist, this suggests that both high dopaminergic integrity in NAcc as well as high integrity in relevant white-matter pathways, contribute to a strong value signal and subsequently good performance. The fact that DA D1-R availability and vmPFC-accumbens white matter integrity predict the strength of the value anticipation signal in vmPFC is in line with the well-established theory that decision-making and reward learning are dependent on corticostriatal loops (Seger 2009; Haber 2016). Activity in these loops is modulated by dopaminergic signals that project from the midbrain to the striatum (Haber and Knutson 2010). This dopaminergic modulation allows for the emergence of value signals in prefrontal cortex. Computational evidence suggests that striatal D1 receptors, specifically, play an important role in this iterative gating process (Gruber et al. 2006).

Although we did not observe any direct relationship between frontostriatal white matter integrity and behavioral performance, our results suggest an indirect relationship. Frontostriatal white matter integrity predicted value anticipation in the vmPFC which in turn predicted behavioral performance on the TAB task. Value anticipation signals in vmPFC have consistently been shown to be crucial for the ability to perform probabilistic reward learning tasks (Noonan et al. 2010; Bartra et al. 2013; Halfmann et al. 2016), as it is the most flexible brain region when it comes to quick value computation (Haber 2016), with computations occurring just before or during an action. Our results suggest that the age-related attenuation in the anticipatory value signal may be in part attributed to the decreased integrity of the frontostriatal tract associated with older age. The fact that we did not observe a direct relationship between frontostriatal white matter integrity and behavioral performance may stem from the design of our task. Whereas previous studies have used tasks with stationary probabilistic contingencies (Samanez-Larkin et al. 2012, 2014; Eppinger et al. 2013), the reward probabilities for each stimulus fluctuated according to a random walk. Whereas our task design may maximize the occurrence of prediction errors during probabilistic reward learning, it also promotes the exploration of unchosen options which may additionally depend on frontal mechanisms unrelated to the frontostriatal path that we studied here (Boorman et al. 2009, 2011).

The exploration involved in this task may also provide an explanation for the marginal performance difference between older and younger participants. Some previous studies have shown that older adults perform on average somewhat, but not dramatically, worse than younger participants on value-based decision-making tasks (Samanez-Larkin et al. 2008, 2014; Lighthall et al. 2018). This difference is usually larger when the task involves probabilistic decisions that require participants to learn to update behavior based on changing reward contingencies (Mell et al. 2005; Mata et al. 2011; Worthy and Maddox 2012), as compared with decision-making tasks where learning is not required. Here, we only found marginal differences on a probabilistic learning task. This lack of behavioral difference in our and other studies is mirrored by a lack of difference between these older and younger participants in the strength of neural signals reflecting reward prediction errors in the NAcc (de Boer et al. 2017; Lighthall et al. 2018). Differential exploration between the two age groups and the relatively small sample size may provide an explanation for a relatively behavioral difference between age groups. Additionally, the older adults in this study are cognitively high functioning, with MMSE scores above 26.

We present an important contribution to the mechanistic understanding of decision-making in probabilistic environments. Despite the fact that our sample size is small for evaluating a mediation with five variables, our observations demonstrate two separate predictors of the integrity of corticostriatal circuitry, which may both contribute to the emergence of strong anticipatory value signals important for successful decision-making. These findings, taken together, provide insights into how age-related decay in the integrity of frontostriatal white matter as well as dopamine D1 receptor availability in the NAcc may underlie an impairment in probabilistic reward learning performance in some older adults.

Supplementary Material

suppfig1_bhaa104

Click here for additional data file.^{(138.6KB, pdf)}

suppfig2_bhaa104

Click here for additional data file.^{(34.7KB, pdf)}

extended_data_bhaa104

Click here for additional data file.^{(96KB, pdf)}

Acknowledgements

We thank Mats Erikson and Kajsa Burström for collecting the data. The study was accomplished while L.N. was holding the Söderberg’s Professorship in Medicine from Torsten and Ragnar Söderberg’s Foundation.

Funding

Swedish Research Council (VR521-2013-2589 to M.G.-M.); Humboldt Research Award (to L.B.); af Jochnick Foundation (L.B.).

Data availability

The code and processed behavioral and neural data used to create the figures in this paper are available at https://github.com/liekelotte/DWI.

References

Achterberg M, Peper JS, van Duijvenvoorde ACK, Mandl RCW, Crone EA. 2016. Frontostriatal white matter integrity predicts development of delay of gratification: a longitudinal study. J Neurosci. 36:1954–1961. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ashburner J. 2007. A fast diffeomorphic image registration algorithm. NeuroImage. 38:95–113. [DOI] [PubMed] [Google Scholar]
Avants BB, Tustison NJ, Song G, Cook PA, Klein A, Gee JC. 2011. A reproducible evaluation of ANTs similarity metric performance in brain image registration. NeuroImage. 54:2033–2044. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bartra O, McGuire JT, Kable JW. 2013. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. NeuroImage. 76:412–427. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bäckman L, Ginovart N, Dixon RA, Wahlin TB, Wahlin A, Halldin C, Farde L. 2000. Age-related cognitive deficits mediated by changes in the striatal dopamine system. Am J Psychiatry. 157:635–637. [DOI] [PubMed] [Google Scholar]
Bäckman L, Lindenberger U, Li S-C, Nyberg L. 2010. Linking cognitive aging to alterations in dopamine neurotransmitter functioning: recent data and future avenues. Neurosci Biobehav Rev. 34:670–677. [DOI] [PubMed] [Google Scholar]
Bennett IJ, Greenia DE, Maillard P, Sajjadi SA, DeCarli C, Corrada MM, Kawas CH. 2017. Age-related white matter integrity differences in oldest-old without dementia. Neurobiol Aging. 56:108–114. [DOI] [PMC free article] [PubMed] [Google Scholar]
Boorman ED, Behrens TEJ, Woolrich MW, Rushworth MFS. 2009. How green is the grass on the other side? Frontopolar cortex and the evidence in favor of alternative courses of action. Neuron. 62:733–743. [DOI] [PubMed] [Google Scholar]
Boorman ED, Behrens TE, Rushworth MF. 2011. Counterfactual choice and learning in a neural network centered on human lateral frontopolar cortex. PLoS Biol. 9:e1001093. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bornstein A, Daw ND. 2011. Multiplicity of control in the basal ganglia: computational roles of striatal subregions. Curr Opin Neurobiol. 21:374–380. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chowdhury R, Guitart-Masip M, Lambert C, Dayan P, Huys Q, Düzel E, Dolan RJ. 2013a. Dopamine restores reward prediction errors in old age. Nat Neurosci. 16:648–653. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chowdhury R, Guitart-Masip M, Lambert C, Dolan RJ, Düzel E. 2013b. Structural integrity of the substantia nigra and subthalamic nucleus predicts flexibility of instrumental learning in older-age individuals. Neurobiol Aging. 34:2261–2270. [DOI] [PMC free article] [PubMed] [Google Scholar]
Collins AGE, Frank MJ. 2014. Opponent actor learning (OpAL): modeling interactive effects of striatal dopamine on reinforcement learning and choice incentive. Psychol Rev. 121:337–366. [DOI] [PubMed] [Google Scholar]
Daw ND, Doya K. 2006. The computational neurobiology of learning and reward. Curr Opin Neurobiol. 16:199–204. [DOI] [PubMed] [Google Scholar]
Day JJ, Roitman MF, Wightman RM, Carelli RM. 2007. Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nat Neurosci. 10:1020. [DOI] [PubMed] [Google Scholar]
de Boer L, Axelsson J, Riklund K, Nyberg L, Dayan P, Bäckman L, Guitart-Masip M. 2017. Attenuation of dopamine-modulated prefrontal value signals underlies probabilistic reward learning deficits in old age. elife. 6:e26424. [DOI] [PMC free article] [PubMed] [Google Scholar]
Desikan RS, Ségonne F, Fischl B, Quinn BT, Dickerson BC, Blacker D, Buckner RL, Dale AM, Maguire RP, Hyman BT, et al. . 2006. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. NeuroImage. 31:968–980. [DOI] [PubMed] [Google Scholar]
Ekelund J, Slifstein M, Narendran R, Guillin O, Belani H, Guo N-N, Hwang Y, Hwang D-R, Abi-Dargham A, Laruelle M. 2007. In vivo DA D1 receptor selectivity of NNC 112 and SCH 23390. Mol Imaging Biol. 9:117–125. [DOI] [PubMed] [Google Scholar]
Eppinger B, Schuck NW, Nystrom LE, Cohen JD. 2013. Reduced striatal responses to reward prediction errors in older compared with younger adults. J Neurosci. 33:9905–9912. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gruber AJ, Dayan P, Gutkin BS, Solla SA. 2006. Dopamine modulation in the basal ganglia locks the gate to working memory. J Comput Neurosci. 20:153. [DOI] [PubMed] [Google Scholar]
Haber SN. 2016. Corticostriatal circuitry. Dialogues Clin Neurosci. 18:7–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
Haber SN, Behrens TEJ. 2014. The neural network underlying incentive-based learning: implications for interpreting circuit disruptions in psychiatric disorders. Neuron. 83:1019–1039. [DOI] [PMC free article] [PubMed] [Google Scholar]
Haber SN, Knutson B. 2010. The reward circuit: linking primate anatomy and human imaging. Neuropsychopharmacology. 35:4–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
Halfmann K, Hedgcock W, Kable J, Denburg NL. 2016. Individual differences in the neural signature of subjective value among older adults. Soc Cogn Affect Neurosci. 11:1111–1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hall H, Sedvall G, Magnusson O, Kopp J, Halldin C, Farde L. 1994. Distribution of D1- and D2-dopamine receptors, and dopamine and its metabolites in the human brain. Neuropsychopharmacology. 11:245–256. [DOI] [PubMed] [Google Scholar]
Jocham G, Klein TA, Ullsperger M. 2011. Dopamine-mediated reinforcement learning signals in the striatum and ventromedial prefrontal cortex underlie value-based choices. J Neurosci. 31:1606–1613. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kaiser RH, Treadway MT, Wooten DW, Kumar P, Goer F, Murray L, Beltzer M, Pechtel P, Whitton A, Cohen AL, et al. . 2018. Frontostriatal and dopamine markers of individual differences in reinforcement learning: a multi-modal investigation. Cereb Cortex. 28:4281–4290. [DOI] [PMC free article] [PubMed] [Google Scholar]
Koch M, Schmid A, Schnitzler HU. 2000. Role of muscles accumbens dopamine D1 and D2 receptors in instrumental and Pavlovian paradigms of conditioned reward. Psychopharmacology. 152:67–73. [DOI] [PubMed] [Google Scholar]
Levy BJ, Wagner AD. 2011. Cognitive control and right ventrolateral prefrontal cortex: reflexive reorienting, motor inhibition, and action updating. Ann N Y Acad Sci. 1224:40–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lighthall NR, Pearson JM, Huettel SA, Cabeza R. 2018. Feedback-based learning in aging: contributions and trajectories of change in striatal and hippocampal systems. J Neurosci. 38:8453–8462. [DOI] [PMC free article] [PubMed] [Google Scholar]
Logan J, Fowler JS, Volkow ND, Wang G-J, Ding Y-S, Alexoff DL. 1996. Distribution volume ratios without blood sampling from graphical analysis of PET data. J Cereb Blood Flow Metab. 16:834–840. [DOI] [PubMed] [Google Scholar]
Mata R, Josef AK, Samanez-Larkin GR, Hertwig R. 2011. Age differences in risky choice: a meta-analysis. Ann N Y Acad Sci. 1235:18–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mazaika PK, Hoeft F, Glover GH, Reiss AL. 2009. Methods and software for fMRI analysis for clinical subjects In: Presentation at the 15th Annual Meeting of the Organization for Human Brain Mapping.
Mell T, Heekeren HR, Marschner A, Wartenburger I, Villringer A, Reischies FM. 2005. Effect of aging on stimulus-reward association learning. Neuropsychologia. 43:554–563. [DOI] [PubMed] [Google Scholar]
Noonan MP, Walton ME, Behrens TEJ, Sallet J, Buckley MJ, Rushworth MFS. 2010. Separate value comparison and learning mechanisms in macaque medial and lateral orbitofrontal cortex. PNAS. 107:20547–20552. [DOI] [PMC free article] [PubMed] [Google Scholar]
O’brien RM. 2007. A caution regarding rules of thumb for variance inflation factors. Qual Quant. 41:673–690. [Google Scholar]
Patenaude B, Smith SM, Kennedy DN, Jenkinson M. 2011. A Bayesian model of shape and appearance for subcortical brain segmentation. NeuroImage. 56:907–922. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pessiglione M, Seymour B, Flandin G, Dolan RJ, Frith CD. 2006. Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature. 442:1042–1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
Raz N, Lindenberger U, Rodrigue KM, Kennedy KM, Head D, Williamson A, Dahle C, Gerstorf D, Acker JD. 2005. Regional brain changes in aging healthy adults: general trends, individual differences and modifiers. Cereb Cortex. 15:1676–1689. [DOI] [PubMed] [Google Scholar]
Rieckmann A, Karlsson S, Karlsson P, Brehmer Y, Fischer H, Farde L, Nyberg L, Bäckman L. 2011. Dopamine D1 receptor associations within and between dopaminergic pathways in younger and elderly adults: links to cognitive performance. Cereb Cortex. 21:2023–2032. [DOI] [PubMed] [Google Scholar]
Ross S, Stearns C. 2010. SharpIR. White Paper. [Google Scholar]
Salamone JD, Correa M. 2012. The mysterious motivational functions of mesolimbic dopamine. Neuron. 76:470–485. [DOI] [PMC free article] [PubMed] [Google Scholar]
Samanez-Larkin GR, Hollon NG, Carstensen LL, Knutson B. 2008. Individual differences in insular sensitivity during loss anticipation predict avoidance learning. Psychol Sci. 19:320–323. [DOI] [PMC free article] [PubMed] [Google Scholar]
Samanez-Larkin GR, Knutson B. 2015. Decision making in the ageing brain: changes in affective and motivational circuits. Nat Rev Neurosci. 16:278–289. [DOI] [PMC free article] [PubMed] [Google Scholar]
Samanez-Larkin GR, Levens SM, Perry LM, Dougherty RF, Knutson B. 2012. Frontostriatal white matter integrity mediates adult age differences in probabilistic reward learning. J Neurosci. 32:5333–5337. [DOI] [PMC free article] [PubMed] [Google Scholar]
Samanez-Larkin GR, Worthy DA, Mata R, McClure SM, Knutson B. 2014. Adult age differences in frontostriatal representation of prediction error but not reward outcome. Cogn Affect Behav Neurosci. 14:672–682. [DOI] [PMC free article] [PubMed] [Google Scholar]
de Schotten MT, Rojkova K, Volle E, Urbanski M, Humbert F, Dell’Acqua F, de Schotten MT. 2014. A selective ageing effect on the frontal lobe connections. Alzheimers Dement. 10:P37–P38. [Google Scholar]
Schultz W, Dayan P, Montague PR. 1997. A neural substrate of prediction and reward. Science. 275:1593–1599. [DOI] [PubMed] [Google Scholar]
Seger CA. 2009. The involvement of corticostriatal loops in learning across tasks, species, and methodologies In: Groenewegen HJ, Voorn P, Berendse HW, Mulder AB, Cools AR, editors. The Basal Ganglia IX. Advances in Behavioral Biology. New York: Springer, pp. 25–39. [Google Scholar]
Seger CA, Peterson EJ, Cincotta CM, Lopez-Paniagua D, Anderson CW. 2010. Dissociating the contributions of independent corticostriatal systems to visual categorization learning through the use of reinforcement learning modeling and granger causality modeling. NeuroImage. 50:644–656. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shiner T, Seymour B, Wunderlich K, Hill C, Bhatia KP, Dayan P, Dolan RJ. 2012. Dopamine and performance in a reinforcement learning task: evidence from Parkinson’s disease. Brain. 135:1871–1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
Skvortsova V, Palminteri S, Pessiglione M. 2014. Learning to minimize efforts versus maximizing rewards: computational principles and neural correlates. J Neurosci. 34:15621–15630. [DOI] [PMC free article] [PubMed] [Google Scholar]
Slifstein M, Kegeles LS, Gonzales R, Frankle WG, Xu X, Laruelle M, Abi-Dargham A. 2007. [11C]NNC 112 selectivity for dopamine D1 and serotonin 5-HT2A receptors: a PET study in healthy human subjects. J Cereb Blood Flow Metab. 27:1733–1741. [DOI] [PubMed] [Google Scholar]
Smith RE, Tournier J-D, Calamante F, Connelly A. 2012. Anatomically-constrained tractography: improved diffusion MRI streamlines tractography through effective use of anatomical information. NeuroImage. 62:1924–1938. [DOI] [PubMed] [Google Scholar]
Smith RE, Tournier J-D, Calamante F, Connelly A. 2015. SIFT2: enabling dense quantitative assessment of brain white matter connectivity using streamlines tractography. NeuroImage. 119:338–351. [DOI] [PubMed] [Google Scholar]
Smittenaar P, Kurth-Nelson Z, Mohammadi S, Weiskopf N, Dolan RJ. 2017. Local striatal reward signals can be predicted from corticostriatal connectivity. NeuroImage. 159:9–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tournier J-D, Calamante F, Connelly A. 2012. MRtrix: diffusion tractography in crossing fiber regions. Int J Imaging Syst Technol. 22:53–66. [Google Scholar]
van de Vijver I, Ridderinkhof KR, Harsay H, Reneman L, Cavanagh JF, Buitenweg JIV, Cohen MX. 2016. Frontostriatal anatomical connections predict age- and difficulty-related differences in reinforcement learning. Neurobiol Aging. 46:1–12. [DOI] [PubMed] [Google Scholar]
Worthy DA, Maddox WT. 2012. Age-based differences in strategy use in choice tasks. Front Neurosci. 5:145. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wu M, Chang LC, Walker L, Lemaitre H, Barnett AS, Marenco S, Pierpaoli C. 2008. Comparison of EPI distortion correction methods in diffusion tensor MRI using a novel framework. Med Image Comput Comput Assist Interv. 11:321–329. [DOI] [PMC free article] [PubMed] [Google Scholar]
Yang AC, Tsai S-J, Liu M-E, Huang C-C, Lin C-P. 2016. The association of aging with white matter integrity and functional connectivity hubs. Front Aging Neurosci. 8:143. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

suppfig1_bhaa104

Click here for additional data file.^{(138.6KB, pdf)}

suppfig2_bhaa104

Click here for additional data file.^{(34.7KB, pdf)}

extended_data_bhaa104

Click here for additional data file.^{(96KB, pdf)}

Data Availability Statement

The code and processed behavioral and neural data used to create the figures in this paper are available at https://github.com/liekelotte/DWI.

[ref1] Achterberg M, Peper JS, van Duijvenvoorde ACK, Mandl RCW, Crone EA. 2016. Frontostriatal white matter integrity predicts development of delay of gratification: a longitudinal study. J Neurosci. 36:1954–1961. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref2] Ashburner J. 2007. A fast diffeomorphic image registration algorithm. NeuroImage. 38:95–113. [DOI] [PubMed] [Google Scholar]

[ref3] Avants BB, Tustison NJ, Song G, Cook PA, Klein A, Gee JC. 2011. A reproducible evaluation of ANTs similarity metric performance in brain image registration. NeuroImage. 54:2033–2044. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref4] Bartra O, McGuire JT, Kable JW. 2013. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. NeuroImage. 76:412–427. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref5] Bäckman L, Ginovart N, Dixon RA, Wahlin TB, Wahlin A, Halldin C, Farde L. 2000. Age-related cognitive deficits mediated by changes in the striatal dopamine system. Am J Psychiatry. 157:635–637. [DOI] [PubMed] [Google Scholar]

[ref6] Bäckman L, Lindenberger U, Li S-C, Nyberg L. 2010. Linking cognitive aging to alterations in dopamine neurotransmitter functioning: recent data and future avenues. Neurosci Biobehav Rev. 34:670–677. [DOI] [PubMed] [Google Scholar]

[ref7] Bennett IJ, Greenia DE, Maillard P, Sajjadi SA, DeCarli C, Corrada MM, Kawas CH. 2017. Age-related white matter integrity differences in oldest-old without dementia. Neurobiol Aging. 56:108–114. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref8] Boorman ED, Behrens TEJ, Woolrich MW, Rushworth MFS. 2009. How green is the grass on the other side? Frontopolar cortex and the evidence in favor of alternative courses of action. Neuron. 62:733–743. [DOI] [PubMed] [Google Scholar]

[ref9] Boorman ED, Behrens TE, Rushworth MF. 2011. Counterfactual choice and learning in a neural network centered on human lateral frontopolar cortex. PLoS Biol. 9:e1001093. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref10] Bornstein A, Daw ND. 2011. Multiplicity of control in the basal ganglia: computational roles of striatal subregions. Curr Opin Neurobiol. 21:374–380. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref11] Chowdhury R, Guitart-Masip M, Lambert C, Dayan P, Huys Q, Düzel E, Dolan RJ. 2013a. Dopamine restores reward prediction errors in old age. Nat Neurosci. 16:648–653. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref12] Chowdhury R, Guitart-Masip M, Lambert C, Dolan RJ, Düzel E. 2013b. Structural integrity of the substantia nigra and subthalamic nucleus predicts flexibility of instrumental learning in older-age individuals. Neurobiol Aging. 34:2261–2270. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref13] Collins AGE, Frank MJ. 2014. Opponent actor learning (OpAL): modeling interactive effects of striatal dopamine on reinforcement learning and choice incentive. Psychol Rev. 121:337–366. [DOI] [PubMed] [Google Scholar]

[ref14] Daw ND, Doya K. 2006. The computational neurobiology of learning and reward. Curr Opin Neurobiol. 16:199–204. [DOI] [PubMed] [Google Scholar]

[ref15] Day JJ, Roitman MF, Wightman RM, Carelli RM. 2007. Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nat Neurosci. 10:1020. [DOI] [PubMed] [Google Scholar]

[ref16] de Boer L, Axelsson J, Riklund K, Nyberg L, Dayan P, Bäckman L, Guitart-Masip M. 2017. Attenuation of dopamine-modulated prefrontal value signals underlies probabilistic reward learning deficits in old age. elife. 6:e26424. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref17] Desikan RS, Ségonne F, Fischl B, Quinn BT, Dickerson BC, Blacker D, Buckner RL, Dale AM, Maguire RP, Hyman BT, et al. . 2006. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. NeuroImage. 31:968–980. [DOI] [PubMed] [Google Scholar]

[ref18] Ekelund J, Slifstein M, Narendran R, Guillin O, Belani H, Guo N-N, Hwang Y, Hwang D-R, Abi-Dargham A, Laruelle M. 2007. In vivo DA D1 receptor selectivity of NNC 112 and SCH 23390. Mol Imaging Biol. 9:117–125. [DOI] [PubMed] [Google Scholar]

[ref19] Eppinger B, Schuck NW, Nystrom LE, Cohen JD. 2013. Reduced striatal responses to reward prediction errors in older compared with younger adults. J Neurosci. 33:9905–9912. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref20] Gruber AJ, Dayan P, Gutkin BS, Solla SA. 2006. Dopamine modulation in the basal ganglia locks the gate to working memory. J Comput Neurosci. 20:153. [DOI] [PubMed] [Google Scholar]

[ref21] Haber SN. 2016. Corticostriatal circuitry. Dialogues Clin Neurosci. 18:7–21. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref22] Haber SN, Behrens TEJ. 2014. The neural network underlying incentive-based learning: implications for interpreting circuit disruptions in psychiatric disorders. Neuron. 83:1019–1039. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref23] Haber SN, Knutson B. 2010. The reward circuit: linking primate anatomy and human imaging. Neuropsychopharmacology. 35:4–26. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref24] Halfmann K, Hedgcock W, Kable J, Denburg NL. 2016. Individual differences in the neural signature of subjective value among older adults. Soc Cogn Affect Neurosci. 11:1111–1120. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref25] Hall H, Sedvall G, Magnusson O, Kopp J, Halldin C, Farde L. 1994. Distribution of D1- and D2-dopamine receptors, and dopamine and its metabolites in the human brain. Neuropsychopharmacology. 11:245–256. [DOI] [PubMed] [Google Scholar]

[ref26] Jocham G, Klein TA, Ullsperger M. 2011. Dopamine-mediated reinforcement learning signals in the striatum and ventromedial prefrontal cortex underlie value-based choices. J Neurosci. 31:1606–1613. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref27] Kaiser RH, Treadway MT, Wooten DW, Kumar P, Goer F, Murray L, Beltzer M, Pechtel P, Whitton A, Cohen AL, et al. . 2018. Frontostriatal and dopamine markers of individual differences in reinforcement learning: a multi-modal investigation. Cereb Cortex. 28:4281–4290. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref28] Koch M, Schmid A, Schnitzler HU. 2000. Role of muscles accumbens dopamine D1 and D2 receptors in instrumental and Pavlovian paradigms of conditioned reward. Psychopharmacology. 152:67–73. [DOI] [PubMed] [Google Scholar]

[ref29] Levy BJ, Wagner AD. 2011. Cognitive control and right ventrolateral prefrontal cortex: reflexive reorienting, motor inhibition, and action updating. Ann N Y Acad Sci. 1224:40–62. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref30] Lighthall NR, Pearson JM, Huettel SA, Cabeza R. 2018. Feedback-based learning in aging: contributions and trajectories of change in striatal and hippocampal systems. J Neurosci. 38:8453–8462. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref31] Logan J, Fowler JS, Volkow ND, Wang G-J, Ding Y-S, Alexoff DL. 1996. Distribution volume ratios without blood sampling from graphical analysis of PET data. J Cereb Blood Flow Metab. 16:834–840. [DOI] [PubMed] [Google Scholar]

[ref32] Mata R, Josef AK, Samanez-Larkin GR, Hertwig R. 2011. Age differences in risky choice: a meta-analysis. Ann N Y Acad Sci. 1235:18–29. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref33] Mazaika PK, Hoeft F, Glover GH, Reiss AL. 2009. Methods and software for fMRI analysis for clinical subjects In: Presentation at the 15th Annual Meeting of the Organization for Human Brain Mapping.

[ref34] Mell T, Heekeren HR, Marschner A, Wartenburger I, Villringer A, Reischies FM. 2005. Effect of aging on stimulus-reward association learning. Neuropsychologia. 43:554–563. [DOI] [PubMed] [Google Scholar]

[ref35] Noonan MP, Walton ME, Behrens TEJ, Sallet J, Buckley MJ, Rushworth MFS. 2010. Separate value comparison and learning mechanisms in macaque medial and lateral orbitofrontal cortex. PNAS. 107:20547–20552. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref36] O’brien RM. 2007. A caution regarding rules of thumb for variance inflation factors. Qual Quant. 41:673–690. [Google Scholar]

[ref37] Patenaude B, Smith SM, Kennedy DN, Jenkinson M. 2011. A Bayesian model of shape and appearance for subcortical brain segmentation. NeuroImage. 56:907–922. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref38] Pessiglione M, Seymour B, Flandin G, Dolan RJ, Frith CD. 2006. Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature. 442:1042–1045. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref39] Raz N, Lindenberger U, Rodrigue KM, Kennedy KM, Head D, Williamson A, Dahle C, Gerstorf D, Acker JD. 2005. Regional brain changes in aging healthy adults: general trends, individual differences and modifiers. Cereb Cortex. 15:1676–1689. [DOI] [PubMed] [Google Scholar]

[ref40] Rieckmann A, Karlsson S, Karlsson P, Brehmer Y, Fischer H, Farde L, Nyberg L, Bäckman L. 2011. Dopamine D1 receptor associations within and between dopaminergic pathways in younger and elderly adults: links to cognitive performance. Cereb Cortex. 21:2023–2032. [DOI] [PubMed] [Google Scholar]

[ref41] Ross S, Stearns C. 2010. SharpIR. White Paper. [Google Scholar]

[ref42] Salamone JD, Correa M. 2012. The mysterious motivational functions of mesolimbic dopamine. Neuron. 76:470–485. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref43] Samanez-Larkin GR, Hollon NG, Carstensen LL, Knutson B. 2008. Individual differences in insular sensitivity during loss anticipation predict avoidance learning. Psychol Sci. 19:320–323. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref44] Samanez-Larkin GR, Knutson B. 2015. Decision making in the ageing brain: changes in affective and motivational circuits. Nat Rev Neurosci. 16:278–289. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref45] Samanez-Larkin GR, Levens SM, Perry LM, Dougherty RF, Knutson B. 2012. Frontostriatal white matter integrity mediates adult age differences in probabilistic reward learning. J Neurosci. 32:5333–5337. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref46] Samanez-Larkin GR, Worthy DA, Mata R, McClure SM, Knutson B. 2014. Adult age differences in frontostriatal representation of prediction error but not reward outcome. Cogn Affect Behav Neurosci. 14:672–682. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref47] de Schotten MT, Rojkova K, Volle E, Urbanski M, Humbert F, Dell’Acqua F, de Schotten MT. 2014. A selective ageing effect on the frontal lobe connections. Alzheimers Dement. 10:P37–P38. [Google Scholar]

[ref48] Schultz W, Dayan P, Montague PR. 1997. A neural substrate of prediction and reward. Science. 275:1593–1599. [DOI] [PubMed] [Google Scholar]

[ref49] Seger CA. 2009. The involvement of corticostriatal loops in learning across tasks, species, and methodologies In: Groenewegen HJ, Voorn P, Berendse HW, Mulder AB, Cools AR, editors. The Basal Ganglia IX. Advances in Behavioral Biology. New York: Springer, pp. 25–39. [Google Scholar]

[ref50] Seger CA, Peterson EJ, Cincotta CM, Lopez-Paniagua D, Anderson CW. 2010. Dissociating the contributions of independent corticostriatal systems to visual categorization learning through the use of reinforcement learning modeling and granger causality modeling. NeuroImage. 50:644–656. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref51] Shiner T, Seymour B, Wunderlich K, Hill C, Bhatia KP, Dayan P, Dolan RJ. 2012. Dopamine and performance in a reinforcement learning task: evidence from Parkinson’s disease. Brain. 135:1871–1883. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref52] Skvortsova V, Palminteri S, Pessiglione M. 2014. Learning to minimize efforts versus maximizing rewards: computational principles and neural correlates. J Neurosci. 34:15621–15630. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref53] Slifstein M, Kegeles LS, Gonzales R, Frankle WG, Xu X, Laruelle M, Abi-Dargham A. 2007. [11C]NNC 112 selectivity for dopamine D1 and serotonin 5-HT2A receptors: a PET study in healthy human subjects. J Cereb Blood Flow Metab. 27:1733–1741. [DOI] [PubMed] [Google Scholar]

[ref54] Smith RE, Tournier J-D, Calamante F, Connelly A. 2012. Anatomically-constrained tractography: improved diffusion MRI streamlines tractography through effective use of anatomical information. NeuroImage. 62:1924–1938. [DOI] [PubMed] [Google Scholar]

[ref55] Smith RE, Tournier J-D, Calamante F, Connelly A. 2015. SIFT2: enabling dense quantitative assessment of brain white matter connectivity using streamlines tractography. NeuroImage. 119:338–351. [DOI] [PubMed] [Google Scholar]

[ref56] Smittenaar P, Kurth-Nelson Z, Mohammadi S, Weiskopf N, Dolan RJ. 2017. Local striatal reward signals can be predicted from corticostriatal connectivity. NeuroImage. 159:9–17. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref57] Tournier J-D, Calamante F, Connelly A. 2012. MRtrix: diffusion tractography in crossing fiber regions. Int J Imaging Syst Technol. 22:53–66. [Google Scholar]

[ref58] van de Vijver I, Ridderinkhof KR, Harsay H, Reneman L, Cavanagh JF, Buitenweg JIV, Cohen MX. 2016. Frontostriatal anatomical connections predict age- and difficulty-related differences in reinforcement learning. Neurobiol Aging. 46:1–12. [DOI] [PubMed] [Google Scholar]

[ref59] Worthy DA, Maddox WT. 2012. Age-based differences in strategy use in choice tasks. Front Neurosci. 5:145. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref60] Wu M, Chang LC, Walker L, Lemaitre H, Barnett AS, Marenco S, Pierpaoli C. 2008. Comparison of EPI distortion correction methods in diffusion tensor MRI using a novel framework. Med Image Comput Comput Assist Interv. 11:321–329. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref61] Yang AC, Tsai S-J, Liu M-E, Huang C-C, Lin C-P. 2016. The association of aging with white matter integrity and functional connectivity hubs. Front Aging Neurosci. 8:143. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Corticostriatal White Matter Integrity and Dopamine D1 Receptor Availability Predict Age Differences in Prefrontal Value Signaling during Reward Learning

Lieke de Boer

Benjamín Garzón

Jan Axelsson

Katrine Riklund

Lars Nyberg

Lars Bäckman

Marc Guitart-Masip

Abstract

Introduction

Materials and Methods

Participants

Two-Armed Bandit Task

Figure 1.

Statistical Analysis of Brain and Behavior

Computational Analysis of Behavior

MRI Acquisition

Functional MRI Analysis

DWI Preprocessing and Analysis

PET Image Acquisition and Analysis

Results

Behavior

Figure 2.

Previous Findings in the DWI Subsample

Table 1.

Table 4.

Relationship Between White-Matter Integrity, Behavior, D1 BPND, and Neural Correlates of Value Anticipation

Figure 3.

Table 2.

Table 3.

Discussion

Supplementary Material

Acknowledgements

Funding

Data availability

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Relationship Between White-Matter Integrity, Behavior, D1 BP_ND, and Neural Correlates of Value Anticipation