Abstract
The judgment of whether to accept or to reject an offer is determined by positive and negative affect related to the offer, but affect also induces motivational responses. Rewarding and aversive cues influence the firing rates of many neurons in primate prefrontal and cingulate neocortical regions, but it still is unclear whether neurons in these regions are related to affective judgment or to motivation. To address this issue, we recorded simultaneously the neuronal spike activities of single units in the dorsolateral prefrontal cortex (dlPFC) and the anterior cingulate cortex (ACC) of macaque monkeys as they performed approach–avoidance (Ap–Av) and approach–approach (Ap–Ap) decision-making tasks that can behaviorally dissociate affective judgment and motivation. Notably, neurons having activity correlated with motivational condition could be distinguished from neurons having activity related to affective judgment, especially in the Ap–Av task. Although many neurons in both regions exhibited similar, selective patterns of task-related activity, we found a larger proportion of neurons activated in low motivational conditions in the dlPFC than in the ACC, and the onset of this activity was significantly earlier in the dlPFC than in the ACC. Furthermore, the temporal onsets of affective judgment represented by neuronal activities were significantly slower in the low motivational conditions than in the other conditions. These findings suggest that motivation and affective judgment both recruit dlPFC and ACC neurons but with differential degrees of involvement and timing.
Keywords: anterior cingulate cortex, decision making, macaque, prefrontal cortex, primate
Introduction
Approach–avoidance (Ap–Av) conflict arises when an individual must decide between acceptance and rejection of a prospective offer having a mixture of positive and negative attributes (Miller, 1971; Vogel et al., 1971). These judgments are determined by cost–benefit integration of positive and negative values related to each component of the combined offer (Talmi et al., 2009; Amemori and Graybiel, 2012) but concurrently induce motivational responses that affect dynamic aspects of the decision-making process (Tversky and Shafir, 1992; Diederich, 2003). For instance, macaque monkeys tend to slow down task performance or sometimes defer performance when the positive aspects of the offer are frustratingly low (Hassani et al., 2001; Watanabe et al., 2001; Isoda and Hikosaka, 2008) and when the negative aspects are not threatening (Kobayashi et al., 2006). Importantly, such motivational responses are considered to be dissociable from judgments based on affect (Lang et al., 1998; Roesch and Olson, 2004). Highly threatening, aversive offers reduce the utility value of acceptance but also are thought to activate an avoidance motivation to facilitate performance (Bradley et al., 2001, 2008). Here we used a conflict decision-making paradigm that can behaviorally dissociate affective judgment of likes and dislikes from motivational factors that control the dynamic aspects of decision-making. We used this potential in an attempt to dissociate components of value-based decision-making by examining the neuronal correlates of affective judgment and motivation.
Both the dorsolateral prefrontal cortex (dlPFC) and the anterior cingulate cortex (ACC) have been implicated in such cost–benefit integration in nonhuman primates (Kennerley et al., 2006, 2011; Hosokawa et al., 2013) and humans (Knoch et al., 2006; Talmi et al., 2009), and these regions are known to be critical for goal-directed behavior (Matsumoto et al., 2003) and value-based decision-making (Barraclough et al., 2004; Seo and Lee, 2007). Activity in the dlPFC is enhanced especially in response to motivationally intriguing goals (Watanabe, 1996; Amemori and Sawaguchi, 2006; Kennerley and Wallis, 2009). Focal microstimulation of part of the pregenual ACC induces negative bias in such value-based decision-making (Amemori and Graybiel, 2012), causally demonstrating an involvement of part of the ACC in subjective evaluation (Padoa-Schioppa, 2011; Cai and Padoa-Schioppa, 2012). However, it has not been clear whether the dlPFC and the ACC have distinct functions in the affective judgment and motivational regulation that influence neural processing as decision-making evolves over time. Using the Ap–Av decision-making paradigm, we tested whether the dlPFC and ACC of macaque monkeys, two cortical regions that have been implicated in affective processing and motivation, have dissociable activity patterns related to these functions by recording with chronic multielectrode methods (Feingold et al., 2012).
Materials and Methods
Subjects and experimental conditions.
Two female Macaca mulatta monkeys (monkey A, 6.8 kg; monkey S, 7.5 kg) were studied in experiments following procedures approved by the Committee on Animal Care of the Massachusetts Institute of Technology. Before training, the monkeys were habituated to sitting in a monkey chair, and sterile surgery was performed with anesthesia induced by intramuscularly administered atropine (0.05 mg/kg) and ketamine (10 mg/kg), followed by inhalation of 1–2.5% sevoflurane with 2 L of O2, to implant over the occipital bone a titanium head post, secured by titanium screws and bone cement. For all surgeries, the monkeys were maintained on analgesics postoperatively, and prophylactic antibiotics were injected on the day of surgery and daily thereafter for 1 week.
Task procedures.
The two female monkeys were trained to perform Ap–Av and approach–approach (Ap–Ap) tasks (Amemori and Graybiel, 2012). In the Ap–Av task (Fig. 1A), a white central fixation spot and a gray rectangular frame simultaneously appeared on a screen. When the monkey placed its hand on a designated start position in front of a joystick, an infrared photobeam sensor detected the placement, inducing the rectangular frame to turn white. The monkey was required to hold its hand in the start position for 1.5 s (fixation period). If the monkey released its hand from the start position, the rectangular frame became gray, requiring the monkey to begin again with fixation. After the fixation period, a compound visual cue, consisting of red and yellow horizontal bars, appeared on the screen. The length of the red bar corresponded to the offered amount of liquefied food, which ranged from 0.1 to 2.0 ml. The length of the yellow bar corresponded to the offered pressure of the airpuff, which ranged from 0 to 50 psi. These two offered outcomes were delivered only when the monkey made an approach decision. The length of the bars could be varied over 101 steps, independently and pseudorandomly. The cues remained on for 1.5 s (cue period), and the monkey had to maintain start-position contact during this period. If the monkey released the contact (a commission error), the trial was terminated, and an airpuff corresponding to that indicated by the length of the yellow bar was delivered to the monkey's face, accompanied by a sound signaling error. If the monkey continued the trial, then after the cue period, two target cues (a white cross and a white square) appeared above and below the cue. At the same time, a cursor (white circle) whose vertical location could be controlled by the joystick appeared at the center of the screen. The monkey was required to report its decision by using a joystick to move the cursor toward one of the two targets within 3 s (response period). The locations of the targets were alternated randomly. If the monkey did not respond within the allotted 3 s response period, the trial was counted as an omission error, and an airpuff as indicated by the yellow bar was delivered, accompanied by the error sound. When the monkey chose the square target, a sound signaling avoidance was played, and 500 ms later, the minimum and constant amount of reward (liquefied food, 0.1 ml; equivalent to the minimum offer by the red bar) was delivered to maintain the monkey's motivation to continue the task. If the monkey chose the cross target, a sound signaling approach was played, and 500 ms later, an airpuff whose pressure was indicated by the yellow bar was delivered to the monkey's face for 800 ms. Liquefied food, in an amount signaled by the red bar, was then delivered 1 s later for 1.5 s. After each trial, a 5 s intertrial interval occurred. The amount of reward and the pressure of airpuff were controlled by a computer-controlled pump (L/S Masterflex; Cole-Parmer) and a transducer (900-EIA; Control Air).
The monkeys were also trained to perform an Ap–Ap task (Fig. 1F; Amemori and Graybiel, 2012). The task design and procedural details for this task were the same as those in the Ap–Av task, except that the outcomes of the decisions were always rewards. When the monkey chose the square target, a sound signaling approach occurred, and 500 ms later, the liquid food reward was delivered for 1.5 s at an amount indicated by the length of the yellow bar. When the monkey chose the cross target, a sound signaling approach was triggered, and 500 ms later, reward was delivered for 1.5 s at an amount indicated by the length of the red bar. The amount of reward per unit length of the red bar was twice that of the yellow bar, so that the association between cue and target was comparable for the Ap–Av and Ap–Ap tasks. When commission or omission errors occurred, the trial was terminated without reward delivery.
Neuronal recording.
After behavioral training, in a second sterile surgery, a plastic 40 × 40 mm recording chamber was implanted on the skull and secured by bone cement and ceramic screws at coordinates stereotaxically determined for each monkey. The chamber contained a plastic grid with openings spaced at 1 mm intervals to serve as guides allowing parallel microelectrode penetrations. After the chamber implantation, magnetic resonance images (T2-weighted turbo spin echo, 300 μm in resolution, 1 mm slice thickness) were taken to allow subsequent identification of the electrode tracks. Then, with surgical anesthesia and sterile conditions, the skull overlying the targeted regions was removed. After recovery, sets of platinum–iridium electrodes (impedance, 0.8–1.5 MΩ; FHC) were gradually lowered into the neocortex (Feingold et al., 2012). All electrodes were held by custom-made micromanipulators affixed to the grid. For the first implant in monkey A, 36 electrodes were implanted simultaneously into the neocortical targets (24 in the dlPFC and 12 in the ACC), and 18 additional electrodes were implanted into the striatum for another experimental purpose. For the second implant in monkey A, 32 electrodes were implanted in the neocortex (18 in the dlPFC and 24 in the ACC). For the first implant in monkey S, 12 electrodes were implanted into the ACC. For the second implant in monkey S, 30 electrodes were implanted into the neocortex (12 in the dlPFC and 18 in the ACC), and 15 additional electrodes were implanted into the striatum.
The recording and task control system consisted of five networked computers and other peripheral equipment. Eye positions were monitored by an infrared eye-movement camera system (Eyelink CL; SR Research). Two computers controlled the task based on a CORTEX system developed by the National Institute of Mental Health. For recording, a digital data acquisition system (Digital Lynx; Neuralynx) collected all signals and task event markers. Signals from the microelectrodes were amplified and stored by the Digital Lynx system. Data were later classified into single-unit activities using Offline Sorter (Plexon) and were analyzed using MATLAB (Mathworks). Recordings were made while the monkeys performed each of the two (Ap–Av and Ap–Ap) tasks in alternating blocks of 150 trials (Fig. 1K). Between blocks, we inserted a 10 s interblock interval, during which a white spot appeared at the center of the screen as an explicit signal of the block change.
Econometrics models deriving chosen value and entropy.
To approximate parametrically the monkey's choice behavior, we used an econometrics model. We adopted the conditional logit model (McFadden, 1974; Train, 2003). With the monkeys' decision sampled in each session (Fig. 2A), we reversely inferred these subjective values as follows. If there are two options associated with the cross and square targets, the probability of choosing the cross target can be written as p+ = 1/(1 + exp(−(U+ − U□))), where U+ and U□ are the representative utility of each option (Fig. 2B). In the Ap–Av task, the function U+ − U□ was approximated by the first-order linear model that had the lowest Bayesian information criterion (BIC; Amemori and Graybiel, 2012). We thus parameterized the function U+ − U□ as U+ − U□ = a1x + a2y + a3, where x was the length of red bar, y was the length of yellow bar, and a1, a2, and a3 were the coefficients determined by the generalized linear model or logistic regression. We modeled each utility as U+ = UAP = a1x + a2y, and U□ = UAV = −a3. The weighted average, ChV = p+U+ + (1 − p+)U□, is interpreted as corresponding to the “chosen value” (ChV), i.e., the anticipatory value associated with the selected option (Von Neumann and Morgenstern, 1947; Padoa-Schioppa and Assad, 2006; Fig. 2D). We consider the ChV in the Ap–Av task to be enhanced by the expectation of reward and to be suppressed by the expectation of punishment, because this is the parameter that reflects valuation for value-based decision-making. In the Ap–Ap task, following BIC procedures, we adopted the first-order approximation model without a bias term as the best model, U+ − U□ = a1x + a2y, where x was the length of red bar, y was the length of yellow bar, and a1 and a2 were the coefficients determined by the logistic regression. Given that the monkeys were fully trained on this task, we considered that the monkeys had acquired the association between the cross target and the red cue and the associations between the square target and the yellow cue. Then, U+ became a function of x, and U□ became a function of y. Based on this assumption, we modeled U+ = a1x and U□ = −a2y. Based on the decisions in each session (Fig. 2E), we calculated the chosen value by ChV = p+U+ + (1 − p+)U□, where p+ is the probability of choosing the cross target, estimated by logistic regression (Fig. 2F). In both the Ap–Av and Ap–Ap tasks, the entropy of the decisions (Ep; Fig. 2C) was calculated by the model as Ep = −p+logp+ −(1 − p+) log(1 − p+).
Behavioral parameters to characterize the decision-making process.
To characterize parametrically the behavior and the decision-making process, we began with 11 variables. Among them, seven variables—the offered reward (the amount of offered reward indicated by the red bar; Rew), the offered airpuff (the offered pressure of airpuff indicated by the yellow bar; Ave), the binary value of approach (1) or avoidance (0), the chosen reward (the amount of reward that would be delivered after the choice of approach; ChR), the chosen aversion (the strength of airpuff that would be delivered after the choice of approach; ChA), the reaction times (RTs), and the direction of the joystick movement (push or pull; Mv)—were derived directly from the monkey's performance in each recording session. We also used four hypothetical variables to characterize the process of the decision-making. Two of these, ChV and Ep, were derived from the econometrics model calculated for each session. Two other variables, SD of decision (SDD) and the frequency of omission error (FOE), were derived from the corresponding averaged behavioral parameters. Omission errors were counted as trials in which the monkey did not respond during the 3 s response period.
Because of the limited number of omission errors in single sessions, we did not use these parameters as obtained from single sessions. Instead, we adopted the mean of SDD and the mean of FOE derived by averaging these variables over all recording sessions. The mean of SDD and the mean of FOE were parameterized by the combined offers as shown in Figure 1, C, E, H, and J. To calculate the pattern of choice (Fig. 1B,G), SDD (Fig. 1C,H), RT (Fig. 1D,I), and FOE (Fig. 1E,J), each datum parameterized by the combined offers was convolved with 4 × 4 square and assigned into 25 × 25 matrix. Thus, FOE was not a variable measuring the omission error in a given trial but was correlated with conditions that frequently induced the omission errors. We considered that FOE was inversely correlated with the level of motivation to perform the task. We used the Ep and SDD to characterize the degree of conflict in the Ap–Av task. Parameters derived from the averaged behavioral patterns (FOE and SDD) were obtained from the corresponding value in the matrix (Fig. 1C,E). Thus, the 11 behavioral parameters were obtained for each trial.
Before calculating the k-means clustering (see Fig. 4B) and the population activity (see Figs. 6–8, 10, 11), every dataset was “standardized” so that we could compare behavioral parameters and the unit activities across different daily sessions. Following the method described previously (Amemori and Graybiel, 2012), we linearly resized the reward amount to have the same decision boundary across different recording sessions.
Preparatory analyses to select explanatory variables for regression.
Some of these 11 variables used originally to parametrically describe the behavior above could clearly have been correlated with one another, reducing their value as explanatory variables for regression analysis (the multicollinearity problem). To select the variables necessary and feasible for multivariate regression analyses among these, we therefore performed two preparatory analyses, correlation analyses and a clustering procedure. First, we performed correlation analyses with all task-related units (Fig. 4A). For each unit, the variable that exhibited the highest correlation coefficient with the cue-period activity among the 11 variables was defined as the “feature variable” for that unit. We ranked the 11 variables by the number of units with activity showing significant correlation with them as determined by Pearson's correlation coefficients (p < 0.05). We found that Rew, Ave, ChV, RT, and FOE were ranked as the top five. These five were the feature variables for 72% (1029 among 1431 units) of the task-related units, suggesting that they were important explanatory variables to use in characterizing the dataset.
Second, to support further the validity of the selected five variables, we inserted a clustering procedure (Fig. 4B). Because the multicollinearity problem is produced by similar regression variables, we aggregated the similar units and then chose one representative variable for the groups of similar units. We used k-means clustering to group the similar units. After we standardized the data to compare across the sessions (Amemori and Graybiel, 2012), for unit i, we produced an 8 × 8 matrix, mi(x, y), where x and y were the corresponding sizes of reward and airpuff, respectively. For each combination of x and y, the mean firing rate was calculated and assigned to the corresponding element. The distance (or dissimilarity index) between unit i and j, D(i, j), was represented by 1 − rij, where rij indicated the two-dimensional correlation of mi(x, y) and mj(x, y). We used classical multidimensional scaling (or principal coordinates analysis) to compress the representation of the dissimilarity matrix, D(i, j), in which the activity of unit i was represented by extracted common features parameterized by f. We performed the k-means clustering with the scaled dissimilarity matrix, D(i, f), with different numbers of clusters (k = 1 to k = 100) and with 1000 different initialization parameters for each k. Thus, we derived 62 optimal clusters that had the lowest sum of silhouette value (Rousseeuw, 1987). For each unit in the cluster, we performed a correlation analysis between 11 hypothetical parameters (Rew, Ave, Cho, ChV, ChR, ChA, Ep, SDD, RT, FOE, and Mv) and the cue-period activity of units in the cluster. The correlation analysis was performed on a trial-by-trial basis using a single epoch that covered the entire 1.5 s cue period. For each cluster, we chose the variables that had a significant correlation (Pearson's correlation coefficient, p < 0.05) most frequently as the feature variable of the cluster. As a result, most of the clusters (51 among 62 clusters, 82%) were characterized by the five variables (Rew, Ave, ChV, RT, and FOE), suggesting that these five variables were the representative features to characterize most of the clusters of similar units. With the two analyses, we thus confirmed that the five variables were selected robustly as the important explanatory variables for the regression analyses.
Lastly, to confirm that the five selected variables were suitable for the multivariate regression, we performed multicollinearity checks for each recording session using the Belsley's collinearity test (Belsley et al., 1980). In our dataset, there were no sessions in which the set of variables exceeded the standard tolerance (a variance decomposition proportion >0.5 and a condition index >30, established in the collinetest function of MATLAB), indicating that these combinations were feasible for the multivariate regression.
All possible subset regression for neuronal classification.
To classify units, we focused on the cue period, during which the monkeys could make a decision based on the offer indicated by the cues but during which they did not yet know the direction of movement required to indicate their decision. With the cue-period activity for the Ap–Av task, we performed the all-possible subset regression analysis using the five selected explanatory variables (Rew, Ave, ChV, RT, and FOE). For every possible combination of the variables (31 combinations), we performed regression to characterize the cue-period activity of each unit on a trial-by-trial basis. The cue-period activity was defined by the raw spike count divided by the cue-period length. Both the cue-period activity and the explanatory behavioral variables were obtained for each trial. For each unit, the best-fitted model was selected among all possible combinations based on BIC (Fig. 5A). Among the combinations of variables that explained the cue-period activity significantly well (p < 0.05), the combination that produced the highest BIC score was selected. We also applied Akaike information criteria and Mallow's Cp statistic as the criteria for the model selection and confirmed that the proportion of each category was similar among all criteria (Fig. 5C). Furthermore, we also performed stepwise regression with the stepwisefit function of MATLAB. With the stepwise regression analyses, we could add and remove terms from a multiple linear model based on a series of F tests. At each step, the p value of an F statistic was computed to test models with and without the indicated term. We performed the stepwise regression algorithm, beginning with no terms in the model and using entrance/exit tolerances of 0.05/0.10 on the p values. The results of stepwise regression were also similar to the results of all-possible subset regression (Fig. 5C), suggesting a robustness of the results against different criteria.
Results
Behavioral correlates of conflict and reluctance in task performance
The Ap–Av and Ap–Ap tasks performed by monkeys A and S are illustrated in Figure 1. In the Ap–Av task, the monkeys' decisions were determined systematically based on Rew and Ave (Fig. 1B). The SDD was high along the decision boundary, suggesting a conflict state in decision-making at this boundary (Fig. 1C). The RT was also of relatively long duration along the decision boundary, but it was clearly affected by the sizes of Rew and Ave (Fig. 1D). When the Rew and Ave were small, RTs were longest, suggesting a low motivational state and a reluctance to perform the task under low payoff conditions. In the Ap–Ap task (Fig. 1F–J), the monkeys chose the option that gave the larger reward. Because the unit amount of reward associated with the unit length of the red bar was twice as large as that indicated by the yellow bar, the decision boundary in the Ap–Ap task corresponded to the offers for which either choice resulted in the same amount of reward. When the offered reward sizes were both low, the RTs were the longest, suggesting low motivation to perform the task (Fig. 1I). The pattern of the RTs in the Ap–Ap task was similar to that in the Ap–Av task. In both tasks, the RTs were longest for “low–low” combined offers (i.e., low reward and low airpuff offers in the Ap–Av task or low reward offers for both targets in the Ap–Ap task; Fig. 1D,I). Thus, we interpreted the RT as a compound of conflict and low motivational states in task performance and that the pattern of this compound was similar in the two tasks.
Dissociation of motivation and affective judgment
The affective judgments leading to acceptance and rejection of combined offers are determined by integration of positive and negative values of the offer, but they also induce motivational responses that affect the decision-making process. In an attempt to dissociate neural activity related to affective judgment and motivation, we introduced two hypothetical variables. First, we focused on the FOE as a key behavioral parameter to quantify the level of motivation (Fig. 1E). Omission errors occurred when the monkey did not respond during the 3 s response period. When the monkeys made an omission error, the trial was terminated by the delivery of a penalty airpuff whose strength was associated with the length of the yellow bar (Fig. 1A). When the monkeys did not want to receive the penalty airpuff, they had to actively choose the avoidance target. Therefore, the FOE decreased as either the Rew or the Ave increased. In fact, in both the Ap–Av and Ap–Ap tasks, the FOE was the highest for low–low combined offers (i.e., low reward and airpuff offers in the Ap–Av task or low reward offers in the Ap–Ap task; Fig. 1E,J). We interpreted the FOE as a behavioral parameter that is correlated with a low motivational condition (Roesch and Olson, 2004) and used it as the explanatory variable to define units related to low motivation. In accordance with this interpretation, we found that the FOE in the Ap–Av task was decreased by increased expectation of reward and punishment (Fig. 1E), whereas the FOE in the Ap–Ap task was increased by decreases in the size of the offered rewards (Fig. 1J).
Second, to characterize quantitatively the affective judgment of whether to accept or to reject the combined offer, we introduced a computational model (conditional logit) that estimated the ChV. In the Ap–Av task, the ChV increased linearly according to the size of offered reward and decreased linearly according to the strength of the airpuff (Fig. 2D). In contrast, in the Ap–Ap task, the ChV was determined by the larger of the two reward amounts offered by the red and yellow bars (Fig. 2F). Therefore, the ChV differed between the Ap–Av and Ap–Ap tasks. The reward and punishment differentially affected the FOE and ChV in the Ap–Av task and allowed us to dissociate estimated internal neuronal processes related to affective judgment and motivation.
Classification of units
We isolated 3109 neocortical units, of which 1076 were recorded from the right and left dlPFC of the two monkeys (689 from monkey A, 387 from monkey S; Fig. 3, blue circles) and 2033 units from the ACC of both sides (1143 from monkey A, 890 from monkey S; Fig. 3, red circles). The analytical results using approximately half of the ACC units (1065 units, 52.4% of the current dataset) were reported previously (Amemori and Graybiel, 2012), so that approximately the same numbers of units were newly recorded in the ACC and dlPFC. Among the dlPFC units, 904 (84.0%) were defined as task related, because their firing rates during the cue period were significantly different from those during the 1 s time window before the fixation cue appeared (two-sample t test, p < 0.05) and those during the pre-cue period (two-sample t test, p < 0.05). Among the ACC units, 1760 units (86.6%) were defined as task related by the same criterion. To classify the full population of 2664 task-related units, we performed all possible subset regression with BIC. First, we selected five explanatory variables (Rew, Ave, ChV, RT, and FOE) based on the results of correlation analyses (see Materials and Methods; Fig. 4). We performed regression analyses with the cue-period activity in the Ap–Av task for all possible combinations of the five explanatory variables, and we selected the combination of the variables that had the highest BIC score. Among all task-related units, 53% were significantly well accounted for (p < 0.05) by the regression models produced by the single or combinations of the variables (1409 units; Fig. 5A). Among these, 81% (1142 units) were characterized by single variables. In additional analyses, we use these 1142 classified units as the sets of dlPFC and ACC units with cue-period activity that could be accounted for (p < 0.05) by the explanatory variable parameters that we introduced (Fig. 5B). Thus, our database was confined to the units that could be identified according to the regression analysis with BIC, which was based on the firing properties recorded during performance of the Ap–Av task.
Properties of units encoding Rew and Ave
Among all classified units, 486 units (43%) were classified as units encoding offered values (Rew and Ave). The cue-period activity of Rew+ units was linearly and positively correlated with the size of offered reward (Fig. 6A, top left), and these units were observed significantly more frequently in the ACC (n = 127, 16.9% of all classified ACC units) than in the dlPFC (n = 46, 11.8% of all classified dlPFC units; Fisher's exact test, p < 0.05; Fig. 6A). Units that had activity correlated negatively with the offered reward (Fig. 6B, Rew−) did not show any bias in proportion between the ACC and dlPFC regions analyzed. In a similar way, units with activity correlated linearly and positively with the offered strengths of the airpuff were categorized as Ave+ units (Fig. 6C, top left). Ave+ units were observed significantly more frequently in the ACC (n = 109, 14.5% of all classified ACC units) than in the dlPFC (n = 35, 8.9% of all classified dlPFC units; Fisher's exact test, p < 0.05). Units with activity correlated negatively with offered airpuff were categorized as Ave− units (Fig. 6D). The dlPFC population contained more of these units than did the ACC population (Fig. 6D).
Properties of units encoding ChV and RT
Units with cue-period activity that was correlated with the ChV were notable, because these units encoded the anticipated value of the chosen option. In contrast to the Rew and Ave units, these units did not reflect objectively the offered reward value, but rather reflected the monkey's subjective evaluation (Padoa-Schioppa and Assad, 2006). The units with activity positively correlated with the ChV were classified as ChV+ units (Fig. 7A). Their activity increased as the offered reward increased and as the offered airpuff decreased, showing positive correlation with the monkey's acceptance decisions. Conversely, the units with activity negatively correlated with the ChV were classified as ChV− units (Fig. 7B), and their activity was positively correlated with the monkey's rejection decision. We found no regional bias in their proportions in the ACC and dlPFC (Fig. 7A,B, top left).
Among all classified units, 268 units were classified as RT units. The units with activity positively correlated with the RT on a trial-by-trial basis were classified as RT+ units (Fig. 7C). The dlPFC population (n = 80, 20.5% of all classified dlPFC units) contained significantly more RT+ units (Fisher's exact test, p < 0.05) than the ACC population (n = 113, 15.0% of all classified ACC units; Fig. 7C, top right). In contrast, units with activity correlated negatively with RT were classified as RT− units (Fig. 7D). These units did not exhibit significant biases in proportion (Fig. 7D, top right). Interestingly, the upcoming RT in the response period was predicted by the activity of the RT units even before the cue period. The differential activity of RT units appeared before the cue period (Fig. 7C,D, bottom), suggesting that the activity was not dependent solely on the offered cue but could also be modulated by long-term motivational states.
Properties of units encoding motivation
We regarded the condition that frequently induced omission errors as a low motivational condition (Hassani et al., 2001; Watanabe et al., 2001; Roesch and Olson, 2004). Units with activity positively correlated with the FOE were categorized as FOE+ units. These units were specifically activated in the low–low condition in which the Rew and Ave were both small. Occasional deferment of the task performance reflects a state in which the monkey reluctantly performed the decision-making. Like the proportion of RT+ units, which were also correlated with the low motivational state, the proportion of the FOE+ units in the dlPFC was significantly larger than that in the ACC (Fisher's exact test, p < 0.05; Fig. 8A, top right). Units with activity negatively correlated with FOE were categorized as FOE− units. These neurons were activated when either the reward or airpuff offer was high, reflecting the activity in the high motivational condition to perform the decision-making (Fig. 8B).
To estimate whether the dlPFC or the ACC temporally led in neuronal processing related to motivational regulation, we compared the time when the neuronal activity in each region began to discriminate the motivational conditions. For FOE+ and FOE− units, the high and low motivational conditions were defined, respectively, as having offered cues that induced higher and lower firing rates than the median in population activity (Fig. 8A,B, top left). For the high and low motivational conditions, neuronal spikes were sampled in 10 ms bins. With these data, the discriminative ability between the two conditions was determined by a Wilcoxon's rank-sum test with 200 ms sliding windows and was represented by a z value. When the neuronal activity showed a significant difference (p < 0.05) between the two conditions for more than eight consecutive 10 ms bins, we defined the time as the onset of the discrimination.
Among 194 FOE units, 53 in the dlPFC units and 84 in the ACC units exhibited significant differences in firing rate between the high and low motivational conditions (Fig. 8C). Comparing ACC and dlPFC units, we found significant differences in both the cumulative distributions of the onset of representation (Kolmogorov–Smirnov test, p < 0.01) and the temporal distribution of onset (Wilcoxon's rank-sum test, p < 0.05), suggesting that the dlPFC temporally leads the ACC in motivational regulation (Fig. 8D,E). The onset distributions were not significantly different between the dlPFC and ACC for the other groups of units (Rew, Ave, ChV, and RT units; p > 0.05 with Kolmogorov–Smirnov and Wilcoxon's rank-sum tests). These results suggest that, although the temporal patterns of activity in the dlPFC and the ACC were generally similar, activity in the dlPFC led activity in the ACC, possibly to regulate motivational aspects of the decision-making.
Effect of low motivation on neuronal activity
One of the features of motivational regulation that could be observed behaviorally was the temporal dynamics of the task performance. Given that the monkeys' decisions were delayed, and occasionally were deferred, in the low motivational states, it was possible that neuronal processes related to decision-making could also be regulated by these different motivational conditions (Tversky and Shafir, 1992; Diederich, 2003). To address this issue with our dataset, we examined neural activity according to the differential motivational conditions (Fig. 9A–C). First, we defined the low and high motivational conditions as, respectively, the preferred and nonpreferred conditions of the FOE+ units. We then compared the onset of discrimination between acceptance and rejection encoded by the ChV units in the low and high motivational conditions. The discrimination between upcoming acceptance and rejection was calculated by a Wilcoxon's rank-sum test (Fig. 9A). The onset of discrimination was defined as in Figure 8C. Among 194 ChV units, the activity of 118 units reached the criterion defining the onset in the high motivational condition, whereas the activity of 48 units reached the criterion in the low motivational condition. In both cumulative (Fig. 9B) and temporal (Fig. 9C) distribution of onsets, the discrimination was significantly earlier in the high motivational condition than in the low motivational condition (Kolmogorov–Smirnov test, p < 0.01; Wilcoxon's rank-sum test, p < 0.05). These results suggest that the low motivational condition in which the monkey occasionally deferred the decision-making actually delayed the neuronal process of deriving affective judgment.
The FOE units were defined as the units that were differentially activated by the motivational condition in which the monkey performed the decision-making reluctantly or willingly. However, because the low motivational condition was defined by the increased FOEs, it was still possible that the FOE units could be involved specifically in disengagement or omission of the task performance rather than motivational regulation. To test this possibility, we isolated a group of units that had activity directly correlated with omission of task performance. Among all task-related units, the cue period activities of 36 units exhibited significant differences between omission error and correct task performance (two-sample t test, p < 0.05), and those units were defined as omission type (Fig. 9D). Notably, these units did not overlap with the motivation (FOE) units at all (Fig. 9E), indicating a clear distinction between omission-type and FOE units. Thus, the FOE+ units seemed not to be directly related to omission of task performance itself when compared with the omission-type units, yet they were activated in the low motivational condition in which the monkey correctly performed the task. Thus, the activity of FOE+ units could reflect motivational regulation to perform the task patiently in the low payoff condition.
Information encoded by ChV and FOE units was preserved in two tasks
As shown in Figure 2, D and F, the ChV was clearly different for the Ap–Av and Ap–Ap tasks. If the ChV units represent exclusively the anticipatory value of chosen option irrespectively of the task conditions, the activity of ChV units should change across the two tasks. Conversely, as shown in Figure 1, E and J, the FOE was similar for the two tasks. If the FOE units represent exclusively motivational condition irrespectively of the task conditions, then they should not change their firing pattern between the two tasks. To examine this possibility, we compared the activity patterns of the recorded units in the Ap–Av task with the activity of the same units in the Ap–Ap task. First, we confirmed that the ChV units preserved the encoded information by changing their response to the visual cues depending on the task version. The ChV+ units (n = 119) exhibited the highest firing rates for high reward and low cost in the Ap–Av task (Fig. 10A, left), and they showed increases in activity as either the red or yellow bar increased in length in the Ap–Ap task (Fig. 10A, middle). We calculated the correlation coefficients between the cue-period activity and the ChV for the Ap–Av (Fig. 2D) and Ap–Ap (Fig. 2F) tasks. The means of the distributions of the correlation coefficients were significantly positive in both the Ap–Av (one-sample t test, p < 0.001) and Ap–Ap (p < 0.001) tasks (Fig. 10A, right). Similarly, for each ChV− unit (n = 75), we calculated the correlation coefficients between the cue-period activity (Fig. 10B, left and middle) and the ChV for the Ap–Av (Fig. 2D) and Ap–Ap (Fig. 2F) tasks. The means of the correlation coefficients of the ChV− units were significantly negative in both the Ap–Av (p < 0.001) and Ap–Ap (p < 0.001) tasks (Fig. 10B, right), suggesting the preservation of ChV information for these units (Amemori and Graybiel, 2012).
Task performance with low motivation occurred at the low–low combined offers in both the Ap–Av and Ap–Ap task conditions (i.e., low reward and low airpuff in the Ap–Av task and low reward offers in the Ap–Ap task), and these were represented by the FOE (Fig. 1E,J). We found that the FOE+ units (n = 120) also did not change their response to the cues between the two tasks (Fig. 10C, left and middle). We calculated the correlation coefficients between the cue-period activity and the FOE for the two tasks. The means of the correlation coefficients were significantly positive both in the Ap–Av (p < 0.001) and Ap–Ap (p < 0.001) tasks (Fig. 10C, right). Similarly for the activity of FOE− units (n = 74; Fig. 10D, left and middle), we calculated the correlation coefficients between the cue-period activity and the FOE for the two tasks. The means of the correlation were significantly negative in both Ap–Av (p < 0.001) and Ap–Ap (p < 0.05) tasks (Fig. 10D, right). These results suggested that the FOE units preserved the encoded information in both task versions.
Qualitatively, in the Ap–Ap task, both the ChV and motivation were minimal in the low–low condition, in which the reward offer was minimal (Figs. 1J, 2F). Thus, it was possible that the ChV and FOE units also exhibited similar activities in the Ap–Ap tasks. To assess quantitatively how the two task versions contributed to differentiate the activities of these units, we compared the ChV+ and FOE− units and also the ChV− and FOE+ units across the two tasks.
In the Ap–Av task, the activities of the ChV+ units (n = 119; Fig. 10A, left) were significantly different from those of the FOE− units (n = 74; Fig. 10D, left), especially when the size of the Ave was large and the size of the Rew was small (two-sample t test, p < 0.01; Fig. 11A). The two-dimensional correlation coefficient that quantified the similarity between ChV+ and FOE− matrices was 0.57. The firing rates of the ChV− units (n = 75; Fig. 10B, left) were significantly different from those of the FOE+ units (n = 120; Fig. 10C, left), and the difference was especially large when the airpuff size was large and the reward size was small (two-sample t test, p < 0.01; Fig. 11B). The two-dimensional correlation coefficient between ChV− and FOE+ matrices was 0.66.
By contrast, in the Ap–Ap task, ChV and motivation were similar, and the firing patterns of ChV units became similar to those of FOE units. These two types of unit fired in the low–low combined offers, in which the two reward offers associated with the different targets were both low. The mean firing rates of the ChV+ units (n = 119; Fig. 10A, middle) were significantly different from those of the FOE− units (n = 74; Fig. 10D, middle) in the high reward condition (two-sample t test, 0.01 < p < 0.05; Fig. 11C), but the difference was smaller than that in the Ap–Av task (Fig. 11A). The two-dimensional correlation coefficient between the ChV+ and FOE− matrices was 0.79, indicating that the activities of these units were similar. The mean firing rates of the ChV− units (n = 75; Fig. 10B, middle) and those of the FOE+ units (n = 120; Fig. 10C, middle) were not significantly different for all combined offers (two-sampled t test, p > 0.05; Fig. 11D). The two-dimensional correlation coefficient between ChV− and FOE+ matrices was 0.85, also indicating similar activities. These results suggest that the FOE and ChV units exhibited similar responses to the combined offers in the Ap–Ap task.
Discussion
Affective judgments about whether to accept or to reject an offer are determined by likes and dislikes, and these affective evaluations simultaneously trigger motivational responses that govern dynamic aspects of the decision-making processes. Because of this complex relationship between affective judgment and motivation, only a few studies dissociated these two neuronal processes in macaque monkeys (Roesch and Olson, 2004). By introducing an aversive stimulus as a component of the combined offer, we attempted to dissociate neuronal activity related to affective judgment from neuronal activity related to the motivation to perform the decision-making task. In the Ap–Av task, the ChV could be increased by cues indicating increasing reward but could be decreased by simultaneously delivered cues indicating increasing punishment. By contrast, the level of motivation to perform the task could be decreased by cues indicating decreases in both reward and punishment, inducing delay and occasional deferment in performance. We analyzed the activity exhibited by populations of dlPFC and ACC neurons during the decision period of the task and classified the recorded units based on the response properties to the behavioral parameters in the Ap–Av task. We found a different emphasis in the dlPFC and ACC units for motivational regulation. Despite many similarities in their responses, the dlPFC population included a greater proportion of neurons with activity related to upcoming delayed reaction and low motivational condition than did the ACC. Furthermore, the responses of the motivation-type units in the dlPFC occurred significantly earlier than those in the ACC. These results suggest that these two regions could function cooperatively but with differential emphases in affective judgment and motivational drive.
Dissociated groups of neurons related to motivation and affective judgments
Our findings are allied with studies of human value evaluation. The valence of evaluation and arousal during positive and negative valuation are considered to be dissociable, given that corresponding subjective preferences and physiological measurements of arousal are not always correlated positively (Lang et al., 1990; Bradley et al., 2001). For example, ratings of interest, viewing time, pupil size, respiration rate, and skin conductance can be increased by both pictures that are positively rated in terms of subjective preference and those that are negatively rated (Lang et al., 1998; Bradley et al., 2008; Keil et al., 2008), suggesting that states of arousal in the valuation process can be activated by stimuli of both positive and negative valence (Lang et al., 1990; Seymour et al., 2007; Codispoti et al., 2009; Lang and Bradley, 2010).
In electrophysiological studies in nonhuman primates, reward-related activity of neurons in the frontal cortex has been reported extensively (Watanabe, 1996; Kobayashi et al., 2002; Amemori and Sawaguchi, 2006; Kennerley and Wallis, 2009), and both the dlPFC and ACC have been implicated in value-based decision-making (Lee et al., 2007; Rushworth et al., 2011; Hosokawa et al., 2013). Responses of frontal cortical neurons related to aversive stimuli have also been reported (Koyama et al., 1998; Kobayashi et al., 2006). However, few studies have addressed the question of whether these neurons can differentiate valence and motivational processes. An exception is the study by Roesch and Olson (2004), who recorded neuronal activity from the orbitofrontal cortex and premotor cortex as monkeys performed a task with different visual cues indicating the size of the reward for correct performance and the size of punishment for error performance. In contrast to the task that we used here, in the task used by these investigators, the monkeys could ignore the aversive cue and choose the reward-related cue by making a saccade. Orbitofrontal neurons responded preferentially to the valence parameter (offered reward size), whereas premotor cortical neurons responded to motivational parameter (task performance and RT).
Task performance in low motivational conditions was dissociable from task disengagement
Motivation affects the dynamics of decision-making processes (Tversky and Shafir, 1992; Watanabe, 1998; Diederich, 2003) and the percentage of correct task performance (Hassani et al., 2001; Shidara and Richmond, 2002). These behavioral findings raised the possibility that the dynamic aspect of neuronal processing related to decision-making was modulated by the different motivational conditions. Accordingly, we examined the neuronal onset of decision-making represented by the ChV units, units that were sensitive to the anticipated value of the eventually chosen outcome. We found that the discrimination onsets were, as expected, delayed in the low motivational conditions in which the FOE units were activated. However, the FOE units that exhibited differential activity in relation to motivational conditions did not overlap with neurons with activity that predicted omission errors. Thus, it is unlikely that the activity of FOE units is simply attributable to the error-related disengagement.
In a series of previous studies, the interaction between the dlPFC and ACC in primates has been examined in terms of attentional modulation (Rothé et al., 2011). The dlPFC response was shown to capture early-onset attentional selectivity (Kaping et al., 2011), whereas the ACC has been suggested to be particularly relevant to detecting errors indicating failures of inhibitory control of the attentional focus (Shen et al., 2014). In parallel with the error-related signals reported in the ACC, we found that a large proportion of the dlPFC units responded to the low motivational condition in which the monkeys patiently performed the task. These results suggest that motivational regulation for the low payoff condition could be different from the actual behavioral disengagement from task performance that we recorded here as omission errors.
Differential emphasis of affective judgment and motivation among the responses of dlPFC and ACC units
Roesch and Olson (2004), in studying macaques performing tasks requiring motor actions in relation to motivation, found sharp differences between the properties of units in the OFC and premotor cortex. Here, we observed many similarities between the response properties of units recorded in the dlPFC and ACC but also found that the ACC populations contained larger proportions of units encoding offered values (Rew and Ave units), and the dlPFC population contained larger proportions of units encoding motivation-related variables (RT and FOE units). We further found that the FOE units in the dlPFC population were activated earlier than those in the ACC, possibly to lead in time the motivational processing during the decision-making.
The primate dlPFC has long been implicated in cognitive control (Dias et al., 1996; Miller and Cohen, 2001; Koechlin et al., 2003), including suppression of impulsive (Iversen and Mishkin, 1970; Dias et al., 1997) and reflexive (Sasaki et al., 1989; Hasegawa et al., 2004; Kuwajima and Sawaguchi, 2007) responses. In another line of studies, behavioral inhibition has been recognized as a critical process induced by negative emotion (Gray, 1981, 1982; Carver and White, 1994), and the human dlPFC has been implicated in this process (Shackman et al., 2009). Our results suggest that many dlPFC units were activated when the monkeys performed the task in low payoff conditions and that this activity was not related to errors of omission. The FOE+ units that had increased activity in the low payoff conditions could thus be related to the ability to perform patiently the task in the small reward conditions to gain large reward in later trials, a situation in which the dlPFC has been implicated by previous work (McClure et al., 2004; Tsujimoto and Sawaguchi, 2005; Figner et al., 2010).
Concurrent neuronal processing of affective judgment and motivation
We found that the information encoded by the neurons related to affective judgment and motivation could be preserved across the Ap–Av and Ap–Ap tasks. In principle, reward incentives enhance both the motivation to perform the task and the utility of acceptance, whereas the threat of potential punishment enhances the motivation to avoid but reduces the utility of acceptance. With the Ap–Av task, we were able to dissociate neurons encoding utility from those encoding motivation here defined as the correlate of the FOE. We found that the level of motivation in goal-directed behavior, at least in part, is processed by neuronal populations that are dissociable from populations dealing with affective judgment of the goal value. However, in the Ap–Ap task, the same neuronal populations were indistinguishable from one another, because the utility of acceptance was positively correlated with the approach motivation when the behavioral goal consisted of only rewards.
The two dissociable processes observed in the Ap–Av task, affective judgment and motivation, were concurrently activated during decision-making, raising the possibility that goal-directed behavior is organized by parallel processing dealing with behavioral regulation and goal evaluation. Our findings suggest that both the dorsolateral prefrontal and the anterior cingulate regions of the neocortex are involved in the integration of cost and benefit to derive affective judgments, and that the dlPFC could lead in time the regulation of motivational levels to govern the dynamic aspects of goal-directed behavior. Thus, the ACC and dlPFC could function in a cooperative manner during decision-making, with different relative assignments within this cooperative processing.
Footnotes
This work was supported by National Institutes of Health Grant R01 NS025529, Office of Naval Research Grant N00014-07-1-0903, CHDI Foundation Grant A-5552, the Simons Center for the Social Brain, and the National Parkinson Foundation (Lynn Diamond Fellowship). We thank Margo Cantor for her skillful assistance in training monkeys and surgical procedures, Drs. Simon Hong, Hideki Shimazu, and Patrick Tierney for their insightful comments and suggestions, Dr. Daniel Gibson for his help with data analysis, Alex McWhinnie for editing figures, and Dr. Yasuo Kubota for help preparing this manuscript.
The authors declare no competing financial interests.
References
- Amemori K, Graybiel AM. Localized microstimulation of primate pregenual cingulate cortex induces negative decision-making. Nat Neurosci. 2012;15:776–785. doi: 10.1038/nn.3088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amemori K, Sawaguchi T. Contrasting effects of reward expectation on sensory and motor memories in primate prefrontal neurons. Cereb Cortex. 2006;16:1002–1015. doi: 10.1093/cercor/bhj042. [DOI] [PubMed] [Google Scholar]
- Barraclough DJ, Conroy ML, Lee D. Prefrontal cortex and decision making in a mixed-strategy game. Nat Neurosci. 2004;7:404–410. doi: 10.1038/nn1209. [DOI] [PubMed] [Google Scholar]
- Belsley DA, Kuh E, Welsch RE. Regression diagnostics: Identifying influential data and sources of collinearity. New York: Wiley; 1980. [Google Scholar]
- Bradley MM, Codispoti M, Cuthbert BN, Lang PJ. Emotion and motivation I: Defensive and appetitive reactions in picture processing. Emotion. 2001;1:276–298. doi: 10.1037/1528-3542.1.3.276. [DOI] [PubMed] [Google Scholar]
- Bradley MM, Miccoli L, Escrig MA, Lang PJ. The pupil as a measure of emotional arousal and autonomic activation. Psychophysiology. 2008;45:602–607. doi: 10.1111/j.1469-8986.2008.00654.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai X, Padoa-Schioppa C. Neuronal encoding of subjective value in dorsal and ventral anterior cingulate cortex. J Neurosci. 2012;32:3791–3808. doi: 10.1523/JNEUROSCI.3864-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carver CS, White TL. Behavioral inhibition, behavioral activation, and affective responses to impending reward and punishment: the BIS/BAS scales. J Pers Soc Psychol. 1994;67:319–333. doi: 10.1037/0022-3514.67.2.319. [DOI] [Google Scholar]
- Codispoti M, Mazzetti M, Bradley MM. Unmasking emotion: Exposure duration and emotional engagement. Psychophysiology. 2009;46:731–738. doi: 10.1111/j.1469-8986.2009.00804.x. [DOI] [PubMed] [Google Scholar]
- Dias R, Robbins TW, Roberts AC. Dissociation in prefrontal cortex of affective and attentional shifts. Nature. 1996;380:69–72. doi: 10.1038/380069a0. [DOI] [PubMed] [Google Scholar]
- Dias R, Robbins TW, Roberts AC. Dissociable forms of inhibitory control within prefrontal cortex with an analog of the Wisconsin card sort test: restriction to novel situations and independence from “on-line” processing. J Neurosci. 1997;17:9285–9297. doi: 10.1523/JNEUROSCI.17-23-09285.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Diederich A. Decision making under conflict: Decision time as a measure of conflict strength. Psychon Bull Rev. 2003;10:167–176. doi: 10.3758/BF03196481. [DOI] [PubMed] [Google Scholar]
- Feingold J, Desrochers TM, Fujii N, Harlan R, Tierney PL, Shimazu H, Amemori K, Graybiel AM. A system for recording neural activity chronically and simultaneously from multiple cortical and subcortical regions in nonhuman primates. J Neurophysiol. 2012;107:1979–1995. doi: 10.1152/jn.00625.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Figner B, Knoch D, Johnson EJ, Krosch AR, Lisanby SH, Fehr E, Weber EU. Lateral prefrontal cortex and self-control in intertemporal choice. Nat Neurosci. 2010;13:538–539. doi: 10.1038/nn.2516. [DOI] [PubMed] [Google Scholar]
- Gray JA. A critique of Eysenck's theory of personality. In: Eysenck H, editor. A model for personality. Berlin: Springer; 1981. pp. 246–276. [Google Scholar]
- Gray JA. The neuropsychology of anxiety: an enquiry into the functions of the septo-hippocampal system. Oxford, UK: Clarendon, Oxford UP; 1982. [Google Scholar]
- Hasegawa RP, Peterson BW, Goldberg ME. Prefrontal neurons coding suppression of specific saccades. Neuron. 2004;43:415–425. doi: 10.1016/j.neuron.2004.07.013. [DOI] [PubMed] [Google Scholar]
- Hassani OK, Cromwell HC, Schultz W. Influence of expectation of different rewards on behavior-related neuronal activity in the striatum. J Neurophysiol. 2001;85:2477–2489. doi: 10.1152/jn.2001.85.6.2477. [DOI] [PubMed] [Google Scholar]
- Hosokawa T, Kennerley SW, Sloan J, Wallis JD. Single-neuron mechanisms underlying cost-benefit analysis in frontal cortex. J Neurosci. 2013;33:17385–17397. doi: 10.1523/JNEUROSCI.2221-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Isoda M, Hikosaka O. A neural correlate of motivational conflict in the superior colliculus of the macaque. J Neurophysiol. 2008;100:1332–1342. doi: 10.1152/jn.90275.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iversen SD, Mishkin M. Perseverative interference in monkeys following selective lesions of the inferior prefrontal convexity. Exp Brain Res. 1970;11:376–386. doi: 10.1007/BF00237911. [DOI] [PubMed] [Google Scholar]
- Kaping D, Vinck M, Hutchison RM, Everling S, Womelsdorf T. Specific contributions of ventromedial, anterior cingulate, and lateral prefrontal cortex for attentional selection and stimulus valuation. PLoS Biol. 2011;9:e1001224. doi: 10.1371/journal.pbio.1001224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keil A, Smith JC, Wangelin BC, Sabatinelli D, Bradley MM, Lang PJ. Electrocortical and electrodermal responses covary as a function of emotional arousal: a single-trial analysis. Psychophysiology. 2008;45:516–523. doi: 10.1111/j.1469-8986.2008.00667.x. [DOI] [PubMed] [Google Scholar]
- Kennerley SW, Wallis JD. Reward-dependent modulation of working memory in lateral prefrontal cortex. J Neurosci. 2009;29:3259–3270. doi: 10.1523/JNEUROSCI.5353-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kennerley SW, Walton ME, Behrens TEJ, Buckley MJ, Rushworth MFS. Optimal decision making and the anterior cingulate cortex. Nat Neurosci. 2006;9:940–947. doi: 10.1038/nn1724. [DOI] [PubMed] [Google Scholar]
- Kennerley SW, Behrens TE, Wallis JD. Double dissociation of value computations in orbitofrontal and anterior cingulate neurons. Nat Neurosci. 2011;14:1581–1589. doi: 10.1038/nn.2961. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knoch D, Gianotti LRR, Pascual-Leone A, Treyer V, Regard M, Hohmann M, Brugger P. Disruption of right prefrontal cortex by low-frequency repetitive transcranial magnetic stimulation induces risk-taking behavior. J Neurosci. 2006;26:6469–6472. doi: 10.1523/JNEUROSCI.0804-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kobayashi S, Lauwereyns J, Koizumi M, Sakagami M, Hikosaka O. Influence of reward expectation on visuospatial processing in macaque lateral prefrontal cortex. J Neurophysiol. 2002;87:1488–1498. doi: 10.1152/jn.00472.2001. [DOI] [PubMed] [Google Scholar]
- Kobayashi S, Nomoto K, Watanabe M, Hikosaka O, Schultz W, Sakagami M. Influences of rewarding and aversive outcomes on activity in macaque lateral prefrontal cortex. Neuron. 2006;51:861–870. doi: 10.1016/j.neuron.2006.08.031. [DOI] [PubMed] [Google Scholar]
- Koechlin E, Ody C, Kouneiher F. The architecture of cognitive control in the human prefrontal cortex. Science. 2003;302:1181–1185. doi: 10.1126/science.1088545. [DOI] [PubMed] [Google Scholar]
- Koyama T, Tanaka YZ, Mikami A. Nociceptive neurons in the macaque anterior cingulate activate during anticipation of pain. Neuroreport. 1998;9:2663–2667. doi: 10.1097/00001756-199808030-00044. [DOI] [PubMed] [Google Scholar]
- Kuwajima M, Sawaguchi T. Involvement of the lateral prefrontal cortex in conditional suppression of gaze shift. Neurosci Res. 2007;59:431–445. doi: 10.1016/j.neures.2007.08.013. [DOI] [PubMed] [Google Scholar]
- Lang PJ, Bradley MM. Emotion and the motivational brain. Biol Psychol. 2010;84:437–450. doi: 10.1016/j.biopsycho.2009.10.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lang PJ, Bradley MM, Cuthbert BN. Emotion, attention, and the startle reflex. Psychol Rev. 1990;97:377–395. doi: 10.1037/0033-295X.97.3.377. [DOI] [PubMed] [Google Scholar]
- Lang PJ, Bradley MM, Cuthbert BN. Emotion, motivation, and anxiety: brain mechanisms and psychophysiology. Biol Psychiatry. 1998;44:1248–1263. doi: 10.1016/S0006-3223(98)00275-3. [DOI] [PubMed] [Google Scholar]
- Lee D, Rushworth MF, Walton ME, Watanabe M, Sakagami M. Functional specialization of the primate frontal cortex during decision making. J Neurosci. 2007;27:8170–8173. doi: 10.1523/JNEUROSCI.1561-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matsumoto K, Suzuki W, Tanaka K. Neuronal correlates of goal-based motor selection in the prefrontal cortex. Science. 2003;301:229–232. doi: 10.1126/science.1084204. [DOI] [PubMed] [Google Scholar]
- McClure SM, Laibson DI, Loewenstein G, Cohen JD. Separate neural systems value immediate and delayed monetary rewards. Science. 2004;306:503–507. doi: 10.1126/science.1100907. [DOI] [PubMed] [Google Scholar]
- McFadden D. Conditional logit analysis of qualitative choice behavior. In: Zarembka P, editor. Frontiers in econometrics. New York: Academic; 1974. pp. 105–142. [Google Scholar]
- Miller EK, Cohen JD. An integrative theory of prefrontal cortex function. Annu Rev Neurosci. 2001;24:167–202. doi: 10.1146/annurev.neuro.24.1.167. [DOI] [PubMed] [Google Scholar]
- Miller NE. Selected papers on conflict, displacement, learned drives and theory. Experimental studies of conflict. Chicago: Aldine-Atherton; 1971. Experimental studies of conflict. [Google Scholar]
- Padoa-Schioppa C. Neurobiology of economic choice: a good-based model. Annu Rev Neurosci. 2011;34:333–359. doi: 10.1146/annurev-neuro-061010-113648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Padoa-Schioppa C, Assad JA. Neurons in the orbitofrontal cortex encode economic value. Nature. 2006;441:223–226. doi: 10.1038/nature04676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roesch MR, Olson CR. Neuronal activity related to reward value and motivation in primate frontal cortex. Science. 2004;304:307–310. doi: 10.1126/science.1093223. [DOI] [PubMed] [Google Scholar]
- Rothé M, Quilodran R, Sallet J, Procyk E. Coordination of high gamma activity in anterior cingulate and lateral prefrontal cortical areas during adaptation. J Neurosci. 2011;31:11110–11117. doi: 10.1523/JNEUROSCI.1016-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rousseeuw PJ. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math. 1987;20:53–65. doi: 10.1016/0377-0427(87)90125-7. [DOI] [Google Scholar]
- Rushworth MF, Noonan MP, Boorman ED, Walton ME, Behrens TE. Frontal cortex and reward-guided learning and decision-making. Neuron. 2011;70:1054–1069. doi: 10.1016/j.neuron.2011.05.014. [DOI] [PubMed] [Google Scholar]
- Sasaki K, Gemba H, Tsujimoto T. Suppression of visually initiated hand movement by stimulation of the prefrontal cortex in the monkey. Brain Res. 1989;495:100–107. doi: 10.1016/0006-8993(89)91222-5. [DOI] [PubMed] [Google Scholar]
- Seo H, Lee D. Temporal filtering of reward signals in the dorsal anterior cingulate cortex during a mixed-strategy game. J Neurosci. 2007;27:8366–8377. doi: 10.1523/JNEUROSCI.2369-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seymour B, Singer T, Dolan R. The neurobiology of punishment. Nat Rev Neurosci. 2007;8:300–311. doi: 10.1038/nrn2119. [DOI] [PubMed] [Google Scholar]
- Shackman AJ, McMenamin BW, Maxwell JS, Greischar LL, Davidson RJ. Right dorsolateral prefrontal cortical activity and behavioral inhibition. Psychol Sci. 2009;20:1500–1506. doi: 10.1111/j.1467-9280.2009.02476.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen C, Ardid S, Kaping D, Westendorff S, Everling S, Womelsdorf T. Anterior cingulate cortex cells identify process-specific errors of attentional control prior to transient prefrontal-cingulate inhibition. Cereb Cortex. 2014 doi: 10.1093/cercor/bhu028. doi: 10.1093/cercor/bhu028. Advance online publication. Retrieved December 15, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shidara M, Richmond BJ. Anterior cingulate: Single neuronal signals related to degree of reward expectancy. Science. 2002;296:1709–1711. doi: 10.1126/science.1069504. [DOI] [PubMed] [Google Scholar]
- Talmi D, Dayan P, Kiebel SJ, Frith CD, Dolan RJ. How humans integrate the prospects of pain and reward during choice. J Neurosci. 2009;29:14617–14626. doi: 10.1523/JNEUROSCI.2026-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Train K. Discrete choice methods with simulation. New York: Cambridge UP; 2003. [Google Scholar]
- Tsujimoto S, Sawaguchi T. Neuronal activity representing temporal prediction of reward in the primate prefrontal cortex. J Neurophysiol. 2005;93:3687–3692. doi: 10.1152/jn.01149.2004. [DOI] [PubMed] [Google Scholar]
- Tversky A, Shafir E. Choice under conflict: the dynamics of deferred decision. Psychol Sci. 1992;3:358–361. doi: 10.1111/j.1467-9280.1992.tb00047.x. [DOI] [Google Scholar]
- Vogel JR, Beer B, Clody DE. A simple and reliable conflict procedure for testing anti-anxiety agents. Psychopharmacology. 1971;21:1–7. doi: 10.1007/BF00403989. [DOI] [PubMed] [Google Scholar]
- Von Neumann J, Morgenstern O. Theory of games and economic behavior. Princeton: Princeton UP; 1947. [Google Scholar]
- Watanabe M. Reward expectancy in primate prefrontal neurons. Nature. 1996;382:629–632. doi: 10.1038/382629a0. [DOI] [PubMed] [Google Scholar]
- Watanabe M. Cognitive and motivational operations in primate prefrontal neurons. Rev Neurosci. 1998;9:225–241. doi: 10.1515/revneuro.1998.9.4.225. [DOI] [PubMed] [Google Scholar]
- Watanabe M, Cromwell HC, Tremblay L, Hollerman JR, Hikosaka K, Schultz W. Behavioral reactions reflecting differential reward expectations in monkeys. Exp Brain Res. 2001;140:511–518. doi: 10.1007/s002210100856. [DOI] [PubMed] [Google Scholar]