Abstract
Background:
The current study aimed to further etiological understanding of the psychological mechanisms underlying negative symptoms in people with schizophrenia. Specifically, we tested whether negative symptom severity is associated with reduced retention of reward-related information over time, and thus a degraded ability to utilize such information to guide future action selection.
Methods:
44 patients with a diagnosis of schizophrenia or schizoaffective disorder and 28 healthy volunteers performed a probabilistic reinforcement-learning task involving stimulus pairs in which choices resulted in reward or in loss avoidance. Following training, participants indicated their valuation of learned stimuli in a test/transfer phase. The test/transfer phase was administered immediately following training and one-week later. Percent retention was defined as accuracy at weeklong delay divided by accuracy at immediate delay.
Results:
Both healthy controls and people with schizophrenia showed similarly robust retention of reinforcement learning over a one-week delay interval. However, in the schizophrenia group, negative symptom severity was associated with reduced retention of information regarding the value of actions across a weeklong interval. This pattern was particularly notable for stimuli associated with reward compared to loss avoidance.
Conclusions:
Our results show that, although individuals with schizophrenia may initially learn about rewarding aspects of their environment, such learning decays at a more rapid rate in patients with severe negative symptoms. Thus, previously learned reward-related information may be more difficult to access to guide future decision-making and to motivate action selection.
Keywords: Schizophrenia, Reinforcement Learning, Motivation, Negative Symptoms, Memory, Experimental Psychopathology
Introduction:
Negative symptoms such as abnormalities in motivation, are a core feature of schizophrenia (1). Such symptoms are both debilitating and resistant to current interventions (2). Thus, negative symptoms represent a critical and unmet clinical therapeutic target. In recent years, two lines of research have revealed insights into the psychological mechanisms underlying negative symptoms in schizophrenia and their associated neural correlates. First, people with schizophrenia show hedonic responses similar to controls when experiencing rewarding stimuli, but diminished responses, compared to controls (3-4), when anticipating future rewarding experiences (5-8). Thus, intact experiences of reward do not appear to have the expected impact on the subsequent initiation of goal-directed action. Second, people with schizophrenia have difficulty using the experience of reward receipt to generate and update mental representations of prospective value for actions (9-11). The results from both of these lines of research suggest difficulty using information about previous experiences to facilitate later decision-making and action initiation (12). In the current manuscript, we argue that difficulty using previously rewarding experiences to guide future behavior may be linked to poor long-term memory retention of positively valenced reward information in schizophrenia. Specifically, we test whether learning decays at a more rapid rate in patients with high negative symptom severity and thus is not fully available to guide future decision-making and behavior.
In attempting to understand how representations of value for particular actions are generated, modified, and utilized for future decision-making, researchers have relied on reinforcement learning (RL) frameworks (13). Generally, in RL models, values for actions are shaped by reward prediction errors (RPEs) – differences between expected and actual outcomes. RPEs have been closely tied to the activity of dopaminergic cells in the midbrain, with cells firing more rapidly for better-than-expected outcomes (positive RPEs) and ceasing to fire for worse-than-expected outcomes (negative RPEs) (14). Multiple studies have examined RL in schizophrenia (15-27,27-33). Results typically show that simple and/or implicit forms of RL are largely intact in schizophrenia (15), and that deficits emerge when learning is explicit and task conditions are difficult. Further, some (21,34-36) but not all (37-39) studies of RL have suggested impaired learning from rewarding stimuli but intact learning to avoid loss in schizophrenia. This dissociation provides an intriguing account of reduced goal-oriented behavior, such that individuals fail to seek rewarding aspects of their environment while also avoiding aspects associated with loss.
There has been remarkably little work in human subjects documenting the long-term retention of RL (40-42). This lack of evidence is surprising as, in real life, action preferences are often shaped by repeated outcomes across days, weeks, or months and the durability of this knowledge is basically presumed rather than actually being demonstrated. In one exception, Wimmer et al., conducted a study examining memory for value representations at a 3-week delay in healthy individuals (42). Results, at delay, showed greater memory for the value of trained stimuli when learning occurred in short, condensed sessions distributed over the course of weeks compared to a single “massed” training session. We are unaware of other studies documenting long-term retention of RL in healthy human subjects. In the schizophrenia literature, Herbener implemented an implicit preference conditioning task which associated visual patterns with particular frequencies of reward (41). People with schizophrenia and healthy controls both showed preference for the more frequently rewarded stimulus immediately following training. While controls maintained this preference at a 24-hour delay, people with schizophrenia did not, suggestive of difficulty in maintaining stimulus-reward relationships over time. Negative symptoms did not relate to delay performance, possibly due to the implicit nature of the learning paradigm. In other studies, individuals with schizophrenia have shown relatively intact performance on implicit RL paradigms and no systematic relationships between RL performance and negative symptom severity (15,43).
Several years ago, our group published a study examining probabilistic RL in schizophrenia (21). In a training phase, participants were presented with four separate pairs of stimuli, one at a time. Two of the stimulus pairs were associated with potential gain and the other two pairs were associated with avoiding a potential loss. Immediately following training, a test/transfer phase was administered. In this phase, participants were presented with both the original training pairs, as well as novel pairing (i.e., pairings of two stimuli, each from a separate training pair) and asked to choose the stimulus they preferred. Participants did not receive feedback in the test/transfer phase. People with schizophrenia with severe negative symptoms showed impaired learning of rewarding pairs compared to controls; however, the groups were similar in learning to avoid losses. Further, in the test/transfer phase, controls showed a clear preference for stimuli that frequently led to reward versus stimuli that frequently avoided loss, whereas people with schizophrenia did not. Finally, using a hybrid computational model with the ability to assess the relative contributions of actor-critic learning (thought to reflect basal-ganglia function) versus Q-learning (thought to reflect orbital frontal cortex function) to decision-making, results showed reduced reliance on Q-learning in people with schizophrenia. Taken together, these results suggested impairment in learning representations of positive value in schizophrenia, and difficulty using such information when making decisions.
Given the small number of studies assessing long-term retention of RL and the potential importance of value representation maintenance to negative symptoms in schizophrenia, we aimed to extend the results of Gold et al. (2012) by analyzing a previously unreported task condition of the original sample. Specifically, the goal of the current study was to examine whether schizophrenia and negative symptom severity are associated with a failure to retain information regarding the expected value of actions over time. To this end, we assessed preferences for task stimuli immediately following learning (reported in the original paper) and at a one-week delay (unreported). We hypothesized that that individuals with schizophrenia would show reduced retention of RL across a weeklong interval compared to controls. Further, we hypothesized that reductions in retention would be most pronounced among people with high negative symptom severity.
Methods:
Participants
Forty-four outpatients meeting DSM-IV criteria for schizophrenia (SZ) or schizoaffective disorder and 28 demographically similar healthy control (HC) subjects participated in the study. SZ subjects had been on a stable medication regimen for at least 4 weeks and were considered clinically-stable at the time of testing. Patients were recruited from the Maryland Psychiatric Research Center and from local outpatient psychiatric clinics.
The HC subjects were recruited from the community via random phone number dialing, Internet and newspaper advertisements, and word of mouth among recruited participants. They had no current Axis I or Axis II diagnoses, as established by the Structured Clinical Interview for DSM-IV-Axis I Disorders (44), and Structured Interview for DSM-IV Personality (45). Control subjects also reported no family history of psychosis and were not taking psychotropic medications. All participants had no history of significant neurological injury or disease and reported no significant medical or substance use disorders. Participants provided informed consent for a protocol approved by the University of Maryland Institutional Review Board.
Symptom and Neuropsychological Assessment:
Negative symptoms were assessed in participants with schizophrenia using a 22-item version of the Scale for the Assessment of Negative Symptoms (SANS) (46). This version omits Attention items, which have been shown to relate more to cognition than negative symptoms (47). Items from the SANS were summed to create an overall score (SANS Total). From the SANS, we also extracted factors assessing the severity of experiential (asociality, anhedonia, and avolition) and expressive (blunted affect and alogia) negative symptoms (21). We report both SANS total, as well as correlations between experiential and expressive symptom domains. The Brief Psychiatric Rating Scale (BPRS) was administered to participants with schizophrenia in order to determine severity of positive symptoms and global expression of symptomatology (48).
Both HC and schizophrenia participants completed measures of word reading (Wide Range Achievement Test – 4th Edition) and the MATRICS (Measurement and Treatment Research to Improve Cognition in Schizophrenia Consensus Cognitive) battery (49). The groups did not significantly differ in age, sex, ethnicity, or parental education. The SZ group reported significantly less personal education than the HC group (Table 1). Symptom severity, medication information, and neuropsychological test scores are also listed in Table 1.
Table 1:
Demographic, Symptom, and Neuropsychological Variables
Variable | HC Group (n= 28) |
SZ Group (n = 44) |
Test Statistic | p-value |
---|---|---|---|---|
Demographic and Clinical Characteristics | ||||
Age, mean (SD) | 41.2 (9.4) | 41.8 (10.0) | t = −0.3 | 0.8 |
Education, mean (SD) | ||||
Participant | 14.8 (1.9) | 12.3 (1.7) | t = 5.9 | <0.01 |
Maternal | 13.1 (2.2) | 13.2 (3.0) | t = −0.1 | 0.92 |
Paternal | 13.5 (2.7) | 13.3 (3.4) | t = 0.3 | 0.59 |
Sex, No. | ||||
Male | 17 | 32 | χ2 = 1.14 | 0.31 |
Female | 11 | 12 | ||
Race/Ethnicity, No. | ||||
Black | 14 | 21 | χ2 = 0.04 | 0.85 |
White | 14 | 23 | ||
Current Prescribed Psychiatric Medications | ||||
Antipsychotics No. | ||||
Atypical | 30 | |||
Typical | 8 | |||
Typical and Atypical | 6 | |||
Antidepressant No. | 11 | |||
Mood Stabilizers No. | 7 | |||
Anxiolytic No. | 5 | |||
Anticholinergic No. | 4 | |||
Neuropsychological Test and Symptom Data | ||||
Clinical Rating Scale, mean (SD) | ||||
BPRS total | 65.0 (37.5) | |||
BPRS positive cluster | 10.0 (4.8) | |||
SANS Total | 36.6 (15.9) | |||
Neuropsychology score | ||||
WRAT | 104.5 (14.5) | 96.7 (13.8) | t = 2.3 | 0.03 |
WTAR | 105.5 (13.8) | 97.2 (13.1) | t = 2.6 | 0.01 |
MATRICS Composite | 48.5 (12.1) | 29.4 (12.8) | t = 6.3 | < 0.01 |
MATRICS visual memory | 46.8 (13.5) | 36.9 (14.5) | t = 2.9 | 0.01 |
Reinforcement-Learning Task:
Training Phase
Participants completed a probabilistic incentive learning task (21). During this task, participants were presented with 4 pairs of landscape stimuli, one pair at a time. Two of the pairs were associated with potential gains; the optimal response was reinforced on 90% of the trials for one pair and 80% of the trials for the other pair. Two of the pairs involved learning to avoid losses; the optimal response avoided losing on 90% of the trials for one pair and 80% of the trials for the other pair. All participants completed a brief 12-trial practice session to ensure task comprehension, followed by 160 learning trials (each pair presented 40 times) with all pair types presented in a randomized order.
Test/Transfer Phase
A test/transfer phase was administered both immediately following the learning phase and at approximately one-week delay (mean = 7.14 days, SD = 0.84). In these 64 trials, the original 4 training pairs were each presented 4 times (16 total trials), and 24 novel pairings were each presented twice (48 total trials). For novel pairings, each stimulus from the training phase was presented with every other stimulus. No feedback was administered during the test/transfer phase. Instead, participants were instructed to pick the item in the pair that they thought was “best” based on their earlier learning. With exception of separate randomization of trial presentation, structure of the immediate and weeklong test/transfer phases was identical. Participants were not told during the initial study visit that they would be asked to repeat aspects of the task one-week later. A manuscript describing the learning and immediate test/transfer phase has already been published (21). The current report focused on percent retention at one-week delay, although we do report broad measures from the training phase (e.g., total accuracy during learning) in order to better characterize retention effects.
For analysis purposes, specific individual pairings in the test/transfer phase were grouped based on whether they appeared in the training phase and according to the frequency and types of outcomes with which individual stimuli were associated in the training phase. The purpose of these groupings was to reduce noise within the data by increasing the number of trials within a condition. First, presentations of the 4 original training pairs (16 presentations) were averaged, creating a single measure of learning for test item pairings (test pairs). Second, presentations where an optimal stimulus (i.e., 80% or 90% winner, 80% or 90% loss avoider) was paired with a suboptimal stimulus (i.e., 20% or 10% winner, 20% or 10% loss avoider) that had been presented with a different stimulus during the initial training phase were averaged, creating a single measure of generalization of trained items to novel decision-making contexts (transfer pairs). Given previous evidence of intact learning from losses but impaired learning from gains in schizophrenia (21, 34-36), we conducted follow-up analyses of stimuli associated with gain and loss avoidance separately.
Statistical Analysis:
Diagnostic group differences were assessed using an independent samples t-test with percent retention (weeklong delay accuracy divided by immediate delay accuracy) as a dependent variable. Analyses examining effects of negative symptoms were conducted both dichotomously (as we have done previously (21)) and continuously. For dichotomous analyses, we conducted an ANOVA with a 3-level between-subjects factor of group (HC, Low-Neg, High Neg) with SZ groups separated based on a median split of the experiential items of the SANS (21) and a dependent variable of percent retention. For continuous analyses, we conducted spearman rank-order correlations between percent retention and symptom severity measures. Follow-up analyses were conducted examining gain and loss avoidance pairings separately. We also conducted a hierarchical logistic regression in order to determine whether choice behavior at weeklong delay was predicted by the difference between frequencies of selecting particular stimuli during training or the difference between the experienced values of stimuli during training (supplement). In this analysis, the dependent variable was performance at one-week delay in order to observe the precise type of information that was retained at delay. Finally, we conducted follow-up analyses excluding individuals whose accuracy measures were not significantly above chance at immediate delay (supplement).
Results:
Figure 1 shows test/transfer phase performance at both immediate and one-week delay. Both groups showed surprisingly high levels of retention for previously optimal stimuli at weeklong delay. Unexpectedly, test-pair percent retention, relative to immediate performance, was similar for the HC and SZ groups (88.2% for HCs; 86.2% for the SZ group; t(70) = 0.52, p = 0.61; Figure 1A). Percent retention for transfer pairs was also similar between groups (t(70) = −0.25, p = 0.81; Figure 1B). Results were similar when using a 3-level group factor (after splitting the patient group according to negative symptom severity), suggesting that retention did not differ between HCs, low negative symptom patients, and high negative symptom patients when analyzed dichotomously (supplement).
Figure 1: Accuracy at immediate and weeklong delay for trained and transfer pairs.
Error bars represent standard error of the mean.
Figure 2 shows the relationship between continuous negative symptom severity (SANS Total) and percent retention. Given the non-normal distribution of study variables spearman rank-order correlations were performed. For test pairs, negative symptom severity was significantly associated with percent retention (Figure 2A; rho = −0.38, p = 0.01), such that individuals with more severe negative symptoms retained less information over time. This effect held when controlling for severity of positive symptoms. The effect was in the same direction but was not significant for transfer pairs (Figure 2D; rho = −0.14; p = 0.38). Associations between positive symptoms, aggregate symptom severity, and percent retention were not significant (p-values > 0.07), suggesting a relationship specifically with negative symptoms (supplement). Negative symptom effects were similar to the aforementioned results when separated into experiential and expressive subdomains, suggesting relations between retention and general negative symptom severity (supplement). When restricting the sample to participants who performed significantly above chance at immediate delay, associations between test-pair retention and negative symptoms was similar to the overall sample but failed to reach significance (supplement).
Figure 2: Scatterplots illustrating associations between information loss and negative symptom severity.
Note removal of outliers does not significantly alter the significance of associations
Effects of Valence - Gain vs. Loss Avoidance
Because several previous studies (including our own) have reported impaired learning from rewarding outcomes but intact learning to avoid losses in people with schizophrenia (21,34-36), we conducted follow-up analyses examining effects of feedback with respect to the negative symptom associations observed above (Figure 3). For test pairs (Figure 2B & 2E), associations between negative symptom severity and retention were similar for gain and loss avoidance (Gain: rho = −0.31; p = 0.04; Loss avoidance: rho = −0.33; p = 0.03; William’s t-test = −0.14, p = 0.89). For transfer pairs (Figure 2C & 2F), associations between negative symptom severity and retention were stronger for gain compared to loss at a trend level (Gain: rho = −0.37; p = 0.01; Loss avoidance: rho = −0.02; p = 0.91; William’s test = −1.8, p = 0.074). Negative symptom effects were similar experiential and expressive subdomains (supplement).
Figure 3: Accuracy at immediate and weeklong delay, separated by feedback type, for test and transfer pairs.
Error bars represent standard error of the mean.
Correlations with Cognition and Learning Phase Task Performance
We addressed contributions of cognitive impairment and learning phase accuracy in two ways. First, we conducted spearman correlations between percent retention and 1) standardized neurocognitive measures (MATRICS Composite Score, WRAT4 Standard Score), 2) performance during the training phase (total accuracy). These analyses revealed that neurocognitive measures and training phase performance were not significantly correlated with percent retention (Table 2). However, training phase performance, immediate delay accuracy, and weeklong delay accuracy were significantly correlated (supplement). Second, we conducted a multiple regression analysis including immediate test/transfer phase accuracy, training phase accuracy, MATRICS Composite score, WRAT4, and SANS Total Score as simultaneous predictors of percent retention. In these analyses, negative symptoms remained a significant predictor of percent retention (Table 3). Correlations between negative symptoms and percent retention remained significant when including equivalent dose as a covariate (supplement).
Table 2:
Spearman Correlations between Training Phase Performance, Neurocognitive measures, and percent retention.
Healthy Control | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Test Pairs | Transfer Pairs | |||||||||||
All | Gain | Loss | All | Gain | Loss | |||||||
rho | p | rho | p | rho | p | rho | p | rho | p | rho | p | |
LEARNING ACC | −0.26 | 0.19 | −0.05 | 0.81 | −0.25 | 0.20 | 0.07 | 0.74 | 0.08 | 0.69 | −0.15 | 0.44 |
MATRICS TOTAL | 0.03 | 0.88 | 0.16 | 0.43 | 0.09 | 0.65 | 0.01 | 0.95 | 0.12 | 0.56 | −0.10 | 0.60 |
WRAT4SS | −0.23 | 0.24 | −0.14 | 0.49 | −0.05 | 0.80 | −0.18 | 0.35 | 0.04 | 0.83 | −0.23 | 0.23 |
Schizophrenia | ||||||||||||
Test Pairs | Transfer Pairs | |||||||||||
All | Gain | Loss | All | Gain | Loss | |||||||
rho | p | rho | p | rho | p | rho | p | rho | p | rho | p | |
LEARNING ACC | −0.24 | 0.13 | −0.12 | 0.44 | −0.28 | 0.07 | −0.07 | 0.65 | −0.05 | 0.76 | −0.10 | 0.53 |
MATRICS TOTAL | −0.05 | 0.74 | −0.01 | 0.94 | −0.13 | 0.43 | −0.01 | 0.93 | 0.13 | 0.39 | −0.12 | 0.45 |
WRAT4SS | 0.16 | 0.31 | 0.21 | 0.17 | 0.03 | 0.88 | 0.17 | 0.28 | 0.12 | 0.46 | 0.13 | 0.42 |
Table 3:
Multiple Regression Analyses predicting long delay performance from negative symptoms, learning phase performance, immediate delay performance, and neurocognitive measures.
ALL TEST PAIRS (DEPENDENT VARIABLE PERCENT RETENTION) | ||||
---|---|---|---|---|
Beta | SE | t-value | p-value | |
SANS TOTAL | −0.008 | 0.003 | −2.255 | 0.03 |
IMMEDIATE DELAY | −0.002 | 0.005 | −0.296 | 0.769 |
MATRICS OVERALL | 0.001 | 0.004 | 0.191 | 0.85 |
WRAT4SS | −0.152 | 0.308 | −0.495 | 0.624 |
LEARNING PHASE ACCURACY | 0.373 | 0.469 | 0.796 | 0.431 |
GAIN TRANSFER PAIRS (DEPENDENT VARIABLE PERCENT RETENTION) | ||||
Beta | SE | t-value | p-value | |
SANS TOTAL | −0.009 | 0.003 | −3.017 | 0.005 |
MATRICS OVERALL | −0.003 | 0.004 | −0.629 | 0.533 |
WRAT4SS | 0.003 | 0.004 | 0.71 | 0.482 |
IMMEDIATE DELAY | −0.903 | 0.367 | −2.461 | 0.019 |
LEARNING PHASE ACCURACY | 0.525 | 0.544 | 0.964 | 0.341 |
Experienced Value vs. Frequency of Choice
Because retention of value-based choices over long delays has not been extensively studied, and the above analyses do not indicate exactly what was retained, it is plausible that rather than retaining the learned value of each stimulus, participants simply remembered which stimulus they had selected most often during the initial training session and used that information to make decisions. Such stimulus-response learning would facilitate appropriate generalization in most cases but is nevertheless dissociable from learned values, given individual participant histories of choices and outcomes. Indeed, logistic regression analysis confirmed that both prior frequency of selection and experienced expected values were independent predictors of week-long transfer performance (supplement). Notably, however, negative symptoms were parametrically and selectively related to decreased reliance on expected value to make decisions at delay (Figure 4). This effect was similar for expressive and experiential negative symptoms.
Figure 4: Correlations between Experienced Value and Frequency and Negative Symptoms:
Individual dots represent the effect of experienced value (left) and frequency (right) on choice behavior at weeklong delay for each individual subject. Results showed that the effect of experienced value, but not frequency, varied parametrically as a function of negative symptom severity. Thus, individuals with high negative symptom severity do not appear to be utilizing experienced value information, to the same degree as healthy controls, when making decisions at weeklong delay.
Discussion:
We document several important findings from both basic and clinical science perspectives. First, we show robust percent retention of RL over a one-week delay interval. This was true across groups. While strong long-term retention of action-outcome contingencies is well-documented (50-52), particularly in the motor and procedural learning literatures, remarkably little evidence of such effects has been provided using choice-based, RL paradigms in human participants (42). The current result addresses this lacuna in the literature, demonstrating that individuals not only retain stimulus-response contingencies, but also mental representations of value for particular stimuli.
From a clinical perspective, we show that individual differences in percent retention, for rewarding stimuli, are related to negative symptom severity. Such findings are consistent with recent theories suggesting that people with schizophrenia show difficulty in using information about previously rewarding experiences to facilitate future decision-making (3,9,10,12). Specifically, we provide preliminary evidence for a hypothesis that these difficulties might arise, in part, due to a failure to retain high-fidelity representations of the value of actions across time, and thus a degraded ability to utilize such representations to guide future action selection. Finally, it is striking that we did not observe significant diagnostic group differences. We consider this noteworthy as people with schizophrenia typically perform more poorly than HCs across most demanding cognitive tasks. Thus, our failure to find between-group effects was unexpected.
As stated in the introduction, the current work was largely motivated by Gold et al., 2012 who found that negative symptoms were associated with deficit representations of positive expected value (21). Our manuscript extends this finding by showing that high negative symptom patients have difficulties maintaining reward-related information over the course of a week. Further, we show that people with high negative symptom severity are less likely to use expected value to make decisions at weeklong delay, similar to findings of reduced reliance on Q-learning (thought to be driven by orbital frontal cortex function) in schizophrenia. Despite these consistencies with prior results, some discrepancies must also be noted. First, while we observed significant correlations between negative symptom severity and percent retention using continuous measures, we failed to find significant relationships between retention and negative symptom severity when using a dichotomous approach. Second, while Gold et al., 2012 reported correlations between RL measures and experiential negative symptoms, the current analyses reveal associations between negative symptom severity and percent retention of RL more broadly with relatively equivalent correlations for both experiential and expressive domains.
The current findings are also partly consistent with Herbener (2009) demonstrating reduced long-term retention of preferences for implicitly learned stimuli at a 24-hour delay in schizophrenia. While we failed to observe such group differences, we did find associations between negative symptom severity and percent retention of explicitly-learned preferences. The observed differences between our results and the findings of Herbener may be due to considerable differences in task design. First, Herbener assessed implicit RL (i.e., participants were not aware of associations between stimuli and probability of reward receipt), whereas our design assessed explicit RL. Importantly, implicit RL has not been shown to be associated with negative symptoms in previous reports (43, 53). Second, feedback in the Herbener paradigm was predetermined regardless of the participant’s choices, whereas feedback in our design depended on participant choice. Third, Herbener assessed retention at a 24-hour delay, whereas we assessed retention at one-week delay.
The current manuscript is also partially consistent with an older literature examining emotional memory in schizophrenia (54). Although the findings from this literature are mixed, studies assessing memory at longer (>24 hrs) delay (41, 55-58) report deficits in retention of emotional information for people with schizophrenia compared to HCs more consistently than those assessed at shorter delays (56, 59-62). Several of these reports have shown enhanced recall of negatively-valenced stimuli than for positively-valenced stimuli in SZ (54). Such findings are consistent with the current result of poor percent retention of gain information in individuals with severe negative symptoms.
Despite the large body of RL research in human subjects, there has been remarkably little work regarding how value representations are retained beyond a single testing session (40-42,55-58). The current report extends such findings by suggesting that representations of value for learned stimuli are retained at very high levels, over the course of a weeklong delay. While the current findings are behavioral, it is important to consider the potential neural substrates that may be underlying reduced retention of value representations in high negative symptom patients. We might speculate, given previous evidence of degraded expected value representations in people with schizophrenia with severe negative symptoms (21), that reduced top-down frontal communication with the basal ganglia could account for reduced retention. Consistent with this account, high negative symptom patients showed a weaker effect of expected value on decision-making at one-week delay. However, reduced retention could reflect deficit dopamine-dependent mechanisms in the basal ganglia and frontal cortex, memory encoding mechanisms within the hippocampus, or an interaction among frontal, hippocampal, and striatal systems. Interestingly, a recent study in healthy participants found that striatal RPE signaling strengthens formation of episodic memories, suggesting interplay between striatal and hippocampal systems during RL (63). Thus, future clinical studies may benefit from leveraging work in the basic sciences that attempts to link RL with memory consolidation (63-66).
Another important direction for future research would be to determine the potential roles of incentive salience in modulating observed effects. In the current study, participants were not told that they would repeat portions of the experimental task at a one-week delay. Recent work has suggested that the relative contributions of various RL systems to decision-making varies depending on incentive contexts (67). Further, individual differences in relationships between these incentive effects and trait dimensions relevant to psychopathology have been noted (68). It will be important for future work to determine whether the retention effects seen in the current study may be modulated by incentive context in healthy populations, and if such modulation may vary with aspects of psychopathology
Limitations
Our sample size was modest. Future work will be needed to replicate the current findings in a larger sample. Second, the SANS was used to assess negative symptoms. Newer measures are available that better reflect the field’s current conceptualization of negative symptoms (69,70). We were unable to use such measures as the data presented in the current study were collected prior to the development of these measures. Third, the current RL paradigm did not include a non-reinforced, “neutral”, learning pair. Thus, we are unable to fully account for the extent to which the current findings are representative of general retention difficulties versus difficulties unique to learning and retaining rewards. Fourth, all participants with schizophrenia were taking anti-psychotic medications at the time of study completion, which may have influenced choice behavior due to influence on dopamine systems.
Summary
Our results show that, although individuals with schizophrenia may initially learn about rewarding actions, such learning decays at a more rapid rate in patients with severe negative symptoms and may be more difficult to access to guide future decision-making and to motivate action selection. Further work is necessary to replicate these findings and examine the interaction between RL and memory processes to gain a better mechanistic account of reduced maintenance of value representations in people with schizophrenia.
Supplementary Material
Acknowledgements:
We would like to thank the study participants who generously gave their time to complete the study protocol. We also thank Sharon August, MA; Leeka Hubzin, MA; Jacqueline Kiwanuka, MBA, and Dhivya Pahwa, BA, contributed to the study.
Financial Disclosure Statement:
This work was supported by grant R01 MH080066 from the National Institute of Mental Health. JAW, JMG, and MJF report that they perform consulting for Hoffman La Roche. JMG has also consulted for Takeda and Lundbeck and receives royalty payments from the Brief Assessment of Cognition in Schizophrenia. JAW also consults for NCT Holdings. The current experiments were not related to any consulting activity. All authors report no biomedical financial interests or potential conflicts of interest.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References:
- 1.Bleuler E (1992): Dementia Praecox or the Group of Schizophrenias. Am J Psychiatry. 149: 1733–1734. [Google Scholar]
- 2.Milev P, Ho B-C, Arndt S, Andreasen NC (2005): Predictive Values of Neurocognition and Negative Symptoms on Functional Outcome in Schizophrenia: A Longitudinal First-Episode Study With 7-Year Follow-Up. Am J Psychiatry. 162: 495–506. [DOI] [PubMed] [Google Scholar]
- 3.Kring AM, Moran EK (2008): Emotional Response Deficits in Schizophrenia: Insights From Affective Science. Schizophr Bull. 34: 819–834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Cohen AS, Minor KS (2010): Emotional Experience in Patients With Schizophrenia Revisited: Meta-analysis of Laboratory Studies. Schizophr Bull. 36: 143–150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gard DE, Gard MG, Kring AM, John OP (2006): Anticipatory and consummatory components of the experience of pleasure: A scale development study. J Res Pers. 40: 1086–1102. [Google Scholar]
- 6.Kring AM, Siegel EH, Barrett LF (2014): Unseen Affective Faces Influence Person Perception Judgments in Schizophrenia. Clin Psychol Sci. 2: 443–454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Moran EK, Kring AM (2018): Anticipatory Emotion in Schizophrenia. Clin Psychol Sci a J Assoc Psychol Sci. 6: 63–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wynn JK, Horan WP, Kring AM, Simons RF, Green MF (2010): Impaired anticipatory event-related potentials in schizophrenia. Int J Psychophysiol. 77: 141–149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Barch DM, Dowd EC (2010): Goal Representations and Motivational Drive in Schizophrenia: The Role of Prefrontal-Striatal Interactions. Schizophr Bull. 36: 919–934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gold JM, Waltz JA, Prentice KJ, Morris SE, Heerey EA (2008): Reward Processing in Schizophrenia: A Deficit in the Representation of Value. Schizophr Bull. 34: 835–847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Deserno L, Boehme R, Heinz A, Schlagenhauf F (2013): Reinforcement Learning and Dopamine in Schizophrenia: Dimensions of Symptoms or Specific Features of a Disease Group? Front Psychiatry. 4: 172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kring AM, Barch DM (2014): The motivation and pleasure dimension of negative symptoms: Neural substrates and behavioral outputs. Eur Neuropsychopharmacol. 24: 725–736. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sutton RS, Barto AG (1999): Book Reviews Reinforcement Learning. . Retrieved February 17, 2020, from https://s3.amazonaws.com/academia.edu.documents/6985553/ivry_rev.pdf?response-content-disposition=inline%3Bfilename%3DReinforcementJearning.pdf&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAIWOWYYGZ2Y53UL3A%2F20200218%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20200218T023104Z&X-Amz-Expires=3600&X-Amz-SignedHeaders=host&X-Amz-Signature=d7805ad5f5df2f13e616da293d9d752b1747a7db1729f3c283435db12c658088. [Google Scholar]
- 14.Schultz W, Dayan P, Montague PR (1997): A neural substrate of prediction and reward. Science. 275: 1593–9. [DOI] [PubMed] [Google Scholar]
- 15.Barch DM, Carter CS, Gold JM, Johnson SL, Kring AM, MacDonald AW, et al. (2017): Explicit and implicit reinforcement learning across the psychosis spectrum. J Abnorm Psychol. 126: 694–711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Collins AGE, Brown JK, Gold JM, Waltz JA, Frank MJ (2014): Working memory contributions to reinforcement learning impairments in schizophrenia. J Neurosci. 34: 13747–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Culbreth AJ, Gold JM, Cools R, Barch DM (2016): Impaired Activation in Cognitive Control Regions Predicts Reversal Learning in Schizophrenia. Schizophr Bull. 42: 484–493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Culbreth AJ, Westbrook A, Daw ND, Botvinick M, Barch DM (2016): Reduced model-based decision-making in schizophrenia. J Abnorm Psychol. 125: 777–787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Dowd EC, Barch DM (2012): Pavlovian reward prediction and receipt in schizophrenia: relationship to anhedonia. PLoS One. 7: e35622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dowd EC, Frank MJ, Collins A, Gold JM, Barch DM (2016): Probabilistic Reinforcement Learning in Patients With Schizophrenia: Relationships to Anhedonia and Avolition. Biol Psychiatry Cogn Neurosci Neuroimaging. 1: 460–473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gold JM, Waltz JA, Matveeva TM, Kasanova Z, Strauss GP, Herbener ES, et al. (2012): Negative symptoms and the failure to represent the expected reward value of actions: Behavioral and computational modeling evidence. Arch Gen Psychiatry. 69: 129–138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Gradin VB, Kumar P, Waiter G, Ahearn T, Stickle C, Milders M, et al. (2011): Expected value and prediction error abnormalities in depression and schizophrenia. Brain. 134: 1751–1764. [DOI] [PubMed] [Google Scholar]
- 23.Heerey EA, Bell-Warren KR, Gold JM (2008): Decision-Making Impairments in the Context of Intact Reward Sensitivity in Schizophrenia. Biol Psychiatry. 64: 62–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Hernaus D, Gold JM, Waltz JA, Frank MJ (2018): Impaired Expected Value Computations Coupled With Overreliance on Stimulus-Response Learning in Schizophrenia. Biol Psychiatry Cogn Neurosci Neuroimaging. 3: 916–926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kéri S, Kelemen O, Szekeres G, Bagóczky N, Erdélyi R, Antal A, et al. (2000): Schizophrenics know more than they can tell: Probabilistic classification learning in schizophrenia. Psychol Med. 30: 149–155. [DOI] [PubMed] [Google Scholar]
- 26.Koch K, Schachtzabel C, Wagner G, Schikora J, Schultz C, Reichenbach JR, et al. (2010): Altered activation in association with reward-related trial-and-error learning in patients with schizophrenia. Neuroimage. 50: 223–232. [DOI] [PubMed] [Google Scholar]
- 27.Murray GK, Corlett PR, Clark L, Pessiglione M, Blackwell AD, Honey G, et al. (2008): Substantia nigra/ventral tegmental reward prediction error disruption in psychosis. Mol Psychiatry. 13: 267–276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Schlagenhauf F, Huys QJM, Deserno L, Rapp MA, Beck A, Heinze H-J, et al. (2014): Striatal dysfunction during reversal learning in unmedicated schizophrenia patients. Neuroimage. 89: 171–180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Strauss GP, Frank MJ, Waltz JA, Kasanova Z, Herbener ES, Gold JM (2011): Deficits in Positive Reinforcement Learning and Uncertainty-Driven Exploration Are Associated with Distinct Aspects of Negative Symptoms in Schizophrenia. Biol Psychiatry. 69: 424–431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Waltz JA, Frank MJ, Wiecki TV., Gold JM (2011): Altered probabilistic learning and response biases in schizophrenia: Behavioral evidence and neurocomputational modeling. Neuropsychology. 25: 86–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Waltz JA, Gold JM (2007): Probabilistic reversal learning impairments in schizophrenia: Further evidence of orbitofrontal dysfunction. Schizophr Res. 93: 296–303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Waltz JA, Kasanova Z, Ross TJ, Salmeron BJ, McMahon RP, Gold JM, Stein EA (2013): The roles of reward, default, and executive control networks in set-shifting impairments in schizophrenia. PLoS One. 8: e57257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Hernaus D, Frank MJ, Brown EC, Brown JK, Gold JM, Waltz JA (2019): Impaired Expected Value Computations in Schizophrenia Are Associated With a Reduced Ability to Integrate Reward Probability and Magnitude of Recent Outcomes. Biol Psychiatry Cogn Neurosci Neuroimaging. 4: 280–290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Cheng GLF, Tang JCY, Li FWS, Lau EYY, Lee TMC (2012): Schizophrenia and risk-taking: Impaired reward but preserved punishment processing. Schizophr Res. 136: 122–127. [DOI] [PubMed] [Google Scholar]
- 35.Reinen J, Smith EE, Insel C, Kribs R, Shohamy D, Wager TD, Jarskog LF (2014): Patients with schizophrenia are impaired when learning in the context of pursuing rewards. Schizophr Res. 152: 309–310. [DOI] [PubMed] [Google Scholar]
- 36.Waltz JA, Frank MJ, Robinson BM, Gold JM (2007): Selective Reinforcement Learning Deficits in Schizophrenia Support Predictions from Computational Models of Striatal-Cortical Dysfunction. Biol Psychiatry. 62: 756–764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Cicero DC, Martin EA, Becker TM, Kerns JG (2014): Reinforcement learning deficits in people with schizophrenia persist after extended trials. Psychiatry Res. 220: 760–764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Fervaha G, Agid O, Foussias G, Remington G (2013): Impairments in both reward and punishment guided reinforcement learning in schizophrenia. Schizophr Res. 150: 592–593. [DOI] [PubMed] [Google Scholar]
- 39.Moran EK, Culbreth AJ, Kandala S, Barch DM (2019): From neuroimaging to daily functioning: A multimethod analysis of reward anticipation in people with schizophrenia. J Abnorm Psychol. 128: 723–734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Grogan JP, Tsivos D, Smith L, Knight BE, Bogacz R, Whone A, Coulthard EJ (2017): Effects of dopamine on reinforcement learning and consolidation in Parkinson’s disease. Elife. 6. doi: 10.7554/eLife.26801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Herbener ES (2009): Impairment in Long-Term Retention of Preference Conditioning in Schizophrenia. Biol Psychiatry. 65: 1086–1090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wimmer GE, Li JK, Gorgolewski KJ, Poldrack RA (2018): Reward Learning over Weeks Versus Minutes Increases the Neural Representation of Value in the Human Brain. J Neurosci. 38: 7649–7666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Heerey EA, Bell-Warren KR, Gold JM (2008): Decision-making impairments in the context of intact reward sensitivity in schizophrenia. Biol Psychiatry. 64: 62–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.First M, Spitzer RL, Robert L, Gibbon M, Williams JBW (2005): Structured Clinical Interview for DSM-IV-TR Axis I Disorders-Patient Edition. New York: (SCIDI/P) New York: State Psychiatric Unit. [Google Scholar]
- 45.First MB, Gibbon M, Spitzer R, Williams J, Benjamin L (1997): Structured Clinical Interview for DSM-IV Axis II Personality Disorders (SCID-II). Am Psychiatr Press Inc. . [Google Scholar]
- 46.Andreasen NC (1989): Scale for the Assessment of Negative Symptoms (SANS). Br J Psychiatry. 155: 53–58. [PubMed] [Google Scholar]
- 47.Vadhan NP, Serper MR, Harvey PD, Chou JCY, Cancro R (2001): Convergent validity and neuropsychological correlates of the schedule for the assessment of negative symptoms (SANS) attention subscale. J Nerv Ment Dis. 189: 637–641. [DOI] [PubMed] [Google Scholar]
- 48.Overall JE, Gorham DR (1962): The Brief Psychiatric Rating Scale. Psychol Rep. 10: 799–812. [Google Scholar]
- 49.Nuechterlein KH, Green MF (2006): MATRICS Consensus Cognitive Battery Manual. Los Angeles, MATRICS Assess. . [Google Scholar]
- 50.Abe M, Schambra H, Wassermann EM, Luckenbaugh D, Schweighofer N, Cohen LG (2011): Reward Improves Long-Term Retention of a Motor Memory through Induction of Offline Memory Gains. Curr Biol. 21: 557–562. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Dail TK, Christina RW (2004): Distribution of Practice and Metacognition in Learning and Long-Term Retention of a Discrete Motor Task. Res Q Exerc Sport. 75: 148–155. [DOI] [PubMed] [Google Scholar]
- 52.Doyon J, Penhune V, Ungerleider LG (2003): Distinct contribution of the cortico-striatal and cortico-cerebellar systems to motor skill learning. Neuropsychologia. 41: 252–262. [DOI] [PubMed] [Google Scholar]
- 53.Barch DM, Carter CS, Gold JM, Johnson SL, Kring AM, MacDonald AW, et al. (2017): Explicit and implicit reinforcement learning across the psychosis spectrum. J Abnorm Psychol. 126: 694–711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Herbener ES (2008, July 21): Emotional memory in schizophrenia. Schizophr Bull. 34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Calev A, Edelist S (1993): Affect and Memory in Schizophrenia: Negative Emotion Words Are Forgotten Less Rapidly than Other Words by Long-Hospitalized Schizophrenics. Psychopathology. 26: 229–235. [DOI] [PubMed] [Google Scholar]
- 56.Hall J, Harris JM, McKirdy JW, Johnstone EC, Lawrie SM (2007): Emotional memory in schizophrenia. Neuropsychologia. 45: 1152–1159. [DOI] [PubMed] [Google Scholar]
- 57.Herbener ES, Rosen C, Khine T, Sweeney JA (2007): Failure of positive but not negative emotional valence to enhance memory in schizophrenia. J Abnorm Psychol. 116: 43–55. [DOI] [PubMed] [Google Scholar]
- 58.Neumann A, Blairy S, Lecompte D, Philippot P (2007): Specificity deficit in the recollection of emotional memories in schizophrenia. Conscious Cogn. 16: 469–484. [DOI] [PubMed] [Google Scholar]
- 59.Danion J-M, Kazes M, Huron C, Karchouni N (2003): Do Patients With Schizophrenia Consciously Recollect Emotional Events Better Than Neutral Events? Am J Psychiatry. 160: 1879–1881. [DOI] [PubMed] [Google Scholar]
- 60.Horan WP, Green MF, Kring AM, Nuechterlein KH (2006): Does Anhedonia in Schizophrenia Reflect Faulty Memory for Subjectively Experienced Emotions? and Veterans Affairs Greater Los Angeles Healthcare System. . doi: 10.1037/0021-843X.115.3.496. [DOI] [PubMed] [Google Scholar]
- 61.Koh SD, Grinker RR, Marusarz TZ, Forman PL (1981): Affective Memory and Schizophrenic Anhedonia. Schizophr Bull. 7: 292–307. [DOI] [PubMed] [Google Scholar]
- 62.Koh SD, Kayton L, Peterson RA (1976): Affective encoding and consequent remembering in schizophrenic young adults. J Abnorm Psychol. 85: 156–166. [DOI] [PubMed] [Google Scholar]
- 63.Jang AI, Nassar MR, Dillon DG, Frank MJ (2019): Positive reward prediction errors during decision-making strengthen memory encoding. Nat Hum Behav. 3: 719–732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Gerraty RT, Davidow JY, Wimmer GE, Kahn I, Shohamy D (2014): Transfer of learning relates to intrinsic connectivity between hippocampus, ventromedial prefrontal cortex, and large-scale networks. J Neurosci. 34: 11297–303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Shadlen MN, Shohamy D (2016): Decision Making and Sequential Sampling from Memory. Neuron. 90: 927–939. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Wimmer GE, Braun EK, Daw ND, Shohamy D (2014): Episodic memory encoding interferes with reward learning and decreases striatal prediction errors. J Neurosci. 34: 14901–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Kool W, Gershman SJ, Cushman FA (2017): Cost-Benefit Arbitration Between Multiple Reinforcement-Learning Systems. Psychol Sci. 28: 1321–1333. [DOI] [PubMed] [Google Scholar]
- 68.Patzelt EH, Kool W, Millner AJ, Gershman SJ (2019): Incentives Boost Model-Based Control Across a Range of Severity on Several Psychiatric Constructs. Biol Psychiatry. 85: 425–433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Kirkpatrick B, Strauss GP, Nguyen L, Fischer BA, Daniel DG, Cienfuegos A, Marder SR (2011): The Brief Negative Symptom Scale: Psychometric Properties. Schizophr Bull. 37: 300–305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Kring AM, Gur RE, Blanchard JJ, Horan WP, Reise SP (2013): The Clinical Assessment Interview for Negative Symptoms (CAINS): Final Development and Validation. Am J Psychiatry. 170: 165–172. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.