Skip to main content
PLOS One logoLink to PLOS One
. 2021 Dec 10;16(12):e0260444. doi: 10.1371/journal.pone.0260444

Investigation of reward learning and feedback sensitivity in non-clinical participants with a history of early life stress

Matthew Paul Wilkinson 1, Chloe Louise Slaney 1, Jack Robert Mellor 1, Emma Susan Jane Robinson 1,*
Editor: Sarah Whittle2
PMCID: PMC8664195  PMID: 34890390

Abstract

Early life stress (ELS) is an important risk factor for the development of depression. Impairments in reward learning and feedback sensitivity are suggested to be an intermediate phenotype in depression aetiology therefore we hypothesised that healthy adults with a history of ELS would exhibit reward processing deficits independent of any current depressive symptoms. We recruited 64 adults with high levels of ELS and no diagnosis of a current mental health disorder and 65 controls. Participants completed the probabilistic reversal learning task and probabilistic reward task followed by depression, anhedonia, social status, and stress scales. Participants with high levels of ELS showed decreased positive feedback sensitivity in the probabilistic reversal learning task compared to controls. High ELS participants also trended towards possessing a decreased model-free learning rate. This was coupled with a decreased learning ability in the acquisition phase of block 1 following the practice session. Neither group showed a reward induced response bias in the probabilistic reward task however high ELS participants exhibited decreased stimuli discrimination. Overall, these data suggest that healthy participants without a current mental health diagnosis but with high levels of ELS show deficits in positive feedback sensitivity and reward learning in the probabilistic reversal learning task that are distinct from depressed patients. These deficits may be relevant to increased depression vulnerability.

1. Introduction

Early life stress (ELS) is a major risk factor in the development of depression [14]. ELS has also been found to lower the threshold of stress required to precipitate depression [5], a key major trigger in healthy populations [6]. Elevated levels of childhood stress lead to widespread functional and morphological alterations in the adult brain with the hippocampus, amygdala and prefrontal cortex being most impacted [7,8]. Amongst other functions, these regions are vital mediators of reward learning: the ability of reward in the environment to modulate future behaviour [916]. However, how ELS influences the developing brain to predispose individuals to psychiatric illness is not yet understood [7,17,18].

Reward learning deficits have been proposed to be an intermediate phenotype in the aetiology and maintenance of depression [1922]. While many tasks have been used to probe aspects of reward learning in ELS, few of these have been used in depressed patientso [23]. Two tasks that have been used in depressed patients are the probabilistic reward task, which measures reward learning (PRT [24]) and the probabilistic reversal learning task, which measures reward learning alongside both feedback sensitivity and cognitive flexibility (PRLT [25]). Depressed patients have a reduced bias towards the more highly rewarded stimulus in the PRT, a test of reward learning where participants respond to ambiguous stimuli that are unequally rewarded [21]. Deficits in reward learning measured in the PRT have been observed to both predict the risk of disease development [26] and persistence [20,27] therefore demonstrating a link between depression vulnerability and performance in this task. Another widely used task is the PRLT where participants are presented two stimuli that are probabilistically and unequally rewarded. Following learning of the underlying rule the contingencies switch and depressed patients show impaired accuracy and increased sensitivity to probabilistic negative feedback following the rule change [28,29]. The PRLT can therefore be used to measure parameters relating to reward learning, feedback sensitivity and cognitive flexibility [30,31]. Interestingly in the context of ELS, acute stress has also been observed to impair reward learning in the PRT [32,33] suggesting a potential link between stress, reward processing deficits and depression aetiology. Results from these tasks therefore suggest that reward learning impairments could be a relevant intermediate phenotype in the development of depression and that ELS could have a contributary role.

Previous studies have therefore investigated reward processing deficits in people who have experienced ELS. Hanson and colleagues [34] recruited adolescents with a history of physical abuse who then completed a probabilistic learning task where they showed lower associative learning compared to controls. Changes in reward learning have also been reported within another probabilistic reward task, the probabilistic stimulus selection task (PSST). Women with a history of childhood sexual abuse and a diagnosis of Major Depressive disorder (MDD) showed decreased performance on trials requiring learning of previously rewarded information compared to MDD only and control groups [35]. While there is relatively little literature regarding reward learning in ELS there is much more known about the effects of ELS upon cognitive flexibility and other aspects of reward processing [23,36,37]. Neuroimaging studies have shown that participants with ELS show reduced striatal reward anticipation [38] while results from other behavioural tasks have shown that ELS is associated with reduced reward responsiveness [39] and differences in exploitation strategy [37,40]. Finally, of relevance to the PRLT, studies have observed reduced cognitive flexibility in children that have suffered ELS [36,41] although in tasks other than the PRLT.

Although these studies provide valuable insights, they use different tasks to those previously used to study depressed populations making direct comparisons difficult. Additionally, utilisation of tasks such as the PRT and PRLT, that have been successfully translated for rodent use [16,31,42], means that any insights from these tasks in humans can be used translationally to probe the mechanistic underpinnings. Finally, many of these studies have been carried out in adolescents where the brain is still developing as opposed to mature adults. Studies are therefore needed in adults without a current mental health diagnosis to understand if any reward processing changes are present in individuals at higher risk of mental health disorders.

In this study it was aimed to investigate the relationship between ELS and reward processing using translational tasks that have previously been used in both depressed populations and rodents allowing greater comparability of results. We hypothesised that ELS is associated with impairments in reward processing and feedback sensitivity in an otherwise healthy adult population. Due to the exploratory nature of this study we were not able to make more specific hypotheses. In order to address this hypothesis we split our investigation into three questions: Firstly, does ELS effect reward learning parameters, secondly, are these changes due to depression symptomology and finally, does ELS interact with either acute or lifetime stress to influence reward learning parameters? Two groups of adult participants that self-reported no current diagnosis of a mental health condition or Parkinson’s disease were recruited online and completed a survey of adverse childhood experiences [43] before being split into high and no ELS groups based upon this. Participants completed both the PRT (points based) and PRLT to enable comparisons between the different elements of reward learning measured in the tasks. The PRLT data was additionally analysed using a Q-learning model to probe reward learning parameter changes. Participants were asked about stress exposure to enable exploratory analysis investigating if life stress interacts with ELS to cause reward processing deficits. Participants with a history of ELS showed evidence for decreased positive feedback sensitivity in the PRLT, however neither high ELS nor control participants showed a response bias in the points based PRT. As this was the first reported attempt to utilise the PRT in an online environment we recruited an additional population of control participants using a modified task design using direct compensation to validate that the PRT can be successfully performed online.

2. Methods

All procedures were approved by the Faculty of Life Sciences and Faculty of Science Research Ethics Committee at the University of Bristol and were performed in accordance with the declaration of Helsinki in addition to all other institutional and national guidelines. The study protocol was pre-registered (www.osf.io/538yk). All participants provided full written consent for the collection, analysis and publication of their data which is available open access (www.ofs.io/63e8j) and were reimbursed at a rate of £6.00 per hour.

2.1 Participants

In order to recruit the final study population a total of 586 participants were recruited using the Prolific (www.prolific.co) online platform to complete an online screening questionnaire (see S1 Fig for study overview and Table 1 for participant demographics). These participants were 25–65 years of age, fluent in English, resident in the UK and had no mild cognitive impairments or dementia. Participants completed the early life stress questionnaire [43] (ELSQ) which asks if participants had prior exposure to specific adverse childhood experiences (ACEs). Participants were also asked “Have you got a current diagnosis of a mental health disorder or Parkinson’s disease?”.

Table 1. Demographic and self-report measures in the study population.

Measure No ELS (n = 65) High ELS (n = 64) Test statistic p
Gender (% male) 44.6 37.5 χ2(2) = 2.5 0.28
Age (years) 37.3 ± 1.30 38.0 ± 1.24 U = 1936.0 0.50
Education (% graduates) 64.6 65.6 χ2(5) = 4.9 0.43
Ethnicity (% white) 95.4 82.8 χ2(4) = 8.7 0.070
Relationship status (% single) 18.5 28.1 χ2(3) = 1.9 0.60
Employment status (% full time) 64.6 60.9 χ2(5) = 3.5 0.61
Monetary concerns (% agree/strongly agree) 36.9 56.3 χ2(3) = 4.4 0.22
ELSQ 0 ± 0 4.36 ± 0.17 - -
Social status 6.2 ± 0.17 5.2 ± 0.21 U = 1397.5 0.001
BDI-II 9.4 ± 1.0 15.2 ± 1.22 U = 1315.5 0.0003
SHAPS 1.4 ± 0.25 2.56 ± 0.32 U = 1496.5 0.004
SHAPS-C 24.3 ± 0.67 26.4 ± 0.86 t119.4 = -1.92 0.057
Lifetime stress 472.8 ± 22.4 529.2 ± 23.9 t127 = -1.72 0.088
Last year stress 111.4 ± 12.3 139.8 ± 17.0 U = 1939.5 0.51

Values are shown for each group as mean ± standard error with significant p values (p ≤ 0.05) indicated in bold.

Participants who met the inclusion criteria for high ELS or no ELS and did not report a diagnosis of a mental health disorder or Parkinson’s were then invited to take part in the main experiment within a week of screening and were allocated into two groups. A no ELS group (n = 65) contained people reporting no ACEs on the ELSQ while a high ELS group (n = 64) consisted of those who reported ≥3 ACEs (estimated to be the top tercile of the population [43]). In this second phase of the experiment participants entered demographic information before completing the MacArthur Scale of Subjective Social Status [44], Beck’s depression inventory II [45] (BDI-II), the Snaith Hamilton pleasure scale [46] (SHAPS) and the Holmes and Rahe stress scale [47]. The SHAPS was additionally scored using the SHAPS-C criteria [48]. For the stress scale participants were asked if each event occurred in either their adult life or the last year to provide a measure of both total adult lifetime stress and recent stress. For all stages of the experiment participants were instructed to use a desktop or laptop only and that they should be in a quiet place with minimal distractions. Sample size was estimated for a medium effect size (Cohen’s d = 0.5) and 80% power for a t-test at 64 participants per group. While other studies have investigated different dimensions of ELS (i.e. emotional abuse vs psychosocial neglect) with regards to cognitive outcome [8] the present study was not powered to enable this.

Early life stress was highly prevalent in the study population with only 21.0% of participants having no adverse childhood experiences (ACEs) and 44.4% of the population suffering three or more ACEs in their childhood (see S2 Fig). 16.0% of respondents self-reported a diagnosis of a mental health disorder or Parkinson’s with this being associated with a higher ELSQ score (Mann-Whitney, U = 15725, p < 0.0001).

The two study groups were well matched with respect to gender, age, education, ethnicity, relationship status, employment status and the presence of monetary worries (see Table 1). However, high ELS participants had a self-reported lower social status coupled with higher depression scores in the BDI-II and elevated anhedonia scores in the SHAPS questionnaires. There was no evidence of a difference between groups when participants were asked about stress they encountered in both the last year and their adult lives. When the BDI-II scores were classified into either minimal, mild, moderate or severe depression (see S3 Fig [45]) participants from the high ELS group were more likely to be in greater severity depression groupings (chi2, χ2(3) = 12.9, p = 0.005). Similarly when SHAPS scores were classified into either normal (≤2) or abnormal (≥3) hedonic responses [46] members of the high ELS group were more likely to have abnormal scores (see S3 Fig, chi2, χ2(1) = 6.3, p = 0.012).

2.2 Behavioural testing

Following completion of self-report measures, participants completed the Probabilistic reversal learning task [30,49] followed by the Probabilistic reward task [24]. To complete the tasks participants were required to download and install the Millisecond Inquisit web player (Millisecond, US) which ran both tasks using Millisecond Inquisit v6.2.1. Participants were instructed they were able to earn an additional £2.00 for high performance on the behavioural tasks.

2.2.1 Probabilistic reversal learning task

The PRLT was conducted as previously described [30,49] using the task from the Millisecond test library [50]. Participants were instructed to choose between a “lucky” (rich) and “unlucky” (lean) pattern to maximise points. Selection of the rich stimulus enabled participants to gain a point 80% of the time and lose a point 20% of the time with the lean stimulus having opposite contingencies. If no stimulus was chosen within 2s then this was classified as incorrect and participants lost a point. Participants progressed in the task until meeting the reversal criterion, following this the contingencies reversed such that the rich stimuli became lean and vice versa. This criterion was set randomly between 10 to 15 consecutive correct rich choices to stop participants counting to the criterion. Participants first completed a practice phase where they had to achieve the criterion for a single reversal before proceeding to the main task which was completed in three blocks each limited to 9 minutes where participants could reverse as many times as able within a block. Participants who did not pass the practice phase were excluded from analysis.

Data were analysed as previously described [51]. Win-stay probability was defined as the probability that if a participant was rewarded for selecting a stimulus they would select the same stimulus for the next trial. Lose-shift probability was conversely the probability that if a participant lost a point at a stimulus they would switch to the opposite stimulus for the next trial. This enabled win-stay and lose-shift probabilities to be used as measures of positive and negative feedback sensitivity respectively. These were subdivided into either true feedback which matches with the underlying task rules (e.g. being rewarded for selecting the rich stimulus), or misleading feedback (e.g. being rewarded for selecting the lean stimulus) which opposes the underlying task rule. The number of rule changes, how many times participants were able to meet criterion for a rule change, accuracy and response latency per block were additionally analysed. A Qlearn reinforcement learning model was applied to data as previously described [51,52]. The model enables assessment of the ability of participants to integrate feedback over time (learning rate parameter) and the degree of choice variability that participants have when selecting stimuli (beta, low values represent more random choices while high values show a deterministic choice strategy). Additionally it is possible to use the model to generate a perfect strategy given the same inputs as the participants and then compare participant performance to this as a measure of overall reward learning ability to maximise reward (subjective accuracy). Additionally, data per phase (practice, acquisition of the first rule in block 1 and the following two reversals) was analysed consisting of participant accuracy, errors to criterion and win-stay/lose-shift probability.

2.2.2 Probabilistic reward task

The PRT was conducted as previously described [24] using the task from the Millisecond test library [53]. Participants were instructed to identify whether the mouth of a presented cartoon face was long or short (≈11% difference in mouth length) to win points over 3 blocks of 100 trials. Participants were shown a face before a mouth was rapidly presented for 100ms with participants given up to 1750ms to respond. Feedback was not provided on every trial but unknown to participants one mouth was rewarded with points three times more often than the other (rich = 60%, lean = 20%). Response key and rich/lean stimuli assignments were counterbalanced across participants and responses that were quicker than 150ms or slower than 1750ms were excluded from analysis. Additional responses that differed by more than 3 standard deviations from the mean following natural log transformation of latencies for each participant were excluded from analysis. Response bias (logB), a measure of reward learning, and discriminability (logD), a measure of task difficulty, were calculated as:

logB=Richcorrect×LeanincorrectRichincorrect×Leancorrect (1)
lodD=Richcorrect×LeancorrectRichincorrect×Leanincorrect (2)

2.2.3 Directly rewarded probabilistic reward task

Because neither group in the points rewarded PRT showed a response bias we recruited an additional cohort of 81 participants to test the hypothesis that if the reward modality was restored back to the direct monetary reward as used in laboratory tasks the PRT could be successfully carried out online. Details of this additional study are available in S1 File.

2.3 Data analysis

Demographic and self-report measures were compared between groups using either χ2, t-tests or Mann-Whitney U tests where appropriate. Analysis of the main study was structured into three layers starting with a primary analysis for each measure directly comparing between no ELS and high ELS groups. Where data were not normally distributed then efforts were first made to transform data to normality and where this was not possible Mann-Whitney U tests were completed. Normality was assessed by visual inspection of the QQ-plot alongside use of the Shapiro-Wilk test. Win-stay by block data were transformed using the bestNormalize package in R [54]. Where measures were split by a within subject factor such as block or feedback type these were analysed with repeated measures ANOVAs. Where Mauchly’s test identified a violation of the Sphericity assumption then this was corrected using the Huynh-Feldt correction. Two sample unpaired t-tests were used for direct group comparisons.

The second stage of analysis aimed to assess the effect of ELS upon a parameter while controlling for the effects of depression symptomology. Due to differences in social status, BDI-II score and SHAPS score between the no ELS and high ELS groups, principal component analysis (PCA) was conducted to reduce the dimensionality of these variables (see S1 and S2 Tables). Because only principal component 1 (PC1) differed between groups and explained 94.6% of variance this was used in ANCOVAs (analysis of covariance) to analyse whether parameter changes were due to ELS or due to changes in depression symptomology accounted for by the PC1 component.

Finally, an exploratory analysis stage was used to understand if stress and gender interacted with ELS to modify reward learning. For this stage generalised linear mixed models (GLMMs) were used due to the number of factors in the analysis. GLMM models were built containing the factors: gender, ELS, lifetime stress, last year stress and age. Models were fit using the glmmTMB package in R 4.0 [55,56] with model refinement conducted utilising stepwise deletion based upon Akaike information criterion before being compared with a null model to protect against overfitting. PC1 was also added to each model following final model selection to assess the effects of depression symptomology. To ensure that any differences between primary/secondary and exploratory analyses were not due to differences in statistical approach, analysis was also repeated using GLMMs with the results being consistent between the two approaches (see S5 Table).

Statistical analysis was conducted in SPSS v26 (IBM, US), MATLAB 2018a (Mathworks, USA) and R 4.0 [55] with output graphics constructed in GraphPad Prism 8 (GraphPad, US). For all analysis α was set at 0.05. All data is shown as mean ± SE with a bar and stars showing evidence of a main effect of ELS in the primary analysis. * ≤ 0.05, ** < 0.01, *** < 0.001, **** < 0.0001.

3. Results

3.1 Probabilistic reversal learning task

In the overall reward learning measures of the number of rule changes that participants were able to complete (Fig 1A) and accuracy (Fig 1B) there was no evidence for a difference between no and high ELS groups. However, participants with a history of high ELS did have a slower average response latency (Fig 1C, RM-ANOVA, F1,126 = 5.03, p = 0.027) with both groups getting equally faster over the course of the three blocks (RM-ANOVA, F1.88,236.7 = 16.1, p < 0.0001). In secondary analysis, designed to probe if parameter changes were due to ELS or underlying depression symptomology, there was no evidence of an effect of depression symptomology (RM-ANCOVA, PCA1: p > 0.05) with the main effect of ELS persisting (RM-ANCOVA, ELS: F1,125 = 4.9, p = 0.028). Exploratory analysis intended to probe if a parameter change was mediated by an interaction between ELS and stress measures was conducted on overall reaction times. This did not replicate a main effect of group but did observe older participants having slower reaction times (GLMM, Z = 2.8, p = 0.005). This analysis also indicated weak evidence of an interaction between group and lifetime stress (GLMM, Z = 1.55, p = 0.065) but further investigation did not reveal an effect of lifetime stress in either group.

Fig 1. Overall reward learning and reinforcement learning in the PRLT.

Fig 1

(A) Rule changes within each block, (B) accuracy by block and (C) average response latency per block. From the Q-learning reinforcement learning model: (D) learning rate, (E) β, the inverse of the softmax temperature and a measure of choice variability and (F) subjective accuracy, participant accuracy compared to a model predicted perfect strategy.

When data were analysed using the Q-learning reinforcement learning model a trend emerged towards high ELS participants having a lower learning rate compared to the no ELS study population (Fig 1D, t-test, t127 = 1.78, p = 0.077). Secondary analysis revealed no effect of PCA component 1 upon learning rate but removed any evidence for an effect of ELS. In exploratory analysis a main effect of ELS was observed (GLMM, Z = 2.1, p = 0.037) with the addition of PC1 impairing model fit (ΔAIC = 1.69, χ2(1) = 0.31, p = 0.57). Additionally, a relationship between stress in the last year and learning rate was observed whereby increased stress in the last year decreased learning rate (GLMM, Z = -2.3, p = 0.024). There was no difference in choice variability (Fig 1E) or accuracy compared to a model predicted perfect strategy (Fig 1F) between groups.

Participants with a history of high ELS exhibited reduced positive feedback sensitivity (PFS, Fig 2A, RM-ANOVA, F1,122 = 10.4, p = 0.002) which persisted once depression symptomology was accounted for using PCA component 1 (RM-ANOVA, F1,121 = 6.6, p = 0.01). Exploratory analysis revealed an interaction between ELS and both lifetime stress (GLMM, Z = -2.15, p = 0.031) and last year stress (GLMM, Z = -1.99, p = 0.047). Further investigation revealed effects of both stress types upon PFS in the low ELS group only (GLMM, lifetime stress: Z = -2.35, p = 0.019, last year stress: Z = -2.2, p = 0.026). In this low ELS group higher lifetime stress was associated with greater PFS but higher stress in the last year was associated with decreased PFS. However it should be noted that although all suggested terms were removed from the model the overall model was a poorer fit than the null when measured by AIC (Δ AIC = 7.3, χ2(13) = 18.7, p = 0.13).

Fig 2. High ELS participants exhibited lower positive feedback sensitivity than those without a history of ELS.

Fig 2

Win-stay probability overall (A) and subdivided into true and misleading feedback (B). Overall Lose-shift probability (C) and additionally subdivided into true and misleading feedback (D).

The effect of ELS upon PFS was consistent across feedback that matched (true feedback) or clashed (misleading feedback) with the underlying task rules (Fig 2B, Mann-Whitney U, true: U = 1443, p = 0.03; misleading: U = 1337, p = 0.005). This effect appeared to be constrained to PFS with no corresponding changes in lose-shift probability between no ELS and high ELS groups (Fig 2C and 2D).

When initial learning in the PRLT task was assessed, it was apparent that although ELS and control participants performed similarly during the practice phase there was a learning deficit during acquisition of the first reversal criterion in block 1 as evidenced by increased errors to criterion (Fig 3A, Mann-Whitney U, U = 1580, p = 0.045) and decreased accuracy (Fig 3B, Mann-Whitney U, U = 1584, p = 0.036). Both groups of participants however performed equally well at achieving criterion for a second and third reversal. Unlike the overall measures there was no difference in win-stay probability between groups (Fig 3C), however there was a trend for high ELS participants to show increased negative feedback sensitivity (NFS) in the practice phase (Fig 3D, Mann-Whitney U, U = 1532, p = 0.052).

Fig 3. High ELS participants show impaired learning in the acquisition phase of block 1.

Fig 3

(A) Errors made while reaching criterion for each phase, (B) accuracy within each phase, (C and D) win-stay and lose-shift probabilities for each phase of block 1 and practice respectively.

3.2 Probabilistic reward task

There was no evidence that participants developed a response bias towards the more highly rewarded stimulus in any block (Fig 4A) nor was there evidence for a response bias developing between blocks (Fig 4B). However, participants with a history of high ELS did show an impaired ability to discriminate between stimuli (Fig 4C, ANOVA, F1,127 = 4.8, p = 0.030). Secondary analysis revealed that this difference between groups appeared to be driven by differences in depression symptomology with the effect of ELS disappearing when PCA component 1 was included in the analysis (ANCOVA, PCA1: F1,126 = 6.08, p = 0.015; ELS: F1,126 = 1.7, p = 0.19). Exploratory analysis further revealed a main effect of lifetime stress with higher lifetime stress corresponding to increased discrimination ability (GLMM, Z = 2.6, p = 0.007). An effect of gender was also revealed (GLMM, Z = 2.04, p = 0.04) with males showing increased discrimination ability. Finally, there was no difference between groups in response latencies (Fig 4D) nor was there an effect of stimulus upon response latency (Fig 4E).

Fig 4. Participants with a history of ELS show decreased discriminability in the PRT.

Fig 4

(A) Response bias to the more highly rewarded stimulus, (B) response bias development between blocks, (C) discriminability between long and short face lengths, (D) average response latency split by block and (E) response latency split by stimulus type.

Consistent with Pizzagalli and colleagues [21] the probability of misclassifying a stimulus based upon the preceding trial outcome was also analysed (S3 Table). Participants with a history of high levels of ELS were more likely to misclassify rich stimuli if either the previous trial was a not rewarded rich trial or a lean not rewarded trial with these measures roughly corresponding with rich lose-shift and lean lose-stay probability in the PRLT respectively.

4. Discussion

This study was designed to investigate whether healthy adults without a current psychiatric illness but with a history of ELS show alterations in reward processing and feedback sensitivity. To this end nearly 600 participants were screened with ELS being highly prevalent in the population with 79.0% of participants experiencing one or more ACE and 44.4% experiencing three or more.

Participants with a history of high ELS had higher self-report depression and anhedonia symptoms. Although participants stated they did not have a diagnosis of depression, 54.7% of high ELS and 26.2% of no ELS participants showed at least mild symptoms based upon the BDI-II questionnaire. BDI-II scores in no ELS (mean: 9.4 ± 1.0) and high ELS (mean: 15.2 ± 1.2) participants were higher than controls for similar studies [21,35,57] (range 1.3–3.62) but lower than depressed patients [21] (mean: 32.1 ± 8.6) or participants described as exhibiting a high BDI [57] (>16). These data are consistent with a large societal burden of un-diagnosed depression [5860]. It should be noted that the present study was undertaken during the Covid-19 global pandemic with it being estimated that levels of depression had doubled during this period [61]. The high number of participants reporting mild to severe symptoms of depression is a major limitation of this study as depression and anhedonia are well known to reduce reward learning in both the PRLT [29] and PRT [20,21,24]. It is also worth considering that 75% of adults with mental health conditions experience the onset of symptoms before aged 24 [62]. This means that the study population, all 25 years of age or greater, is potentially biased towards those more protected from mental health disorders. In keeping with large bodies of previous literature this study used retrospective reporting however it is worth noting the difficulties in correlating retrospective and prospective measures of ELS [63] suggesting that these might encompass different populations with different mechanistic links between ELS and depression vulnerability. Studies have also provided interesting insights when ELS is split into different modalities [64,65], however the present study was not powered enough to merit investigating this.

4.1 Probabilistic reversal learning task

In the PRLT participants with high ELS displayed decreased positive feedback sensitivity compared to controls as measured by win-stay probability. This finding was independent of depression symptomology and specific to PFS with no changes observed in lose-shift probability. This contrasts to depressed patients performing the PRLT who have been reported to show no change in positive feedback sensitivity [28,29]. Blunted striatal responses to reward in participants with a history of ELS have been previously reported [66,67] which speculatively may underlie the decreased PFS observed in the present study. Consistent with the present study, women with MDD and a history of childhood sexual abuse have also been found to have impaired performance in the PSST but only for trials requiring use of previously rewarded information and not those requiring use of previously punished information [35]. Within the PRLT depressed patients have been observed to show increased sensitivity to misleading negative feedback [28,29]. This was not observed in the high ELS cohort in the present study. In other tasks depressed patients have also been reported to show increased NFS alongside attenuated PFS [25,6872]. These findings suggest that ELS is associated with changes in feedback sensitivity in the PRLT differently to depression with ELS decreasing PFS but not effecting NFS while depression has an opposite effect. It should however be noted that participants with ELS were more likely to have lifetime substance abuse [73] which may confound the results. Substance abuse is known to induce changes in feedback sensitivity and reward learning in addicts [74].

The PRLT also allows for assessment of reinforcement learning through the analysis of rule changes and accuracy in addition to parameters calculated through use of the reinforcement learning model. In contrast with our hypothesis, there was no evidence that ELS affected rule changes which is surprising considering evidence that both depression and ELS can impair cognitive flexibility [41,75]. The task in this study utilised performance-based reversals compared to previous ones using a fixed reversal interval therefore comparisons with depressed patients are challenging. However Murphy and colleagues did report that depressed patients made more errors following reversal [29]. Although rule changes were used as the main behavioural reward learning output, when data were analysed with the Q-learning model a trend towards decreased learning rate was observed in high ELS participants. This became significant in exploratory analysis and decreased associative learning has been previously observed in juveniles previously exposed to physical abuse [34]. There is a lack of consistent evidence in depression studies as to whether model free learning rate differs between patients and controls [76,77]. However, a recent study in a PRLT did report that depressed patients did show a decreased learning rate with this decreased learning rate being a better predictor of depression scores than simple task outputs. [72]. These findings warrant future investigation due to this study being only powered to detect group differences between two groups meaning that ANCOVA and exploratory analysis is likely to be underpowered.

A slower response latency was also observed in high ELS participants which was specific to the PRLT with no congruent changes seen in the PRT. This discrepancy may be related to differing cognitive demands with the PRLT potentially requiring greater working memory. Changes in response latency have not previously been observed in depressed patients carrying out the PRLT [28,29].

No directly comparable studies have been carried out in humans. However, maternally separated marmosets, an animal model of ELS, showed no change in simple visual discrimination compared to controls but showed impairments when the contingencies reversed [78]. This is similar to that seen in both depressed and bipolar patients in the human PRLT [29,79] who acquire the initial rule successfully but then are impaired following reversal. This compares to ELS participants in the present study who performed equally well in the practice phase and reversal phases but showed a deficit in acquisition of the first rule in block 1. This suggests a potential impairment in the ability to generalise the task rules between the practice and acquisition phase. However previous probabilistic learning studies did not include a practice phase meaning that this likely changed the way participants processed the start of block 1. This might explain the contrast with Pechtel and Pizzagalli, 2013 who reported that women with remitted MDD and ELS learnt acquisition in the PSST at the same rate as controls [35].

Exploratory analysis aimed to investigate if stress in adult life would modulate the relationship between reward processing deficits and ELS. There was little evidence for this except for an observed interaction between PFS and ELS whereby stress only influenced PFS in participants without a history of ELS. Higher lifetime stress led to greater PFS but higher stress in the last year was associated with decreased win-stay probability. There are few previous studies investigating similar constructs but Berghorst and colleagues reported that after stress induction those who had higher cortisol reactivity and self-reported negative affect had lower reward but not punishment sensitivity [33]. Additionally, it is worth noting that due to the relatively poor model fit for this exploratory analysis that these findings should be taken as preliminary due to the risk of data overfitting.

4.2 Probabilistic reward task

In contrast to previous studies employing the PRT neither ELS nor control groups showed a response bias toward the more highly rewarded stimulus [21,24] suggesting a general failure of all participants to modulate their responses as a function of reward. This lack of response bias in the main study potentially indicates that the reward information was not salient enough therefore participants focussed upon correctly discriminating between mouth lengths. This makes comparison to previous literature challenging however it is worth noting that depressed patients have been reported to have a decreased response bias with no change in discrimination ability [20,21]. There are no previously published studies carrying out the PRT online. The online testing could have been one reason for the lack of response bias and likely leads to the high variability seen in the data as it is not possible to ensure that participants are completing the task in as controlled an environment as would be possible by laboratory testing.

However, another possibility for the lack of response bias seen was a key difference between laboratory and online versions of the task. While all other aspects of the task were similar, participants in the online task were informed that high performance would lead to a bonus payment with the actual reward in the task being points. Previous studies instead used direct monetary compensation in the task [24]. When the second population of control participants was tested in the PRT using direct monetary compensation a response bias was seen in blocks 1 and 3, however there did not appear to be evidence for this bias strengthening over time like previously observed [24]. There was also robust evidence for participants responding more quickly to the rich than lean stimulus as also has been reported [24,27]. While this difference in compensatory mechanism may underly the difference in control population performance in the two implementations of the task it could also be because the direct reward population had lower BDI and SHAPS scores. However, when the direct reward experiment was re-analysed with no BDI and SHAPS cut-offs such that it more closely approximated the control population in the main study the results were much the same as with the cut-offs (S5 Fig). These data therefore suggest that it is possible to successfully implement the PRT in an online setting using the directly rewarded task with the availability of reliable online psychological tasks being key under current circumstances.

While difficult to interpret for previously discussed reasons, participants with high levels of ELS did show impairments in discrimination, a measure of task difficulty. This however appeared to be driven by changes in depression symptomology as opposed to ELS specifically.

4.3 Conclusions

These data suggest that participants who do not self-report a diagnosis of a mental health condition but do have a history of ELS show impairments in positive feedback sensitivity and reward learning in the PRLT compared to controls. These impairments may be important in understanding how ELS predisposes to depression with reduced reward learning being a key feature in MDD patients [22]. However, high levels of potentially undiagnosed depression are a potential confound and highlight a potential wider issue in terms of the number of people who meet criteria for MDD but are not formally diagnosed or receiving care. Future studies are needed to replicate these findings, investigate the neural circuit changes underlying these reward learning impairments and investigate whether these findings are directly related to psychiatric risk.

Supporting information

S1 Fig. Study overview.

Participants were screened by ELSQ score and then formed into two study groups: no ELS and high ELS.

(PNG)

S2 Fig. Early life stress in an online study population.

(A) ELSQ scores in the study population. (B) Mental health disorder/Parkinson’s self-report diagnosis by ELSQ score (Mann-Whitney, U = 15725, p < 0.0001). N = 586 participants. (C) ELSQ scores in the study population split by modality of adverse childhood experience.

(TIF)

S3 Fig. Interpretation of BDI-II and SHAPS scores in the no and high ELS populations.

Scores were interpreted following Beck et al., 1996 and Snaith et al., 1995. (A) BDI-II split by severity of depression (chi2, χ2(3) = 12.9, p = 0.005) and (B) SHAPS split by normal or abnormal hedonic responses (chi2, χ2(1) = 6.3, p = 0.012). N = 129 participants (65 no ELS, 64 high ELS).

(TIF)

S4 Fig. Direct monetary reward in the PRT using a control population led to a reward induced bias.

(A) While no overall effect of block was observed, a response bias was observed in blocks 1 and 3 (Wilcoxon signed ranks test, block 1: W = 1087.5, p = 0.001, block 3: W = 916.5, p = 0.038). (B) There was little evidence for response bias strengthening across blocks. Discriminability (C) and response latency (D) did not appear to change over the course of a session. (E) Participants were faster to respond to the rich stimulus than lean (Wilcoxon matched pairs signed ranks test, W = 814, p = 0.0007). N = 56 participants.

(TIF)

S5 Fig. Direct monetary reward in the PRT using a control population led to a reward induced bias (no participants excluded due to BDI or SHAPS scores).

(A) While no overall effect of block was observed, a response bias was observed in blocks 1 and 3 (Wilcoxon signed ranks test, block 1: W = 1174, p = 0.003, block 3: W = 1105, p = 0.004). (B) Response bias strengthened between blocks 3 and 2 (Wilcoxon signed ranks test, W = 873, p = 0.040). Discriminability (C) and response latency (D) did not appear to change over the course of a session. (E) Participants were faster to respond to the rich stimulus than lean (Wilcoxon matched pairs signed ranks test, W = 1372, p < 0.0001). N = 81 participants.

(TIF)

S1 Table. Principal component analysis of social scale, SHAPS and BDI-II scores.

The mean ± standard error are shown for each group with the relevant statistical comparison.

(DOCX)

S2 Table. Principal component analysis component loadings.

(DOCX)

S3 Table. Miss-rates, the chance of mis-categorising a stimulus, by previous trial.

Data is shown as mean ± standard error and significant p-values are shown in bold.

(DOCX)

S4 Table. Demographic and self-report measures in the directly rewarded PRT control population.

Values are shown for each group as mean ± standard error where appropriate.

(DOCX)

S5 Table. Comparison of ANOVA/ANCOVA analytical approach used as main analysis with results obtained using generalised linear mixed models.

Measures by task phase were analysed as multiple Mann-Whitney tests due to non-parametricity of data. #1: Acquisition, Z = -1.38, p = 0.17, First reversal, Z = -2.0, p = 0.045, #2: Acquisition, Z = -1.44, p = 0.15, First reversal, Z = -2.1, p = 0.036, #3: Acquisition, Z = -1.41, p = 0.16, First reversal, Z = -1.2, p = 0.24, #4: Acquisition, Z = -1.95, p = 0.05, First reversal, Z = -0.92, p = 0.36.

(DOCX)

S1 File. Supplementary methods for the directly rewarded PRT in a control population.

(DOCX)

Acknowledgments

The authors would like to thank Michelle Taylor for advice on statistical analysis.

Data Availability

All data files are available on the open science framework platform (10.17605/OSF.IO/63E8J).

Funding Statement

This work was primarily funded by the BBSRC SWBio DTP PhD programme (www.bbsrc.ukri.org, grant numbers: BB/J014400/1 and BB/M009122/1) awarded to MPW. Additional support was also provided from the Wellcome Trust Neural Dynamics PhD studentship (www.wellcome.org, grant number: 108899/B/15/Z) awarded to CLS. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Green JG, McLaughlin KA, Berglund PA, Gruber MJ, Sampson NA, Zaslavsky AM, et al. Childhood adversities and adult psychopathology in the National Comorbidity Survey Replication (NCS-R) I: Associations with first onset of DSM-IV disorders. Arch Gen Psychiatry. 2010;67: 113–123. doi: 10.1001/archgenpsychiatry.2009.186 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Agid O, Shapira B, Zislin J, Ritsner M, Hanin B, Murad H, et al. Environment and vulnerability to major psychiatric illness: a case control study of early parental loss in major depression, bipolar disorder and schizophrenia. Mol Psychiatry. 1999;4: 163. doi: 10.1038/sj.mp.4000473 [DOI] [PubMed] [Google Scholar]
  • 3.Lemoult J, Humphreys KL, Tracy A, Hoffmeister J, Ip E, Gotlib IH. Meta-Analysis: Exposure to Early Life Stress and Risk for Depression in Childhood and Adolescence. J Am Acad Child Adolesc Psychiatry. 2019. doi: 10.1016/j.jaac.2019.10.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.McCauley J, Kern DE, Kolodner K, Dill L, Schroeder AF, DeChant HK, et al. Clinical characteristics of women with a history of childhood abuse: unhealed wounds. Jama. 1997;277: 1362–8. doi: 10.1001/jama.277.17.1362 [DOI] [PubMed] [Google Scholar]
  • 5.Hammen C, Henry R, Daley SE. Depression and sensitization to stressors among young women as a function of childhood adversity. J Consult Clin Psychol. 2000;68: 782–787. doi: 10.1037//0022-006X.68.5.782 [DOI] [PubMed] [Google Scholar]
  • 6.Kessler RC. The effects of stressful life events on depression. Annu Rev Psychol. 1997;48: 191–214. doi: 10.1146/annurev.psych.48.1.191 [DOI] [PubMed] [Google Scholar]
  • 7.Tottenham N. A review of adversity, the amygdala and the hippocampus: a consideration of developmental timing. Front Hum Neurosci. 2009;3: 68. doi: 10.3389/neuro.09.068.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Cohodes EM, Kitt ER, Baskin-Sommers A, Gee DG. Influences of early-life stress on frontolimbic circuitry: Harnessing a dimensional approach to elucidate the effects of heterogeneity in stress exposure. Dev Psychobiol. 2020; 1–20. doi: 10.1002/dev.21969 [DOI] [PubMed] [Google Scholar]
  • 9.Kennerley SW, Walton ME. Decision making and reward in frontal cortex: complementary evidence from neurophysiological and neuropsychological studies. Behav Neurosci. 2011;125: 297–317. doi: 10.1037/a0023575 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Rushworth MFS, Noonan MP, Boorman ED, Walton ME, Behrens TE. Frontal Cortex and Reward-Guided Learning and Decision-Making. Neuron. 2011;70: 1054–1069. doi: 10.1016/j.neuron.2011.05.014 [DOI] [PubMed] [Google Scholar]
  • 11.Palombo DJ, Hayes SM, Reid AG, Verfaellie M. Hippocampal contributions to value-based learning: Converging evidence from fMRI and amnesia. Cogn Affect Behav Neurosci. 2019;19: 523–536. doi: 10.3758/s13415-018-00687-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Dickerson KC, Delgado MR. Contributions of the hippocampus to feedback learning. Cogn Affect Behav Neurosci. 2015;15: 861–877. doi: 10.3758/s13415-015-0364-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Okatan M. Correlates of reward-predictive value in learning-related hippocampal neural activity. Hippocampus. 2009;19: 487–506. doi: 10.1002/hipo.20535 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Baxter MG, Murray EA. The amygdala and reward. Nat Rev Neurosci. 2002;3: 563–573. doi: 10.1038/nrn875 [DOI] [PubMed] [Google Scholar]
  • 15.Wassum KM, Izquierdo A. The basolateral amygdala in reward learning and addiction. Neurosci Biobehav Rev. 2015/09/02. 2015;57: 271–283. doi: 10.1016/j.neubiorev.2015.08.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Der-Avakian A, Barnes S, Markou A, Pizzagalli D. Translational Assessment of Reward and Motivational Deficits in Psychiatric Disorders. Curr Top Behav Neurosci. 2016;28: 231–262. doi: 10.1007/7854_2015_5004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Fogelman N, Canli T. Early Life Stress, Physiology, and Genetics: A Review. Front Psychol. 2019;10: 1668. Available: https://www.frontiersin.org/article/10.3389/fpsyg.2019.01668. doi: 10.3389/fpsyg.2019.01668 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Smith KE, Pollak SD. Early life stress and development: potential mechanisms for adverse outcomes. J Neurodev Disord. 2020;12: 34. doi: 10.1186/s11689-020-09337-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Whitton AE, Treadway MT, Pizzagalli DA. Reward processing dysfunction in major depression, bipolar disorder and schizophrenia. Curr Opin Psychiatry. 2015;28: 7–12. doi: 10.1097/YCO.0000000000000122 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Vrieze E, Pizzagalli DA, Demyttenaere K, Hompes T, Sienaert P, De Boer P, et al. Reduced reward learning predicts outcome in major depressive disorder. Biol Psychiatry. 2013;73: 639–645. doi: 10.1016/j.biopsych.2012.10.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Pizzagalli DA, Losifescu D, Hallet LA, Ratner KG, Fava M. Reduced Hedonic Capacity in Major Depressive Disorder: Evidence from a Probabilistic Reward Task. J Psychiatr Res. 2008;43: 76–87. doi: 10.1016/j.jpsychires.2008.03.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Halahakoon DC, Kieslich K, O’Driscoll C, Nair A, Lewis G, Roiser JP. Reward Processing Behavior in Depressed Participants Relative to Healthy Volunteers: A Systematic Review and Meta-analysis. JAMA Psychiatry. 2020. doi: 10.1001/jamapsychiatry.2020.2139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Herzberg MP, Gunnar MR. Early life stress and brain function: Activity and connectivity associated with processing emotion and reward. Neuroimage. 2020;209: 116493. doi: 10.1016/j.neuroimage.2019.116493 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Pizzagalli DA, Jahn AL, O’Shea JP. Toward an objective characterization of an anhedonic phenotype: A signal-detection approach. Biol Psychiatry. 2005;57: 319–327. doi: 10.1016/j.biopsych.2004.11.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Elliott R, Sahakian BJ, Herrod JJ, Robbins TW, Paykel ES. Abnormal response to negative feedback in unipolar depression: evidence for a diagnosis specific impairment. J Neurol Neurosurg Psychiatry. 1997;63: 74–82. doi: 10.1136/jnnp.63.1.74 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Bress JN, Foti D, Kotov R, Klein DN, Hajcak G. Blunted neural response to rewards prospectively predicts depression in adolescent girls. Psychophysiology. 2013;50: 74–81. doi: 10.1111/j.1469-8986.2012.01485.x [DOI] [PubMed] [Google Scholar]
  • 27.Pechtel P, Dutra SJ, Goetz EL, Pizzagalli DA. Blunted reward responsiveness in remitted depression. J Psychiatr Res. 2013;47: 1864–1869. doi: 10.1016/j.jpsychires.2013.08.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Taylor Tavares J V., Clark L, Furey ML, Williams GB, Sahakian BJ, Drevets WC. Neural basis of abnormal response to negative feedback in unmedicated mood disorders. Neuroimage. 2008;42: 1118–1126. doi: 10.1016/j.neuroimage.2008.05.049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Murphy FC, Michael A, Robbins TW, Sahakian BJ. Neuropsychological impairment in patients with major depressive disorder: the effects of feedback on task performance. Psychol Med. 2003;33: 455–467. doi: 10.1017/s0033291702007018 [DOI] [PubMed] [Google Scholar]
  • 30.Cools R, Clark L, Owen AM, Robbins TW. Defining the Neural Mechanisms of Probabilistic Reversal Learning Using Event-Related Functional Magnetic Resonance Imaging. J Neurosci. 2002;22: 4563–4567. doi: 20026435 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Bari A, Theobald DE, Caprioli D, Mar AC, Aidoo-Micah A, Dalley JW, et al. Serotonin modulates sensitivity to reward and negative feedback in a probabilistic reversal learning task in rats. Neuropsychopharmacology. 2010;35: 1290–1301. doi: 10.1038/npp.2009.233 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Bogdan R, Pizzagalli DA. Acute Stress Reduces Reward Responsiveness: Implications for Depression. Biol Psychiatry. 2006;60: 1147–1154. doi: 10.1016/j.biopsych.2006.03.037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Berghorst LH, Bogdan R, Frank MJ, Pizzagalli DA. Acute stress selectively reduces reward sensitivity. Front Hum Neurosci. 2013;7: 1–15. doi: 10.3389/fnhum.2013.00001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Hanson JL, van den Bos W, Roeber BJ, Rudolph KD, Davidson RJ, Pollak SD. Early adversity and learning: implications for typical and atypical behavioral development. J Child Psychol Psychiatry Allied Discip. 2017;58: 770–778. doi: 10.1111/jcpp.12694 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Pechtel P, Pizzagalli DA. Disrupted reinforcement learning and maladaptive behavior in women with a history of childhood sexual abuse: A high-density event-related potential study. JAMA Psychiatry. 2013;70: 499–507. doi: 10.1001/jamapsychiatry.2013.728 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Harms MB, Shannon Bowen KE, Hanson JL, Pollak SD. Instrumental learning and cognitive flexibility processes are impaired in children exposed to early life stress. Dev Sci. 2018;21: e12596. doi: 10.1111/desc.12596 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Letkiewicz AM, Silton RL, Mimnaugh KJ, Miller GA, Heller W, Fisher J, et al. Childhood abuse history and attention bias in adults. Psychophysiology. 2020;57: e13627. doi: 10.1111/psyp.13627 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Mehta MA, Gore-Langton E, Golembo N, Colvert E, Williams SCR, Sonuga-Barke E. Hyporesponsive reward anticipation in the basal ganglia following severe institutional deprivation early in life. J Cogn Neurosci. 2010;22: 2316–2325. doi: 10.1162/jocn.2009.21394 [DOI] [PubMed] [Google Scholar]
  • 39.Sheridan MA, McLaughlin KA, Winter W, Fox N, Zeanah C, Nelson CA. Early deprivation disruption of associative learning is a developmental pathway to depression and social problems. Nat Commun. 2018;9: 2216. doi: 10.1038/s41467-018-04381-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Humphreys KL, Lee SS, Telzer EH, Gabard-Durnam LJ, Goff B, Flannery J, et al. Exploration—exploitation strategy is dependent on early experience. Dev Psychobiol. 2015;57: 313–321. doi: 10.1002/dev.21293 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Zhou X, Meng Y, Schmitt HS, Montag C, Kendrick KM, Becker B. Cognitive flexibility mediates the association between early life stress and habitual behavior. Pers Individ Dif. 2020;167: 110231. 10.1016/j.paid.2020.110231. [DOI] [Google Scholar]
  • 42.Kangas BD, Wooldridge LM, Luc OT, Bergman J, Pizzagalli DA. Empirical validation of a touchscreen probabilistic reward task in rats. Transl Psychiatry. 2020;10. doi: 10.1038/s41398-020-00969-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Cohen RA, Hitsman BL, Paul RH, McCaffery J, Stroud L, Sweet L, et al. Early life stress and adult emotional experience: An international perspective. Int J Psychiatry Med. 2006;36: 35–52. doi: 10.2190/5R62-9PQY-0NEL-TLPA [DOI] [PubMed] [Google Scholar]
  • 44.Adler NE, Epel ES, Castellazzo G, Ickovics JR. Relationship of subjective and objective social status with psychological and physiological functioning: preliminary data in healthy white women. Heal Psychol Off J Div Heal Psychol Am Psychol Assoc. 2000;19: 586–592. doi: 10.1037//0278-6133.19.6.586 [DOI] [PubMed] [Google Scholar]
  • 45.Beck A, Steer R, Brown G. Beck Depression Inventory. Second Edition. San Antonio, TX; 1996.
  • 46.Snaith RP, Hamilton M, Morley S, Humayan A, Hargreaves D, Trigwell P. A scale for the assessment of hedonic tone the Snaith-Hamilton Pleasure Scale. Br J Psychiatry. 1995;167: 99–103. doi: 10.1192/bjp.167.1.99 [DOI] [PubMed] [Google Scholar]
  • 47.Holmes TH, Rahe RH. The social readjustment rating scale. J Psychosom Res. 1967;11: 213–218. doi: 10.1016/0022-3999(67)90010-4 [DOI] [PubMed] [Google Scholar]
  • 48.Ameli R, Luckenbaugh DA, Gould NF, Holmes MK, Lally N, Ballard ED, et al. SHAPS-C: the Snaith-Hamilton pleasure scale modified for clinician administration. PeerJ. 2014;2: e429–e429. doi: 10.7717/peerj.429 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Waegeman A, Declerck CH, Boone C, Seurinck R, Parizel PM. Individual differences in behavioral flexibility in a probabilistic reversal learning task: An fMRI study. J Neurosci Psychol Econ. 2014;7: 203–218. doi: 10.1037/npe0000026 [DOI] [Google Scholar]
  • 50.Millisecond. Probabilistic Reversal Learning Task. 2020. Available: www.millisecond.com.
  • 51.Wilkinson MP, Grogan JP, Mellor JR, Robinson ESJ. Comparison of conventional and rapid-acting antidepressants in a rodent probabilistic reversal learning task. Brain Neurosci Adv. 2020;4: 1–11. doi: 10.1177/2398212820907177 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Grogan JP, Tsivos D, Smith L, Knight BE, Bogacz R, Whone A, et al. Effects of dopamine on reinforcement learning and consolidation in Parkinson ‘ s disease. Elife. 2017;6: 1–23. doi: 10.7554/eLife.26801 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Millisecond. Probabilistic Reward Task. 2020. Available: www.millisecond.com.
  • 54.Peterson RA, Cavanaugh JE. Ordered quantile normalization: a semiparametric transformation built for the cross-validation era. J Appl Stat. 2019; 1–16. doi: 10.1080/02664763.2019.1630372 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2020. Available: https://www.r-project.org/. [Google Scholar]
  • 56.Brooks ME, Kristensen K, van Benthem KJ, Magnusson A, Berg CW, Nielsen A, et al. glmmTMB balances speed and flexibility among packages for zero-inflated generalized linear mixed modeling. R J. 2017;9: 378–400. doi: 10.32614/rj-2017-066 [DOI] [Google Scholar]
  • 57.Pizzagalli DA, Bogdan R, Ratner KG, Jahn AL. Increased Perceived Stress is Associated with Blunted Hedonic Capacity: Potential Implications for Depression Research. Behav Res Ther. 2007;45: 2742–2753. doi: 10.1016/j.brat.2007.07.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Li C, Ford ES, Zhao G, Ahluwalia IB, Pearson WS, Mokdad AH. Prevalence and correlates of undiagnosed depression among U.S. adults with diabetes: The Behavioral Risk Factor Surveillance System, 2006. Diabetes Res Clin Pract. 2009;83: 268–279. doi: 10.1016/j.diabres.2008.11.006 [DOI] [PubMed] [Google Scholar]
  • 59.Lotfaliany M, Bowe SJ, Kowal P, Orellana L, Berk M, Mohebbi M. Depression and chronic diseases: Co-occurrence and communality of risk factors. J Affect Disord. 2018;241: 461–468. doi: 10.1016/j.jad.2018.08.011 [DOI] [PubMed] [Google Scholar]
  • 60.Lewis K, Marrie RA, Bernstein CN, Graff LA, Patten SB, Sareen J, et al. The Prevalence and Risk Factors of Undiagnosed Depression and Anxiety Disorders Among Patients With Inflammatory Bowel Disease. Inflamm Bowel Dis. 2019;25: 1674–1680. doi: 10.1093/ibd/izz045 [DOI] [PubMed] [Google Scholar]
  • 61.Office for National Statistics. Coronavirus and depression in adults, Great Britain: June 2020. 2020. Available: https://www.ons.gov.uk/peoplepopulationandcommunity/wellbeing/articles/coronavirusanddepressioninadultsgreatbritain/june2020.
  • 62.Kessler RC, Berglund P, Demler O, Jin R, Merikangas KR, Walters EE. Lifetime Prevalence and Age-of-Onset Distributions of DSM-IV Disorders in the National Comorbidity Survey Replication. Arch Gen Psychiatry. 2005;62: 593–602. doi: 10.1001/archpsyc.62.6.593 [DOI] [PubMed] [Google Scholar]
  • 63.Baldwin JR, Reuben A, Newbury JB, Danese A. Agreement Between Prospective and Retrospective Measures of Childhood Maltreatment: A Systematic Review and Meta-analysis. JAMA Psychiatry. 2019;76: 584–593. doi: 10.1001/jamapsychiatry.2019.0097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Dennison MJ, Sheridan MA, Busso DS, Jenness JL, Peverill M, Rosen ML, et al. Neurobehavioral markers of resilience to depression amongst adolescents exposed to child abuse. Journal of Abnormal Psychology. McLaughlin, Katie A.: Department of Psychology, University of Washington, Box 351525, Seattle, WA, US, 98195, mclaughk@uw.edu: American Psychological Association; 2016. pp. 1201–1212. doi: 10.1037/abn0000215 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Pechtel P, Pizzagalli DA. Effects of early life stress on cognitive and affective function: An integrated review of human literature. Psychopharmacology (Berl). 2011;214: 55–70. doi: 10.1007/s00213-010-2009-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Hanson JL, Nacewicz BM, Sutterer MJ, Cayo AA, Schaefer SM, Rudolph KD, et al. Behavioral problems after early life stress: Contributions of the hippocampus and amygdala. Biol Psychiatry. 2015;77: 314–323. doi: 10.1016/j.biopsych.2014.04.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Hanson JL, Albert D, Iselin A-MR, Carré JM, Dodge KA, Hariri AR. Cumulative stress in childhood is associated with blunted reward-related brain activity in adulthood. Soc Cogn Affect Neurosci. 2016;11: 405–412. doi: 10.1093/scan/nsv124 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Foti D, Hajcak G. Depression and reduced sensitivity to non-rewards versus rewards: Evidence from event-related potentials. Biological. 2009;81: 1–8. doi: 10.1016/j.biopsycho.2008.12.004 [DOI] [PubMed] [Google Scholar]
  • 69.Webb CA, Auerbach RP, Bondy E, Stanton CH, Foti D, Pizzagalli DA. Abnormal neural responses to feedback in depressed adolescents. J Abnorm Psychol. 2017;126: 19–31. doi: 10.1037/abn0000228 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Mueller EM, Pechtel P, Cohen AL, Douglas SR, Pizzagalli DA. Potentiated processing of negative feedback in depression is attenuated by anhedonia. Depress Anxiety. 2015/01/23. 2015;32: 296–305. doi: 10.1002/da.22338 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Herzallah M, Moustafa A, Natsheh J, Abdellatif S, Taha M, Tayem Y, et al. Learning from negative feedback in patients with major depressive disorder is attenuated by SSRI antidepressants. Front Integr Neurosci. 2013;7: 67. doi: 10.3389/fnint.2013.00067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Mukherjee D, Filipowicz ALS, Vo K, Satterthwaite TD, Kable JW. Reward and punishment reversal-learning in major depressive disorder. J Abnorm Psychol. 2020;129: 810–823. doi: 10.1037/abn0000641 [DOI] [PubMed] [Google Scholar]
  • 73.Enoch M-A. The role of early life stress as a predictor for alcohol and drug dependence. Psychopharmacology (Berl). 2010/07/02. 2011;214: 17–31. doi: 10.1007/s00213-010-1916-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Volkow ND, Wang G-J, Fowler JS, Tomasi D, Telang F, Baler R. Addiction: decreased reward sensitivity and increased expectation sensitivity conspire to overwhelm the brain’s control circuit. Bioessays. 2010;32: 748–755. doi: 10.1002/bies.201000042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Murphy FC, Michael A, Sahakian BJ. Emotion modulates cognitive flexibility in patients with major depression. Psychol Med. 2012;42: 1373–1382. doi: 10.1017/S0033291711002418 [DOI] [PubMed] [Google Scholar]
  • 76.Chen C, Takahashi T, Nakagawa S, Inoue T, Kusumi I. Reinforcement learning in depression: A review of computational research. Neurosci Biobehav Rev. 2015;55: 247–267. doi: 10.1016/j.neubiorev.2015.05.005 [DOI] [PubMed] [Google Scholar]
  • 77.Robinson OJ, Chase HW. Learning and Choice in Mood Disorders: Searching for the Computational Parameters of Anhedonia. Comput psychiatry (Cambridge, Mass). 2017/12/29. 2017;1: 208–233. doi: 10.1162/CPSY_a_00009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Pryce CR, Dettling AC, Spengler M, Schnell CR, Feldon J. Deprivation of Parenting Disrupts Development of Homeostatic and Reward Systems in Marmoset Monkey Offspring. Biol Psychiatry. 2004;56: 72–79. doi: 10.1016/j.biopsych.2004.05.002 [DOI] [PubMed] [Google Scholar]
  • 79.Gorrindo T, Blair RJR, Budhani S, Dickstein DP, Pine DS, Leibenluft E. Deficits on a probabilistic response-reversal task in patients with pediatric bipolar disorder. Am J Psychiatry. 2005;162: 1975–1977. doi: 10.1176/appi.ajp.162.10.1975 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Alexandra Kavushansky

23 Jul 2021

PONE-D-21-19019

Investigation of reward learning and feedback sensitivity in non-clinical participants with a history of early life stress.

PLOS ONE

Dear Dr. Wilkinson,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Sep 05 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Alexandra Kavushansky, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following in the Acknowledgments Section of your manuscript:

“This work was primarily funded by the BBSRC SWBio DTP PhD programme (grant numbers: BB/J014400/1 and BB/M009122/1) awarded to MPW. Additional support was also provided from the Wellcome Trust Neural Dynamics PhD studentship (grant number: 108899/B/15/Z) awarded to CLS”

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

 “This work was primarily funded by the BBSRC SWBio DTP PhD programme (www.bbsrc.ukri.org, grant numbers: BB/J014400/1 and BB/M009122/1) awarded to MPW. Additional support was also provided from the Wellcome Trust Neural Dynamics PhD studentship (www.wellcome.org, grant number: 108899/B/15/Z) awarded to CLS. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.”

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

3. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

4. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

Additional Editor Comments:

In addition to the issues raised by the reviewers, I'll like to ask to add the information about the test that was used to verify normal distribution of the data.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: No

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This manuscript describes an online study using two reinforcement learning tasks, comparing adult individuals reporting high levels of early-life adversity (ELA) and those who do not. While ELA is a risk factor in the development of depression, the current study excluded those reporting a psychiatric diagnosis. A motivation for the study was to investigate whether ELA may lead to a deficit reward processing that could be seen as a vulnerability to developing depression. A strength of the current study was the measurement of several demographic factors related to depression, stress, and socio-economic status that enabled the authors to use regression to examine the effects of ELA independently. One weakness that should be mentioned as a caveat is the fact that lifetime substance use is likely to have varied between the groups and could have contributed to the results.

The main finding was the demonstration of a reduced reward sensitivity (win-stay) behavior in participants in the probabilistic reversal learning task. This effect remained even when accounting for current levels of depression. There was no effect on reversal learning, and thus cognitive flexibility did not seem affected. There was a trend for a slower learning rate, but this was fairly subtle. Overall, the selectivity of the deficit in reward processing was thus interesting.

It appears that the implementation of the second task (probabilistic reward) did not yield interpretable results in the online format. The second control group showed that using actual monetary reward improved reward sensitivity in this online format, and thus it would be more informative to use this version in a future study examining ELA. As it stands, I feel that the data from this task do not add much as neither group showed reward sensitivity in this implementation.

Reviewer #2: The paper “Investigation of reward learning and feedback sensitivity in non-clinical participants with a history of early life stress” examines the relationships between early stressful event exposures and multiple facets of reward learning. While an important question with potentially interesting findings, the manuscript in its current form lacks the theoretical and methodological clarity necessary to interpret the presented results.

1. On a whole, the manuscript is difficult to read and follow. I found myself having to search for clarifying details, finding them in places you would not typically expect – i.e. methods in the introduction or discussion – or not all. The paper in its current form requires extensive re-writing and clarification before I can effectively evaluate the presented results. I have provided more concrete suggestions for this could be addressed below.

2. The introduction lacks strong theoretical justification for study. This is not because there is no reason to believe the question would be one of interest, but rather due to the current structure of the introduction.

a. The introduction sets the paper up as though it will be examining whether alterations in reward learning processes moderate the relationship between early life stress (ELS) and depression. However, this is not what is tested in the results – the primary analyses of interest focus on the relationship between ELS and learning parameters, while controlling for depressive symptoms. The introduction would benefit from a more extensive review of the existing evidence related to ELS and reward learning – an area that has grown rapidly over the last 10 years (for review see Herzberg & Gunnar, 2020) – and a more thorough explanation of how the current manuscript addresses an outstanding question within that literature.

Herzberg & Gunnar (2020). Early life stress and brain function: Activity and Connectivity associated with processing emotion and reward. Neuroimage, 209: 116493.

b. The introduction uses a lot of jargon without clarifying what these terms refer to and why they may be of interest. For example, authors discuss the effects of depression on “the probabilistic reward learning” and “the probabilistic reversal learning task” on page 3, but provide little information on what the specific tasks referred are testing or why the findings are of interest. It’s important to provide this information to help justify the current study, especially as many variations of probabilistic reward learning tasks have been used in the literature examining early life stress and reward learning.

c. The authors should provide more specific hypotheses than just ELS will be associated with altered reward processing and feedback sensitivity. There is enough existing literature to allow for concrete predictions. The justification for each hypothesized effect should also be made concrete.

d. Relatedly, despite the fact the authors utilize a probabilistic reversal learning task, they include no discussion of existing research examining reversal learning/cognitive flexibility and ELS. Research finding ELS is associated with less cognitive flexibility during reversal learning (see Harms et al, 2017) and different learning strategies (see Letkiewicz et al, 2020) is directly relevant to the questions being asked here.

Harms et al (2017). Instrumental learning and cognitive flexibility processes are impaired in children exposed to early life stress. Developmental Science, 21(4): e12596.

Letkiwicz et al (in press). Frontoparietal network activity during model-based reinforcement learning updates is reduced among adolescents with sever sexual abuse. Journal of Psychiatric Science.

3. The Methods are also unclear and difficult to follow making it hard to interpret the presented results.

a. It is not clear why the authors used repeated measures ANCOVA for some of the analyses and GLMMs for other. GLMMs have no issue handling repeated measures data. The authors should justify why they used the different approaches for different outcomes.

b. It is unclear what the follow up study is addressing. In the introduction, it is stated that it was “due to the failure of both groups to meet the primary endpoint” and then in the methods and discussion it is stated this is because neither group showed a response bias towards the high reward option. Why the follow up study is helpful needs to be made clearer and more explicit. This information should reside in the Methods, not interspersed throughout the Introduction, Methods, and Discussion. Additionally, I would recommend moving the follow up study in its entirety to the Supplemental Materials as it doesn’t appear to be directly relevant to the primary question of interest.

c. It would be helpful to provide more information about what the model parameters from the Q learning model represent. Readers familiar with computational learning models will know this, but more naieve readers may not. This can be as simple as explaining the learning rate represents speed of feedback integration and will also help readers better interpret the results.

4. The results should be re-written in line with the introduction to make it clear how the different analyses speak to the hypotheses of interest. As currently written, the findings seem somewhat spurious and it is unclear how meaningful they are.

More minor comments

1. Are the effects similar if you run the analyses treating ELS as continuous rather than dichotomous?

2. The manuscript should be thoroughly read for typos and grammatical errors.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Barbara Knowlton

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Dec 10;16(12):e0260444. doi: 10.1371/journal.pone.0260444.r002

Author response to Decision Letter 0


3 Sep 2021

We appreciate the time taken in reviewing this manuscript. Please find a detailed response to every comment raised in the included cover letter.

Decision Letter 1

Sarah Whittle

20 Sep 2021

PONE-D-21-19019R1Investigation of reward learning and feedback sensitivity in non-clinical participants with a history of early life stress.PLOS ONE

Dear Dr. Wilkinson,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Nov 04 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Sarah Whittle

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: No

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: No

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: Thank you to the authors for their detailed responses and revisions. The revisions have improved the manuscript, particularly in regards to clarifying the motivation for the study and the methods. However, there are still a few issues that could be addressed to improve the work.

1. I appreciate the authors’ rationale for their use of the different analytical techniques for the different analyses (ANOVA, ANCOVA, GLMM). However, I still have concerns about the use of the different techniques – in particular, it makes it difficult to determine whether differences in the effects of ELS in the ANOVA/ANCOVA and GLMM analyses are due to the incorporation of additional predictors in the GLMM or the different analytical techniques. Could the authors re-run all analyses using GLMMs to help clarify this? As long as the results are comparable, I do not think these need to be reported in the main manuscript, but they would be helpful as Supplemental for readers with similar concerns.

2. While the revisions have improved the readability of the manuscript, there are still areas where the language is unclear making it difficult to follow. I advise the authors read through the manuscript carefully to improve readability and flow of the text. Some specific examples of where this would be useful are outlined below:

a. The abstract could be revised for clarity. In particular, the second sentence with the hypothesis is confusing unless you’ve read the paper already – the way it is written makes it sound like the hypothesis is contradictory to current literature (i.e. it’s stated that depression is associated with reward deficits but then the hypothesis is that ELS will be associated with reward deficits but depression will not). Additionally, given the study the modified PRT in control participants has been moved to the supplemental, it’s not clear this is necessary in the abstract.

b. I recommend explicitly stating the goal of the study before introducing the hypothesis in the last paragraph of the introduction. The goal of assessing the relationship between ELS and reward processing using tasks that can be directly compared with those that have been used in the literature on depression is an important one and it will help readers to have that made clear before reading the methods and results. Additionally, it might help to outline the three sub questions (Does ELS affect reward learning parameters, is this due to depression, and does ELS interact with acute stress) here to orient the readers to how the results will be conveyed.

c. Could you speak to how the results tie to the work that has been done using these tasks in patients with depression in the discussion as that is one of the motivations for the study outlined in the introduction?

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Dec 10;16(12):e0260444. doi: 10.1371/journal.pone.0260444.r004

Author response to Decision Letter 1


4 Nov 2021

Please see attached cover letter providing detailed responses to each comment by the reviewers.

Decision Letter 2

Sarah Whittle

10 Nov 2021

Investigation of reward learning and feedback sensitivity in non-clinical participants with a history of early life stress.

PONE-D-21-19019R2

Dear Dr. Wilkinson,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Sarah Whittle

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Acceptance letter

Sarah Whittle

3 Dec 2021

PONE-D-21-19019R2

Investigation of reward learning and feedback sensitivity in non-clinical participants with a history of early life stress.

Dear Dr. Wilkinson:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Sarah Whittle

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Study overview.

    Participants were screened by ELSQ score and then formed into two study groups: no ELS and high ELS.

    (PNG)

    S2 Fig. Early life stress in an online study population.

    (A) ELSQ scores in the study population. (B) Mental health disorder/Parkinson’s self-report diagnosis by ELSQ score (Mann-Whitney, U = 15725, p < 0.0001). N = 586 participants. (C) ELSQ scores in the study population split by modality of adverse childhood experience.

    (TIF)

    S3 Fig. Interpretation of BDI-II and SHAPS scores in the no and high ELS populations.

    Scores were interpreted following Beck et al., 1996 and Snaith et al., 1995. (A) BDI-II split by severity of depression (chi2, χ2(3) = 12.9, p = 0.005) and (B) SHAPS split by normal or abnormal hedonic responses (chi2, χ2(1) = 6.3, p = 0.012). N = 129 participants (65 no ELS, 64 high ELS).

    (TIF)

    S4 Fig. Direct monetary reward in the PRT using a control population led to a reward induced bias.

    (A) While no overall effect of block was observed, a response bias was observed in blocks 1 and 3 (Wilcoxon signed ranks test, block 1: W = 1087.5, p = 0.001, block 3: W = 916.5, p = 0.038). (B) There was little evidence for response bias strengthening across blocks. Discriminability (C) and response latency (D) did not appear to change over the course of a session. (E) Participants were faster to respond to the rich stimulus than lean (Wilcoxon matched pairs signed ranks test, W = 814, p = 0.0007). N = 56 participants.

    (TIF)

    S5 Fig. Direct monetary reward in the PRT using a control population led to a reward induced bias (no participants excluded due to BDI or SHAPS scores).

    (A) While no overall effect of block was observed, a response bias was observed in blocks 1 and 3 (Wilcoxon signed ranks test, block 1: W = 1174, p = 0.003, block 3: W = 1105, p = 0.004). (B) Response bias strengthened between blocks 3 and 2 (Wilcoxon signed ranks test, W = 873, p = 0.040). Discriminability (C) and response latency (D) did not appear to change over the course of a session. (E) Participants were faster to respond to the rich stimulus than lean (Wilcoxon matched pairs signed ranks test, W = 1372, p < 0.0001). N = 81 participants.

    (TIF)

    S1 Table. Principal component analysis of social scale, SHAPS and BDI-II scores.

    The mean ± standard error are shown for each group with the relevant statistical comparison.

    (DOCX)

    S2 Table. Principal component analysis component loadings.

    (DOCX)

    S3 Table. Miss-rates, the chance of mis-categorising a stimulus, by previous trial.

    Data is shown as mean ± standard error and significant p-values are shown in bold.

    (DOCX)

    S4 Table. Demographic and self-report measures in the directly rewarded PRT control population.

    Values are shown for each group as mean ± standard error where appropriate.

    (DOCX)

    S5 Table. Comparison of ANOVA/ANCOVA analytical approach used as main analysis with results obtained using generalised linear mixed models.

    Measures by task phase were analysed as multiple Mann-Whitney tests due to non-parametricity of data. #1: Acquisition, Z = -1.38, p = 0.17, First reversal, Z = -2.0, p = 0.045, #2: Acquisition, Z = -1.44, p = 0.15, First reversal, Z = -2.1, p = 0.036, #3: Acquisition, Z = -1.41, p = 0.16, First reversal, Z = -1.2, p = 0.24, #4: Acquisition, Z = -1.95, p = 0.05, First reversal, Z = -0.92, p = 0.36.

    (DOCX)

    S1 File. Supplementary methods for the directly rewarded PRT in a control population.

    (DOCX)

    Data Availability Statement

    All data files are available on the open science framework platform (10.17605/OSF.IO/63E8J).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES