The Effect of Reduced Learning Ability on Avoidance in Psychopathy: A Computational Approach

Takeyuki Oba; Kentaro Katahira; Hideki Ohira

doi:10.3389/fpsyg.2019.02432

. 2019 Nov 1;10:2432. doi: 10.3389/fpsyg.2019.02432

The Effect of Reduced Learning Ability on Avoidance in Psychopathy: A Computational Approach

Takeyuki Oba ^1,^*, Kentaro Katahira ², Hideki Ohira ²

PMCID: PMC6838140 PMID: 31736830

Abstract

Individuals with psychopathy often show deficits in learning, which often have negative consequences. Several theories have been proposed to explain psychopathic behaviors, but the learning mechanisms in psychopathy are still unclear. To clarify the learning anomalies in psychopathy, we fitted reinforcement learning (RL) models to behavioral data. We conducted two experiments to examine the effect of psychopathy as a group difference (Experiment 1) and as a continuum (Experiment 2). Forty-three undergraduates (in Experiment 1) and fifty-five undergraduate and graduate students (in Experiment 2) performed a go/no-go based learning task with accompanying rewards or punishments. Although we observed no differences in learning performance among the levels of psychopathic traits, the learning rate for the positive prediction error in the loss domain was lower for those with high-psychopathic trait than for those with low-psychopathic trait. This finding indicates that individuals with high-psychopathic traits update an action value less when they avoid a negative outcome. Our model can represent previous theories under a computational framework and provide a new perspective on impaired learning in psychopathy.

Keywords: psychopathy, reinforcement learning model, learning rate, prediction error, avoidance learning

Introduction

Psychopathy is a group of personality traits described by callousness, lack of empathy, shallow affect, and impulsivity (Cleckley, 1976), and these traits can be divided into emotional detachment and externalizing behavior (Hare, 2003). Because of such features, individuals with psychopathy often commit antisocial behaviors and harm others (Hemphill et al., 1998; Leistico et al., 2008). However, a wide range of people who do not commit crimes may possess psychopathic traits (Levenson et al., 1995; Gao and Raine, 2010) because impaired emotional functions, rather than impulsivity, constitute the core element of psychopathy (Hare, 2003; Blair, 2006). Indeed, persons with high psychopathy who are recruited from a non-clinical population often show some behaviors similar to those of psychopathic offenders (Lynam et al., 1999; Osumi et al., 2007b; Kahane et al., 2015; Pletti et al., 2017).

One of the remarkable features related to psychopathy is a failure to learn from negative consequences, such as an electric shock, a monetary loss, or a loss of points (Lykken, 1957; Blair et al., 2006; von Borries et al., 2010). Many studies have reported that individuals with psychopathy showed deficient performance in several types of learning that are needed to change one’s own behavior through unpleasant experiences. A major paradigm for the evaluation of learning abilities in psychopathy is a go/no-go based learning task. Individuals with psychopathy often fail to withdraw a response to a stimulus that leads to punishment, but they rarely fail to respond to a stimulus that leads to a reward (Newman and Kosson, 1986; Newman and Schmitt, 1998; Lynam et al., 1999; Finger et al., 2011). Moreover, learning deficits have often been observed among psychopathic persons with low trait anxiety (Lykken, 1957; Newman and Schmitt, 1998). This finding indicates that individuals with psychopathy have difficulty in learning to adjust their behaviors based on negative outcomes. Clarifying the mechanisms of learning with negative results for individuals with psychopathy is thought to be important because learning deficits may cause abnormal moral development and behavior (Blair, 2017).

The reasons why individuals with psychopathy have difficulty learning from negative results have been debated. A classic explanation for the characteristics of psychopathy is the low-fear hypothesis, which suggests that diminished reactions to threatening stimuli underlie psychopathic features (Lykken, 1957; Patrick et al., 1993; Hoppenbrouwers et al., 2016). In this hypothesis, individuals with high psychopathy are less susceptible to negative stimuli; thus, their learning performance is insufficient compared to that of individuals with low psychopathy. In this regard, researchers have developed several neurocognitive models for psychopathy, such as the integrated emotion system (IES) theory, which highlights the amygdala and orbitofrontal cortex (OFC) functions that are assumed to form stimulus-outcome associations and to select appropriate actions after a reversal of contingency (Blair, 2006). In contrast, Newman and colleagues argued that impairments related to psychopathy stem from abnormal attentional systems (Hiatt et al., 2004; Zeier et al., 2009; Newman et al., 2010; Newman and Baskin-Sommers, 2016). This theory, the response modulation hypothesis, assumes that learning impairments in psychopathy occur due to the disregard for a disadvantageous sign while attending to a goal-related stimulus. While these theories have led to important findings, they seem to lack evidence to directly describe the learning deficits.

Reinforcement learning (RL) models can provide insight into the learning deficits in psychopathy by providing a computational framework for describing how advantages are maximized and disadvantages are minimized through experience (Sutton and Barto, 1998). A key component of RL models, especially a delta learning rule, is the prediction error (PE), which is the difference between an anticipated value and an actual received value. Several studies have shown neural activities correlated with the PE algorithms in classical and instrumental learning (Schultz et al., 1997; O’Doherty et al., 2003). This method allows us to summarize large dynamic data sets (i.e., trial-by-trial choice data) with very few parameters, such as a learning rate (i.e., the extent of modification to the error) and a subjective impact of outcomes (i.e., choice randomness). Using the learning parameters, the RL models can map psychopathology, such as schizophrenia (Culbreth et al., 2016) and major depression (Kunisato et al., 2012; Huys et al., 2013). This approach to studying mental illness using computational models is called computational psychiatry (Montague et al., 2012; Huys et al., 2016), and the RL models can provide details regarding learning mechanisms and anomalies. Thus, the RL models can describe learning impairment in psychopathy and explore how it corresponds to the abovementioned theories.

Several pioneering studies have explored the computational characteristics of learning abilities for individuals with psychopathy. Using a reward learning task in which a partner gives advice on the choice of behavior, Brazil et al. (2013a) found that some psychopathic traits were negatively correlated with the weights of the subjective probabilities for reward and social information. Blair et al. (2004) applied a Hebbian learning rule to simulate actual learning performance in psychopathic offenders and revealed that a model that represented impairments in stimulus-punishment associations could replicate the performance of individuals with psychopathy. Aisbitt and Murphy (2016) identified a learning characteristic related to psychopathy from a learning model thought to be affected by attention. They showed that the effect of competing cues in a learning task decreased with the extent of psychopathy; this result was predicted by the model that Aisbitt and Murphy used. In the go/no-go based learning task, White et al. (2013) demonstrated that adolescents with conduct problems showed smaller blood oxygen level-dependent (BOLD) signals correlated with action values that were estimated from an RL model. Brazil et al. (2017) used a computational framework to model fluctuations of BOLD signals during threat conditioning and showed that psychopathic traits were positively related to the fluctuations. These findings contribute to the theoretical, behavioral, and neurobiological understanding of learning deficits in psychopathy. However, Brazil et al. (2013a) did not test the effect of negative consequences on learning. Blair et al. (2004) and Aisbitt’s studies did not report group differences for the model parameters because these studies used models to predict learning performance. Brazil et al. (2017) and White et al. (2013) mainly examined neural activities related to learning models. Thus, these studies have not examined the learning parameters associated with avoidance learning in psychopathy.

This article aims to examine the learning mechanisms in psychopathy using RL models. These models can provide parameters that characterize certain aspects of learning, and we searched for the relationships between RL parameters and psychopathy. We conducted two experiments to examine the relations of psychopathy as a group difference and as a continuum. We hypothesized that the abnormal learning process in psychopathy is related to aberrant valence systems such as reward-punishment and/or positive-negative PE processes. In line with the low-fear hypothesis, individuals with psychopathy showed poor reactions to fear conditioning (Birbaumer et al., 2005) and weak physiological responses to unpleasant images (Blair et al., 1997; Osumi et al., 2007b). Moreover, the IES theory predicts that psychopathic traits are related to a weaker ability to build a stimulus-outcome association (Blair, 2006). Therefore, according to the low-fear hypothesis and the IES theory, individuals with high psychopathic trait are slow to build negative associations (i.e., having a lower learning rate in a loss domain) than are individuals in a control group. In contrast, the response modulation hypothesis relies on data suggesting that when individuals with psychopathy concentrate on a target stimulus, they tend to attenuate the interference by another stimulus (Hiatt et al., 2004; Zeier et al., 2009). If the response modulation hypothesis is valid, learning parameters related to a reward system (especially the learning rate for positive PE in a gain condition) for the high psychopathic level group were expected to be higher than parameters related to a punishment system. In addition, we sought other parameters that may contribute to learning in psychopathy.

Experiment 1

We first used an extreme groups approach to compare the effect of the difference between high and low levels of psychopathy on learning parameters. This method has the advantage of increased statistical power (Preacher et al., 2005; Katahira and Yamashita, 2017). The goal of Experiment 1 was to identify which learning parameters were differed among individuals with high and low levels of psychopathic traits.

Materials and Methods

Participants

Data were obtained from 46 undergraduate students who met specific criteria, which are described later. All participants completed the Japanese version of the Levenson Self-Report Psychopathy Scale (LSRP: Levenson et al., 1995; Sugiura and Sato, 2005) and the trait anxiety scale from the Japanese version of the State-Trait Anxiety Inventory (STAI: Spielberger et al., 1970; Shimizu and Imae, 1981). We determined the sample size following previous studies (Brazil et al., 2013a; Aisbitt and Murphy, 2016; Pletti et al., 2017). The participants were divided into high- and low-psychopathic trait groups based on the criteria, and each group consisted of 23 participants. Two participants in the high-psychopathic trait group and one participant in the low-psychopathic trait group were excluded from the analysis because they performed poorly due to misunderstanding the instructions for executing the task in this experiment. Therefore, the students with high-psychopathic trait consisted of 21 participants (15 males and 6 females, mean age = 19.24, SD = 0.77), and the students with low-psychopathic trait consisted of 22 participants (13 males and 9 females, mean age = 19.05, SD = 0.90). All participants gave their written informed consent and received ¥1,000 for participation. This study was approved by the Ethics Committee of Nagoya University.

When we recruited candidates for this experiment, we used certain criteria derived from a screening session in which 411 university students completed both of the questionnaires described above. The first criterion was whether individuals had primary psychopathy scores on the LSRP 0.5 SD above or 0.5 SD below the average for the screening session (M = 33.01, SD = 6.36; thus, 0.5 SD = 3.18), which was also used for group allocation. The LSRP can measure the primary and secondary psychopathic traits that correspond to emotional detachment and impulsivity, respectively (see section Measurements for details). We define psychopathy as emotional dysfunction rather than impulsivity because several prior studies have reported defects in emotional responses (Blair et al., 1997; Osumi et al., 2007b), and primary psychopathic traits are theoretically unique to psychopathy (Blair, 2006). Moreover, Blair et al. (2006) revealed that impulsive traits related to secondary psychopathy were unlikely to predict learning performance (however, see Lynam et al., 1999). Therefore, we focused on the difference in primary psychopathy and allowed secondary psychopathy to be matched at the average level in the two groups. The other criterion was used to control for trait anxiety. Learning deficits in psychopathy were often obtained only when individuals had high scores for psychopathic traits and low anxiety (Lykken, 1957; Newman and Schmitt, 1998). Therefore, we refrained from recruiting people with anxiety traits greater than 1 SD above the average score of the screening session (M = 47.76, SD = 8.83). A summary of these personality traits is shown in Table 1.

TABLE 1.

Means and standard deviations of LSRP and STAI scores by group.

	Psychopathy		t-value	p-value
	High trait scores	Low trait scores
	(n = 21)	(n = 22)
PP	42.76 (4.62)	25.64 (2.44)	15.29	p < 0.001
SP	20.67 (2.33)	19.55 (2.54)	1.51	p = 0.140
TA	42.00 (6.32)	44.27 (7.19)	1.10	p = 0.278

Open in a new tab

PP, primary psychopathy; SP, secondary psychopathy; TA, trait anxiety. Standard deviations are in parentheses.

Measurements

We used the Japanese version of the LSRP (Levenson et al., 1995; Sugiura and Sato, 2005) to assess the participants’ psychopathic tendencies. The LSRP has been examined in terms of its reliability and validity by Lynam et al. (1999) and Osumi et al. (2007a) and has been used by several studies (Osumi et al., 2007b; Kahane et al., 2015; Pletti et al., 2017). The LSRP has two subgroups corresponding to primary psychopathy and secondary psychopathy. Primary psychopathy encompasses callousness and a manipulative attitude toward others (e.g., “People who are stupid enough to get ripped off usually deserve it”), whereas secondary psychopathy involves impulsivity and stimulation-seeking behavior (e.g., “I don’t plan anything very far in advance”). The primary psychopathy subscale consists of 16 items, and the secondary psychopathy subscale includes 10 items. Cronbach’s alpha statistics calculated from the screening session data were 0.790 for primary psychopathy and 0.599 for secondary psychopathy. Each item is rated on a four-point Likert-type scale [from disagree strongly (1) to agree strongly (4)].

The trait anxiety scale from the STAI (Spielberger et al., 1970) is a 20-item self-report questionnaire that measures the level of anxiety in daily life (e.g., “I lack self-confidence”). We used a Japanese version of the STAI, the validity of which was examined by Shimizu and Imae (1981). Cronbach’s alpha for this scale in the screening session was 0.859. Each STAI item is rated on a four-point Likert-type scale [from not at all (1) to very much so (4)].

Learning Task

The experimental task was a probabilistic go/no-go learning task that is almost identical to that used by Guitart-Masip et al. (2012). The experiment was controlled by PsychoPy v1.80.30 (Peirce, 2009). In this paradigm, participants were required to learn approach or avoidance actions from positive or negative outcomes (Figure 1). At the start of the trial, a fixation cross appeared for 1.5 s on the computer screen. Then, one of four fractal images was presented as a condition stimulus. Participants had to decide whether to press the space key while a fractal image was displayed for 2 s. After a fractal disappeared, feedback of a gain of ¥10, a loss of ¥10, or neither a gain nor a loss was shown, depending on a prior action upon a fractal. The feedback was presented for 1 s, and then the next trial began.

Task structure in this experiment. Correct actions often lead to desirable results (increasing 10 yen in the gain cue and preventing a loss of money in the avoidance cue), whereas incorrect actions generally lead to undesirable consequences (omitting the reward and receiving the punishment). The pointing finger in this figure is depicted as a go action.

The four fractal images were randomly assigned to four conditions consisting of go to gain, no-go to gain, go to avoid, and no-go to avoid. The reward (+¥10) or no reward (¥0) feedback was shown in the gain trial, while the punishment (−¥10) or no punishment (¥0) result appeared in the loss trial. The outcomes were variable, such that the correct response led to a positive result at 80% but a negative result at 20% and that the incorrect response yielded a negative result at 80% but a positive result at 20%. These four conditions were presented 60 times; thus, the participants completed a total of 240 trials. The trial order was randomized for a block that included the four conditions. Each participant obtained the total amount that they earned at the end of the experiment. The participants were told that the outcomes were probabilistic, and they were required to find the correct response by trial and error to augment the benefit.

Reinforcement Learning Models

To assess the characteristics of learning, we applied delta rule RL models, including a combination of several parameters related to this experiment. All models are designed to assign an action value to each action for making decisions. Here, we consider action a (go or no-go) in response to stimulus s (a fractal image) on trial t for the action value Q_t(a_t, s_t). The action value for a chosen action is updated based on the following equation:

Q_{t + 1} (a_{t}, s_{t}) = Q_{t} (a_{t}, s_{t}) + ε δ_{t}

(1)

δ_{t} = ρ r_{t} - Q_{t} (a_{t}, s_{t})

(2)

where ε is the learning rate governing the degree to which the value is updated. The subjective impact of outcome ρ is a free parameter representing the effect size of the result. The outcome value r_t is 1 for a gain, -1 for a loss, or 0 for no gain or loss in trial t. The term ρr_t−Q_t(a_t, s_t) is the PE described as δ_t. Learning proceeds with a decision for each action according to the values, and the probabilities of implementing an action are calculated by the softmax function:

p_{t} (a_{t}, s_{t}) = \frac{\exp (W_{t} (a_{t}, s_{t}))}{\sum_{a^{'}} \exp (W_{t} (a_{t}^{'}, s_{t}))}

(3)

where W_t(a_t, s_t) is an action weight corresponding to Q_t(a_t, s_t), except in the models with specific parameters.

We used two additional parameters that were validated in prior studies to explain the go/no-go learning task (Guitart-Masip et al., 2012, 2014). One parameter was called the action bias, which is a tendency to press a button regardless of learning. The bias parameter b influences the action value on the weight:

W_{t} (a_{t}, s_{t}) = {\begin{matrix} Q_{t} (a_{t}, s_{t}) + b & i f a_{t} = g o \\ Q_{t} (a_{t}, s_{t}) & e l s e \end{matrix}

(4)

The other parameter was the Pavlovian factor, which expresses the effect of a stimulus value. Several studies have reported that stimuli resulting in rewards tend to block action inhibition, while stimuli leading to punishment tend to discourage reactions even though they are not the correct responses (Guitart-Masip et al., 2012, 2014). The action weight is adapted by the Pavlovian factor π as follows:

W_{t} (a_{t}, s_{t}) = {\begin{matrix} Q_{t} (a_{t}, s_{t}) + π V_{t} (s_{t}) & i f a_{t} = g o \\ Q_{t} (a_{t}, s_{t}) & e l s e \end{matrix}

(5)

V_{t + 1} (s_{t}) = V_{t} + ε (ρ r_{t} - V_{t} (s_{t}))

(6)

The stimulus value V_t(s_t) is updated with the same parameters used by the action value.

We hypothesized that psychopathic traits may be associated with deterioration in the process related to valence; thus, we divided certain parameters to obtain more detail about learning in psychopathy. The learning rate can be separated according to the positive PE (δ > 0) and negative PE (δ < 0). Models that comprise the learning rates for the signed PE allow an asymmetric effect on learning depending on the reception of better or worse results (Cazé and van der Meer, 2013). Furthermore, the learning rate can be divided into both gain and loss domains, indicating that the updating value in the gain domain can differ from that in the loss domain. Four conditions were consistent with the learning rates: a positive PE in a gain (gain: ε_GP), a negative PE in a gain (absence of reward: ε_GN), a positive PE in a loss (avoidance of monetary loss: ε_LP), and a negative PE in a loss (loss: ε_LN). The subjective impact of outcomes can also differ between a gain (ρ_G) and a loss (ρ_L), indicating that the subjective magnitude of positive reinforcers may not be equal to that of negative reinforcers. In sum, we examined 12 parameters and sought the best combination of these parameters.

Model Fitting and Comparison

Free parameters were estimated for each participant via a hierarchical type II maximum likelihood estimation, and the procedures were identical to those used in previous studies (Huys et al., 2011; Guitart-Masip et al., 2012 for details). This method assumes that the parameters of each individual are derived from each parameter distribution. We suppose that the population-level distribution for each parameter is a normal distribution. Certain parameters were converted into a suitable form. To perform the estimation, the likelihood was maximized by the expectation-maximization procedure using the Laplace approximation to calculate the posterior probability. We used the Rsolnp package in R¹ to optimize the likelihood functions.

These models were evaluated with the integrated Bayesian information criterion (iBIC). A smaller iBIC value represents a better model (Huys et al., 2011). Briefly, the iBIC was calculated by using the following procedures: Using the parameter values randomly generated by the population distributions, the likelihood was calculated multiple times (1,000 times here) for each participant data. Next, after dividing the total likelihood of each participant by the number of samples (1,000), these amounts were summed for all participants. Finally, the cost for the number of parameters was added to this value (see Huys et al., 2011 for details). The iBIC values are approximations of the log marginal likelihoods with a penalty for the number of free parameters.

Results

Learning Performance

For the numbers of errors, we conducted a 2 (psychopathic tendency: high/low) × 2 (correct action: go/no-go) × 2 (domain: gain/loss) repeated-measures ANOVA (Figure 2). This analysis revealed a main effect of action [F(1, 41) = 6.315, p = 0.016, η_p² = 0.134]. Participants made more errors when they needed to suppress a response than when they were required to respond. Consistent with the findings of prior studies, a significant interaction between action and domain was found [F(1, 41) = 19.532, p < 0.001, η_p² = 0.323]. Shaffer’s post hoc test indicated that participants were likely to fail to obtain rewards more often by action inhibition (M = 0.383, SD = 0.329) than by using the go response (M = 0.113, SD = 0.202; p < 0.001), while they showed better performance with the no-go response (M = 0.166, SD = 0.103) than with the go response (M = 0.271, SD = 0.221) for avoiding a loss of money (p = 0.006). Moreover, the level of error was higher with the go action when participants were engaged in avoiding a monetary loss than when they were engaged in pursuing benefits (p = 0.001). In contrast, the number of failures for the no-go response was larger in the gain condition than in the loss condition (p < 0.001). For the statistical effects of psychopathic tendency, neither the main effect nor the interactions were significant in learning performance [main effect: F(1, 41) = 1.114, p = 0.297, η_p² = 0.026; psychopathic tendency × action: F(1, 41) = 0.004, p = 0.949, η_p² = 0.0001; psychopathic tendency × domain: F(1, 41) = 0.055, p = 0.816, η_p² = 0.001; psychopathic tendency × domain × action: F(1, 41) = 0.958, p = 0.334, η_p² = 0.023].

Error rates in each condition for both groups. Dots indicate the data for each participant. Error bars represent standard errors.

Model Selection

Several models that had a specific constellation of free parameters were compared to determine which model yielded the best prediction of the choice data by using the iBIC. Using a stepwise procedure for comparing models, we added one free parameter to a model and accepted the plausible parameter that decreased the iBIC the most at each step. First, as depicted in Figure 3A, the Pavlovian factor π reduced the iBIC of the basic model (one learning rate ε and one subjective impact of outcomes ρ) over the other parameters. The iBIC of the model with π was diminished by separation of the learning rates for positive and negative PEs (ε_P and ε_N). The learning rates that were further divided between gains and losses (ε_GP, ε_GN, ε_LP, and ε_LN) also decreased the iBIC value. Finally, the action bias parameter b reduced the iBIC. The subjective impact of outcomes among gains (ρ_G) and losses (ρ_L) did not reduce the iBIC. The winning model included four different learning rates (ε_GP, ε_GN, ε_LP, and ε_LN) and one subjective impact of outcomes ρ, action bias b, and the Pavlovian factor π. Figure 3B shows a prediction of the winning model for the actual choice data.

**(A)** Each iBIC value for RL models. ε, learning rate; ρ, subjective impact of outcomes; b, action bias; π, Pavlovian factor. The subscripts represent the following: P, positive PE; N, negative PE; G, gain domain; L, loss domain. The brightness represents the number of parameters (as the number of parameters increases, the bar becomes darker). The diamond shape represents the winning model. **(B)** Average probabilities of choosing a go response in each trial for four conditions and the model predictions. The solid lines indicate the proportions of the go responses in each trial across participants, and the dashed lines show the predictions of the winning model. The black and gray lines represent the high- and low-psychopathy groups, respectively.

Group Differences of the Parameters

We addressed the main question of how learning processes differ between individuals with high- and low-psychopathy traits scores. Using the winning model, we first checked the learning rates. A 2 (psychopathic tendency: high/low) × 2 (type of PE: positive/negative) × 2 (domain: gain/loss) repeated-measures ANOVA was performed (Figure 4). Shaffer’s post hoc test was used when a significant interaction was found. The ANOVA was significant for each main effect [psychopathic tendency: F(1, 41) = 4.988, p = 0.031, η_p² = 0.109; type of PE: F(1, 41) = 23.401, p < 0.001, η_p² = 0.363; domain: F(1, 41) = 22.378, p < 0.001, η_p² = 0.353]. The results indicated that participants who scored high on psychopathic tendencies showed less change in their action value than participants who scored low on psychopathy. Furthermore, the learning rates for the loss condition and the positive PE were larger than the learning rates for the gain condition and the negative PE. The interaction between the type of PE and domain was significant [F(1, 41) = 17.642, p < 0.001, η_p² = 0.301], suggesting that participants showed greater change in their action value when they avoided monetary loss than when they experienced monetary gain (p < 0.001) and loss (p < 0.001). Furthermore, a three-way interaction of psychopathic tendency × domain × type of PE was found [F(1, 41) = 5.291, p = 0.027, η_p² = 0.114]. This analysis showed that compared to the participants with low-psychopathic traits, the high-psychopathic trait participants possessed a lower learning rate for the positive PE in the loss condition (high-psychopathic students: M = 0.330, SD = 0.228, low-psychopathic students: M = 0.494, SD = 264; p = 0.036), indicating that individuals with high-psychopathic traits showed reduced value updating when avoiding monetary loss. However, both groups exhibited a higher learning rate for avoidance (ε_LP) than for the other conditions (high-psychopathic students: M = 0.170, SD = 0.125, p = 0.009 for ε_GP, M = 0.160, SD = 0.176, p = 0.018 for ε_LN; low-psychopathic students: M = 0.169, SD = 0.127, p < 0.001 for ε_GP, M = 0.154, SD = 0.156, p < 0.001 for ε_LN). These results were replicated when using the other models, including four learning rates (i.e., 4 learning rates + one subjective impact of outcomes + the Pavlovian factor or 4 learning rates + the Pavlovian factor + 2 subjective impact of outcomes).

Learning rates for each condition in the psychopathic and non-psychopathic groups. Error bars and dots represent standard errors and individual data, respectively.

We further examined the relationships between psychopathic traits and other parameters. We performed t-tests between the groups for each parameter but found no significant effects [subjective impact ρ : t(41) = 0.251, p = 0.803, d = 0.077; bias: t(41) = 0.164, p = 0.871, d = 0.050; Pavlovian π : t(25.272) = 1.161 [with the Welch correction], p = 0.257, d = 0.360].

Experiment 2

In Experiment 1, we observed the group difference in learning rates for positive PE in the loss domain (i.e., slow to learn from the experience of avoidance). The extreme groups approach that we used in Experiment 1 can improve the statistical power but contains several problems (Preacher et al., 2005). Furthermore, many studies have investigated the effects of psychopathy as a continuum (Lynam et al., 1999; Newman et al., 2010; Brazil et al., 2013a; Kahane et al., 2015; Aisbitt and Murphy, 2016). We further examined whether psychopathy-related traits are linearly related to the learning parameters.