Abstract
In classical fear conditioning, neutral conditioned stimuli that have been paired with aversive physical unconditioned stimuli eventually trigger fear responses. Here, we tested whether aversive mental images systematically paired with a conditioned stimulus also cause de novo fear learning in the absence of any external aversive stimulation. In two experiments (N = 45 and N = 41), participants were first trained to produce aversive, neutral, or no imagery in response to three different visual-imagery cues. In a subsequent imagery-based differential-conditioning paradigm, each of the three cues systematically coterminated with one of three different neutral faces. Although the face that was paired with the aversive-imagery cue was never paired with aversive external stimuli or threat-related instructions, participants rated it as more arousing, unpleasant, and threatening and displayed relative fear bradycardia and fear-potentiated startle. These results could be relevant for the development of fear and related disorders without trauma.
Keywords: fear; fear conditioning; imagery; aversive conditioning; eyeblink reflex; open data, open materials
Classical fear conditioning provides a powerful model to explain the acquisition of fear in humans and nonhuman animals (LeDoux, 2014) and is often used to explain the development of anxiety disorders (Lissek et al., 2005; Mineka & Oehlberg, 2008). For example, in this framework, the emergence of a dog phobia could be explained as a consequence of being bitten by a dog; the dog would be the conditioned stimulus (CS), and the bite would be the aversive unconditioned stimulus (US) that becomes associated with the CS. A critical issue with this model, however, is that many patients with anxiety disorders do not recall such experiences (e.g., an aversive US such as a dog bite) in their past (Murray & Foote, 1979; Rachman, 1977), and this raises the question of whether an aversive US must physically occur in order for fear learning to occur.
Observational or vicarious fear-conditioning studies suggest that merely observing someone else receiving an aversive stimulation after CS presentation rather than experiencing the aversive stimulation oneself may suffice for fear learning to occur (Mineka, Davidson, Cook, & Keir, 1984), possibly because of overlapping neural representations of observing someone else in pain and experiencing pain oneself (Morrison, Lloyd, Di Pellegrino, & Roberts, 2004; Olsson & Phelps, 2007). Given that the neural representation of pain may also be activated by merely imagining a painful stimulation (Fairhurst, Fairhurst, Berna, & Tracey, 2012; Jackson, Brunet, Meltzoff, & Decety, 2006; Ogino et al., 2007), associative fear learning could also be based on mental images of the US (King, 1973; Lewis, O’Reilly, Khuu, & Pearson, 2013) and could even occur in the total absence of any physical or observed aversive stimulation. If this is the case, stimulus-contingent aversive imagery could provide an explanation for how fear may develop without aversive in vivo experiences and thus be of high relevance for understanding and treating anxiety disorders.
Previous studies that have investigated the role of an imagined US in conditioning (for a review, see Dadds, Bovbjerg, Redd, & Cutmore, 1997) either provided explicit instructions on CS–US contingencies (e.g., as used by Arabian, 1982; Soeter & Kindt, 2012) or first conditioned the CS with the physical US and then modulated an existing CS–US association with further US imagery (Jones & Davey, 1990). An open question is whether mental images of an aversive US may cause de novo fear conditioning in the total absence of any physically aversive stimulation, explicit instructions, or previously established CS–US associations. Translated to everyday life, can a person who was never bitten by a dog (and who neither observed how someone else was bitten nor was informed that dogs may bite) develop dog phobia, only because of aversive imagery when seeing a dog?
To investigate this hypothesis, we trained participants to produce specific mental images at the presentation of particular imagery cues. In a subsequent differential fear-conditioning procedure, we systematically paired CSs with these imagery cues but not with an actual US. Two different positive CSs (CSs+) were presented to disentangle CS responses related to aversive imagery from CS responses related to imagery per se. One CS+ was paired with a cue for aversive imagery (aversive CS+) and the other CS+ was paired with a cue for neutral imagery (neutral CS+). In addition, a negative CS (CS–) was presented and paired with an irrelevant stimulus that was physically similar to the imagery cues but was not supposed to prompt any imagery. After an acquisition phase, participants underwent an extinction phase in which the CSs were presented without the respective cues, to further investigate whether imagery-based conditioned fear is extinguished in the same manner as conditioned fear with a physical US (Dadds et al., 1997).
Study 1: Conditioning With an Imagined Thumbtack
Method
Participants and procedure
A total of 45 individuals (age: M = 22.67 years, SD = 2.5; 36 female, 9 male) with normal or corrected-to-normal vision and without neurological, cardiovascular, or psychiatric conditions participated in this study for course credit. Although we observed large effects for CS+ versus CS– differences in earlier studies with highly potent physically aversive USs (Sperl, Panitz, Hermann, & Mueller, 2016), we expected smaller (medium-size) effects with mental images of aversive events as USs. Under the assumption of medium correlations between measurements (r = .3), an alpha error probability of .05, and a power of .8, we used G*Power (Faul, Erdfelder, Lang, & Buchner, 2007) to calculate that a total sample size of 37 would be required to achieve a medium effect size (ηp2) of .06 for between-conditions differences. A sample size of 45 would allow for a potential data loss of up to 20%.
Participants signed informed consent, filled out a battery of questionnaires to test hypotheses unrelated to the current study, and completed a brief interview, after which they had electrodes attached for recording of the electrocardiogram (ECG) and electrodermal activity (EDA). Afterward, they were seated for a 5-min resting phase. Following an imagery training (see below), participants underwent the imagery-based fear-conditioning paradigm (Fig. 1). At the end, electrodes were detached, a postexperimental interview was conducted, and participants were debriefed and compensated for participation. The study was approved by the local ethics committee of the University of Marburg Psychology Department.
CSs and imagery cues
Three different faces with a neutral expression served as the aversive CS+, neutral CS+, and CS–, respectively (faces obtained from the Ekman faces series; Ekman & Friesen, 1976). The particular CS type of each face was counterbalanced across participants. Three different geometric shapes (a red square, a blue ellipse, and a yellow hexagon) served as imagery cues (aversive cue, neutral cue, no-image cue); assignment of shape and cue type was counterbalanced.
Imagery scripts
Prior to the current study, an online survey with 29 individuals had been conducted to determine a scenario that was considered highly aversive and could be vividly imagined by most individuals. From 10 different scenarios, participants found the “thumbtack-in-the-heel” scenario (see below) to be both highly aversive (M = 4.24, SD = 0.79, on a scale from 1 to 5) and vividly imaginable (M = 4.59, SD = 0.57, on a scale from 1 to 5). The “stepping-on-a-coin” scenario was chosen as the nonaversive control scenario because it also provides phasic tactile stimulation of the foot with a metallic object. Imagery scripts were created following the recommendations of Lang (1979) and included a response component to increase the vividness of the imagery (e.g., “your muscles cramp due to the pain of the thumbtack”).
The script for the thumbtack-in-the-heel scenario was as follows:
Imagine the following situation: You walk barefoot through a room, and your right foot steps on a thumbtack. You can feel the thin needle sinking into your heel as you step on the pin with your entire weight. The pain is piercing and intense and spreads from your heel into your leg. Every [red square/blue ellipse/yellow hexagon] that appears on the screen evokes the feeling of the needle pushing into your heel and the piercing and intense pain going through your body. The stinging pain is extremely unpleasant and barely tolerable. Focus on the pain you are experiencing. You can feel how it spreads from your right heel and you are cramping. You do not want to experience the stinging pain again. With every [red square/blue ellipse/yellow hexagon], you feel the thumbtack pushing into your heel.
The script for the stepping-on-a-coin scenario was as follows:
Imagine the following situation: You walk barefoot through a room, and your right foot steps on a 1-cent coin. You can feel the round metal under your heel when you step on it. The coin feels cool but it is not unpleasant. Every [red square/blue ellipse/yellow hexagon] that appears on the screen evokes the feeling of the round, cool coin under your heel. The contact is not unpleasant and is easily tolerable. Focus on the contact; you are relaxed. With every [red square/blue ellipse/yellow hexagon] that appears on the screen, you feel the round, cool coin under your heel.
The script for the control cue was as follows:
Whenever this [red square/blue ellipse/yellow hexagon] appears on the screen, you do not have to imagine anything. Just sit in your chair, observe the [red square/blue ellipse/yellow hexagon], and think of nothing in particular.
Imagery training
The imagery training was completed prior to the imagery-based-conditioning procedure and started with an auditory recording of the imagery scripts (recordings in German are available at https://doi.org/10.5281/zenodo.2591593). After the auditory instructions were given, participants were reminded two times about each cue–scenario association by instructions on the screen. If necessary, this reminder was repeated until participants were able to report the correct associations.
Imagery-based-conditioning paradigm
The imagery-based-conditioning paradigm consisted of an initial habituation phase, a subsequent acquisition phase, and a final extinction phase. The habituation phase consisted of three presentations of each CS for 10 s (intertrial interval, or ITI, jittered from 8 to 10 s) in random order. During each of two sequential acquisition blocks, every CS was presented 10 times for 10 s each, again with a jittered ITI of 8 to 10 s. Of the 10 CS presentations, 8 coterminated with the imagery cue centrally superimposed on the CS for the last 3 s (80% reinforcement). The two extinction blocks were identical to the acquisition blocks, except that the imagery cues were never shown.
Ratings
Before and after each phase and block, participants rated the valence and arousal of each of the three CSs on a 5-point Likert-type scale. In addition, participants reported their subjective experience of fear, anger, and disgust when looking at the CSs. Furthermore, participants were shown the three cues and asked to indicate whether they associated an image with each cue and, if so, to rate the unpleasantness of that image (from 0, not unpleasant at all, to 10, extremely unpleasant). Participants could also report having no image associated with a cue; these responses, which occurred exclusively to the no-image cue, were coded as 0 for statistical analyses.
Psychophysiological-data recording and reduction
The ECG and EDA were recorded with a BioSemi ActiveTwo system (BioSemi, Amsterdam, The Netherlands) with the Common Mode Sense and the Driven Right Leg electrodes attached to the right leg (sampling rate = 1024 Hz). For ECG measurement, Ag/AgCl electrodes (4-mm diameter) were applied in a lead-two configuration. In BrainVision Analyzer 2 (Brain Products, Gilching, Germany), the ECG was band-pass filtered (−3 dB at 1 Hz and 30 Hz, fourth-order two-way Butterworth filter, 24 dB/octave roll-off), and R spikes were detected automatically with the EKG Markers solution in the Analyzer software. R spikes were corrected manually if necessary, and nonusable data (e.g., premature systoles, excessive movement artifacts) were removed. Using custom-made MATLAB scripts (MATLAB Version 9.2; The MathWorks, Natick, MA), we then converted the ECG to a time course of interbeat intervals (IBIs), in which the value at each time point reflected the latency between the preceding and the next R spike (Mueller, Stemmler, Hennig, & Wacker, 2013). The IBI time series was then segmented into epochs ranging from −1,000 to 7,000 ms relative to CS onset (CS-evoked IBI) or from −1,000 to 10,000 ms relative to cue onset (cue-evoked IBI), baseline-corrected relative to −1,000 to 0 ms, downsampled to 2 Hz, and averaged across all trials by block and condition.
Heart rate responses to a CS during fear acquisition typically showed a triphasic response pattern (Lipp, 2007) consisting of an initial deceleration (D1), a transient acceleration (A1), and a second deceleration (D2). For analysis of CS-evoked IBIs, the maximum values were extracted for the time periods from 0 ms to 2,000 ms (D1) and 5,000 ms to 7,000 ms (D2), and the minimum values were extracted from 2,000 ms to 5,000 ms (A1). To remove the influence of the preceding components, we then computed peak-to-peak values, for example, the value for A1 was referenced to D1 (corrected A1 = A1 – D1), and the value for D2 was referenced to A1 (corrected D2 = D2 – A1). In addition to analyzing the three components separately, we also analyzed the mean IBI for the entire epoch from 0 ms to 7,000 ms (results are provided in the Supplemental Material available online). For cue-evoked IBIs, in which only a biphasic response was observed, the maximum value from 0 ms to 4,000 ms (D1) and the minimum value from 4,000 ms to 10,000 ms (A1) were taken.
EDA was recorded at the thenar and hypothenar of the nondominant hand with two Ag/AgCl electrodes (5-mm diameter, exosomatic measurement, 1 µA at 16 Hz AC). Electrodes were filled with isotonic (0.5% NaCl) electrolyte medium. Raw EDA was low-pass filtered off-line (1 Hz, same filter specifics as for ECG) and downsampled to 128 Hz. Ledalab 3.4.9 (implemented in MATLAB 9.2) was used for artifact correction and through-to-peak analyses (Benedek & Kaernbach, 2010a, 2010b). All data were visually screened, and technical artifacts were interpolated with spline or cubic interpolation.
Skin conductance responses (SCRs) were defined as the sum of SCR amplitudes of significant SCRs within 1,000 and 5,000 ms after CS or cue onset. SCRs smaller than 0.01 µS were considered zero responses. SCRs were logarithmized, ln(µS + 1), before averaging to obtain a normal distribution. Finally, as in the ECG analysis, SCR through-to-peak scores were averaged within blocks and conditions. Additional, more fine-grained SCR analyses with range correction and exclusion of nonresponders are provided in the Supplemental Material.
Statistical analyses
For analyzing responses to the imagery cues, repeated measures analyses of variance (ANOVAs) with the factors cue type (aversive vs. neutral vs. none) and block of acquisition (first vs. second) were conducted. For analyzing responses to CSs, the repeated measures ANOVAs included the factors CS type (aversive CS+ vs. neutral CS+ vs. CS–) and block of acquisition (first vs. second). Main effects were followed up by pairwise post hoc t tests. Greenhouse-Geisser correction was used when applicable. Statistical analyses were performed with SPSS 24.
Results
Responses to imagery cues
Subjective ratings
A Block (after first acquisition vs. after second acquisition) × Cue Type (aversive vs. neutral vs. none) ANOVA on the pleasantness of mental images revealed a main effect of cue type (p < .001; see Fig. 2; for further statistics, see Table S1 in the Supplemental Material), indicating that participants rated the image they had after being shown the aversive cue to be significantly more aversive than the image they had in response to the neutral cue. The Block × Cue Type interaction was not significant (p = .36), indicating that images remained aversive throughout the course of acquisition.
Physiological responses
The Block × Cue Type ANOVA on the D1 component revealed no main effects or interactions (all ps > .15). The Block × Cue Type ANOVA on the A1 IBI component revealed a main effect of cue type (see Fig. 2). Follow-up analyses indicated that these effects were driven by increased acceleration to the aversive cue versus the neutral cue (p = .007) but not between the aversive cue and the no-image cue (p = .24) or the neutral cue and the no-image cue (p = .23). Similarly, for SCR, the main effect of cue type was significant (p = .010). Direct comparisons revealed increased responses to the aversive cue as opposed to the neutral cue during the two acquisition blocks (p = .001; see Fig. 2), as well as to the aversive cue as opposed to the no-image cue (p = .029). There was no difference between the neutral cue and the no-image cue (p = .878).
Together, analyses of ratings and physiological responses thus confirmed that the aversive cue evoked imagery that was perceived as highly unpleasant and was accompanied by increased heart rate and SCR.
Responses to CSs
Subjective ratings
ANOVAs on self-rated fear after habituation confirmed that participants rated all faces to be similarly fear evoking prior to conditioning (ps ≥ .441). Importantly, after the first and second acquisition blocks, however, the Block × CS Type ANOVA on fear ratings revealed a main effect of CS type (p < .001; see Table S1). As shown in Figure 3, participants rated faces that had been paired with the thumbtack-image cue (aversive CS+) as significantly more fear evoking than faces that had been paired with the coin-image cue (neutral CS+), t(44) = 3.36, p = .002, or with the no-image cue (CS–), t(44) = 3.56, p < .001. Very similar main effects of CS type emerged for the anger, disgust, arousal, and valence ratings (all ps < .008; see Fig. 4).
Similar to the ANOVAs during acquisition, results of the Block × CS Type ANOVA on ratings during extinction (i.e., after termination of cue presentation) revealed main effects of CS type for fear, anger, disgust, arousal, and valence ratings (all ps < .033), indicating that faces previously paired with aversive images continued to evoke negative feelings even if they were no longer paired with image cues.
To test whether extinction reduced negative feelings relative to acquisition, we additionally performed a Phase × CS Type ANOVA in which the factor phase consisted of the last block of acquisition versus the last block of extinction. This ANOVA yielded a significant Phase × CS Type interaction for fear (p = .008) and arousal (p = .029) ratings; the effect of CS type (i.e., the differential fear response) decreased from acquisition toward the end of extinction (fear: ηp2s = .16 vs. .05; arousal: ηp2s = .23 vs. .08). There were no significant interactions for valence, anger, or disgust ratings (ps ≥ .155).
Physiological responses
During habituation, the D1 component did not differ as a function of CS type (p = .48). During acquisition, the Block × CS Type ANOVA on the D1 component revealed an interaction of block and CS type (p = .023). ANOVAs within blocks revealed that during the beginning of acquisition, IBI did not differ between CS types (p = .559), whereas CS type modulated IBI in the second block of acquisition (p = .023), as shown in Figure 5. As expected (Notterman, Schoenfeld, & Bersh, 1952; Panitz, Hermann, & Mueller, 2015; Sperl et al., 2016), direct comparisons indicated stronger deceleration for the aversive CS+ than the neutral CS+ (14 vs. 4 ms; p = .016). In addition, there was stronger deceleration for the CS– than the neutral CS+ (p = .035). The Block × CS Type ANOVA on the other IBI components revealed no significant main effects or interactions involving CS type (all ps > .5).
Consistent with successful extinction, the Block × CS Type ANOVA on D1 during extinction did not reveal any main effects or interactions (ps > .6). The Block × CS Type ANOVAs on the other IBI components and on EDA revealed no significant main effects or interactions involving CS type during habituation (ps > .07), acquisition (ps ≥ .5), or extinction (ps ≥ .09).
Study 2: Conditioning With an Imagined Electric Shock
The first study showed that, when contingently paired with aversive mental images, CSs elicit fear responses at the subjective and cardiovascular levels. The aim of Study 2 was to determine whether imagery-based fear conditioning would also work with shorter CS durations and a US that is more typical for classical fear-conditioning studies (i.e., imagery of an electric shock). Moreover, to rule out demand effects, fear-potentiated startle, which is a physiological marker outside of conscious control (Hamm & Weike, 2005; Lipp, 2007), was assessed in Study 2. Finally, to further explore similarities between imagery-based fear conditioning and fear conditioning based on physical USs, we tested whether imagery of the US after extinction triggers a return of fear (i.e., reinstatement) because it is commonly observed after physical US presentations (Hermans et al., 2005).
Method
Participants and procedure
In Study 2, 41 individuals (age: M = 23.76 years, SD = 2.6; 29 female, 12 male) participated for course credit. This sample size allowed us to test the central hypotheses with a power of .8, an alpha error probability of .05, and a drop-out rate of 10%. The study was approved by the local ethics committee of the University of Marburg Psychology Department. The overall procedure was identical to that in Study 1, and participants again filled out a battery of questionnaires to test hypotheses unrelated to the current report.
Imagery scripts
In contrast to participants in Study 1, participants in Study 2 were instructed to imagine receiving a strong electric shock on the forearm (aversive imagery) or receiving a mild vibration on the forearm (neutral imagery). The script for feeling a painful electric shock was as follows:
Imagine the following situation: You sit in a chair; your hands are on the arm rests. An electrode is attached to your left wrist. The electrode provides a short but powerful electric shock whenever a [red square/blue triangle/yellow circle] appears on the screen. The shock spreads throughout your whole body. Every time the [red square/blue triangle/yellow circle] appears on the screen, you receive a painful electric shock. The pain is extremely uncomfortable and barely tolerable. Focus on the pain you are experiencing. You can feel how it spreads throughout your entire body and how your muscles are cramping. You do not wish to experience this pain again. With every [red square/blue triangle/yellow circle], you experience the electric shock again.
The script for feeling a vibration was as follows:
Imagine the following situation: You sit in a chair; your hands are on the arm rests. A wristband is attached to your left wrist. The wristband provides a short vibration whenever a [red square/blue triangle/yellow circle] appears on the screen. Every time the [red square/blue triangle/yellow circle] appears on the screen, you experience this vibration. You can feel the vibration spread throughout your whole body. The sensation is not at all uncomfortable and is easily tolerable. Focus on the vibration you are experiencing. You can feel how it spreads throughout your entire body, and your muscles are relaxed. With every [red square/blue triangle/yellow circle], you experience the vibration again.
The script for the control cue was as follows:
Whenever this [red square/blue triangle/yellow circle] appears on the screen, you do not have to imagine anything. Just sit in your chair, observe the [red square/blue triangle/yellow circle], and think of nothing in particular.
Original imagery scripts in the German language are available at https://doi.org/10.5281/zenodo.2591593.
Imagery-based-conditioning paradigm
The imagery-based-conditioning paradigm was identical to that in Study 1 with the following exceptions. First, the CS was presented for 8 s instead of 10 s. Second, participants saw the aversive-imagery cue once after the first extinction block (reinstatement cue). Third, a circle and a triangle were used instead of the ellipse and hexagon cues, respectively. The ITI, reinforcement rate, cue presentation time, and number of CS presentations were identical to those in Study 1.
Dependent variables
Ratings
As the effects of conditioning on different affect-rating scales were largely redundant in Study 1, we collected only CS-associated arousal and valence in Study 2. CS ratings (i.e., arousal and valence) from only 40 participants were analyzed because 1 participant claimed after the experiment to have misunderstood the questions. The assessment and analysis were identical to those in Study 1.
Physiological responses
Procedures for SCR and ECG recording and analysis were largely identical to those in Study 1. However, because of the shorter CS presentation latency, only the ECG recording from −1,000 ms to 5,000 ms relative to CS and cue onset were analyzed. Accordingly, the CS-evoked IBI included a D1 component from 0 ms to 2,000 ms, an A1 component from 2,000 ms to 5,000 ms, and a D2 component from 4,000 ms to 5,000 ms. The cue-evoked IBI included a D1 component that was measured as the maximum IBI from 0 ms to 2,000 ms and an A1 component that was measured as the minimum IBI from 2,000 ms to 5,000 ms. To remove the influence of the preceding components in the CS-evoked IBI, we then referenced the value for A1 to D1 (corrected A1 = A1 – D1) and the value for D2 was referenced to A1 (corrected D2 = D2 – A1). Only trials not containing startle probes (see the next section) were used for SCR and IBI analyses. One participant had to be excluded from SCR analyses because of missing data, and 1 participant had to be excluded from IBI analyses for the extinction phase because of excessive artifacts in the ECG recording during that phase.
Fear-potentiated startle
After the resting phase, the startle probe—50 ms duration, 85 dB(A) white-noise burst, 1 ms rise–fall time—was presented five times to allow for an initial startle habituation. In the acquisition and extinction phases, the startle probe was presented during five presentations of each CS in each block (potential window: 2–4 s after CS) and during six ITIs in each block (between 2 s into the ITI and 1 s before its end). Electromyography (EMG) was measured below the left eye on the musculus orbicularis oculi using two Ag/AgCl electrodes (4-mm diameter) and analyzed according to recommendations from Blumenthal et al. (2005). It was first band-pass filtered from 28 Hz to 500 Hz, rectified and low-pass filtered with a time constant of 10 ms, segmented from −50 ms to 250 ms relative to startle onset, and then baseline-corrected from −50 ms to 0 ms.
Because of the good signal-to-noise ratio of startle responses, data were not aggregated within blocks but instead analyzed at the single-trial level (Sevenster, Beckers, & Kindt, 2013; Soeter & Kindt, 2010, 2012) to allow visualization of the learning dynamics during imagery-based fear conditioning. To this end, the maximum value between 20 ms and 150 ms was assessed for each trial in which a startle response was observed that did not begin earlier than 20 ms after startle onset. Single-trial startle magnitudes were T standardized (M = 50, SD = 10) within each participant using ITI startle magnitudes as the reference distribution. In cases of nonresponse, missing values were interpolated on the basis of the value of the preceding available trial of the same category. If there was no preceding trial in that block and category, the value from the succeeding available trial was taken instead. If there was no trial in one acquisition block and category, the participant was excluded, yielding a final sample for startle analysis of 29 for acquisition and 26 for extinction. Finally, single-trial startle responses were normalized as the percentage of the first startle response during the respective condition. For the statistical analysis of fear-potentiated startle, a CS Type × Trial ANOVA was performed.
Results
Responses to image cues
Subjective ratings
The Block × Cue Type ANOVA on the unpleasantness ratings of the cue-related image revealed a main effect of cue (p < .001), indicating that the image prompted by the aversive cue was rated as more unpleasant than the image prompted by the neutral cue or the no-image cue, whereas there was no difference between ratings for the neutral cue and the no-image cue.
Physiological responses
As in Study 1, the Block × Cue Type ANOVA on the A1 revealed a significant main effect of cue type (p = .013), whereas the same ANOVA on the D1 component during acquisition revealed no main effects or interactions (all ps > .3). Mirroring the results of Study 1 (see Fig. 2d), follow-up analyses indicated that these effects were driven by a stronger acceleration component to the aversive cue than to the neutral cue (p = .022) and to the aversive cue than to the no-image cue (p = .015) but not to the neutral cue than to the no-image cue (p = .52). A Block × Cue Type ANOVA on SCR revealed only a main effect of block (p = .046), which was due to smaller responses in the second block than the first block, and no other significant main effects or interactions (all ps ≥ .309). As in Study 1, the aversive cue thus evoked a mental image that was perceived as highly unpleasant and was accompanied by increased heart rate. Meanwhile, in this study, increased SCR to the aversive-imagery cue was not observed.
Responses to CSs
Subjective ratings
At baseline and after habituation, participants rated all faces to be similarly arousing and pleasant (ps ≥ .305). The CS Type × Block ANOVA on the arousal ratings revealed a main effect of CS type (p = .017; see Fig. 4). Post hoc t tests showed increased arousal for the aversive CS+ than for the neutral CS+ (p = .020) and the aversive CS+ than the CS– (p = .021) but not between the neutral CS+ and the CS– (p = .742). With regard to the valence ratings, the ANOVA revealed a significant effect of block (p = .033) and a trend for CS type (p = .074). In line with the arousal ratings and Study 1, exploratory t tests indicated increased unpleasantness of the aversive CS+ compared with the neutral CS+ (p = .030). There were no significant differences between ratings for the aversive CS+ and the CS– (p = .330) or the neutral CS+ and the CS– (p = .170).
During extinction, the Block (before reinstatement vs. after reinstatement) × CS Type ANOVAs on arousal and valence ratings showed a significant main effect of CS type only for the arousal ratings (p = .004), which was not further modulated by block (p = .641), thus providing no evidence for complete extinction or reinstatement. Similarly, an ANOVA on ratings after the last block of acquisition and the last block of extinction revealed no interactions of block and CS type for the arousal or valence ratings (ps > .6). Taken together, the ratings of Study 2 provide no evidence for successful extinction or reinstatement.
Peripheral measures
During habituation, there was no effect of CS type in any of the three cardiac components (ps > .32). The Block × CS Type ANOVA on D1 for acquisition revealed a marginally significant interaction of block and CS type (p = .070; see Fig. 4), comparable with the significant effect in Study 1. The overall pattern mirrored the results of Study 1, suggesting marginally higher IBI and relative deceleration to the aversive CS+ and CS– compared with the neutral CS+ in the second but not the first block of acquisition (see Fig. 5). As in Study 1, the Block × CS Type ANOVAs on D1 during extinction and the analyses of the other cardiac components and of SCR during any of the three phases revealed no significant main effects or interactions involving CS type (ps > .12).
Fear-potentiated startle
The CS Type × Trial ANOVA revealed a main effect of CS type (p = .037) and a main effect of trial (p < .001). Post hoc t tests on the main effect of CS type revealed a significant difference between the aversive CS+ and the CS– (p = .023) and (marginally) between the aversive CS+ and the neutral CS+ (p = .054) but not between the neutral CS+ and the CS– (p = .970).
Trial-wise t tests revealed that startle probes during the aversive CS+ evoked enhanced startle responses relative to the CS– in Trial 3 (p = .043) of the first acquisition block and in Trial 1 (p = .036), Trial 2 (p = .084), Trial 3 (p = .019), and Trial 4 (p = .022) of the second acquisition block and that the aversive CS+ evoked enhanced startle relative to the neutral CS+ in Trial 1 (p = .035), Trial 2 (p = .054), Trial 3 (p = .021), and Trial 4 (p = .001) of the second acquisition block (see Fig. 6). Specifically, a dishabituation between the last trial of the first acquisition block (Trial 5) and the first trial of the second acquisition block (Trial 6, following a short break) was observed for the aversive CS+ (from 76% to approximately 81% of the first startle response), whereas the response during the other two CSs remained at about 68%.
The CS Type × Trial ANOVA for the startle responses during extinction revealed only main effects of trial (p < .001) but no other significant main effects or interactions (ps ≥ .42). Trial-wise t tests for extinction revealed no enhanced startle responses for the aversive CS+ relative to the neutral CS+ (ps ≥ .073) or CS– (ps ≥ .169).
Discussion
The goal of the current research was to test whether fear can be conditioned de novo with aversive mental images as USs only. To this end, we conducted two studies in which different neutral face photographs were contingently paired with specific cues that had been previously trained to prompt aversive, neutral, or no imagery in 41 and 45 participants, respectively. Across studies, participants rated neutral faces as more fear evoking, unpleasant, and arousing, and they responded with relative cardiac deceleration and fear-potentiated startle if the faces had been paired with aversive imagery compared with neutral or no imagery. Because these findings indicate that associative fear learning may occur in the total absence of aversive physical stimulation, vicarious experiences, or explicit instructions, our results are relevant for understanding how phobias and anxiety disorders may develop in the absence of prior physically aversive experiences.
Most importantly, CS ratings revealed that faces, which were initially perceived as neutral, were later rated as more unpleasant, arousing, and fear evoking if they had been paired with cues for aversive as opposed to neutral or no imagery. It can be assumed that these cues prompted participants to produce the intended mental images, given that participants rated the mental images in response to the aversive cue as highly unpleasant and showed cardiac acceleration and increased SCRs to the aversive cue. Because (a) none of the faces had ever been paired with an aversive physical stimulus, (b) no instructions regarding the faces had ever been given, and (c) participants had not observed anyone else receiving an aversive stimulation in response to the faces, the higher arousal, negative valence, and fear ratings to the aversive CS+ than to the neutral CS+ and the CS– in Studies 1 and 2 can be ascribed only to the different mental images prompted by the associated cues.
At the cardiac level, the aversive CS+ evoked more cardiac deceleration compared with the neutral CS+ after the first acquisition block in Study 1 and (marginally significantly) in Study 2. Because fear-conditioned CSs+ generally evoke cardiac deceleration or “fear bradycardia” (Notterman et al., 1952; Panitz et al., 2015; Sperl et al., 2016), this finding further supports successful imagery-based fear learning from Block 1 to Block 2. In addition, the two nonthreatening stimuli differed from each other; there was a relative acceleration to the neutral CS+ as opposed to the CS–, consistent with imagery tasks evoking cardiac acceleration (Vrana & Lang, 1990).
With regard to the eyelid startle magnitude, which is believed to be a relatively pure correlate of stimulus valence in both classical fear conditioning and fear imagery (Hamm & Weike, 2005; Vrana & Lang, 1990), noise bursts given during aversive CS+ evoked stronger startle responses than bursts given during the neutral CS+ or CS–. As with cardiac deceleration, this effect increased throughout the course of learning and was particularly pronounced in the second half of the acquisition phase. At the same time, we did not observe higher electrodermal responses to the CS+ than to the CS– as we have found with physical USs using the same type of CS (Mueller, Panitz, Hermann, & Pizzagalli, 2014; Panitz et al., 2018; Sperl et al., 2016). Furthermore, in Study 2, imagery of a US did not trigger a reinstatement as would be expected with a physical US presentation (Hermans et al., 2005). Together, this suggests that de novo fear acquisition based on imagery mirrors physical-US-based fear conditioning with regard to some factors (i.e., subjective report, fear-potentiated startle, fear bradycardia) but not all factors (i.e., EDA, reinstatement).
An open question is whether the fear conditioning, as observed in both studies, was actually caused by the mental images that were paired with the CSs. Alternatively, the aversive-imagery cues themselves may have acquired aversive properties during the initial imagery-training procedure and served as a second-order conditioning US (Rizley & Rescorla, 1972). Although this is somewhat speculative at this point, such a mechanism may have far-reaching clinical implications because it would suggest that cues remotely associated with aversive imagery (rather than aversive imagery per se) may cause new fear learning. To probe the involvement of second-order conditioning, researchers may in the future control for cue valence, for example, by collecting cue valence ratings or by applying more indirect approaches to assess stimulus valence. Alternatively, researchers may include a control group that receives the initial imagery training but is instructed to not engage in imagining when cues are presented during conditioning.
Furthermore, the observed fear responses to the aversive CS+ may not have been caused by the actual imagery of an aversive event but may instead relate to propositional knowledge. Although this is a general issue of human associative-learning studies (Mitchell, De Houwer, & Lovibond, 2009), the startle potentiation during the aversive CS+ compared with both the neutral CS+ and CS– in Study 2 shows that associative learning could also be observed with regard to threat responses that are largely outside of cognitive control (Hamm & Weike, 2005; Lipp, 2007). Moreover, the observed relative fear bradycardia to the aversive CS+ in Studies 1 and 2 supports the notion that the aversive CS+ indeed triggered fear responses across multiple response systems, suggesting that the acquired association of the aversive CS+ and aversive imagery goes beyond merely propositional knowledge.
The type of learning that is captured with this novel paradigm is potentially relevant for anxiety disorders and other aspects of human functioning, particularly in light of the relevance of imagery for mental disorders and their treatment (Pearson, Naselaris, Holmes, & Kosslyn, 2015). This type of imagery-based learning connects truly existing external stimuli to threatening images or, by extension, reality to fantasy. With such connections, the emergence of dog phobia does not require being bitten by a dog but it would suffice to merely imagine being bitten when encountering dogs. Similarly, imagery of social embarrassment, suffocation, back pain, or even terrorist attacks may be highly relevant for the emergence and treatment of social phobia, agoraphobia, pain disorder, and social prejudice, when contingently paired with seeing other individuals, subway trains, movements, or foreigners, respectively.
It should be noted that the content and time course of experimentally induced imagery cannot be perfectly controlled. After contingencies are learned, participants may initiate imagery before cues are presented. As a consequence, recordings of conditioned responses after CS presentations may have been confounded with unconditioned responses to the mental images. In contrast to this assumption, however, we observed a dissociation of unconditioned responses and conditioned responses at the cardiovascular level, which is typically found in classical fear-conditioning studies (Lipp, 2007): relative cardiac deceleration to the aversive CS+, but cardiac acceleration to the US or, in the current studies, the aversive cue. Moreover, participants may have visualized an image when US presentations were not intended (e.g., during the CS–, nonreinforced trials, or the extinction phase) or, alternatively, may have avoided unpleasant mental images by not vividly imagining the US or not imagining the US at all. Because such behavior may have led to enhanced or reduced imagery-based conditioning, respectively, the reported effect sizes may not accurately reflect the actual potential of aversive images to induce fear learning in real life.
Taken together, the present studies showed that subjective and physiological fear responses were evoked by neutral faces, which were never paired with any aversive physical stimuli, any observations, or any explicit instructions but only with cues for aversive imagery. When contingently paired with neutral stimuli, particular images may thus lead to de novo conditioning, which is of potential relevance for anxiety disorders, social prejudice, and other dimensions of human functioning.
Supplemental Material
Supplemental material, MuellerSupplementalMaterial_rev for Aversive Imagery Causes De Novo Fear Conditioning by Erik M. Mueller, Matthias F. J. Sperl and Christian Panitz in Psychological Science
Supplemental Material
Supplemental material, Mueller_OpenPracticesDisclosure_rev for Aversive Imagery Causes De Novo Fear Conditioning by Erik M. Mueller, Matthias F. J. Sperl and Christian Panitz in Psychological Science
Acknowledgments
We thank Harald Lachnit for helpful discussions concerning the study and Alexander Petermann, Julia Zachlehner, Anne Gerhard, and Jing Zhao for great assistance in planning, conducting, and analyzing parts of the study.
Footnotes
Action Editor: Ian H. Gotlib served as action editor for this article.
Author Contributions: E. M. Mueller developed the study concept. All the authors contributed to the study design. C. Panitz programmed the experiment. Testing and data collection were performed by students supervised by C. Panitz and M. F. J. Sperl. Data preprocessing was performed by M. F. J. Sperl (electrodermal activity) and C. Panitz (all other physiological variables). E. M. Mueller analyzed and interpreted the data. C. Panitz confirmed the reproducibility of all reported results. E. M. Mueller drafted the manuscript, and C. Panitz and M. F. J. Sperl provided critical revisions. E. M. Mueller and M. F. J. Sperl created the figures. M. F. J. Sperl made the data, analysis scripts, and codebooks publicly available. All the authors approved the final manuscript for submission.
ORCID iD: Matthias F. J. Sperl https://orcid.org/0000-0002-5011-0780
Declaration of Conflicting Interests: The author(s) declared that there were no conflicts of interest with respect to the authorship or the publication of this article.
Supplemental Material: Additional supporting information can be found at http://journals.sagepub.com/doi/suppl/10.1177/0956797619842261
Open Practices:
Deidentified data for both experiments, a codebook, data-analysis scripts, and audio recordings of the imagery scripts have been made publicly available via Zenodo and can be accessed at https://doi.org/10.5281/zenodo.2591593. Face stimuli were part of the Ekman face series (Ekman & Friesen, 1976) and cannot be posted without permission. The stimulus IDs are JJ3-4, EM2-4, and GS1-4. The design and analysis plans for the studies were not preregistered. The complete Open Practices Disclosure for this article can be found at http://journals.sagepub.com/doi/suppl/10.1177/0956797619842261. This article has received the badges for Open Data and Open Materials. More information about the Open Practices badges can be found at http://www.psychologicalscience.org/publications/badges.
References
- Arabian J. M. (1982). Imagery and Pavlovian heart rate decelerative conditioning. Psychophysiology, 19, 286–293. doi: 10.1111/j.1469-8986.1982.tb02565.x [DOI] [PubMed] [Google Scholar]
- Benedek M., Kaernbach C. (2010. a). A continuous measure of phasic electrodermal activity. Journal of Neuroscience Methods, 190, 80–91. doi: 10.1016/j.jneumeth.2010.04.028 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benedek M., Kaernbach C. (2010. b). Decomposition of skin conductance data by means of nonnegative deconvolution. Psychophysiology, 47, 647–658. doi: 10.1111/j.1469-8986.2009.00972.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blumenthal T. D., Cuthbert B. N., Filion D. L., Hackley S., Lipp O. V., van Boxtel A. (2005). Committee report: Guidelines for human startle eyeblink electromyographic studies. Psychophysiology, 42, 1–15. doi: 10.1111/j.1469-8986.2005.00271.x [DOI] [PubMed] [Google Scholar]
- Cumming G. (2014). The new statistics: Why and how. Psychological Science, 25, 7–29. doi: 10.1177/0956797613504966 [DOI] [PubMed] [Google Scholar]
- Dadds M. R., Bovbjerg D. H., Redd W. H., Cutmore T. R. (1997). Imagery in human classical conditioning. Psychological Bulletin, 122, 89–103. doi: 10.1037/0033-2909.122.1.89 [DOI] [PubMed] [Google Scholar]
- Ekman P., Friesen W. V. (1976). Pictures of facial affect. Palo Alto, CA: Consulting Psychologists Press. [Google Scholar]
- Fairhurst M., Fairhurst K., Berna C., Tracey I. (2012). An fMRI study exploring the overlap and differences between neural representations of physical and recalled pain. PLOS ONE, 7(10), Article e48711. doi: 10.1371/journal.pone.0048711 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Faul F., Erdfelder E., Lang A.-G., Buchner A. (2007). G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behavior Research Methods, 39, 175–191. doi: 10.3758/BF03193146 [DOI] [PubMed] [Google Scholar]
- Hamm A. O., Weike A. I. (2005). The neuropsychology of fear learning and fear regulation. International Journal of Psychophysiology, 57, 5–14. doi: 10.1016/j.ijpsycho.2005.01.006 [DOI] [PubMed] [Google Scholar]
- Hermans D., Dirikx T., Vansteenwegen D., Baeyens F., Van den Bergh O., Eelen P. (2005). Reinstatement of fear responses in human aversive conditioning. Behaviour Research and Therapy, 43, 533–551. doi: 10.1016/j.brat.2004.03.013 [DOI] [PubMed] [Google Scholar]
- Jackson P. L., Brunet E., Meltzoff A. N., Decety J. (2006). Empathy examined through the neural mechanisms involved in imagining how I feel versus how you feel pain. Neuropsychologia, 44, 752–761. doi: 10.1016/j.neuropsychologia.2005.07.015 [DOI] [PubMed] [Google Scholar]
- Jones T., Davey G. C. (1990). The effects of cued UCS rehearsal on the retention of differential ‘fear’ conditioning: An experimental analogue of the ‘worry’ process. Behaviour Research and Therapy, 28, 159–164. doi: 10.1016/0005-7967(90)90028-H [DOI] [PubMed] [Google Scholar]
- King D. L. (1973). An image theory of classical conditioning. Psychological Reports, 33, 403–411. doi: 10.2466/pr0.1973.33.2.403 [DOI] [PubMed] [Google Scholar]
- Lang P. J. (1979). Presidential address, 1978: A bio-informational theory of emotional imagery. Psychophysiology, 16, 495–512. doi: 10.1111/j.1469-8986.1979.tb01511.x [DOI] [PubMed] [Google Scholar]
- LeDoux J. E. (2014). Coming to terms with fear. Proceedings of the National Academy of Sciences, USA, 111, 2871–2878. doi: 10.1073/pnas.1400335111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewis D. E., O’Reilly M. J., Khuu S. K., Pearson J. (2013). Conditioning the mind’s eye: Associative learning with voluntary mental images. Clinical Psychological Science, 1, 390–400. doi: 10.1177/2167702613484716 [DOI] [Google Scholar]
- Lipp O. V. (2007). Human fear learning: Contemporary procedures and measurement. In Craske M. G., Hermans D., Vansteenwegen D. (Eds.), Fear and learning: From basic processes to clinical implications (pp. 37–52). Washington, DC: American Psychological Association. [Google Scholar]
- Lissek S., Powers A. S., McClure E. B., Phelps E. A., Woldehawariat G., Grillon C., Pine D. S. (2005). Classical fear conditioning in the anxiety disorders: A meta-analysis. Behaviour Research and Therapy, 43, 1391–1424. doi: 10.1016/j.brat.2004.10.007 [DOI] [PubMed] [Google Scholar]
- Masson M. E. J., Loftus G. R. (2003). Using confidence intervals for graphically based data interpretation. Canadian Journal of Experimental Psychology, 57, 203–220. doi: 10.1037/h0087426 [DOI] [PubMed] [Google Scholar]
- Mineka S., Davidson M., Cook M., Keir R. (1984). Observational conditioning of snake fear in rhesus monkeys. Journal of Abnormal Psychology, 93, 355–372. doi: 10.1037/0021-843X.93.4.355 [DOI] [PubMed] [Google Scholar]
- Mineka S., Oehlberg K. (2008). The relevance of recent developments in classical conditioning to understanding the etiology and maintenance of anxiety disorders. Acta Psychologica, 127, 567–580. doi: 10.1016/j.actpsy.2007.11.007 [DOI] [PubMed] [Google Scholar]
- Mitchell C. J., De Houwer J., Lovibond P. F. (2009). The propositional nature of human associative learning [Target article and commentaries]. Behavioral & Brain Sciences, 32, 183–246. doi: 10.1017/S0140525X09000855 [DOI] [PubMed] [Google Scholar]
- Morrison I., Lloyd D., Di Pellegrino G., Roberts N. (2004). Vicarious responses to pain in anterior cingulate cortex: Is empathy a multisensory issue? Cognitive, Affective & Behavioral Neuroscience, 4, 270–278. doi: 10.3758/CABN.4.2.270 [DOI] [PubMed] [Google Scholar]
- Mueller E. M., Panitz C., Hermann C., Pizzagalli D. A. (2014). Prefrontal oscillations during recall of conditioned and extinguished fear in humans. The Journal of Neuroscience, 34, 7059–7066. doi: 10.1523/JNEUROSCI.3427-13.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mueller E. M., Stemmler G., Hennig J., Wacker J. (2013). 5-HTTLPR and anxiety modulate brain-heart covariation. Psychophysiology, 50, 441–453. doi: 10.1111/psyp.12016 [DOI] [PubMed] [Google Scholar]
- Murray E. J., Foote F. (1979). The origins of fear of snakes. Behaviour Research and Therapy, 17, 489–493. doi: 10.1016/0005-7967(79)90065-2 [DOI] [PubMed] [Google Scholar]
- Notterman J. M., Schoenfeld W. N., Bersh P. J. (1952). Partial reinforcement and conditioned heart rate response in human subjects. Science, 115, 77–79. doi: 10.1126/science.115.2978.77 [DOI] [PubMed] [Google Scholar]
- Ogino Y., Nemoto H., Inui K., Saito S., Kakigi R., Goto F. (2007). Inner experience of pain: Imagination of pain while viewing images showing painful events forms subjective pain representation in human brain. Cerebral Cortex, 17, 1139–1146. doi: 10.1093/cercor/bhl023 [DOI] [PubMed] [Google Scholar]
- Olsson A., Phelps E. A. (2007). Social learning of fear. Nature Neuroscience, 10, 1095–1102. doi: 10.1038/nn1968 [DOI] [PubMed] [Google Scholar]
- Panitz C., Hermann C., Mueller E. M. (2015). Conditioned and extinguished fear modulate functional corticocardiac coupling in humans. Psychophysiology, 52, 1351–1360. doi: 10.1111/psyp.12498 [DOI] [PubMed] [Google Scholar]
- Panitz C., Sperl M. F. J., Hennig J., Klucken T., Hermann C., Mueller E. M. (2018). Fearfulness, neuroticism/anxiety, and COMT Val158Met in long-term fear conditioning and extinction. Neurobiology of Learning and Memory, 155, 7–20. doi: 10.1016/j.nlm.2018.06.001 [DOI] [PubMed] [Google Scholar]
- Pearson J., Naselaris T., Holmes E. A., Kosslyn S. M. (2015). Mental imagery: Functional mechanisms and clinical applications. Trends in Cognitive Sciences, 19, 590–602. doi: 10.1016/j.tics.2015.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rachman S. (1977). The conditioning theory of fear acquisition: A critical examination. Behaviour Research and Therapy, 15, 375–387. doi: 10.1016/0005-7967(77)90041-9 [DOI] [PubMed] [Google Scholar]
- Rizley R. C., Rescorla R. A. (1972). Associations in second-order conditioning and sensory preconditioning. Journal of Comparative and Physiological Psychology, 81, 1–11. doi: 10.1037/h0033333 [DOI] [PubMed] [Google Scholar]
- Sevenster D., Beckers T., Kindt M. (2013). Prediction error governs pharmacologically induced amnesia for learned fear. Science, 339, 830–833. doi: 10.1126/science.1231357 [DOI] [PubMed] [Google Scholar]
- Soeter M., Kindt M. (2010). Dissociating response systems: Erasing fear from memory. Neurobiology of Learning and Memory, 94, 30–41. doi: 10.1016/j.nlm.2010.03.004 [DOI] [PubMed] [Google Scholar]
- Soeter M., Kindt M. (2012). Erasing fear for an imagined threat event. Psychoneuroendocrinology, 37, 1769–1779. doi: 10.1016/j.psyneuen.2012.03.011 [DOI] [PubMed] [Google Scholar]
- Sperl M. F. J., Panitz C., Hermann C., Mueller E. M. (2016). A pragmatic comparison of noise burst and electric shock unconditioned stimuli for fear conditioning research with many trials. Psychophysiology, 53, 1352–1365. doi: 10.1111/psyp.12677 [DOI] [PubMed] [Google Scholar]
- Vrana S. R., Lang P. J. (1990). Fear imagery and the startle-probe reflex. Journal of Abnormal Psychology, 99, 189–197. doi: 10.1037/0021-843X.99.2.189 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplemental material, MuellerSupplementalMaterial_rev for Aversive Imagery Causes De Novo Fear Conditioning by Erik M. Mueller, Matthias F. J. Sperl and Christian Panitz in Psychological Science
Supplemental material, Mueller_OpenPracticesDisclosure_rev for Aversive Imagery Causes De Novo Fear Conditioning by Erik M. Mueller, Matthias F. J. Sperl and Christian Panitz in Psychological Science