Abstract
Conflict between response tendencies is ubiquitous in every day performance. Capabilities that resolve such conflicts are therefore mandatory for successful goal achievement. The present study investigates the potential of evaluative and motivational inner speech to help conflict resolution. In our study we assessed six tasks commonly used to measure conflict resolution capabilities and cognitive flexibility in 163 participants. Participants additionally answered questionnaires concerned with their habitual usage of inner speech such as silently rehearsing task instructions and evaluating performance. We found reduced conflict effects in tasks using symbolic, non-verbal stimuli for participants with higher self-reported use of evaluative and motivational inner speech. Overall, our findings suggest that silent self-talk and performance monitoring are beneficial for conflict resolution over and above constructs such as intelligence and working memory capacity that account for mean RT differences among participants.
Subject terms: Human behaviour, Working memory, Cognitive control, Personality, Attention
Introduction
In everyday tasks, we often encounter opposing response tendencies, which lead to the experience of conflict1–3. Mechanisms for conflict resolution and in consequence goal achievement are commonly subsumed under the umbrella term of cognitive control. Investigating the processes underlying cognitive control, frameworks incorporating processes such as inhibition of irrelevant information or switching of attention have been proposed (e.g., by Friedman and Miyake4). In another line of (developmental) research, Luria5 highlighted the role of language for successful cognitive control and goal achievement. Likewise, Vygotskij6 argued that language and especially the extended representational power it represents, accelerates cognitive development and allows for more effortful processing, for instance inhibiting irrelevant information. Recently, Gruber and Goschke7 emphasized the close link between language and cognitive control, assuming verbal representations underlying performance in tasks that require conflict resolution (i.e., conflict tasks). Yet so far, surprisingly little research has been done to investigate the impact of individual preferences for language use and its benefits for conflict tasks in healthy adult participants. Mostly this research has focused on paradigms in which participants were either instructed to use language to help performance8 or asked to perform a secondary articulation task. In these studies it was found that language either helped or interfered with the primary task requirements9–12. Habitual usage of inner speech as a personality trait has not been considered as a predictor for performance in conflict tasks, different to other fields such as athletic performance13.
Inner speech, i.e., the subjective experience of self-directed language without audible articulation14 is usually investigated in contexts of health psychology, related to self-regulation15 and psycho-social well-being16–18. Starting from the ideas of Luria and Vygotski, we predicted that habitually talking to oneself during conflict tasks is beneficial for conflict resolution. Recently, we investigated habitual inner speech in a Simon task. We found that habits related to motivational (i.e., “you can do that”) and evaluative (i.e, “well done”) inner speech predict the size of the Simon effect19. The Simon effect reflects the difference in performance for trials in which the task-irrelevant position of the stimulus on the screen mismatches with the to-be given response compared to trials in which stimulus-location and response-location match. Those participants who engage more in motivational and evaluative inner speech had smaller Simon effects compared to less engaged participants, suggesting better inhibition of irrelevant information. These results lend support to the hypothesis that task-related contents of inner speech, i.e. instructing oneself or evaluating task performance, help to maintain focus on task-relevant features and resolve response conflict. We assume that both, motivational aspects (i.e., instructing oneself silently which key to press) as well as evaluative aspects (i.e., monitoring accuracy of behaviour and commenting on it), are relevant for performance benefits and predict smaller conflict effects for participants relying on inner speech in paradigms commonly used to investigate conflict resolution capabilities.
For the present study, we opted for tasks that rely on endogenous regulation of control. In addition, we assessed two switching tasks. We chose this addition to investigate a wider range of presumed cognitive control processes4 that might be susceptible to inner speech habits. We reasoned that inner speech might not only help to resolve conflict but also foster cognitive flexibility assessed in the switching tasks. Next to the overall requirement of conflict resolution, the specific task characteristics (i.e., stimulus material) may modulate the usage of inner speech. Therefore, we assessed tasks using verbal stimulus material such as words and tasks that employed figural/symbolic stimulus material such as shapes or arrows only. Our predictions were that modulation of behavior due to inner speech processes should be less detectable in tasks using verbal material as most participants would engage in covert articulation to guide performance (i.e., rehearsing stimulus-response mappings). In contrast, in non-verbal conflict tasks, retrieving nonverbal symbolic representations does not necessarily require inner speech, making global or task-related motivational and evaluative inner speech helpful to maintain focus and effort in the ongoing task. Therefore, individual differences in influences of inner speech should be readily visible in those tasks.
The nonverbal tasks chosen comprised the Simon task20 that affords a reaction towards a stimulus feature such as shape while ignoring the stimulus position and the flanker task21 for arrows in which participants had to react to the direction of the central arrow and ignore the flanking arrows. The verbal tasks chosen comprised the Stroop task22 that required the identification of the ink color in which a color word was printed while ignoring its meaning and the flanker task21 for letters where participants had to react to the identity of the central letter and ignore the flanking letters. The task-switching tasks comprised a non-verbal and verbal version in which either shapes or words had to be classified.
In all conflict tasks the amount of conflict is assessed as the difference between congruent (i.e., flankers and targets match) and incongruent (flankers and targets mismatch) trials. The main dependent variable in the switching task is the switch cost, i.e., the difference between task repetition and task switch trials. We predicted reduced conflict effects and switch costs in participants relying more on inner speech. Those reductions should also be more obvious in tasks not employing verbal material. We also reasoned that not all types of inner speech behavior might be equivalently helpful (see for instance, detrimental effects of rumination behavior17). Therefore, motivational and evaluative inner speech should have the biggest influence as these are closely linked to required behavior (see also19).
Inner Speech was assessed in quantity and quality using three questionnaires (Varieties of Inner Speech Questionnaire [VISQ]15; Self Talk Scale [STS]16, and the Inner Speech Scale [ISS]23). Additionally, we measured general intelligence (IQ) and working memory capacity (WMC) as control variables as the size of conflict effects are bound to overall mean RTs24,25. Controlling for variables known to affect mean RT such as intelligence and working memory capacity therefore seemed prudent. To sum up, starting from ideas of close linkage between language use and cognitive control we hypothesized that participants habitually engaging in inner speech processes should show reduced conflict effects and smaller switch costs. This reduction should be especially evident in task using non-verbal stimulus material as the involvement of verbal recoding enhances task-underlying representations and render them susceptible for proactive control processes that reduce the impact of conflicting response tendencies26. This predictive value of inner speech habits should exist incrementally over and above differences in intelligence and working memory capacity.
Results
Factor structure of the inner speech questionnaires and correlations with big five, depression, working memory capacity and intelligence
We originally had planned a hierarchical measurement model with one latent variable accounting for the shared variance of all inner speech scales (see registration, https://osf.io/xeth7/), based on the idea of a higher-order component of inner speech frequency16. VISQ and STS subscales, however, were only weakly correlated (mean ), leading to an unacceptable fit of the preregistered one-factor measurement model (, ). Separate confirmatory factor analyses (CFAs) for the VISQ and STS showed acceptable fit for models with latent factors for each of the four assumed subscales and the corresponding items as manifest variables (VISQ, , ; STS , ), replicating the factor structures of the original versions. Single factor models did not yield acceptable fit (VISQ, , ; , ; ISS , ), precluding the use of total scores especially for the VISQ but also for the STS and the ISS. For the subsequent analyses we therefore used the subscale scores of the VISQ and STS.
There were no meaningful correlations of the inner speech scales with the Big 5 (mean for the VISQ subscales and for the STS subscales). There were small positive correlations of the inner speech scales with the ADS (mean for the VISQ subscales and for the STS subscales). In the sample of participants that took part in the lab study, none of the inner speech subscales correlated with WMC (mean for the VISQ subscales and mean for the STS subscales) or IQ (mean for the VISQ subscales and mean for the STS subscales). For full correlation matrizes between predictors and control variables see Tables S1–S2 in Supplementary materials.
Predicting task performance
The variances and covariances of the response times (RTs) in the four conflict and two task switching tasks were analyzed by Hierarchical Linear Modeling27 using HLM728. Level 1 modeled the within-subjects variability by predicting each single RT from a dummy variable for each task coding general RT and a task-specific contrast that modeled the RT difference between congruent and incongruent trials with in each task. The congruency contrasts were coded +0.5 for incongruent trials and −0.5 for congruent trials, implying that a positive coefficient indicates slower responses in incongruent compared to congruent trials (i.e., a Simon effect). Level 2 modeled between-subject variability of the task-specific parameters coding general RT and conflict. Split-half reliability of the individual participants’ task parameters and their correlations are depicted in Table 1.
Table 1.
Task parameter reliabilities (diagonal) and their correlations (above the diagonal).
| Task parameter | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| general RT | ||||||||||||
| 1. Simon | 0.94 | 0.71 | 0.76 | 0.67 | 0.79 | 0.66 | −0.33 | 0.38 | 0.26 | 0.69 | −0.08 | 0.17 |
| 2. Arrow flanker | 0.90 | 0.57 | 0.61 | 0.59 | 0.59 | −0.10 | 0.79 | 0.22 | 0.58 | 0.24 | 0.14 | |
| 3. Pictorial switching | 0.94 | 0.56 | 0.62 | 0.70 | −0.50 | 0.24 | 0.38 | 0.61 | 0.12 | 0.20 | ||
| 4. Stroop | 0.93 | 0.65 | 0.58 | 0.12 | 0.37 | 0.11 | 0.77 | 0.03 | 0.13 | |||
| 5. Letter flanker | 0.93 | 0.67 | −0.23 | 0.40 | 0.09 | 0.63 | 0.12 | 0.03 | ||||
| 6. Verbal switching | 0.91 | −0.48 | 0.39 | 0.19 | 0.49 | 0.41 | 0.35 | |||||
| conflict | ||||||||||||
| 7. Simon | 0.87 | 0.14 | −0.28 | 0.10 | −0.21 | 0.17 | ||||||
| 8. Arrow flanker | 0.74 | 0.03 | 0.53 | 0.18 | 0.03 | |||||||
| 9. Pictorial switching | 0.90 | 0.05 | −0.25 | 0.28 | ||||||||
| 10. Stroop | 0.73 | −0.04 | 0.14 | |||||||||
| 11. Letter flanker | −0.09 | 0.16 | ||||||||||
| 12. Verbal switching | 0.64 | |||||||||||
Note. N = 155. Critical values of |r| for p < 0.05 are rc = 0.13 (one-tailed) and rc = 0.16 (two-tailed).
We expected higher correlations of the conflict parameters within the nonverbal vs. verbal tasks than across these tasks, suggesting different amounts of reliance on inner speech. Whereas intercorrelations of the task parameters were high for general RT (mean ), they were very low for conflict (mean for nonverbal tasks, and for verbal tasks). The preregistered measurement model with two latent factors accounting for the shared variance of conflict in nonverbal tasks vs. verbal tasks and empirical Bayes estimates of the six task-specific conflict parameters as manifest variables had an unacceptable fit (, ). Fit was only marginally better for a one-factor model (, ) and a model with two latent factors accounting for the shared variance of conflict in inhibition (Simon, Stroop, flanker) vs. switching tasks (, ). These results led us to abandon the pre-registered measurement model and adopt a piecewise analysis strategy in line with recent research29–31 that observed similarly low intercorrelations yet good reliabilities.
For the piecewise analyses we included the particiants WMC, IQ as well as inner speech questionnaire scores in the level-2 model and simultaneously predicted the individual participants’ task-specific parameters for general RT and conflict for all six task. We ran two separate analyses for the subscale scores of the VISQ and STS. The coefficients for fixed effects of general RT and conflict for each task are depicted in Tables 2 and 3 (for error analysis see Tables S5–S6 in Supplementary material). Scores of all level-2 predictors were entered as standard scores (, ). Thus, the level-2 coefficients indicated the time in milliseconds participant’s general RT (i.e., the dummy variable for each task) or the RT increment for incongruent compared to congruent trials (i.e., the task-specific conflict parameter) changed if the WMC, IQ or questionnaire score increased by one standard deviation.
Table 2.
Coefficients (robust standard errors) for fixed effects of general RT and conflict for each task, with participants WMC, IQ as well as their VISQ subscale scores as simultaneous predictors in the Level-2 model.
| Task | Intercept | WMC | IQ | VISQ | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DLG | CND | OTP | E/M | ||||||||||||
| nonverbal | |||||||||||||||
| Simon | general RT | 544 | (7) | −11 | (8) | −28 | (10) | −1 | (8) | 13 | (7) | 7 | (9) | 4 | (9) |
| conflict | 6 | (3) | −1 | (5) | 0 | (3) | 2 | (2) | −2 | (3) | 4 | (3) | −6 | (3) | |
| flanker | general RT | 515 | (6) | −1 | (8) | −30 | (10) | 5 | (6) | 20 | (8) | 12 | (10) | −13 | (8) |
| conflict | 98 | (4) | −7 | (6) | −11 | (4) | 4 | (4) | 9 | (6) | 6 | (4) | −10 | (5) | |
| switching | general RT | 958 | (19) | −9 | (24) | −62 | (25) | 6 | (21) | 40 | (18) | 10 | (20) | 13 | (21) |
| conflict | 109 | (7) | 12 | (8) | −11 | (9) | 8 | (7) | 11 | (7) | 4 | (8) | 2 | (7) | |
| verbal | |||||||||||||||
| Stroop | general RT | 648 | (11) | −5 | (14) | −51 | (17) | −3 | (10) | 13 | (11) | 16 | (14) | −3 | (13) |
| conflict | 39 | (3) | −8 | (4) | −6 | (4) | −2 | (3) | 3 | (4) | 4 | (3) | −1 | (4) | |
| flanker | general RT | 622 | (9) | −22 | (12) | −20 | (11) | 3 | (9) | 3 | (10) | 5 | (10) | −1 | (10) |
| conflict | 26 | (2) | 3 | (3) | −3 | (3) | −2 | (2) | −1 | (2) | 3 | (2) | 0 | (2) | |
| switching | general RT | 1031 | (18) | −36 | (23) | −72 | (22) | 13 | (20) | 33 | (18) | 23 | (21) | 13 | (23) |
| conflict | 130 | (7) | 21 | (9) | −25 | (9) | 13 | (7) | 14 | (8) | 17 | (8) | 1 | (8) | |
Note. N = 144, approx. d.f. = 137. WMC Working Memory Capacity, IQ General Intelligence, VISQ: DLG = dialogic inner speech, i.e. talking to oneself going back and forth, CND = condensed inner speech, i.e., shortcutting sentences, using brief phrases, OTP = other people in inner speech, i.e., hearing other people talk in my head, E/M = evaluative and motivational inner speech, i.e. telling myself not to do things. Significant coefficients (p < 0.05) are printed bold.
Table 3.
Coefficients (robust standard errors) for fixed effects of general RT and conflict for each task, with participants WMC, IQ as well as their STS subscale scores as simultaneous predictors in the Level-2 model.
| Task | Intercept | WMC | IQ | STS | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SAS | SRE | SCR | SMA | ||||||||||||
| nonverbal | |||||||||||||||
| Simon | general RT | 544 | (7) | −15 | (8) | −26 | (10) | −9 | (10) | 5 | (10) | 17 | (11) | 2 | (11) |
| conflict | 6 | (3) | −1 | (5) | 0 | (3) | 3 | (3) | 7 | (3) | −7 | (3) | −3 | (4) | |
| flanker | general RT | 515 | (6) | −2 | (8) | −30 | (11) | 17 | (9) | −6 | (11) | −2 | (9) | −13 | (10) |
| conflict | 98 | (4) | −7 | (6) | −12 | (5) | 14 | (7) | −1 | (7) | 0 | (6) | −18 | (9) | |
| switching | general RT | 958 | (20) | −15 | (26) | −54 | (26) | −10 | (26) | 17 | (26) | 21 | (26) | −3 | (29) |
| conflict | 109 | (7) | 10 | (8) | −9 | (10) | −12 | (10) | 4 | (9) | 8 | (10) | 0 | (12) | |
| verbal | |||||||||||||||
| Stroop | general RT | 648 | (11) | −10 | (15) | −50 | (18) | −19 | (16) | 19 | (14) | 1 | (17) | 1 | (16) |
| conflict | 39 | (3) | −8 | (4) | −6 | (3) | −1 | (4) | −4 | (4) | −1 | (4) | 0 | (5) | |
| flanker | general RT | 622 | (9) | −24 | (12) | −16 | (11) | 1 | (12) | 9 | (11) | 3 | (13) | 9 | (14) |
| conflict | 26 | (2) | 3 | (3) | −2 | (3) | 6 | (3) | −3 | (3) | −4 | (3) | 1 | (3) | |
| switching | general RT | 1031 | (18) | −44 | (24) | −64 | (24) | −3 | (24) | 19 | (24) | 17 | (23) | −16 | (28) |
| conflict | 130 | (8) | 17 | (10) | −21 | (10) | 1 | (11) | 2 | (11) | 11 | (12) | −5 | (11) | |
Note. N = 144, approx. d.f. = 137. WMC Working Memory Capacity, IQ General Intelligence, STS: SAS = Social Assessment, SRE = Self-Reinforcement, SCR = Self-Criticism, SMA Self-Management. Significant coefficients (p < 0.05) are printed bold.
As expected, coefficients for conflict in all conflict tasks were positive, ranging from 6 ms (Simon) to 98 ms (arrow flanker), indicating that responses are slower for incongruent compared to congruent trials. We also observed large switch costs (109 ms and 130 ms for the non-verbal and the verbal switching task, respectively). Most interesting were of course the cross-level interactions. We predicted a contribution of WMC to task performance in all tasks. When entered as a level-2 predictor together with general intelligence, however, WMC did not predict reduced general RT or RT difference scores in any of the four conflict tasks and both switching tasks. For participants scoring high in IQ we predicted better performance in our verbal tasks. When entered as a level-2 predictor together with WMC, general intelligence predicted reduced general RT for the Simon, arrow flanker, Stroop and both switching tasks, as well as reduced conflict for arrow flanker and Stroop task.
For participants habitually engaging in inner speech processes we predicted reduced conflict effects and switch costs. Of the VISQ subscales (Table 2), evaluative and motivational inner speech predicted the conflict parameter in the Simon task as well as the arrow flanker task in the expected direction. For participants with a subscale score one standard deviation above the mean the Simon effect was 6 ms smaller and the arrow flanker was 10 ms smaller. Of the STS subscales (Table 3), Self-criticism predicted the conflict parameter in the Simon task whereas Self-Management predicted the conflict in the arrow flanker task in the expected direction. For participants with a subscale score one standard deviation above the mean (compared to an average score), the Simon effect was 7 ms smaller and the arrow flanker was 18 ms smaller. There were no effects of the other subscales of VISQ or STS on the conflict contrast or the switch costs. Parameter estimates for the inner speech scales were comparable when the ADS was added as an additional predictor at level-2.
Discussion
In our study, we were interested to what degree inner speech habits (i.e., self-instructions and evaluation) benefit cognitive control. To this end, we assessed performance in four commonly used conflict tasks using symbolic and verbal stimuli, as well as two task switching paradigms and inner speech habits using three questionnaires. Although we could replicate the factor structure within the respective questionnaires, we could not establish a common factor across questionnaires representing overall frequency of inner speech. This supports the view of content-specific inner speech serving separate functions, and we therefore examined the predictive value of individual subscales for our performance measures.
As for the questionnaires, we could also not establish a common cognitive control factor for the conflict tasks and the switching tasks. There was considerable variability in individual task performance across participants, and although congruency effects and switch costs varied in size and reliability across tasks, they were sizable and reliable enough for correlations to emerge, given the sample size. Yet, substantial positive correlations were found only for the arrow flanker and Stroop task and both switching tasks. The otherwise unrelated conflict parameters point to a diverse range of specific abilities necessary to solve response requirements in these tasks, making only two latent factors less likely to account for individual differences in cognitive control across our tasks32.
In consequence, we adopted piecewise analysis strategy and found that inner speech habits related to motivational and evaluative aspects of performance predict better performance in the Simon as well as the arrow flanker tasks, supporting our hypothesis that depending on context, task-related inner speech habits do have an implication over and above communication in that participants are less affected by response conflict and henceforth experience less interference and performance costs. Furthermore, we also predicted that those influences should be more visible in tasks employing non-verbal, symbolic stimuli such as shapes or arrows which do not foster verbal strategies because of stimulus material. This prediction was confirmed for two tasks (Simon and arrow flanker) in which we found influences of inner speech on. Unlike other studies that were mainly concerned with the both detrimental as well as supportive influence of instructed inner speech on concurrent actions8,9,12,33, our study discerns the benefits of inner speech habits as a language-related personality trait for conflict resolution.
Interestingly (and importantly), influences of motivational and evaluative inner speech persisted when we controlled for working memory capacity and intelligence, in line with our finding that inner speech habits were unrelated to and hence its benefits cannot be explained by greater working memory capacity or higher intelligence. We speculate that the questionnaire items assessing motivational and evaluative inner speech indeed capture more local processes of interference control or performance monitoring. These more local processes directly translate into actual performance, whereas global characteristics (i.e., processing speed as assessed as general RT level) are linked to overarching constructs such as intelligence and working memory capacity.
However, although interesting our results still suffer from limitations. Our first concern refers to the assessment of different contents of inner speech. In the questionnaires used, most of the items that offer some exemplification are rather broad and unspecific (i.e., “I talk through my plans”). The items within the evaluative and motivational inner speech subscale, however, are quite concrete and performance-related. This concreteness might have been necessary for the observed predictive value regarding task performance and may explain the absent effects for the other subscales. Second, out of the four conflict tasks and the switching tasks the influence of inner speech habits was restricted the Simon task as well as the arrow flanker task. Those tasks go along with strong response activation features for irrelevant responses in case of incongruent trials. Next, both tasks consist of only four stimulus-response (S–R) episodes (i.e., left and right pointing arrows as targets and flankers) that might allow for rapid automatization within a small number of trials. With a higher number of S–R epsiodes, recurrently retrieving the instructions and activating the phonological representation of the relevant stimulus features may render motivational inner speech and thereby individual differences less likely. Our third concern refers to the missing influence of working memory. To ensure instruction-compliant behavior, we displayed the relevant S–R mapping throughout the tasks, possibly reducing working memory load. This might have reduced variability among participants as working memory capacity was less relevant and therefore equalized it as predictor of performance.
To summarize, in our study being concerned with the influence of habitual use of inner speech on cognitive control, we found benefits of motivational and evaluative inner speech for performance in tasks allowing for fast automatization and focused attention such as the Simon task and the arrow flanker task, incremental to the predictive value of intelligence and working memory capacity. Future studies are necessary to examine whether more or less concrete contents of inner speech are helpful, over and above sequential self-instructions, and if the benefits extend to more complex tasks or tasks affording sustained attention.
Methods
The study was carried out in accordance with guidelines formulated in the Declaration of Helsinki. The Ethics Committee of the German Society for Psychology approved the study (MG 042014). Informed consent was secured from all participants.
Participants
502 participants were sampled from the participants’ pool of the Julius-Maximilians University of Würzburg. 32 participants had to be excluded due to suspiciously fast responding, false responses to two careless-responder catch items or because they opted for “do not use my data”. Data of the remaining sample (N = 470, 150 female, mean age = 24.6 years, SD = 4.73 years) were used to calculate the psychometric properties and the confirmatory factor analyses of the questionnaires. Of those 470 participants, 163 took part in the lab study and received course credit or €25. Questionnaire data of these participants were comparable to the full sample (all for equivalence tests34 with ). In eight participants, one of the ten experimental tasks crashed, yielding complete experimental data for 155 participants (38 male), with a mean age 24.5 years (SD 3.6 years). Due to experimenter error, IQ data were missing for 11 participants, reducing the sample size to 144 participants (36 male, mean age 24.5 years, SD 3.6 years) for analyses controlling for intelligence. All participants reported normal or corrected-to-normal vision.
Questionnaires
After signing up for the study, participants were invited to fill in the following questionnaires: For inner speech habits we assessed the Varieties of Inner Speech Questionnaire15 (VISQ, 18 items); the Self Talk Scale16 (STS, 16 items), and the Inner Speech Scale23 (ISS, 18 items). The VISQ assesses inner speech habits along four dimensions (dialogic inner speech, condensed inner speech, other people in inner speech, and evaluative and motivational inner speech), whereas the STS and ISS are instruments coming from health psychology backgrounds and are used commonly to assess inner speech habits more in style of coping behavior. Additionally, we assessed short forms of the Big Five Inventory35 (BFI-S, 15 items), the General Depression Scale36 (“Allgemeine Depressions Skala – Kurzform”, ADS-K, 15 items) and the Edinburgh Handedness Inventory37,38 (EHI-SH, four items). The assessment was performed online using SosciSurvey39 and took about 15 to 20 min.
Reliability analyses of the inner speech questionnaire data were based on McDonalds ω40 and were performed using the R package “mbess41”. The reliability values of the VISQ subscales were acceptable to good, considering the length of the scales (four to five items each): For the subscale dialogic inner speech, ω = 0.72 [CI 0.68–0.76], for the subscale condensed inner speech, ω = 0.79 [CI 0.76–0.82], for the subscale addressing the other people in inner speech, ω = 0.86 [CI 0.83–0.88] and for the evaluative and motivational inner speech subscale, ω = 0.71 [CI 0.65–0.75]. The reliability values of the STS subscales were acceptable to good, Social Assessment: ω = 0.81 [CI 0.78–0.84], Self-Reinforcement: ω = 0.82 [CI 0.79–0.85], Self-Criticism: ω = 0.76 [CI 0.72–0.79], Self-Management: ω = 0.75 [CI 0.70–0.78].
Measures of IQ and Working Memory Capacity
The short version of the Berliner Intelligenz Struktur Test42 (BIS) comprises 16 paper-pencil tests that can be aggregated to a measure of general intelligence (AI for “Allgemeine Intelligenz”). Raw scores were transformed to Standard scores using the norms given in the manual. The Working Memory Capacity Battery43 (WMC) consists of four tasks, an updating task (memory updating, MU), two span tasks (OS; sentence span, SS), and a spatial task (spatial short-term memory, SSTM). Due to a coding error in the two span tasks only the maintenance scores were saved but no data for the processing component (i.e., judging an equation as correct or incorrect). Mean percent correct values were computed for all four tasks. Since the scores were found to load on a single latent variable, we ran a principal component analysis of the mean percent correct values to obtain a measure of the shared variance of the tasks. The first two eigenvalues were 2.54 and 0.81; accounting for 63.4% and 20.2% of the variance. Thus, one factor was retained and its standardized factor score was used for the analyses.
Tasks
Tasks were programmed and run using Eprime 2.0.10.356 Professional (Psychology Software Tools, Pittsburgh, PA). Stimulus color (unless indicated otherwise) was white on darkgrey background. All tasks started with an instruction that comprised a brief description of task affordances followed by practice trials comprising all stimuli to be encountered in the task and their assigned responses. In case of open questions, the experimenter explained the task orally. All tasks and blocks within a task were started self-determined. The new tasks were announced before the instruction sheets were displayed. All tasks consisted of 32 practice trials and 192 test trials (three blocks of 64 trials for the switching tasks and two blocks of 96 test trials for all other tasks). All trials started with a fixation cross (Calibri 44 points) or in case of task switching with the cue for 500 ms and were followed by a feedback using a smiling or frowning face (2.54 cm in diameter) dependent on response accuracy for 250 ms followed by a blank for 500 ms. S–R mappings for the actual task at hand were displayed at the bottom of the screen. Different response keys were used for each task and no upper response time limit was given. After each block, participants were informed about their percentage of errors and invited to take a short rest. All participants performed the following tasks in two blocks of three tasks each: The non-verbal tasks block consisted of the Arrow Flanker, Simon and pictorial Task Switching. The verbal tasks block consisted of the Letter Flanker, Stroop and verbal Task Switching.
Non-verbal tasks block
In the arrow flanker task participants had to indicate the direction (left vs. right) of a central arrow that was flanked by two additional arrows to the right and the left (i.e., distractors). These distractors could either point in the same direction as the central arrow (i.e., congruent trials) or in the opposite direction (i.e., incongruent trials). The imperative stimulus consisting of five arrows (“>” or “<” in Calibri 44 points), presented centrally until response with keys “X” and “M” of a standard German QWERTZ-keyboard was given.
For the Simon task, participants had to indicate whether a given stimulus was a square or a diamond using the “A” or the “L” key. The imperative stimulus was presented 7 cm to the left or the right of the fixation cross. For the Simon task, congruent trials (i.e., location of the imperative stimulus and required response matched) were equally frequent as incongruent trials (i.e., location of the imperative stimulus and required response did not match). The imperative stimulus, a diamond or a square with 2.54 cm × 2.54 cm edge length was presented either left or right of the fixation cross and remained visible until response.
In the pictorial task switching task, participants had to classify shapes either as round or angular or whether they were symmetrical or not. Six shapes were chosen for this task and comprised an ellipse, a cloud, a lake-like shape, a drop as well as a square, a hexagon, a freely designed shape without meaning but no curves and an asymmetric star. All shapes covered an area of 4 × 5 cm on the screen and were presented centrally. The cues (3 × 3 cm) indicating which task had to be performed were a circle superimposed on a triangle for the round/angular task and a heart with vertical straight line for the symmetry task. Assigned response keys were “1” in case of a symmetrical or angular object and “9” for asymmetrical and round objects. A trial started with the presentation of the cue at the top for 500 ms followed the imperative stimulus. The cue, S–R mapping and stimulus remained visible until the response. Task switches, i.e. trials in which the current task was different from the task in the previous trial were equally frequent as task repetitions in which the task stayed the same across two consecutive trials.
Verbal tasks block
In the Stroop task participants were asked to indicate the color in which as color word was written. In incongruent trials, the print color did not match the meaning of the color word (i.e., the word “red” printed in blue), for congruent trials print color and word meaning matched (i.e,, “red” printed in red). Response keys were “Y”, “V”, “N” and “−” for the colors red, blue, green and yellow, respectively. Stimuli were colored color words (“rot”, “blau”, “grün” and “gelb” in German for red, blue, green and yellow) written in Calibri 44 points and presented centrally. Each participant encountered four congruent and four incongruent stimuli but all incongruent stimuli were presented counterbalanced over participants. Half of the trials participants encountered were congruent and the other half incongruent.
In the letter flanker task, participants were required to classify the central letter out of a string comprised of five letters as consonant or vowel. The two letters to the left or right of the central letter belonged either to the same category (i.e., “A” being flanked by “E”, congruent) or to the other category (i.e., “E” being flanked by “F”, incongruent). Participants responded to vowels by using the “K” key of the keyboard and to consonants by using the “D” key of the keyboard. Congruent and incongruent trials were equally frequent.
In the verbal task switching task participants were asked to judge a noun whether it was concrete or abstract or whether it went along with a female (f) or male (m) article in German. Stimuli words were love (f), fidelity (f), cloud (f), flower (f), pride (m), frustration (m), tree (m), and pot (m). All words were written in Calibri 44 points and presented centrally. In German, all words consisted of 4–5 letters. Cues were the words “Artikel” (article) for the male/female article categorization and “Nomen” (noun) for the concrete vs. abstract categorization. Participants used the keys “Q” for male and abstract nouns and the “P” key for female and concrete nouns. Procedure was similar to the non-verbal task switching task.
Reliability of task parameters
To estimate split-half reliability of the task parameters, multilevel models (cf. Results section) were run independently for each task with separate general RT and conflict parameters for each half of the respective task. Within all tasks, split-half reliabilities of the general RT parameters were excellent (all ). Split-half correlations of the conflict parameters were more varied. Reliabilities were acceptable to excellent for the nonverbal tasks (arrow flanker , Simon , pictorial switch ). Of the verbal tasks, the Stroop and switching task had barely acceptable reliabilities ( and , respectively), wheres for the letter flanker task both halfs were essentially uncorrelated ().
Procedure
Participants were tested in pairs on two desktop PCs, running Windows 7 connected to 19” flat screen monitors with a 1280 × 1024 pixel resolution and USB connected standard German QWERTZ keyboards. Order of tasks blocks and WMC battery was counterbalanced across participants, resulting in about 40 participants for the four conditions (verbal vs. nonverbal tasks block first; WMC battery before or after the tasks). The three Stroop mappings were evenly distributed across the four conditions. Overall, testing amounted to 3 hours.
Data Trimming
We removed all practice blocks as well as errors (4.0%) and trials following an error (3.9%). Furthermore, the first trial in every block was also removed as were all trials with reaction times shorter than or equal to 50 ms and trials with reaction times longer than 2500 ms (1.3%). The trimming procedure was identical across all analyses.
Supplementary information
Acknowledgements
This research was funded by a DFG Grant GA2105/2-1 to Miriam Gade. The Ethics Committee of the German Society for Psychology approved the study (MG 042014).
Author Contributions
This is a shared first authorship. Miriam Gade and Marko Paelecke contributed equally to the study, the design, programming, data analysis and writing of the manuscript.
Data Availability
The datasets generated during and/or analysed during the current study are available in the OSF repository, https://osf.io/9kx7y/?view_only=5c7a84693b384d4a87b71950338d9dd0.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary information accompanies this paper at 10.1038/s41598-019-45836-2.
References
- 1.Berlyne DE. Uncertainty and conflict: A point of contact between information-theory and behavior-theory concepts. Psychological Review. 1957;64:329–339. doi: 10.1037/h0041135. [DOI] [PubMed] [Google Scholar]
- 2.Botvinick MM, Braver TS, Barch DM, Carter CS, Cohen JD. Conflict monitoring and cognitive control. Psychological Review. 2001;108:624–652. doi: 10.1037/0033-295X.108.3.624. [DOI] [PubMed] [Google Scholar]
- 3.Botvinick MM, Cohen JD, Carter CS. Conflict monitoring and anterior cingulate cortex: An update. Trends in Cognitive Sciences. 2004;8:539–546. doi: 10.1016/j.tics.2004.10.003. [DOI] [PubMed] [Google Scholar]
- 4.Friedman NP, Miyake A. Unity and diversity of executive functions: Individual differences as a window on cognitive structure. Cortex. 2017;86:186–204. doi: 10.1016/j.cortex.2016.04.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Luria, A. R. The Role of Speech in the Regulation of Normal and Abnormal Behavior. (Liveright, 1961).
- 6.Vygotskij, L. S. Denken und Sprechen: Psychologische Untersuchungen. (Beltz, 1962).
- 7.Gruber O, Goschke T. Executive control emerging from dynamic interactions between brain systems mediating language, working memory and attentional processes. Acta Psychologica. 2004;115:105–121. doi: 10.1016/j.actpsy.2003.12.003. [DOI] [PubMed] [Google Scholar]
- 8.Kray J, Kipp KH, Karbach J. The development of selective inhibitory control: The influence of verbal labeling. Acta Psychologica. 2009;130:48–57. doi: 10.1016/j.actpsy.2008.10.006. [DOI] [PubMed] [Google Scholar]
- 9.Baddeley AD, Chincotta D, Adlam A. Working memory and the control of action: Evidence from task switching. Journal of Experimental Psychology: General. 2001;130:4. [PubMed] [Google Scholar]
- 10.Goschke, T. Intentional Reconfiguration and Involuntary Persistence in Task-Set Switching. In Control of cognitive processes (eds Monsell, S. & Driver, J.) 331–355 (MIT Press, 2000).
- 11.Mayr U, Kleffner-Canucci K, Kikumoto A, Redford MA. Control of task sequences: What is the role of language? Journal of Experimental Psychology: Learning, Memory, and Cognition. 2014;40:376–384. doi: 10.1037/a0035221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Miyake A, Emerson MJ, Padilla F, Ahn J-C. Inner speech as a retrieval aid for task goals: The effects of cue type and articulatory suppression in the random task cuing paradigm. Acta Psychologica. 2004;115:123–142. doi: 10.1016/j.actpsy.2003.12.004. [DOI] [PubMed] [Google Scholar]
- 13.Hardy J. Speaking clearly: A critical review of the self-talk literature. Psychology of Sport and Exercises. 2006;7:81–97. doi: 10.1016/j.psychsport.2005.04.002. [DOI] [Google Scholar]
- 14.Alderson-Day B, Fernyhough C. Inner speech: Development, cognitive functions, phenomenology, and neurobiology. Psychological Bulletin. 2015;141:5. doi: 10.1037/bul0000021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.McCarthy-Jones S, Fernyhough C. The varieties of inner speech: Links between quality of inner speech and psychopathological variables in a sample of young adults. Consciousness and Cognition. 2011;20:1586–1593. doi: 10.1016/j.concog.2011.08.005. [DOI] [PubMed] [Google Scholar]
- 16.Brinthaupt TM, Hein MB, Kramer TE. The Self-Talk Scale: Development, Factor Analysis, and Validation. Journal of Personality Assessment. 2009;91:1. doi: 10.1080/00223890802484498. [DOI] [PubMed] [Google Scholar]
- 17.Whitmer AJ, Banich MT. Inhibition Versus Switching Deficits in Different Forms of Rumination. Psychological Science. 2007;18:546–553. doi: 10.1111/j.1467-9280.2007.01936.x. [DOI] [PubMed] [Google Scholar]
- 18.Whitmer AJ, Gotlib IH. Switching and backward inhibition in major depressive disorder: The role of rumination. Journal of Abnormal Psychology. 2012;121:570–578. doi: 10.1037/a0027474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Gade, M., Paelecke, M. & Rey-Mermet, A. Simon Says - on the Influence of Stimulus Material, Stimulus Arrangement and Inner Speech Habit on the Simon Effect (Manuscript submitted for publication, 2018). [DOI] [PubMed]
- 20.Simon JR. Reactions toward the source of stimulation. Journal of Experimental Psychology. 1969;81:174–176. doi: 10.1037/h0027448. [DOI] [PubMed] [Google Scholar]
- 21.Eriksen BA, Eriksen CW. Effects of noise letters upon the identification of a target letter in a nonsearch task. Perception & Psychophysics. 1974;16:143–149. doi: 10.3758/BF03203267. [DOI] [Google Scholar]
- 22.Stroop JR. Studies of interference in serial verbal reactions. Journal of Experimental Psychology. 1935;18:643–662. doi: 10.1037/h0054651. [DOI] [Google Scholar]
- 23.Siegrist M. Inner speech as a cognitive process mediating self-consciousness and inhibiting self-deception. Psychological Reports. 1995;76:259–265. doi: 10.2466/pr0.1995.76.1.259. [DOI] [PubMed] [Google Scholar]
- 24.Hommel B. Interactions between stimulus-stimulus congruence and stimulus-response compatibility. Psychological Research. 1997;59:248–260. doi: 10.1007/BF00439302. [DOI] [Google Scholar]
- 25.Pratte MS, Rouder JN, Morey RD, Chuning F. Exploring the differences in distributional properties between Stroop and Simon effects using delta plots. Attention, Perception, & Psychophysics. 2013;72:2013–2025. doi: 10.3758/APP.72.7.2013. [DOI] [PubMed] [Google Scholar]
- 26.Braver, T. S., Gray, J. R. & Burgess, G. C. Explaining the many varieties of working memory variation: Dual mechanisms of cognitive control. In Variation in Working Memory (eds Conway, A., Jarrold, C., Kane, M. & Miyake, A.) 76–106 (Oxford University Press, 2007).
- 27.Raudenbush, S. W. & Bryk, A. S. Hierarchical Linear Models: Applications and Data Analysis Methods. (Sage, 2002).
- 28.Raudenbush, S. W., Bryk, A. S., Cheong, Y. F., Congdon, R. & Du Toit, M. Hierarchical linear and nonlinear modeling (HLM7). (Scientific Software International, 2011).
- 29.Pettigrew C, Martin RC. Cognitive declines in healthy aging: Evidence from multiple aspects of interference resolution. Psychology and Aging. 2014;29:187–204. doi: 10.1037/a0036085. [DOI] [PubMed] [Google Scholar]
- 30.Rey-Mermet A, Gade M, Oberauer K. Should we stop thinking about inhibition? Searching for individual and age differences in inhibition ability. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2018;44:501–526. doi: 10.1037/xlm0000450. [DOI] [PubMed] [Google Scholar]
- 31.Rey-Mermet, A., Gade, M., Souza, A. S., von Bastian, C. C. & Oberauer, K. Is executive control related to working memory capacity and fluid intelligence? Journal of Experimental Psychology: General (2019). [DOI] [PubMed]
- 32.Karr JE, et al. The unity and diversity of executive functions: A systematic review and re-analysis of latent variable studies. Psychological Bulletin. 2018;144:1147–1185. doi: 10.1037/bul0000160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Tullett AM, Inzlicht M. The voice of self-control: Blocking the inner voice increases impulsive responding. Acta Psychologica. 2010;135:252–256. doi: 10.1016/j.actpsy.2010.07.008. [DOI] [PubMed] [Google Scholar]
- 34.Lakens D. Equivalence Tests: A Practical Primer for t Tests, Correlations, and Meta-Analyses. Social Psychological and Personality Science. 2017;8:355–362. doi: 10.1177/1948550617697177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Schupp, J. & Gerlitz, J.-Y. BFI-S: Big Five lnventory-SOEP. In Zusammenstellung Sozialwissenschaftlicher Items und Skalen. ZIS Version (Vol. 12) (ed. Glöckner-Rist, A.) (GESIS, 2008).
- 36.Hautzinger, M., & Bailer, M. ADS-Allgemeine Depressionsskala. Beltz (2003).
- 37.White K, Ashton R. Handedness assessment inventory. Neuropsychologia. 1976;14:261–264. doi: 10.1016/0028-3932(76)90058-0. [DOI] [PubMed] [Google Scholar]
- 38.Veale JF. Edinburgh Handedness Inventory – Short Form: A revised version based on confirmatory factor analysis. Laterality: Asymmetries of Body, Brain and Cognition. 2013;19:2. doi: 10.1080/1357650X.2013.783045. [DOI] [PubMed] [Google Scholar]
- 39.Leiner, D. J. SoSci Survey (Version 2.6.00), https://www.soscisurvey.de (2016).
- 40.McDonald, R. P. Test Theory: A Unified Treatment. (Lawrence Erlbaum Associates Publishers, 1999).
- 41.Dunn TJ, Baguley T, Brunsden V. From alpha to omega: A practical solution to the pervasive problem of internal consistency estimation. British Journal of Psychology. 2014;105:399–412. doi: 10.1111/bjop.12046. [DOI] [PubMed] [Google Scholar]
- 42.Jäger, A. O., Süß, H.-M. & Beauducel, A. Berliner Intelligenzstruktur-Test: BIS-Test. (Hogrefe, 1997).
- 43.Lewandowsky S, Oberauer K, Yang L-X, Ecker UKH. A working memory test battery for MATLAB. Behavior Research Methods. 2010;42:571–585. doi: 10.3758/BRM.42.2.571. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated during and/or analysed during the current study are available in the OSF repository, https://osf.io/9kx7y/?view_only=5c7a84693b384d4a87b71950338d9dd0.
