Skip to main content
Journal of Speech, Language, and Hearing Research : JSLHR logoLink to Journal of Speech, Language, and Hearing Research : JSLHR
. 2026 Mar 10;69(4):1595–1630. doi: 10.1044/2025_JSLHR-25-00303

A Meta-Analytical Review of Executive Function Skills in Adults Who Stutter

Levi C Ofoe a,, Katerina Ntourou b, Shari Clifton c, Geoffrey A Coalson d
PMCID: PMC13086193  PMID: 41805429

Abstract

Purpose:

Executive function has been identified as a potential area of vulnerability in individuals who stutter. The present study identified and analyzed the data across empirical studies of the executive function skills of adults who do (AWS) and do not stutter (AWNS).

Method:

Electronic databases, literature reviews, and reference sections of articles and dissertations were searched to identify candidate studies that examined behavioral measures of working memory, inhibition, and/or cognitive flexibility. A total of 39 studies met the eligibility criteria for this meta-analysis. A random-effects model was applied to estimate the pooled effect sizes (Hedges's g) and 95% confidence intervals.

Results:

AWS were significantly less accurate than AWNS on measures of working memory (Hedges's g = −0.41, p < .001), including nonword repetition (Hedges's g = −0.57, p < .001), forward digit span (Hedges's g = −0.23, p = .02), backward digit span (Hedges's g = −0.38, p = .004), and operation span tasks (Hedges's g = −0.37, p = .018). AWS performed comparably to AWNS on inhibition measures (Hedges's g = −0.10, p = .37). An insufficient number of published studies were available to conduct a meaningful analysis of cognitive flexibility.

Conclusions:

Present findings suggest that AWS, as a group, exhibit weaknesses in one component of executive function—working memory—compared to AWNS. Additional research is necessary to determine potential differences in inhibition and cognitive flexibility in AWS.


Executive function is a comprehensive term used to describe the general-purpose cognitive processes that underlie goal-directed thoughts and actions. Executive function comprises three core components: working memory, inhibition, and cognitive flexibility (Diamond, 2013; Miyake et al., 2000). Executive function skills emerge during infancy and continue to develop into adolescence and early adulthood (Diamond, 1988; Ferguson et al., 2021). Executive function skills are associated with psychosocial development, mental health, and overall quality of life across the lifespan (Brown & Landgraf, 2010; Carlson, 2005; Garon et al., 2008; Gathercole et al., 2004; Miller et al., 2011). Over the past 2 decades, more than 35 individual studies have examined the executive function skills of children who do and do not stutter, yielding mixed results (Anderson & Ofoe, 2019). Based on a meta-analytic review, which has more power to statistically detect between-group differences and provides a more accurate summary and direction of effects when compared to individual studies, Ofoe et al. (2018) found that executive function skills in young children who stutter, on average, are less robust than those of their typically fluent peers in two main areas: (a) experimental/behavioral measures of phonological and verbal short-term memory and (b) parent report questionnaires/measures on inhibition. Findings from more recent studies suggest that the observed weaknesses in executive function skills in children who stutter may persist beyond the preschool-age years (e.g., Eichorn & Pirutinsky, 2021; Paphiti & Eggers, 2022).

According to the executive function model of developmental stuttering (Anderson & Ofoe, 2019), there are multiple connections between executive function and stuttering. One such link is that weaknesses in executive function skills may directly contribute to the development and/or maintenance of stuttering. In a reciprocal way, according to Anderson and Ofoe's hypotheses, the frequent breakdown in the fluency of language production over a period of time could also directly weaken executive function skills. An alternative trajectory, in an indirect manner, involves the intertwined relationship between the executive function skills and language processes, such that weaknesses in one may directly affect the other and subsequently affect the fluency of language production. Along this latter route, the connection between executive function and stuttering is through language processes (i.e., deficits in executive function skills may weaken language skills, which in turn may affect fluency skills). However, other researchers have also suggested potential links between more specific aspects of executive function and (dis)fluent speech production (i.e., working memory: Eichorn et al., 2016; inhibition: Neef et al., 2018; cognitive flexibility: Eichorn et al., 2018; Eichorn & Pirutinsky, 2021), albeit inconsistently (for a review, see Anderson & Ofoe, 2019). Theories regarding the role of executive function and the instances of stuttering are varied, including (but are not limited to) difficulty storing and/or identifying inaccurate phonological information in working memory (Bajaj, 2007; Eichorn et al., 2016), difficulty suppressing an incorrect speech plan before it occurs (e.g., Anderson & Wagovich, 2017), and difficulty revising motoric plans after an error is encountered (e.g., Eggers et al., 2013; Eichorn et al., 2018). Beyond speech production, executive function skills in adults have also been hypothesized to underlie potential cognitive and affective consequences of stuttering (e.g., Coalson et al., 2025; Usler, 2022).

Executive function and attention are closely related constructs. However, there has been considerable debate in the literature about the concept of, and/or the processes involved in, attention (Chun et al., 2011; Cowan et al., 2005; Kaldas, 2022). Some researchers have explained attention succinctly as “everybody knows what attention is” (James, 1890). In contrast, others have suggested that “no one knows, or can ever know, exactly what attention is” (Hommel et al., 2019, p. 2299), given its near-ubiquitous impact in any behavioral task. While not a core component of executive functions (Diamond, 2013), attention is generally considered the foundational, multifaceted cognitive construct that supports the operations of the executive function components (e.g., working memory, inhibition, and cognitive flexibility) when engaged in a purposeful behavior (Baddeley, 2002). According to some researchers, there are three subcomponents of attention: alerting (also known as sustained, vigilance, or alertness) attention, which is considered the fundamental form of attention and involves being ready to respond to a stimulus about to appear; orienting (also known as selection or scanning) attention, which is related to using a cue to locate a target stimulus; and executive (also known as focused, selective, supervisory, or conflict resolution) attention, which is involved in monitoring and resolving conflicts (Fan et al., 2002; Posner & Boies, 1971; Raz & Buhle, 2006).

Regardless of the different classifications and/or terminologies related to attention as an underlying cognitive construct, Doneva (2020) conducted a meta-analytical review of attentional skills in adults who do (AWS) and do not stutter (AWNS). The results showed that, when compared to AWNS, AWS were less effective and/or less efficient on behavioral attentional tasks that involved dual tasks (12 studies), selective (also known as focused or executive) attention (nine studies), and general attention (21 studies), suggesting that attention may play a role in the persistence of stuttering in adults. In light of these aforesaid results, the current study focuses on the primary components of executive function (i.e., memory, inhibition, and cognitive flexibility), while acknowledging that these constructs (i.e., attention and the core executive function components) are highly interrelated and not entirely independent.

To date, however, no meta-analytic study has been conducted on the differences in executive function skills between AWS and AWNS. Similar to children who stutter, AWS performance on behavioral measures of executive function relative to AWNS is mixed. It is possible that some cross-study discrepancies in the adult stuttering literature are due to methodological inconsistencies (e.g., task selection, sample size, outcome measures). In light of this, the goal of the current study was to systematically identify, synthesize, and analyze the quantitative data across individual studies that have examined the executive function skills of AWS to assess potential between-group differences, if present, relative to AWNS. To provide a contextual framework for the study, we will first discuss the three core components of executive function—working memory, inhibition, and cognitive flexibility—and then briefly review the existing literature on executive function skills in AWS.

Working Memory

The first core executive function skill is working memory. First proposed as a concept by Miller et al. (1960), working memory refers to a multicomponent, capacity-limited system responsible for temporarily storing (i.e., short-term memory) and manipulating information for complex cognitive purposes (i.e., working memory; Baddeley, 2000; Baddeley & Hitch, 1974; Diamond, 2013). As a resource-limited system, its short-term memory component refers to the cognitive capacity for holding information in mind for a brief period (Cowan, 2008), whereas the working memory component involves manipulating and updating the current information in mind (Baddeley, 2012). While there are at least 10 different models of working memory (see Cowan, 2017; Miyake & Shah, 1999), Baddeley and Hitch's (1974) original model has tended to receive the most focus (D'Esposito & Postle, 2015). Baddeley and Hitch's model consists of four distinct but interrelated components: (a) the phonological loop (or a domain-specific verbal short-term memory), (b) the visuospatial sketchpad (or domain-specific visual–spatial short-term memory), (c) the episodic buffer (a system that integrates the verbal and visuospatial information), and (d) the central executive (the attentional command center over processing, manipulation, and recall of verbal and visuospatial information; cf. Baddeley & Hitch, 1994). Thus, Baddeley and Hitch's model of working memory was selected for our meta-analytical review, given that it is a modular framework compared to the other models (Cowan, 2008; Engle, 2002; Oberauer, 2019).

One component of Baddeley and Hitch's working memory model, the phonological loop, consists of a store, which holds information for a limited amount of time (typically about 2 s), and an embedded rehearsal mechanism, which refreshes the information in the store to prevent decay (Baddeley, 2004; Baddeley et al., 1998). Researchers have examined the phonological store of short-term memory using a variety of simple span tasks. One such task used to measure short-term memory is the nonword repetition task (Estes et al., 2007; Gupta, 2003), which involves participants hearing and accurately repeating a string of nonlexical “words” of increasing length and complexity, typically from one syllable (e.g., jup) to seven syllables or more (e.g., dookershatupietazawm; Wagner et al., 2013). Another type of span task is the forward digit span task, which is similar to nonword repetition but requires participants to repeat progressively longer series of digits in the correct order immediately after presentation (e.g., Dempster, 1981; Engle et al., 1999; Wechsler, 2009). For these tasks, participants are required to store the phonological information in short-term memory and, as needed, rehearse it verbatim to ensure accurate responses.

Unlike short-term memory, which does not require the manipulation of information, working memory requires participants to actively process the information. Working memory can be assessed with more complex span tasks, as they require active manipulation of the contents. For example, backward digit span tasks require participants to accurately repeat a series of digits in the reverse order. Another type of working memory task is the operation span task (OSpan; Unsworth et al., 2005), which requires participants to complete an additional, unrelated intervening task between each to-be-remembered item. During the OSpan task, participants must retain and verbally recall a series of letters while solving interspersed math equations (e.g., participant sees a letter, then solves a math question, then sees another letter, then solves another math question, and so on). At the end of a two- to seven-item trial set, for example, participants are asked to recall the letters aloud in the order of occurrence. The dual nature of complex span tasks, in comparison to simple span tasks, provides a distinction between storage and rehearsal in working memory and the central executive processing required for attentional control.

Inhibition

The second core executive function skill is inhibition—the ability to intentionally resist the tendency to execute a dominant or automatic response (Miyake et al., 2000; see also Banich & Depue, 2015; Hall & Fong, 2015). Researchers have classified inhibition based on different theoretical taxonomies or empirical functions (e.g., Friedman & Miyake, 2004; Howard et al., 2014; Nigg, 2000). Nigg (2000) classified inhibition (also termed executive inhibition) into four distinct conceptual subcategories: (a) interference control, (b) behavioral/motor inhibition, (c) cognitive inhibition, and (d) oculomotor inhibition. Some of these subcategories of inhibition have been theorized to influence stuttered speech at different stages, particularly interference control (e.g., the speaker has difficulty disengaging with inaccurate speech motor plans) and behavioral/motor inhibition (e.g., the speaker has difficulty suppressing an inaccurate speech plan).

According to Nigg's classification, interference control refers to the ability to maintain one's response in the presence of a competing, distracting, or interfering stimulus (Dempster, 1993).1 There are conceptually different tasks that involve interference control. For example, the Eriksen flanker task (Eriksen & Eriksen, 1974) requires participants to press a left/right arrow button in response to a centered targeted stimulus (arrow, >) flanked by other nontarget stimuli (arrows; < >) in the same direction (congruent condition; >>> > >>>) or opposite direction (incongruent condition; <<< > < <<). Response time is the difference in the mean response time between the congruent and incongruent trials (Eriksen, 1995), and response latency is thought to reflect the participants' ability to resist irrelevant stimuli to achieve a specific goal.

Behavioral/motor inhibition refers to a conscious suppression of a dominant behavioral response or movement during a task, such as the Go-NoGo task or stop-signal tasks (SSTs).2 During an SST, participants press a corresponding button (e.g., the right button for X and the left button for O) as fast as possible but withhold response when an auditory/visual signal follows the target (Logan, 1994; Verbruggen & Logan, 2008). The participants' stop-signal reaction time (SSRT) is thought to reflect the ability to suppress a dominant or automatic response after it has been initiated.

Cognitive inhibition refers to the ability to effortfully resist previously useful information, thoughts, or directives from intruding into working memory (Dempster, 1993; Harnishfeger, 1995). For example, during a cued recall task (Tehan & Humphreys, 1995; Tolan & Tehan, 1999), participants are required to quickly and accurately retrieve a target word from working memory from the most recent block of stimuli while ignoring phonologically or semantically related information from preceding trials. Oculomotor inhibition refers to the active suppression of an oculomotor response to gaze toward visual cues flashing in a different location within their visual field (Everling & Fischer, 1998; Hallett, 1978; Hutton & Ettinger, 2006; Magnusdottir et al., 2019).

Cognitive Flexibility

Cognitive flexibility, the third core executive function component, refers to the ability to adapt or switch one's responses based on updated rules, strategies, and perspectives (Miyake et al., 2000). Cognitive flexibility typically emerges gradually during the preschool years between 3 and 5 years old (Cragg & Chevalier, 2012; Doebel & Zelazo, 2015; Ferguson et al., 2021), and its development is considered to be contingent on the development of working memory and inhibition skills (Davidson et al., 2006; Garon et al., 2008). Common behavioral measures of cognitive flexibility are the Trail Making Test (Bowie & Harvey, 2006), in which participants connect a sequence of numbers in ascending order (Part A) and subsequently connect alternating numbers and letters (Part B), and the Wisconsin Card Sorting Task (Grant & Berg, 1948), wherein, without the examiner's initial instructions, participants sort cards based on a predetermined set of rules (e.g., sort by color) and then switch to a new rule (e.g., sort by shape). Participants use the examiner's feedback (i.e., correct or incorrect) on each trial to inform their performance on the next trial. The type and frequency of errors are thought to reflect participants' ability to adapt or modify their responses after new information is presented and/or errors have occurred.

Cognitive flexibility as a subcomponent of executive function is distinct from, yet interrelated with, working memory and inhibition. In the literature, various terms have been used to refer to cognitive flexibility, including set shifting, task shifting/switching, cognitive adaptability, and mental/representational flexibility (Ionescu, 2012). Whereas working memory can be viewed as the “mental workspace” (i.e., activates, maintains, and processes the information necessary for a goal-directed behavior; e.g., maintaining the integrity of a speech plan prior to production) and inhibition can be conceptualized as the “mental brake” (i.e., stop or cancel a failed and/or irrelevant goal-directed behavior; e.g., inhibiting moments of stuttered speech prior to or during production), cognitive flexibility can be conceptualized as the “mental pivot”—that is, the ability to quickly revise an original goal in response to changes in an external or internal environment (e.g., pivoting from motoric behaviors inherent to stuttered speech to motoric behaviors associated with fluent speech production). In this way, cognitive flexibility relies on working memory (e.g., the storage and processing of stimulus–response patterns) and inhibition (e.g., the suppression of the original information or resultant behavior; Garon et al., 2008). That said, cognitive flexibility has been linked to numerous aspects of mental health (e.g., emotional regulation [Baas et al., 2008; Kashdan & Rottenberg, 2010], resilience [Nakhostin-Khayyat et al., 2024], anxiety/depression mitigation [Huang et al., 2024; Johnco et al., 2013]), academic and cognitive performance (e.g., problem solving [Hopper et al., 2020], literacy [Johann et al., 2020; Magalhães et al., 2020], divergent thinking [Baas et al., 2008]), and social functioning (e.g., empathy [Yan et al., 2020], domestic and marital well-being [Howlett et al., 2022; Koesten et al., 2009]).

Executive Function Skills in Adults Who Stutter

Speech-language planning and production is a complex and dynamic task that involves the efficient, effective, and timely coordination and monitoring of multiple interacting processes (e.g., lexical retrieval, morphological encoding, phonological encoding, monitoring; Dell et al., 2014; Levelt, 1989; Levelt et al., 1999; Nozari & Novick, 2017). Considering the dynamic and attention-demanding nature of speech-language production, fluent speech production seems to draw upon speakers' domain-general executive function skills (e.g., Berninger et al., 2017; Shao et al., 2012; Sikora et al., 2016; Ye & Zhou, 2009). Furthermore, successful conversational interactions depend on the communication partners' ability to inhibit or suppress internal competing linguistic messages or thoughts (e.g., “I wonder what they think of me when I stutter”) and/or external distractions (e.g., listener reactions, time pressure). For AWS, weaknesses in executive function skills may contribute to an increased cognitive load during speech planning, resulting in greater difficulty sustaining fluent speech.

Furthermore, as previously mentioned, executive function skills are closely associated with psychosocial functioning and quality of life. In the field of stuttering, researchers have reported that AWS often experience psychosocial stressors, reduced socioemotional functioning, and reduced quality of life due to their stuttering (e.g., Blumgart et al., 2010; Boyle, 2015; Craig et al., 2009; Craig & Tran, 2006). Therefore, given the relationship between executive function skills, psychosocial functioning, and quality of life in adults, it is important to examine the executive function skills of AWS. In the stuttering literature, some, but not all, studies conducted with AWS and AWNS suggest that AWS may have less robust executive function skills when compared to AWNS.

Working Memory

In the case of working memory, there is evidence that AWS are less accurate during tasks of nonword repetition compared to AWNS (e.g., Byrd et al., 2012; Coalson & Byrd, 2017), suggesting difficulties in phonological working memory; however, other studies have reported no between-talker group differences using similar tasks (e.g., Brown, 2015; Sasisekaran, 2013; Smith et al., 2010). For example, Byrd et al. (2012) examined short-term verbal memory in 14 AWS and 14 AWNS using a modified version of Dollaghan and Campbell's (1998) nonword repetition task at four- and seven-syllable lengths but found between-group differences only at the seven-syllable length level, suggesting that AWS may exhibit weaknesses in phonological working memory. On the other hand, Coalson and Byrd (2017) found between-group differences (26 AWS, 26 AWNS) in nonword recall accuracy for stimuli as short as two syllables in length.

Inhibition

A similar pattern is reflected in findings across studies that examined inhibition, in which some studies have reported that AWS have weaknesses in behavioral inhibition when compared to AWNS (e.g., Markett et al., 2016; Treleaven & Coalson, 2020), while others have failed to find any significant between-group differences (Bakhtiar & Eggers, 2023). Yet still, others also failed to find any differences in oculomotor inhibition between the two groups (e.g., Gkalitsiou et al., 2020). To illustrate, Markett et al. (2016) examined behavioral inhibition between 28 AWS and 28 AWNS using a manual-response SST and found statistically significant differences between the two groups in response time, but not in accuracy, suggesting that AWS may exhibit weaknesses in behavioral inhibition. Conversely, Bakhtiar and Eggers (2023) did not find any significant differences between 13 AWS and 14 AWNS in the accuracy or reaction time on a verbal SST. Similarly, Gkalitsiou et al. (2020) did not observe significant differences between 17 AWS and 17 AWNS in the accuracy or reaction time on an antisaccade task, indicating comparable inhibition skills between the two groups.

Cognitive Flexibility

Unlike memory and inhibition, fewer studies have focused on cognitive flexibility in AWS (e.g., Doneva et al., 2018; Gkalitsiou, 2018; Wingate, 1966). Markett et al. (2016) examined the cognitive flexibility skills of 28 AWS and 28 AWNS using the category switch task (Mayr & Kliegl, 2000). However, the authors found no significant differences in the accuracy (p = .14) or reaction time (p = .48) between the two groups. On the other hand, Wingate (1966) examined 12 AWS and 12 AWNS using the Test of Behavioral Rigidity (Schaie, 1955) and found results to suggest that the AWS are less flexible when compared to the AWNS.

One possible reason for the discrepancies across individual studies may be variations in task selection and sample size, which can make it more challenging to detect between-group differences. However, by synthesizing and analyzing conceptually similar tasks across individual studies, we can increase the sample size and increase the likelihood of detecting between-group differences, if they exist. It is possible that between-group differences in executive function skills exist, but to date, individual studies may have been insufficiently powered to yield consistent patterns in findings. It is also equally possible that AWS exhibit no significant differences in executive function skills, or specific subdomains of executive function, compared to AWNS. Thus, the purpose of the present study was to consolidate and summarize the extant quantitative data across conceptually similar tasks in the literature on executive function skills in AWS and AWNS, thereby enhancing the power to detect subtle differences between the two groups to clarify the role of executive function in AWS. More specifically, the quantitative data from empirical studies of working memory, inhibition, and cognitive flexibility between AWS and AWNS will be identified using a search strategy, and the quantitative data related to the conceptually related tasks will be extracted, synthesized, and analyzed. Based on the findings from the Ofoe et al. (2018) study and the evidence that executive function skills continue to develop into adulthood (Ferguson et al., 2021), it is expected that, as a group, AWS will exhibit weaknesses in executive function skills compared to AWNS. Specifically, the following three research questions were examined:

  1. Do AWS differ from AWNS on behavioral measures of working memory?

  2. Do AWS differ from AWNS on behavioral measures of inhibition?

  3. Do AWS differ from AWNS on behavioral measures of cognitive flexibility?

Method

Inclusion Criteria

This study was not subject to an institutional review board approval process. To be included in the meta-analysis, a study had to (a) include adults at least 18 years old who do and do not stutter, with the two talker groups not being matched or equated on one or more measures of executive function (talker group classification criteria were free to vary across studies; see Appendix A for summary of group classification criteria); (b) report methods and results in English; and (c) provide quantitative information (e.g., mean and standard deviation) amenable to effect size calculation for at least one core component of executive function (e.g., working memory, cognitive flexibility, inhibition). Studies that included working memory tasks as secondary tasks to a primary monitoring task were excluded (e.g., phoneme monitoring tasks). In cases where two or more studies reported on overlapping samples (e.g., Eichorn et al., 2016, 2023), only the study with the bigger sample size (e.g., Eichorn et al., 2016) and/or the one with more measures/outcomes was included, to ensure independence of effect sizes. To mitigate publication bias, increase the sample size to improve statistical power, and strengthen the generalizability of the findings, both published (e.g., peer-reviewed journal articles) and unpublished (e.g., theses, dissertations) studies were included.

Data Sources and Study Selection

The literature search was conducted in accordance with the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines by the third author, a librarian with experience in conducting systematic reviews, in consultation with the first two authors, primarily using the MEDLINE database searched via the Ovid interface and the ProQuest Dissertations & Theses Global database. A preliminary MEDLINE search was completed in February 2022, and based on team discussions, further iterations of the searches were conducted in May 2022 and a year later, in June 2023, to capture any newly published studies prior to finalizing the review. Final search strategies reflecting controlled vocabulary terms, keywords, and the search functionality employed (e.g., truncation, field-specific searching, proximity operators) are provided in Appendix B.

The MEDLINE search strategy combined controlled vocabulary (e.g., “Stuttering/”, “Executive Function/”) with keyword terms and Boolean logic (e.g., “stutter$.mp.” OR “stammer$.mp.”, and “exec$ function$” OR “memor$ adj2 working$”) to capture a broad range of relevant literature on stuttering and executive functioning. Search functionalities such as truncation (e.g., stutter$, inhibit$), proximity operators (e.g., adj2), and field-specific searching were employed. A detailed account of the complete search strategy is available in Appendix B to ensure transparency and reproducibility.

To ensure comprehensive coverage of dissertations and theses, additional searches were completed in May 2022 by the first two authors using the following sources: Networked Digital Library of Theses and Dissertations (https://ndltd.org/), Open Access Theses and Dissertations (https://oatd.org/), and the Louisiana State University Scholarly Repository (https://repository.lsu.edu/about.html). Limits to English or other languages, publication date, or publication type were not utilized in any of the searches completed for this project.

All identified studies were initially screened based on their title and abstract in Covidence, a web-based program for systematically managing studies in a meta-analytic review. The first and second authors screened 58% and 42% of the identified studies, respectively, using the outlined criteria. For reliability purposes, each author also screened a subset (20% of studies) of the other rater's studies. The interrater agreement was excellent (κ = .91), and any discrepancies were discussed between the two raters until consensus was reached. Full texts of all studies that passed the first level of screening (N = 98) were then retrieved and were independently reviewed for final inclusion by the first two authors and the fourth author. All cases of disagreement in eligibility assessment were resolved through a detailed discussion between all three raters.

Data Management and Extraction

A summary of the number of studies excluded during the review process is provided in Figure 1. A total of 39 studies qualified for inclusion. Descriptive data (e.g., age range of participants, construct[s] measured, task description) and quantitative data (e.g., sample sizes, means, standard deviations, statistical tests) were extracted from each of the 39 studies that met inclusion criteria, and they were manually coded into a Microsoft Excel spreadsheet. The descriptive data for all included studies can be found in Table 1.

Figure 1.

A PRISMA flowchart for shortlisting studies for a review. The flowchart has identification, screening, and included phases. The steps are as follows. 1. Identification. Step 1. Studies identified, n equals 953. Duplicates removed, n equals 394. 2. Screening. Step 2. Studies screened, n equals 559. Studies excluded, n equals 436. Step 3. Studies assessed for eligibility, n equals 123. Studies excluded, n equals 84. 3. Included. Step 4. Studies included in review, n equals 39. Out of the 39 studies shortlisted for the review, n equals 26 are concerned with Working memory, n equals 10 are concerned with Inhibitory control, and n equals 3 are concerned with Working memory and inhibitory control.

Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) flow diagram for systematic review.

Table 1.

Descriptive information of studies that examined executive function between adults who do (AWS) and do not stutter (AWNS).

Author(s) N Gender (M/F) Age (years), M (SD) Construct Task Task, M (SD) Effect sizes
AWS AWNS AWS AWNS AWS AWNS AWS AWNS
Baird (1996) a 22 22 22/0 22/0 28.2 (n/a) 28.2 (n/a) WM FDSb-W 7.18 (1.53) 7.95 (1.21) −0.55
WM BDSb-W 5.68 (1.46) 5.77 (1.31) −0.06
Bakhtiar & Eggers (2023) c 13 14 9/4 9/5 31.0 (13.29) 28.18 (11.49) BI SST 358 (91) 311 (86) −0.23
Bauerly (2012) a 11 12 11/0 12/0 33.4 (6.4) 32.20 (5.2) WM LNS 14.82 (2.60) 13.42 (2.50) 0.53
Brocklehurst & Corley (2011) c 32 32 8/24 9/23 38 39 WM FDS 6.6 (n/a) 7.2 (n/a) −0.41
B. T. Brown (2015) a 15 15 11/4 12/3 27.1 (8.15) 26.01 (3.45) WM NRTd 28.40 (8.56) 32.80 (2.04) −0.69
Byrd et al. (2015) c 10 10 9/1 9/1 31 (9.31) 31.2 (7.69) WM NWRe-4sy n/af n/af −0.32
NWRe-7sy n/af n/af −1.03
Byrd et al. (2017) c 15 15 12/3 12/3 28.3 (8.2) 28.6 (8.2) WM NWR 8.00 (2.00) 9.00 (2.00) −0.49
Coalson & Byrd (2015) c 11 11 7/4 7/4 22.27 (6.45) 20.36 (2.62) WM NWR 12.36 (1.96) 12.80 (2.04) −0.21
WM FDSg 10.82 (2.27) 11.00 (1.90) −0.08
WM BDSg 6.55 (2.16) 8.18 (2.32) −0.70
Coalson & Byrd (2015) c 11 11 7/4 7/4 22.27 (2.97) 23.45 (4.97) WM NWR 12.27 (1.74) 12.18 (1.47) 0.05
WM FDSg 11.09 (2.26) 8.82 (1.25) 1.20
WM BDSg 6.82 (1.47) 5.82 (1.72) 0.60
Coalson et al. (2017) c 10 10 9/1 3/7 28.6 (12.16) 20.9 (0.88) WM NWR 18.2 (2.20) 19.10 (1.52) −0.46
WM FDS 19.70 (2.75) 20.80 (4.42) −0.29
WM BDS 15.4 (3.50) 15.5 (2.76) −0.03
Eichorn et al. (2016) c 19 20 12/7 11/9 27.21 (4.18) 25.60 (4.58) WM OSpan 42.56 (17.39) 45.85 (16.16) −0.19
WM SSpan 19.61 (12.52) 20.65 (7.04) −0.10
Eichorn et al. (2023) c 18 20 11/7 11/9 27.1 (4.3) 25.6 (4.6) WM OSpan-3 7.83 (2.09) 8.25 (1.65) −0.22
OSpan-4 9.56 (3.91) 10.80 (3.21) −0.34
OSpan-5 11.39 (4.13) 11.00 (4.47) 0.09
OSpan-6 8.33 (5.87) 10.20 (6.49) −0.29
OSpan-7 5.44 (7.80) 5.60 (6.26) −0.02
WM SSpan-2 4.56 (1.65) 5.3 (1.17) −0.51
SSpan-3 5.33 (3.01) 6.60 (2.68) −0.44
SSpan-4 5.56 (4.78) 6.00 (3.78) −0.10
SSpan-5 4.17 (4.93) 2.75 (3.02) 0.34
Elsherif et al. (2021) c 30 84 22/8 7/77 29.47 (8.91) 20.18 (1.03) WM MFD 15.33 (3.08) 16.65 (2.14) −0.54
NWR 9.90 (2.90) 13.68 (1.73) −1.79
Garnett et al. (2023) c 18 18 12/6 12/6 29.4 (11.4) 25.3 (6.9) WM OSpan 34.50 (17.90) 43.20 (16.80) −0.49
Gkalitsiou (2018) a 15 15 11/4 11/4 23.27 (5.68) 23.47 (6.21) WM NIH-WM 98.60 (10.97) 102.00 (12.10) −0.29
WM N-back-1-auditory 618.33 (19.02) 608.67 (19.00) −0.49
N-back-1-visual 466.29 (23.17) 460.20 (23.16) −0.26
N-back-2-auditory 790.61 (19.24) 736.84 (19.20) −2.72
N-back-2-visual 614.60 (23.28) 579.23 (23.25) −1.48
N-back-3-auditory 841.39 (19.66) 825.42 (19.59) −0.79
N-back-3-visual 656.92 (23.48) 639.01 (23.48) −0.74
IC FICT 108.73 (14.71) 101.47 (11.54) 0.53
Gkalitsiou et al. (2020) c 17 17 14/3 14/3 23.41 (4.0) 23.29 (3.88) IC OMT 262.36 (6.82) 274.18 (6.82) 0.41
Haj-Tas (2007) a 14 14 10/4 10/4 32.40 (14.05) 32.20 (12.76) WM FDS 9.50 (3.28) 10.93 (2.56) −0.47
WM BDS 7.29 (2.92) 9.07 (2.87) −0.60
Höbler et al. (2023) c 18 18 14/4 13/5 32.56 (7.28) 31.78 (6.02) WM FDSh 7.39 (1.46) 7.83 (1.10) −0.33
WM BDSh 5.06 (1.16) 6.33 (1.37) −0.98
Jackson (2012) a 5 10 3/2 0/10 24.65 (6.48) 21.48 (1.79) WM MFD 12.6 (2.30) 10.8 (1.99) 0.81
WM NWR 8.80 (2.95) 8.70 (2.50) 0.04
Lescht (2017) c 15 15 9/6 9/6 30.87 (3.26) 30.67 (3.21) WM FDSi 8.27 (1.16) 8.67 (1.18) −0.33
FDSb 8.60 (1.24) 9.13 (1.30) −0.41
Markett et al. (2016) c 28 28 22/6 22/6 28.54 (9.63) 31.39 (10.87) BI SST t = 2.23 p = .03 −0.59
Markett et al. (2016) c 28 28 20/8 20/8 27.57 (9.13) 27.21 (8.69) BI SST t = 2.25 p = .03 −0.60
Maxfield (2017)b,c 16 16 12/4 12/4 25.8 (n/a) 24.4 (n/a) BI GNG n/aj n/aj −0.22
McGill et al. (2016) a 22 22 18/4 18/4 27.95 (7.61) 27.86 (6.99) WM NWR 9.32 (1.84) 10.27 (2.03) −0.48
WM NIH-WM 103.54 (14.07) 111.47 (7.89) −0.68
IC NIH-FICT 112.64 (9.46) 114.35 (10.95) 0.05
McGill et al. (2016) c 13 13 8/5 8/5 28.15 (7.97) 28.5 (7.49) WM NWR 9.69 (2.63) 12.38 (1.26) −1.26
Mersov et al. (2016) c 12 12 10/2 10/2 32 (6) 30 (8) WM FDS 11.3 (2.9) 12.4 (2.1) −0.42
Ning et al. (2017) c 15 15 13/2 13/2 28.5 (9.8) 27.8 (7.9) BI GNG 5.33 (2.13) 4.60 (2.61) −0.31
Sasisekaran (2013) c 9 9 8/1 8/1 32.2 (14.7) 31.8 (14.6) WM NRT-1sy 91.70 (7.40) 95.10 (7.20) −0.44
NRT-2sy 97.20 (3.60) 98.50 (2.80) −0.38
NRT-3sy 97.10 (4.0) 98.80 (3.50) −0.43
NRT-4sy 89.90 (7.90) 92.80 (6.0) −0.39
WM FDS-W 11.3 (2.1) 11.4 (1.51) −0.05
WM BDS-W 7.1 (2.9) 7.7 (2.1) −0.23
Sasisekaran & Weisberg (2014) c 10 10 9/1 9/1 30.7 (15.1) 30.8 (14.9) WM NRT-PC p = .62f −0.22
NRT-NPC p = .006f −1.33
NRT-4sy p = .18f −0.60
NRT-6sy p = .01f −1.23
WM FDS 11.8 (1.7) 11.0 (23) 0.38
WM BDS 7.0 (2.9) 7.6 (1.6) −0.25
WM SR 88.70 (5.60) 90.10 (9.40) −0.17
Schwartze & Kotz (2020) c 12 12 11/1 n/a 33.3 (7.45) 32.6 (7.25) WM BDS n/a n/af −1.03
Smits-Bandstra & Gracco (2015) c 14 14 8/6 8/6 65.1 (5.7) 65.5 (5.6) WM FDS-W 12.1 (1.9) 12.1 (2.3) 0.00
WM BDS-W 8.5 (2.1) 8.9 (2.4) −0.17
Subramanian (2001) a 12 12 11/1 11/1 28.69 (n/a) 28.85 (n/a) IC ST-M n/ak n/ak 0.07
ST-V n/ak n/ak 0.43
WM NRT-C 52.64 (1.75) 54.00 (2.32) −0.64
NRT-V 35.64 (2.42) 36.64 (2.11) −0.42
Tendera (2019) a 31 31 22/9 21/10 28.1 (6.44) 26.4 (5.7) BI GNG-M 315.5 (24.59) 321.6 (27.5) 0.23
GNG-V 315.1 (33.57) 319 (29.15) 0.12
Tichenor (2020) a 40 42 29/11 30/12 27.05 (11.2) 24.07 (5.86) WM FDS 12 (2.75) 12 (2.27) 0.00
WM NWR 10.65 (1.98) 10.79 (2.27) −0.07
WM OSpan 21.27 (9.65) 25.36 (9.98) −0.41
WM SSpan 13.22 (6.02) 14.28 (5.23) −0.19
WM RSpan-B1 10.84 (4.65) 11.86 (4.12) −0.23
WM RSpan-B2 10.30 (5.03) 12.24 (5.26) −0.37
Treleaven & Coalson (2021) c 17 17 14/3 9/8 29.59 (9.19) 24.59 (8.83) BI SST 204.5 (15.5) 211.9 (10.15) 0.56
Vallely (2011) a 12 14 10/2 12/2 28.02 (7.35) 27.06 (7.25) WM NWR 8.08 (1.73) 9.00 (2.39) −0.42
MFD 10.17 (3.1) 11.14 (2.35) −0.35
Van der Linden (2013) a , l 18 18 n/a n/a n/a n/a BI SST 476.21 (152.81) 339.90 (140.39) −0.93
Van Lieshout et al. (2014) c 10 10 7/3 7/3 30.6 (7.6) 28.4 (9.2) IC CST n/ak n/ak −0.08
Vincent et al. (2012) 15 15 15/0 15/0 21.11 (n/a) 22.1 (n/a) WM DS-W 19.8 (5.1) 17.8 (3.6) 0.44
WM LNS 6.10 (2.30) 6.50 (1.60) −0.20
Webster (1990) c 24 24 12/12 12/12 29.8 (n/a) 24.6 (n/a) WM FDS-M 8.3 (2.4) 10.0 (1.9) −0.76
WM FDS-F 8.7 (2.4) 10.7 (2.1) −0.86
WM BDS-M 7.0 (1.3) 8.4 (2.9) −0.60
WM BDS-F 6.3 (2.3) 8.3 (2.7) −0.77
Wiltshire (2019) a 38 21 32/6 16/5 31.5 (7.2) 28.6 (6.4) BI SST 217.18 (64.92) 211.05 (67.22) −0.09
Yang et al. (2019) c 19 19 16/3 16/3 26.26 (3.43) 24.84 (2.79) WM FDS 8.79 (1.40) 9.21 (1.31) −0.30
19 18m 16/3 n/a 26.26 (3.43) n/a WM BDS 5.84 (1.70) 7.00 (1.61) −0.68

Note.N indicates the sample size. “Age” refers to mean age in years. M = male; F = female; n/a = the authors did not provide the information; WM = working memory; FDS = forward digit span; W = Wechsler (1981); BI = behavioral/motor inhibition; SST = stop-signal task; LNS = letter–number sequencing recall task; NRT = nonword repetition test (Dollaghan & Campbell, 1998); NWR = nonword repetition task—Comprehensive Test of Phonological Processing (Wagner et al., 1999); Sy = syllable(s); BDS = backward digit span; OSpan = operation span task; SSpan = symmetry span; MFD = memory for digits—Comprehensive Test of Phonological Processing (Wagner et al., 1999); NIH-WM = National Institutes of Health Toolbox for Assessment of Neurological and Behavioral Function; IC = interference control; NIH-FICT = NIH-Flanker Inhibitory Control Test; OMT = oculomotor task; GNG = Go-NoGo; PC = Permissible Combinations; NPC = Non-Permissible Combinations; SR = sentence recall; ST = Stroop (verbal) task; ST-M = ST-manual; ST-V = ST-verbal; NRT-C = NRT percent consonants correct; NRT-V = NRT percent vowels correct; GNG-V = Go-NoGo-Verbal; GNG-M = GNG-manual; RSpan = reading span; B1 = Block 1; B2 = Block 2; CST = classic Stroop task; DS-W = Digit Span–Wechsler.

a

Dissertation/thesis.

b

Peer-reviewed article.

c

The different syllable variant of the task.

d

Adaptation of Dollaghan and Campbell's (1998) test.

e

Raw score of the longest digit span.

f

Adapted from the Comprehensive Test of Phonological Processes.

g

Three AWS and three AWNS were excluded from the final analysis.

h

One participant from the AWNS group failed to complete the task.

i

Responses were typed.

j

Adapted from the Wechsler Adult Intelligence Scale.

k

Reported sample size and p value.

l

See study for the congruent and incongruent reaction time values.

m

See study for the accuracy of the NoGo trials.

Statistical Analysis

All statistical analyses were performed with the Comprehensive Meta-Analysis (Version 4; Borenstein et al., 2022) software package. Hedges's g, a variation of the standardized mean difference correcting for potential bias due to small sample sizes (Hedges & Olkin, 1985), and 95% confidence intervals (CIs) were calculated for each study and construct (see Appendix C for the glossary). When multiple levels of a single measure were extracted from a study (e.g., nonword repetition accuracy for four- and seven-syllable nonwords), the effect sizes for each level were averaged to yield a single effect size for that study. A random-effects model was used in all analyses, weighing the studies based on their sample size and standard error.

The Cochrane Q statistic, using a criterion alpha of .1, instead of the conventional level of .05, and I2 were used to assess heterogeneity across studies (Higgins et al., 2003). Higgins et al. (2003) recommend using a significance level of .1 rather than the conventional .05, due to the limited power of the Cochrane Q test to detect heterogeneity, especially when few studies are included in a meta-analysis. Publication bias was evaluated through a combination of visual inspection of funnel plots, Duval and Tweedie's trim-and-fill procedure (Duval & Tweedie, 2000), and Egger's regression test. Because funnel plot asymmetry can arise from factors other than publication bias (e.g., heterogeneity), the results were interpreted with attention to the direction of any missing studies. Specifically, we considered whether the location of imputed studies aligned with the direction in which bias would be expected to operate (i.e., AWS performing better than AWNS). Finally, we evaluated the robustness of the results for each construct/measure (e.g., nonword repetition) by conducting the “one study–removed” sensitivity analysis. In this approach, the analysis was repeated multiple times, each time excluding one study from the analysis. This iterative process helps determine if any single study disproportionately influences the overall effect size and, consequently, the conclusions are drawn.

Results

The descriptive characteristics of studies included in the meta-analysis are presented in Table 1. The Coalson and Byrd (2015) study included two independent samples, so it provided two separate effect size estimates for the analyses. Of the 39 studies included in the analysis, 23 (59%) were conducted in the United States; seven (17.9%), in Canada; three (7.7%), in the United Kingdom; three (7.7%), in China; two (5.1%), in Germany; and one (2.6%), in the Netherlands. Twenty-seven (75%) of the 39 studies were reported in peer-reviewed journal articles, nine (23.1%) were doctoral dissertations, and three (7.7%) were master's theses. In the majority of the studies (69%), participants were assigned to AWS or AWNS talker groups based on traditional stuttering measures (e.g., percentage of stuttering-like disfluencies from conversational or reading samples [Yairi & Ambrose, 2005], stuttering severity measures based on the Stuttering Severity Instrument [Riley, 1994, 2009]; see Appendix A). For the remaining studies, group membership was primarily based on self-identification as an adult who stutters and/or by clinical judgment of a speech-language pathologist. Twenty-four studies reported that participants in the AWS and AWNS groups were equated or matched on education. The remaining studies did not provide any information on the educational level of the two talker groups. Of the 39 included studies, 17 measured participants' vocabulary skills, of which only two (Haj-Tas, 2007; Sasisekaran & Weisberg, 2014) reported a statistically significant difference between AWS and AWNS, with AWS demonstrating lower vocabulary skills. The remaining 15 studies did not find a statistically significant difference in receptive or expressive vocabulary between AWS and AWNS. Finally, only six studies assessed participants' nonverbal intelligence, with the overwhelming majority using the Test of Nonverbal Intelligence–Fourth Edition (TONI-4; L. Brown et al., 2010) and one study using the Raven's Standard Progressive Matrices (Raven, 1960). None of the studies reported a statistically significant difference in nonverbal intelligence between AWS and AWNS.

Of the 39 total studies, 26 studies included working memory measures, 10 studies included inhibition measures, and three studies included measures for both. An insufficient number of studies on cognitive flexibility were obtained from the search for any meaningful analysis (n = 3; S. Doneva et al., 2018; Gkalitsiou, 2018; Markett et al., 2016). Therefore, quantitative analysis of studies that examined cognitive flexibility was not conducted.

Working Memory

Nonword Repetition

A total of 14 studies examined verbal short-term memory in 225 AWS and 288 AWNS using nonword repetition tasks. As shown in Figure 2, results of the random-effects estimate of the mean effect size revealed a weighted moderate effect size of Hedges's g = −0.57, p < .001 (SE = 0.14, 95% CI [−0.30, −0.85]). This mean effect size suggests that AWS, as a group, scored more than half a standard deviation below AWNS on nonword repetition measures.

Figure 2.

The forest plot shows the study name, statistics for each study, and random relative weight in random. The statistics for each study lists the Hedges's g, lower limit, upper limit, and p value. The data are as follows. 1. Study name: Tichenor, 2020. Hedges' g: negative 0.07. Lower limit: negative 0.49. Upper limit: 0.36. p value: 0.767. Relative weight: 8.57. 2. Study name: Sasisekaran and Weisberg, 2014. Hedges' g: negative 0.80. Lower limit: negative 1.25. Upper limit: negative 0.36. p value: 0.000. Relative weight: 8.45. 3. Study name: Sasisekaran, 2013. Hedges' g: negative 0.41. Lower limit: negative 0.86. Upper limit: 0.03. p value: 0.069. Relative weight: 8.43. 4. Study name: Elsherif et al., 2021. Hedges' g: negative 1.79. Lower limit: negative 2.27. Upper limit: negative 1.32. p value: 0.000. Relative weight: 8.18. 5. Study name: Subramanian, 2001. Hedges' g: negative 0.53. Lower limit: negative 1.09. Upper limit: 0.03. p value: 0.062. Relative weight: 7.48. 6. Study name: McGill, 2016. Hedges' g: negative 0.48. Lower limit: negative 1.07. Upper limit: 0.10. p value: 0.107. Relative weight: 7.21. 7. Study name: Byrd et al., 2015. Hedges' g: negative 0.65. Lower limit: negative 1.27. Upper limit: negative 0.04. p value: 0.038. Relative weight: 7.00. 8. Study name: Byrd et al., 2017. Hedges' g: negative 0.49. Lower limit: negative 1.19. Upper limit: 0.22. p value: 0.178. Relative weight: 6.28. 9. Study name: Brown, 2015. Hedges' g: negative 0.69. Lower limit: negative 1.41. Upper limit: 0.03. p value: 0.060. Relative weight: 6.20. 10. Study name: Vallely, 2011. Hedges' g: negative 0.42. Lower limit: negative 1.18. Upper limit: 0.33. p value: 0.274. Relative weight: 5.93. 11. Study name: Coalson and Byrd, 2015b. Hedges' g: 0.05. Lower limit: negative 0.75. Upper limit: 0.86. p value: 0.896. Relative weight: 5.59. 12. Study name: Coalson and Byrd, 2015a. Hedges' g: negative 0.21. Lower limit: negative 1.02. Upper limit: 0.59. p value: 0.607. Relative weight: 5.57. 13. Study name: McGill et al., 2016. Hedges' g: negative 1.26. Lower limit: negative 2.08. Upper limit: negative 0.44. p value: 0.003. Relative weight: 5.48. 14. Study name: Coalson et al., 2017. Hedges' g: negative 0.46. Lower limit: negative 1.31. Upper limit: 0.40. p value: 0.294. Relative weight: 5.28. 15. Study name: Jackson, 2012. Hedges' g: 0.04. Lower limit: negative 0.97. Upper limit: 1.05. p value: 0.945. Relative weight: 4.36. 16. Pooled. Hedges' g: negative 0.57. Lower limit: negative 0.85. Upper limit: negative 0.30. p value: 0.000. 17. Prediction interval. Hedges' g: negative 0.57. Lower limit: negative 1.53. Upper limit: 0.39. Negative values of Hedges' g implies that the study favors AWNS and positive values of Hedges' g implies that the study favors AWS.

Forest plot for the mean difference in nonword repetition skills between adults who do (AWS) and do not stutter (AWNS). Squares represent effect size per study, with size proportional to its weight in the analysis. Horizontal lines indicate 95% confidence intervals (CIs) per study. The red diamond represents the overall effect size and corresponding 95% CI. The red prediction interval reflects the expected range within which the true effect sizes of future studies fall.

A visual inspection of the funnel plot (see Figure 3) indicated potential evidence of publication bias, and the Duval and Tweedie's trim-and-fill method estimated that at least five effect sizes with negative Hedges's g values would be necessary to make the funnel symmetrical. However, if publication bias were present, it would be more likely to exist against studies showing positive effects (i.e., AWS performing better than AWNS). Because the five missing studies fall in the opposite direction of where bias would be expected, and the Egger's regression test was nonsignificant (β = 1.22, SE = 1.71, p = .24), one could argue that the average effect size estimate for the nonword repetition measure is not affected by publication bias. Other factors (e.g., heterogeneity) could also contribute to funnel asymmetry.

Figure 3.

A forest plot displays the statistics for a number of studies and the random relative weight associated with each study. The study statistics lists the Hedges' g value, lower limit, upper limit, and the p value. The data are as follows. 1. Study name: Elsherif et al., 2021. Hedges' g: negative 0.54. Lower limit: negative 0.96. Upper limit: negative 0.12. p value: 0.011. Relative weight: 9.20. 2. Study name: Tichenor, 2020. Hedges' g: 0.00. Lower limit: negative 0.43. Upper limit: 0.43. p value: 1.000. Relative weight: 9.03. 3. Study name: Brocklehurst and Corley, 2011. Hedges' g: negative 0.41. Lower limit: negative 0.90. Upper limit: 0.08. p value: 0.099. Relative weight: 7.95. 4. Study name: Lescht, 2017. Hedges' g: negative 0.37. Lower limit: negative 0.87. Upper limit: 0.13. p value: 0.145. Relative weight: 7.82. 5. Study name: Webster, 1990. Hedges' g: negative 0.81. Lower limit: negative 1.38. Upper limit: negative 0.24. p value: 0.006. Relative weight: 6.71. 6. Study name: Baird, 1996. Hedges' g: negative 0.55. Lower limit: negative 1.14. Upper limit: 0.04. p value: 0.069. Relative weight: 6.42. 7. Study name: Yang et al., 2019. Hedges' g: negative 0.30. Lower limit: negative 0.93. Upper limit: 0.32. p value: 0.343. Relative weight: 5.98. 8. Study name: Hobler et al., 2023. Hedges' g: negative 0.33. Lower limit: negative 0.98. Upper limit: 0.31. p value: 0.311. Relative weight: 5.77. 9. Study name: Smits-Bandstra and Gracco, 2015. Hedges' g: 0.00. Lower limit: negative 0.72. Upper limit: 0.72. p value: 1.000. Relative weight: 4.97. 10. Study name: Haj-Tas, 2007. Hedges' g: negative 0.47. Lower limit: negative 1.20. Upper limit: 0.26. p value: 0.205. Relative weight: 4.87. 11. Study name: Vallely, 2011. Hedges' g: negative 0.35. Lower limit: negative 1.10. Upper limit: 0.41. p value: 0.368. Relative weight: 4.66. 12. Study name: Mersov et al., 2016. Hedges' g: negative 0.42. Lower limit: negative 1.20. Upper limit: 0.36. p value: 0.293. Relative weight: 4.41. 13. Study name: Coalson and Byrd, 2015a. Hedges' g: negative 0.08. Lower limit: negative 0.89. Upper limit: 0.72. p value: 0.840. Relative weight: 4.23. 14. Study name: Coalson et al., 2017. Hedges' g: negative 0.29. Lower limit: negative 1.13. Upper limit: 0.56. p value: 0.506. Relative weight: 3.93. 15. Study name: Sasisekaran and Weisberg, 2014. Hedges' g: 0.38. Lower limit: negative 0.47. Upper limit: 1.23. p value: 0.381. Relative weight: 3.91. 16. Study name: Coalson and Byrd, 2015b. Hedges' g: 1.20. Lower limit: 0.32. Upper limit: 2.07. p value: 0.008. Relative weight: 3.70. 17. Study name: Sasisekaran, 2013. Hedges' g: negative 0.05. Lower limit: negative 0.93. Upper limit: 0.83. p value: 0.908. Relative weight: 3.69. 18. Jackson, 2012. Hedges' g: 0.81. Lower limit: negative 0.24. Upper limit: 1.86. p value: 0.131. Relative weight: 2.77. 19. Pooled. Hedges' g: negative 0.23. Lower limit: negative 0.42. Upper limit: negative 0.04. p value: 0.018. 20. Prediction interval. Hedges' g: negative 0.23. Lower limit: negative 0.78. Upper limit: 0.32. Studies with negative value of Hedges' g favors AWNS and studies with positive value of Hedges' g favors AWS.

Funnel plot for studies that examined nonword repetition skills of adults who do (AWS) and do not stutter (AWNS). The white circles represent individual studies, and white diamonds represent the pooled estimate. Five imputed studies, based on Duval and Tweedie's trim-and-fill method, are shown as filled circles, with filled diamonds reflecting the imputed/“adjusted” Hedges's g estimates.

Analysis of heterogeneity revealed a significant variation in effect sizes across the studies, Q = 39.80, df = 14, p < .001, suggesting that effect sizes differ more than would be expected by chance alone. The I2 statistic was 64.82%, indicating that approximately 65% of the variance in observed effects reflects variance in true effects rather than sampling error. Based on the prediction interval, shown at the bottom of the forest plot, the true effect size in 95% of all comparable future studies will fall between −1.53 and 0.39. Finally, the direction and magnitude of Hedges's g persisted after sequentially excluding each study, suggesting that the pooled effect size was not greatly affected by any one study.

Forward Digit Span

Seventeen studies assessed verbal short-term memory between 308 AWS and 371 AWNS using forward digit span measures. As shown in Figure 4, based on the pooled distribution, results of the random-effects estimate of the mean effect size revealed a weighted small effect size of Hedges's g = −0.23, p = .02 (SE = 0.09, 95% CI [−0.42, −0.04]). This mean effect size suggests that AWS, as a group, scored about a third of a standard deviation below AWNS on forward digit span measures. A visual inspection of the funnel plot and the Duval and Tweedie's trim-and-fill method failed to reveal any evidence of publication bias.

Figure 4.

A forest plot lists the study statistics and random relative weights for a number of studies. The study statistics are Hedges' g, Lower limit, Upper limit, and p value. The data are as follows. 1. Study name: Elsherif et al., 2021. Hedges' g: negative 0.54. Lower limit: negative 0.96. Upper limit: negative 0.12. p value: 0.011. 2. Study name: Tichenor, 2020. Hedges' g: 0.00. Lower limit: negative 0.43. Upper limit: 0.43. p value: 0.000. 3. Study name: Brocklehurst and Corley, 2011. Hedges' g: negative 0.41. Lower limit: negative 0.90. Upper limit: 0.08. p value: 0.099. 4. Study name: Lescht, 2017. Hedges' g: negative 0.37. Lower limit: negative 0.87. Upper limit: 0.13. p value: 0.145. 5. Study name: Webster, 1990. Hedges' g: negative 0.81. Lower limit: negative 1.38. Upper limit: negative 0.24. p value: 0.006. 6. Study name: Baird, 1996. Hedges' g: negative 0.55. Lower limit: negative 1.14. Upper limit: 0.04. p value: 0.069. 7. Study name: Yang et al., 2019. Hedges' g: negative 0.30. Lower limit: negative 0.93. Upper limit: 0.32. p value: 0.343. 8. Study name: Hobler et al., 2023. Hedges' g: negative 0.33. Lower limit: negative 0.98. Upper limit: 0.31. p value: 0.311. 9. Study name: Smits-Bandstra and Gracco, 2015. Hedges' g: 0.00. Lower limit: negative 0.72. Upper limit: 0.72. p value: 0.000.

Forest plot for the mean difference in forward digit span skills between adults who do (AWS) and do not stutter (AWNS). Squares represent effect size per study, with size proportional to its weight in the analysis. Horizontal lines indicate 95% confidence intervals (CIs) per study. The red diamond represents the overall effect size and corresponding 95% CI. The red prediction interval reflects the expected range within which the true effect sizes of future studies fall.

Analysis of heterogeneity revealed significant variation in effect sizes across the studies, Q = 26.38, df = 17, p = .07. Based on the I2 statistic, approximately 35% of the variance in observed effects reflects variance in true effects rather than sampling error. Based on the prediction interval, the true effect size in 95% of all comparable future studies will fall between −0.78 and 0.32. A “one study–removed” sensitivity analysis indicated that the results are robust and stable, as the average effect size remained essentially unchanged with any one study removed.

Backward Digit Span

Ten studies assessed verbal working memory between 162 AWS and 161 AWNS using the backward digit span measures. As shown in Figure 5, based on the pooled distribution, results of the random-effects estimate of the mean effect size revealed a weighted moderate effect size of Hedges's g = −0.38, p = .004 (SE = 0.13, 95% CI [−0.64, −0.12]), with AWNS, as a group, outperforming the AWS group. A visual inspection of the funnel plot and the Duval and Tweedie's trim-and-fill method did not reveal any evidence of publication bias.

Figure 5.

A forest plot of the study statistics and the relative weight for a number of studies. The data are as follows. 1. Study name: Webster, 1990. Hedges\u2019 g: negative 0.71. Lower limit: negative 1.29. Upper limit: negative 0.14. p value: 0.015. Relative weight: 12.50. 2. Study name: Baird, 1996. Hedges\u2019 g: negative 0.06. Lower limit: negative 0.64. Upper limit: 0.52. p value: 0.830. Relative weight: 12.34. Relative weight: 12.34. 3. Study name: Yang et al., 2019. Hedges\u2019 g: negative 0.68. Lower limit: negative 1.33. Upper limit: negative 0.04. p value: 0.039. Relative weight: 10.65. 4. Study name: Hobler et al., 2023. Hedges\u2019 g: negative 0.98. Lower limit: negative 1.66. Upper limit: negative 0.30. p value: 0.005. Relative weight: 10.05. 5. Study name: Smits-Bandstra and Gracco, 2015. Hedges\u2019 g: negative 0.17. Lower limit: negative 0.89. Upper limit: 0.55. p value: 0.640. Relative weight: 9.21. 6. Study name: Haj-Tas, 2007. Hedges\u2019 g: negative 0.60. Lower limit: negative 1.33. Upper limit: 0.14. p value: 0.112. Relative weight: 8.93. 7. Study name: Coalson and Byrd, 2015b. Hedges\u2019 g: 0.60. Lower limit: negative 0.22. Upper limit: 1.42. p value: 0.152. Relative weight: 7.54. 8. Study name: Coalson et al., 2017. Hedges\u2019 g: negative 0.03. Lower limit: negative 0.87. Upper limit: 0.81. p value: 0.943. Relative weight: 7.45. 9. Study name: Sasisekaran and Weisberg, 2014. Hedges\u2019 g: negative 0.25. Lower limit: negative 1.09. Upper limit: 0.60. p value: 0.568. Relative weight: 7.32. 10. Study name: Sasisekaran, 2013. Hedges\u2019 g: negative 0.23. Lower limit: negative 1.11. Upper limit: 0.66. p value: 0.616. Relative weight: 7.27. 11. Pooled. Hedges\u2019 g: negative 0.38. Lower limit: negative 0.64. Upper limit: negative 0.12. p value: 0.004. Relative weight: 6.76. 12. Prediction interval. Hedges\u2019 g: negative 0.38. Lower limit: negative 0.97. Upper limit: 0.22.

Forest plot for the mean difference in backward digit span skills between adults who do (AWS) and do not stutter (AWNS). Squares represent effect size per study, with size proportional to its weight in the analysis. Horizontal lines indicate 95% confidence intervals (CIs) per study. The red diamond represents the overall effect size and corresponding 95% CI. The red prediction interval reflects the expected range within which the true effect sizes of future studies fall.

Analysis of heterogeneity did not support a significant variation in effect sizes across the studies, Q = 13.83, df = 10, p = .18. Based on the I2 statistic, approximately 28% of the variance in observed effects reflects variance in true effects rather than sampling error. Based on the prediction interval, the true effect size in 95% of all comparable future studies will fall between −0.97 and 0.22. Sensitivity analysis with one study removed did not change the average effect size estimate.

OSpan

Three studies examined complex verbal working memory skills between 77 AWS and 80 AWNS using the OSpan measures. As shown in Figure 6, based on the pooled distribution, the results of the random-effects estimate of the mean effect size revealed a weighted small–moderate effect size of Hedges's g = −0.37, p = .018 (SE = 0.16, 95% CI [−0.69, −0.06]), with AWS, as a group, scoring lower on OSpan tasks than the AWNS group. A visual inspection of the funnel plot failed to reveal any evidence of publication bias. An analysis of heterogeneity revealed the absence of variability across studies (I2 = 0), Q = 0.487, df = 2, p = .784. Given the zero between-study variance, a prediction interval was not calculated, and a sensitivity analysis was not performed. However, these results should be interpreted with caution because estimates of heterogeneity based on fewer than 10 studies may be unreliable (Borenstein, 2020).

Figure 6.

A Forest plot displays the statistics and the random relative weights for a number of studies. The statistics are Hedges' g, standard error, lower limit, upper limit, and p value. The data are as follows. 1. Study name: Tichenor, 2020. Hedges' g: negative 0.41. Standard error: 0.22. Lower limit: negative 0.85. Upper limit: 0.02. p value: 0.062. Relative weight: 51.52. 2. Study name: Eichorn et al., 2016. Hedges' g: negative 0.19. Standard error: 0.31. Lower limit: negative 0.81. Upper limit: 0.42. p value: 0.541. Relative weight: 25.47. 3. Study name: Garnett et al., 2023. Hedges' g: negative 0.49. Standard error: 0.33. Lower limit: negative 1.14. Upper limit: 0.16. p value: 0.139. Relative weight: 23.01. 4. Pooled. Hedges' g: negative 0.37. Standard error: 0.16. Lower limit: negative 0.69. Upper limit: negative 0.06. p value: 0.018. A negative value of Hedges' g favors AWNS and a positive value of Hedges' g favors AWS.

Forest plot for the mean difference in operation span (OSpan) skills between adults who do (AWS) and do not stutter (AWNS). Squares represent effect size per study, with size proportional to its weight in the analysis. Horizontal lines indicate 95% confidence intervals (CIs) per study. The red diamond represents the overall effect size and corresponding 95% CI.

Overall Working Memory

Some studies used working memory measures (e.g., list sorting working memory task) that did not fit into the discrete categories of nonword repetition, forward digit span, backward digit span, and OSpan. These measures were combined with all the other measures to generate an overall working memory effect size. In total, 29 studies assessed working memory skills. As shown in Figure 7, the results of the random-effects estimate of the overall mean effect size revealed a weighted medium effect size of Hedges's g = −0.41 (SE = 0.07, 95% CI [−0.55, −0.27]).

Figure 7.

A forest plot displays the Hedges\u2019 g value, the p value, and the random relative weight for a number of studies. The data are as follows. 1. Study name: Tichenor, 2020. Hedges\u2019 g: negative 0.21. p value: 0.019. Relative weight: 4.78.2. Study name: Gkalitsiou, 2018. Hedges\u2019 g: negative 0.82. p value: 0.000. Relative weight: 4.56. 3. Study name: Elsherif et al., 2021. Hedges\u2019 g: negative 1.09. p value: 0.000. Relative weight: 4.47. 4. Sasisekaran and Weisberg, 2014. Hedges\u2019 g: negative 0.45. p value: 0.008. Relative weight: 4.24. 5. Study name: Sasisekaran, 2013. Hedges\u2019 g: negative 0.32. p value: 0.083. Relative weight: 3.99. 6. Study name: Webster, 1990. Hedges\u2019 g: negative 0.75. p value: 0.000. Relative weight: 3.90. 7. Study name: Baird, 1996. Hedges\u2019 g: negative 0.30. p value: 0.154. Relative weight: 3.87. 8. Study name: McGill, 2016. Hedges\u2019 g: negative 0.58. p value: 0.007. Relative weight: 3.76. 9. Study name: Eichom et al., 2016. Hedges\u2019 g: negative 0.15. p value: 0.510. Relative weight: 3.67. 10. Study name: Yang et al., 2019. Hedges\u2019 g: negative 0.49. p value: 0.032. Relative weight: 3.57. 11. Study name: Hobler et al., 2023. Hedges\u2019 g: negative 0.64. p value: 0.007. Relative weight: 3.55. 12. Study name: Coalson and Byrd, 2015a. Hedges\u2019 g: 0.58. p value: 0.017. Relative weight: 3.48. 13. Study name: Coalson et al., 2017. Hedges\u2019 g: negative 0.26. p value: 0.305. Relative weight: 3.44. 15. Study name: Brocklehurst and Corley, 2011. Hedges\u2019 g: negative 0.41. p value: 0.099. Relative weight: 3.43. 16. Study name: Vincent et al., 2012. Hedges\u2019 g: 0.12. p value: 0.638. Relative weight: 3.39. 17. Study name: Lescht, 2017. Hedges\u2019 g: negative 0.37. p value: 0.145. Relative weight: 3.39. 18. Study name: Smits-Bandstra and Gracco, 2015. Hedges\u2019 g: negative 0.09. p value: 0.741. Relative weight: 3.32. 19. Study name: Haj-Tas, 2007. Hedges\u2019 g: negative 0.53. p value: 0.043. Relative weight: 3.27. 20. Study name: Vallely, 2011. Hedges\u2019 g: negative 0.38. p value: 0.159. Relative weight: 3.18. 21. Study name: Subramanian, 2001. Hedges\u2019 g: negative 0.53. p value: 0.062. Relative weight: 3.05. 22. Study name: Byrd et al., 2015. Hedges\u2019 g: negative 0.65. p value: 0.038. Relative weight: 2.76. 23. Study name: Garnett et al., 2023. Hedges\u2019 g: negative 0.49. p value: 0.139. Relative weight: 2.60. 24. Study name: Byrd et al., 2017. Hedges\u2019 g: negative 0.49. p value: 0.178. Relative weight: 2.35. 25. Study name: Brown, 2015. Hedges\u2019 g: negative 0.69. p value: 0.060. Relative weight: 2.31. 26. Study name: Jackson, 2012. Hedges\u2019 g: 0.41. p value: 0.273. Relative weight: 2.27. 27. Study name: Mersov et al., 2016. Hedges\u2019 g: negative 0.42. p value: 0.293. Relative weight: 2.08. 28. Study name: Bauerly, 2012. Hedges\u2019 g: 0.53. p value: 0.196. Relative weight: 2.00. 29. Study name: McGill et al., 2016. Hedges\u2019 g: negative 1.26. p value: 0.003. Relative weight: 1.95. 30. Study name: Schwartze and Kotz, 2020. Hedges\u2019 g: negative 1.03. p value: 0.015. Relative weight: 1.93. 31. Pooled. Hedges\u2019 g: negative 0.41. p value: 0.000. 32. Prediction interval. Hedges\u2019 g: negative 0.41. Negative Hedges\u2019 g value indicates that the study favors AWNS. Positive Hedges\u2019 g value indicates that the study favors AWS.

Forest plot for the mean difference in overall working memory skills between adults who do (AWS) and do not stutter (AWNS). Squares represent effect size per study, with size proportional to its weight in the analysis. Horizontal lines indicate 95% confidence intervals (CIs) per study. The red diamond represents the overall effect size and corresponding 95% CI. The red prediction interval reflects the expected range within which the true effect sizes of future studies fall.

A visual inspection of the funnel plot (see Figure 8) indicated potential evidence of publication bias, and the Duval and Tweedie's trim-and-fill method estimated that at least seven effect sizes with positive Hedges's g values would be necessary to make the funnel symmetrical. However, the Egger's regression test showed a nonsignificant intercept (β = .06, SE = .80, p = .46), suggesting that the average effect size estimate for studies examining working memory may not necessarily be affected by publication bias.

Figure 8.

A funnel plot. The vertical axis represents the standard error and the horizontal axis represents Hedges' g. A vertical line representing the overall effect intersects the horizontal axis at negative 0.3. Two diagonal lines representing the 95 percent confidence interval intersect the horizontal axis at negative 1.3 and positive 0.7, respectively. Almost all the white points are inside the funnel. Black data points are equally distributed inside and outside the funnel. A white diamond centered at an Hedges' g value of negative 0.4 and a black diamond centered at an Hedges' g value of negative 0.3 are marked.

Funnel plot for studies that examined the working memory skills of adults who do and do not stutter. The white circles represent individual studies, and white diamonds represent the pooled estimate. Five imputed studies, based on Duval and Tweedie's trim-and-fill method, are shown as filled circles, with filled diamonds reflecting the imputed/“adjusted” Hedges's g estimates.

Analysis of heterogeneity revealed significant variation in effect sizes across the studies, Q = 78.125, df = 30, p < .001. Based on the I2 statistic, approximately 63% of the variance in observed effects reflects variance in true effects rather than sampling error. This heterogeneity is not surprising given the different tasks/measures included in the analysis. Based on the prediction interval, the true effect size in 95% of all comparable future studies will fall between −1.02 and 0.21. A “one study–removed” sensitivity analysis indicated that the results are robust and stable, as the average effect size remained essentially unchanged when any one study was removed.

We also conduct a separate analysis, including only the 13 studies (out of 29) that reported no statistically significant differences in language or vocabulary skills between the two talker groups. The overall effect size for working memory was −0.55, which remained statistically significant (p < .001). This suggests that the observed differences in working memory skills are unlikely to be attributable to differences in vocabulary. However, it should be noted that these 13 studies assessed only receptive and/or expressive vocabulary and did not include measures of higher order language processing.

Inhibition

Interference Control

Four studies examined interference control between 59 AWS and 59 AWNS using the Stroop or flanker measures. As shown in Figure 9, results of the random-effects estimate of the mean effect size revealed a weighted small and nonsignificant effect size of Hedges's g = 0.19 (SE = 0.15, 95% CI [−0.12, 0.49]). This result suggests that AWS and AWNS do not differ significantly in interference control inhibition. A visual inspection of the funnel plot and the Duval and Tweedie's trim-and-fill method did not reveal any evidence of publication bias.

Figure 9.

A forest plot displays the study statistics and the random relative weight of a number of studies. The study statistics lists the Hedges' g value, lower limit, upper limit, and p value. The data are as follows. 1. Study title: Subramanian, 2001. Hedges' g: 0.21. Lower limit: negative 0.27. Upper limit: 0.68. p value: 0.399. Relative weight: 40.83. 2. Study title: McGill, 2016. Hedges' g: 0.05. Lower limit: negative 0.53. Upper limit: 0.63. p value: 0.859. Relative weight: 27.56. 3. Study title: Gkalisiou, 2018. Hedges' g: 0.53. Lower limit: negative 0.18. Upper limit: 1.24. p value: 0.140. Random relative weight: 18.45. 4. Study title: Van Lieshout et al., 2014. Hedges' g: negative 0.08. Lower limit: negative 0.92. Upper limit: 0.76. p value: 0.858. Relative weight: 13.16. 5. Pooled. Hedges' g: 0.19. Lower limit: negative 0.12. Upper limit: 0.49. p value: 0.230. Negative value of Hedges' g indicates that the studies favor AWNS and positive value of Hedges' g indicates that the studies favor AWS.

Forest plot for the mean difference in interference inhibition control skills between adults who do (AWS) and do not stutter (AWNS). Squares represent effect size per study, with size proportional to its weight in the analysis. Horizontal lines indicate 95% confidence intervals (CIs) per study. The red diamond represents the overall effect size and corresponding 95% CI.

An analysis of heterogeneity revealed the absence of variability across studies (I2 = 0), Q = 1.51, df = 3, p = .70. Given the zero between-study variance, a prediction interval was not calculated, and a sensitivity analysis was not performed. However, as previously mentioned, these results need to be interpreted with caution because estimates of heterogeneity based on fewer than 10 studies could be unreliable (Borenstein, 2020; Borenstein et al., 2021).

Behavioral/Motor Inhibition

Eight studies examined behavioral/motor inhibition of 204 AWS and 188 AWNS using the Go-NoGo and stop-signal measures. The Markett et al. (2016) study included two independent samples, so it provided two separate effect size estimates to the analysis. As shown in Figure 10, based on the pooled distribution, results of the random-effects estimate of the mean effect size revealed a nonsignificant effect size of Hedges's g = −0.25 (SE = 0.14, 95% CI [−0.52, 0.02]). This mean effect size suggests that AWS, as a group, scored similarly to AWNS on behavioral/motor inhibition measures.

Figure 10.

A forest plot lists the study statistics consisting of Hedges\u2019 g value, lower limit, upper limit, and p value, followed by the random relative weight for a number of studies. The data are as follows. 1. Study title: Treleaven and Coalson, 2021. Hedges\u2019 g: 0.25. Lower limit: negative 0.09. Upper limit: 0.59. p value: 0.147. Relative weight: 14.88. 2. Study title: Tendera, 2019. Hedges\u2019 g: 0.17. Lower limit: negative 0.18. Upper limit: 0.52. p value: 0.331. Relative weight: 14.70. 3. Study title: Maxfield, 2017. Hedges\u2019 g: negative 0.22. Lower limit: negative 0.71. Upper limit: 0.27. p value: 0.385. Relative weight: 11.82. 4. Study title: Wiltshire, 2019. Hedges\u2019 g: negative 0.09. Lower limit: negative 0.62. Upper limit: 0.43. p value: 0.732. Relative weight: 11.19. 5. Study title: Markett et al., 2016a. Hedges\u2019 g: negative 0.59. Lower limit: negative 1.11. Upper limit: negative 0.06. p value: 0.029. Relative weight: 11.16. 6. Study title: Markett et al., 2016b. Hedges\u2019 g: negative 0.59. Lower limit: negative 1.12. Upper limit: negative 0.07. p value: 0.028. Relative weight: 11.16. 7. Study title: Van den Linden, 2013. Hedges\u2019 g: negative 0.91. Lower limit: negative 1.58. Upper limit: negative 0.24. p value: 0.008. Relative weight: 8.82. 8. Study title: Ning et al., 2017. Hedges\u2019 g: negative 0.30. Lower limit: negative 1.00. Upper limit: 0.40. p value: 0.404. Relative weight: 8.43. 9. Study title: Bakhtiar and Eggers, 2023. Hedges\u2019 g: negative 0.52. Lower limit: negative 1.26. Upper limit: 0.23. p value: 0.175. Relative weight: 7.84. 10. Pooled. Hedges\u2019 g: negative 0.25. Lower limit: negative 0.52. Upper limit: 0.02. p value: 0.069. 11. Prediction interval. Hedges\u2019 g: negative 0.25. Lower limit: negative 1.06. Upper limit: 0.56.

Forest plot for the mean difference in behavioral/motor inhibition control skills between adults who do (AWS) and do not stutter (AWNS). Squares represent effect size per study, with size proportional to its weight in the analysis. Horizontal lines indicate 95% confidence intervals (CIs) per study. The red diamond represents the overall effect size and corresponding 95% CI. The red prediction interval reflects the expected range within which the true effect sizes of future studies fall.

A visual inspection of the funnel plot (see Figure 11) indicated potential evidence of publication bias, and the Duval and Tweedie's trim-and-fill method estimated that at least four effect sizes with positive Hedges's g values would be necessary to make the funnel symmetrical (adjusted effect size = 0.03, 95% CI [−0.26, 0.33]). Furthermore, analysis of heterogeneity revealed significant variation in effect sizes across the studies, Q = 20.09, df = 8, p = .01. Based on the I2 statistic, approximately 60% of the variance in observed effects reflects variance in true effects rather than sampling error. Based on the prediction interval, the true effect size in 95% of all comparable future studies will fall between −1.06 and 0.56.

Figure 11.

A funnel plot. The y-axis represents the standard error and the x-axis represents Hedges' g. The vertical line representing the overall effect intersects the x-axis at 0.1. The slant lines representing the 95 percent confidence intervals intersect the x-axis at negative 0.7 and positive 0.8. White open circular data points are mostly distributed between the vertical line and the left slant line. A white diamond is marked between x-values of 0 and negative 0.3. A single black filled circle is within the right slant line. Two other filled black circular points are outside the right slant line. A black diamond is marked between x-values of negative 0.1 and positive 0.1.

Funnel plot for studies that examined behavioral/motor inhibition skills of adults who do (AWS) and do not stutter (AWNS). The white circles represent individual studies, and white diamonds represent the pooled estimate. The imputed studies, based on Duval and Tweedie's trim-and-fill method, are shown as filled circles, with filled diamonds reflecting the imputed/“adjusted” Hedges's g estimates.

Finally, we performed a sensitivity analysis with the “one study–removed” (see Figure 12) approach. Upon removing the Treleaven and Coalson (2021) and Tendera (2019) studies, the pooled effect size changed from −0.25 to −0.33 and −0.32, respectively, and became statistically significant, with AWS scoring significantly lower than AWNS in measures of inhibition. Thus, it seems that these two studies had a notable impact on the pooled effect size. Therefore, while the overall trend supports that AWS exhibit poorer inhibitory control than AWNS, the findings highlight the importance of carefully considering the influence of individual studies on the overall effect size.

Figure 12.

A forest plot lists the statistics with study removed, the Hedges\u2019 g and the 95 percent confidence interval for a number of studies. The statistics are point, lower limit, upper limit, and the p value. The data are as follows. 1. Study title: Treleaven and Coalson, 2021. Point: negative 0.33. Lower limit: negative 0.59. Upper limit: negative 0.06. p value: 0.015. 2. Study title: Markett et al., 2016a. Point: negative 0.21. Lower limit: negative 0.49. Upper limit: 0.08. p value: 0.152. 3. Study title: Markett et al., 2016b. Point: negative 0.21. Lower limit: negative 0.49. Upper limit: 0.08. p value: 0.153. 4. Study title: Wiltshire, 2019. Point: negative 0.28. Lower limit: negative 0.58. Upper limit: 0.03. p value: 0.073. 5. Study title: Maxfield, 2017. Point: negative 0.26. Lower limit: negative 0.57. Upper limit: 0.04. p value: 0.092. 6. Study title: Van den Linden, 2013. Point: negative 0.18. Lower limit: negative 0.43. Upper limit: 0.08. p value: 0.177. 7. Study title: Bakhtiar and Eggers, 2023. Point: negative 0.23. Lower limit: negative 0.52. Upper limit: 0.06. p value: 0.115. 8. Study title: Ning et al., 2017. Point: negative 0.25. Lower limit: negative 0.55. Upper limit: 0.04. p value: 0.094. 9. Study title: Tendera, 2019. Point: negative 0.32. Lower limit: negative 0.61. Upper limit: negative 0.03. p value: 0.029. 10. Pooled. Point: negative 0.25. Lower limit: negative 0.52. Upper limit: 0.02. p value: 0.069. 11. Prediction interval. Point: negative 0.25. Lower limit: negative 1.06. Upper limit: 0.56. p value: 0.000. A negative value of Hedges\u2019 g indicates that the study favors AWNS and a positive value of Hedges\u2019 g indicates that the study favors AWS.

Forest plot for the one study–removed sensitivity analysis. Each line/row displays the pooled effect size and its 95% confidence interval when the respective study was excluded from the analysis.

Oculomotor Inhibition

During the timeframe for this review, only one study (Gkalitsiou et al., 2020) examined oculomotor inhibition measures between AWS and AWNS. Given that a minimum of two studies is technically required to conduct a meaningful meta-analysis (Borenstein et al., 2021; Valentine et al., 2010), a descriptive summary of the study is provided as follows: Gkalitsiou et al. (2020) examined oculomotor inhibition in 17 AWS and 17 AWNS using the antisaccade task. On this computer-based task, participants were required to inhibit their prepotent instinct to look at a sudden target stimulus (prosaccade) and instead make a voluntary eye movement in the opposite direction of the stimulus (antisaccade). The authors did not find significant differences in accuracy or reaction time in the antisaccade trials between AWS and AWNS, suggesting a lack of deficit in oculomotor inhibition in AWS.

Cognitive Inhibition

No study was identified that examined cognitive inhibition measures between AWS and AWNS.

Overall Inhibition

Similar to the overall working memory analysis, we calculated an overall inhibition effect size by combining the four studies on interference control, the eight studies on behavioral/motor inhibition, and the study on oculomotor inhibition (Gkalitsiou et al., 2020). The Markett et al. (2016) study included two independent samples, so it provided two separate effect size estimates to the analysis. The results of the random-effects estimate of the overall mean effect size revealed a weighted small and nonsignificant effect size of Hedges's g = −0.10 (SE = 0.11, 95% CI [−0.31, 0.12]). These results suggest that AWS and AWNS do not differ significantly in inhibition (see Figure 13). A visual inspection of the funnel plot and the Duval and Tweedie's trim-and-fill method did not reveal any evidence of publication bias.

Figure 13.

A forest plot lists the study statistics and the relative random weight for a number of studies. The study statistics are Hedges\u2019 g, lower limit, upper limit, and the p value. The data are as follows. 1. Study title: Treleaven and Coalson, 2021. Hedges\u2019 g: 0.25. Lower limit: negative 0.09. Upper limit: 0.59. p value: 0.147. Relative weight: 11.54. 2. Study title: Tendera, 2019. Hedges\u2019 g: 0.17. Lower limit: negative 0.18. Upper limit: 0.52. p value: 0.331. Relative weight: 11.39. 3. Study title: Maxfield, 2017. Hedges\u2019 g: negative 0.22. Lower limit: negative 0.71. Upper limit: 0.27. p value: 0.385. Relative weight: 8.97. 4. Study title: Wiltshire, 2019. Hedges\u2019 g: negative 0.09. Lower limit: negative 0.62. Upper limit: 0.43. p value: 0.732. Relative weight: 8.45. 5. Study title: Markett et al., 2016a. Hedges\u2019 g: negative 0.59. Lower limit: negative 1.11. Upper limit: negative 0.06. p value: 0.029. 6. Study title: Markett et al., 2016b. Hedges\u2019 g: negative 0.59. Lower limit: negative 1.12. Upper limit: negative 0.07. p value: 0.028. Relative weight: 8.42. 7. Study title: McGill, 2016. Hedges\u2019 g: 0.05. Lower limit: negative 0.53. Upper limit: 0.63. p value: 0.859. Relative weight: 7.69. 8. Study title: Van der Linden, 2013. Hedges\u2019 g: negative 0.91. Lower limit: negative 1.58. Upper limit: negative 0.24. p value: 0.008. Relative weight: 6.55. 9. Study title: Ning et al., 2017. Hedges\u2019 g: negative 0.30. Lower limit: negative 1.00. Upper limit: 0.40. p value: 0.404. Relative weight: 6.24. 10. Study title: Gkalitsiou, 2018. Hedges\u2019 g: 0.53. Lower limit: negative 0.18. Upper limit: 1.24. p value: 0.140. Relative weight: 6.15. 11. Study title: Bakhtiar and Eggers, 2023. Hedges\u2019 g: negative 0.52. Lower limit: negative 1.26. Upper limit: 0.23. p value: 0.175. Relative weight: 5.75. 12. Study title: Subramanian, 2001. Hedges\u2019 g: 0.43. Lower limit: negative 0.35. Upper limit: 1.21. p value: 0.284. Relative weight: 5.44. 13. Study title: Van Lieshout et al., 2014. Hedges\u2019 g: negative 0.08. Lower limit: negative 0.92. Upper limit: 0.76. p value: 0.858. Relative weight: 4.94. 14. Pooled. Hedges\u2019 g: negative 0.13. Lower limit: negative 0.36. Upper limit: 0.10. p value: 0.260. 15. Prediction interval. Hedges\u2019 g: negative 0.13. Lower limit: negative 0.82. Upper limit: 0.56.

Forest plot for the mean difference in overall inhibition between adults who do (AWS) and do not stutter (AWNS). Squares represent effect size per study, with size proportional to its weight in the analysis. Horizontal lines indicate 95% confidence intervals (CIs) per study. The red diamond represents the overall effect size and corresponding 95% CI. The red prediction interval reflects the expected range within which the true effect sizes of future studies fall.

Analysis of heterogeneity revealed significant variation in effect sizes across the studies, Q = 27.03, df = 13, p = .01. Based on the I2 statistic, approximately 52% of the variance in observed effects reflects variance in true effects rather than sampling error. This heterogeneity is not surprising given the different tasks/measures included in the analysis. Based on the prediction interval, the true effect size in 95% of all comparable future studies will fall between −0.75 and 0.56. Finally, the “one study–removed” sensitivity analysis indicated that the average effect size remained small and not statistically significant with any one study removed.

Cognitive Flexibility

To date, three studies (i.e., S. Doneva et al., 2018; Gkalitsiou, 2018; Markett et al., 2016) that examined cognitive flexibility in AWS and AWNS met the inclusion criteria for the present study. However, as may be recalled, while a minimum of two studies is required to conduct a meta-analysis (Borenstein et al., 2021), three studies may likewise also be insufficient to assess meaningful heterogeneity and perform a sensitivity analysis. Thus, we provide a descriptive summary of these studies as follows: Markett et al. (2016) used the category switch task (Mayr & Kliegl, 2000) to examine the cognitive flexibility skills of 28 AWS and 28 AWNS and found no significant differences in the accuracy (p = .14) or reaction time (p = .48) between the two groups. Subsequently, Doneva et al. (2018) utilized the Visual Elevator subtest from the Test of Everyday Attention (Robertson et al., 1994) to investigate attentional switching/cognitive flexibility in 50 AWS and 50 AWNS. They also found no significant differences in the estimated time per switch between the two groups, p = .72. Finally, Gkalitsiou (2018) also found no significant differences between the two groups after administering the NIH Dimensional Change Card Sorting (DCCS) Test to 15 AWS and 15 AWNS, p = .13. Overall, the current evidence suggests that there are no significant differences between AWS and AWNS in cognitive flexibility skills; however, such findings should be interpreted with caution given the paucity of studies in this core domain of executive function. In addition to the variation in sample sizes, there are important differences in the tasks across the studies. Whereas the Test of Everyday Attention has normative data for adults and is typically used in clinical settings, the category switch task is experimental, with unknown ecological validity. On the other hand, the NIH DCCS is adaptable to different age groups and has moderate ecological validity (Zelazo et al., 2014). Given the methodological variations, further studies are needed to provide additional insight into the nature of cognitive flexibility skills in AWS.

Discussion

The purpose of this study was to compare the performance of AWS with that of AWNS on behavioral tasks of executive function in three specific subdomains: working memory, inhibition, and cognitive flexibility. A review of published studies and dissertations was systematically conducted, followed by a meta-analysis of the quantitative measures in each subdomain across the eligible studies. Results indicate that AWS, as a group, performed significantly lower than AWNS on tasks of working memory. However, effect size differences between the two groups were within 1 SD, suggesting that, on average, AWS demonstrate relative, rather than clinical, differences in working memory when compared to AWNS. No significant difference in inhibition skills was identified across studies. Cognitive flexibility could not be assessed due to the low number of published studies meeting eligibility criteria for meaningful analysis. These findings support the notion that, like children who stutter, AWS demonstrate subtle yet significant difficulties in at least one subdomain of executive function skills.

Working Memory

Across tasks, overall working memory abilities were significantly lower for AWS than AWNS (n = 31 studies; Hedges's g = −0.39, moderate effect size; p < .001). In terms of working memory tasks, AWS were found to be significantly less accurate on tasks of nonword repetition (n = 15 studies; Hedges's g = −0.57, moderate effect size; p < .001), forward digit span (n = 18 studies; Hedges's g = −0.23, small effect size; p = .018), and backward digit span (n = 11 studies; Hedges's g = −0.38, moderate effect size; p = .004). Sensitivity analyses confirmed these findings were present after conservative, step-down re-analysis. A final analysis of OSpan revealed potential group differences favoring AWNS (Hedges's g = −0.49, small-to-moderate effect size; p = .017); however, due to the limited number of studies (n = 4), results for OSpan should be interpreted with caution. Although no publication bias was confirmed for any specific task (i.e., nonword repetition, forward or backward digit span, OSpan), such potential was identified for analysis of overall working memory and behavioral inhibition.

The findings that AWS exhibited lower accuracy on nonword repetition and forward digit span tasks are consistent with the meta-analytical review of children who stutter by Ofoe et al. (2018). Like Ofoe et al., the between-group difference in accuracy on the forward span tasks was less pronounced than the differences on the nonword repetition tasks. The magnitude of group differences was also similar for AWS in the present study and children who stutter in the Ofoe et al. study, in that accuracy did not exceed 1 SD of the mean for either age group. Based on the similarity between studies, one may apply similar interpretations across age groups in each study. First, given the larger effect size for nonword repetition relative span tasks, differences in working memory in AWS appear to be more evident in the absence of long-term lexical knowledge. Second, given the modest (yet significant) effect size favoring AWNS across tasks, difficulties in working memory in AWS are likely not clinically meaningful and do not reflect a clinically disordered system. Instead, these differences may reflect a working memory system that is less robust and/or more vulnerable to breakdown as demands increase, and particularly when access to long-term lexical knowledge is limited.

However, unlike Ofoe et al. (2018), the inclusion of standardized measures (e.g., nonword repetition: Comprehensive Test of Phonological Processes [CTOPP], Comprehensive Test of Phonological Processes–Second Edition; Wagner et al., 1999, 2013) to assess nonword repetition (10 of the 15 studies) for adult participants provided an opportunity to examine how differences in phonological working memory in each group compare to the general population. Review of the standard scores of AWS across the studies further supports the notion that difficulties in phonological working memory relative to AWNS, although significant, are subtle. For example, of the 10 studies that included a standardized measure of nonword repetition, only one study reported mean standard scores for AWS below 2 SDs (i.e., Elsherif et al., 2021; standard score M = 7). Moreover, of the four studies that provided individual participant CTOPP scores for nonword repetition, a similar proportion of participants from each group performed within typical limits (±2 SDs) compared to the general population (i.e., 70% of 50 AWS, 71% of 55 AWNS). That being said, the distribution of group scores differed. Specifically, 22% of the 30% of AWS who did not score within the typical limits scored 2 SDs below the population mean, compared to only 5% of AWNS (out of 29% of AWNS who did not score within the typical limits). Although the CTOPP does not provide scores by syllable length, low overall scores typically indicate that individuals reached ceiling at relatively short nonword lengths, suggesting difficulties even with items as short as three syllables. Thus, our finding that a greater percentage of AWS than AWNS scored 2 SDs below the population mean suggests that a sizable subset of AWS in these studies was less accurate than the general population during initial encoding and storage of shorter phonological stimuli (e.g., < 3 syllables) that likely do not require rehearsal prior to repetition. Difficulty with syllables as short as two to three syllables, which would be expected to improve with age, is surprising for the adult speakers and suggests that, in addition to difficulties in phonological rehearsal, difficulties with initial storage and encoding observed for younger children by Ofoe et al. at two syllables do not necessarily disappear with age for a nontrivial number of AWS.

Inhibition

Across inhibition tasks, AWS demonstrated overall comparable performance compared to AWNS (Hedges's g = −0.10, p = .37). In terms of interference control, AWS performed similarly to AWNS (Hedges's g = 0.19, p = .23) during tasks that required ignoring distracting stimuli (i.e., flanker tasks, Stroop tasks, continuous processing tasks), although the number of studies identified and eligible for analysis was low (n = 4). In terms of behavioral/motor inhibition, AWS performed similarly to AWNS (Hedges's g = −0.25, small effect size; p = .07) during tasks that require suppression of a dominant movement or response (i.e., SST, Go-NoGo). An insufficient number of studies investigating oculomotor inhibition between AWS and AWNS were identified for analysis, and no studies on cognitive inhibition in AWS and AWNS were identified for analysis.

In terms of cross-study comparison with children who stutter, Ofoe et al. (2018) reported that significant differences in behavioral inhibition between children who do and do not stutter were restricted to parent report measures. Ofoe et al. reported no group differences on controlled, experimental tasks based on the limited number of studies that incorporated behavioral tasks of inhibition (e.g., Go-NoGo, SST, grass-snow/baa-meow) similar to those included in this meta-analysis, although the number of child studies that included such tasks was low (n = 3). As noted by Ofoe et al., laboratory administration of inhibition tasks may be too challenging for children, particularly complex tasks such as the grass-snow/baa-meow, wherein participants are required to not just withhold response but also interrupt an in-progress response and/or respond in the presence of conflicting information. Findings from the present study suggest that, even for adult participants who are equipped to complete challenging experimental tasks across a larger number of studies (n = 13), participants who stutter performed comparably to nonstuttering participants during behavioral measures of inhibition.

It is important to note that sensitivity analysis indicated that, upon removal of two studies (i.e., Tendera, 2019; Treleaven & Coalson, 2021), AWS performed significantly more poorly than AWNS. Both studies compared stop-signal responses between AWS and AWNS, and unlike the other inhibition studies included in our analyses, both asked participants to provide manual as well as vocal responses. It is possible that averaging across two distinct response modalities resulted in atypically faster overall responses for AWS. However, notable between-group disparities were reported for only one study and in one modality—Treleaven and Coalson (2021; manual responses, difference in means = 66 ms)—with all other between-group differences being similar across studies per modality (differences in means ranged from 4 to 9 ms). No other systematic factor shared by these studies (e.g., presentation modality, stimuli complexity, gender balance, stop-signal calculation procedures) was identified to potentially account for the stronger inhibition by the AWS observed in the studies by Tendera (2019) and Treleaven and Coalson (2021). However, the standardized residuals for these studies (1.28 and 1.52, respectively) did not exceed the standard threshold for removal (> 3) in absolute magnitude (Hedges & Olkin, 1985; Viechtbauer & Cheung, 2010). Furthermore, the significant heterogeneity (Q = 20.09, p = .01) limits confidence in labeling the group performance of these specific AWS cohorts as outliers. Given the wide range in prediction intervals (−1.06 to 0.56), it is difficult to determine the true effect size due to heterogeneity among the groups. As such, the present outcomes suggest that while AWS may exhibit inhibition differences relative to AWNS upon inclusion of additional future studies, current data only provide support to the observation that inhibition skills are heterogeneous in AWS and, across studies, statistically comparable in AWNS.

A further consideration when interpreting findings related to inhibition measures is the expected methodological differences across studies, such as variation in stimulus presentation (i.e., auditory, visual), stimulus type (i.e., semantic, nonsemantic), response modalities (i.e., manual, vocal), and outcome measures. For example, Maxfield (2017) included a combination of semantic and phonemic cues during stimulus presentation. In these cases, we opted to average across conditions rather than further subdivide analyses. In other cases, decisions regarding outcome similarity by the authors were required to bolster comparison across studies. For example, Tendera (2019) used a standard SST but chose a less common outcome measure (i.e., point of subjective equality) rather than SSRT. We elected to include these findings related to “reactive” interference, rather than “proactive” interference, as the underlying processes of reactive interference were akin (but not identical) to standard SSRT outcomes used in other eligible studies. We acknowledge that these decisions, which were made to maintain statistical power and facilitate cross-study comparison, may obscure a more nuanced interpretation of data and potential inhibition differences in AWS. In summary, the present findings did not indicate statistical differences in inhibition in AWS, but the low number of studies, potential publication bias for behavioral inhibition, and heterogeneity across studies provide a window for discovery in future studies examining AWS inhibition skills.

Cognitive Flexibility

As may be recalled, only three studies have reported information on cognitive flexibility in AWS and AWNS within the timeframe and parameters of the systematic review. However, it is noteworthy that none of these three studies reported significant differences in cognitive flexibility between AWS and AWNS. Ofoe et al. (2018) also reported a similarly low number of studies investigating cognitive flexibility in children who stutter (n = 2; search ended in December 2016), which precluded meaningful analysis. It should be noted that a number of studies examining differences in cognitive flexibility between CWS and CWNS (e.g., Anderson et al., 2020; Eichorn & Pirutinsky, 2021; Ntourou et al., 2018; Paphiti & Eggers, 2022; Paphiti et al., 2022) have been conducted since Ofoe et al.'s systematic review. Collectively, and similar to working memory and inhibition, findings from these more recent studies indicate subtle challenges in cognitive flexibility for younger individuals who stutter. Given the interrelationship between each component of executive function skills (see Davidson et al., 2006) and the growth of data in cognitive flexibility in younger children who stutter, further investigation of this component of executive function in AWS and AWNS is warranted.

Executive Function and Stuttering Persistence

The current findings could be interpreted through the lens of the executive function model of developmental stuttering (Anderson & Ofoe, 2019), which posits that stuttering could directly or indirectly tax executive function skills or vice versa. Thus, it is possible that a lifetime of frequent fluency breakdowns could partially, and over time, contribute to weaknesses in executive function skills in AWS. According to Anderson and Ofoe (2019), weaknesses in the components of executive function and aspects of attention could disrupt the development and maintenance of robust linguistic representations in the mental lexicon (e.g., Gathercole, 2006). Subsequently, attempts to retrieve the less stable representations from the mental lexicon for production may interfere with the flow of speech. Within the context of Levelt et al.'s (1999) model of speech planning, stuttering may reflect difficulties in monitoring or maintaining the speech plan within working memory prior to production, resulting in articulatory resets. That is, the assembly of abstract speech plans (i.e., rapid merging of segmental and metrical information stored in the mental lexicon) immediately prior to production may be periodically delayed and exceed the relatively brief timeframe for speech planning prior to (fluent) speech production. In the context of Dell's interactive activation model (Dell et al., 1997) and its most recent update (Foygel & Dell, 2000), lexical selection involves managing activation and competition among candidate representations. Thus, weak inhibitory control or reduced working memory capacity may impair this process, leading to interference between partially activated items and, consequently, disfluent output. Thus, considering the findings from the present study, as well as those of Ofoe et al. (2018) and Doneva (2020) on the differences in selective/focus attention between AWS and AWNS, it is possible that aspects of attention and executive function may contribute to the development and/or persistence of stuttering.

Furthermore, regarding the components of executive function, given the similarity in performance on working memory tasks between CWS and AWS relative to fluent peers, it is tempting to conclude that executive function, particularly phonological working memory, is related to stuttering persistence. Findings extend differences in phonological memory between young children who stutter (Ofoe et al., 2018) and between young children who persist and recover from stuttering (Spencer & Weber-Fox, 2014; Walsh et al., 2021). Collectively, these findings suggest that weaknesses in phonological working memory skills present early in life in children who stutter are also present in adults whose stuttering persisted and therefore provide indirect support of the link between phonological working memory and stuttering persistence (e.g., Walsh et al., 2018). These cross-sectional data, however, cannot directly speak to the trajectory of these individual participants. For example, based on these data, we cannot conclude that participants who performed poorly as adults in these studies also struggled at younger ages. We also cannot account for the executive function skills of those children who no longer stutter but who may, hypothetically, have exhibited similarly low executive function skills as a child. Further comparison of AWS and adults who recovered from stuttering during childhood would be required to confirm the potential link between persistence and phonological working memory. Additional considerations—such as concomitant phonological or language issues as well as treatment history—would also warrant attention as potential mediating factors. Together, a more comprehensive account would be to include all studies, across ages, and account for persistence rather than, or in addition to, age.

Executive Function Skills, Psychosocial Functioning, and Quality of Life in AWS

As previously noted, executive function skills are linked to psychosocial development, mental health, and overall quality of life across the lifespan (Brown & Landgraf, 2010; Miller et al., 2011). Theoretically, weaknesses in aspects of attention and executive function skills could compromise the efficient planning and execution of speech-language production in people who stutter. In the event of an increased cognitive load, these weaknesses may lead to disruptions in the fluency of their speech-language production. Conversely, people who stutter may attempt to counteract the frequent speech breakdowns by excessively relying on their diminished executive function and attentional resources to produce fluent speech, which may further weaken their cognitive skills and impact other processes, including socioemotional skills (see Anderson & Ofoe, 2019; Choo et al., 2020). Given that speech and communication skills are essential for social and emotional functioning (Rautakoski et al., 2021; van Barreveld et al., 2025), disruptions in the speech fluency of individuals who stutter may increase their risk of developing speech-related anxiety (Blumgart et al., 2010; Cabel et al., 2002; Craig & Tran, 2006; Iverach et al., 2011), which can impact the overall effectiveness of verbal communication in social settings. Over time, the vulnerabilities and disruptions in the flow of speech during social interactions may cause or exacerbate feelings of shame, embarrassment, and lower self-esteem, which may lead to withdrawal or avoidance of social situations and impact their education, career choices, and overall quality of life (Craig et al., 2009; Daniels & Gabel, 2004; Hayhow et al., 2002; Koedoot et al., 2011; Langevin & Prasad, 2012). Beyond the potential relationship between speech fluency and executive function, it is possible that negative experiences with stuttering may be mediated by executive function. For example, AWS have reported increased repetitive negative thinking related to previous communication experiences (Tichenor & Yaruss, 2020)—a process linked to one's (in)efficiency when updating working memory in AWNS (Zetsche et al., 2018). However, Coalson et al. (2025) found that skills related to updating the content of working memory were not associated with increased rumination in AWS. Although further investigation of the link between executive function and quality of life in AWS has yet to reach consensus, there is clear potential to explore this hypothesis in future studies.

Limitations and Future Studies

There are a number of limitations that warrant consideration. First, the age of inclusion was typically 18 years, which may be too young to capture significant milestones in executive function (e.g., Korzeniowski et al., 2021). Second, as is common in meta-analyses, these findings represent group trends rather than individual differences in executive function skills. That is, these outcomes should not be seen as prescriptive to all individuals who stutter but rather as overall differences that inform factors that contribute to, and inform, stuttering as a condition. We also acknowledge that the measures used in these studies are, for the most part, broad metrics that require a behavioral response from the participants. To maintain a sufficient sample size for robust meta-analyses, studies were grouped by methodology in accordance with the overarching theoretical framework within each domain of executive function (e.g., Baddeley and Hitch's framework of working memory, Nigg's taxonomy of inhibition). The outcomes of this study, however, cannot account for group differences attributable to study-specific variations or modifications of tasks (e.g., stimulus presentation, response modality, calculation of outcome measures). Even during the administration of more standardized measures (e.g., the nonword repetition task on CTOPP), outcomes may be influenced by variations in speech rate, strategy use, examiner familiarity, fluent versus disfluent trials, and other factors. Thus, although valuable, we acknowledge that such factors are typically allowed to vary freely across studies and may obscure differences (or lack thereof) obtained through these subtle but important variations. Furthermore, such behavioral measures, while they are direct measures of the participants' performance, may not be sensitive enough to capture underlying differences more apt for examination by neurophysiological measures. Additionally, future studies may consider incorporating and reporting self-report measures, which involve reflections about thoughts, feelings, and behavior in different situations, in addition to the behavioral measures to provide a more comprehensive perspective of executive function skills between AWS and AWNS (see Dang et al., 2020).

Also, differences in participant characteristics related to executive function (e.g., language skills, attention, nonverbal intelligence) may also help explain the observed talker-group differences. In this meta-analysis, less than 50% of the primary studies (i.e., 17 out of 39) included a language measure. Even fewer studies screened for attention-deficit/hyperactivity disorder (ADHD) in the participants (e.g., Tichenor, 2020; Treleaven & Coalson, 2021), despite the intersection between ADHD and stuttering (Tichenor et al., 2021), or addressed co-occurring mental health concerns that may be related to atypical executive function skills (e.g., depression: Zetsche et al., 2018; anxiety: Moran, 2016). Another subject-level characteristic that could also be contributing to variability across studies is nonverbal intelligence. However, only six of the 39 studies (Eichorn et al., 2016; Elsherif et al., 2021; Gkalitsiou, 2018; McGill, 2016; Tichenor, 2020; Treleaven & Coalson, 2021) included a measure of nonverbal intelligence—five using the TONI-4 (Brown et al., 2010) and one using Raven's Standard Progressive Matrices (Raven, 1960). Thus, to strengthen future meta-analytic work and improve comparability across studies, researchers could incorporate potential moderators into their designs and report descriptive data on relevant variables (e.g., attentional skills, nonverbal intelligence) alongside primary outcomes.

Conclusions

The purpose of this study was to conduct a meta-analytic review of the primary components of the executive function skills of AWS relative to AWNS. The present findings suggest that AWS, as a group, differ subtly yet significantly from AWNS in working memory. Differences in inhibition did not reach significance, although outliers and potential publication bias were detected. At present, insufficient data are available to conduct meaningful cross-study comparisons of cognitive flexibility, and future investigations may consider focusing on this executive function subdomain in adults who stutter.

Data Availability Statement

All data generated or analyzed during this study are included in this published article.

Acknowledgments

This research was partly supported by a research seed grant from the University of West Georgia's College of Education, awarded to Levi C. Ofoe; a grant from the Oklahoma Center for the Advancement of Science and Technology (HR 21-052-2) awarded to Katerina Ntourou; and a grant from the National Institutes of Health (R21DC018109-01A1) awarded to Geoffrey A. Coalson.

Appendix A

Talker Classification Criteria Across Studies That Examined Executive Function Between Adults Who Do (AWS) and Do Not Stutter (AWNS)

Study Group SSI-3 /SSI-4 % SLD Likert SEV scales OASES Self-ID SLP judgment Other Total criteria
Baird (1996) AWS s 1
AWNS 1
Bakhtiar & Eggers (2023) AWS c+r 2
AWNS 1
Bauerly (2012) AWS NR
AWNS NR
Brocklehurst & Corley (2011)a AWS c+r r Self-ratings of hierarchical “fluency difficulty” 4
AWNS r 2
B. T. Brown (2015) AWS Self-reported history of SLDs 2
AWNS NR
Byrd et al. (2015) AWS c+r 2
AWNS NR
Byrd et al. (2017) AWS s s 4
AWNS NR
Coalson & Byrd (2015) AWS c+r 3
AWNS 2
Coalson & Byrd (2015) AWS c+r 3
AWNS 2
Coalson et al. (2017) AWS c 3
AWNS NR
Eichorn et al. (2016)b AWS s 4
AWNS 1
Elsherif et al. (2021) AWS c+r 2
AWNS NR
Garnett et al. (2023) AWS s 1
AWNS 1
Gkalitsiou (2018) AWS s 3
AWNS 2
Gkalitsiou et al. (2020) AWS r 2
AWNS NR
Haj-Tas (2007) AWS c Self-rated severity across situations 1
AWNS 1
Höbler et al. (2023) AWS c+r 2
AWNS NR
Jackson (2012) AWS Support group membership 1
AWNS NR
Lescht (2017) AWS c+r 1
AWNS NR
Markett et al. (2016) AWS 2
AWNS No family history 2
Maxfield (2017) AWS c 2
AWNS 1
McGill et al. (2016)a AWS 2
AWNS NR
McGill et al. (2016) AWS s s 4
AWNS 3
Mersov et al. (2016) AWS s 2
AWNS 1
Ning et al. (2017) AWS s 2
AWNS 1
Sasisekaran (2013) AWS s 2
AWNS 2
Sasisekaran & Weisberg (2014) AWS s 1
AWNS 2
Schwartze & Kotz (2020) AWS c 1
AWNS 1
Smits-Bandstra & Gracco (2015)b AWS c+r 2
AWNS 1
Subramanian (2001) AWS c 3
AWNS 3
Tendera (2019) AWS c+r 2
AWNS 1
Tichenor (2020) AWS 1
AWNS 1
Treleaven & Coalson (2021) AWS s 3
AWNS 3
Vallely (2011) AWS s 1
AWNS 1
Van der Linden (2013) AWS 1
AWNS 1
Van Lieshout et al. (2014) AWS c+r 2
AWNS 2
Vincent et al. (2012) AWS c 2
AWNS 3
Webster (1990) AWS Self-reported severity overall 2
AWNS NR
Wiltshire (2019) AWS s 1
AWNS NR
Yang et al. (2019) AWS s 2
AWNS NR
Total AWS 24 5 9 4 24 19 0
AWNS 3 5 3 3 19 9 0

Note. SLP judgment includes in-experiment confirmation and prior diagnosis and/or treatment by a licensed SLP. SSI-3/SSI-4 = Stuttering Severity Instrument–Third Edition (Riley, 1994) and Stuttering Severity Instrument–Fourth Edition (Riley, 2009); %SLD = percentage of stuttering-like disfluencies within speech sample(s); SEV = stuttering severity; OASES = Overall Assessment of the Speaker's Experience of Stuttering (Yaruss & Quesal, 2010); ID = identification; SLP = speech-language pathologist; s = speech sample (unspecified); c = conversational sample; r = reading sample; NR = criteria not reported.

a

Groups differentiated by significant difference on OASES-Section IIIA.

b

Groups differentiated by significant differences on SSI-4 raw score.

Appendix B

The Date and Search Strategy for the Study

Search Executed: May 27, 2022

Search Sets Forwarded for Review: #24 | #25

Database: Ovid MEDLINE(R) and Epub Ahead of Print, In-Process, In-Data-Review & Other Non-Indexed Citations and Daily <1946 to May 26, 2022>

Search Strategy:

  1. Stuttering/

  2. stutter$.mp.

  3. stammer$.mp.

  4. or/1–3

  5. Executive Function/

  6. (exec$ function$ or exec$ control$).mp.

  7. central$ exec$.mp.

  8. Memory, Short-Term/

  9. (memor$ adj2 (working$ or short-term$ or shortterm$ or short$ term$ or immediat$)).mp.

  10. Mental Recall/

  11. recall$.mp.

  12. exp Inhibition, Psychological/

  13. inhibit$.mp.

  14. exp Reaction Time/

  15. (react$ adj2 tim$).mp.

  16. (cognit$ adj2 (load$ or flexib$ or stress$)).mp.

  17. (repetiti$ adj2 (non-word$ or nonword$)).mp.

  18. (dual$ task$ or phoneme$ elision$ or information$ updat$ or phonolog$ encod$ or n-back$ or nback$ or stop$ signal$ or phon$ complex$).mp.

  19. monitoring$.mp.

  20. or/5–19

  21. 4 and 20

  22. remove duplicates from 21

  23. (adult$ or man or men or woman or women or adolescen$ or teen$ or youth$).mp.

  24. 22 and 23 (310)

  25. 22 not 24 (236)

Search Executed: September 2, 2022

Search Set Forwarded for Review: #10

Database: ProQuest Dissertations & Theses Global

Search Strategy:

  1. noft(stutter* OR stammer*)

  2. noft(“exec* function*” OR “exec* control*” OR “central* exec*”)

  3. noft(memor* N/2 (working* OR “short* term*” OR shortterm* OR immediat*))

  4. noft(recall* OR inhibit*)

  5. noft(react* N/2 tim*)

  6. noft(cognit* N/2 (load* OR flexib* OR stress*))

  7. noft(repetiti* N/2 (“non-word*” OR nonword*))

  8. noft(“dual* task*” OR “phoneme* elision*” OR “information* updat*” OR “phonolog* encod*” OR “n-back*” OR nback* OR “stop* signal*” OR “phon* complex*”)

  9. 2 OR 3 OR 4 OR 5 OR 6 OR 7 OR 8

  10. 1 AND 9

Appendix C

A Glossary of the Meta-Analysis Terms Used to Examine Executive Function Between Adults Who Do and Do Not Stutter

Term Meaning
Hedges's g An effect size measure that expresses the standardized mean difference between two groups, adjusted for small sample size bias.
Cochrane Q (Q statistic) A test of heterogeneity in meta-analysis that assesses whether the variability in effect sizes across studies is greater than would be expected by chance, indicating true differences rather than random sampling error.
I 2 index A statistic that specifies the degree of actual variation rather than random chance among studies in a meta-analysis.
Publication bias The systematic error that occurs when published literature overrepresents studies with significant or favorable results, leading to a biased estimation of the true effect.
Funnel plot Visual tool assessing the possible existence of publication bias.
Duval and Tweedie's trim-and-fill method An analytic technique that tests for funnel plot asymmetry and imputes missing studies to evaluate the impact of publication bias.
Sensitivity analysis A rigorous method used to examine whether the results or direction of effects are unduly influenced by specific studies.
Egger's regression A linear regression test used to examine the potential publication bias via a funnel plot asymmetry.
Random-effects model A type of meta-analysis model that considers the actual effect size variation from study to study due to population differences, rather than assuming that each study shares the same effect size due to samples drawn from the same population.
“One study–removed” analysis A meta-analytic technique where the overall effect size is recalculated repeatedly, each time omitting a single study, to assess whether results are disproportionately influenced by any one study.

Funding Statement

This research was partly supported by a research seed grant from the University of West Georgia's College of Education, awarded to Levi C. Ofoe; a grant from the Oklahoma Center for the Advancement of Science and Technology (HR 21-052-2) awarded to Katerina Ntourou; and a grant from the National Institutes of Health (R21DC018109-01A1) awarded to Geoffrey A. Coalson.

Footnotes

1

Some researchers associate interference control with attention (Friedman & Miyake, 2004). However, attention as a cognitive construct was not considered in the present study (see Doneva, 2020, for a meta-analytic review of attention skills in AWS and AWNS).

2

Behavioral/motor inhibition tasks can be further categorized into simple and complex, with the Go-NoGo task and SSTs classified as simple response inhibition tasks as these tasks do not require a subdominant response (Garon et al., 2008).

References

References denoted with asterisks (*) indicate studies included in the meta-analysis.

  1. Anderson, J. D., & Ofoe, L. C. (2019). The role of executive function in developmental stuttering. Seminars in Speech and Language , 40, 305–319. 10.1055/s-0039-1692965 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Anderson, J. D., & Wagovich, S. A. (2017). Explicit and implicit verbal response inhibition in preschool-age children who stutter. Journal of Speech, Language, and Hearing Research, 60(4), 836–852. 10.1044/2016_JSLHR-S-16-0135 [DOI] [Google Scholar]
  3. Anderson, J. D., Wagovich, S. A., & Ofoe, L. (2020). Cognitive flexibility for semantic and perceptual information in developmental stuttering. Journal of Speech, Language, and Hearing Research , 63(11), 3659–3679. 10.1044/2020_JSLHR-19-00119 [DOI] [Google Scholar]
  4. Baas, M., De Dreu, C. K., & Nijstad, B. A. (2008). A meta-analysis of 25 years of mood-creativity research: Hedonic tone, activation, or regulatory focus? Psychological Bulletin, 134(6), 779–806. 10.1037/a0012815 [DOI] [PubMed] [Google Scholar]
  5. Baddeley, A. (2000). Short-term and working memory. In Tulving E. & Craik F. I. M. (Eds.), The Oxford handbook of memory (Vol. 4, pp. 77–92). Oxford University Press. 10.1093/oso/9780195122657.003.0005 [DOI] [Google Scholar]
  6. Baddeley, A. (2004). Working memory. In Balota D. A. & Marsh E. J. (Eds.), Cognitive psychology: Key readings (pp. 355–361). Psychology Press. [Google Scholar]
  7. Baddeley, A. (2012). Working memory: Theories, models, and controversies. Annual Review of Psychology , 63(1), 1–29. 10.1146/annurev-psych-120710-100422 [DOI] [Google Scholar]
  8. Baddeley, A., Gathercole, S., & Papagno, C. (1998). The phonological loop as a language learning device. Psychological Review , 105(1), 158–173. 10.1037/0033-295x.105.1.158 [DOI] [PubMed] [Google Scholar]
  9. Baddeley, A. D. (2002). Is working memory still working? European Psychologist, 7(2), 85–97. 10.1027/1016-9040.7.2.85 [DOI] [Google Scholar]
  10. Baddeley, A. D., & Hitch, G. (1974). Working memory. In Bower G. H.(Ed.), The psychology of learning and motivation (Vol. 8, pp. 47–89). >Academic Press. 10.1016/S0079-7421(08)60452-1 [DOI] [Google Scholar]
  11. Baddeley, A. D., & Hitch, G. J. (1994). Developments in the concept of working memory. Neuropsychology , 8(4), 485–493. 10.1037/0894-4105.8.4.485 [DOI] [Google Scholar]
  12. *. Baird, B. R. (1996). EEG activation in the frontal lobes of stutterers and nonstutterers during working memory tasks [Unpublished doctoral dissertation/master's thesis] . Virginia Polytechnic and State University. [Google Scholar]
  13. Bajaj, A. (2007). Working memory involvement in stuttering: Exploring the evidence and research implications. Journal of Fluency Disorders, 32(3), 218–238. 10.1016/j.jfludis.2007.03.002 [DOI] [PubMed] [Google Scholar]
  14. *. Bakhtiar, M., & Eggers, K. (2023). Exogenous verbal response inhibition in adults who do and do not stutter. Journal of Fluency Disorders, 75, Article 105957. 10.1016/j.jfludis.2022.105957 [DOI] [Google Scholar]
  15. Banich, M. T., & Depue, B. E. (2015). Recent advances in understanding neural systems that support inhibitory control. Current Opinion in Behavioral Sciences , 1, 17–22. 10.1016/j.cobeha.2014.07.006 [DOI] [Google Scholar]
  16. *. Bauerly, K. R. (2012). Motor learning abilities in adults who stutter [Unpublished doctoral dissertation/master's thesis]. University of Toronto. [Google Scholar]
  17. Berninger, V., Abbott, R., Cook, C. R., & Nagy, W. (2017). Relationships of attention and executive functions to oral language, reading, and writing skills and systems in middle childhood and early adolescence. Journal of Learning Disabilities , 50(4), 434–449. 10.1177/0022219415617167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Blumgart, E., Tran, Y., & Craig, A. (2010). Social anxiety disorder in adults who stutter. Depression and Anxiety, 27(7), 687–692. 10.1002/da.20657 [DOI] [PubMed] [Google Scholar]
  19. Borenstein, M. (2020). Research note: In a meta-analysis, the I2 index does not tell us how much the effect size varies across studies. Journal of Physiotherapy, 66(2), 135–139. 10.1016/j.jphys.2020.02.011 [DOI] [PubMed] [Google Scholar]
  20. Borenstein, M., Hedges, L. V., Higgins, J. P., & Rothstein, H. R. (2021). Introduction to meta-analysis. Wiley. [Google Scholar]
  21. Borenstein, M., Hedges, L. V., Higgins, J. P., & Rothstein, H. R. (2022). Comprehensive meta-analysis (Version 4) [Computer software]. Biostat.
  22. Bowie, C. R., & Harvey, P. D. (2006). Administration and interpretation of the Trail Making Test. Nature Protocols, 1(5), 2277–2281. 10.1038/nprot.2006.390 [DOI] [PubMed] [Google Scholar]
  23. Boyle, M. P. (2015). Relationships between psychosocial factors and quality of life for adults who stutter. American Journal of Speech-Language Pathology, 24(1), 1–12. 10.1044/2014_AJSLP-14-0089 [DOI] [PubMed] [Google Scholar]
  24. *. Brocklehurst, P. H., & Corley, M. (2011). Investigating the inner speech of people who stutter: Evidence for (and against) the covert repair hypothesis. Journal of Communication Disorders, 44(2), 246–260. 10.1016/j.jcomdis.2010.11.004 [DOI] [PubMed] [Google Scholar]
  25. *. Brown, B. T. (2015). Neurocorrelates of speech–motor planning and execution in adults and children who stutter [Unpublished doctoral dissertation/master's thesis]. University of Iowa. [Google Scholar]
  26. Brown, L., Sherbenou, R. J., & Johnsen, S. K. (2010). Test of Nonverbal Intelligence–Fourth Edition (TONI-4). Pro-Ed. [Google Scholar]
  27. Brown, T. E., & Landgraf, J. M. (2010). Improvements in executive function correlate with enhanced performance and functioning and health-related quality of life: Evidence from 2 large, double-blind, randomized, placebo-controlled trials in ADHD. Postgraduate Medicine , 122(5), 42–51. 10.3810/pgm.2010.09.2200 [DOI] [PubMed] [Google Scholar]
  28. *. Byrd, C. T., Coalson, G. A., Yang, J., & Moriarty, K. (2017). The effect of phonetic complexity on the speed of single-word productions in adults who do and do not stutter. Journal of Communication Disorders, 69, 94–105. 10.1016/j.jcomdis.2017.06.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. *. Byrd, C. T., McGill, M., & Usler, E. (2015). Nonword repetition and phoneme elision in adults who do and do not stutter: Vocal versus nonvocal performance differences. Journal of Fluency Disorders, 44, 17–31. 10.1016/j.jfludis.2015.01.004 [DOI] [PubMed] [Google Scholar]
  30. Byrd, C. T., Vallely, M., Anderson, J. D., & Sussman, H. (2012). Nonword repetition and phoneme elision in adults who do and do not stutter. Journal of Fluency Disorders , 37(3), 188–201. 10.1016/j.jfludis.2012.03.003 [DOI] [PubMed] [Google Scholar]
  31. Cabel, R. M., Colcord, R. D., & Petrosino, L. (2002). Self-reported anxiety of adults who do and do not stutter. Perceptual and Motor Skills, 94(3), 775–784. 10.2466/pms.2002.94.3.775 [DOI] [PubMed] [Google Scholar]
  32. Carlson, S. M. (2005). Developmentally sensitive measures of executive function in preschool children. Developmental Neuropsychology , 28(2), 595–616. 10.1207/s15326942dn2802_3 [DOI] [PubMed] [Google Scholar]
  33. Choo, A. L., Smith, S. A., & Li, H. (2020). Associations between stuttering, comorbid conditions and executive function in children: A population-based study. BMC Psychology, 8(1), Article 113. 10.1186/s40359-020-00481-7 [DOI] [Google Scholar]
  34. Chun, M. M., Golomb, J. D., & Turk-Browne, N. B. (2011). A taxonomy of external and internal attention. Annual Review of Psychology, 62(1), 73–101. 10.1146/annurev.psych.093008.100427 [DOI] [Google Scholar]
  35. *. Coalson, G. A., & Byrd, C. T. (2015). Metrical encoding in adults who do and do not stutter. Journal of Speech, Language, and Hearing Research, 58(3), 601–621. 10.1044/2015_JSLHR-S-14-0111 [DOI] [Google Scholar]
  36. Coalson, G. A., & Byrd, C. T. (2017). Nonword repetition in adults who stutter: The effects of stimuli stress and auditory–orthographic cues. PLOS ONE , 12(11), Article e0188111. 10.1371/journal.pone.0188111 [DOI] [Google Scholar]
  37. *. Coalson, G. A., Byrd, C. T., & Kuylen, A. (2017). Uniqueness point effects during speech planning in adults who do and do not stutter. Folia Phoniatrica et Logopaedica, 69(5–6), 191–208. 10.1159/000485657 [DOI] [PubMed] [Google Scholar]
  38. Coalson, G. A., Croft, R., & Byrd, C. T. (2025). Working memory capacity and rumination in adults who stutter. Journal of Fluency Disorders, 84, Article 106119. 10.1016/j.jfludis.2025.106119 [DOI] [Google Scholar]
  39. Cowan, N. (2008). What are the differences between long-term, short-term, and working memory? Progress in Brain Research , 169, 323–338. 10.1016/S0079-6123(07)00020-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Cowan, N. (2017). The many faces of working memory and short-term storage. Psychonomic Bulletin & Review, 24(4), 1158–1170. 10.3758/s13423-016-1191-6 [DOI] [PubMed] [Google Scholar]
  41. Cowan, N., Elliott, E. M., Saults, J. S., Morey, C. C., Mattox, S., Hismjatullina, A., & Conway, A. R. (2005). On the capacity of attention: Its estimation and its role in working memory and cognitive aptitudes. Cognitive Psychology, 51(1), 42–100. 10.1016/j.cogpsych.2004.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Cragg, L., & Chevalier, N. (2012). The processes underlying flexibility in childhood. The Quarterly Journal of Experimental Psychology , 65(2), 209–232. 10.1080/17470210903204618 [DOI] [PubMed] [Google Scholar]
  43. Craig, A., Blumgart, E., & Tran, Y. (2009). The impact of stuttering on the quality of life in adults who stutter. Journal of Fluency Disorders, 34(2), 61–71. 10.1016/j.jfludis.2009.05.002 [DOI] [PubMed] [Google Scholar]
  44. Craig, A., & Tran, Y. (2006). Fear of speaking: Chronic anxiety and stammering. Advances in Psychiatric Treatment, 12(1), 63–68. 10.1192/apt.12.1.63 [DOI] [Google Scholar]
  45. Dang, J., King, K. M., & Inzlicht, M. (2020). Why are self-report and behavioral measures weakly correlated? Trends in Cognitive Sciences, 24(4), 267–269. 10.1016/j.tics.2020.01.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Daniels, D. E., & Gabel, R. M. (2004). The impact of stuttering on identity construction. Topics in Language Disorders, 24(3), 200–215. 10.1097/00011363-200407000-00007 [DOI] [Google Scholar]
  47. Davidson, M. C., Amso, D., Anderson, L. C., & Diamond, A. (2006). Development of cognitive control and executive functions from 4 to 13 years: Evidence from manipulations of memory, inhibition, and task switching. Neuropsychologia , 44(11), 2037–2078. 10.1016/j.neuropsychologia.2006.02.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Dell, G. S., Nozari, N., & Oppenheim, G. M. (2014). Lexical access: Behavioral and computational considerations. In Ferreira V., Goldrick M., & Miozzo M. (Eds.), The Oxford handbook of language production (pp. 88–104). Oxford University Press. [Google Scholar]
  49. Dell, G. S., Schwartz, M. F., Martin, N., Saffran, E. M., & Gagnon, D. A. (1997). Lexical access in aphasic and nonaphasic speakers. Psychological Review, 104(4), 801–838. 10.1037/0033-295X.104.4.801 [DOI] [PubMed] [Google Scholar]
  50. Dempster, F. N. (1981). Memory span: Sources of individual and developmental differences. Psychological Bulletin , 89(1), 63–100. 10.1037/0033-2909.89.1.63 [DOI] [Google Scholar]
  51. Dempster, F. N. (1993). Resistance to interference: Developmental changes in a basic processing mechanism. In Howe M. L. & Pasnak R. (Eds.), Emerging themes in cognitive development: Volume 1. Foundations (pp. 3–27). Springer-Verlag. 10.1007/978-1-4613-9220-0_1 [DOI] [Google Scholar]
  52. D'Esposito, M., & Postle, B. R. (2015). The cognitive neuroscience of working memory. Annual Review of Psychology , 66(1), 115–142. 10.1146/annurev-psych-010814-015031 [DOI] [Google Scholar]
  53. Diamond, A. (1988). Abilities and neural mechanisms underlying AB performance. Child Development , 59(2), 523–527. 10.2307/1130330 [DOI] [PubMed] [Google Scholar]
  54. Diamond, A. (2013). Executive functions. Annual Review of Psychology , 64(1), 135–168. 10.1146/annurev-psych-113011-143750 [DOI] [Google Scholar]
  55. Doebel, S., & Zelazo, P. D. (2015). A meta-analysis of the Dimensional Change Card Sort: Implications for developmental theories and the measurement of executive function in children. Developmental Review, 38, 241–268. 10.1016/j.dr.2015.09.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Dollaghan, C., & Campbell, T. F. (1998). Nonword repetition and child language impairment. Journal of Speech, Language, and Hearing Research, 41(5), 1136–1146. 10.1044/jslhr.4105.1136 [DOI] [Google Scholar]
  57. Doneva, S., Davis, S., & Cavenagh, P. (2018). Comparing the performance of people who stutter and people who do not stutter on the Test of Everyday Attention. Journal of Clinical and Experimental Neuropsychology , 40(6), 544–558. 10.1080/13803395.2017.1386162 [DOI] [PubMed] [Google Scholar]
  58. Doneva, S. P. (2020). Adult stuttering and attentional ability: A meta-analytic review. International Journal of Speech-Language Pathology , 22(4), 444–453. 10.1080/17549507.2019.1665710 [DOI] [PubMed] [Google Scholar]
  59. Duval, S., & Tweedie, R. (2000). Trim and fill: A simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics , 56(2), 455–463. 10.1111/j.0006-341x.2000.00455.x [DOI] [PubMed] [Google Scholar]
  60. Eggers, K., De Nil, L. F., & Van den Bergh, B. R. (2013). Inhibitory control in childhood stuttering. Journal of Fluency Disorders, 38(1), 1–13. 10.1016/j.jfludis.2012.10.001 [DOI] [PubMed] [Google Scholar]
  61. Eichorn, N., Hall, J., & Marton, K. (2023). Complex working memory in adults with and without stuttering disorders: Performance patterns and predictive relationships. Journal of Fluency Disorders, 77, Article 105993. 10.1016/j.jfludis.2023.105993 [DOI] [Google Scholar]
  62. Eichorn, N., Marton, K., & Pirutinsky, S. (2018). Cognitive flexibility in preschool children with and without stuttering disorders. Journal of Fluency Disorders, 57, 37–50. 10.1016/j.jfludis.2017.11.001 [DOI] [PubMed] [Google Scholar]
  63. *. Eichorn, N., Marton, K., Schwartz, R. G., Melara, R. D., & Pirutinsky, S. (2016). Does working memory enhance or interfere with speech fluency in adults who do and do not stutter? Evidence from a dual-task paradigm. Journal of Speech, Language, and Hearing Research, 59(3), 415–429. 10.1044/2015_JSLHR-S-15-0249 [DOI] [Google Scholar]
  64. Eichorn, N., & Pirutinsky, S. (2021). Cognitive flexibility and effortful control in school-age children with and without stuttering disorders. Journal of Speech, Language, and Hearing Research , 64(3), 823–838. 10.1044/2020_JSLHR-20-00440 [DOI] [Google Scholar]
  65. *. Elsherif, M. M., Wheeldon, L. R., & Frisson, S. (2021). Do dyslexia and stuttering share a processing deficit? Journal of Fluency Disorders, 67, Article 105827. 10.1016/j.jfludis.2020.105827 [DOI] [Google Scholar]
  66. Engle, R. W. (2002). Working memory capacity as executive attention. Current Directions in Psychological Science, 11(1), 19–23. 10.1111/1467-8721.00160 [DOI] [Google Scholar]
  67. Engle, R. W., Tuholski, S. W., Laughlin, J. E., & Conway, A. R. (1999). Working memory, short-term memory, and general fluid intelligence: A latent-variable approach. Journal of Experimental Psychology: General , 128(3), 309–331. 10.1037/0096-3445.128.3.309 [DOI] [PubMed] [Google Scholar]
  68. Eriksen, B. A., & Eriksen, C. W. (1974). Effects of noise letters upon the identification of a target letter in a nonsearch task. Perception & Psychophysics , 16(1), 143–149. 10.3758/BF03203267 [DOI] [Google Scholar]
  69. Eriksen, C. W. (1995). The flankers task and response competition: A useful tool for investigating a variety of cognitive problems. Visual Cognition , 2(2–3), 101–118. 10.1080/13506289508401726 [DOI] [Google Scholar]
  70. Estes, K. G., Evans, J. L., & Else-Quest, N. M. (2007). Differences in the nonword repetition performance of children with and without specific language impairment: A meta-analysis. Journal of Speech, Language, and Hearing Research , 50(1), 177–195. 10.1044/1092-4388(2007/015) [DOI] [Google Scholar]
  71. Everling, S., & Fischer, B. (1998). The antisaccade: A review of basic research and clinical studies. Neuropsychologia , 36(9), 885–899. 10.1016/S0028-3932(98)00020-7 [DOI] [PubMed] [Google Scholar]
  72. Fan, J., McCandliss, B. D., Sommer, T., Raz, A., & Posner, M. I. (2002). Testing the efficiency and independence of attentional networks. Journal of Cognitive Neuroscience, 14(3), 340–347. 10.1162/089892902317361886 [DOI] [PubMed] [Google Scholar]
  73. Ferguson, H. J., Brunsdon, V. E. A., & Bradford, E. E. F. (2021). The developmental trajectories of executive function from adolescence to old age. Scientific Reports, 11(1), Article 1382. 10.1038/s41598-020-80866-1 [DOI] [Google Scholar]
  74. Foygel, D., & Dell, G. S. (2000). Models of impaired lexical access in speech production. Journal of Memory and Language, 43(2), 182–216. 10.1006/jmla.2000.2716 [DOI] [Google Scholar]
  75. Friedman, N. P., & Miyake, A. (2004). The relations among inhibition and interference control functions: A latent-variable analysis. Journal of Experimental Psychology: General, 133(1), 101–135. 10.1037/0096-3445.133.1.101 [DOI] [PubMed] [Google Scholar]
  76. *. Garnett, E. O., McAuley, J. D., Wieland, E. A., Chow, H. M., Zhu, D. C., Dilley, L. C., & Chang, S.-E. (2023). Auditory rhythm discrimination in adults who stutter: An fMRI study. Brain and Language, 236, Article 105219. 10.1016/j.bandl.2022.105219 [DOI] [Google Scholar]
  77. Garon, N., Bryson, S. E., & Smith, I. M. (2008). Executive function in preschoolers: A review using an integrative framework. Psychological Bulletin , 134(1), 31–60. 10.1037/0033-2909.134.1.31 [DOI] [PubMed] [Google Scholar]
  78. Gathercole, S. E. (2006). Nonword repetition and word learning: The nature of the relationship. Applied Psycholinguistics , 27(4), 513–543. 10.1017/S0142716406060383 [DOI] [Google Scholar]
  79. Gathercole, S. E., Pickering, S. J., Knight, C., & Stegmann, Z. (2004). Working memory skills and educational attainment: Evidence from National Curriculum Assessments at 7 and 14 years of age. Applied Cognitive Psychology , 18(1), 1–16. 10.1002/acp.934 [DOI] [Google Scholar]
  80. *. Gkalitsiou, Z. (2018). Phonological working memory in adults who stutter [Unpublished doctoral dissertation/master's thesis]. The University of Texas at Austin. [Google Scholar]
  81. *. Gkalitsiou, Z., Byrd, C., & Griffin, Z. (2020). Executive control in adults who stutter: The antisaccade task. Journal of Speech, Language, and Hearing Research , 63(6), 1688–1699. 10.1044/2020_JSLHR-19-00045 [DOI] [Google Scholar]
  82. Grant, D. A., & Berg, E. (1948). A behavioral analysis of degree of reinforcement and ease of shifting to new responses in a Weigl-type card-sorting problem. Journal of Experimental Psychology , 38(4), 404–411. 10.1037/h0059831 [DOI] [PubMed] [Google Scholar]
  83. Gupta, P. (2003). Examining the relationship between word learning, nonword repetition, and immediate serial recall in adults. The Quarterly Journal of Experimental Psychology: Human Experimental Psychology , 56(7), 1213–1236. 10.1080/02724980343000071 [DOI] [PubMed] [Google Scholar]
  84. *. Haj-Tas, M. (2007). Effect of syntactic structure on speech production in adults who stutter [Unpublished doctoral dissertation/master's thesis]. University of Florida. [Google Scholar]
  85. Hall, P. A., & Fong, G. T. (2015). Temporal self-regulation theory: A neurobiologically informed model for physical activity behavior. Frontiers in Human Neuroscience, 9, Article 117. 10.3389/fnhum.2015.00117 [DOI] [Google Scholar]
  86. Hallett, P. E. (1978). Primary and secondary saccades to goals defined by instructions. Vision Research , 18(10), 1279–1296. 10.1016/0042-6989(78)90218-3 [DOI] [PubMed] [Google Scholar]
  87. Harnishfeger, K. K. (1995). The development of cognitive inhibition: Theories, definitions, and research evidence. In Dempster F. N. & Brainerd C. J. (Eds.), Interference and inhibition in cognition (pp. 175–204). Academic Press. 10.1016/B978-012208930-5/50007-6 [DOI] [Google Scholar]
  88. Hayhow, R., Cray, A. M., & Enderby, P. (2002). Stammering and therapy views of people who stammer. Journal of Fluency Disorders , 27(1), 1–17. 10.1016/S0094-730X(01)00102-4 [DOI] [PubMed] [Google Scholar]
  89. Hedges, L. V., & Olkin, I. (1985). Statistical methods for meta-analysis . Academic Press. [Google Scholar]
  90. Higgins, J. P., Thompson, S. G., Deeks, J. J., & Altman, D. G. (2003). Measuring inconsistency in meta-analyses. BMJ , 327(7414), Article 557. 10.1136/bmj.327.7414.557 [DOI] [Google Scholar]
  91. *. Höbler, F., Bitan, T., Tremblay, L., & De Nil, L. (2023). Explicit benefits: Motor sequence acquisition and short-term retention in adults who do and do not stutter. Journal of Fluency Disorders, 75, Article 105959. 10.1016/j.jfludis.2023.105959 [DOI] [Google Scholar]
  92. Hommel, B., Chapman, C. S., Cisek, P., Neyedli, H. F., Song, J.-H., & Welsh, T. N. (2019). No one knows what attention is. Attention, Perception, & Psychophysics, 81(7), 2288–2303. 10.3758/s13414-019-01846-w [DOI] [Google Scholar]
  93. Hopper, L. M., Jacobson, S. L., & Howard, L. H. (2020). Problem solving flexibility across early development. Journal of Experimental Child Psychology, 200, Article 104966. 10.1016/j.jecp.2020.104966 [DOI] [Google Scholar]
  94. Howard, S. J., Johnson, J., & Pascual-Leone, J. (2014). Clarifying inhibitory control: Diversity and development of attentional inhibition. Cognitive Development , 31, 1–21. 10.1016/j.cogdev.2014.03.001 [DOI] [Google Scholar]
  95. Howlett, C. A., Wewege, M. A., Berryman, C., Oldach, A., Jennings, E., Moore, E., Karran, E. L., Szeto, K., Pronk, L., Miles, S., & Moseley, G. L. (2022). Back to the drawing board—The relationship between self-report and neuropsychological tests of cognitive flexibility in clinical cohorts: A systematic review and meta-analysis. Neuropsychology , 36(5), 347–372. 10.1037/neu0000796 [DOI] [PubMed] [Google Scholar]
  96. Huang, W., Fang, Y., Tan, X., & Zhao, J. (2024). Childhood trauma, stressful life events, and depression: Exploring the mediating effect of cognitive flexibility. Psychological Trauma: Theory, Research, Practice, and Policy, 16(Suppl. 1), S45–S52. 10.1037/tra0001337 [DOI] [PubMed] [Google Scholar]
  97. Hutton, S. B., & Ettinger, U. (2006). The antisaccade task as a research tool in psychopathology: A critical review. Psychophysiology , 43(3), 302–313. 10.1111/j.1469-8986.2006.00403.x [DOI] [PubMed] [Google Scholar]
  98. Ionescu, T. (2012). Exploring the nature of cognitive flexibility. New Ideas in Psychology, 30(2), 190–200. 10.1016/j.newideapsych.2011.11.001 [DOI] [Google Scholar]
  99. Iverach, L., Menzies, R. G., O'Brian, S., Packman, A., & Onslow, M. (2011). Anxiety and stuttering: Continuing to explore a complex relationship. American Journal of Speech-Language Pathology , 20(3), 221–232. 10.1044/1058-0360(2011/10-0091) [DOI] [PubMed] [Google Scholar]
  100. *. Jackson, L. S. (2012). False memories in adults who do and do not stutter [Unpublished doctoral dissertation/master's thesis]. The University of Texas at Austin. [Google Scholar]
  101. James, W. (1890). The principles of psychology (Vol. 1). Henry Holt and Co. 10.1037/10538-000 [DOI] [Google Scholar]
  102. Johann, V., Könen, T., & Karbach, J. (2020). The unique contribution of working memory, inhibition, cognitive flexibility, and intelligence to reading comprehension and reading speed. Child Neuropsychology, 26(3), 324–344. 10.1080/09297049.2019.1649381 [DOI] [PubMed] [Google Scholar]
  103. Johnco, C., Wuthrich, V. M., & Rapee, R. M. (2013). The role of cognitive flexibility in cognitive restructuring skill acquisition among older adults. Journal of Anxiety Disorders, 27(6), 576–584. 10.1016/j.janxdis.2012.10.004 [DOI] [PubMed] [Google Scholar]
  104. Kaldas, A. (2022). Attention: A descriptive taxonomy. History and Philosophy of the Life Sciences, 44(4), Article 63. 10.1007/s40656-022-00538-3 [DOI] [Google Scholar]
  105. Kashdan, T. B., & Rottenberg, J. (2010). Psychological flexibility as a fundamental aspect of health. Clinical Psychology Review, 30(7), 865–878. 10.1016/j.cpr.2010.03.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Koedoot, C., Bouwmans, C., Franken, M.-C., & Stolk, E. (2011). Quality of life in adults who stutter. Journal of Communication Disorders, 44(4), 429–443. 10.1016/j.jcomdis.2011.02.002 [DOI] [PubMed] [Google Scholar]
  107. Koesten, J., Schrodt, P., & Ford, D. J. (2009). Cognitive flexibility as a mediator of family communication environments and young adults' well-being. Health Communication, 24(1), 82–94. 10.1080/10410230802607024 [DOI] [PubMed] [Google Scholar]
  108. Korzeniowski, C., Ison, M. S., & de Anglat, H. D. (2021). A summary of the developmental trajectory of executive functions from birth to adulthood. In Gargiulo P. Á. & Mesones Arroyo H. L. (Eds.), Psychiatry and neuroscience update: From epistemology to clinical psychiatry (pp. 459–473). Springer. 10.1007/978-3-030-61721-9_33 [DOI] [Google Scholar]
  109. Langevin, M., & Prasad, N. N. (2012). A stuttering education and bullying awareness and prevention resource: A feasibility study. Language, Speech, and Hearing Services in Schools, 43(3), 344–358. 10.1044/0161-1461(2012/11-0031) [DOI] [PubMed] [Google Scholar]
  110. *. Lescht, E. (2017). Text recognition skills and text recall ability of adults who do and do not stutter in oral and typed modes [Unpublished doctoral dissertation/master's thesis] . University of Maryland. [Google Scholar]
  111. Levelt, W. J. M. (1989). Speaking: From intention to articulation. MIT Press. 10.7551/mitpress/6393.001.0001 [DOI] [Google Scholar]
  112. Levelt, W. J. M., Roelofs, A., & Meyer, A. S. (1999). A theory of lexical access in speech production. Behavioral and Brain Sciences , 22(1), 1–38. 10.1017/S0140525X99001776 [DOI] [PubMed] [Google Scholar]
  113. Logan, G. D. (1994). On the ability to inhibit thought and action: A user's guide to the stop signal paradigm. In Dagenbach D. & Carr T. H. (Eds.), Inhibitory processes in attention, memory, and language (pp. 189–239). Academic Press. [Google Scholar]
  114. Magalhães, S., Carneiro, L., Limpo, T., & Filipe, M. (2020). Executive functions predict literacy and mathematics achievements: The unique contribution of cognitive flexibility in grades 2, 4, and 6. Child Neuropsychology, 26(7), 934–952. 10.1080/09297049.2020.1740188 [DOI] [PubMed] [Google Scholar]
  115. Magnusdottir, B. B., Faiola, E., Harms, C., Sigurdsson, E., Ettinger, U., & Haraldsson, H. M. (2019). Cognitive measures and performance on the antisaccade eye movement task . Journal of Cognition , 2(1), Article 3. 10.5334/joc.52 [DOI] [Google Scholar]
  116. *. Markett, S., Bleek, B., Reuter, M., Prüss, H., Richardt, K., Müller, T., Yaruss, J. S., & Montag, C. (2016). Impaired motor inhibition in adults who stutter—Evidence from speech-free stop-signal reaction time tasks. Neuropsychologia, 91, 444–450. 10.1016/j.neuropsychologia.2016.09.008 [DOI] [PubMed] [Google Scholar]
  117. *. Maxfield, N. D. (2017). Semantic and phonological encoding times in adults who stutter: Brain electrophysiological evidence. Journal of Speech, Language, and Hearing Research, 60(10), 2906–2923. 10.1044/2017_JSLHR-L-16-0309 [DOI] [Google Scholar]
  118. Mayr, U., & Kliegl, R. (2000). Task-set switching and long-term memory retrieval. Journal of Experimental Psychology: Learning, Memory, and Cognition, 26(5), 1124–1140. 10.1037//0278-7393.26.5.1124 [DOI] [PubMed] [Google Scholar]
  119. *. McGill, M., Sussman, H., & Byrd, C. T. (2016). From grapheme to phonological output: Performance of adults who stutter on a word jumble task. PLOS ONE, 11(3), Article e0151107. 10.1371/journal.pone.0151107 [DOI] [Google Scholar]
  120. *. McGill, M. N. (2016). Multimodal nonvocal nonword matching in adults who stutter [Unpublished doctoral dissertation/master's thesis]. The University of Texas at Austin. [Google Scholar]
  121. *. Mersov, A.-M., Jobst, C., Cheyne, D. O., & De Nil, L. (2016). Sensorimotor oscillations prior to speech onset reflect altered motor networks in adults who stutter. Frontiers in Human Neuroscience, 10, Article 443. 10.3389/fnhum.2016.00443 [DOI] [Google Scholar]
  122. Miller, G. A., Galanter, E., & Pribram, K. H. (1960). Plans and the structure of behavior. Henry Holt and Co. 10.1037/10039-000 [DOI] [Google Scholar]
  123. Miller, H. V., Barnes, J. C., & Beaver, K. M. (2011). Self-control and health outcomes in a nationally representative sample. American Journal of Health Behavior , 35(1), 15–27. 10.5993/AJHB.35.1.2 [DOI] [PubMed] [Google Scholar]
  124. Miyake, A., Friedman, N. P., Emerson, M. J., Witzki, A. H., Howerter, A., & Wager, T. D. (2000). The unity and diversity of executive functions and their contributions to complex “frontal lobe” tasks: A latent variable analysis. Cognitive Psychology , 41(1), 49–100. 10.1006/cogp.1999.0734 [DOI] [PubMed] [Google Scholar]
  125. Miyake, A., & Shah, P. (Eds.). (1999). Models of working memory: Mechanisms of active maintenance and executive control. Cambridge University Press. 10.1017/CBO9781139174909 [DOI] [Google Scholar]
  126. Moran, T. P. (2016). Anxiety and working memory capacity: A meta-analysis and narrative review. Psychological Bulletin, 142(8), 831–864. 10.1037/bul0000051 [DOI] [PubMed] [Google Scholar]
  127. Nakhostin-Khayyat, M., Borjali, M., Zeinali, M., Fardi, D., & Montazeri, A. (2024). The relationship between self-regulation, cognitive flexibility, and resilience among students: A structural equation modeling. BMC Psychology, 12(1), Article 337. 10.1186/s40359-024-01843-1 [DOI] [Google Scholar]
  128. Neef, N. E., Anwander, A., Bütfering, C., Schmidt-Samoa, C., Friederici, A. D., Paulus, W., & Sommer, M. (2018). Structural connectivity of right frontal hyperactive areas scales with stuttering severity. Brain, 141(1), 191–204. 10.1093/brain/awx316 [DOI] [PMC free article] [PubMed] [Google Scholar]
  129. Nigg, J. T. (2000). On inhibition/disinhibition in developmental psychopathology: Views from cognitive and personality psychology and a working inhibition taxonomy. Psychological Bulletin , 126(2), 220–246. 10.1037/0033-2909.126.2.220 [DOI] [PubMed] [Google Scholar]
  130. *. Ning, N., Peng, D., Liu, X., & Yang, S. (2017). Speech timing deficit of stuttering: Evidence from contingent negative variations. PLOS ONE, 12(1), Article e0168836. 10.1371/journal.pone.0168836 [DOI] [Google Scholar]
  131. Nozari, N., & Novick, J. (2017). Monitoring and control in language production. Current Directions in Psychological Science , 26(5), 403–410. 10.1177/0963721417702419 [DOI] [Google Scholar]
  132. Ntourou, K., Anderson, J. D., & Wagovich, S. A. (2018). Executive function and childhood stuttering: Parent ratings and evidence from a behavioral task. Journal of Fluency Disorders, 56, 18–32. 10.1016/j.jfludis.2017.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  133. Oberauer, K. (2019). Working memory and attention—A conceptual analysis and review. Journal of Cognition, 2(1), Article 36. 10.5334/joc.58 [DOI] [Google Scholar]
  134. Ofoe, L. C., Anderson, J. D., & Ntourou, K. (2018). Short-term memory, inhibition, and attention in developmental stuttering: A meta-analysis. Journal of Speech, Language, and Hearing Research , 61(7), 1626–1648. 10.1044/2018_JSLHR-S-17-0372 [DOI] [Google Scholar]
  135. Paphiti, M., & Eggers, K. (2022). Cognitive flexibility in younger and older children who stutter. Frontiers in Psychology, 13, Article 1017319. 10.3389/fpsyg.2022.1017319 [DOI] [Google Scholar]
  136. Paphiti, M., Jansson-Verkasalo, E., & Eggers, K. (2022). Complex response inhibition and cognitive flexibility in school-aged Cypriot-Greek-speaking children who stutter. Frontiers in Psychology, 13, Article 991138. 10.3389/fpsyg.2022.991138 [DOI] [Google Scholar]
  137. Posner, M. I., & Boies, S. J. (1971). Components of attention. Psychological Review, 78(5), 391–408. 10.1037/h0031333 [DOI] [Google Scholar]
  138. Rautakoski, P., Ursin, P. A., Carter, A. S., Kaljonen, A., Nylund, A., & Pihlaja, P. (2021). Communication skills predict social-emotional competencies. Journal of Communication Disorders, 93, Article 106138. 10.1016/j.jcomdis.2021.106138 [DOI] [Google Scholar]
  139. Raven, J. C. (1960). Guide to using the Standard Progressive Matrices. Lewis. [Google Scholar]
  140. Raz, A., & Buhle, J. (2006). Typologies of attentional networks. Nature Reviews Neuroscience, 7(5), 367–379. 10.1038/nrn1903 [DOI] [PubMed] [Google Scholar]
  141. Riley, G. D. (1994). Stuttering Severity Instrument–Third Edition (SSI-3). Pro-Ed. [Google Scholar]
  142. Riley, G. D. (2009). Stuttering Severity Instrument–Fourth Edition (SSI-4). Pro-Ed. [Google Scholar]
  143. Robertson, I. H., Ward, T., Ridgeway, V., & Nimmo-Smith, I. (1994). The Test of Everyday Attention: TEA: Manual. Thames Valley Test Company. [Google Scholar]
  144. *. Sasisekaran, J. (2013). Nonword repetition and nonword reading abilities in adults who do and do not stutter. Journal of Fluency Disorders, 38(3), 275–289. 10.1016/j.jfludis.2013.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  145. *. Sasisekaran, J., & Weisberg, S. (2014). Practice and retention of nonwords in adults who stutter. Journal of Fluency Disorders, 41, 55–71. 10.1016/j.jfludis.2014.02.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  146. Schaie, K. W. (1955). A test of behavioral rigidity. Journal of Abnormal and Social Psychology, 51(3), 604–610. 10.1037/h0046287 [DOI] [Google Scholar]
  147. *. Schwartze, M., & Kotz, S. A. (2020). Decreased sensitivity to changing durational parameters of syllable sequences in people who stutter. Language, Cognition and Neuroscience, 35(2), 179–187. 10.1080/23273798.2019.1642499 [DOI] [Google Scholar]
  148. Shao, Z., Roelofs, A., & Meyer, A. S. (2012). Sources of individual differences in the speed of naming objects and actions: The contribution of executive control. The Quarterly Journal of Experimental Psychology, 65(10), 1927–1944. 10.1080/17470218.2012.670252 [DOI] [PubMed] [Google Scholar]
  149. Sikora, K., Roelofs, A., Hermans, D., & Knoors, H. (2016). Executive control in spoken noun-phrase production: Contributions of updating, inhibiting, and shifting. The Quarterly Journal of Experimental Psychology, 69(9), 1719–1740. 10.1080/17470218.2015.1093007 [DOI] [PubMed] [Google Scholar]
  150. Smith, A., Sadagopan, N., Walsh, B., & Weber-Fox, C. (2010). Increasing phonological complexity reveals heightened instability in inter-articulatory coordination in adults who stutter. Journal of Fluency Disorders, 35(1), 1–18. 10.1016/j.jfludis.2009.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  151. *. Smits-Bandstra, S., & Gracco, V. (2015). Retention of implicit sequence learning in persons who stutter and persons with Parkinson's disease. Journal of Motor Behavior, 47(2), 124–141. 10.1080/00222895.2014.961890 [DOI] [PubMed] [Google Scholar]
  152. Spencer, C., & Weber-Fox, C. (2014). Preschool speech articulation and nonword repetition abilities may help predict eventual recovery or persistence of stuttering. Journal of Fluency Disorders, 41, 32–46. 10.1016/j.jfludis.2014.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  153. *. Subramanian, A. (2001). Identification of traits transmitted associated with stuttering [Unpublished doctoral dissertation/master's thesis] . University of Illinois at Urbana-Champaign. [Google Scholar]
  154. Tehan, G., & Humphreys, M. S. (1995). Transient phonemic codes and immunity to proactive interference. Memory and Cognition, 23(2), 181–191. 10.3758/BF03197220 [DOI] [PubMed] [Google Scholar]
  155. *. Tendera, A. (2019). Reactive and proactive response inhibition in neurodevelopmental stuttering [Unpublished doctoral dissertation/master's thesis]. University of Alberta. [Google Scholar]
  156. Tichenor, S., & Yaruss, J. S. (2020). Repetitive negative thinking, temperament, and adverse impact in adults who stutter. American Journal of Speech-Language Pathology, 29(1), 201–215. 10.1044/2019_AJSLP-19-00077 [DOI] [PubMed] [Google Scholar]
  157. *. Tichenor, S. E. (2020). Attention and stuttering: Differentiating word-form encoding and working memory differences in adults who stutter [Unpublished doctoral dissertation]. Michigan State University.
  158. Tichenor, S. E., Johnson, C. A., & Yaruss, J. S. (2021). A preliminary investigation of attention-deficit/hyperactivity disorder characteristics in adults who stutter. Journal of Speech, Language, and Hearing Research, 64(3), 839–853. 10.1044/2020_JSLHR-20-00237 [DOI] [Google Scholar]
  159. Tolan, G. A., & Tehan, G. (1999). Determinants of short-term forgetting: Decay, retroactive interference, or proactive interference? International Journal of Psychology, 34(5–6), 285–292. 10.1080/002075999399585 [DOI] [Google Scholar]
  160. Treleaven, S. B., & Coalson, G. A. (2020). Manual response inhibition and quality of life in adults who stutter. Journal of Communication Disorders , 88, Article 106053. 10.1016/j.jcomdis.2020.106053 [DOI] [Google Scholar]
  161. *. Treleaven, S. B., & Coalson, G. A. (2021). Verbal response inhibition in adults who stutter. Journal of Speech, Language, and Hearing Research, 64(9), 3382–3397. 10.1044/2021_JSLHR-20-00739 [DOI] [Google Scholar]
  162. Unsworth, N., Heitz, R. P., Schrock, J. C., & Engle, R. W. (2005). An automated version of the operation span task. Behavior Research Methods, 37(3), 498–505. 10.3758/BF03192720 [DOI] [PubMed] [Google Scholar]
  163. Usler, E. R. (2022). Why stuttering occurs: The role of cognitive conflict and control. Topics in Language Disorders, 42(1), 24–40. 10.1097/TLD.0000000000000275 [DOI] [PMC free article] [PubMed] [Google Scholar]
  164. Valentine, J. C., Pigott, T. D., & Rothstein, H. R. (2010). How many studies do you need? A primer on statistical power for meta-analysis. Journal of Educational and Behavioral Statistics, 35(2), 215–247. 10.3102/1076998609346961 [DOI] [Google Scholar]
  165. *. Vallely, M. N. (2011). Phonological working memory in adults who do and do not stutter [Unpublished doctoral dissertation/master's thesis]. The University of Texas at Austin. [Google Scholar]
  166. van Barreveld, M., Scheper, A., Vissers, C., Duinmeijer, I., & Hakvoort, B. (2025). Social–emotional functioning and quality of life in language disorders: A systematic review of development from childhood to adolescence. International Journal of Language & Communication Disorders, 60(3), Article e70039. 10.1111/1460-6984.70039 [DOI] [Google Scholar]
  167. *. Van der Linden, L. (2013). Cognitive control and stuttering [Unpublished doctoral dissertation/master's thesis]. University of Ghent. [Google Scholar]
  168. *. van Lieshout, P., Ben-David, B., Lipski, M., & Namasivayam, A. (2014). The impact of threat and cognitive stress on speech motor control in people who stutter. Journal of Fluency Disorders, 40, 93–109. 10.1016/j.jfludis.2014.02.003 [DOI] [PubMed] [Google Scholar]
  169. Verbruggen, F., & Logan, G. D. (2008). Response inhibition in the stop-signal paradigm. Trends in Cognitive Sciences, 12(11), 418–424. 10.1016/j.tics.2008.07.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  170. Viechtbauer, W., & Cheung, M. W. (2010). Outlier and influence diagnostics for meta-analysis. Research Synthesis Methods, 1(2), 112–125. 10.1002/jrsm.11 [DOI] [PubMed] [Google Scholar]
  171. *. Vincent, I., Grela, B. G., & Gilbert, H. R. (2012). Phonological priming in adults who stutter. Journal of Fluency Disorders, 37(2), 91–105. 10.1016/j.jfludis.2012.01.001 [DOI] [PubMed] [Google Scholar]
  172. Wagner, R. K., Torgesen, J. K., & Rashotte, C. A. (1999). Comprehensive Test of Phonological Processes (CTOPP). Pro-Ed. [Google Scholar]
  173. Wagner, R. K., Torgesen, J. K., Rashotte, C. A., & Pearson, N. A. (2013). Comprehensive Test of Phonological Processes–Second Edition (CTOPP-2). Pro-Ed. 10.1037/t52630-000 [DOI] [Google Scholar]
  174. Walsh, B., Christ, S., & Weber, C. (2021). Exploring relationships among risk factors for persistence in early childhood stuttering. Journal of Speech, Language, and Hearing Research, 64(8), 2909–2927. 10.1044/2021_JSLHR-21-00034 [DOI] [Google Scholar]
  175. Walsh, B., Usler, E., Bostian, A., Mohan, R., Gerwin, K. L., Brown, B., Weber, C., & Smith, A. (2018). What are predictors for persistence in childhood stuttering? Seminars in Speech and Language, 39(4), 299–312. 10.1055/s-0038-1667159 [DOI] [PMC free article] [PubMed] [Google Scholar]
  176. *. Webster, W. G. (1990). Concurrent cognitive processing and letter sequence transcription deficits in stutterers. Canadian Journal of Psychology, 44(1), 1–13. 10.1037/h0084238 [DOI] [PubMed] [Google Scholar]
  177. Wechsler, D. (1981). The psychometric tradition: Developing the Wechsler Adult Intelligence Scale. Contemporary Educational Psychology, 6(2), 82–85. 10.1016/0361-476X(81)90035-7 [DOI] [Google Scholar]
  178. Wechsler, D. (2009). Wechsler Adult Intelligence Scale–Fourth Edition (WAIS-IV). Pearson. 10.1037/t15169-000 [DOI] [Google Scholar]
  179. *. Wiltshire, C. (2019). Investigating speech motor control using vocal tract imaging, fMRI, and brain stimulation [Unpublished doctoral dissertation/master's thesis]. University of Oxford. [Google Scholar]
  180. Wingate, M. E. (1966). Behavioral rigidity in stutterers. Journal of Speech and Hearing Research, 9(4), 626–629. 10.1044/jshr.0904.626 [DOI] [Google Scholar]
  181. Yairi, E., & Ambrose, N. (2005). Early childhood stuttering: For clinicians, by clinicians . Pro-Ed. [Google Scholar]
  182. Yan, Z., Hong, S., Liu, F., & Su, Y. (2020). A meta-analysis of the relationship between empathy and executive function. PsyCh Journal, 9(1), 34–43. 10.1002/pchj.311 [DOI] [PubMed] [Google Scholar]
  183. *. Yang, Y., Jia, F., Fox, P. T., Siok, W. T., & Tan, L. H. (2019). Abnormal neural response to phonological working memory demands in persistent developmental stuttering. Human Brain Mapping, 40(1), 214–225. 10.1002/hbm.24366 [DOI] [PMC free article] [PubMed] [Google Scholar]
  184. Yaruss, J. S., & Quesal, R. W. (2010). Overall Assessment of the Speaker's Experience of Stuttering (OASES). Pearson. [Google Scholar]
  185. Ye, Z., & Zhou, X. (2009). Executive control in language processing. Neuroscience & Biobehavioral Reviews, 33(8), 1168–1177. 10.1016/j.neubiorev.2009.03.003 [DOI] [PubMed] [Google Scholar]
  186. Zelazo, P. D., Anderson, J. E., Richler, J., Wallner-Allen, K., Beaumont, J. L., Conway, K. P., Gerhson, R., & Weintraub, S. (2014). NIH Toolbox Cognition Battery (CB): Validation of executive function measures in adults. Journal of the International Neuropsychological Society, 20(6), 620–629. 10.1017/S1355617714000472 [DOI] [PMC free article] [PubMed] [Google Scholar]
  187. Zetsche, U., Bürkner, P.-C., & Schulze, L. (2018). Shedding light on the association between repetitive negative thinking and deficits in cognitive control—A meta-analysis. Clinical Psychology Review, 63, 56–65. 10.1016/j.cpr.2018.06.001 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data generated or analyzed during this study are included in this published article.


Articles from Journal of Speech, Language, and Hearing Research : JSLHR are provided here courtesy of American Speech-Language-Hearing Association

RESOURCES