Abstract
Women are thought to fare better in verbal abilities, especially in verbal-fluency and verbal-memory tasks. However, the last meta-analysis on sex/gender differences in verbal fluency dates from 1988. Although verbal memory has only recently been investigated meta-analytically, a comprehensive meta-analysis is lacking that focuses on verbal memory as it is typically assessed, for example, in neuropsychological settings. On the basis of 496 effect sizes and 355,173 participants, in the current meta-analysis, we found that women/girls outperformed men/boys in phonemic fluency (ds = 0.12–0.13) but not in semantic fluency (ds = 0.01–0.02), for which the sex/gender difference appeared to be category-dependent. Women/girls also outperformed men/boys in recall (d = 0.28) and recognition (ds = 0.12–0.17). Although effect sizes are small, the female advantage was relatively stable over the past 50 years and across lifetime. Published articles reported stronger female advantages than unpublished studies, and first authors reported better performance for members of their own sex/gender. We conclude that a small female advantage in phonemic fluency, recall, and recognition exists and is partly subject to publication bias. Considerable variance suggests further contributing factors, such as participants’ language and country/region.
Keywords: verbal ability, phonemic fluency, semantic fluency, verbal memory, age, author effects
After more than 100 years of psychological research, sex/gender1 differences in cognitive abilities are still heavily debated (for reviews, see Halpern, 2012; Hyde, 2014). Spatial and mathematical abilities, in which men are commonly believed to excel, are very well researched. For instance, a male advantage in mental rotation, the ability to rotate complex figures in one’s mind, has been reported in several meta-analyses with effect sizes around Cohen’s d from 0.56 to 0.73 (Linn & Petersen, 1985; Voyer et al., 1995; Zell et al., 2015). By comparison, much less is known about verbal abilities, in which women/girls are commonly believed to excel. There is no unitary concept of verbal abilities, but it relates to all aspects of open or inner language production and comprehension. Meta-analyses reported female advantages with medium effect sizes for writing ability (ds = 0.53–0.61; Hedges & Nowell, 1995) and reading comprehension (ds = 0.23–0.68; Reilly, 2012; Stoet & Geary, 2013). Verbal intelligence/reasoning (Feingold, 1988) and vocabulary (Hyde & Linn, 1988), on the other hand, did not reveal a female advantage (effect sizes smaller than d = 0.05; Hyde, 2005, 2014).
The two verbal abilities, however, that textbooks and review articles typically refer to when claiming the existence of a female advantage are verbal fluency (sometimes also called “word fluency”) and verbal memory (Andreano & Cahill, 2009; Halpern, 2012; Hamson et al., 2016; Hyde, 2014; Kimura, 2000; Miller & Halpern, 2014). Verbal-fluency and verbal-memory tests correlate with general cognitive abilities (Alexander & Smales, 1997; Kraan et al., 2013) and are frequently used in psychological assessments of developmental impairments in children (Gaillard et al., 2003; Pennington & Ozonoff, 1996), impairments and rehabilitation after stroke (Baldo et al., 2006; Barker-Collo & Feigin, 2006), and cognitive decline in dementia (Collie & Maruff, 2000; Zhao et al., 2013).
Verbal Fluency
Verbal fluency refers to the ability to generate (orally or written) as many words as possible that fulfill a certain criterion, normally under time restrictions. The criterion is typically either semantic, also called “categorical fluency” (e.g., naming animals, fruits, etc.) or phonemic (e.g., naming words that begin with a specific letter), also called “lexical/letter fluency.” Virtually all articles that claim women’s/girls’ superiority in verbal fluency refer to a landmark meta-analysis by Hyde and Linn (1988), who examined sex/gender differences in a few verbal abilities. The authors concluded that “speech production” or “verbal production” favored women by d = 0.33. However, the definition of “speech production” (“as occurs in essay writing or measures of spoken language,” p. 55) is different from the verbal-fluency definition above, and consequently, some studies in Hyde and Linn (1988) assessed different verbal abilities, such as quality of essays or written sentences (Harris & Seibel, 1976; Wormack, 1979) or how many words 4-year-old children speak (Brownell & Smith, 1973). Moreover, the meta-analysis was based on only 14 studies, whereas the Web of Knowledge revealed that approximately 7,500 references have included the term “verbal fluency” since 1988.
Phonemic Versus Semantic Fluency, Age, Cohort Effects, and Gender of First/Last Author
Heister (1982) found a female advantage when participants were asked to generate words beginning with the letters “S” and “M” (phonemic fluency), whereas no sex/gender differences emerged for naming things that are red or round (semantic fluency). Other studies reported a female advantage in semantic fluency (Acevedo et al., 2000) or did not find a sex/gender difference in either phonemic or semantic fluency (Kavé, 2005). Overall, it is unclear whether a female advantage exists in both semantic and phonemic fluency.
Furthermore, it is unclear at what age the putative female advantage arises and whether it changes across the life span. Some studies suggest a steeper decline in older men compared with women (Maylor et al., 2007; Rodriguez-Aranda & Martinussen, 2006), whereas de Frias et al. (2006) found that the female advantage in semantic fluency was stable between 35 and 80 years. On the basis of semantic fluency data from more than 30,000 individuals (ages 50–84) in 14 European countries, Weber et al. (2014, 2017) showed that women from younger cohorts performed better than women from older cohorts. Sex/gender differences also varied across European countries. Both findings were interpreted to show the impact of better access of women to resources and education (Weber et al., 2014, 2017). So far, it is unclear whether sex/gender differences in verbal fluency change with age or across cohorts.
Finally, Hyde and Linn (1988) found that female first authors reported a stronger female advantage (d = 0.15) than male first authors (d = 0.08). However, this finding was based on all verbal abilities, and although statistically significant, the difference was considered to be unsubstantial. In the current study, we sought to replicate the findings by Hyde and Linn but more specifically with respect to verbal fluency. In addition, we also investigated the influence of gender of the last author, who is often the supervisor or more senior researcher overseeing the research effort.
Verbal-Episodic Memory
As with verbal ability, there is no unitary definition of verbal memory. Nevertheless, there is a multitude of empirical data on what researchers considered verbal memory. Several studies found better performance in women (Catani et al., 2007; de Frias et al., 2006; Herlitz et al., 1997; P. A. Lowe et al., 2003), and a narrative review concluded that “females show an advantage at verbal memory” (Andreano & Cahill, 2009, p. 260). However, other studies found no sex/gender differences in verbal memory (Munnelly, 2016; Parsons et al., 2005). Meta-analyses on this issue were lacking until recently. Voyer et al. (2021) focused specifically on verbal working memory and found an overall significant female advantage that, however, was practically zero (Hedge’s g = 0.03). Furthermore, sex/gender differences varied across different sample and task parameters: Tasks with cued recall (g = 0.08) and free recall (g = 0.15) had a slightly elevated female advantage, whereas there was a male advantage in complex span (g = 0.04) and no significant sex/gender difference in serial recall (g < 0.01) and simple span (g < 0.01).
Another meta-analysis (Asperholm et al., 2019) investigated sex/gender differences in long-term memory, specifically episodic memory. Long-term memory is typically divided into declarative (explicit) and nondeclarative (implicit) memory; declarative memory comprises episodic memory (i.e., the ability to remember specific events or situations at a particular place at a particular time) and semantic memory (i.e., the ability to remember concepts and facts). Asperholm et al. (2019) investigated sex/gender differences in episodic memory for different stimuli, including images, movies, faces, routes, locations, and verbal content such as words/sentences. Verbal content showed a small female advantage (g = 0.28). A wide range of studies/tasks were included in the verbal-episodic category, and the authors investigated whether the female advantage varied across, for example, neutral stimuli versus emotional stimuli, intentionally learned versus incidentally learned, or recall versus recognition. Subsequent analyses of moderator variables, such as age, publication year, or geographical region, took into account whether the stimulus material was verbal, images, movies, or faces but did not distinguish between incidental/intentional, emotional/neutral, or recall/recognition, and only peer-reviewed articles were included.
Like Asperholm et al. (2019), in the present study, we were interested in episodic long-term memory and thus discarded studies/tasks that primarily assess working memory. In contrast to Asperholm et al., we had a narrower focus on verbal-episodic memory, which we investigated with a broader literature search. That is, we examined exclusively verbal-episodic memory (not memory for routes and locations) and included only studies with neutral stimuli (vs. emotional stimuli) in which participants learned material intentionally (vs. incidentally). The intentional learning of neutral stimuli is a key feature of frequently used neuropsychological tests on verbal long-term memory, such as the California Verbal Learning Test (CVLT; Delis et al., 2000), the Rey Auditory Verbal Learning Test (RAVLT; Schmidt, 1996), or the Wechsler Memory Scale (WMS; Wechsler, 2009). Further in contrast to Asperholm et al., the literature search of the current study also included “gray” literature, such as PhD/master’s theses, to investigate whether sex/gender differences are subject to publication effects. Moreover, the current study examined, for the first time, possible effects of first/last authors’ gender on sex/gender differences in verbal-episodic memory. Finally, we performed these analyses separately for recognition (i.e., when cues are provided for the material that had to be memorized) and recall (i.e., absence or lack of cues) because the female advantage appeared to be consistently larger for recall than for recognition (Asperholm et al., 2019; Voyer et al., 2021). The fact that only 14 and 18 of our 168 included studies overlapped with Voyer et al. (2021) and Asperholm et al., respectively, demonstrates that different aspects of verbal memory were investigated in the current study. Henceforth, we thus use the term “verbal-episodic memory” to refer to the data that were analyzed in the present study and “verbal memory” to refer to verbal memory in general.
Aims and Hypotheses
A female advantage is frequently assumed in verbal fluency and verbal memory. For verbal fluency, this assumption is based on an early meta-analysis by Hyde and Linn (1988) that required an update. For verbal memory, a meta-analysis was missing that focuses specifically on verbal-episodic memory—complementary to two recent meta-analyses about verbal working memory (Voyer et al., 2021) and episodic memory in general (Asperholm et al., 2019). In the present study, we thus aimed to reveal the magnitude of the putative female advantage in verbal fluency and verbal-episodic memory. For both, we additionally examined the impact of potentially modulating factors such as publication year, type of publication (articles vs. PhD/master theses), participants’ age, semantic fluency versus phonemic fluency, recall versus recognition, and gender of first/last author. We hypothesized a female advantage (a) in both verbal fluency and verbal-episodic memory of intentionally learned neutral stimuli (Andreano & Cahill, 2009; Halpern, 2012; Miller & Halpern, 2014), (b) that has increased over the past 50 to 60 years because of better access to education for women (Weber et al., 2014, 2017), (c) that emerges across all age groups but becomes larger in older adults (Maylor et al., 2007; Rodriguez-Aranda & Martinussen, 2006), and (d) that is affected by the gender of the first (Hyde and Linn, 1988) and last authors.
Method
The meta-analysis, including literature search, study selection, data analysis, and presentation of results, was performed following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines (Moher et al., 2009) and the recommendations for meta-analyses described by Borenstein et al. (2009). Data analysis was carried out with Comprehensive Meta-Analysis (Version 3.3.070; Borenstein et al., 2014).
Literature search and study selection
Search terms and databases
Between October 22 and 29, 2016, the databases PsychInfo, ISI Web of Knowledge, and PubMed were searched for relevant literature. Between September 13 and 19, 2019, we additionally searched the ProQuest Dissertation & Theses database to identify unpublished PhD and master’s theses. For the search terms and number of identified references, see Table S1 in the Supplemental Material available online. An additional 16 studies were identified through other sources, such as comprehensive literature reviews and references used in previously identified publications. After removing 38,322 duplicates, the remaining 28,305 hits were screened for suitability. Screening comprised reading both title and full abstract. In isolated cases, references were excluded based solely on title, for example, in case the title indicated that the reference was a review or meta-analysis without original data or the topic of the reference was outside the scope of the present meta-analysis (e.g., “Persephone in the Underworld: The Motherless Hero in Novels by Burney, Radcliffe, Austen, Bronte, Eliot, and Woolf”). Some older PhD and master’s theses often did not have abstracts, in which case the whole thesis was screened. Details about the exclusion criteria and procedure during screening is provided in the Supplemental Material.
Study selection: final inclusion criteria
Of the 2,984 references that were included after screening of abstract/title, 72 full texts could not be obtained. The remaining 2,912 references then underwent a full-text search for eligibility. Inclusion criteria were:
-
Use of phonemic/semantic-fluency and/or verbal-episodic-memory (recognition/recall) tests that comply with the aforementioned definitions of verbal fluency and verbal-episodic memory. Examples for verbal fluency are the Controlled Oral Word Association Test (COWAT; Benton, 1967) or the F-A-S Test (Spreen & Benton, 1977), the Thurstone Word Fluency Test (Thurstone & Thurstone, 1962), or any test in which participants had to generate as many words as possible starting/ending with or containing certain letters and to provide as many examples as possible for a specific category. Not included were data from tests such as finding synonyms or essay writing (which were considered too peripheral for verbal fluency). Anagram tasks were excluded on the grounds that they draw on numerical and spatial abilities (Wilson et al., 1954).
For verbal-episodic memory, we excluded tasks that measured exclusively or predominantly working memory such as digit span forward or backward from the Wechsler Adult Intelligence Scales (Wechsler, 2008). Examples for included verbal-episodic memory tests are the Visual Verbal Learning Test (Brand & Jolles, 1985), the RAVLT, and the CVLT. Logical Memory II and Logical Memory Recognition (remembering a story) from the WMS were included, but not Logical Memory I because this subtest is more related to verbal working memory. If multiple verbal-episodic-memory parameters were provided (e.g., delayed recall, total recall, recall), we retained the total score; otherwise, the provided scores were kept. Learning in all verbal-episodic-memory measures had to be intentional (i.e., incidental learning measures were not included).
For both verbal fluency and episodic memory, we excluded tasks that employed emotional stimuli because they could be confounded with sex/gender differences in emotional processing (Kret & De Gelder, 2012; Stevens & Hamann, 2012). For example, affective semantic-fluency categories such as “pleasant/unpleasant” or “joy/fear” (e.g., Gawda & Szepietowska, 2013a, 2013b) were not included.
Verbal-fluency/episodic-memory stimuli were not presented laterally, that is, to one specific hemisphere. For example, tasks that employed laterality paradigms were not considered because of sex/gender differences in hemispheric asymmetry (Hirnstein et al., 2019).
Verbal-fluency/episodic-memory tasks were not performed simultaneously with other tasks because multitasking abilities might vary across men and women (Hirnstein et al., 2018).
The publication contained quantitative, empirical data (i.e., no reviews, study protocols, meta-analyses), which allowed computation of the effect size and the exact number (or percentages) of male and female participants. Only “pure” verbal-fluency and verbal-episodic-memory measures were included. That is, if covariates such as intelligence had been factored in, the data were excluded. If only aggregate scores were provided from test batteries that included both eligible and not eligible tasks, data were excluded. Finally, when studies reported multiple verbal-fluency/episodic-memory tasks but provided only statistical parameters to compute effect sizes for tests that found significant sex/gender differences—and insufficient statistical parameters for tests that did not find sex/gender differences—the whole study was discarded to avoid introducing a bias toward significant results.
There were at least 10 male and 10 female participants in the sample to mitigate the effect of spurious findings with very small sample sizes.
Participants were healthy individuals without a mental or other condition that could affect verbal-fluency/episodic-memory performance (e.g., depression, Alzheimer’s disease, learning disability) and were not under the influence of any kind of substance, medicine, or other factors that might influence cognitive performance (e.g., sleep deprivation, noise exposure). Data from control groups could be included unless control subjects were selected for specific features (e.g., intelligence, age, socioeconomic status) to match clinical groups.
Participants were not preselected for a specific feature that could potentially be related to verbal-fluency/episodic-memory performance (e.g., participants with certain gene combination or combinations, participants who performed better than average on a creativity test, samples with homosexual participants only).
The publication was written in English, German, or any Scandinavian language.
Cohen’s d was outside the range of −4.0 to 4.0, which we deemed unrealistic. The range of included effect sizes was −1.07 to 1.42.
For cases in which inclusion criteria were met but the study lacked important quantitative information (e.g., number of men/women/boys/girls, means, or p values), authors were contacted with a request to provide the relevant data and other relevant data they have or know of. Out of 45 contacted authors, nine provided relevant data.
In total, 496 effect sizes from 168 references were included for quantitative analysis, comprising data from 355,173 participants (men/boys = 178,409, women/girls = 176,764). For a more detailed overview of the study-selection process, including reasons that led to exclusion, see Figure 1. For a complete list of all included references and effect sizes, see Table S2 in the Supplemental Material.
Fig. 1.
PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flow diagram showing the study-selection process.
Statistical analysis
For each relevant measure from the included references above, standardized differences in means (Cohen’s d) were computed from the available statistical information. If the male/female distribution was given in percentages, they were converted into integers. The effect direction was set such that positively signed values indicate a female advantage and negatively signed values indicate a male advantage. A value of zero indicates the absence of any male/female advantage. We consistently applied the random-effects model because (a) we expected substantial between-studies variance and (b) we aimed to generalize our findings to the entire population. Moreover, we consistently used subgroups in a reference as the unit of analysis (vs. using the whole reference as the unit of analysis). That is, if a study included a verbal-episodic-memory measure from two age groups (e.g., one 50–59 and another 60–69), those subgroups were treated as separate measures rather than combining them into one measure.
Several studies reported multiple outcomes for each sample/subsample. For example, a study could provide data from two different tests that both measure recall. It is likely that those tests were correlated with each other and that the magnitude of that correlation affects the variance and, thus, the likelihood of finding statistically significant results (Borenstein et al., 2009). Because these correlations were rarely reported, we ran each analysis twice: once with r = 0, assuming perfect independence of the outcomes, and once with r = 1.0, assuming perfect correlation between outcomes. In most cases, the results of both analyses yielded similar results. For ease of reading, we always report the perfect independence results first. All tables/figures were based on the assumption of perfect independence.
Overall sex/gender effects
First, we computed the overall sex/gender effect separately for verbal fluency and verbal-episodic memory. Then, we computed the overall sex/gender effect for each of the following four verbal-ability measures: phonemic and semantic fluency as measures of verbal fluency and recognition and recall as measures of verbal-episodic memory. One study had aggregated phonemic- and semantic-fluency scores into a combined verbal-fluency score (DeWan, 2006), whereas another had aggregated recognition and recall scores into combined verbal-episodic-memory scores (Rouch et al., 2005). Effect sizes from these studies were thus kept in the overall verbal-fluency/episodic-memory analysis but excluded from the recognition/recall/phonemic/semantic-fluency analysis.
For all these analyses, we provide Q statistic (testing the null hypothesis that all studies in the analysis shared a common effect size), I2 (the proportion of observed variance that reflects difference in true effect sizes rather than sampling error), and T2 (the variance of true effect sizes) as indicators of how much the sex/gender effect varied across studies. To address the issue of publication bias, we reported Egger’s regression (two-tailed; Egger et al., 1997) and funnel plots (see Fig. S1 in the Supplemental Material).
Effects of publication year, publication type, age, and gender of first/last authors
To investigate whether sex/gender differences change with publication year (as an indicator for changes over time), vary across publication type (articles vs. PhD/master’s theses), age, and the gender of the first/last authors, we ran a set of metaregressions. Metaregressions have the advantage that they allow investigating the effect of one factor while controlling for a set of other factors (Borenstein et al., 2009). Here again, we assumed that the true effect size varied across studies and thus applied a random-effects model (method of moments). All tests were two-sided and based on z distribution.
Six covariates were created for the metaregressions: (a) The continuous covariate “publication year” simply coded the year when a reference was published. (b) “Publication type” was a categorical covariate that could either be “published article” or “PhD/master’s thesis.” (c) Age was analyzed with two covariates: “mean age” as a continuous variable, which was either obtained directly from the corresponding reference or, in case that information was missing, computed on the basis of the age range (e.g., an age range of 40–60 would lead to a mean age of 50). If age ranges were provided separately for men/boys and women/girls, we took the youngest and oldest age from either sex/gender. If mean ages were provided separately for women/girls and men/boys, we calculated a weighted overall mean. Using mean age alone, however, has two shortcomings. First, several studies provided only age information such as “>70 years,” which made it impossible to calculate a mean. Second, many studies have enormous age ranges. For example, approximately 20% of studies had age ranges of 40 years and more, which rendered mean age a rather coarse indicator. (d) For this reason, we created a second covariate to examine age effects: “age groups.” This was a categorical covariate, theoretically grounded in the Medical Subject Heading, the standardized vocabulary used in the Medline database for indexing, developed by National Library of Medicine. According to this classification, the following age categories were formed: “child/child preschool” (2–12), “adolescent” (13–18), “adult” (19–44), “middle aged” (45–64), and “aged” (65+). Effect sizes were grouped into those categories using the reported age range of the corresponding study. For example, an effect size based on a sample with an age range of 20 to 27 was classified as adult. An effect size based on an age range of 17 to 40 was coded blank and excluded from the age-groups analysis. As a consequence, the number of effect sizes was substantially higher for mean age (92%, 455/497) than for age groups (51%, 253/497). Although both age measures have their respective shortcomings, we combined both because this allows a reasonable estimate of age effects (see also Voyer et al., 2021). Finally, (e) and (f) were the categorical covariates “first author gender” and “last author gender,” respectively, which was either male or female. In case of single-author studies, this was coded as first author and was not included for analysis of last-author effects.
The categorical covariates described above were dummy-coded in order to be entered into the metaregression. This was done such that published articles, males, and adult served as reference groups for publication type, first/last author gender, and age groups, respectively. We did not include language as a covariate because there were too few non-English reports of data. For comparison, 263 out of 496 effect sizes (53%) were reported in English, whereas the second most frequent language, Dutch, comprised only 40 effect sizes (8%).
We ran a sequence of metaregressions for each verbal ability (i.e., recall, recognition, phonemic/semantic fluency) separately. The first metaregression always included the covariates publication year, mean age, publication type, and first-author gender. This was done to maximize the number of available effect sizes. Age groups was not entered into the first metaregression because of multicollinearity with mean age and because only half of the effect sizes could be assigned to a specific age group (see above). We thus ran a second metaregression that included age group and all significant covariates from the first metaregression as a control (except for mean age because of multicollinearity). Last-author gender was also not entered into the first metaregression because of multicollinearity with publication type: None of the PhD/master’s theses have a last author. Therefore, we ran a third metaregression for published articles that included only last-author gender and all significant covariates from the first metaregression as a control (except for publication type because of multicollinearity).
Results
Overall sex/gender differences
Effect sizes of the most frequent verbal-fluency and verbal-episodic-memory measures are presented in Table 1.
Table 1.
Descriptive Overview of Sex/Gender Differences in Verbal-Fluency and Verbal-Episodic-Memory Measures
| Verbal ability | Test/measure | Effect size |
|---|---|---|
| Verbal fluency | Total effect | d = 0.07 [0.04, 0.10], k = 290 |
| Phonemic fluency | Total effect | d = 0.13 [0.09, 0.16], k = 135 |
| Generic starting letter(s) | d = 0.12 [0.07, 0.18], k = 59 | |
| Controlled Oral Word Association Test/F-A-S Test | d = 0.14 [0.08, 0.20], k = 55 | |
| Four-word sentences | d = 0.03 [−0.20, 0.26], k = 5 | |
| Semantic fluency | Total effect | d = 0.02 [−0.02, 0.06], k = 147 |
| Category: animals | d = −0.13 [−0.16, −0.09], k = 58 | |
| Categories: animals and fruits/vegetables/food | d = 0.11 [0.03, 0.18], k = 26 | |
| Objects with specific color | d = 0.19 [0.13, 0.25], k = 10 | |
| Categories: animals, fruits/vegetables/food, and action verbs | d = 0.25 [−0.03, 0.53], k = 8 | |
| Fruits/vegetables/food | d = 0.31 [0.16, 0.47], k = 8 | |
| Verbal-episodic memory | Total effect | d = 0.23 [0.19, 0.26], k = 206 |
| Recall | Total effect | d = 0.28 [0.23, 0.32], k = 136 |
| California Verbal Learning Test | d = 0.42 [0.32, 0.52], k = 28 | |
| Rey Auditory Verbal Learning Test | d = 0.39 [0.29, 0.48], k = 24 | |
| Generic word list | d = 0.17 [0.06, 0.28], k = 16 | |
| Delayed Memory for Names/Visual-Auditory Learning from Woodcock Johnson Psycho-Educational Battery–Revised | d = −0.13 [−0.27, 0.01], k = 12 | |
| 10 Word Learning Test from CERAD | d = 0.18 [0.07, 0.28], k = 10 | |
| Ten-Words Test | d = 0.26 [0.13, 0.39], k = 7 | |
| Deese, Roediger, and McDermott task | d = 0.15 [0.02, 0.28], k = 7 | |
| Recognition | Total effect | d = 0.12 [0.06, 0.17], k = 66 |
| Rey Auditory Verbal Learning Test | d = 0.22 [0.12, 0.33], k = 18 | |
| California Verbal Learning Test | d = 0.17 [0.06, 0.29], k = 13 | |
| Deese, Roediger, and McDermott task | d = 0.15 [0.04, 0.27], k = 7 | |
| Storytelling delayed recognition | d = −0.07 [−0.18, 0.04], k = 7 | |
| Storytelling immediate recognition | d = 0.02 [−0.09, 0.13], k = 7 |
Note: Values in brackets represent 95% confident intervals; k = number of effect sizes included. Effect sizes are provided assuming independence between multiple outcomes in the same study. Effect sizes in each subcategory were combined with a random-effects model, assuming a common among-study variance component across subcategories. That is, T2 was computed for each age group and then pooled across subgroups. Only tests with at least seven effect sizes are provided, except for phonemic fluency, for which the three most frequent tests are provided. CERAD = Consortium to Establish a Registry for Alzheimer’s Disease.
Verbal fluency
Assuming perfect independence between multiple outcomes in the same study, we found that the overall effect size was d = 0.07 with a 95% confidence interval (CI) of 0.04 to 0.10, based on 290 effect sizes. The female advantage deviated significantly from zero, Z = 5.10, p < .001. There was substantial heterogeneity among studies, Q(289) = 2085.1, p < .001, I2 = 86.1%, T2 = 0.02. Egger’s regression intercept of −0.10 was not significant, t(288) = 0.54, p = .591.
Assuming perfect correlation between multiple outcomes in the same study, we found that all effects remained significant/nonsignificant: d = 0.07, 95% CI = [0.04, 0.10], Z = 4.60, p < .001, Q(209) = 1784.3, p < .001, I2 = 88.3%, T2 = 0.02, Egger’s intercept = −0.13, t(208) = 0.52. p = .602, based on 210 effect sizes.
Verbal-episodic memory
Assuming perfect independence, we found a significant female advantage, d = 0.23, 95% CI = [0.19, 0.26], Z = 13.09, p < .001, based on 206 effect sizes. Heterogeneity was substantial, Q(205) = 1622.7, p < .001, I2 = 87.4%, T2 = 0.04. Egger’s intercept was 1.08, t(204) = 3.94, p < .001. Assuming perfect correlation, we found that all effects remained significant/nonsignificant: d = 0.26, 95% CI = [0.21, 0.30], Z = 11.39, p < .001, Q(132) = 1194.1, p < .001, I2 = 88.9%, T2 = 0.04, Egger’s intercept = 1.18, t(131) = 3.45, p < .001, based on 133 effect sizes.
Phonemic fluency
There was a significant female advantage, d = 0.13, 95% CI = [0.09, 0.16], Z = 6.75, p < .001, based on 135 effect sizes. There was significant heterogeneity, Q(134) = 272.3, p < .001, I2 = 50.8%, T2 = 0.01. Egger’s intercept was 0.19, t(133) = 1.04, p = .30. Assuming perfect correlation, we found that all effects remained significant/nonsignificant: d = 0.12, 95% CI = [0.09 0.16], Z = 6.97, p < .001, Q(128) = 226.9, p < .001, I2 = 43.6%, T2 = 0.01, Egger’s intercept = 0.20, t(127) = 1.14. p = .25, based on 129 effect sizes.
Semantic fluency
There was no significant sex/gender difference in semantic fluency, d = 0.02, 95% CI = [−0.02 0.06], Z = 1.00, p = .315, based on 147 effect sizes. The effect varied significantly across studies, Q(146) = 1782.6, p < .001, I 2 = 91.8%, T 2 = 0.03, and Egger’s intercept was −0.61, t(145) = 1.78, p = .078. Assuming perfect correlation, we found that all effects remained significant/nonsignificant: d = 0.01, 95% CI = [−0.02 0.05], Z = 0.70, p = .482, Q(136) = 1740.1, p < .001, I 2 = 92.2%, T 2 = 0.03, Egger’s intercept = −0.68, t(135) = 1.86. p = .065, based on 137 effect sizes.
Recall
There was a significant female advantage, d = 0.28, 95% CI = [0.23, 0.32], Z = 12.54, p < .001, based on 136 effect sizes. The effect varied largely between studies, Q(135) = 1217.0, p < .001, I 2 = 88.9%, T 2 = 0.04. Egger’s intercept was 1.32, t(134) = 3.94, p < .001. Assuming perfect correlation, we found that all effects remained significant/nonsignificant: d = 0.28, 95% CI = [0.24, 0.33], Z = 11.90, p < .001, Q(123) = 1155.3, p < .001, I 2 = 89.4%, T 2 = 0.04, Egger’s intercept = 1.35, t(123) = 3.85. p < .001, based on 124 effect sizes.
Recognition
There was a significant female advantage, d = 0.12, 95% CI = [0.06 0.17], Z = 4.42, p < .001, 66 effect sizes. The effect varied significantly across studies, Q(65) = 257.1, p < .001, I 2 = 74.7%, T 2 = 0.02. Egger’s intercept was 1.27, t(64) = 3.11, p = .003. Assuming perfect correlation, we found that all effects remained significant/nonsignificant: d = 0.17, 95% CI = [0.10, 0.24], Z = 4.78, p < .001, Q(49) = 164.9, p < .001, I 2 = 70.3%, T 2 = 0.03, Egger’s intercept = 1.08, t(48) = 2.42. p = .019, based on 50 effect sizes.
Metaregressions for moderator variables
The first set of metaregressions contained the predictors publication year, publication type, first-author gender, and mean age. Assuming perfect independence, we found that all four models explained a significant proportion of between-studies variance: phonemic fluency, Q(4) = 15.75, p = .003, R2 = 3.6, based on 125 effect sizes; semantic fluency, Q(4) = 28.94, p < .001, R2 = 51.0%, based on 129 effect sizes; recall, Q(4) = 28.76, p < .001, R2 = 23.5%, based on 124 effect sizes; and recognition, Q(4) = 33.03, p < .001, R2 = 31.3%, based on 65 effect sizes. Assuming perfect correlation, we found that all four models remained significant: phonemic fluency, Q(4) = 18.04, p = .001, R2 = 11.2%, based on 119 effect sizes; semantic fluency, Q(4) = 35.66, p < .001, R2 = 53.2, based on 120 effect sizes; recall, Q(4) = 25.89, p < .001, R2 = 23.9, based on 111 effect sizes; and recognition, Q(4) = 23. 80, p < .001, R2 = 36.2, based on 49 effect sizes.
Published articles versus PhD/master’s theses
Published articles consistently reported significantly higher female performance than PhD/master’s theses: phonemic fluency, Z = 2.00, p = .045, B = −0.093; semantic fluency, Z = 2.77, p = .006, B = −0.108; recall, Z = 4.01, p < .001, B = −0.243; and recognition, Z = 4.58, p < .001, B = −0.390 (see Fig. 2). Assuming perfect correlation, we found that all four effects remained significant.
Fig. 2.
Effect of publication type. The asterisk denotes significant difference between published articles and PhD/master’s theses. Central lines represent means of the respective category, and upper and lower lines are confidence intervals. Figures are based on assuming perfect independence between multiple measures from the same sample or subsample.
Gender of first author
Female first authors reported significantly stronger female advantages in phonemic fluency (Z = 2.44, p = .015, B = 0.107), semantic fluency (Z = 3.69, p < .001, B = 0.134), and recognition (Z = 4.31, p < .001, B = 0.271) compared with male first authors (see Fig. 3). No significant difference between male and female first authors emerged in recall (Z = 1.36, p = .175, B = 0.076). Assuming perfect correlation, we found that all effects remained significant/nonsignificant.
Fig. 3.
Gender of first-author effect. The asterisk denotes significant difference between female and male first authors. Central lines represent means of the respective category, and upper and lower lines are confidence intervals. Figures are based on assuming perfect independence between multiple measures from the same sample or subsample.
Publication year
The female advantage significantly decreased in phonemic fluency (Z = 2.401, p = .016, B = −0.004) and recall (Z = 2.02, p = .044, B = −0.005) with publication year. However, the effect became nonsignificant in phonemic fluency if the oldest study (Elias, 1951) was removed (Z = 1.91, p = .057, B = −0.002). Neither semantic fluency (Z = 1.63, p = .103, B = −0.004) nor recognition (Z = 1.43, p = .152, B = −0.004) changed significantly with publication year (see Fig. S2 in the Supplemental Material). Assuming perfect correlation, we found that the effect in recall was no longer significant (Z = 1.73, p = .085, B = −0.005) and that all other effects remained nonsignificant (after removing Elias, 1951).
Mean age
In phonemic fluency, the female advantage became significantly smaller with increasing mean age (Z = 2.46, p = .014, B = −0.002). By contrast, the female advantage became significantly larger with increasing mean age in recall (Z = 2.07, p = .038, B = 0.002). However, the effect was nonsignificant (Z = 1.76, p = .078, B = 0.002) after removing the study with the oldest mean-age sample, which also had an unusually high female advantage (Bleecker et al., 1988). No significant mean-age effect emerged in semantic fluency (Z = 1.94, p = .052, B = −0.001) and recognition (Z = 0.05, p = .959, B < −0.001; see Fig. S3 in the Supplemental Material). Assuming perfect correlation, we found that the female advantage decreased significantly with age in semantic fluency (Z = 2.45, p = .014, B = −0.002) and increased significantly in recall also if Bleecker et al. (1988) was removed (Z = 2.03, p = .043, B = 0.002). All other effects remained significant/nonsignificant.
Age groups
A new set of metaregressions was computed that contained age groups and all significant covariates from the first set of metaregressions described above. Mean age was never retained because of multicollinearity with age groups.
The results are presented in Table 2. Age groups as a whole (i.e., with all age categories combined) varied significantly only in semantic fluency, Q(4) = 102.6, p < .001, based on 77 effect sizes. More specifically, the sex/gender difference in middle aged (Z = 2.01, p = .045, B = 0.093) and aged (Z = 7.65, p < .001, B = −0.273) differed significantly from the reference group, adults. There was no significant difference between child/child preschool or adolescent with adult (all Zs ≤ 1.57, all ps ≥ .117). Moreover, there were no significant overall effects of age groups in phonemic fluency, Q(4) = 5.49, p = .241, based on 63 effect sizes; recall, Q(4) = 7.54, p = .110, based on 67 effect sizes; and recognition, Q(4) = 6.85, p = .144, based on 35 effect sizes. In phonemic fluency (all Zs ≤ 1.56, all ps ≥ .119), also, none of the individual age groups differed significantly from the reference group, adult. In recall, the child/child preschool group had a significantly smaller female advantage than the adult group (Z = 2.15, p = .032, B = 0.200). In recognition, the adolescent (Z = 2.11, p = .035, B = 0.275) and child/child preschool (Z = 2.05, p = .040, B = 0.202) groups had a significantly higher female advantage than the adult reference group, but in the case of adolescents, this was based on only three effect sizes.
Table 2.
Descriptive Overview of Age-Group Effects
| Phonemic fluency | Semantic fluency | Recall | Recognition | |
|---|---|---|---|---|
| Child/child preschool (≤ 12 years) |
d = 0.13 [0.06, 0.25], k = 29 |
d = 0.09 [−0.02, 0.17], k = 30 |
d = 0.05
[−0.06, 0.17], k = 15 |
d = 0.13
[−0.04, 0.31], k = 7 |
| Adolescent (13–18 years) |
d = 0.22 [0.03, 0.41], k=5 |
d = 0.03 [−0.25, 0.30], k = 2 |
d = 0.13 [−0.06, 0.31], k = 7 |
d = 0.11
[−0.14, 0.35], k = 3 |
| Adult (19–44 years) |
d = 0.24 [0.07, 0.41], k = 7 |
d = 0.15 [0.10, 0.21], k = 8 |
d = 0.28 [0.17, 0.39], k = 15 |
d = 0.02 [−0.10, 0.13], k = 9 |
| Middle aged (45–64 years) |
d = 0.13 [0.03, 0.23], k = 7 |
d = 0.25
[0.17, 0.32], k = 6 |
d = 0.34 [0.24, 0.45], k = 9 |
d = 0.13 [−0.04, 0.28], k = 6 |
| Aged (≥ 65 years) |
d = 0.06 [−0.03, 0.15], k = 15 |
d = −0.10*
[−0.14, −0.07], k = 31 |
d = 0.17 [0.09, 0.24], k = 21 |
d = 0.06 [−0.09, 0.21], k = 10 |
Note: Values in parentheses represent 95% confidence intervals; k = number of effect sizes included. Boldface type indicates that individual age groups differed significantly from the reference group “adult.” Verbal-ability measures in boldface type indicate that the sex/gender difference varied significantly across all age groups. This table may contain more effect sizes than the metaregression because the metaregression includes only studies with information on all covariates. Values are based on assuming perfect independence between multiple measures from the same sample or subsample.
Assuming perfect correlation, we found that all age-groups effects in phonemic fluency (63 effect sizes) and semantic fluency (74 effect sizes) remained significant/nonsignificant. In recall, age groups as a whole remained nonsignificant, but now only the aged subsample had a significantly smaller female advantage than adult (Z = 2.30, p = .021, B = −0.127, based on 62 effect sizes). In recognition, age groups as a whole remained nonsignificant, and none of the individual age groups differed significantly from adults (all Zs ≤ 1.78, all ps ≥ .075, based on 26 effect sizes).
Gender of last author
A third set of metaregressions was computed for only published articles that contained last-author gender and all significant covariates from the respective first set of metaregressions. Publication type was not included because of multicollinearity. Last-author gender became significant only in semantic fluency (Z = 2.50, p < .001, B = −0.09, based on 90 effect sizes), in which male last authors reported a stronger female advantage than female last authors. No significant differences between male and female last authors emerged in phonemic fluency (Z = 1.68, p = .0093, B = 0.087, based on 72 effect sizes), recall (Z = 0.72, p = .474, B = 0.031, based on 70 effect sizes), and recognition (Z = 0.35, p = .729, B = −0.021, based on 53 effect sizes; see Fig. S4 in the Supplemental Material). Assuming perfect correlation, we found that all effects remained significant/nonsignificant.
Discussion
Using a meta-analytical approach, we investigated whether women/girls perform better than men/boys in verbal fluency and verbal-episodic memory with neutral stimuli that were memorized intentionally and which factors moderated the female advantage.
Small but robust female advantage in phonemic but not semantic fluency
Women/girls performed significantly better in phonemic fluency than men/boys (d = 0.13), but there was no significant female advantage in semantic fluency (ds = 0.01–0.02). When combined into a single verbal-fluency score, a significant female advantage remained (d = 0.07), but more by virtue of the large number of included effect sizes (k = 290). The female advantage is thus limited to phonemic fluency, and even here it is markedly lower than in the landmark meta-analysis by Hyde and Linn (1988), who reported a small effect (d = 0.33). This discrepancy might be partly due to a different definition of verbal fluency used in the present meta-analysis, which also included a much larger number of studies (168 vs. 14), thereby providing higher precision.
The overall effect size for phonemic fluency (ds = 0.12–0.13) is practically identical with both the COWAT/F-A-S (d = 0.14), the most frequently used test/starting-letter combination, and when generic starting letters or combination of generic starting letters are combined (d = 0.12). To illustrate the magnitude of the female advantage, if men/boys report a mean of 36 words, an effect of d = 0.14 would translate into an advantage of roughly 1.5 words for women/girls (M = 37.4) if a realistic standard deviation of 10 words is assumed.
The large number of studies and effect sizes in the present meta-analysis allowed testing whether the observed sex/gender difference in semantic fluency depended on the specific category participants were tasked with. The results revealed that men/boys generally named more animals (d = −0.13), whereas women/girls named more fruits/food/vegetables (d = 0.31). When both categories were combined, which several studies did, the effects size was slightly positive (d = 0.11), indicating a slight female advantage. These findings support the view that there is no overall female advantage in semantic fluency and that sex/gender differences are category-dependent (e.g., Laws, 2004; Sokołowski et al., 2020). Category dependency is also likely to account in part for the enormous heterogeneity in semantic fluency: The proportion of observed variance that reflects difference in true effect sizes (rather than sampling error) was 92%. Yet further research is needed to study those categories in more detail.
Small but robust female advantage in verbal-episodic memory
We found a significant female advantage for verbal-episodic memory, in general, with effect sizes between d = 0.23 and d = 0.26. Furthermore, the female advantage was stronger in recall (d = 0.28) than in recognition (ds = 0.12–0.17). Both findings are in line with Asperholm et al. (2019), who reported an overall female advantage of g = 0.28 for episodic memory with verbal content and a female advantage for recall (gs = 0.28–0.31) and recognition (g = 0.17). Note that the studies included in both meta-analyses had only little overlap, which highlights the robustness of the female advantage. Recognition is generally considered easier than recall (e.g., Postman et al., 1948). Therefore, the female advantage might be smaller in the less difficult recognition tasks.
The strongest female advantage arose for the CVLT (d = 0.42) and the RAVLT (d = 0.39). By contrast, when the two tasks—delayed memory for names and visual-auditory learning—from the Woodcock Johnson-Psycho-Educational Battery–Revised were combined, there was a male advantage (d = −0.13). However, because all 12 effect sizes were taken from the same study (Cotten, 1991), generalization of these findings is questionable. In recognition, the CVLT (d = 0.17) and RAVLT (d = 0.22) also demonstrated a female advantage. The only task that showed a male advantage (i.e., storytelling delayed recognition; d = −0.07) was not significant (confidence bands include zero), and again all seven effect sizes were from the same study (Murre et al., 2013). To illustrate the magnitude of the female advantage in verbal-episodic memory, imagine a hypothetical study with the CVLT in which participants need to memorize a list with 16 nouns. If one assumes a realistic standard deviation of three words and M = 10 for men, Cohen’s d = 0.42 (the largest effect size found for verbal-episodic memory) translates into a female advantage of roughly one single word (M = 11.26).
Whereas the present meta-analysis together with Asperholm et al. (2019) suggest a small but robust female advantage for verbal-episodic memory, Voyer et al. (2021) demonstrated that the female advantage in verbal working memory is practically zero. The largest female advantage reported by the authors was g = 0.15 for free recall. This may be because certain tasks, which showed a reliable female advantage in the present study, for example the CVLT, were also included in Voyer et al. The distinction between episodic long-term and working memory is not always clear cut, and there are good arguments why the CVLT taps into both memory processes. In general, however, the findings from all three meta-analyses suggest that the female advantage in verbal memory is not universal and emerges especially when information needs to be transferred to long-term memory, whereas it is very small or absent in working memory.
The female advantage is small but relevant
By comparison, the female advantage in verbal-episodic memory and phonemic fluency is smaller than in other verbal abilities, such as reading achievement (ds = 0.23–0.68; Reilly, 2012; Stoet & Geary, 2013) or writing abilities (ds = 0.53–0.61; Hedges & Nowell, 1995). In general, medium to large sex/gender differences were the exception, which is in line with the “gender-similarity hypothesis” (Hyde, 2005, 2014), according to which most sex/gender differences are in the small to medium range.
Verbal-episodic-memory and phonemic-fluency tasks are frequently used for assessing psychological impairments (Barker-Collo & Feigin, 2006; Collie & Maruff, 2000; Pennington & Ozonoff, 1996). Given that the present study corroborates previous findings that standard tests, such as CVLT (Kramer et al., 2003), RAVLT (Bleecker et al., 1988), and COWAT (Halari et al., 2005), reliably showed a female advantage, this implies that sex/gender should be taken into account when phonemic fluency and verbal-episodic memory are used in the clinical/diagnostic context.
Stronger female advantage in published articles than PhD/master’s theses
We found support for the notion that the female advantage in verbal fluency and verbal-episodic memory is subject to publication bias. First, Egger’s regression and the funnel plots (see Fig. S1 in the Supplemental Material) suggest a “small study effect” for verbal-episodic memory, in general, as well as recall and recognition. That is, especially small studies with significant results favoring women/girls were more likely to be included in our meta-analysis than small studies favoring men/boys. Egger’s regression, however, was not significant for verbal, phonemic, or semantic fluency, which suggests the small-study effect is generally stronger in verbal-episodic memory.
In addition, we found that the female advantage in all four reported verbal abilities was higher in published articles than in PhD/master’s theses. The difference ranged between d = 0.09 and d = 0.39. In fact, for recognition, the female advantage was not significant in PhD/master’s theses. By using metaregressions, factors such as publication year, age, or first/last-author gender were controlled for. Therefore, it is unlikely that the publication-type effect was a mere artifact of, for instance, an overrepresentation of unpublished studies in a particular age group. Likewise, the publication bias is unlikely to arise from lower quality in non-peer-reviewed PhD/master’s theses: If this were the case, we would expect randomly weaker or larger sex/gender differences. However, we found consistently stronger female advantage in published articles. The most parsimonious explanation is therefore that studies are more likely to be published when they find the anticipated female advantage.
First-authors’ gender affects sex/gender difference
The metaregression further revealed that the first-author’s gender affects the magnitude of the sex/gender difference in phonemic fluency, semantic fluency, and recognition, but not recall. Both male and female first authors consistently reported stronger performance for members of their own gender. The effect was in the range of ds = 0.11 to 0.27 and controlled for age, publication type, or publication year. Hyde and Linn (1988) reported a similar first-author bias but with smaller effect size (d = 0.07) and across a wide range of verbal abilities. We speculate that the first-author bias represents an in-group bias in which members of one’s own group are favored over out-group members. With these data, it is not possible to disentangle whether female first authors overreport or male first authors underreport the female advantage.
We also found a last-author effect in semantic fluency in which male last authors reported a significantly stronger female advantage than female last authors. This result is difficult to interpret because the sex/gender effect in semantic fluency is category-dependent, as described above. None of the other three measures (i.e., phonemic fluency, recall, and recognition) yielded significant last-author effects, and thus we refrain from speculations regarding last-author effects in the present study.
No clear cohort or age effects
The female advantage decreased significantly with publication year for recall (when perfect independence between multiple outcomes was assumed), but the effect was small (B = −0.004) and did not emerge when perfect correlation was assumed. No significant effect was found for recognition (see also Asperholm et al., 2019). Likewise, the significant publication-year effect in phonemic fluency disappeared when one outlier was removed. Overall, sex/gender effects reported here were relatively stable over time.
Age effects were neither in line with the previously reported stronger deterioration in older men compared with older women (Graves et al., 2017; Kramer et al., 2003; Rodriguez-Aranda & Martinussen, 2006) nor with an inverted U-shaped curve with smaller sex/gender differences in earlier and later life (Asperholm et al., 2019). When the analysis was based on mean age, a significant coefficient (B = −0.002) was found only in phonemic fluency, which implies that the female advantage was reduced by d = 0.02 over a 10-year period—a small effect. When the analysis was based on age groups, none of the three verbal-ability measures that showed a reliable female advantage yielded a significant overall age-groups effect. In some cases, certain age groups differed significantly from the adult reference group (see Table 2), but most comparisons with adults were not significant. In general, findings for the three measures that yielded a female advantage indicated relatively stable sex/gender differences throughout life span (see also de Frias et al., 2006).
Semantic fluency was the only verbal domain that showed a significant overall age-group effect: Middle-aged participants (45–64, d = 0.25) showed the strongest female advantage, followed by adults (19–44, d = 0.15) and children (2–12, d = 0.09). Participants age 65 or older even showed a significant male advantage (d = −0.10). However, we refrain from interpretations because the female advantage was strongly category-dependent.
Limitations
First, the statistical indicators showed considerable variance. The null hypothesis, according to which there is only one true underlying effect size, was violated in all analyses. To include data from very heterogeneous samples can be considered an asset because it increases the generalizability of our findings. However, although we investigated several moderator variables, there are other potentially relevant factors that we did not examine, such as (a) specific categories for semantic fluency, (b) test language, (c) monolingual versus bilingual participants, and (d) participants’ country/region of origin. The fact that most studies were carried out in the United States and United Kingdom and used native English-speaking participants might hamper generalizability. For example, a recent study did not find that the female advantage in phonemic fluency varied across countries, but only UK, Italy, and Norway were investigated (Moè et al., 2021). However, the female advantage in reading comprehension has been demonstrated to vary across countries (Reilly, 2012; Stoet & Geary, 2013).
Second, we analyzed age effects with two approaches (age means and age groups) that each have their advantages and disadvantages. Age means allowed including more effect sizes at the expense of precision because the single number of age mean becomes meaningless in samples with large age ranges. Age groups allowed examining sex/gender differences in clearly defined developmental periods but at the expense of losing effect sizes that do not fall in an age category. As a result, some of the age groups have very few effect sizes (e.g., two or three), and we thus refrained from interpreting too much into significant differences between specific age groups. Conducting those analyses seemed nevertheless justified, and the lack of clear age effects may in part be due to the complex nature of sex/gender differences across age.
Third, we contacted authors whose work we had already identified as suitable for our meta-analysis and where only key statistical parameters were missing for calculating effect sizes. We did not reach out to authors who simply used tests/tasks that we considered as adequate, and we also did not contact forums or researchers in the field of verbal fluency/memory. We further reached out only to authors who provided contact details in published articles, which were unavailable for authors of PhD/master’s theses. Moreover, we did not include data from Google Scholar because the massive numbers of reference (> 200,000) was simply unfeasible to process. Thus, although the present meta-analysis compiled a large body of data, we might have missed several primary studies.
Conclusion and future avenues
Analyzing data from 168 studies, 496 effect sizes, and 355,173 participants, the present meta-analysis suggests that a small but robust female advantage in verbal fluency and verbal-episodic memory exists. With respect to verbal fluency, the female advantage emerged only in phonemic fluency, whereas sex/gender differences in semantic fluency appeared strongly category-dependent. The female advantage, especially in phonemic fluency, is smaller than previously shown (Hyde & Linn, 1988). However, phonemic fluency and verbal-episodic memory measures are frequently used in psychological/diagnostic settings, which highlights the need for taking sex/gender effects into account. A discussion of how the female advantage arises and what the underlying brain mechanisms are is beyond the scope of the present meta-analysis, but as argued for other cognitive sex/gender differences, we propose that the female advantage emerges from an intricate interaction of biological, psychological, and sociocultural factors (Halpern, 2012; Halpern & Tan, 2001; Hausmann, 2017; Jäncke, 2018).
The female advantage is affected by publication bias in two forms: Published articles reported larger female advantages than unpublished research, and both male and female first authors reported better performance for participants of their own gender. Although we found evidence for the existence of publication bias, it did not fully account for the female advantage reported here.
In general, meta-analyses focusing on cognitive abilities favoring women/girls are rare (for notable exceptions, see Asperholm et al., 2019; Voyer et al., 2007, 2021; Voyer & Voyer, 2014). Apart from including additional factors listed above, future studies should investigate publication bias and first-author/last-author effects in cognitive abilities in which men/boys typically excel (e.g., mental rotation). This has been largely ignored so far. Finally, more studies should adopt a biopsychosocial approach and include more routinely sex/gender-related, nonbinary factors (e.g., sex hormones, self-efficacy, gender stereotypes), and their interactions that might explain individual differences in verbal abilities and other cognitive domains better than sex/gender.
Supplemental Material
Supplemental material, sj-docx-1-pps-10.1177_17456916221082116 for Sex/Gender Differences in Verbal Fluency and Verbal-Episodic Memory: A Meta-Analysis by Marco Hirnstein, Josephine Stuebs, Angelica Moè and Markus Hausmann in Perspectives on Psychological Science
Acknowledgments
We thank Kylie Wong and Emily George for their tremendous help with the screening of references.
Cognitive differences between men/boys and women/girls arise from a complex interplay of biological, psychological, and sociocultural factors. These factors would be so intertwined that it would not be logical to distinguish between biology (“sex”) and social environment (“gender”). In the current study, we therefore aimed for a neutral terminology and avoided “sex” or “gender” as separate terms and instead used “sex/gender” whenever possible. In certain contexts, however, it would be inappropriate to use “sex/gender” when addressing specific biological or social constructs, such as gender equality, gender stereotypes, sex hormones, or sex chromosomes. When addressing first/last-author effects, we refer to gender because we identified authors as males or females simply on the basis of their first name, not knowing their biological sex or gender identity.
Transparency
Action Editor: Laura A. King
Editor: Laura A. King
The author(s) declared that there were no conflicts of interest with respect to the authorship or the publication of this article.
Funding: This work was supported by the Bergen Research Foundation (Grant BFS2016REK03) to M. Hirnstein.
References
References marked with an asterisk indicate studies included in the meta-analysis.
- *Abdel Aziz K., Khater M. S., Emara T., Tawfik H. M., Rasheedy D., Mohammedin A. S., Tolba M. F., El-Gabry D. A., Qassem T. (2017). Effects of age, education, and gender on verbal fluency in healthy adult Arabic-speakers in Egypt. Applied Neuropsychology: Adult, 24(4), 331–341. 10.1080/23279095.2016.1185424 [DOI] [PubMed] [Google Scholar]
- *Acevedo A., Loewenstein D. A., Barker W. W., Harwood D. G., Luis C., Bravo M., Hurwitz D. A., Aguero H., Greenfield L., Duara R. (2000). Category Fluency Test: Normative data for English- and Spanish-speaking elderly. Journal of the International Neuropsychological Society, 6(7), 760–769. [DOI] [PubMed] [Google Scholar]
- *Agard C. N. (2008). Urine creatinine levels and neurocognitive functioning in African-American adults (Publication No. 3335300) [Doctoral dissertation, Howard University]. ProQuest Dissertations & Theses Global.
- Alexander J. R. M., Smales S. (1997). Intelligence, learning and long-term memory. Personality and Individual Differences, 23(5), 815–825. 10.1016/S0191-8869(97)00054-8 [DOI] [Google Scholar]
- *Alexiou T. (2005). Cognitive development, aptitude and language learning in Greek young learners (Publication No. 10798115) [Doctoral dissertation, Swansea University]. ProQuest Dissertations & Theses Global.
- Andreano J. M., Cahill L. (2009). Sex influences on the neurobiology of learning and memory. Learning & Memory, 16(4), 248–266. 10.1101/lm.918309 [DOI] [PubMed] [Google Scholar]
- *Ardila A., Rosselli M., Matute E., Guajardo S. (2005). The influence of the parents’ educational level on the development of executive functions. Developmental Neuropsychology, 28(1), 539–560. 10.1207/s15326942dn2801_5 [DOI] [PubMed] [Google Scholar]
- Asperholm M., Hogman N., Rafi J., Herlitz A. (2019). What did you do yesterday? A meta-analysis of sex differences in episodic memory. Psychological Bulletin, 145(8), 785–821. 10.1037/bul0000197 [DOI] [PubMed] [Google Scholar]
- *Auriacombe S., Fabrigoule C., Lafont S., Jacqmin-Gadda H., Dartigues J.-F. (2010). Letter and category fluency in normal elderly participants: A population-based study. Aging, Neuropsychology, and Cognition, 8(2), 98–108. 10.1076/anec.8.2.98.841 [DOI] [Google Scholar]
- Baldo J. V., Schwartz S., Wilkins D., Dronkers N. F. (2006). Role of frontal versus temporal cortex in verbal fluency as revealed by voxel-based lesion symptom mapping. Journal of the International Neuropsychological Society, 12(6), 896–900. 10.1017/s1355617706061078 [DOI] [PubMed] [Google Scholar]
- *Banks P. G., Dickson A. L., Plasay M. T. (1987). The verbal selective reminding test: Preliminary data for healthy elderly. Experimental Aging Research, 13(4), 203–207. 10.1080/03610738708259326 [DOI] [PubMed] [Google Scholar]
- Barker-Collo S., Feigin V. (2006). The impact of neuropsychological deficits on functional stroke outcomes. Neuropsychology Review, 16(2), 53–64. 10.1007/s11065-006-9007-5 [DOI] [PubMed] [Google Scholar]
- *Baxter L. C. (1998). Dual-task interference effects on sex differences in verbal memory. Linguistics and Language Behavior Abstracts. [Google Scholar]
- Benton A. L. (1967). Problems of test construction in the field of aphasia. Cortex, 3(1), 32–58. 10.1016/S0010-9452(67)80005-4 [DOI] [Google Scholar]
- *Blair A. S. (2002). The effects of acculturation level on verbal learning in a sample of Hispanics of Mexican-American extraction (Publication No. 3074024) [Doctoral dissertation, George Fox University]. ProQuest Dissertations & Theses Global. [Google Scholar]
- Bleecker M. L., Bolla-Wilson K., Agnew J., Meyers D. A. (1988). Age-related sex differences in verbal memory. Journal of Clinical Psychology, 44(3), 403–411. [DOI] [PubMed] [Google Scholar]
- *Bolla K. I., Gray S., Resnick S. M., Galante R., Kawas C. (1998). Category and letter fluency in highly educated older adults. The Clinical Neuropsychologist, 12(3), 330–338. 10.1076/clin.12.3.330.1986 [DOI] [Google Scholar]
- *Bolla K. I., Lindgren K. N., Bonaccorsy C., Bleecker M. L. (1990). Predictors of verbal fluency (FAS) in the healthy elderly. Journal of Clinical Psychology, 46(5), 623–628. [DOI] [PubMed] [Google Scholar]
- *Bolla-Wilson K., Bleecker M. L. (1986). Influence of verbal intelligence, sex, age, and education on the Rey Auditory Verbal Learning Test. Developmental Neuropsychology, 2(3), 203–211. 10.1080/87565648609540342 [DOI] [Google Scholar]
- Borenstein M., Hedges L. V., Higgins J. P. T., Rothstein H. R. (2009). Introduction to meta-analysis. John Wiley & Sons. [Google Scholar]
- Borenstein M., Hedges L. V., Higgins J. P. T., Rothstein H. R. (2014). Comprehensive meta-analysis. BioSTAT. [Google Scholar]
- Brand N., Jolles J. (1985). Learning and retrieval rate of words presented auditorily and visually. The Journal of General Psychology, 112(2), 201–210. [DOI] [PubMed] [Google Scholar]
- *Brandling-Bennett E. M. (2006). Categorization during typical development (Publication No. 3238634) [Doctoral dissertation, Washington University in St. Louis]. ProQuest Dissertations & Theses Global.
- *Brocki K. C., Bohlin G. (2004). Executive functions in children aged 6 to 13: A dimensional and developmental study. Developmental Neuropsychology, 26(2), 571–593. 10.1207/s15326942dn2602_3 [DOI] [PubMed] [Google Scholar]
- *Brosnan M. D. (1973). Developmental changes in memory processes for verbal and Pictorial material (Publication No. 7415125) [Doctoral dissertation, Purdue University]. ProQuest Dissertations & Theses Global. [Google Scholar]
- Brownell W., Smith D. R. (1973). Communication patterns, sex, and length of verbalization in speech of four-year-old children. Speech Monographs, 40(4), 310–316. 10.1080/03637757309375809 [DOI] [Google Scholar]
- *Brucki S. M., Rocha M. S. (2004). Category fluency test: Effects of age, gender and education on total scores, clustering and switching in Brazilian Portuguese-speaking subjects. Brazilian. Journal of Medical and Biological Research, 37(12), 1771–1777. 10.1590/s0100-879x2004001200002 [DOI] [PubMed] [Google Scholar]
- *Burton L. A., Henninger D. (2013). Sex differences in relationships between verbal fluency and personality. Current Psychology, 32(2), 168–174. 10.1007/s12144-013-9167-4 [DOI] [Google Scholar]
- *Capitani E., Laiacona M., Basso A. (1998). Phonetically cued word-fluency, gender differences and aging: A reappraisal. Cortex, 34(5), 779–783. 10.1016/S0010-9452(08)70781-0 [DOI] [PubMed] [Google Scholar]
- *Carstairs J. R., Shores E. A., Myors B. (2012). Australian norms and retest data for the Rey Auditory and Verbal Learning Test. Australian Psychologist, 47(4), 191–197. 10.1111/j.1742-9544.2012.00086.x [DOI] [Google Scholar]
- Catani M., Allin M. P. G., Husain M., Pugliese L., Mesulam M. M., Murray R. M., Jones D. K. (2007). Symmetries in human brain language pathways correlate with verbal recall. Proceedings of the National Academy of Sciences, USA, 104(43), 17163–17168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Cerhan J. R., Folsom A. R., Mortimer J. A., Shahar E., Knopman D. S., McGovern P. G., Hays M. A., Crum L. D., Heiss G. (1998). Correlates of cognitive function in middle-aged adults. Gerontology, 44(2), 95–105. 10.1159/000021991 [DOI] [PubMed] [Google Scholar]
- *Chan A. S., Poon M. W. (1999). Performance of 7- to 95-year-old individuals in a Chinese version of the Category Fluency Test. Journal of the International Neuropsychological Society, 5(6), 525–533. 10.1017/s135561779956606x [DOI] [PubMed] [Google Scholar]
- *Chan R. C. K., Wong M., Chen E. Y. H., Lam L. C. W. (2003). Semantic categorisation and verbal fluency performance in a community population in Hong Kong: A preliminary report. Hong Kong Journal of Psychiatry, 13(4), 14–20. [Google Scholar]
- *Chang Y. J. (1992). P300 event-related potential in normal young adults: Correlation with memory and imagery variables (Publication No. 1351382) [Master’s thesis, California State University, Long Beach]. ProQuest Dissertations & Theses Global.
- *Chipman K. A. (1998). No sex difference on incidental picture memory, despite better verbal memory in women (Publication No. MQ32474) [Master’s thesis, The University of Western Ontario]. ProQuest Dissertations & Theses Global.
- *Cohen D. L. (1975). Sex differences in the organization of spatial abilities in older men and women (Publication No. 302815814) [Doctoral dissertation, University of Southern California]. ProQuest Dissertations & Theses Global. [Google Scholar]
- *Cole G. C. (1991). Alexithymia, Stroop interference, and verbal abilities: Sex differences (Publication No. 303971489) [Doctoral dissertation, Simon Fraser University]. ProQuest Dissertations & Theses Global.
- Collie A., Maruff P. (2000). The neuropsychology of preclinical Alzheimer’s disease and mild cognitive impairment. Neuroscience & Biobehavioral Reviews, 24(3), 365–374. 10.1016/S0149-7634(00)00012-9 [DOI] [PubMed] [Google Scholar]
- *Comilang K. A. (2003). The influence of internalized cultural and racial socialization on the neuropsychological test performance of Asian Americans (Publication No. 3103275) [Doctoral dissertation, Boston College]. ProQuest Dissertations & Theses Global.
- *Contador I., Almondes K., Fernandez-Calvo B., Boycheva E., Puertas-Martin V., Benito-Leon J., Bermejo-Pareja F. (2016). Semantic verbal fluency: Normative data in older Spanish adults from NEDICES Population-Based Cohort. Archives of Clinical Neuropsychology, 31(8), 954–962. 10.1093/arclin/acw071 [DOI] [PubMed] [Google Scholar]
- *Corona-LoMonaco M. E. (2000). Impact of language and culture on a neuropsychological screening battery for Hispanics (Publication No. 85564578) [Doctoral dissertation, University of Southern California]. ProQuest Dissertations & Theses Global.
- *Cory J. M. (2003). Sociocultural influences in neuropsychological testing with Latinos of Mexican origin (Publication No. 3107073) [Doctoral dissertation, Colorado State University]. ProQuest Dissertations & Theses Global.
- *Cotten M. A. (1991). Gender differences on the Woodcock-Johnson Revised Tests of Cognitive Abilities and Tests of Achievement (Publication No. 303966823) [Doctoral dissertation, Texas Woman’s University]. ProQuest Dissertations & Theses Global.
- *Crossley M., D’Arcy C., Rawson N. S. (1997). Letter and category fluency in community dwelling Canadian seniors: A comparison of normal participants to those with dementia of the Alzheimer or vascular type. Journal of Clinical and Experimental Neuropsychology, 19(1), 52–62. 10.1080/01688639708403836 [DOI] [PubMed] [Google Scholar]
- *Dadin C. O., Salgado R., Fernandez A. (2009). Ciclos naturales de las hormonas sexuales y diferencias entre sexos en memoria [Natural sex hormone cycles and gender differences in memory]. Actas Espanolas de Psiquiatria, 37(2), 68. [PubMed] [Google Scholar]
- *de Frias C. M., Nilsson L.-G., Herlitz A. (2006). Sex differences in cognition are stable over a 10-year period in adulthood and old age. Aging, Neuropsychology, and Cognition, 13(3–4), 574–587. 10.1080/13825580600678418 [DOI] [PubMed] [Google Scholar]
- Delis D. C., Kramer J. H., Kaplan E., Ober B. A. (2000). California Verbal Learning Test (2nd ed.). The Psychological Corp. [Google Scholar]
- *DeWan L. K. (2006). Childhood developmental trends in executive function as measured by the Delis-Kaplan Executive Function System: An exploration of gender differences (Publication No. 304908180) [Doctoral dissertation, George Fox University]. ProQuest Dissertations & Theses Global.
- *Dias N. M., Menezes A., Seabra A. G. (2013). Age differences in executive functions within a sample of Brazilian children and adolescents. Spanish Journal of Psychology, 16, Article E9. 10.1017/sjp.2013.12 [DOI] [PubMed] [Google Scholar]
- *Egelko S. E. (1983). Cognitive sequelae of right cerebrovascular accident: Issues of verbal deficit and sex differential patterns in visuospatial and verbal performance (Publication No. 8326686) [Doctoral dissertation, Fordham University]. ProQuest Dissertations & Theses Global.
- Egger M., Davey Smith G., Schneider M., Minder C. (1997). Bias in meta-analysis detected by a simple, graphical test. BMJ, 315(7109), 629–634. 10.1136/bmj.315.7109.629 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Elias J. Z. (1951). Non-intellective factors in certain intelligence and achievement tests: An analysis of factors in addition to the cognitive entering into the intelligence and achievement scores of children at the sixth grade level (Publication No. 0002503) [Doctoral dissertation, New York University]. ProQuest Dissertations & Theses Global.
- *Ernest C. H. (1983). Imagery and verbal ability and recognition memory for pictures and words in males and females. Educational Psychology, 3(3–4), 227–244. 10.1080/0144341830030307 [DOI] [Google Scholar]
- *Farace E. (1996). Gender differences in relationships between degree of brain lateralization and cognitive ability (Publication No. 9701315) [Doctoral dissertation, University of Virginia]. ProQuest Dissertations & Theses Global.
- *Fares R. (2011). Normative data of two measures of verbal fluency for Arabic/English bilinguals. Linguistics and Language Behavior Abstracts. [Google Scholar]
- *Faul J. D. (2008). The effect of lifecourse socioeconomic position and health on trajectories of cognitive function in older adults (Publication No. 3304967) [Doctoral dissertation, University of Michigan]. Health Research Premium Collection; ProQuest Central; ProQuest Dissertations & Theses Global.
- Feingold A. (1988). Cognitive gender differences are disappearing. American Psychologist, 43(2), 95–103. 10.1037/0003-066X.43.2.95 [DOI] [Google Scholar]
- *Findlay L., Bernier J., Tuokko H., Kirkland S., Gilmour H. (2010). Validation of cognitive functioning categories in the Canadian Community Health Survey–Healthy Aging. Health Reports, 21(4), 85–100. https://www.ncbi.nlm.nih.gov/pubmed/21269015 [PubMed] [Google Scholar]
- *Fraser D. A. (1986). Identifying effective memory strategies: A human experiment in Information science (mnemonics, cognition) (Publication No. 8623521) [Doctoral dissertation, Columbia University]. ProQuest Dissertations & Theses Global.
- Gaillard W. D., Sachs B. C., Whitnah J. R., Ahmad Z., Balsamo L. M., Petrella J. R., Braniecki S. H., McKinney C. M., Hunter K., Xu B., Grandin C. B. (2003). Developmental aspects of language processing: fMRI of verbal fluency in children and adults. Human Brain Mapping, 18(3), 176–185. 10.1002/hbm.10091 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Gates J. K. (1986). The relation of hemisphere asymmetry and cognitive ability in young children (Publication No. T-29774) [Doctoral dissertation, The University of Chicago]. ProQuest Dissertations & Theses Global.
- *Gavin M. R. (1988). The performance of healthy elderly adults on behavioral tasks associated with frontal lobe functioning (Publication No. 303674594) [Doctoral dissertation, University of Georgia]. ProQuest Dissertations & Theses Global.
- *Gawda B., Szepietowska E. M. (2013. a). Impact of unconscious emotional schemata on verbal fluency—Sex differences and neural mechanisms. Neuroquantology, 11(3), 443–450. [Google Scholar]
- *Gawda B., Szepietowska E. M. (2013. b). Semantic and affective verbal fluency: Sex differences. Psychological Reports, 113(1), 1258–1268. [DOI] [PubMed] [Google Scholar]
- *Gerdeman E. M. (1975). The contribution of social intelligence to predictive accuracy of interpersonal perception (Publication No. 7514509) [Doctoral dissertation, Loyola University Chicago]. ProQuest Dissertations & Theses Global.
- *González H. M., Mungas D., Haan M. N. (2005). A semantic verbal fluency test for English- and Spanish-speaking older Mexican-Americans. Archives of Clinical Neuropsychology, 20(2), 199–208. 10.1016/j.acn.2004.06.001 [DOI] [PubMed] [Google Scholar]
- *Gould K. L. (1972). Relationships of creativity, reading comprehension, intelligence, and response to a literature selection for fourth grade inner-city children (Publication No. 64189725) [Doctoral dissertation, The Ohio State University]. ProQuest Dissertations & Theses Global.
- Graves L. V., Moreno C. C., Seewald M., Holden H. M., Van Etten E. J., Uttarwar V., McDonald C. R., Delano-Wood L., Bondi M. W., Woods S. P., Delis D. C., Gilbert P. E. (2017). Effects of age and gender on recall and recognition discriminability. Archives of Clinical Neuropsychology, 32(8), 972–979. 10.1093/arclin/acx024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Greenstein Y., Blachstein H., Vakil E. (2010). Interrelations between attention and verbal memory as affected by developmental age. Child Neuropsychology, 16(1), 42–59. 10.1080/09297040903066891 [DOI] [PubMed] [Google Scholar]
- *Gregory A. M. (2002). Gender differences in the encoding of occupations: A release from proactive interference method in an aural-verbal modality (Publication No. 1409474) [Master’s thesis, California State University, Fullerton]. ProQuest Dissertations & Theses Global.
- *Greshner C. L. (2000). Relationships between physical and psychological measures of masculinity, femininity, and androgyny, and performance on sexually dimorphic cognitive tests (Publication No. MQ61434) [Master’s thesis, Simon Fraser University]. ProQuest Dissertations & Theses Global.
- *Gur R. C., Ragland J. D., Moberg P. J., Turner T. H., Bilker W. B., Kohler C., Siegel S. J., Gur R. E. (2001). Computerized neurocognitive scanning: I. Methodology and validation in healthy people. Neuropsychopharmacology, 25(5), 766–776. 10.1016/S0893-133X(01)00278-0 [DOI] [PubMed] [Google Scholar]
- *Halari R. (2003). The relationship between gonadal hormones and neurocognitive functioning in healthy men and women and patients with schizophrenia (Publication No. 301634326) [Doctoral dissertation, The City, University of London]. ProQuest Dissertations & Theses Global.
- Halari R., Hines M., Kumari V., Mehrotra R., Wheeler M., Ng V., Sharma T. (2005). Sex differences and individual differences in cognitive performance and their relationship to endogenous gonadal hormones and gonadotropins. Behavioral Neuroscience, 119(1), 104–117. 10.1037/0735-7044.119.1.104 [DOI] [PubMed] [Google Scholar]
- Halpern D. F. (2012). Sex differences in cognitive abilities (4th ed.). Psychology Press, Taylor and Francis Group. [Google Scholar]
- Halpern D. F., Tan U. (2001). Stereotypes and steroids: Using a psychobiosocial model to understand cognitive sex differences. Brain and Cognition, 45(3), 392–414. [DOI] [PubMed] [Google Scholar]
- Hamson D. K., Roes M. M., Galea L. A. M. (2016). Sex hormones and cognition: Neuroendocrine influences on memory and learning. Comprehensive Physiology, 6(3), 1295–1337. 10.1002/cphy.c150031 [DOI] [PubMed] [Google Scholar]
- Harris M. B., Seibel C. E. (1976). Effects of sex, occupation, and confidence of model and sex and grade of subject on imitation of language behaviors. Developmental Psychology, 12(1), 89–90. 10.1037/0012-1649.12.1.89 [DOI] [Google Scholar]
- *Harrison J. E., Buxton P., Husain M., Wise R. (2000). Short test of semantic and phonological fluency: Normal performance, validity and test-retest reliability. British Journal of Clinical Psychology, 39(Pt. 2), 181–191. [DOI] [PubMed] [Google Scholar]
- Hausmann M. (2017). Why sex hormones matter for neuroscience: A very short review on sex, sex hormones, and functional brain asymmetries. Journal of Neuroscience Research, 95(1–2), 40–49. 10.1002/jnr.23857 [DOI] [PubMed] [Google Scholar]
- *Hausmann M., Schoofs D., Rosenthal H. E., Jordan K. (2009). Interactive effects of sex hormones and gender stereotypes on cognitive sex differences—A psychobiosocial approach. Psychoneuroendocrinology, 34(3), 389–401. [DOI] [PubMed] [Google Scholar]
- *Havlena J. E. (1990). The influence of age, gender, and motivation on the development of memory strategy use (Publication No. 1341726) [Master’s thesis, California State University, Fullerton]. ProQuest Dissertations & Theses Global.
- *Hazin I., Leite G., Oliveira R. M., Alencar J. C., Fichman H. C., Marques P. d. N., de Mello C. B. (2016). Brazilian normative data on letter and category fluency tasks: Effects of gender, age, and geopolitical region. Frontiers in Psychology, 7, Article 684. 10.3389/fpsyg.2016.00684 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hedges L. V., Nowell A. (1995). Sex-differences in mental test-scores, variability, and numbers of high-scoring individuals. Science, 269(5220), 41–45. 10.1126/science.7604277 [DOI] [PubMed] [Google Scholar]
- *Heister G. (1982). Sex differences in verbal fluency: A short note. Current Psychology, 2(1), 257–260. 10.1007/bf03186768 [DOI] [Google Scholar]
- *Held M. M. (2013). The effects of gender, culture, and acculturation on verbal fluency in Filipino Americans and Caucasian European Americans (Publication No. 1641420952) [Doctoral dissertation, Palo Alto University]. ProQuest Dissertations & Theses Global.
- *Hendrawan D., Hatta T., Ohira H. (2015). Do the letters F, A and S represent Indonesian letter fluency stimuli? Asia-Pacific Psychiatry, 7(1), 64–71. 10.1111/appy.12082 [DOI] [PubMed] [Google Scholar]
- *Herlitz A., Airaksinen E., Nordström E. (1999). Sex differences in episodic memory: The impact of verbal and visuospatial ability. Neuropsychology, 13(4), 590–597. 10.1037/0894-4105.13.4.590 [DOI] [PubMed] [Google Scholar]
- *Herlitz A., Nilsson L. G., Backman L. (1997). Gender differences in episodic memory. Memory & Cognition, 25(6), 801–811. 10.3758/bf03211324 [DOI] [PubMed] [Google Scholar]
- *Herlitz A., Reuterskiöld L., Lovén J., Thilers P. P., Rehnman J. (2013). Cognitive sex differences are not magnified as a function of age, sex hormones, or puberty development during early adolescence. Developmental Neuropsychology, 38(3), 167–179. 10.1080/87565641.2012.759580 [DOI] [PubMed] [Google Scholar]
- *Hirnstein M., Coloma Andrews L., Hausmann M. (2014). Gender-stereotyping and cognitive sex differences in mixed- and same-sex groups. Archives of Sexual Behavior, 43(8), 1663–1673. 10.1007/s10508-014-0311-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Hirnstein M., Freund N., Hausmann M. (2012). Gender stereotyping enhances verbal fluency performance in men (and women). Zeitschrift fur Psychologie / Journal of Psychology, 220(2), 70–77. 10.1027/2151-2604/a000098 [DOI] [Google Scholar]
- Hirnstein M., Hugdahl K., Hausmann M. (2019). Cognitive sex differences and hemispheric asymmetry: A critical review of 40 years of research. Laterality: Asymmetries of Body, Brain and Cognition, 24(2), 204–252. 10.1080/1357650X.2018.1497044 [DOI] [PubMed] [Google Scholar]
- Hirnstein M., Larøi F., Laloyaux J. (2018). No sex difference in an everyday multitasking paradigm. Psychological Research, 83(2), 286–296. 10.1007/s00426-018-1045-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Holland L. J. (1969). Profile types of the urban junior high school student: Utilizing measures of cognitive factors, achievement, aptitude, and background information (Publication No. 302457169) [Doctoral dissertation, Wayne State University]. ProQuest Dissertations & Theses Global.
- *Hurks P. P. (2013). Administering design fluency tests in school-aged children: Analyses of design productivity over time, clustering, and switching. The Clinical Neuropsychologist, 27(7), 1131–1149. 10.1080/13854046.2013.821170 [DOI] [PubMed] [Google Scholar]
- Hyde J. S. (2005). The gender similarities hypothesis. American Psychologist, 60(6), 581–592. [DOI] [PubMed] [Google Scholar]
- Hyde J. S. (2014). Gender similarities and differences. Annual Review of Psychology, 65(1), 373–398. 10.1146/annurev-psych-010213-115057 [DOI] [PubMed] [Google Scholar]
- Hyde J. S., Linn M. C. (1988). Gender differences in verbal-ability—A meta-analysis. Psychological Bulletin, 104(1), 53–69. [Google Scholar]
- *Isomura A. J. (2002). Verbal and spatial explicit memory performance among Japanese Americans and European Americans: Gender and ethnic differences (Publication No. 3055184) [Doctoral dissertation, Pacific Graduate School of Psychology]. ProQuest Dissertations & Theses Global.
- *Iverson G. L., Brooks B. L., Ashton Rennison V. L. (2014). Minimal gender differences on the CNS vital signs computerized neurocognitive battery. Applied Neuropsychology: Adult, 21(1), 36–42. 10.1080/09084282.2012.721149 [DOI] [PubMed] [Google Scholar]
- Jäncke L. (2018). Sex/gender differences in cognition, neurophysiology, and neuroanatomy. F1000Research, 7, Article F1000 Faculty Rev-805. 10.12688/f1000research.13917.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *John S., Rajashekhar B. (2014). Word retrieval ability on semantic fluency task in typically developing Malayalam-speaking children. Child Neuropsychology, 20(2), 182–195. [DOI] [PubMed] [Google Scholar]
- *John S., Rajashekhar B., Guddattu V. (2016). Word retrieval ability on phonemic fluency in typically developing children. Applied Neuropsychology: Child, 5(4), 252–263. 10.1080/21622965.2015.1050099 [DOI] [PubMed] [Google Scholar]
- *Johnston J. O. (1965). Relationships between intelligence and personality variables (Publication No. 302199034) [Doctoral dissertation, Oklahoma State University]. ProQuest Dissertations & Theses Global.
- *Jordan L. M. (2014). Verbal fluency: Norms for the Lakota population in semantic and phonemic fluency tasks (Publication No. 1767318648) [Master’s thesis, The University of North Dakota]. ProQuest Central; ProQuest Dissertations & Theses Global.
- *Kavé G. (2005). Phonemic fluency, semantic fluency, and difference scores: Normative data for adult Hebrew speakers. Journal of Clinical and Experimental Neuropsychology, 27(6), 690–699. 10.1080/13803390490918499 [DOI] [PubMed] [Google Scholar]
- *Kemmotsu N. (2010). Performance of Japanese Americans on selected cognitive instruments (Publication No. 3412246) [Doctoral dissertation, University of California, San Diego and San Diego State University]. ProQuest Dissertations & Theses Global.
- *Kempler D., Teng E. L., Dick M., Taussig I. M., Davis D. S. (1998). The effects of age, education, and ethnicity on verbal fluency. Journal of the International Neuropsychological Society, 4(6), 531–538. [DOI] [PubMed] [Google Scholar]
- *Kesse-Guyot E., Andreeva V. A., Jeandel C., Ferry M., Hercberg S., Galan P. (2012). A healthy dietary pattern at midlife is associated with subsequent cognitive performance. Journal of Nutrition, 142(5), 909–915. 10.3945/jn.111.156257 [DOI] [PubMed] [Google Scholar]
- *Khalil M. S. (2010). Preliminary Arabic normative data of neuropsychological tests: The verbal and design fluency. Journal of Clinical and Experimental Neuropsychology, 32(9), 1028–1035. 10.1080/13803391003672305 [DOI] [PubMed] [Google Scholar]
- *Kim J. K., Kang Y. (1999). Normative study of the Korean-California Verbal Learning Test (KCVLT). The Clinical Neuropsychologist, 13(3), 365–369. 10.1076/clin.13.3.365.1740 [DOI] [PubMed] [Google Scholar]
- Kimura D. (2000). Sex and cognition. MIT Press. [Google Scholar]
- *Kimura D., Clarke P. G. (2002). Women’s advantage on verbal memory is not restricted to concrete words. Psychological Reports, 91(3, Suppl.), 1137–1142. 10.2466/pr0.2002.91.3f.1137 [DOI] [PubMed] [Google Scholar]
- *Kimura D., Seal B. N. (2003). Sex differences in recall of real or nonsense words. Psychological Reports, 93(1), 263–264. 10.2466/pr0.2003.93.1.263 [DOI] [PubMed] [Google Scholar]
- *Knaus T. A. (2003). Sex differences in the anatomy of human perisylvian regions: Frontal and temporal cortical language areas (Publication No. 3084118) [Doctoral dissertation, Tulane University]. ProQuest Dissertations & Theses Global.
- *Knight R. G., McMahon J., Green T. J., Skeaff C. M. (2006). Regression equations for predicting scores of persons over 65 on the Rey Auditory Verbal Learning Test, the minimental state examination, the trail making test and semantic fluency measures. British Journal of Clinical Psychology, 45(Pt. 3), 393–402. 10.1348/014466505x68032 [DOI] [PubMed] [Google Scholar]
- *Kosmidis M. H., Vlahou C. H., Panagiotaki P., Kiosseoglou G. (2004). The verbal fluency task in the Greek population: Normative data, and clustering and switching strategies. Journal of the International Neuropsychological Society, 10(2), 164–172. 10.1017/S1355617704102014 [DOI] [PubMed] [Google Scholar]
- Kraan C., Stolwyk R. J., Testa R. (2013). The abilities associated with verbal fluency performance in a young, healthy population are multifactorial and differ across fluency variants. Applied Neuropsychology: Adult, 20(3), 159–168. 10.1080/09084282.2012.670157 [DOI] [PubMed] [Google Scholar]
- *Kramer J. H., Delis D. C., Kaplan E., Odonnell L., Prifitera A. (1997). Developmental sex differences in verbal learning. Neuropsychology, 11(4), 577–584. 10.1037/0894-4105.11.4.577 [DOI] [PubMed] [Google Scholar]
- *Kramer J. H., Yaffe K., Lengenfelder J., Delis D. C. (2003). Age and gender interactions on verbal memory performance. Journal of the International Neuropsychological Society, 9(1), 97–102. 10.1017/s1355617703910113 [DOI] [PubMed] [Google Scholar]
- Kret M. E., De Gelder B. (2012). A review on sex differences in processing emotional signals. Neuropsychologia, 50(7), 1211–1221. 10.1016/j.neuropsychologia.2011.12.022 [DOI] [PubMed] [Google Scholar]
- *LaFevor M. E. (2017). Examining concurrent validity, reliability, and sex and age normative values of the Impact Quick Test-Pediatric version (Publication No. 10272206) [Doctoral dissertation, Michigan State University]. ProQuest Dissertations & Theses Global.
- *Lanting S., Haugrud N., Crossley M. (2009). The effect of age and sex on clustering and switching during speeded verbal fluency tasks. Journal of the International Neuropsychological Society, 15(2), 196–204. 10.1017/S1355617709090237 [DOI] [PubMed] [Google Scholar]
- *Larrabee G. J., Crook T. H. (1993). Do men show more rapid age-associated decline in simulated everyday verbal memory than do women? Psychology and Aging, 8(1), 68–71. 10.1037/0882-7974.8.1.68 [DOI] [PubMed] [Google Scholar]
- *Laws K. R. (2004). Sex differences in lexical size across semantic categories. Personality and Individual Differences, 36(1), 23–32. 10.1016/S0191-8869(03)00048-5 [DOI] [Google Scholar]
- *Lero D. S. (1974). The effects of timed and untimed assessment on creativity test performance (Publication No. 302724281) [Doctoral dissertation, Purdue University]. ProQuest Dissertations & Theses Global.
- *Lewin C., Wolgers G., Herlitz A. (2001). Sex differences favoring women in verbal but not in visuospatial episodic memory. Neuropsychology, 15(2), 165–173. 10.1037/0894-4105.15.2.165 [DOI] [PubMed] [Google Scholar]
- *Liang H. (2013). Cognitive dysfunction and mental health status in Ketamine and poly-drug abusers (Publication No. 3578875) [Doctoral dissertation, The Chinese University of Hong Kong]. Health Research Premium Collection; ProQuest Central; ProQuest Dissertations & Theses Global.
- *Lindblad S. G. (1995). Gender, age, and level of achievement differences on the Woodcock-Johnson revised tests of cognitive abilities and tests of achievement - early development battery (Publication No. 304270104) [Doctoral dissertation, Texas Woman’s University]. ProQuest Dissertations & Theses Global.
- Linn M. C., Petersen A. C. (1985). Emergence and characterization of sex-differences in spatial ability—A meta-analysis. Child Development, 56(6), 1479–1498. [PubMed] [Google Scholar]
- *Lowe C. N. (1998). The association between frontal lobe development and social perspective taking in children (Publication No. 304479777) [Master’s thesis, Mount Saint Vincent University]. ProQuest Dissertations & Theses Global.
- Lowe P. A., Mayfield J. W., Reynolds C. R. (2003). Gender differences in memory test performance among children and adolescents. Archives of Clinical Neuropsychology, 18(8), 865–878. 10.1016/S0887-6177(02)00162-2 [DOI] [PubMed] [Google Scholar]
- *Lundervold A. J., Wollschlager D., Wehling E. (2014). Age and sex related changes in episodic memory function in middle aged and older adults. Scandinavian Journal of Psychology, 55(3), 225–232. 10.1111/sjop.12114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Mathuranath P. S., George A., Cherian P. J., Alexander A., Sarma S. G., Sarma P. S. (2003). Effects of age, education and gender on verbal fluency. Journal of Clinical and Experimental Neuropsychology, 25(8), 1057–1064. [DOI] [PubMed] [Google Scholar]
- *Maylor E. A., Reimers S., Choi J., Collaer M. L., Peters M., Silverman I. (2007). Gender and sexual orientation differences in cognition across adulthood: Age is kinder to women than to men regardless of sexual orientation. Archives of Sexual Behavior, 36(2), 235–249. 10.1007/s10508-006-9155-y [DOI] [PubMed] [Google Scholar]
- *McKay K. E. (1995). Implicit and explicit recognition memory function: A developmental study of normal, reading disabled and ADHD children and adults (Publication No. 9605632) [Doctoral dissertation, City University of New York]. ProQuest Dissertations & Theses Global.
- Miller D. I., Halpern D. F. (2014). The new science of cognitive sex differences. Trends in Cognitive Sciences, 18(1), 37–45. 10.1016/j.tics.2013.10.011 [DOI] [PubMed] [Google Scholar]
- Moè A., Hausmann M., Hirnsten M. (2021). Gender stereotypes and incremental beliefs in STEM and non-STEM students in three countries. Relationships with performance in cognitive tasks. Psychological Research, 85, 554–567. 10.1007/s00426-019-01285-0 [DOI] [PubMed] [Google Scholar]
- *Mohd S. T. (1997). Assessing reading-related skills in Arabic-speaking children (Publication No. 9801124) [Doctoral dissertation, University of Florida]. ProQuest Dissertations & Theses Global.
- Moher D., Liberati A., Tetzlaff J., Altman D. G., & The PRISMA Group. (2009). Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLOS Medicine, 6(7), Article e1000097. 10.1371/journal.pmed.1000097 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Moulden D. J. A. (1992). Development of the dichotic right ear advantage (REA) for the inferred lateralization of language: Dichotic word listening norms (Publication No. 89140240) [Master’s thesis, Laurentian University]. ProQuest Dissertations & Theses Global.
- *Mullins M. R. (1977). The relationship of verbal abilities to cognitive complexity (Publication No. 1696062658) [Master’s thesis, University of Nebraska at Omaha]. ProQuest Dissertations & Theses Global.
- *Munnelly M. (2016). Gender differences in verbal and visual memory (Publication No. 1814218463) [Master’s thesis, Kean University]. ProQuest Dissertations & Theses Global.
- *Murre J. M. J., Janssen S. M. J., Rouw R., Meeter M. (2013). The rise and fall of immediate and delayed memory for verbal and visuospatial information from late childhood to late adulthood. Acta Psychologica, 142(1), 96–107. 10.1016/j.actpsy.2012.10.005 [DOI] [PubMed] [Google Scholar]
- *Nida R. E. (1986). A comparative investigation of young children’s recall memory proficiency in naturalistic and laboratory settings (Publication No. 8718680) [Doctoral dissertation, The University of North Carolina at Greensboro]. ProQuest Dissertations & Theses Global.
- *O’Hara R., Miller E., Liao C. P., Way N., Lin X., Hallmayer J. (2006). COMT genotype, gender and cognition in community-dwelling, older adults. Neuroscience Letters, 409(3), 205–209. 10.1016/j.neulet.2006.09.047 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parsons T. D., Rizzo A. R., Zaag C. v. d., McGee J. S., Buckwalter J. G. (2005). Gender differences and cognition among older adults. Aging, Neuropsychology, and Cognition, 12(1), 78–88. 10.1080/13825580590925125 [DOI] [Google Scholar]
- *Pedersen L. L. (2005). The relationship between behavioral and performance-based measures of executive function in preschool children (Publication No. 3168583) [Doctoral dissertation, Texas Woman’s University]. ProQuest Dissertations & Theses Global.
- Pennington B. F., Ozonoff S. (1996). Executive functions and developmental psychopathology. Journal of Child Psychology and Psychiatry, 37(1), 51–87. 10.1111/j.1469-7610.1996.tb01380.x [DOI] [PubMed] [Google Scholar]
- *Phillips B. L. (1977). A comparison of the McCarthy Scales of Children’s Abilities to the WPPSI and the Columbia Mental Maturity Scale (Publication No. EP16129) [Master’s thesis, University of Wyoming]. ProQuest Dissertations & Theses Global.
- *Pino Escobar G. M. (2017). Do bilingual and monolingual children differ? Measuring and comparing attentional control skills in the verbal and non-verbal domains (Publication No. 2033936385) [Master’s thesis, Western Sydney University]. ProQuest Dissertations & Theses Global.
- Postman L., Jenkins W. O., Postman D. L. (1948). An experimental comparison of active recall and recognition. The American Journal of Psychology, 61, 511–519. 10.2307/1418315 [DOI] [Google Scholar]
- *Prigatano G. P., Gray J. A., Lomay V. T. (2008). Verbal (animal) fluency scores in age/grade appropriate minority children from low socioeconomic backgrounds. Journal of the International Neuropsychological Society, 14(1), 143–147. 10.1017/S1355617708080089 [DOI] [PubMed] [Google Scholar]
- *Rae G. (1979). The role of auditory and visual sensory modalities in reading and verbal memory (Publication No. U444206) [Doctoral dissertation, University of Aberdeen]. ProQuest Dissertations & Theses Global.
- *Rahman Q., Abrahams S., Wilson G. D. (2003). Sexual-orientation-related differences in verbal fluency. Neuropsychology, 17(2), 240–246. 10.1037/0894-4105.17.2.240 [DOI] [PubMed] [Google Scholar]
- *Ratcliff G., Dodge H., Birzescu M., Ganguli M. (2003). Tracking cognitive functioning over time: Ten-year longitudinal data from a community-based study. Applied Neuropsychology: Adult, 10(2), 76–88. 10.1207/S15324826AN1002_03 [DOI] [PubMed] [Google Scholar]
- Reilly D. (2012). Gender, culture, and sex-typed cognitive abilities. PLOS ONE, 7(7), Article e39904. 10.1371/journal.pone.0039904 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Renteria L. (2005). Validation of the Spanish language Wechsler Adult Intelligence Scale (3rd edition) in a sample of American, urban, Spanish speaking Hispanics (Publication No. 3180959) [Doctoral dissertation, Loyola University Chicago]. Health Research Premium Collection; ProQuest Central; ProQuest Dissertations & Theses Global.
- *Reynolds A. G. (1967). Associational fluency and written language expression (Publication No. 302282905) [Master’s thesis, The University of Western Ontario]. ProQuest Dissertations & Theses Global.
- *Riva D., Nichelli F., Devoti M. (2000). Developmental aspects of verbal fluency and confrontation naming in children. Brain and Language, 71(2), 267–284. 10.1006/brln.1999.2166 [DOI] [PubMed] [Google Scholar]
- Rodriguez-Aranda C., Martinussen M. (2006). Age-related differences in performance of phonemic verbal fluency measured by Controlled Oral Word Association Task (COWAT): A meta-analytic study. Developmental Neuropsychology, 30(2), 697–717. 10.1207/s15326942dn3002_3 [DOI] [PubMed] [Google Scholar]
- *Rosen M. L. (1995). Concept formation: An integrated developmental model to explain performance differences between ADHD and non-ADHD children (Publication No. 304213785) [Doctoral dissertation, Illinois Institute of Technology]. ProQuest Dissertations & Theses Global.
- *Rosselli M., Ardila A., Matute E., Inozemtseva O. (2009). Gender differences and cognitive correlates of mathematical skills in school-aged children. Child Neuropsychology, 15(3), 216–231. 10.1080/09297040802195205 [DOI] [PubMed] [Google Scholar]
- *Rosselli M., Tappen R., Williams C., Salvatierra J., Zoller Y. (2009). Level of education and category fluency task among Spanish speaking elders: Number of words, clustering, and switching strategies. Neuropsychology, Development, and Cognition. Section B, Aging, Neuropsychology and Cognition, 16(6), 721–744. 10.1080/13825580902912739 [DOI] [PubMed] [Google Scholar]
- *Rouch I., Wild P., Ansiau D., Marquié J. C. (2005). Shiftwork experience, age and cognitive performance. Ergonomics, 48(10), 1282–1293. 10.1080/00140130500241670 [DOI] [PubMed] [Google Scholar]
- *Rubin L. H. (2009). Effects of sex steroid hormones on cognition in schizophrenia (Publication No. 3394236) [Doctoral dissertation, University of Illinois at Chicago]. Health Research Premium Collection; ProQuest Central; ProQuest Dissertations & Theses Global.
- *Ryan J. P., Atkinson T. M., Dunham K. T. (2004). Sports-related and gender differences on neuropsychological measures of frontal lobe functioning. Clinical Journal of Sport Medicine, 14(1), 18–24. 10.1097/00042752-200401000-00004 [DOI] [PubMed] [Google Scholar]
- *Sakamoto M. (2009). Comparing Alzheimer’s disease and vascular dementia profiles on neuropsychological tests among Japanese elders (Publication No. 3358503) [Doctoral dissertation, Drexel University]. Health Research Premium Collection; ProQuest Central; ProQuest Dissertations & Theses Global.
- *Sakurai H., Hanyu H., Murakami M., Kume K., Takata Y., Onuma T., Akai T., Iwamoto T. (2011). The category “animals” is more appropriate than the category “vegetables” to measure semantic category fluency. Geriatrics & Gerontology International, 11(3), 374–375. 10.1111/j.1447-0594.2010.00667.x [DOI] [PubMed] [Google Scholar]
- *Sandel N. (2016). High school lacrosse and soccer players’ neurocognitive performance and symptoms before and after concussion (Publication No. 3738377) [Doctoral dissertation, Widener University]. Health Research Premium Collection; ProQuest Central; ProQuest Dissertations & Theses Global.
- *Schallmo M. P., Kassel M. T., Weisenbach S. L., Walker S. J., Guidotti-Breting L. M., Rao J. A., Hazlett K. E., Considine C. M., Sethi G., Vats N., Pecina M., Welsh R. C., Starkman M. N., Giordani B., Langenecker S. A. (2015). A new semantic list learning task to probe functioning of the Papez circuit. Journal of Clinical and Experimental Neuropsychology, 37(8), 816–833. 10.1080/13803395.2015.1052732 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmidt M. (1996). Rey auditory verbal learning test: A handbook. Western Psychological Services. [Google Scholar]
- *Sims R. C. (2007). Neurocognitive correlates of cardiovascular risk factors in Blacks (Publication No. 3299679) [Doctoral dissertation, Howard University]. Health Research Premium Collection; ProQuest Central; ProQuest Dissertations & Theses Global.
- *Sinay R. D. (1967). Creative aptitude patterns of college honors students (Publication No. 6807200) [Doctoral dissertation, University of Southern California]. ProQuest Dissertations & Theses Global.
- *Snitz B. E., Unverzagt F. W., Chang C. C., Bilt J. V., Gao S., Saxton J., Hall K. S., Ganguli M. (2009). Effects of age, gender, education and race on two tests of language ability in community-based older adults. International Psychogeriatrics, 21(6), 1051–1062. 10.1017/S1041610209990214 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sokołowski A., Tyburski E., Sołtys A., Karabanowicz E. (2020). Sex differences in verbal fluency among young adults. Advances in Cognitive Psychology, 16(2), 92–102. 10.5709/acp-0288-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Soleman R. S., Schagen S. E., Veltman D. J., Kreukels B. P., Cohen-Kettenis P. T., Lambalk C. B., Wouters F., Delemarre-van de Waal H. A. (2013). Sex differences in verbal fluency during adolescence: A functional magnetic resonance imaging study in gender dysphoric and control boys and girls. Journal of Sexual Medicine, 10(8), 1969–1977. 10.1111/jsm.12083 [DOI] [PubMed] [Google Scholar]
- *Sosa A. L., Albanese E., Prince M., Acosta D., Ferri C. P., Guerra M., Huang Y., Jacob K. S., de Rodriguez J. L., Salas A., Yang F., Gaona C., Joteeshwaran A., Rodriguez G., de la Torre G. R., Williams J. D., Stewart R. (2009). Population normative data for the 10/66 Dementia Research Group cognitive test battery from Latin America, India and China: A cross-sectional survey. BMC Neurology, 9, Article 48. 10.1186/1471-2377-9-48 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Speer P., Wersching H., Bruchmann S., Bracht D., Stehling C., Thielsch M., Knecht S., Lohmann H. (2014). Age- and gender-adjusted normative data for the German version of Rey’s Auditory Verbal Learning Test from healthy subjects aged between 50 and 70 years. Journal of Clinical and Experimental Neuropsychology, 36(1), 32–42. 10.1080/13803395.2013.863834 [DOI] [PubMed] [Google Scholar]
- Spreen O., Benton A. L. (1977). Neurosensory center comprehensive examination for aphasia: Manual of directions (Rev. ed.). Neuropsychology Laboratory, University of Victoria. [Google Scholar]
- *Stanulis R. G. (1977). Hemispheric function and memory: Effect of handedness of verbal and imaginal processes (Publication No. 7805228) [Doctoral dissertation, Wayne State University]. ProQuest Dissertations & Theses Global.
- Stevens J. S., Hamann S. (2012). Sex differences in brain activation to emotional stimuli: A meta-analysis of neuroimaging studies. Neuropsychologia, 50(7), 1578–1593. 10.1016/j.neuropsychologia.2012.03.011 [DOI] [PubMed] [Google Scholar]
- *Stoddard E. (2007). Measuring learning modalities with neuropsychological memory measures in a college population (Publication No. 3275174) [Doctoral dissertation, Drexel University]. ProQuest Dissertations & Theses Global.
- Stoet G., Geary D. C. (2013). Sex differences in mathematics and reading achievement are inversely related: Within- and across-nation assessment of 10 years of PISA data. PLOS ONE, 8(3), Article e57988. 10.1371/journal.pone.0057988 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Sunderaraman P., Blumen H. M., DeMatteo D., Apa Z. L., Cosentino S. (2013). Task demand influences relationships among sex, clustering strategy, and recall: 16-word versus 9-word list learning tests. Cognitive and Behavioral Neurology, 26(2), 78–84. 10.1097/WNN.0b013e31829de450 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Sundermann E. E., Biegon A., Rubin L. H., Lipton R. B., Mowrey W., Landau S., Maki P. M., & Alzheimer’s Disease Neuroimaging Initiative. (2016). Better verbal memory in women than men in MCI despite similar levels of hippocampal atrophy. Neurology, 86(15), 1368–1376. 10.1212/WNL.0000000000002570 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Swift P. T. (1999). Validation of a Spanish language test of verbal learning and memory: The “Perri Test de Aprendizaje Verbal y Memoria” (Publication No. 304531435) [Doctoral dissertation, University of Connecticut]. ProQuest Dissertations & Theses Global.
- *Tallberg I. M., Ivachova E., Jones Tinghag K., Ostberg P. (2008). Swedish norms for word fluency tests: FAS, animals and verbs. Scandinavian Journal of Psychology, 49(5), 479–485. 10.1111/j.1467-9450.2008.00653.x [DOI] [PubMed] [Google Scholar]
- *Tanaka T. R. (2005). Gender and ethnic differences on select verbal and visuospatial measures among older European and Japanese Americans (Publication No. 3179491) [Doctoral dissertation, Pacific Graduate School of Psychology]. ProQuest Dissertations & Theses Global.
- *Temple C. M., Cornish K. M. (1993). Recognition memory for words and faces in schoolchildren: A female advantage for words. British Journal of Developmental Psychology, 11(4), 421–426. 10.1111/j.2044-835X.1993.tb00613.x [DOI] [Google Scholar]
- *Thilers P. P., MacDonald S. W., Herlitz A. (2007). Sex differences in cognition: The role of handedness. Physiology & Behavior, 92(1–2), 105–109. 10.1016/j.physbeh.2007.05.035 [DOI] [PubMed] [Google Scholar]
- *Thomas L. L., Curtis A. T., Bolton R. (1978). Sex differences in elicited color lexicon size. Perceptual and Motor Skills, 47(1), 77–78. 10.2466/pms.1978.47.1.77 [DOI] [PubMed] [Google Scholar]
- *Thornburg H. L. (1973). An investigation of interrelations of abilities in Guilford’s structure-of-intellect (Publication No. 7412212) [Doctoral dissertation, University of Illinois at Urbana-Champaign]. ProQuest Dissertations & Theses Global.
- Thurstone L., Thurstone T. (1962). Primary mental abilities (Rev. ed.). Science Research Associates. [Google Scholar]
- *Tombaugh T. N., Kozak J., Rees L. (1999). Normative data stratified by age and education for two measures of verbal fluency: FAS and animal naming. Archives of Clinical Neuropsychology, 14(2), 167–177. [PubMed] [Google Scholar]
- *Tuck S. (2012). The development of verbal fluency in children: An examination of switching, clustering, metastrategic awareness, and timing effects (Publication No. 1530411653) [Doctoral dissertation, York University]. ProQuest Dissertations & Theses Global.
- *Vakil E., Blachstein H. (1997). Rey AVLT: Developmental norms for adults and the sensitivity of different memory measures to age. The Clinical Neuropsychologist, 11(4), 356–369. 10.1080/13854049708400464 [DOI] [Google Scholar]
- *van Hooren S. A., Valentijn A. M., Bosma H., Ponds R. W., van Boxtel M. P., Jolles J. (2007). Cognitive functioning in healthy older adults aged 64-81: A cohort study into the effects of age, sex, and education. Neuropsychology, Development, and Cognition. Section B, Aging, Neuropsychology and Cognition, 14(1), 40–54. 10.1080/138255890969483 [DOI] [PubMed] [Google Scholar]
- *Vannorsdall T. D. (2006). White matter hyperintensities: Neuropsychological correlates in a community-based sample (Publication No. 3254864) [Doctoral dissertation, University of Maryland, Baltimore County]. Health Research Premium Collection; ProQuest Central; ProQuest Dissertations & Theses Global.
- Voyer D., Postma A., Brake B., Imperato-McGinley J. (2007). Gender differences in object location memory: A meta-analysis. Psychonomic Bulletin & Review, 14(1), 23–38. [DOI] [PubMed] [Google Scholar]
- Voyer D., Saint Aubin J., Altman K., Gallant G. (2021). Sex differences in verbal working memory: A systematic review and meta-analysis. Psychological Bulletin, 147(4), 352–398. 10.1037/bul0000320 [DOI] [PubMed] [Google Scholar]
- Voyer D., Voyer S., Bryden M. P. (1995). Magnitude of sex-differences in spatial abilities—A meta-analysis and consideration of critical variables. Psychological Bulletin, 117(2), 250–270. 10.1037/0033-2909.117.2.250 [DOI] [PubMed] [Google Scholar]
- Voyer D., Voyer S. D. (2014). Gender differences in scholastic achievement: A meta-analysis. Psychological Bulletin, 140(4), 1174–1204. 10.1037/a0036620 [DOI] [PubMed] [Google Scholar]
- *Wagner M. D. (1980). Receptive and expressive language differences between lower and middle-income black children and their relationship to performance on tasks requiring spatial and conceptual inferences (Publication No. 8021286) [Doctoral dissertation, Emory University]. ProQuest Dissertations & Theses Global.
- *Walburn K. D. (2014). Effects of physical activity and stress events on a spatial memory task and Stroop task in college students (Publication No. 1563919) [Master’s thesis, University of Nebraska at Omaha]. ProQuest Dissertations & Theses Global.
- Weber D., Dekhtyar S., Herlitz A. (2017). The Flynn effect in Europe—Effects of sex and region. Intelligence, 60, 39–45. 10.1016/j.intell.2016.11.003 [DOI] [Google Scholar]
- Weber D., Skirbekk V., Freund I., Herlitz A. (2014). The changing face of cognitive gender differences in Europe. Proceedings of the National Academy of Sciences, USA, 111(32), 11673–11678. 10.1073/pnas.1319538111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wechsler D. (2008). Wechsler Adult Intelligence Scale (4th ed.). Pearson. [Google Scholar]
- Wechsler D. (2009). Wechsler Memory Scale IV (WMS-IV). The Psychological Corp. [Google Scholar]
- *Weiss E. M., Kemmler G., Deisenhammer E. A., Fleischhacker W. W., Delazer M. (2003). Sex differences in cognitive functions. Personality and Individual Differences, 35(4), 863–875. 10.1016/S0191-8869(02)00288-X [DOI] [Google Scholar]
- *Weiss E. M., Ragland J. D., Brensinger C. M., Bilker W. B., Deisenhammer E. A., Delazer M. (2006). Sex differences in clustering and switching in verbal fluency tasks. Journal of the International Neuropsychological Society, 12(4), 502–509. 10.1017/S1355617706060656 [DOI] [PubMed] [Google Scholar]
- *Westcott P. (1969). Age differences in strategies for free recall (Publication No. 7017225) [Doctoral dissertation, Yale University]. ProQuest Dissertations & Theses Global.
- *Wilkosc M., Markowska A., Zajac-Lamparska L., Skibinska M., Szalkowska A., Araszkiewicz A. (2016). A lack of correlation between brain-derived neurotrophic factor serum level and verbal memory performance in healthy Polish population. Frontiers in Neural Circuits, 10, Article 39. 10.3389/fncir.2016.00039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Willis M. P. (1997). Cognitive capacities that contribute to 5- and 6-year-old children’s ability to demonstrate nonverbal and verbal understanding of spatial relational polar opposites (Publication No. 9728323) [Doctoral dissertation, Columbia University]. ProQuest Dissertations & Theses Global.
- Wilson R. C., Guilford J. P., Christensen P. R., Lewis D. J. (1954). A factor-analytic study of creative-thinking abilities. Psychometrika, 19(4), 297–311. 10.1007/BF02289230 [DOI] [Google Scholar]
- *Wolkenberg F. A. (1999). Differences between men and women in a lexical decision task and the role of hemispheric processing (Publication No. 9941846) [Doctoral dissertation, Emory University]. ProQuest Dissertations & Theses Global.
- Wormack L. (1979). Cognitive predictors of articulation in writing. Perceptual and Motor Skills, 48(3, Suppl.), 1151–1156. 10.2466/pms.1979.48.3c.1151 [DOI] [PubMed] [Google Scholar]
- *Wulandari S. W., Hendrawan D. (2020). Trust your abilities more than the stereotype: Effect of gender-stereotype threat and task difficulty on word production, clustering, and switching in letter fluency. Pertanika Journal of Social Sciences and Humanities, 28(4), Article 2567. 10.47836/pjssh.28.4.05 [DOI] [Google Scholar]
- *Xu H. (2018). Association between migration and cognitive function among middle aged and older adults: A comparison between China and India (Publication No. 2039532064) [Doctoral dissertation, Duke University]. ProQuest Dissertations & Theses Global.
- *Yeudall L. T., Fromm D., Reddon J. R., Stefanyk W. O. (1986). Normative data stratified by age and sex for 12 neuropsychological tests. Journal of Clinical Psychology, 42(6), 918–946. [DOI] [PubMed] [Google Scholar]
- *Yi A. S. (2007). A methodological approach to gender differences in cognition among older adults (Publication No. 3262586) [Doctoral dissertation, Fuller Theological Seminary, School of Psychology]. Health Research Premium Collection; ProQuest Central; ProQuest Dissertations & Theses Global.
- *Yonker J. E., Eriksson E., Nilsson L. G., Herlitz A. (2003). Sex differences in episodic memory: Minimal influence of estradiol. Brain and Cognition, 52(2), 231–238. 10.1016/s0278-2626(03)00074-5 [DOI] [PubMed] [Google Scholar]
- *Young M. C. (2002). Anterior aphasia as a natural category of acquired cognitive-communicative impairment: Implications for cognitive neurolinguistic theory, experimental methods, and clinical practice (Publication No. 3089491) [Doctoral dissertation, The University of Texas at Austin]. ProQuest Dissertations & Theses Global.
- Zell E., Krizan Z., Teeter S. R. (2015). Evaluating gender similarities and differences using metasynthesis. American Psychologist, 70(1), 10–20. 10.1037/a0038208 [DOI] [PubMed] [Google Scholar]
- Zhao Q., Guo Q., Hong Z. (2013). Clustering and switching during a semantic verbal fluency test contribute to differential diagnosis of cognitive impairment. Neuroscience Bulletin, 29(1), 75–82. 10.1007/s12264-013-1301-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Zhu W. (2015). Association of objectively measured physical activity with cognitive function in Black and White older adults: Reasons for Geographic and Racial Differences in Stroke (REGARDS) study (Publication No. 3701737) [Doctoral dissertation, Arizona State University]. ProQuest Dissertations & Theses Global.
- *Zoccoli S. L. (2005). The COWA: An empirical study of phonemic and semantic verbal fluency (Publication No. 305434002) [Master’s thesis, Southern Methodist University]. ProQuest Dissertations & Theses Global.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplemental material, sj-docx-1-pps-10.1177_17456916221082116 for Sex/Gender Differences in Verbal Fluency and Verbal-Episodic Memory: A Meta-Analysis by Marco Hirnstein, Josephine Stuebs, Angelica Moè and Markus Hausmann in Perspectives on Psychological Science



