Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Aug 1.
Published in final edited form as: Mem Cognit. 2016 Aug;44(6):922–936. doi: 10.3758/s13421-016-0608-9

Variation in Strategy Use Across Measures of Verbal Working Memory

Alexandra B Morrison 2,1, Gail M Rosenbaum 2,1, Damien Fair 3, Jason M Chein 2
PMCID: PMC4976054  NIHMSID: NIHMS774754  PMID: 27038310

Abstract

The working memory (WM) literature contains a number of tasks that vary on dimensions such as when or how memory items are reported. In addition to the ways in which WM tasks are designed to differ, tasks may also diverge according to the strategies participants use during task performance. The present study included seven tasks from the WM literature, each requiring short-term retention of verbal items. Following completion of a small number of trials from each task, individuals completed a self-report questionnaire to identify their primary strategy. Results indicated substantial variation across individuals for a given task, and within the same individual across tasks. Moreover, while direct comparisons between tasks showed that some tasks evinced similar patterns of strategy use despite differing task demands, others showed markedly different patterns of self-reported strategy use. A community detection algorithm aimed at identifying groups of individuals based on their profile of strategic choices revealed unique communities of individuals who are dependent on specific strategies under varying demands. Together, the findings suggest that researchers using common working memory paradigms should very carefully consider the implications of variation in strategy use when interpreting their findings.


A diverse range of topics in the cognitive and psychological sciences involve the study of short-term working memory (WM) – a system often conceptualized as a flexible mental workspace used for the storage and manipulation of information during the planning and execution of everyday cognitive behaviors (e.g., Baddeley, 2003; Cowan, 2001; Logie, 2003). Accordingly, there is widespread use of tasks designed to index the functioning of WM, and over the past few decades, the number and variety of WM assessment tasks has proliferated. With such a widely utilized, yet diverse, collection of WM tasks, an obvious question arises: How similar are these tasks to one another with respect to the mental processes, and strategies, that they evoke? While traditional behavioral research (Cowan, 2001; Luck & Vogel, 2013), psychometric studies (Engle, Tuholski, Laughlin, & Conway, 1999; Kane et al., 2004), and cognitive neuroimaging investigations (Nee et al., 2013; Wager & Smith, 2003) all provide some evidence that distinct WM tasks do draw upon shared processes, they also provide evidence that these tasks can operate quite differently from one another with respect to the behavioral phenomena they elicit (Morrison, Conway, & Chein, 2014; Oberauer, 2003; Ricker & Cowan, 2014), the variance they account for across individuals (Unsworth, Fukuda, Awh, & Vogel, 2014), and the specific brain circuitry that they activate (Chein, Moore, & Conway, 2011; Henson, Shallice, Gorno-Tempini, & Dolan, 2002; Zanto, Clapp, Rubens, Karlsson, & Gazzaley, 2016).

One likely, but under-investigated, source of these differences regards the cognitive strategies that individuals deploy when performing alternative WM tasks. Although there has been some limited exploration of strategy use in various WM paradigms (as discussed in greater detail below), many studies seem to be implemented with little thought to the issue of potential variation in strategy use. Others seem to assume, usually implicitly, that there is a de facto strategy associated with a given WM task, and that engagement in this strategy is more or less uniform across individuals. In fact, WM measures are sometimes explicitly chosen from among the alternatives because of their assumed strategic properties (i.e., the task is assumed to demand, or to preclude, some particular strategy), and interpretations frequently depend directly on the assumed, but usually not demonstrated or assessed, engagement (or non-engagement) of a particular strategy (e.g. phonological rehearsal of memory items).

In the present study, we set out to explore the distribution of self-reported strategy use across a varied set of WM assessment measures with the goal of providing a more extensive description of how strategy selection varies both within and across tasks, and among individuals. To this end, we sampled strategy patterns across a number of WM measures tested in the same population of individuals, in the same experimental setting, and using the same extended pool of verbal stimuli. Where possible, these tasks were designed to differ systematically with respect to specific properties, thus allowing for an evaluation of the particular features of a task that are most closely associated with alternative strategies.

WM Tasks and Their Properties

Working memory research emerged historically from the study of short-term memory; with the emphasis shifting from simple, passive, storage of information (short-term memory) to a focus on the ability to operate upon, and mentally “work” with that information in the service of ongoing cognition (working memory) (Baddeley & Hitch, 1974). Despite the shift in emphasis, many tasks that were originally conceived as assessments of short-term memory, including variants of the Immediate Serial Recall and Sternberg Item Recognition tasks, remain among the most widely used measures in the working memory literature (see D’Esposito & Postle, 2015; Hurlstone, Hitch, & Baddeley, 2014; Jonides et al., 2008; Logie, DellaSala, Laiacona, Chalmers, & Wynn, 1996; Oberauer & Lewandowsky, 2008). However, the shift in emphasis also brought with it the introduction of new tasks into the WM researcher’s arsenal (and increased the use of some previously less common tasks), including, for example, variants of the Complex Working Memory Span task (Conway et al., 2005; Daneman & Carpenter, 1980; Oberauer, Lewandowsky, Farrell, Jarrold, & Greaves, 2012; Redick et al., 2012; Unsworth, Heitz, Schrock, & Engle, 2005) and Running Memory Span task (Broadway & Engle, 2010; Bunting, Cowan, & Saults, 2006; Cowan et al., 2005; Pollack, Johnson, & Knaff, 1959).

To pursue ever more nuanced aspects of the processes that underlie WM and its capacity limits, researchers have designed and deployed an increasingly extensive array of different working memory measures. While there are far too many WM paradigms and variants to list them exhaustively, a subset of these tasks is shown schematically in Figure 1. All of these paradigms share the core requirement that some type of information must be maintained over the short term, and it is this essential feature that makes them “working memory” tasks. Moreover, a robust literature establishes that this shared aspect of WM tasks at least partially explains why they typically correlate positively with one another, and why they significantly predict individual differences in many complex cognitive abilities, such as reading comprehension, problem solving, and fluid intelligence (Kane et al., 2004; Unsworth et al., 2014).

Figure 1.

Figure 1

Schematic of the seven working memory tasks included in the present study.

Unfortunately, our understanding of the ways in which seemingly similar WM tasks diverge, and the implications of this divergence, is limited. We do know that task differences are important – for instance, even fundamental characteristics of WM, such as capacity estimates (see Cowan, 2001 for review), forgetting rates (Ricker & Cowan, 2014) and primacy and recency effects (Morrison et al., 2014; Oberauer, 2003) are known to vary across tasks – but we know surprisingly little about which specific qualities of each WM paradigm have this influence on the observed findings. One way to gain some traction in our understanding is to engage in task analysis (to evaluate the specific characteristics of a given task implementation) and to explore how task demands affect the strategies that a given task elicits or encourages.

Some dimensions of WM task variation can be fairly readily characterized (Figure 2 provides a list of some of these dimensions). For example, an obvious difference between Immediate Serial Recall and Item Recognition tasks lies in the way in which memory is reported (i.e. response demands); as their names suggest, Immediate Serial Recall requires recall, while Item Recognition requires recognition. These two tasks additionally differ in terms of their emphasis on the serial order in which memoranda are maintained and reported; Immediate Serial Recall requires ordered recollection, while Item Recognition does not. To retain the requirement for item recall (as in Immediate Serial Recall), but without the emphasis on serial order of presentation, some researchers have instead used a Free Recall paradigm (Beaman & Jones, 1998; Marsh, Sorqvist, Hodgetts, Beaman, & Jones, 2015; Neely & LeCompte, 1999; Unsworth, Spillers, & Brewer, 2010) in which participants are allowed to remember and report items in whatever order they desire. Another factor differentiating WM paradigms regards the duration of the retention interval, with some tasks involving the assessment of memory (essentially) immediately after presentation (as in Immediate Serial Recall), and others imposing a longer, though still “short-term”, delay interval between initial presentation and retrieval (as in Delayed Serial Recall (e.g., Chein & Fiez, 2001; Farrell, 2006). WM tasks are also distinguished by the presence or absence of intervening distractors (i.e. extraneous stimuli or processing requirements) and the relative need for memory updating (replacing the previous contents of WM with newly relevant information). For instance, Complex Span tasks (e.g., Conway et al., 2005) require that information be maintained and updated across periods of responding to a secondary processing task, while Running Memory Span (e.g., Bunting et al., 2006) tasks require that the contents of WM be continuously updated such that earlier (now extraneous) information is replaced by more recently presented information.

Figure 2.

Figure 2

Top: Demands present or absent in each task. Bottom: Graph of percentage of participants employing each of the more common strategy responses. “Other” includes participants who reported a strategy other than one listed, as well as those who reported less common strategies, such as Look and Sound. ISR = Immediate Serial Recall; DSR = Delayed Serial Recall; CWMS = Complex Working Memory Span; IR = Item Recognition; FR = Free Recall; RMS = Running Memory Span; MI = Missing Item.

There are, of course, many other dimensions that can be varied through the specific parameterization and implementation of a given WM paradigm; stimulus qualities, speed of item presentation, modality of presentation, and list length, to name a few. These factors also undoubtedly influence the specific processes and strategies that are evoked in support of task performance. Since there is an almost infinite number of WM tasks that can be created by systematically varying each factor, one cannot hope to exhaustively categorize all possible variants of WM tasks. However, by beginning with a finite and relatively common set of WM tasks and parameterizations, we can begin to understand whether (and how) strategy use varies as a function of the specific elements of a WM task.

Assessment of WM Strategies

There is a small but informative extant literature on strategy use in WM tasks, with various different approaches used to collect strategy information. These approaches include: 1) investigating the behavioral impacts of instructing participants to engage in a predetermined strategy (Carretti, Borella, & De Beni, 2007; McNamara & Scott, 2001; St Clair-Thompson, Stevens, Hunt, & Bolder, 2010; Turley-Ames & Whitfield, 2003), 2) allowing participants to control the pace of item presentation and measuring looking time as a correlate of the amount of strategic processing (Engle, Cantor, & Carullo, 1992), 3) designing stimuli in a manner that is more amenable to certain strategies, like grouping items or organizing them based on prior knowledge (Bor, Cumming, Scott, & Owen, 2004; Bor & Owen, 2007), 4) assessing overt verbal behaviors during the task’s inter-trial intervals (Lehmann & Hasselhorn, 2007), and 5) asking participants to self-report their strategy following each individual trial or group of trials, through either open-ended or structured questionnaires (e.g., Dunlosky & Kane, 2007; Richardson, 1998)

Studies using the last approach are the most common, and have already been conducted with Immediate Serial Recall (Logie et al., 1996), Free Recall (Hertzog, McGuire, & Lineweaver, 1998) and Complex Working Memory Span measures such as Operation Span and Reading Span (Bailey, Dunlosky, & Kane, 2011; Dunlosky & Kane, 2007; Friedman & Miyake, 2004). These prior studies are informative in several ways. First, they provide an initial characterization of the range of specific strategies that might be evoked when performing WM tasks. Prior investigations also consistently show that strategy use is not homogenous even within a given task, and that performance can be contingent on strategy choice – strategies such as visual imagery, forming sentences with memory items, and grouping items have been found to be normatively effective strategies, while simply reading or repeating (i.e., phonologically rehearsing) memory items has been found to be normatively less effective (Bailey et al., 2011; Dunlosky & Kane, 2007). Strategy use is also found to be a contributing factor to the relationship between performance on WM tasks and other types of cognitive tasks (e.g., reading comprehension and episodic memory), in particular when the same strategy can be used to support performance in both the WM and additionally assessed cognitive tasks (Bailey, Dunlosky, & Kane, 2008). Furthermore, flexible use of strategies may be a characteristic of higher performing individuals (Dunlosky & Kane, 2007).

Although this body of prior work provides an important foundation for the present investigation, the specific methods used in past studies have important limitations with respect to the present study objectives. Here we aim to inform researchers asking questions such as, “If I choose a certain WM task, what types of strategies should I expect from participants?” and “To what extent can I assume that alternative WM tasks are interchangeable and will produce comparable distributions of strategy use?” Extensions of this research may ultimately help the field answer questions such as “Which WM task is best suited for my research question?” Results from prior studies are helpful, but limited because of the restricted range of tasks considered, and because of variation in the experimental context, stimuli, and strategy assessment methods used across studies. Accordingly, the present study was designed to assess a range of self-reported strategies for a larger set of verbal WM tasks, each assessed under the same basic experimental conditions. Further, while prior work on WM strategy use has focused on the relationship between performance and strategy choice, our immediate focus is not on the way in which strategy use can optimize performance, but on the ways in which experimental design factors can impact strategy use.

Present Study

The present work explores differences in strategy use in seven verbal WM tasks by examining differences among individuals in the same task, and within individuals across multiple tasks. We examine the data from several points of view. With respect to tasks, we evaluate the distribution of strategies employed for a given task, the relative uniformity of intra-task strategy choice, inter-task variation in strategy selection, and the relationships between strategy choice and specific task dimensions/characteristics. With respect to individuals, we evaluate consistency of strategy choice across tasks, and the existence of communities of individuals who behave similarly to one another across variations in task demands.

Method

Participants

Two-hundred and twenty Temple University undergraduate students participated in this study (137 female, 76 male, 7 did not specify their sex) with a mean age just under 21 years (M = 20.80, SD = 4.67, range 17 to 56). All participants were awarded course credit.

Working Memory Tasks

Selection of Tasks

Each participant completed seven verbal working memory tasks (see Figure 1). This subset of tasks was selected following a review of the WM literature, which produced an extensive archive of tasks, varying on many dimensions. Inclusion of a particular WM measure in the present study was based on three constraints: 1) The frequency of use in the literature – with the aim of including more common tasks, 2) The ability to form relatively parallel/analogous versions of each paradigm in a way that would support task comparisons and isolate specific relevant task properties, and 3) The amount of time needed to instruct and execute each measure. It was with these constraints in mind that the following six tasks were selected: Immediate Serial Recall, Delayed Serial Recall, Free Recall, Item Recognition, Running Memory Span, and Complex Span. These six tasks are alike in many respects, but vary on specific dimensions, as clarified in Figure 2.

A seventh task, the Missing Item task (Beaman & Jones, 1997; Buschke, 1963; Jones & Macken, 1993), was selected to be more disparate from the other six tasks, and hence, to evoke what we expected would be a demonstrably different pattern of self-reported strategy use, thus serving as a control task for the sensitivity of the self-report strategy measure to variation in strategy use. The missing item task differs from the other tasks we administered in several ways. In the present study it was the only task that required a different set of stimuli (digits spelled out zero through nine). Rather than requiring the retention of a novel set of words, this task relies on memory for a set of items known to the participants prior to the onset of a trial, and with an inherent ordering structure that differs from the order of presentation during the task. Since the complete stimulus set is known a priori, and all but one item from the set is presented in each trial, participants are thought to use a unique strategy to support task performance, wherein each presented item is mentally noted and “checked off” the mental list of potential items (Beaman & Jones, 1997).

Task Parameterization

The specific parameters for each of the seven tasks were selected with two goals in mind: consistency with methods from the prior literature, and maximal matching of parameters in order to support comparison of tasks differing only according to key variables of interest. To support comparison of strategies across tasks, a common set of procedures and stimuli were used throughout - all tasks required memory for verbal memoranda, were presented visually, and used sequential, rather than simultaneous (a single array of items), presentation. Items were also always presented for one second each, with no gap between successive stimulus presentations (no inter-stimulus-interval) in any of the tasks, with the exception of the Complex Span task wherein an intervening processing interval imposed a necessary inter-stimulus delay (but there was still no interval between the memory items and the processing task).

The goal of maximizing the match between tasks sometimes required deviation from the most typical implementation parameters in the extant literature, and the most notable of these deviations are detailed below. Likewise, in order to maintain consistency with the past literature while also supporting certain pairwise task comparisons it was necessary at times to set parameters (e.g., list length, timing of processing decisions in the operation span task, the inclusion a delay prior to retrieval) in such a way that actually confounded pair-wise comparisons between more disparate tasks. Again, these potential confounds are considered in the discussion.

Stimuli and Procedure

All seven tasks involved memory for verbal items. Word items used for all but the missing item task were selected from the MRC Psycholinguistic Database (Wilson, 1988) with the following constraints: stimuli were all one syllable, contained a maximum of six letters, had an imagability score of at least 500, and a written frequency of at least 50. Within these constraints, 70 words were selected (see Supplemental Materials A). For each task, one of the lists of 10 phonologically distinct words was selected, except for Running Memory Span (which required 20 words per trial; see task explanation below) and the Missing Item task (which used number words as stimuli). For a given task the same word list was used repeatedly across a practice trial and six experimental trials. On a given task trial, items were sampled randomly without replacement from among the items on the selected list. The missing item task necessarily involves memory for a known set of items, and accordingly, the digits 1 through 9 spelled out as words (e.g. one, two, three) were always used for this task.

Individuals participated in a single one-hour session. Sessions were run in groups of between two and ten participants. Each session consisted of all seven working memory tasks. Task stimuli were projected onto a 5’ × 7’ screen. Participants viewed the screen from 7–10 feet away. Words were presented in white on a black background in Arial font size 26, subtending approximately 0.44×1.42–2.48 degrees of visual angle (depending on the length of the word). For all tasks, responses were provided in written form, and given on a task-appropriate response sheet provided immediately prior to testing of each task. For recall tasks, the response sheets included blank spaces where each of the presented words could be written for each trial. For the one recognition task (item recognition), a checkbox for “yes” and “no” responses could be checked off for each trial. For the Complex Span task, additional slots were included for responding to the intervening math processing items.

For each task, participants were instructed verbally and additionally read written instructions. Every task included a practice trial, followed by a brief question and answer period to insure understanding. Visibility of stimuli was confirmed by all participants following the practice trial for each task, and this was followed by 6 experimental trials for each task. Following each task, participants completed a strategy questionnaire for that task only, and after the completion of the strategy questionnaire, participants were introduced to the next task. This cycle of tasks and questionnaires continued for the remainder of the session.

The order of task presentation was counterbalanced across experimental group sessions. Based on the counterbalancing scheme, each individual task occurred sometimes as the first task in the session (i.e. it was the first task attempted by participants, and was therefore the first task for which strategy assessment was conducted), and sometimes as a later task in the session. The counterbalancing also assured a varied sequence of task ordering for each session, such that task history would not have a systematic impact on strategy selection.

Tasks

Immediate Serial Recall

Participants were shown six words, one at a time, for one second each, without an inter-stimulus interval (ISI). Immediately following presentation of the last item, participants were prompted, by an icon showing a hand in the act of writing, to report all six items in the order in which they had been presented.

Item Recognition

This task differed from the Immediate Serial Recall task in only the response requirements. Six items were presented one at a time, for one second each, without an ISI. At the conclusion of each trial, participants were then shown a single probe word and asked to respond “yes” if the word was on the list and “no” if it was not. The word was present on the list 50% of the time and absent 50% of the time (lures were chosen from the 4 unsampled stimulus items on the 10-item list). A set size of six was chosen to maintain a consistent set size between the Immediate Serial Recall task and the Item Recognition task. While Item Recognition tasks are often implemented with simultaneous presentation of the memoranda, for consistency with other tasks we used sequential presentation.

Delayed Serial Recall

Nine words were displayed one at a time, for one second each, and with no ISI. Following item presentation there was a delay of 10 seconds, during which a fixation cross was presented on the center of the screen. After the delay, participants were prompted (by icon) to recall the nine words in serial order.

Free Recall

The free recall task was identical to the Delayed Serial Recall task, with the exception of the response requirements. Nine words were presented sequentially, for one second each and with no ISI. After a 10 second delay/fixation period, participants were prompted to report the words in any order they chose. While the free recall task has been previously implemented with and without a delay (e.g., Beaman & Jones, 1998), a delay before recall was included here to allow for a close comparison between the free recall task and the delayed serial recall task. Although atypical for this task in the literature, in order to be consistent with the other tasks, stimuli were also sampled from the same 10-item list of words across all six trials (and thus, although presentation order varied, only one item differed from one trial to the next). Since participants were free to recall the items in any order, it could be expected that free recall of the items would become relatively stereotyped, and more accurate, across trials.

Complex Span

Participants were shown five words and, between the presentation of each word, were asked to judge the veracity of a solved arithmetic equation (e.g. (5+3)/2=4). Memoranda were displayed for 1 second each. In prototypical versions of this type of Complex Span task (referred to as the Operation Span task), either an experimenter insures that each successive item is presented immediately after responding to the equation is completed (Kane & Engle, 2003), or automated timing is tailored to an individual’s speed of solving a set of practiced mathematical operations (Unsworth et al., 2005). In order to present the task to a group of participants, we allotted a fixed time for participants to solve the math problems (6 s), based on the average response duration obtained from a large sample (N>100) of prior participants who had completed the automated version of the task in our research lab (Chein, unpublished data).

Running Memory Span

This task tests memory for the final items of a list of unpredictable length. Successive stimuli were presented for one second each, one at a time, and with no ISI. The length of the list varied unpredictably across trials (12–20 items); thus, we used a pool of 20 words for this task (rather than 10, as in other tasks). When the list ended, participants were prompted (by icon) to report the last six words that had been shown, in the presented order.

Missing Item

For the Missing Item task, the set of digits zero through nine were spelled out as words. Nine of the ten digits were presented for one second each, sequentially, and without an ISI. After item presentation was completed, participants were prompted to report the single item that was not presented in the trial.

Strategy Assessment and Distribution Analyses

A strategy questionnaire was administered immediately following each task (see Supplemental Materials A). The 10 strategies included on the questionnaire were constructed based on a comprehensive search through the strategy literature, open ended strategy reports provided by participants at the conclusion of a one month working memory training study where training included complex span working memory tasks (Chein & Morrison, 2010)1, and open ended strategy reporting provided during a brief pilot of the present experiment. The 10 strategies identified from these sources were assessed via endorsement of the following statements, with the words in bold lettering intended to highlight the characteristic quality of each strategy (words shown in parenthesis after each statement indicate our nomenclature for these strategies, but were not shown on the questionnaire): I silently repeated the items (Rehearsal), I remembered the words in groups (Grouping), I used the meaning of the words to remember or connect them (Semantic), I thought about other things that could relate to the words (Association), I pictured the way the words looked on the screen (Look), I created a visual image based on the meaning of the words (Imagery), I thought about the way the words sounded (Sound), I simply concentrated on the words (Concentrate), I answered based on what words seemed recent or familiar (Familiarity), I expected certain words to appear and mentally checked them off as they arrived (Checklist). On the questionnaire, participants could also indicate that they had used another unidentified strategy, no particular strategy, or didn’t understand the task instructions/demands.

The order in which the list of strategies was presented was counterbalanced across sessions so that selection would not be biased according to location in the list (i.e., because a given strategy appeared more often near the top or bottom of the page). However, for a given participant, the strategy list always appeared in the same order throughout the session. During analysis it was determined that some statements were endorsed very infrequently, and the associated strategies were ultimately collapsed into a single Other category along with any additional previously unidentified strategies that participants described. Additionally, the None category was grouped with Concentrate (see Results for further explanation of collapsed categories).

The strategy questionnaire included two pages, intended to assess primary and secondary strategy use, respectively. On page one of the questionnaire, participants were asked to endorse a single strategy from the list as the strategy that best described their approach to the task. This selection was considered their “primary strategy.” On page two, which was identical to the first page with the exception of the instructions at the top of the page, participants were asked to indicate other strategies that were used to a lesser extent (i.e., secondary strategies) during task performance. The current paper focuses on primary strategies only. Secondary strategies were very erratically and variably reported (e.g., many participants never reported them, while others reported using nearly every strategy at some point within a single task). We were accordingly concerned about the validity and reliability of this component of the instrument, and opted to focus exclusively on primary strategy use.

Main analyses utilized chi-square testing to determine which tasks evoked consistent and varied patterns of strategy choice. Additional descriptive analyses explored within-individual variation in strategy choice across specific task pairings. Finally, as a further way to identify patterns within the strategy choice data, we also deployed community detection algorithms based on graph theory (using the Connectivity Toolbox in Matlab; [Rubinov & Sporns, 2010]). These data-driven algorithms allowed us to explore whether participants naturally grouped into “communities” based on their across-task strategy distribution profiles. A detailed explanation of these analyses can be found in Supplemental Materials E.

Results

Strategy Variation within and across Tasks

An initial aim of the present study was to provide a characterization of the range and distribution of strategies that individuals engage when performing alternative WM tasks. Across all tasks, the strategy assessment data indicated that certain strategies were often reported as a primary strategy, while others were rarely reported. In order to support direct comparisons of the strategy distribution patterns across tasks, infrequently reported strategies (i.e. those reported less than 2.5% of the time, representing less than about 35 total reports of use) were collapsed together with cases where participants reported using a strategy not listed (those reporting Other) to create a single Other category (Suplemental Materials B provides a chart with the complete, uncollapsed data). Collapsed strategies included Look, Sound, and Association.

Additionally, some participants reported using no strategy (selected None) on certain tasks (2.23 %). Because it was reported infrequently, we considered grouping these responses in the Other category. Alternatively, it occurred to us in retrospect that the None response might be used in a fashion equivalent to the Concentrate option. Like concentrate, selection of None could indicate a “default” approach to the task at hand with no other supplementary strategy. The latter grouping of None was corroborated by the fact that the frequency of use of these two strategy responses (None, Concentrate) was strongly correlated across tasks (r=.64, p =.06 one-tailed; the correlation does not reach significance because there were only 7 contributing cases). Accordingly, in final analyses we grouped, these two response types (None and Concentrate) into a single class.2 Last, data were treated as missing for the rare occasions when participants reported that they did not understand the task (1.33% of the time for each task, on average).

At an initial, descriptive, level there were several similarities in strategy distribution across all of the tasks (see Figure 2 for a graphical representation of the strategy distributions, and Supplemental Materials B for an uncollapsed version, as well as a table with exact percentages). Unsurprisingly, the most commonly reported strategy was Rehearsal, making up 39.85% of all strategy reports. Grouping was the next most popular strategy, comprising 18.87% of primary strategy reports. In fact, for all tasks except the Missing Item task, Rehearsal and Grouping made up at least half of the primary strategies reported.

Despite commonalities across tasks, there were also notable differences. Chi-square tests were used to determine whether strategy use differed significantly across tasks (see Table 1). Indeed, even though the majority of the tasks shared the two most popular strategies, a chi-square test including all task strategy frequencies indicated significant differences, (χ2(42) = 455.45, p < .001). As anticipated, Missing Item showed the most disparate strategy distribution, and pairwise chi-square tests confirmed that the frequency of strategy choices in the Missing Item task differed significantly from every other task strategy distribution (all ps < .001). Of note, participants reported using the Checklist strategy more often than in any other task (24.63% in Missing Item vs. 1.82% across all other tasks), though this strategy was by no means universal in this task.

Table 1.

Chi-square results.

χ2 df P
All tasks 455.45 42 <.001
Without MI 189.45 35 <.001
ISR vs. IR 25.83 7 <.001
ISR vs. CWMS 4.69 7 <.001
ISR vs. DSR 8.40 7 .30
ISR vs. FR 40.57 7 <.001
ISR vs. RMS 65.09 7 <.001
ISR vs. MI 139.84 7 <.001
IR vs. CWMS 27.05 7 <.001
IR vs. DSR 19.94 7 .006
IR vs. FR 9.72 7 .21
IR vs. RMS 33.94 7 <.001
IR vs. MI 76.51 7 <.001
CWMS vs. DSR 23.39 7 <.002
CWMS vs. FR 42.80 7 <.001
CWMS vs. RMS 57.79 7 <.001
CWMS vs. MI 135.73 7 <.001
DSR vs. FR 33.02 7 <.001
DSR vs. RMS 52.34 7 <.001
DSR vs. MI 130.80 7 <.001
FR vs. RMS 24.77 7 <.001
FR vs. MI 74.50 7 <.001
RMS vs. MI 112.11 7 <.001

Because the Missing Item strategy distribution was so different from the other tasks, we repeated chi-square analyses while excluding the Missing Item task, which still produced a highly significant result (χ2(35) = 189.45, p < .001). To further explain this omnibus effect, additional chi-square tests were conducted comparing the strategy differences between pairs of tasks. Including the pair-wise comparisons between Missing Item and every other task, we ran a total of 21 pair-wise chi-squares. To control for family-wise error rate, we used a Bonferroni-corrected cutoff p-value of .0024 (.05/21=.0024). Of interest, only three of the 21 tests did not reach significance even after correction: Immediate Serial Recall and Delayed Serial Recall (χ2(7) = 8.40, p =.30); Item Recognition and Free Recall, (χ2(7) = 9.72, p =.21); and Item Recognition and Delayed Serial Recall (χ2(7) = 19.94, p =.006). Chi-square tests were significant for all other pairs of tasks, suggesting that the strategy distributions across most tasks demonstrated significant variability.

To examine which task dimensions influenced strategy selection and which did not, we conducted an additional series of chi-square analyses based on the following specific task dimensions (see Figure 2 & Table 2): delay length (immediate vs. delayed), list length (short vs. long), order requirement (ordered vs. free), response demands (recall vs. recognition), and updating/distraction demands (passive storage vs. storage with updating/distraction). Tasks falling within each dimensional class were combined, and chi-square values computed. Strategy distributions did not differ based on delay or list length (ps > .10). However, strategy selection did vary based on serial order requirements (ordering), response demands (recall), and the updating/distraction dimension (ps < .001).

Table 2.

Chi-squares collapsed across task dimensions.

Dimension Tasks + Tasks − χ2 df p
Delay DSR, FR ISR, IR, CWMS, RMS 11.35 7 .12
Ordering ISR, CWMS, DSR, RMS IR, FR 41.08 7 <.001
Recall ISR, CWMS, DSR, FR IR, RMS 49.79 7 <.001
Updating/
Distraction
CWMS, RMS ISR, IR, DSR, FR 40.77 7 <.001
List Length 6 items – ISR, IR 9 items – DSR, FR 11.94 7 .10

Exploration of Cross-Task and Strategy “Contamination”

Our experimental design necessitated that participants complete several WM tasks successively, providing a strategy report after each task. This design made it possible that earlier exposure to the strategy-use questionnaire could influence participants’ strategy choice in subsequent tasks (e.g., introduce novel strategies that the participant might want to try later in the session). Participants’ strategy reports may also have been influenced by exposure to prior tasks in the session. To statistically address these possible “contamination” effects, we ran additional chi-square tests comparing the distribution of reported strategies when a task occurred as the first task in the session (insuring no prior to exposure to the strategy report sheet or other tasks) to the distribution when the same task was presented later in the experiment. Only one of the tasks (IR) significantly differed in strategy distribution when it was presented first compared to later (χ2=18.51, p = .01). However, this test did not survive correction for multiple comparisons (.05/7 tasks = p <.007). Therefore, there is very limited evidence that a different variety of strategies were reported after exposure to the strategy sheet and/or other tasks. Stacked graphs displaying strategy reports for participants’ completing each task as the first in the session, compared to those who completed the same task later in the session, are presented in Supplemental Materials C.

Task Performance as a Function of Strategy

Table 3 summarizes performance in each of the WM tasks. While the main focus of the present study was to investigate strategy use and variation across WM tasks, some prior work has found a link between participants’ strategy use and their task performance (Bailey et al., 2011; Dunlosky & Kane, 2007). Accordingly, we ran a series of tests to investigate whether task performance varied systematically with strategy choice. Specifically, for each task independently, we conducted a one-way ANOVA on task accuracy, treating strategy choice as a between-subjects factor. These analyses indicated no significant variation in performance as a function of strategy choice (all Bonferroni-corrected and uncorrected ps > .1). Additional post-hoc analysis of the performance data is reported in Supplemental Materials D.

Table 3.

Mean and Standard Deviation of Response Accuracy for Each Task.

Task Mean (SD)
Immediate Serial Recall (avg. % recalled in position, out of 6 items) 66% (21)
Delayed Serial Recall (avg. % recalled in position, out of 9 items) 33% (15)
Complex Working Memory Span (avg. % recalled in position, out of 5 items) 75% (3)
Item Recognition (avg. % old/new recognition accuracy) 88% (19)
Free Recall (avg. % recalled out of 9 items) 66% (15)
Running Memory Span (avg. % recalled in position, out of 6 items) 35% (16)
Missing Item (avg. % items accurately checked-off as missing) 59% (26)

Strategy Variation across Individuals

We were not only interested in how certain tasks encourage the use of distinct strategies, but also in how individuals choose different strategies across tasks. It is possible that some participants are more likely to keep strategy use constant across tasks, while others are more likely to switch strategies depending on the task at hand. Figure 3 shows frequency counts detailing the correspondence (data falling on the main diagonal), or lack of correspondence (data falling off of the diagonal) in individuals’ strategy choices across several task pairings. Figure 3a displays the correspondence between Immediate Serial Recall and Delayed Serial Recall, whose overall strategy distributions did not significantly differ from one another. Despite the similar distributions, about half of the participants indicated use of different primary strategies (i.e. switched strategies) to perform the two tasks. In Item Recognition compared to Free Recall (Figure 3b), whose strategy distributions also did not significantly differ, only about a third of participants maintained their strategy across the tasks. So, it seems that even when the overall distribution of strategies is similar across tasks, individuals often do not maintain consistent strategies across tasks. Indeed, the degree of individual variation is comparable in those cases to that shown in Figure 3c, which displays the correspondence between Immediate Serial Recall and Operation Span, which were significantly different in overall strategy distribution.

Figure 3.

Figure 3

Figure 3

Number of participants using each strategy in a. Immediate Serial Recall compared to Delayed Serial Recall; b. Item Recognition compared to Free Recall; and c. Immediate Serial Recall compared to Operation Span. R=Recall; G=Grouping; S=Semantics; I=Image; C=Concentrate; Rec=Recent; Ch=checklist; O=Other

Strategy Use Profiles

The lack of within-individual consistency, especially for tasks with similar overall strategy distributions, was intriguing. To further probe individual strategy variation, we next turned to a community detection algorithm based on graph theory. This data-driven approach explored the possibility that there are naturally occurring groups of individuals who vary their strategies the same way under changing task demands. The algorithm we applied required a complete strategy profile for each individual (e.g., would not accommodate participants with missing strategy data on one or more tasks), and fails for cases where the identical strategy is reported for every task. Accordingly, subjects with any missing data (N=55) and those who made consistent use of only one strategy (N=12) were excluded from these analyses. Also, we excluded Missing Item strategy data from these analyses in order to focus on variation in strategy use within more proximate tasks. As in the analyses above, we maintained the strategy grouping of None and Concentrate, and excluded participants who did not understand the task. For this analysis, we also added the Checklist strategy to the Other category, as it was very sparsely reported in tasks other than the Missing Item task. After these exclusions, a total of 153 participants remained. Strategy data from these participants was submitted to network analysis, which revealed a highly modular and robust network with 3 distinct subgroups (Q = .58; threshold=.55).

To better understand the results of this grouping algorithm, and to explore what factors may have driven the robust groupings, we examined how participants in each group employed different strategies across tasks. This examination revealed striking differences in strategy use between groups, as shown in Figure 4 (the “Semantics” and “Imagery” strategies almost no variation by group, and for simplicity, these strategies are not shown in the figure). These groupings were driven primarily by the tasks for which Rehearsal (the most-reported strategy across all groups and tasks) was less often reported, as well as by the predominating alternate strategies for those who did not employ Rehearsal. For instance, no participants in Group 1 employed Rehearsal in the Running Memory Span task; instead, they generally reported Concentrate or Other for that task. Meanwhile, fewer members of Group 2 reported using Rehearsal during Item Recognition; both relative to their use of this strategy on other tasks and compared to the use of rehearsal by the other two groups during Item Recognition. As Group 1 did for Running Memory Span, Group 2 tended to report Concentrate or Other as the primary alternative strategy during Item Recognition. Group 3 were less frequent “rehearsers” during Free Recall, during which they tended to make greater use of the Familiarity, Other, and Concentrate strategies.

Figure 4.

Figure 4

Charts displaying the relative proportion of reported strategy use across tasks and divided by subtype (Group). Group 1: N=60; Group 2: N=32; Group 3: N=53. ISR = Immediate Serial Recall; DSR = Delayed Serial Recall; CWMS = Complex Working Memory Span; IR = Item Recognition; FR = Free Recall; RMS = Running Memory Span.

The correlation heat map presented in Table 4 provides another useful visualization for understanding the subtyping analysis. It is clear from this table that Group 1 differs most greatly from Groups 2 and 3 on Running Memory Span, while Group 2 differs on Item Recognition and Group 3 differs on Free Recall.

Table 4.

Correlations in strategy use between subtyping groups for each of the tasks.

graphic file with name nihms774754t1.jpg

Note. Darker cells represent more negative correlations while lighter cells represent strong positive correlations. ISR = Immediate Serial Recall; DSR = Delayed Serial Recall; CWMS = Complex Working Memory Span; IR = Item Recognition; FR = Free Recall; RMS = Running Memory Span.

Discussion

Various factors drive investigators to make use of a given WM task – the prevalence of use in the prior literature associated with the phenomenon under investigation, emphasis on specific aspects of WM (e.g. executive vs. domain-specific storage), the degree to which the task is amenable to experimental constraints or suited to the population of interest, the amount of time it takes to obtain reliable data from the task, etc. These factors all relate to the appropriateness of the paradigm to the research question under investigation. However, given the very wide use of WM assessment tasks, surprisingly little is known about how specific task demands actually affect the strategies that participants choose. Without knowing about how task properties affect strategy use, it is difficult (if not impossible) to actually determine the appropriateness of a given paradigm.

The broad goal of the present study was to initiate a more extensive investigation of the relationship between WM tasks and strategy use. One of our specific aims was to simply characterize the range of strategies employed during tasks common to the WM literature. We investigated the hypothesis that even modest changes in task parameters (e.g., recall versus recognition) could lead to different strategy allocation by participants. Our analyses revealed that strategy distributions differed significantly across most, but not all, tasks. Further analyses showed that some parameters (an ordering demand, updating requirement, or distraction) appear to affect the distribution of strategy use, while others (presence/absence of a delay, length of the memory list) do not, at least within the boundary conditions that we investigated.

The finding that find that strategy use is heterogeneous, even within a given WM task, is consistent with the extant literature on strategy use during working memory tasks (Logie et al., 1996; Hertzog, McGuire, & Lineweaver, 1998). However, while others have found performance to be contingent on the choice of strategy (Bailey et al., 2011; Dunlosky & Kane, 2007; Friedman & Miyake, 2004), we did not find that performance within any of the tasks varied as a function of strategy use. The lack of a robust influence of strategy on performance may be related to the small number of trials that were included per task in the present study. While we prioritized inclusion of several tasks over inclusion of a larger number of trials per task, in future studies inclusion of a greater number of trials per task may increase the power to replicate the performance differences observed in earlier studies.

The heterogeneity in strategy choice that we observed has important implications when trying to draw inferences across studies using disparate WM tasks. Sometimes assuming common underlying strategies may be justified, but other times these assumptions may not be valid. For instance, the present results endorse the conclusion that participants employ a similar range of strategies in an Immediate Serial Recall task and a Delayed Serial Recall task, and, accordingly, make a stronger case for directly comparing performance obtained from these two tasks. In contrast, it seems that ordering demands are a strong determinant of strategy choice, and hence, comparisons between immediate serial recall and item recognition, for example, may not be well justified.

Equally profound are the implications of these findings as they pertain to studies using neuro-investigative (e.g., fMRI, EEG) methods. The implications are twofold: First, if different WM tasks are associated with unique strategy distributions, and by inference, unique neural substrates, then researchers need to use caution in comparing activation patterns evoked from a given task to those evoked from a different task, despite both being putative measures of WM. Second, if participants in a study sample use heterogeneous strategies in a single task, then the neural processes they engage while completing that task should also be expected to be heterogeneous, thus adding substantial noise to already noisy methods. As a safeguard of sorts, researchers may want to probe their participants to understand which strategies they are using, and examine group differences in brain activation based on strategy use, provided that the sample is large enough.

Our subtyping analysis also affords novel insights. This analysis suggests that within any study sample, it is likely that robust groups of participants will approach the same tasks differently. These findings are particularly interesting because they expose particular contexts in which participants will stray from the predominating rehearsal strategy. For example, based on the present results, it can be expected that a subgroup of participants will approach Free Recall differently from the majority of other participants, responding based on what is familiar or by concentrating on the words, rather than engaging in effortful rehearsal. Once again, this is an important finding not only with respect to behavioral research, but also for neuro-investigative studies. For instance, we might expect that activation profiles will be similar for members of the same strategy group, but disparate for members of opposing groups.

To take the potential implications one step further, the great degree of heterogeneity we observe in strategy selection – across task demands, within individuals (across tasks), and between groups of individuals – raises questions about the very existence of a common underlying working memory “system” or resource. Indeed, the results lend some support to an emerging perspective that abandons the notion that working memory functions as a singular cognitive system that is recruited whole-cloth in support of performance, and assumes instead that the specific processes marshaled in a given task environment are entirely a function of the particular materials that need to be processed, the exact storage and output requirements of the task, and the particular skills/knowledge possessed by the participant (e.g., Macken, Taylor, & Jones, 2015).

While the present experiment provides an important step toward understanding the strategies that participants use in various WM tasks, there are several limitations that need to be acknowledged. Foremost, in designing our tasks we needed to control task parameters so that: 1) tasks would be easy to administer to large groups of participants, and 2) we could make comparisons across tasks that differed according to specific parameters. In some cases, this led the tasks to diverge slightly from similar tasks used in past literature. This was true for the free recall task, which in atypical fashion used repeated presentations of the same pool of items, and for the delayed serial recall task, which used a larger stimulus set (9 items) than is the mode. A particularly relevant deviation, we think, occurred with the Operation Span task. Operation Span tasks are usually subject-paced, with new memory items given directly after the solution to the intervening math problem is given. However, our Operation Span task could not be subject-paced because we administered the task to groups of participants who may have varied in their speed at solving math problems. We therefore opted to allow participants a fixed amount of time to solve the math problem. From the perspective of understanding strategy selection, this could have been problematic because those who solved the math problems more quickly (i.e. early in the allotted time period) would have had extra time to engage in active strategy selection, and conversely, participants who did not have enough time to solve the math problem may have been forced to adopt passive strategies. Future studies testing participants individually may benefit from using subject-specific timing for Operation Span and other tasks.

Yet another issue of note is related to the stimulus sets we deployed. We used partially “open” pools of stimuli – i.e., we varied the specific subset of words presented from one trial to the next within a task. This practice can be contrasted with studies using “closed” sets of items (i.e.,using the same limited set of words on each trial of a task) to isolate short-term processes from semantic and long term-memory phenomena (see Baddeley, 2012). While the use of open vs. closed sets might impact strategy choice in some contexts, our chi-square analysis (see Table 2) showed that strategy distributions for tasks where the stimulus pool was more open (tasks where only 6 items out of 10 possible items were sampled across trials; ISR, IR) did not significantly differ from those using a more closed set (tasks where 9 of 10 possible items were sampled across trials; DSR, FR). Hence, where evidence is available in the present study the patterns don’t indicate that variation in strategy choice was a function of the degree to which stimulus sets were open or closed.

Another limitation of the present study stems from reliance on a self-reported strategy questionnaire. Despite the detailed description of strategies that we provided, it is likely that there was variability in participants’ interpretations of the subjective and introspective experiences associated with a given strategy, and in their interpretations of the specific language of the questionnaire in relation to those subjective experiences. Since we only have their final self-reports, we cannot readily assess the degree to which subjects reliably understood the described strategies and whether they uniformly associated these descriptions with the strategic approaches we intended to index. Nevertheless, the higher rate of use of the Checklist strategy in association with the Missing Item task does provide some evidence that participants were indeed trying to make appropriate use of the questionnaire. A somewhat related concern is that our strategy labels (and descriptions) may have been too coarse to capture nuanced, but important, differences within strategic categories. For example, Lehmann and Hasselhorn (2007) have made the argument that within the broad category of “rehearsal” there may be a distinction between “labeling” (merely saying the word), “single-word rehearsal” (rehearsing each word one at a time), and “cumulative rehearsal” (rehearsing at least two words in succession). As rehearsal was our most commonly reported strategy, it is possible that participants were actually engaging in different types of rehearsal. These distinctions may have been helpful in understanding the strategies employed by individuals, especially in the subtyping analysis.

Additionally, it is important to point out that participants may have been biased in their reporting after seeing the strategy list for the first time. Indeed, having been exposed to the questionnaire after completion of the first task may have encouraged participants to select strategies that they would not have considered had they not seen the option on the questionnaire. Yet another possible source of bias in reporting may have emerged because participants completed several WM tasks within a single session, and experience with the demands of one task may have influenced subsequent tasks. These sources of bias were partially attenuated by counterbalancing task order. To further evaluate concerns about how the chronology of the tasks within a session may influence strategy reports, for each task separately, we compared the distribution of reported strategies in the subsample of participants that started the experiment with a given task to the distribution in those who completed the same task later in the session. We did not find robust differences within any of the tasks according to the task’s relative position in the session, making it somewhat less likely that factors associated with the order of tasks within a session influenced the strategy reports.

Some additional caveats we should acknowledge are tied to the use of Chi-square tests and the assumptions associated with the conduct of such tests. Chi-square tests assume that frequencies across cells are independent of one another, and this assumption was not met in the analyses that considered more than one task per participant (because a given participants’ responses were included in multiple cells). Luckily, the fact that strategy distributions were similar for those participants who completed a given task at the start of the session (with no violation of the independence assumption) to the entire session data suggests that the violation of this assumption did not strongly impact the outcomes. However, analysis of only the data from the first task encountered dramatically reduces statistical power and reduces the expected count of several cells below 5 occurrences, which violates another common rule in application of the Chi-square statistic. Therefore, in the present study, it is difficult to accommodate the assumption of independence while also meeting guidelines for expected cell counts. While the Chi-square results should be interpreted with these limitations in mind, concerns regarding these assumptions are, to some extent, lessened by the addition of the subtyping analyses, which also supported the conclusion that strategy differs between individuals in the same task and within an individual between tasks.

While there are some clear caveats to be considered, this is the first study to explicitly probe strategy use across a variety of WM tasks and to describe some differences in strategy use across tasks and across individuals. We hope that these data will serve as a useful reference to those wishing to understand and predict strategy variation across these tasks, and that the findings will also encourage more in depth contemplation of the implications of differences in strategy use across WM tasks. Ultimately, the findings suggest a need to be more circumspect in the selection of particular WM tasks for use in research, more thoughtful in the interpretation of results emerging from studies using these tasks, and more careful in drawing comparisons across studies using different tasks.

Supplementary Material

13421_2016_608_MOESM1_ESM

Acknowledgments

This project was supported in part by funding from the National Institute on Drug Abuse, NIH (DA031436) and the Office of Naval Research (ONR N00014-11-1-0399). We would like to thank the many research assistants and students who assisted with execution of the study, especially Naomi Streeter, Danielle Adinolfi and Suwanthna Reel.

Footnotes

1

In Chein & Morrison, (2010) participants engaged in complex span tasks for a total of ten hours. Strategy use was not discussed with participants during training, but following training participants were asked to detail the strategies they engaged. Participants’ time-on-task was substantially more than a typical single session study of WM, and they reported a rich and varied set of strategies that were very informative in the design of the present study.

2

We ran the analyses both grouping “none” with “other” and excluding “none” and the results did not qualitatively change.

References

  1. Baddeley A. Working memory: Looking back and looking forward. Nature Reviews Neuroscience. 2003;4(10):829–839. doi: 10.1038/nrn1201. [DOI] [PubMed] [Google Scholar]
  2. Baddeley A. Working memory: theories, models, and controversies. Annual review of psychology. 2012;63:1–29. doi: 10.1146/annurev-psych-120710-100422. [DOI] [PubMed] [Google Scholar]
  3. Baddeley A, Hitch GJ. Working Memory. The psychology of learning and motivation. 1974;8:47–89. [Google Scholar]
  4. Bailey H, Dunlosky J, Kane MJ. Why does working memory span predict complex cognition? Testing the strategy affordance hypothesis. Mem Cognit. 2008;36(8):1383–1390. doi: 10.3758/MC.36.8.1383. [DOI] [PubMed] [Google Scholar]
  5. Bailey H, Dunlosky J, Kane MJ. Contribution of strategy use to performance on complex and simple span tasks. Mem Cognit. 2011;39(3):447–461. doi: 10.3758/s13421-010-0034-3. [DOI] [PubMed] [Google Scholar]
  6. Beaman CP, Jones DM. Role of serial order in the irrelevant speech effect: Tests of the changing-state hypothesis. Journal of Experimental Psychology-Learning Memory and Cognition. 1997;23(2):459–471. [Google Scholar]
  7. Beaman CP, Jones DM. Irrelevant sound disrupts order information in free recall as in serial recall. Quarterly Journal of Experimental Psychology Section a-Human Experimental Psychology. 1998;51(3):615–636. doi: 10.1080/713755774. [DOI] [PubMed] [Google Scholar]
  8. Bor D, Cumming N, Scott CE, Owen AM. Prefrontal cortical involvement in verbal encoding strategies. Eur J Neurosci. 2004;19(12):3365–3370. doi: 10.1111/j.1460-9568.2004.03438.x. [DOI] [PubMed] [Google Scholar]
  9. Bor D, Owen AM. A common prefrontal-parietal network for mnemonic and mathematical recoding strategies within working memory. Cerebral Cortex. 2007;17(4):778–786. doi: 10.1093/cercor/bhk035. [DOI] [PubMed] [Google Scholar]
  10. Broadway JM, Engle RW. Validating running memory span: Measurement of working memory capacity and links with fluid intelligence. Behav Res Methods. 2010;42(2):563–570. doi: 10.3758/BRM.42.2.563. [DOI] [PubMed] [Google Scholar]
  11. Bunting M, Cowan N, Saults JS. How does running memory span work? Q J Exp Psychol (Hove) 2006;59(10):1691–1700. doi: 10.1080/17470210600848402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Buschke H. Relative Retention in Immediate Memory Determined by the Missing Scan Method. Nature. 1963;200:1129–1130. doi: 10.1038/2001129b0. [DOI] [PubMed] [Google Scholar]
  13. Carretti B, Borella E, De Beni R. Does strategic memory training improve the working memory performance of younger and older adults? Exp Psychol. 2007;54(4):311–320. doi: 10.1027/1618-3169.54.4.311. [DOI] [PubMed] [Google Scholar]
  14. Chein JM, Fiez JA. Dissociation of verbal working memory system components using a delayed serial recall task. Cerebral Cortex. 2001;11(11):1003–1014. doi: 10.1093/cercor/11.11.1003. [DOI] [PubMed] [Google Scholar]
  15. Chein JM, Moore AB, Conway ARA. Domain-general mechanisms of complex working memory span. Neuroimage. 2011;54(1):550–559. doi: 10.1016/j.neuroimage.2010.07.067. [DOI] [PubMed] [Google Scholar]
  16. Chein JM, Morrison AB. Expanding the mind’s workspace: Training and transfer effects with a complex working memory span task. Psychon Bull Rev. 2010;17(2):193–199. doi: 10.3758/PBR.17.2.193. [DOI] [PubMed] [Google Scholar]
  17. Conway AR, Kane MJ, Bunting MF, Hambrick DZ, Wilhelm O, Engle RW. Working memory span tasks: A methodological review and user’s guide. Psychon Bull Rev. 2005;12(5):769–786. doi: 10.3758/bf03196772. [DOI] [PubMed] [Google Scholar]
  18. Cowan N. The magical number 4 in short-term memory: a reconsideration of mental storage capacity. Behav Brain Sci. 2001;24(1):87–114. doi: 10.1017/s0140525x01003922. discussion 114–185. [DOI] [PubMed] [Google Scholar]
  19. Cowan N, Elliott EM, Scott Saults J, Morey CC, Mattox S, Hismjatullina A, Conway AR. On the capacity of attention: its estimation and its role in working memory and cognitive aptitudes. Cogn Psychol. 2005;51(1):42–100. doi: 10.1016/j.cogpsych.2004.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. D’Esposito M, Postle BR. The cognitive neuroscience of working memory. Annual Review of Psychology. 2015;66:115–142. doi: 10.1146/annurev-psych-010814-015031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Daneman M, Carpenter PA. Individual-Differences in Working Memory and Reading. Journal of Verbal Learning and Verbal Behavior. 1980;19(4):450–466. [Google Scholar]
  22. Dunlosky J, Kane MJ. The contributions of strategy use to working memory span: A comparison of strategy assessment methods. Q J Exp Psychol (Hove) 2007;60(9):1227–1245. doi: 10.1080/17470210600926075. [DOI] [PubMed] [Google Scholar]
  23. Engle RW, Cantor J, Carullo JJ. Individual-Differences in Working Memory and Comprehension - a Test of 4 Hypotheses. Journal of Experimental Psychology-Learning Memory and Cognition. 1992;18(5):972–992. doi: 10.1037//0278-7393.18.5.972. [DOI] [PubMed] [Google Scholar]
  24. Engle RW, Tuholski SW, Laughlin JE, Conway AR. Working memory, short-term memory, and general fluid intelligence: a latent-variable approach. J Exp Psychol Gen. 1999;128(3):309–331. doi: 10.1037//0096-3445.128.3.309. [DOI] [PubMed] [Google Scholar]
  25. Farrell S. Mixed-list phonological similarity effects in delayed serial recall. Journal of Memory and Language. 2006;55(4):587–600. [Google Scholar]
  26. Friedman NP, Miyake A. The reading span test and its predictive power for reading comprehension ability. Journal of Memory and Language. 2004;51(1):136–158. [Google Scholar]
  27. Henson RNA, Shallice T, Gorno-Tempini ML, Dolan RJ. Face repetition effects in implicit and explicit memory tests as measured by fMRI. Cerebral Cortex. 2002;12(2):178–186. doi: 10.1093/cercor/12.2.178. [DOI] [PubMed] [Google Scholar]
  28. Hertzog C, McGuire CL, Lineweaver TT. Aging, attributions, perceived control, and strategy use in a free recall task. Aging Neuropsychology and Cognition. 1998;5(2):85–106. [Google Scholar]
  29. Hurlstone MJ, Hitch GJ, Baddeley AD. Memory for serial order across domains: An overview of the literature and directions for future research. Psychol Bull. 2014;140(2):339–373. doi: 10.1037/a0034221. [DOI] [PubMed] [Google Scholar]
  30. Jones DM, Macken WJ. Irrelevant Tones Produce an Irrelevant Speech Effect -Implications for Phonological Coding in Working Memory. Journal of Experimental Psychology-Learning Memory and Cognition. 1993;19(2):369–381. [Google Scholar]
  31. Jonides J, Lewis RL, Nee DE, Lustig C, Berman MG, Moore KS. The mind and brain of short-term memory. Annual Review of Psychology. 2008;59:193–224. doi: 10.1146/annurev.psych.59.103006.093615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Kane MJ, Engle RW. Working-memory capacity and the control of attention: The contributions of goal neglect, response competition, and task set to Stroop interference. Journal of Experimental Psychology-General. 2003;132(1):47–70. doi: 10.1037/0096-3445.132.1.47. [DOI] [PubMed] [Google Scholar]
  33. Kane MJ, Hambrick DZ, Tuholski SW, Wilhelm O, Payne TW, Engle RW. The generality of working memory capacity: a latent-variable approach to verbal and visuospatial memory span and reasoning. J Exp Psychol Gen. 2004;133(2):189–217. doi: 10.1037/0096-3445.133.2.189. [DOI] [PubMed] [Google Scholar]
  34. Lehmann M, Hasselhorn M. Variable memory strategy use in children’s adaptive intratask learning behavior: developmental changes and working memory influences in free recall. Child Dev. 2007;78(4):1068–1082. doi: 10.1111/j.1467-8624.2007.01053.x. [DOI] [PubMed] [Google Scholar]
  35. Logie RH. Spatial and visual working memory: A mental workspace. Psychology of Learning and Motivation: Advances in Research and Theory: Cognitve Vision, Vol 42. 2003;42:37–78. [Google Scholar]
  36. Logie RH, DellaSala S, Laiacona M, Chalmers P, Wynn V. Group aggregates and individual reliability: The case of verbal short-term memory. Mem Cognit. 1996;24(3):305–321. doi: 10.3758/bf03213295. [DOI] [PubMed] [Google Scholar]
  37. Luck SJ, Vogel EK. Visual working memory capacity: from psychophysics and neurobiology to individual differences. Trends in Cognitive Sciences. 2013;17(8):391–400. doi: 10.1016/j.tics.2013.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Macken B, Taylor J, Jones D. Limitless capacity: a dynamic object-oriented approach to short-term memory. Frontiers in psychology. 2015;6 doi: 10.3389/fpsyg.2015.00293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Marsh JE, Sorqvist P, Hodgetts HM, Beaman CP, Jones DM. Distraction Control Processes in Free Recall: Benefits and Costs to Performance. Journal of Experimental Psychology-Learning Memory and Cognition. 2015;41(1):118–133. doi: 10.1037/a0037779. [DOI] [PubMed] [Google Scholar]
  40. McNamara DS, Scott JL. Working memory capacity and strategy use. Mem Cognit. 2001;29(1):10–17. doi: 10.3758/bf03195736. [DOI] [PubMed] [Google Scholar]
  41. Morrison AB, Conway ARA, Chein JM. Primacy and recency effects as indices of the focus of attention. Frontiers in Human Neuroscience. 2014;8 doi: 10.3389/fnhum.2014.00006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Nee DE, Brown JW, Askren MK, Berman MG, Demiralp E, Krawitz A, Jonides J. A Meta-analysis of Executive Components of Working Memory. Cerebral Cortex. 2013;23(2):264–282. doi: 10.1093/cercor/bhs007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Neely CB, LeCompte DC. The importance of semantic similarity to the irrelevant speech effect. Mem Cognit. 1999;27(1):37–44. doi: 10.3758/bf03201211. [DOI] [PubMed] [Google Scholar]
  44. Oberauer K. Understanding serial position curves in short-term recognition and recall. Journal of Memory and Language. 2003;49(4):469–483. [Google Scholar]
  45. Oberauer K, Lewandowsky S. Forgetting in immediate serial recall: decay, temporal distinctiveness, or interference? Psychol Rev. 2008;115(3):544–576. doi: 10.1037/0033-295X.115.3.544. [DOI] [PubMed] [Google Scholar]
  46. Oberauer K, Lewandowsky S, Farrell S, Jarrold C, Greaves M. Modeling working memory: An interference model of complex span. Psychon Bull Rev. 2012;19(5):779–819. doi: 10.3758/s13423-012-0272-4. [DOI] [PubMed] [Google Scholar]
  47. Pollack I, Johnson LB, Knaff PR. Running Memory Span. Journal of Experimental Psychology. 1959;57(3):137–146. doi: 10.1037/h0046137. [DOI] [PubMed] [Google Scholar]
  48. Redick TS, Broadway JM, Meier ME, Kuriakose PS, Unsworth N, Kane MJ, Engle RW. Measuring Working Memory Capacity With Automated Complex Span Tasks. European Journal of Psychological Assessment. 2012;28(3):164–171. [Google Scholar]
  49. Richardson JTE. The availability and effectiveness of reported mediators in associative learning: A historical review and an experimental investigation. Psychon Bull Rev. 1998;5(4):597–614. [Google Scholar]
  50. Ricker TJ, Cowan N. Differences Between Presentation Methods in Working Memory Procedures: A Matter of Working Memory Consolidation. Journal of Experimental Psychology-Learning Memory and Cognition. 2014;40(2):417–428. doi: 10.1037/a0034301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Rubinov M, Sporns O. Complex network measures of brain connectivity: Uses and interpretations. Neuroimage. 2010;52(3):1059–1069. doi: 10.1016/j.neuroimage.2009.10.003. [DOI] [PubMed] [Google Scholar]
  52. St Clair-Thompson H, Stevens R, Hunt A, Bolder E. Improving children’s working memory and classroom performance. Educational Psychology. 2010;30(2):203–219. [Google Scholar]
  53. Turley-Ames KJ, Whitfield MM. Strategy training and working memory task performance. Journal of Memory and Language. 2003;49(4):446–468. [Google Scholar]
  54. Unsworth N, Fukuda K, Awh E, Vogel EK. Working memory and fluid intelligence: Capacity, attention control, and secondary memory retrieval. Cognitive Psychology. 2014;71:1–26. doi: 10.1016/j.cogpsych.2014.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Unsworth N, Heitz RP, Schrock JC, Engle RW. An automated version of the operation span task. Behav Res Methods. 2005;37(3):498–505. doi: 10.3758/bf03192720. [DOI] [PubMed] [Google Scholar]
  56. Unsworth N, Spillers GJ, Brewer GA. The contributions of primary and secondary memory to working memory capacity: an individual differences analysis of immediate free recall. J Exp Psychol Learn Mem Cogn. 2010;36(1):240–247. doi: 10.1037/a0017739. [DOI] [PubMed] [Google Scholar]
  57. Wager TD, Smith EE. Neuroimaging studies of working memory: A meta-analysis. Cognitive Affective & Behavioral Neuroscience. 2003;3(4):255–274. doi: 10.3758/cabn.3.4.255. [DOI] [PubMed] [Google Scholar]
  58. Wilson MD. The MRC Psycholinguistic Database: Machine Readable Dictionary. Behavioural Research Methods, Instruments and Computers. 1988;20:6–11. [Google Scholar]
  59. Zanto TP, Clapp WC, Rubens MT, Karlsson J, Gazzaley A. Expectations of Task Demands Dissociate Working Memory and Long-Term Memory Systems. Cereb. Cortex. 2016;26(3):1176–1186. doi: 10.1093/cercor/bhu307. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

13421_2016_608_MOESM1_ESM

RESOURCES