The Central Executive as a Search Process: Priming Exploration and Exploitation across Domains

Thomas T Hills; Peter M Todd; Robert L Goldstone

doi:10.1037/a0020666

. Author manuscript; available in PMC: 2011 Nov 1.

Published in final edited form as: J Exp Psychol Gen. 2010 Nov;139(4):590–609. doi: 10.1037/a0020666

The Central Executive as a Search Process: Priming Exploration and Exploitation across Domains

Thomas T Hills ¹, Peter M Todd ^2,^3,⁴, Robert L Goldstone ^2,³

PMCID: PMC2974337 NIHMSID: NIHMS222239 PMID: 21038983

Abstract

The trade-off between exploration and exploitation is common to a wide variety of problems involving search in space and mind. The prevalence of this trade-off and its neurological underpinnings led us to propose domain-general cognitive search processes (Hills, Todd, & Goldstone, 2008). Here, we propose further that these are consistent with the idea of a central executive search process that combines goal-handling across subgoal hierarchies. The present study investigates three aspects of this proposal. First, the existence of a unitary central executive search process should allow priming from one search task to another, and at multiple hierarchical levels. We confirm this by showing cross-domain priming from a spatial search task to two different cognitive levels within a lexical search task. Second, given the neural basis of the proposed generalized cognitive search process and the evidence that the central executive is primarily engaged during complex tasks, we hypothesize that priming should require ‘search’ in the sense of a self-regulated making and testing of sequential predictions about the world. This was confirmed by showing that when participants were allowed to collect spatial resources without searching for them, no priming occurred. Finally, we provide a mechanism for the underlying search process and investigate three alternative hypotheses for subgoal hierarchies using the Central Executive as a Search Process model (CESP). CESP envisions the central executive as having both emergent and unitary processes, with one of its roles being a generalized cognitive search process that navigates goal hierarchies by mediating persistence on and switching between subgoals.

Keywords: working memory, central executive, foraging, search, priming, subgoal hierarchy

Problem solving is often characterized as a search process, involving a trade-off between exploiting old solutions and exploring new ones. Such trade-offs influence a wide range of everyday decisions. Problems like buying and selling stocks, staying in or abandoning relationships, gambling further or cutting your losses, and having your regular lunch or trying “the special” are all examples of a decision between persisting with past actions or trying something different. Problems such as these are similar to animal foraging problems, in that they require a search process that can resolve the exploration-exploitation trade-off—choosing between staying in, and hence exploiting, locations where resources have been found in the past or switching to other locations, where the quality and quantity of resources may be less predictable. They also have much in common with characterizations of central executive processing, whose goal-handling and attention-focusing properties have been a keystone in the working memory literature (Baddeley, 1996; Miyake & Shah, 1999; Hazy, Frank, & O'Reilly, 2006). Tasks that tap into executive processing, like the Wisconsin Card Sorting Task and the Verbal Fluency Task, do so in part by inviting participants to mediate this trade-off between persisting with one line of thinking or switching to another (e.g., Berg, 1948; Troyer, Moscovitch, Winocur, Alexander, & Stuss, 1998). As with so many problems in our everyday experience, finding a good solution involves being able to do both.

The widespread discussion of the exploration-exploitation trade-off in numerous fields reflects an interesting opposition in the cognition literature. On the one hand, specialized mechanisms have been hypothesized for search, problem solving, and decision-making in a variety of domains, following arguments regarding the advantages of specifically tuned over general purpose designs. This has further led to suggestions that the mind incorporates numerous autonomous and domain-specific neural modules (Cosmides & Tooby, 1994; Barrett & Kurzban, 2006; also see Samuels, 1998; Fodor, 1983), or tools in an “adaptive toolbox” (Gigerenzer, Todd, & the ABC Research Group, 1999), each of which is designed to handle a specific class of problems. An extended version of this argument is that the mind lacks a central executive, and that cognitive control is principally an emergent process of competing subsystems, with conflict resolution in the absence of a metacognitive control system, as in, for example, Barnard's (1985) model of Interacting Cognitive Subsystems and Anderson's (1993) ACT-R framework.

On the other hand, the underlying structure of many problems, regardless of domain, can hinge on a decision of when to maintain or shift strategies and attentional resources (e.g., Robbins, 1952). This suggests the usefulness of a domain-general persist-or-switch process, which mediates the needed trade-off between exploitation and exploration. Evidence is accumulating that a central executive in working memory may do exactly this, mediating the focus of attention between competing goals in complex tasks (Hasher & Zacks, 1988; Baddeley, 1996; Engle & Kane, 2004). As we describe below, there is also evidence that some neural capacities are devoted to this generic goal maintenance and attention shifting across domains, and may in fact represent a general purpose search architecture. Though domain-specific mechanisms certainly exist, solving search-related problems—where finding solutions may require aborting non-self-terminating subgoals—may often be a matter of supplementing a domain-general search process with domain-specific information.

In previous work, we found evidence supporting the existence of one such generalized cognitive search process. We did this by showing that experience with a visuo-spatial search task could alter search behavior in a lexical search task, possibly by priming a domain general search process (Hills, Todd, & Goldstone, 2008). Participants were first asked to find as many resource tokens as possible by actively searching in a virtual world. The tokens were distributed either diffusely or in clumps, with the consequence that participants gave up quickly on a given area and moved on, or perseverated on local resource patches, respectively. Following this spatial search, participants were asked to form a total of 30 words over a series of letter sets, using one letter set at a time. When they felt that they had sufficiently exhausted the word possibilities of one letter set, they could move to another letter set. Participants who experienced clustered spatial resources stayed searching longer in each letter set than participants who experienced diffuse spatial resources. The results indicated that tendencies acquired in one task to explore versus exploit an environment spontaneously transferred to a superficially dissimilar task.

In this paper, we provide a mechanism accounting for the generality of the cognitive search process described above, and we do so by combining evidence for a central executive process for goal persistence with a model of problem solving using subgoal hierarchies. As we describe in more detail below, a search architecture based on these two elements combines the goal-handling abilities of the central executive with our understanding of the shared ecological and neural basis of a generalized cognitive search process. By this account, the central executive is responsible for altering subgoals in response to either changing task demands or diminishing returns on subgoal performance. Thus, our notion of the central executive attempts to extend our understanding of working memory by pointing out that many of the everyday problems we contend with share a common abstract structure: one that involves knowing when to abandon one action sequence and initiate another.

In what follows, we first provide the theoretical basis for conceiving of the central executive as a search process. We do this by combining two lines of evidence. The first is the biological evidence for a common underlying search process in the brain, which in turn supports a unitary, domain-general central executive and also provides a putative mechanism for its role as a process of subgoal persistence in cognitive control. The second line of evidence, following from the first, indicates how the central executive mediates goal persistence via inhibition and adaptive transitions of attention. Following this, we present two experiments that test predictions regarding priming at multiple hierarchical levels within a single task, and the necessity of “search” as opposed to just “movement” for priming behavior across domains. Search is defined here specifically as the self-regulation of transitions between exploratory and exploitative actions to achieve higher-level goals. Finally, we formalize a model of the central executive as a search process (CESP), and test three alternative hypotheses regarding how the navigation of subgoal hierarchies can produce the observed human behavior patterns.

The Biological Basis of a Domain-general Search Process

“Regulating the balance between exploitation and exploration is a fundamental need for adaptive behavior in a complex and changing world” (Cohen, McClure, & Yu, 2007, p. 934). This statement summarizes what we feel is one of the most important selective forces operating in the evolution of cognition. It encompasses both the ability to detect—via exploration—the resource contingencies available in different environments, and then to pursue—via exploitation—effective behavioral strategies that take advantage of those contingencies. This regulatory process is so central to adaptive success that it is found at both evolutionary and ecological (i.e., cognitive) time scales (Badyaev, 2005; Bedau & Packard, 2003; Baldwin, 1896; Stephens, 1991).

Taking a comparative approach to investigating the biological origins of the exploitation/exploration trade-off in cognitive control, Hills (2006) presented evidence for an early evolutionary origin of the neuromolecular architectures involved in this control. He also showed that, in humans, this modulation appears to have taken on a domain-general form that is common across many tasks, controlling both external and internal search.

The evidence for an early evolutionary origin stems from the fact that the relevant neural architectures controlling food- and foraging-related behaviors almost universally involve dopaminergic modulations of glutamatergic synaptic pathways, which are shared across a wide variety of species, ranging from nematodes to mammals (e.g., Hills, Brockie, & Maricq, 2004; Due, Jing, & Klaudiusz, 2004; Baumann, Dames, Kühnel, & Walz, 2002; Anderson, Braestrup, & Randrup, 1975; Szczypka et al., 1999). The domain generality of dopaminergic mechanisms in executive processing in humans is supported by their involvement in both spatial and more abstract goal-directed cognition (Salas, Broglio, & Rodriquez, 2003; Houk, Davis, & Beiser, 1995; Reiner, Medina, & Veenman, 1998), numerous pathologies of goal-directed behavior (see Nieoullon, 2002; Hills, 2006), and strong evidence that deficits in dopaminergic processing account for general age-related deficits in executive function (e.g., Volkow, Gur, Wang, Fowler, Moberg, et al., 1998; Bäckman & Farde, 2005).

The fact that dopamine has often been called a “reward” or “novelty” detector is an indication of the field's appreciation of this generality. Novelty and rewards are domain-general classifiers. They are so general that the dopaminergic signals that indicate them can be reassigned from unconditioned to conditioned stimuli (Ljungberg, Apicella, & Schultz, 1992)—neutral stimuli can come to activate reward detectors by being associated with rewards. Thus, detecting reward contingencies can mean detecting different kinds of opportunities for exploiting available resources.

Support for the domain generality of dopaminergic processing also stems from the antagonistic role of tonic D1and phasic D2 dopamine (DA) receptor-mediated effects (Grace, 1991; Grace, 2000; Cohen, Braver, & Brown, 2002; Goto, Otani, & Grace, 2007). The current hypothesis is that while phasic bursts of DA update localized contents of working memory by gating new information to the prefrontal cortex, tonic dopaminergic activity is a more global inhibitor of phasic activity, down-regulating sensitivity to competing stimuli. Changes in tonic activity also occur over a slower-time scale (Schultz, 2007). This has the consequence that tonic dopamine levels offer a general mechanism for mediating the tendency to exploit the existing foci of attention, or transfer attention to other potential targets, no matter what the domain.

Equally compelling biological evidence for domain-general regulation of exploration and exploitation comes from the widely distributed cortical projections of the locus coeruleus-norepinephrine (LC-NE) system (Aston-Jones & Cohen, 2005). Like the phasic-tonic account of dopaminergic processing, the LC-NE system operates along a similar principle of local versus global mediation of attentional focus. The activation of the LC is considered to follow the course of positive and negative reinforcements from the environment—and to respectively induce exploitative and exploratory behaviors in response (e.g., Cohen et al., 2007; Aston-Jones & Cohen, 2005; McClure, Gilzenrat, & Cohen, 2006).

In summary, the biological arguments for domain-general cognitive search processes stem from at least two separate, but potentially integrated, neuromodulatory systems. Both appear to control processing between exploitative and exploratory behavior, and they may do so by helping to maintain or shift attention between competing task demands. Though there has been limited investigation of the evolutionary origins of the LC-NE system, in the case of dopamine the evidence points predominantly to spatial search as one of its earliest domains of application. Thus, mechanisms devoted to the mediation of exploration and exploitation of goals may have originated to facilitate spatial search behavior and possibly other search behaviors that have similar cognitive requirements. This suggests three possibilities. Either this search control mechanism is now duplicated (exapted) for multiple competing and independent subsystems, or it is instantiated as a single domain-general search process, or both. If either of the latter two is true, then we should be able to prime search strategies across domains, for example from a visuo-spatial domain to a lexical domain, as if we were priming a unitary central executive process.

The Central Executive as a Search Process

Although a diversity of ideas have proliferated around Miller, Galanter, and Pribram's (1960) initial use of the concept of working memory, it is generally accepted that working memory comprises that part of human cognition that handles activation of information and execution of plans related to the achievement of current goals (see Miyake & Shah, 1999). In seminal work, Baddeley and Hitch (1974) argued that this conceptualization of working memory could be divided into two subcomponents: a set of slave systems, which handle the storage of domain-specific information (e.g., the visuo-spatial sketchpad and the phonological loop); and a central executive. Baddely (1986) later proposed that the central executive might be consistent with the functioning of Norman and Shallice's (1986) supervisory attentional system, guiding and updating behavior adaptively, appropriate to the current context. More recently, the central executive has become associated with the cognitive control of attention and with the ability to maintain goals and processing in working memory in the face of interference while handling complex tasks (e.g., Cowan, 2005; Baddeley, 2007; Engle, Kane, & Tuholski, 1999; Oberauer, Süss, Wilhelm, & Sander, 2008).

Our definition of a domain-general cognitive search process is a potential component of the more expansive central executive processes proposed above. The biological evidence suggests a central executive that is involved with domain-general high-level cognitive processing, devoted to mediating attentional processes to achieve goals. We further conceive of this central executive process as searching through high-dimensional problem spaces, because reducing the difference between one's present state and one's goal state often involves a process of dynamically switching between subgoals—involving the exploration and exploitation of alternatives. Furthermore, on complex problems where an executive control of attention is required, the central executive can work to achieve goals by perseverating on subgoals associated with particular actions, and choosing new subgoals when prior subgoals do not lead to adequate progress. We recognize that the central executive as proposed by some may be broader or narrower in purpose than we claim here. Our claim is that the central executive as a general search process, one that searches by navigating subgoal hierarchies, is a minimal functional requirement of the central executive.

Search through subgoal hierarchies

If the central executive instantiates a search process, then it needs an environment over which to forage. One proposed representation of this search environment is in terms of subgoal hierarchies (e.g., Botvinick, 2008; Braver & Bongiolatti, 2002). Subgoal (or action) hierarchies and their corresponding reward structures are appropriate natural environments for searching by means of a cognitive process that mediates between persistence and switching. Miller et al.'s (1960) initial description of a “working memory” was made with respect to precisely these kinds of hierarchically structured plans of action. Moreover, hierarchically structured actions are employed by numerous cognitive architecture models, such as the General Problem Solver (Newell & Simon, 1972), ACT-R (Anderson, 1993), and SOAR (Laird, Newell, & Rosenbloom, 1987). Thus, the central executive in working memory may achieve goals not by searching through literal space, but by searching spaces of subgoals and their associated actions, which in turn facilitate movement in either external or internal domains.

Figure 1 presents one possible subgoal hierarchy for the lexical search task we investigate in Experiments 1 and 2. The aim is to discover English words that can be made from a series of scrambled letter sets, with the primary goal being to find a total of 30 such words across multiple letter sets. The subgoals represent possible methods for achieving the primary goal—including searching within a letter set and switching between letter sets. As with many real world tasks, there are multiple possible trajectories through a tree of subgoals. The role of the search process is to choose subgoals such that an appropriate trajectory can be found to achieve the top-level goal.

One possible subgoal hierarchy for the lexical search task. The primary goal is to find 30 words across multiple letter sets. Branches of the tree represent possible subgoals for activation in working memory. Individuals start with the top level goal and make decisions between alternative subgoals to search the problem space.

Subgoal hierarchies like Figure 1 represent hypotheses about goal structures, through which possible courses of action can be found. Like production rules in ACT-R and SOAR, alternative hierarchies can be tested experimentally to investigate the actual goal structure of the task environment. As we demonstrate in more detail with the CESP model simulation, formalizing and testing alternative hierarchies is a way to uncover the goal spaces utilized to solve a given problem.

Two open questions still remain about the role of the central executive as a search process. First, what is the evidence that it plays a role in mediating switching between subgoals, and second, is the central executive really a unitary domain-general construct, or is it an emergent property of competing subsystems?

The central executive, inhibition, and task switching

“The ability to switch flexibly between tasks is the pinnacle of human cognition and the hallmark of executive control” (Logan, 2004, p. 3). This quote is fundamentally similar to that by Cohen et al. (2007) above, both referring to a cognitive ability to transfer attention flexibly and adaptively in response to task demands. The flip side of this perspective is to say that the pinnacle of human cognition is the ability to maintain attention in the face of distraction—that is, appropriate switching also implies focusing and not switching, at least some of the time. What evidence is there that a central executive mediates the control of attention between competing goals?

Some of the most compelling evidence comes from studies demonstrating that working memory capacity is strongly correlated with the ability to maintain focused attention. In an elegant study of the cocktail party phenomenon, Conway, Cowan, and Bunting (2001) first divided participants into two groups based on their operation span—a test of memory span while performing a distracting task. Participants were then presented with a dichotic listening task in which they were asked to shadow (repeat) words heard in the right ear, while an irrelevant message was played in the left ear. Unbeknownst to participants, the message in the left ear also contained the participant's name. When later asked if they had heard their name, participants with lower operation span were significantly more likely to have heard their own name—being less able to control their attention to avoid distraction—than those with high span.

In a study of behavioral inhibition, Unsworth, Heitz, and Engle (2004) found a similar result when participants were asked to inhibit a visual looking response in an anti-saccade task. Participants with low spans were less able to inhibit looking at the target than participants with high spans. In both the Conway et al. (2001) and the Unsworth et al. (2004) studies, a course of action associated with a specific goal is pitted against actions associated with attending to distracting stimuli—distracting stimuli representing competing subgoals. Maintenance of one subgoal requires not switching to the other. Having a higher working memory span could therefore be associated with greater ability to maintain attention to the items being kept in memory. Numerous studies have directly investigated this sustained-attention/switching hypothesis of the central executive, and have found consistent evidence in support of it (Hasher & Zacks, 1988; Kane & Engle, 2000; Rosen & Engle, 1998; Kane, Bleckley, Conway, & Engle, 2001; Kane & Engle, 2003; Hasher, Lustig, & Zacks, 2008; also see Braver, Gray, & Burgess, 2008 for a similar account based on proactive and reactive control).

In many problems, the ability to regulate the maintenance and transfer of attention between subgoals is critical to successful outcomes. The results just reviewed support the idea that this self-regulated switching process is a key component of the central executive.

Central executive or executive committee

The second problem to be addressed is whether or not the central executive is an emergent property of dynamic, interactive, competing task demands—what Baddeley (1996) called an executive committee—or is it a unitary, domain-general system that serves to modulate attention, regardless of the competing tasks? In other words, is the control of attention the outcome of competition decided only by the opponents (the committee), or decided in part by a referee (the executive)? Both perspectives are well represented in the literature.

Numerous cognitive architecture models produce central executive functions by processing interactions among competing subsystems. For example, the model of Interactive Cognitive Subsystems (Barnard, 1985), ACT-R (Lovett, Reder, & Lebiere, 1999), and the Executive-Process/Interactive-Control model (Kieras, Meyer, Mueller, & Seymour, 1999) all use competition between domain-specific subsystems to determine the outcome of cognitive control—there is no true “central executive.” On the other hand, systems such as the Embedded Processes Model (Cowan, 2005) and Executive Attention Theory (Engle & Kane, 2004; Kane, Conway, Hambrick, & Engle, 2008) explicitly posit a central executive that is shared across tasks to schedule subsystem activation.

Still other proposals, such as O'Reilly and Frank's (2006) Pre-frontal Cortex and Basal Ganglia Model and similar neural based connectionist models (e.g., Braver & Cohen, 1999; Durstewitz, Kelc, & Güntürkün, 1999), sit on the fence with respect to the unitary nature of the central executive. For example, they often assume a shared set of neural subregions (e.g., the thalamus, hippocampus, striatum, and prefrontal cortex) or neuromodulatory control mechanisms (e.g., dopamine) across tasks, but the goals and their associated behaviors compete for activation in the prefrontal cortex without a higher-level arbiter. They also typically remain agnostic as to whether there is a global mechanism for mediating exploration and exploitation across tasks (e.g., see Munakata, Morton, & O'Reilly, 2008; Braver et al., 2008; cf. Rougier, Noelle, Braver, Cohen, & O'Reilly, 2005; also see Daw, O'Doherty, Dayan, Seymour, & Dolan, 2006).

One possible resolution is provided by phasic-tonic accounts of neuromodulation. That is, phasic-tonic dopamine or norepinephrine models of cognitive control are potentially consistent with a shared, domain-general mechanism for mediating exploratory/exploiting behavioral patterns, in a way that integrates the executive and committee views of search control (see Cohen et al., 2002; Aston-Jones & Cohen, 2005). The phasic-tonic accounts provide a means for a self-organized, committee system of goal-goal competition (via local phasic activity) mediated by a domain-general mechanism for increasing or reducing goal perseveration (via tonic activity). To the extent that processes like the DA and NE mechanisms are true global modulators of attentional control, we should be able to observe an effect of exploration/exploitation priming across domain-specific tasks, and at multiple hierarchical levels within those tasks. Experiment 1 examines this possibility.

Experiment 1

The first experiment was designed to investigate priming at multiple levels in a hierarchical problem solving task. Each participant worked through a series of three phases. The first phase was an initial training session with the lexical search task. In the second phase, participants were randomly assigned to either a clustered or diffuse treatment condition of a spatial search task. In the third phase, participants returned to the lexical search task.

In our earlier cross-domain priming study (Hills et al., 2008), the letter sets were presented in a fixed order, which added order effect confounds to the investigation of any priming within letter sets. To overcome this problem—and to establish the robustness of the earlier priming results—each participant in the present experiment was assigned an independently randomized sequence of letter sets.

The abstract lexical search task presented participants with two simultaneous search decisions: how to search for the next word within a letter set, and when to leave one letter set for the next letter set. A wait time of 15 seconds was imposed between the participant's expressed desire (via an on-screen button click) to move to the next letter set and the next letter set's presentation. This prevented participants from simply moving rapidly over multiple sets. By imposing this transition cost or “travel time,” the abstract lexical search problem becomes analogous to the typical patch-search paradigm in the animal foraging literature, in which animals must decide between foraging in one patch or taking the time to travel to the next patch (e.g., Charnov, 1976; Hills & Adler, 2001). In both cases, “travel” (between patches or between letter sets) is costly in the sense that resources cannot be acquired while traveling.

More broadly, the commonality in both our spatial and lexical search tasks is that participants must choose between continuing to try to harvest resources (particular pixels on the screen or words in the letter set) in their current “location” (screen region or letter set) or “moving” to and then checking a new location. See Table 1 for a fuller description of the analogy between the search tasks. The basis for this analogy may seem highly abstract, but if people have a general cognitive search process dedicated to making decisions about whether to exploit a current resource patch or explore for new patches, then they are implicitly making a common decision under both tasks about when to switch the “location” that is currently being searched. In terms of subgoal hierarchies, the common element is the decision to stick with a specific subgoal or to switch between subgoals.

Table 1.

Analogies between spatial and lexical search tasks

Task	Spatial search	Lexical search
Resource	Pixels that change color	Words hidden in letter sets
Resource patches	Resource-filled regions of the screen	Level 1: Sets of letters Level 2: Sources of solutions within a letter set
Cost of transitioning between patches	Time needed to travel between patches	Level 1: 15 seconds between letter sets Level 2: Time to retrieve a letter sample from a new source
Exploration strategy	Move to unexplored screen regions and check for resources	Level 1: Give up on a current letter set and receive a new set Level 2: Give up on current solution source and transfer attention to another
Exploitation strategy	Remain in spatial region	Level 1: Keep trying to find words in current letter set Level 2: Keep using the same solution source

Open in a new tab

We can obtain evidence for a common underlying process governing search across domains by examining the nature of cross-task priming. If participants in the diffuse treatment of the spatial search task are generally primed to explore (i.e., switch between subgoals more often) more than exploit (i.e., switch less often), then we predict that when they are transferred to the lexical search task, they will shift letter sets more frequently (exploring more). Conversely, if participants searching through clustered resource patches in the foraging task are generally primed to exploit resource patches, then we predict that they will shift letter sets relatively infrequently (exploiting each letter set for a longer period of time).

A further prediction of a common search process is that within letter set word transitions will also be affected by the priming. We formalize these predictions explicitly when we present the CESP model (see below), but here we lay out their general predictions. First, it may be that participants only ever sample from the letter set, without regard to their previous solutions. Here the terminal subgoals are to sample from the letter set, compose a test word from the sampled letters, and then try to match the test word to the known lexicon. In this case, similarity between adjacent solutions should not differ between the conditions, because the test word production process always starts from the same location (with the letter set itself). Note that we define the test word as a product of working memory manipulations; it may or may not be a valid solution. In the same way that a chess player explores different possible solutions to a chess problem before actually selecting one, a test word represents a possible solution in the current letter set.

A second hypothesis is that because clustered participants exploit resource patches for longer, they will also exploit previous solutions for longer. If this is true, the terminal subgoals are to modify the previous solution to compose a test word, and then check it against the lexicon. Participants in the clustered treatment should then generate solutions that are more similar to their previous solution than participants in the diffuse treatment.

A third hypothesis is that clustered participants perseverate longer on the process of iteratively manipulating test words. Here the terminal subgoals are to modify the most recent test word and check it against the lexicon. Now clustered participants may produce words that are less similar to their previous solution, because they make more manipulations to the previous solution before switching to another subgoal.

We argue that the outcome of the priming between solutions is dependent on the identity of the terminal subgoals in the subgoal hierarchy—and more generally, on the structure of the solution space and the strategies associated with navigating it. In the present case, the three hypotheses represent three different sources of subgoal perseveration: Hypothesis 1) always sample from the letter set, Hypothesis 2) modify the previous solution from the current letter set, when available or Hypothesis 3) modify the previous test word from this letter set, when available. Following the experiments, we formalize and simulate these hypotheses in the CESP model.

Methods

Participants

75 students from Indiana University participated in the experiment. Participants received course credit for participation. We established a completion criterion of at least 20 words found in the third word puzzle phase within the allotted time of approximately 1 hour (minus the time for the lexical search training, maze training, and spatial search task). Participants were informed that their goal was to find at least 30 words. Six participants failed to meet the above criterion.¹ Two participants also stayed longer than 5 standard deviations beyond the mean staying time in one or more letter sets, more than 200 seconds longer then the next closest participant, and these participants were also removed from the analysis. Participants were randomly assigned to the two separate treatment groups by the computer. After removing the outliers, this yielded 35 participants in the clustered treatment and 32 participants in the diffuse treatment.

Apparatus and procedure

Participants were seated in front of a computer and asked to follow written instructions that appeared on the screen. Instructions guided participants through four activities, beginning with a training session in the abstract lexical search task, followed by a maze training, a spatial search task, and then a final session in the abstract lexical search task. The initial training session in the lexical task and maze training were to familiarize the participants with the controls for the separate tasks.

Lexical search task

Participants were asked to find one or more words made up of at least four letters from each of a sequence of letter sets containing 6 letters, four consonants and two vowels (e.g., the letter set SULMPA can be used to form, among other words, SLAP and PLUM). Plurals and proper names were disallowed. Once a letter set was displayed, participants could type in as many words as they wanted, or click on a button to move to the next set. Letter sets were constructed randomly using only the twenty most common letters in English (i.e., excluding K, V, X, Z, J, and Q). Previous work has shown that participants in this task are sensitive to letter frequency, which is associated with number of possible solutions (Wilke, 2006), and we did not want there to be obvious cues to the ‘patch size’ for each letter set (i.e. the number of possible words present in a letter set). 18 letter sets were constructed. All participants saw the same four letter sets during the training phase. The remaining 14 letter sets were randomized across subjects. Immediately after entering a word, participants were given on-screen feedback as to whether it was correct (i.e., an English word, not plural, and formed from the appropriate letters) or incorrect. There were on average 14.7 (SD = 5.5) valid solution words per letter set, judged according to the wordsmith.org anagram dictionary. Participants could leave a letter set at any time but had to wait fifteen seconds before the next letter set was shown. After leaving a letter set, participants could not visit it again. All participants used in this study found the required number of words before seeing all the letter sets. Participants received instructions and training on one letter set before moving to the pretest session. In the pretest, participants were asked to find words in three letter sets, with no directions regarding how many words to find before moving on to the next letter set. The pretest session ended when participants left the last of these three letter sets. In the post-test lexical search session (following the spatial search treatment), participants were told that they needed to find a total of 30 correct words across any number of letter sets to finish the experiment. They were also told that they could spend as much time as they liked on any given letter set, but that they should allocate their time appropriately so as not to stay too long or too short in any given letter set. Analyses were performed on correct solutions only.

Spatial search task

In the spatial search task, participants controlled the movement of a foraging icon using the ‘J’ and ‘L’ keys representing ‘turn counterclockwise’ and ‘turn clockwise’, respectively.² The keys initiated turns of 35 degrees per step, and forward speed was approximately 20 pixels (steps) per second, which was constant through the trial. To familiarize participants with the controls, participants first had to complete a maze training task, involving the navigation of a two-dimensional maze. Following completion of the maze, participants entered the foraging treatment. Here participants saw an initially blank 200 × 200 pixel field (which wrapped around at the edges) with their icon in the center. They were told to move the icon to find as many hidden “resource” pixels as they could in the allotted time. Time was indicated by a sweeping clock-hand in the upper-right corner of the screen (clock units indicated remaining time steps, 20 per second)—this was meant to provide a sense that time in the trail was passing without providing explicit information about when the trial would end. Participants were randomly assigned to one of two resource distributions, ‘clustered’ or ‘diffuse,’ consisting of 3124 resource pixels within 4 diamond-shaped patches of 781 contiguous pixels each or 700 patches of 5 pixels each, respectively. Patches were assigned random locations. Figure 2 shows example resource distributions for the two spatial search treatments (above) and typical participant foraging paths for each treatment (below). Resource pixels were not visible to participants until they were encountered, nor were the participants' paths visible at any time. Once a resource pixel had been found by a participant (by moving over it) it was shown in green and remained visible on the screen for the remaining duration of the trial. A participant searched through five spatial search trials for 2 minutes each, with a different random arrangement of patch locations in each trial.

A. Examples of clustered and diffuse resource distributions. Black pixels represent resources (not seen by participants until they pass over them in Experiment 1, or seen at the beginning of the task in Experiment 2). B. Example paths for single participants in clustered and diffuse treatments. Grey circles are positioned over the pixels where participants found a resource.

Results and Discussion

Overall, the results replicate the pattern found in Hills et al. (2008). The average turning angle for the clustered treatment group was 20.1 degrees and for the diffuse treatment group 11.5 degrees per 0.2 seconds (t(65) = 4.27, p < 0.001). Participants who were in the clustered spatial search treatment stayed longer in each letter set in the abstract lexical search task than did participants exposed to the diffuse spatial search treatment (clustered mean = 85.6 seconds, SD = 28.32, diffuse mean = 70.0 seconds, SD = 25.8, t(65) = 2.39, p < 0.05). Figure 3A presents this pattern in terms of time spent in the first seven letter sets in the posttest (after spatial search) minus the mean time spent across the three letter sets in the pretest, (more than 80% of the participants reached the 7th letter set).

Priming results from Experiment 1. A. Time spent in each letter set, for each treatment group, across the first 7 letter sets. B. The bigram similarity between successive solutions within a letter set, for each treatment group. Error bars are standard error of the mean.

Our dependent measure is the difference in time spent in each sequence between the posttest and the pretest. To control for the possibility that individuals who stay varying amounts of time in a letter set may be more or less susceptible to the spatial foraging manipulation, we used the pretest time, the position in the sequence of letter sets (i.e., first letter set, second, third, etc.), and the spatial distribution treatment as independent variables in an unbalanced equivalent of the repeated measures analyses of variance (i.e., the linear mixed effects model, Laird & Ware, 1982). The unbalanced test is needed because participants submitted differing numbers of solutions for each letter set. The main effects of pretest time and sequence position were both significant (F(1, 64) = 38.2, p < 0.001; F(1, 512) = 24.51, p < 0.001, respectively).⁴ The treatment effect was also significant (F(1, 64)=6.68, p = 0.01), with participants from the clustered spatial condition staying significantly longer in the letter sets than participants in the diffuse spatial condition. For completeness, we find the same results for treatment even if the dependent measure is absolute posttest time with treatment as the only predictor (F(1, 65)=6.05, p = 0.02).

Including the amount of resources found in the spatial search task had no effect on this result and was not itself a significant predictor of time spent in the letter set sequences (p > 0.1). The two groups did not differ in the number of sequences visited (mean for both groups = 9.22 sequences, SD = 2.0, p > 0.05), but did differ in the time taken between word submissions (clustered mean = 17.31 sec, SD = 4.56, diffuse mean = 15.00 sec, SD = 5.03, t(65) = 1.96, p = 0.05), and the total time taken to complete the task (clustered mean = 821 sec, SD = 254, diffuse mean = 670 sec, SD = 180, t(65) = -2.8, p < 0.01).

In Figure 3A, the fact that the posttest-pretest differences are greater for the participants in the clustered than the diffuse conditions indicates that clustered participants stayed longer in letter sets (in the posttest) following the spatial search treatment. Diffuse treatment participants, on the other hand, may have stayed a shorter period of time (at least for the first sequence, though this effect gets weaker for later letter sets). The gradual decline in time spent in the letter sets is picked up in the sequence position effect, with participants eventually staying for shorter durations in each letter set as the number of letter sets they visit increases. Principally, the treatment effect demonstrates that the distribution of resources in a spatial search task can influence the search for resources in a subsequent task, even if the environment in which the subsequent search is performed is radically different from the spatial search task and only involves space in a metaphorical manner. Moreover, the transfer of search tendencies across two domains, typically taken to represent independent subdomains in working memory (Baddeley, 1996; Logie, Zucco, & Baddeley, 1990), suggests that the transferred process is associated with a central executive process and this processing is domain general, i.e., shared across tasks.

A further test of this domain generality is whether or not the observed search priming effect at the level of the letter sets can also be witnessed in the search within a letter set. A participant's solution sequence within a letter set can be described as a path through a similarity space, where distances between words correspond to the similarity measure. A similarity distance measure has been developed in earlier research with anagrams, which found that the anagram solutions of non-experts are generated using serial-hypothesis testing in which letters are sampled from the letter set and manipulated to produce successive hypotheses (Mayzner, Tresselt, & Helbock, 1964; Mendelsohn, 1976; Novick & Sherman, 2008). In this case, the distance from the letter set to the solution is assumed to be dependent on the number of manipulations required to reach the solution from the letter set. The manipulation distance is inversely related to the bigram similarity—that is, the number of bigrams shared between two words or sets of letters. The assumption is that two words that are more similar to one another, in terms of sharing more bigrams, require fewer manipulations in working memory to move from one to the next. For example, BOAT and BOLT share 3 bigrams (start-B, BO, and T-end), for a bigram similarity of 3. This similarity measure is supported by the observation that greater bigram similarity between the letter set and the solution decreases the time to find that solution (Gavurin & Zangrillo, 1975).

Based on these results, we used bigram similarity to construct a search representation for anagram solutions. Because we use anagrams with multiple solutions of varying size, our measure of bigram similarity is the ratio of shared bigrams to the number of total bigrams in the larger word (or letter set). Our assumption is that participants are searching locally in word solution space and that this will be detectable in the bigram similarity between successive solutions. Just as foragers often search near where they have found resources in the past, so participants trying to come up with words from a letter set may search for words close to where they have found other solutions in the recent past. We argue that this nearness can be detected in the participant's solutions, and that it can provide information about where participants are perseverating in their goal structure. It may represent perseveration on a prior solution, the original letter set, or the process of iteratively altering a test word. To test among these alternative hypotheses, we computed the shared bigram similarity between successive submitted words, and the shared bigram similarity between each submitted word and the letter set itself.

Figure 4A provides a visual representation of all participants', in both conditions, sequential solutions in the letter set NSBDOE. Nodes represent solutions and arrows between nodes represent the production of one solution following another. Darker lines indicate that more participants made a particular solution-solution sequence. Larger node sizes indicate that more participants produced that solution before moving on to the next letter set. From this figure, one can see that words that follow one another tend to share similar letter arrangements. For example, BONE following DONE was a common transition, as was DOSE following NOSE (both pairs share three bigrams). However, transitions between less similar words are more rare (e.g., pairs like DENS- BOND and SNOB-BEND share no bigrams and zero links in Figure 4A).

A. A visual depiction of the between word transitions produced by all participants in the letter set NSBDOE. Nodes represent solutions and links between nodes represent transitions between words, with the arrow showing which word came second. Node size is proportional to the number of participants who provided that solution for this letter set. Link thickness is proportional to the number of participants who made that transition. For visual clarity, only transitions that took place more than twice are represented with a link. B. The bigram similarity of the present solution to previous (N-1) and penultimate (N-2) solutions and to the original letter set. Error bars are standard error of the mean.

Figure 4B presents the bigram similarity between the present solution (N) and previous solution (N-1), penultimate solution (N-2), and the current letter set, averaged first within participants, and then across participants, for all solutions in both treatment conditions. A paired t-test reveals that the previous solution (N-1) is more similar to the present solution than either the penultimate solution (t(66) = 5.42, p < 0.001) or the letter set (t(66)=9.44, p < 0.001). Because the letter set is usually larger than the words produced from it, we can statistically control for this bias by computing the bigram similarity ratio as shared bigrams over the number of bigrams in the smaller word. Doing this, however, does not alter the above results; the previous solution still shares the highest bigram similarity with the current solution (as above, t(66) = 5.152, p < 0.001, and t(66) = 12.97, p < 0.001, respectively).

The finding that solutions are most similar to the solutions that precede them is consistent with the hypothesis that participants are using the previous correct submission as a starting point for the next exploratory search (Hypothesis 2 and 3), and inconsistent with the hypothesis that they are only using the letter set (Hypothesis 1). If they were predominantly using the letter set, then the average similarity between words should be approximately equal without regard to its order in production. Instead, the evidence suggests that participants use prior solutions as waypoints for further search into the solution space.

To evaluate the effect of the two spatial environment conditions on the between solution similarity, we calculated the mean similarity between successive solutions generated by each participant, averaged separately over participants in the two treatment groups. Figure 3B shows that the mean bigram similarity for successive solutions in the clustered group (M = 0.35, SD = 0.07) was significantly lower than that for the diffuse group (M = 0.39, SD =0.06; t(65) = 2.41, p = 0.02). Using a linear mixed effects model, we also found a significant effect of treatment condition on bigram similarity (t(65)=2.28, p = 0.03), and this holds even after controlling for the number of words submitted per letter set (t(65)=2.42, p = 0.02). This suggests that experience searching in a particular spatial distribution also affects within-letter set word search in terms of solution similarity. However, the direction of the result supports Hypothesis 3: The diffuse spatial resource condition led participants to produce words that were more similar to the preceding solution than did the clustered treatment condition (we found evidence for the same pattern in the data from Hills et al., 2008, data not shown).

Participants who experienced clustered resources do not appear to perseverate more on their previous solution—which would have increased their between-word similarity—but may perseverate more on the subgoal of iteratively manipulating the test word. When we present the CESP model, we demonstrate how search in subgoal hierarchies can generate the above patterns.

Experiment 2

Experiment 1 established that the spatial search task was sufficient to induce priming at multiple levels in the lexical search task. However, in experiment 1, three alternative hypotheses for the priming are possible: 1) the priming may be elicited by motor perseveration, because the clustered condition required individuals to initiate multiple tight turns in order to stay in one location, or 2) the priming may be a consequence of a high level schema associated with clustered or diffuse resource distributions irrespective of search strategies, or 3) the priming may be mediated by an implicit search mechanism, that calibrates a tendency for exploratory or exploitative behavior, and thus mediates the maintenance and transfer of attention between subgoals.

For the motor hypothesis, perseveration at a motor level may initiate a similar kind of perseveration on cognitive processes, through the continuation of behavioral tendencies. For the schema hypothesis, simply harvesting from a resource distribution may elicit a particular pattern of expectations (potentially metaphorical) about how resources should be distributed in future environments—and this carries over from one task to the next. Only the implicit search mechanism requires an overt self-regulating search process—the other two hypotheses only require that individuals harvest from resource distributions.

This distinction between the need for effortful self-regulation or simply following overt cues provided by the environment is also important with respect to task switching. In numerous studies, task switching costs associated with a potential central executive have been found under effortful cognitive control conditions involving self-regulation in the absence of overt cues (Logie et al., 1990; Baddeley et al., 1998; Logan, 2004). However, others have found less convincing evidence for a relation between task switching and potential executive processes, such as working memory span, and have called for a more detailed analysis of how different tasks may interfere with one another (Oberauer et al., 2008; Hale, Myerson, Emery, Lawrence, & Dufault, 2008). One compelling possibility is that central executive and associated working memory processes are only required when self-regulation is required to manage task switching, i.e., in the absence of overt cues.

In the second experiment, we wanted to determine whether or not an implicit search process is required in the spatial search task in order to elicit priming in the lexical search task. To this end, we replicated the experiment from Hills et al. (2008) except that the full resource distribution was made visible to participants at the beginning of the task. Consequently, if priming is caused simply by the behavior of harvesting resources by either a scattered or concentrated method (moving to them and passing over them), then we should find priming in this visible-resources setting as well. However, if the priming requires modulating an implicit executive search process, then priming should not occur when all resources are visible and no exploration or location prediction is required.