Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Sep 1.
Published in final edited form as: Cogn Sci. 2010 Sep 1;34(7):1287–1314. doi: 10.1111/j.1551-6709.2010.01130.x

Knowledge as process: Contextually-cued attention and early word learning

Linda B Smith 1, Eliana Colunga 2, Hanako Yoshida 3
PMCID: PMC2992382  NIHMSID: NIHMS224213  PMID: 21116438

Abstract

Learning depends on attention. The processes that cue attention in the moment dynamically integrate learned regularities and immediate contextual cues. This paper reviews the extensive literature on cued attention and attentional learning in the adult literature and proposes that these fundamental processes are likely significant mechanisms of change in cognitive development. The value of this idea is illustrated using phenomena in children novel word learning.

Keywords: cued attention, development, noun learning


In her introduction to the 1990 special issue of Cognitive Science, Rochel Gelman asked “How is it that our young attend to inputs that will support the development of concepts they share with their elders?” Gelman posed the question in terms of attention, but the answers offered in that volume were not about attention. Instead, they were about innate knowledge structures, so-called “first” or “core” principles that guide learning in specific knowledge domains. In the years since 1990, inspired in part by the highly influential papers in that special issue, there have been many demonstrations of the remarkable knowledge that quite young children bring to bear on learning. But there have been very modest advances in understanding the processes and mechanisms through which that knowledge is realized and applied to aid learning. Accordingly, in this paper we return to Gelman's original question: How do children select the right information for learning?

We consider this question in the context of how children learn words, one of the topics also of central interest in the original special issue. We begin by reviewing well-documented and general mechanisms of attentional learning from the adult literature from the perspective of their relevance as mechanisms of cognitive development. We then ask – mostly without direct empirical evidence on the answer – what role these known mechanism might play in early word learning. Our main goal is to encourage researchers to pursue these mechanisms as significant contributors to word learning and to cognitive development more generally. This --- taking well-documented mechanisms from one area of research and asking if they might apply in another, moving the field toward a more unified understanding of what might seem at first unrelated domains --should not be contentious. However, the literature on early word learning is highly contentious (Booth and Waxman, 2002; Cimpian & Markman, 2005; Smith & Samuelson, 2006) and not in a productive way, as the explanations pitted against each other are not empirically resolvable in a straight-forward way.

One word-learning principle, mutual exclusivity, discussed in the original special issue may help illuminate the problem. The phenomenon, which is not at issue, is this: Given a known thing with a known name (e.g., a cup) and novel thing and told a novel name (“Where is the rif?”) children take the novel word to refer to the novel object and not the known one. Markman (1990, page 66) explained this behavior as follows: “children constrain word meaning by assuming at first that that words are mutually exclusive –that each object can have one and only one label.” This description summarizes the phenomenon in terms of the macro-level construct of “assumptions.” This construct, of course, may be unpacked into a number of micro-level processes, including, as we will propose here, cue competitions. An account in terms of cue competitions and attention is not in opposition to an account in terms of assumptions because the two kinds of explanations are at fundamentally different levels of analysis that answer different questions. The mutual exclusivity assumption as proposed by Markman is a statement about an operating characteristic of the child's cognitive system that facilitates word learning. A proposal about cue competitions is a proposal about the more micro-level mechanisms that may give rise to that operating characteristic. An analogy help: One possible account of why someone just ate a cookie is that they are hungry; another possible account is that they ate the cookie because of low levels of leptin and the release of ghrelin. The second account might be wrong, and the first might well be right; but it the second is not in opposition to the first in any sensible way. Moreover, a choice between the two cannot be made by decided according to which one better accounts for the macro-level behavior of eating the cookie. Instead, the relevance of leptin and ghrelin to cookie eating must be decided in terms of micro-level processes about which the macro-level construct of hunger makes no predictions.

In what follows, we consider Gelman's original question of how children might know to attend to the right properties for learning by considering contemporary evidence on attentional mechanisms and then asking whether these mechanisms might play a role in enabling children to attend to the right information for learning words. The evidence suggestive of a role for cued attention in word learning is primarily at a macro level. Accordingly, we next discuss the kind of micro-level studies needed to pursue these proposed mechanisms. We do not consider macro-level explanations of children's early word learning as competing hypotheses to the proposals about cued-attention. However, we conclude with a reconsideration of the contention in the early word learning and what is (and is not) at stake.

Cued Attention

Attention is a construct that is so widely used in psychological theory that William James (1890) lamented “everyone knows what attention is” with the subtext that everyone may know but no one agrees. Within developmental psychology, “attention” is currently studied with respect to several different (but potentially deeply related, see Posner & Rothbart, 2007) phenomena, including sustained attention (e.g. Miller et al, 2009; Richards, 2004; 2008), attentional switching and disengagement (e.g. Posner et al, 1998; Blago & Columbo, 2006; Richards, 2008), executive control (e.g.,Chatham, Frank & Munakata, 2009; Diamond, 2006; Hanania & Smith, in press), and joint attention among social partners (e.g., Grossman & Farroni, 2009; Hirotani et al, 2009). There are very few developmental studies specifically concerned with contextually-cued attention (e.g., Wu & Kirkham, in press, Smith & Chatterjee, 2008; Goldberg, Maurer, & Lewis, 2001). Although we will briefly consider how cued-attention might be related to other forms of attention in children at conclusion, we focus on contextually-cued attention in this paper precisely because so little is known about its development despite its apparent ubiquity in sensory, perceptual, and cognitive processing in adults. Briefly, the well-documented fact is this: cues that have been probabilistically associated with some stimulus in the past enhance detection, processing and learning about that stimulus (e.g. Chun & Jiang, 1998; Brady & Chun, 2007). In this section, we briefly summarize the very broad literature that supports these conclusions, noting how these processes capture regularities in the input, protect past learning and guide future learning.

Predictive cues

The attentional consequences of learned associations between cues and the stimuli they predict has been well known since the work of MacIntosh (1975) and Rescorla and Wagner (1972). Originally in the context of classical conditioning but more recently also in the broader domain of associative learning (see Pearce and Bouton, 2001; Kruschke and Blair, 2000; Chapman & Robbins, 1990; see also, Ramscar, et al, in press), these studies present learners with cues that predict specific outcomes. The results show that what is learned about the relation between those cues and outcomes depends on the cues present in the task, their relative salience and the learner's history of experiences with those cues in predicting outcomes. Critically, these cue-attention effects are not about single cues associated with single attentional outcomes but rather are about the consortium of cues present in the moment and all of their predictive histories.

Three illustrative phenomena are overshadowing, blocking and latent inhibition (Kamin, 1968; Lubow & Moore, 1959; Mackintosh, 1975; Rescorla & Wagner, 1972). The relevant task structures are shown in Table 1. Overshadowing refers to the situation in which two cues are presented together and jointly predict some outcome (e.g., in a category learning study, two symptoms might predict some disease, or in classical conditioning, a tone and a light might predict shock). The strength of association of each cue to the outcome depends on their relative salience. However, it is not simply that the most salient cue is learned better, rather salience differences are exaggerated in that the more salient cue may “overshadow” the less salient cue with little or no learning at all about the less salient cue (Kamin, 1969, Mackintosh, 1976; Rescorla-Wagner, 1972 Kruschke, 2001; Grossberg, 1982). Blocking refers to the case in which the greater salience of one cue is not due to intrinsic salience but is due, instead, to past learning. If some cue regularly predicts some outcome and then, subsequently, a second cue is made redundant with first and so also predicts that outcome, there is little or no learning about the second cue: learning is blocked by the first predictive cue (Bott, Hoffman & Murphy, 2007; Kruschke & Blair, 2000; Kamin, 1968; Shanks, 1985), as if the first cue is more “salient” and thus overshadowing in this predictive context. Latent inhibition, like blocking, also makes the point that salience is a product of learning and predictive strength. But here the phenomenon is learned irrelevance: If a cue is first varied independently of the outcome so that it is not predictive at all but then it is made perfectly predictive, it will be hard to learn and “overshadowed” by other cues (e.g., Lubow, 1993; Lubow & Kaplan, 1997; Mackintosh & Turner, 1971).

Table 1.

Cue interactions in associative learning The letters A,B,C indicate cues and the letters X and Y indicate predicted outcome for the first phase of learning, the subsequent second phase of learning. What is learned after the second phase indicated in the third column. Bold letters indicate more salient cues (either through learning or intrinsic salience).

Cue Interaction First learning phase Second learning phase Learning Outcome
Overshadowing AB→X A→X
Blocking A→X AB→X A→X
Latent inhibition B→ AB→X A→X
Highlighting AB→X AC→Y C→Y
Mutual exclusivity A→X AB→Y B→Y

These phenomena make three key points about the cues in cued attention: (1) cue strength is determined by predictive power, not mere co-occurence, (2) cue strength depends on the ordered history of predictability, not batch statistics, and (3) individual cues interact such that one cannot simply predict whether some cue will be learned by considering it alone, one must instead know its history and the history of the other cues in the learning environment. These phenomena are evident in many adult statistical learning tasks (e.g., Cheng & Holyoak, 1995; Kruschke & Blair, 2000; Kruschke, 2001; Kruschke & Johansen, 1999; Ramscar et al, in press) and as Ellis (2006) concluded, their ubiquity implies that human learning about predictive relations is bounded by basic mechansisms of cued attention (see all Yu & Smith, under review, Yoshida, under revision).

Highlighting is a higher-level phenomenon that may be understood as a product of blocking and overshadowing and it is a particularly robust phenomenon in adult associative learning (Kruschke, 1996, 2005; Ramscar et al, in press). It also demonstrates how cued attention both protects past learning and guides new learning. Highlighting emerges when adults first learn one set of predictive cues to task relevant information and then later are exposed to an overlapping set of new and old predictive cues that predict the relevance of different information. The task structure that leads to highlighting is also provided in Table 1. Learners are first exposed to a conjunctive cue (A + B) that predicts an outcome (X), and then are presented with a new conjunctive cue that contains one old component (A) plus a new one (C) that predicts a new outcome (Y). The key result concerns learning during the second phase: learners associate the new cue with the new outcome more than they associate the old cue with the new outcome. For example, if the learner is first taught that RED and SQUARE predict category X and then is taught that RED and CIRCLE predict category Y, the learner does not learn about the relation of RED and category Y, but rather appears to selectively attend only to the novel cue, CIRCLE, and to learn that CIRCLE predicts category Y. In brief, novel cues are associated with novel outcomes. By one explanation, this derives from the rapid (and automatically driven) shift of attention away from the previously learned cue (RED and SQUARE) in the context of the new outcome, so that the new cue (CIRCLE) becomes attentionally highlighted and strongly associated with the new outcome (e.g., Kruschke, 1996, 2005). This attention-shifting account has also been supported by eye-tracking results (Kruschke et al, 2005).

Highlighting, as well as blocking and overshadowing may be understood in terms of competitions among predictive cues, as if cues fight for associations with outcomes, so that once an association is established it protects itself by inhibiting the formation of new associations to the same predicted outcome. There are many reasons to think that these kinds of cue competitions should play a role in cognitive development, including the robustness of the phenomena in adults across a variety of task contexts. There is also one phenomenon in children's learning that shares an at least surface similarity to cue competition effects in associative learning: mutual-exclusivity (Markman, 1989; see also Hollich, Hirsh- Pasek, Tucker, & Golinkoff, 2000; Halberda, 2009).

As illustrated in Table 1, the task structure that yields mutual exclusivity looks very much like that of blocking or highlighting: a first-learned association (the word “cup” to the referent cup) is followed by a subsequent learning task with a new cue (the novel word) and outcome (the novel object). Consistent with this proposal, several models (e.g., Regier, 2005; Mayor & Plunkett, 2010) have shown how mutual exclusivity might emerge in cue interactions in associative learning. Also consistent with this proposal are studies tracking moment-to-moment eye gaze direction in infants (Halberda, 2009); the attention shifting by infants in these studies resemble those of adults in highlighting experiments (Kruschke, et al, 2005).

Lots of cues

Blocking and highlighting emerge in experimental studies in which researchers manipulate at most 2 to 3 cues and outcomes. The world, however, presents many probabalistically associated cues and outcomes yielding potentially complex patterns of predictability, with cues predicting other cues as well as potentially multiple outcomes. Importantly, such a large “data-set” of associations are also likely to have considerable latent structure, higher order regularities that might support deep and meaningful generalizations that go beyond specific cues and outcomes. Several connectionist models have sought to understand the structure of these higher order regularities (see, for example, Colunga & Smith, 2005; Colunga, Smith & Gassser, 2009; Rogers & McClelland, 2003; Kruschke, 1990, 1992, 2001). In one study, Colunga and Smith (2005) examined how the statistical regularities among perceptual properties of instances of basic level noun categories might create higher order partitions (object, substance) and also direct attention to relevant properties for categorizing artifacts (shape) and substances (material). To this end, they fed a connectionist net perceptual features (e.g., solidity, material, color, shape) that adults said characterized 300 noun categories commonly known by 2 year olds. From this input, the network acquired generalized cued-attention effects that worked even when presented with novel entities; specifically, in making decisions about novel categories, the network weighted shape more in the context of solidity and weighted material in the context of nonsolidity. This result tells us that the corpus of early learned nouns and the correlated proeprties of the objects to which those nouns refer contain useable cue-outcome regularities of the kind that could train attention.

Several theoretical and empirical analyses suggest that systems of associations often present a form of coherent covariation (Rogers & McClelland, 2003) across cues and outcomes. The importance of coherent covariation to human learning has been demonstrated in experimental tasks showing that adults (and infants, Younger and Cohen, 1986) are more likely to attend to and learn about features that co-vary than those that merely co-occur (e.g., Kruschke & Blair, 2000; Medin & Schafer, 1978; Medin, Altom, Edelson, & Freko, 1982). Other theorists have also pointed to what they call the systematicity (Billman & Heit, 1989) of associations; cues that probabalistically predict an outcome, also often predict each other, and the systematicity across associations among cues, among outcomes, and among cues and outcoms matters in learning (e.g., Billman & Knutson, 1996; Goldstone, 1998).

Yoshida and Smith (2003b, 2005; see also, Sloutsky & Fisher, 2008) proposed a cued-attention framework for thinking about the interactive effects of redundant cues that builds on the idea of interactive activation (see Goldstone, 1998; Billman, 1996; Billman & Knutson, 1996; Medin, Altom, Edelson, & Freko, 1982; O'Reilly, 2001). The proposal is illustrated in Figure 1 using the simple case of just three correlated cues: the correlation between cues a and b are learned either in the context of a third redundant cue, c, or without that third correlated cue. Experimental studies show that the learned connection between a and b is stronger if acquired in the context of c, which correlates with both a and b, than without that redundant correlation (Yoshida & Smith, 2005; Billman & Knutson, 1996). Moreover, the stronger associative link between a and b remains even when the redundant cue, c, is removed. Of course, in real world learning, there may be many more than 3 correlated cues and much more complex patterns of correlation. Through mutually reinforcing correlations such as these, a system of many correlated cues may lead to what have sometimes been called “gang effects:” highly interconnected and dense patterns of associative links that give rise to patterns of internal activation that, as a result, are not tightly dependent on any one cue (e.g., Billman & Knutson, 1996; O'Reilly, 2001; Yoshida & Smith, 2003a,b; Goldstone, 1998). Instead, the interactive activation among co-varying cues can lead to snowball effects in which the joint activation of cues is stronger than the sum of the individually contributing associations.

Figure 1.

Figure 1

Redundant correlations in the learning environment lead to stronger to stronger associations.

One developmental result raised by Gelman (1990) in the original special issue, “illusory projections,” may be understood in terms of such gang effects (see Rogers & McClelland, 2003 for a more detailed account). Gelman reported that preschool children project properties onto an instance of a category that are not perceptually there; children say that they saw feet on and eyed yet ball-like and footless thing if they were told that the thing could could move on its own. Figure 2 provides an example of how this might work within the cued attention framework. On the left is a hypothetical description of overlapping correlations among a set of perceptual properties: things with eyes tend also to have feet, to move on their own, and so forth. Because of the overlapping correlations among all these properties, each will serve as a context cue that predicts and “primes” attention to the others (see Yoshida & Smith, 2003b) thereby potentially causing an individual to more readily detect or “see” a property in an ambiguous stimulus. Thus on the right of Figure 2 is a hypothetical description of the cluster of cues input to the system – eyed, furry, moving on its own, but footless. By hypothesis, fur, eyes, and movement will all increase attention to each other and, given the ambiguous stimulus, lead to the perception of feet.

Figure 2.

Figure 2

An illustration of how redundant and overlapping correlations can lead to pattern completion and illusory projections.

Enhanced processing of predicted outcomes

Cues predict outcomes, and in the present use of the term, an outcome is any stimulus event that is predicted (it could be another cue or the specifically task-relevant target information). Considerable and growing evidence suggests that cues that predict outcomes also enhance detection and processing of the predicted event. The relevant experiments often involve search or detection tasks in which participants are asked to detect some target– a shape or color – typically in a crowded field of competing distractors (Chun & Jiang, 1998; Jiang, Olson & Chun, 2000; Gölcu & Gilbert, 2009). These studies have revealed strong effects of repeating arrays. Arrays that have been seen before show enhanced detection of the target, often with just one pre-exposure but increasing with repetition. This phenomenon is usually explained as a form of cued attention in which the array as a whole cues attention to a specific target at a specific location (Biederman, 1972, Scheider & Shiffrin, 1977, Shiffrin & Scheider, 1977; Boyce, Pollatsek & Rayner, 1989; Lewicki, Hill & Czyzewska, 1992).

The experimental evidence also shows that in these tasks, contextual cues and their attentional effects emerge without awareness (Lewicki, Hill & Czyzewska, 1992, 1997; Chun and Jiang, 1998; Jiang & Chun, 2001, 2003; Jiang & Leung, 2005; Shanks, Channon, Wilkinson & Curran, 2006; Olson & Chun, 2002) and result both in rapid shifting of attention to a location (e.g., Jiang & Chun, 2001; Summerfield et al, 2006; Clohessy, Posner & Rothbart, 2001; Chun and Turke-Browne, 2007) and also to the enhanced processing of particular stimulus features (Maunsell & Treue, 2006; Bichot, Rossi, & Desmione, 2005; Kruschke, 1996; Rossi & Paradiso, 1995). The growing neuroscience evidence on cued attention also indicates that contextually-cued enhancements of stimulus processing are pervasive across early sensory processing and higher-level perceptual and cognitive systems (Beck & Kastner, 2009; Gilbert, Ito, Kapadia & Westheimer, 2000; Pessoa, Kastner & Ungerleider, 2003; Kovács, 1996). For example, neurophysiological studies have shown that cue-target associations enhance base-line firing rates (Chelazzi, et al, 1993; 1998; Kastner et al, 1999) at early and middle levels in the visual system and alter neuronal tuning of target properties (Spitzer, et al, 1988; Yeshurun & Carrarasco, 1998; Treue & Matinez, 1999; Williford & Maunsell, 2006). Context cued enhancements of processing have also been shown to play a role in decision making and in integrating sensory information across systems (Reynolds & Heeger, 2009; Boynton, 2009; Bichot et al, 1996; Gold & Shadlen, 2007).

Contemporary theories of these effects, often based on analyses of neural patterns of excitation, share a great deal with more classical explanations (e.g., MacIntosh,1975; Rescorla & Wagner, 1972) of cued attention that emphasize prediction (or pre-activation, e.g., Summerfield et al, 2006) and competition (e.g., Duncan,1996; Desmione & Duncan, 1995). For example, the biased-competition theory of selective attention (see Duncan,1996; Desmione & Duncan, 1995; Beck & Kastner, 2009) begins with the starting assumption that competition characterizes representational processes at the sensory, motor, cortical and subcortical levels. In general, activation of a representation of some property, event or object is at the expense of other complementary representations. Selection, or attention, then, occurs by biasing (e.g., priming) some representations in favor of others or by inhibiting competing repetitions. The presence of contextual cues previously associated with some representation (at any of these levels) is thus thought to bias the competition. Multiple activated cues compete, and thus also interfere with each other, with stronger cues inhibiting weaker ones in a manner common to lateral inhibition models (von Bekesy, 1969; Walley & Weiden, 1973; de Zubicaray & McMahon, 2009; Ludwig, Gilchrist & McSorley, 2005). On these grounds, some have suggested that attention and selection is fundamentally a form of lateral inhibition in which the degree of activation of one representation inhibits that of nearby competitors (see Duncan, 1996; Beck & Kastner, 2009).

Here, then, is what we know about cued attention: Adults readily and unconsciously learn cues that predict other sensory events (outcomes) that are relevant in tasks. These predictive cues interact –both by supporting activation of the predicted event when they are correlated and also through competition that depends in fine-grained ways on the relative strengths of these cues and their history of prediction. These interactions are such that (1) early learned predictive relations tend to be preserved; (2) attention is systematically shifted to novel cues in the context of novel outcomes; and (3) the coherent covariation in large systems of cues and outcomes can capture latent structure that organizes attention in meaningful ways. Finally, predicted sensory events are processed faster and tuned more sharply than unpredicted events. All this suggests that cued attention is a basic mechanism that is likely to play a contributing role in many knowledge domains.

Cued Attention as Developmental Process

Cued attention is also a mechanism of change and one that seems capable of driving considerable change in the cognitive systems. This is because cued attention is a single mechanism that aggregates knowledge, is a repository of knowledge, and that guides learning and thereby driving the acquisition of new knowledge. In brief, attentional learning is a self-organizing process that builds on itself, becoming more directed, more knowledge driven, and potentially more domain specific as a consequence of its own activity. Figure 3 builds on a prior proposal by Colunga & Smith (2008) about how cued attention gathers, integrates and applies information over nested time scales. The three separate boxes on the left illustrate attentional processes at three time scales. The large box represents long-term associations among the many cues and outcomes in the learning environment. The middle –box indicates the task context and the in-task dynamics of prediction and pre-activation of cues and outcomes (see Samuelson & Smith, 2001; Samuelson, Schutte & Horst, in press). Finally, there are the processes of interaction and competition driven by the current task and stimuli. As illustrated on the right side of the figure, these are nested processes: In-the-moment attention depends on task context as modulated by long-term associations; in-the-moment attention also adds to those-long term associations.

Figure 3.

Figure 3

The nested times scales of attentional learning. The three separate boxes on the left illustrate attentional processes at three time scales. The large box represents long-term associations among the many cues and outcomes in the learning environment. The middle –box indicates the task context and the in-task dynamics of prediction and pre-activation of cues and outcomes. The small box indicates processes of interaction and competition to a momentary stimulus. As illustrated on the right side of the figure, these are nested processes: In-the-moment attention depends on task context as modulated by long-term associations; in-the-moment attention also adds to those-long term associations.

These nested interactions mean that attention can be biased in different ways in different contexts. Context cues that co-occur with (and define) specific tasks will come with repeated experience to shift attention to the task relevant information. Figure 4 represents this idea in terms of a series of context sensitive salience maps, with the relative salience of regions in the perceptual field indicated by the relative darkness of the locations. The idea is this: Because the history of associated cues increases and decreases the relative activation of features and task targets in the perceptual field, the salience map will change with changes in contextual cues. This means potentially dramatic shifts in the detection, selection and processing of stimulus events in different domains of expertise – word learning, or quantity judgments or spatial reasoning. These domains have associated contextual cues that –given sufficient experience– may structure the salience maps in consequentially different ways. Being smart in a domain may reflect (in part) predicting what information is relevant in that domain. For learners who have sufficient experiences in different domains, attention will nimbly dance about from context to context, enabling those learners to attend to just the right sort of information for that domain. In sum, what we know about cued-attention and attentional learning suggests that these mechanisms will play a role across all domains of cognition and their self-changing nature will create domain specific competencies. We consider next one domain in which these mechanisms may be at work.

Figure 4.

Figure 4

Contextually changing salience maps. Relative salience is indicated by the relative darkness of the locations. By hypothesis, these change with changes in context cues that predict sensory events in the field.

Children's Novel Word Generalizations

Many well-controlled studies show that young children need to hear only a single object name to systematically generalize that name to new instances in ways that seem right to adults (e.g., Golinkoff, Mervis, & Hirsh-Pasek, 1994; Markman, 1989; Smith, 1995; Waxman & Markow, 1995). Moreover, children generalize names for different kinds of things by different kinds of similarities, shifting attention to the right properties for each kind of category. Thus, for the task of learning common nouns, young children have solved Gelman's problem: they know what properties to attend to so as to form categories “shared with their elders.”

In the most common form of the novel word generalization task, the child is shown a single novel entity, told its name (e.g., This is the toma) and then asked what other things have the same name (e.g., Where is the toma here?) Many experiments have examined three kinds of entities (examples are shown in Figure 5) and found three different patterns of generalization. Given objects with features typical of animates (e.g., eyes or legs), children extend the name narrowly to things that are similar in multiple properties. Given a solid inanimate artifact-like thing, children extend the name broadly to all things that match in shape. Given a nonsolid substance, children extend the name by material. These are highly reliable and replicable results – obtained by many researchers – and in their broad outline characteristic of children learning a variety of languages (e.g., Jones, Smith, & Landau, 1991; Kobayashi, 1998; Jones & Smith, 2002; Yoshida & Smith, 2001; Markman, 1989; Booth & Waxman, 2002; Gathercole & Min, 1997; Imai & Gentner, 1997; Landau, Smith, & Jones, 1988, 1992, 1998; Soja, Carey, & Spelke, 1991; see also Gelman & Coley, 1991; Keil, 1994).

Figure 5.

Figure 5

Scatterplots of individual children's performances in the Novel Noun Generalization task as a function of the number of different kinds of nouns in their productive vocabularies. Individual data are from: Smith et al 1992; 2002;, Yoshida & Smith, 2003; Jones & Smith, 1998, 2002; and unpublished data.

Cued attention may a play a role in these generalizations in the following way: Artifacts, animals, substances present different features (angularity, eyes, nonsolidity) and these features co-occur with different words (quantifiers, verbs, adjectives, etc.). The features and co-occurring words are thus potential context cues that could shift attention in systematic ways to the relevant properties for the particular kind of category --- to multiple properties for animals to shape for artifacts, and to material for substances. And, indeed, the literature is filled with experimental demonstrations of these effects (Ward et al, 1991, Jones & Smith, 1998; Waxman & Booth, 2002; Soja, 1994; Samuelson, et al 2008; Gathercole, Cramer, Somerville, & Haar, 1995; McPherson, 1991; Soja, 1994; Yoshida & Smith, 2005; Yoshida et al., 2001; Colunga & Smith, 2004; Colunga, 2006).

These perceptual- and linguistic-context effects on children's novel noun generalizations also increase with age and language learning, just as they should if children are learning relevant contextual cues (Landau, Smith, & Jones, 1988; Soja, Carey, & Spelke, 1991; Jones, Smith, & Landau, 1991; Samuelson & Smith, 1999, 2000; Smith, 1995). Figure 5 presents a summary of the developmental pattern. The figure shows scatterplots of individual children's novel noun generalizations for solid artifactual things, things with eyes, and nonsolid things from a variety of different experiments (with many different unique stimuli and task structures) conducted in our laboratories over the years. Each individual child's generalizations by the category relevant property (shape for solid things, multiple similarities for eyed things, material for nonsolid things) is shown as function of the number of artifact, animal and substance names in the individual child's vocabulary. The figures show that the systematicity of these novel noun generalizations increases with early noun learning. Thus, in the broad view, the developmental pattern of children's novel noun generalizations fits what might be expected if the underlying mechanism was cued attention: Children learn cues for task relevant properties and after sufficient experience, those cues come to shift attention in task appropriate ways.

Cross-linguistic differences

By hypothesis, the learning environment presents clusters of perceptual cues (e.g., eyes, legs, mouths, body shapes in the case of animates) and clusters of linguistic cues (e.g., the words “wants,” “is happy,” “mad” and “hungry”) that are associated with each other and that predict the relevant similarities for categorizing different kinds. If perceptual and linguistic cues are both part of the same attentional cuing system they should interact, and given coherent covariation, should reinforce each other. The “natural” experiment that provides evidence is the comparison of children learning different languages.

Systematicity

The novel word generalizations of children learning English and Japanese have been examined in a number of studies (Imai & Gentner, 1997; Yoshida & Smith, 2001; 2003; 2005). Both languages provide many linguistic cues that correlate with artifacts (and attention to shape), animals (and attention to multiple similarities) and substances (and attention to material). However, English arguably provides more systematic and coherently co-varying cues distinguishing objects and substances whereas Japanese arguably provides more coherently co-varying cues distinguishing animates and inanimates (see Yoshida & Smith, 2003, for a discussion).

In particular, the count-mass distinction in English partitions all nouns into discrete countable entities (objects and animals) or masses (substances) and the various linguistic markers of this distinction (determiners, plural) are strongly correlated with the perceptual cues (and particularly solidity) that predict categorization by shape. Moreover, the predictive relations between linguistic cues and perceptual cues are particularly strong in the 300 English nouns that children normatively learn by 2 ½ years (see Samuelson & Smith, 1999; Smith, Colunga & Yoshida, 2003; Colunga & Smith, 2005). By hypothesis, these linguistic cues should augment attention to shape for solid artifactual things and to material for nonsolid substances. Because these linguistic cues lump object and animal categories together, they might also weaken the predictability of the distinction between artifact categories as shape-based categories and animal categories as organized by multiple properties.

Japanese, unlike English, makes no systematic distinction between count and mass nouns (and has no English-like plural). Therefore there is less redundancy in the cue-category correlations with respect to object and substance categories. However, Japanese, offers more systematic cues with respect to animates and inanimates than does English. As just one example, every time a Japanese speaker refers to the location of an entity, in frames as ubiquitous as There is a____, they must mark the entity as animate or inanimate (ga koko ni iru vs. __ga koko ni aru, respectively). In this, as well as other ways, Japanese, relative to English, adds extra and systematic cues that correlate with perceptual cues predictive of animal versus non-animal category organizations.

Under the cued-attention framework, the coherent variation of linguistic and perceptual cues within the two languages should create measurable cross-linguistic differences in the noun generalizations of children in the artificial word-learning task. For children learning both languages, solidity predicts attention to shape, nonsolidity predicts attention to material, and features such as eyes and feet predict attention to multiple similarities. But for children learning English, the solidity-nonsolidty cues co-vary and are supported by linguistic cues. For children learning Japanese, eyed-noneyed cues correlate with pervasive linguistic contrasts. Thus, there should be stronger, earlier and sharper distinctions in novel noun generalizations for solid versus nonsolids for English speaking children than for Japanese speaking children and stronger, earlier and sharper distinctions between eyed and non-eyed stimuli for children learning Japanese. Experimental studies have documented these differences (Imai & Gentner, 1997; Yoshida & Smith, 2003).

Gang effects

Yoshida and Smith (2005) provided an experimental test of the cued-attention account of these cross-linguistic differences in a 4-week training experiment that taught monolingual Japanese children redundant linguistic cues analogous to count-mass cues in English. The experiment used a 2 × 2 design: linguistic cues versus no linguistic cues correlated with category organization and solidity during training and the presence or absence of those linguistic cues at test. The key results are these: Children who were trained with correlated linguistic and perceptual cues outperformed those who were not so trained, attending to the shape of solid things and the material of nonsolid thing and did so even when they were tested with totally novel entities and even when the trained linguistic cue was not present at test. That is, learning the links between solidity and nonsolidity and shape and material in the context of redundant linguistic cues made the perceptual cue-outcome associations stronger.

Illusory Projections

In a related study, Yoshida and Smith (2003) presented English and Japanese children with novel entities like that in Figure 6. The protrusions are ambiguous; depending on context, both Japanese- and English-speaking children could see them as legs or as wires. However, when presented in a neutral context, Japanese-speaking children were more likely to see them as leg like and to generalize a novel name for the exemplar by multiple similarities; English-speaking children were more likely to see them as wires and to generalize a novel name for the exemplar by shape. In brief, Japanese-speaking children showed an enhanced sensitivity to subtle cues vaguely suggestive of animacy, a sensitivity that may be made in a history of redundant linguistic-perceptual cues that reinforce attention to each other and in so-doing prime the relevant representations for animal-like features.

Figure 6.

Figure 6

An ambiguous object: the protrusions may be seen as wires or as limbs.

These cross-linguistic differences indicate that perceptual and linguistic cues interact in a single system that organizes attention in novel word generalization tasks. The findings also suggest that cued attention might be one mechanism behind Whorfian effects in which speakers of different languages are found to be particularly sensitive to and attentive to relations and properties that are lexicalized in their language (see Bowerman, 1996; Boroditsky, 2001; Gathercole & Min, 1997; Choi & Bowerman, 1991; Gentner & Broditsky, 2001; Levinson, 2003).

Competition and prediction

One attentional pathway

One early study by Smith, Jones and Landau (1992) using the novel-noun generalization task directly examined cue-competitions by pitting cues against each other. The study examined sentence frames associated with naming objects (This is a riff) and frames associated with labeling properties (This is a riff one) asking under what contexts the count noun frame would result in increased attention to shape and in what contexts the adjective frame would result in increased attention to color. Children know more nouns and know them earlier than they do adjectives so the noun frame might be expected to be stronger attentional cue than the adjective frame. Particularly relevant to possibly underlying mechanisms of cued-attention, Smith et al examined the effects of noun and adjective frames when the colors of the labeled things were dull (olive green), intrinsically salient (glittery silver-gold), or very salient (glittery silver-gold under a spot light).

The results provide strong support for a competition among cues that interacts with other stimulus driven (exogenous) pulls on attention. Given dull colors, children generalized novel words presented in both the noun and the adjective frames by shape. Given glittery colors without a spotlight, children generalized novel words in a noun frame by shape but in an adjective frame by color. Apparently the association between the noun frame and shape is strong enough to overcome competition from glitter but the pull from glitter helps the weaker adjective frame in guiding attention to the property. Finally, when given glittery colors under a spotlight, children often generalized the word –even in the count noun frame –by color. Children's performance strongly suggests a winner-take-all competition for attention in which learned cues and other pulls on attention directly interact.

These results also llustrate an important point about attention: It forces a decision and there is but one pathway and one decision. The single pathway means that attention must integrate information – from past learning, from task contexts, from immediate input – into a single attentional response. Because attention integrates multiple sources of information in the moment, it will also be flexible and adaptive to the idioscyncracies of specific tasks. The results also underscore the idea that words, whatever else they are, are also cues for attention and play a role in guiding in-the-moment attention.

Cue strength

Colunga, Smith and Gasser (2009) offered an analysis of how the structure of the Spanish and English count-mass system may yield different cue strengths and as consequence different patterns of cue interactions. The relevant difference between the two languages can be illustrated by thinking about how speakers can talk about a block of wood. An English-speaker talking about a wooden block might say “a block” if they are talking about its shape or “some wood” if they are talking about its substance. Further, an English speaker cannot say “some block” or “a wood” because “block” is a count noun and “wood ” is a mass noun. In contrast, a Spanish-speaker, talking about the wooden block, could say “un bloque” (a block) or “una madera” (a wood) when talking about the coherent bounded and shaped thing but would say “algo de madera” (some wood) when talking about the substance irrespective of its shape. That is, count-mass quantifiers are used more flexibly across nouns in Spanish than English to signal different task relevant construals of the entity (Iannucci, 1952). Note, English has some nouns that work like madera in Spanish: “a muffin” predicts the context relevancy of muffin shape, but “some muffin” predicts the context relevancy of muffin substance. But such nouns are not common in English whereas in Spanish, in principle, all nouns work this way, and in everyday speech many more nouns are used in both count and mass frames than in English (Gathercole 1997; Gathercole et al. 2000; Iannucci 1952). In brief, in Spanish, count-noun syntax is more predictive of attention to shape or material than the noun (madera) itself or the solidity of the object. In English, syntax, the noun, and solidity are (not perfectly but) more strongly correlated with each other, and equally predictive of whether the relevant dimension is shape or material.

In light of these differences, Colunga et al hypothesized that Spanish count-mass syntax should overshadow other cues –solidity, the noun – in children's novel noun generalizations. In the experiment, monolingual Spanish-speaking children and monolingual English speaking children were presented with a novel solid entity named with either mass or count nouns, e.g., “a toma” or “some toma” and then tested with the two syntactic frames. The syntactic frame had a stronger effect on the Spanish-speaking children's noun generalizations than on the English-speaking children's noun generalizations. In other words, Spanish-speaking children learn to attend to mass-count syntax as a way to disambiguate nouns that sometimes refer to shape and sometimes refer to material, and in the context of mass syntax they attend to material even when the items are solid. These results fit the idea that children learn predictive cues and that stronger cues inhibit weaker ones. They also emphasize how individual cues reside in a system of cues and that it is the system of interacting cues that determine attention.

Protecting past learning

Cue competition privileges old learning as stronger cues inhibit new ones and force attention elsewhere. Yoshida & Hanania (2007) provided an analysis of adjective learning that illustrates how this aspect of cued attention could play a positive role in word learning. The motivating question for their analysis was how, in the context of both a novel adjective and a known noun (e.g., in the context of “a stoof elephant”), attention could be properly shifted to a property (such as texture) rather than the shape of the thing. If the word “elephant” is a strong cue for attention to elephant shape, how does the child manage to learn novel adjectives? The answer offered by Yoshida and Hanania is much like the mutual-exclusivity account (Markman, 1989) but is in terms of a cued-attention mechanism rather than children's knowledge about how different kinds of words link to different kinds of meanings.

Their experiments were based on a prior study by Mintz and Gleitman (2002) that showed that the explicit mention of the noun (e.g., the stoof elephant) helps children learn the novel adjective relative to conditions in which the noun is not explicitly spoken (e.g., a stoof one). Yoshida and Hanania proposed that the role of the noun could be understood as a kind of attentional highlighting via competition. In brief, their cued-attention explanation of the explicit mention of the noun is this: Children usually experience the word elephant in the context of elephant-shaped things with typical elephant textures (e.g., ROUGH). In these novel adjective-learning experiments, the named object is an elephant-shaped elephant with a totally novel texture (e.g., with tiny holes punched throughout). Thus, in the context of stoof elephant, the known cue-outcome (elephant-elephant shape) shifts attention to the novel cue – novel outcome (stoof-holey texture). In the context of stoof one, there is no conjunctive cue containing both a known and a novel cue and thus less competition which in this case means less directed attention to the novel property. Yoshida and Hanania provided indirect support for this account by showing that the mere conjunction of words (in an unordered list format, e.g., elephant, stoof, red) rather than in a sentence format in which stoof modifies the known noun elephant) was sufficient for the novel-word to novel-property mapping and by showing that the attentional shift away from shape to the right property, depends on the number of competitive cues (stoof in the context red and elephant leads to stronger effects than stoof in the context of elephant alone).

Competitive processes are ubiquitous in the sensory and cognitive system (see Beck & Kastner, 2008). They are at the core of current theories of lexical access and online sentence processing (e.g., Bowers, Davis & Hanley, 2005; Davis & Lupker, 2006). It seems likely that they play an important role in lexical development as well (Hollich, Hirsh- Pasek, Tucker, & Golinkoff, 2000; Halberda, 2009; Horst, Scott & Pollard, in press) and, as suggested by Yoshida and Hanania (2007; under revision), competitive attentional processes may be particularly important in early development because by protecting already learned associations, they guide learning about novel cues and outcomes and in this way may leverage little-bits of learning into strong forces that effectively speed up learning.

The evidence on children's novel noun generalizations show that context –both linguistic and perceptual –cues children to form categories on the basis of different properties and to select one object over another as the referent of some word. The contextual cueing effects that characterize these early word generalizations share a number of (at least surface) similarities to cued attention – the role of coherent covariation, competition with other pulls on attention, attention shifting to protect past learning. However, none of these early word learning experiments unambiguously show that these are attentional effects. They cannot because the measures of performance are at a macro level – generalizating names to things. To show that mechanisms of attentional learning play a role in these phenomena, we need measures at finer levels of resolution.

Going Micro

Our main thesis is that cued-attention is a potentially powerful developmental mechanism and that early word learning –and particularly the phenomena associated with children's smart novel noun generalizations –would be a fertile domain in which to investigate this idea. However, the granularity of the phenomena studied under cued-attention –rapid attentional shifts, rapid detection, enhanced processing and discrimination – and those studied in most novel word learning experiments – generalization of an object name to a new instance – are not comparable. What is needed are finer-grained measures of component behaviors –orienting, disengagement, detection, and discrimination – in the context of words and correlated object properties. One might ask, for example, whether a count noun sentence frame shifts primes detection of an odd shape but not an odd texture in a visual search task or whether a count noun sentence frame enhances shape discriminations over texture discrimination.

Recent studies of early word learning using moment to moment tracking of eye gaze suggests the new insights to emerge from analyzing attention and word learning at a finer temporal scale. For example, recent studies using this methodology suggest that older (24 month old) and younger (18 month old) word learners may both map words to referents but that older learners more rapidly process the word and object and more rapidly shift attention to the right object (Fernald et al, 2008). Other studies manipulating the sound properties of the words have shown in the time course of looking two potential referents the time course of competition and its resolution depend on the similarity of cues (Swingley & Aslin, 2007). Another recent study showed that time course of looking to the named referent closely tracked the co-occurrence probabilities of words, referents and distracters (Vouloumanos & Werker, 2009; Fernald, Thorpe & Marchman, 2010; Yu & Smith, in press). These finer-grained methods might also be used to directly test the idea that learned clusters of coherently varying cues organize orienting to, detecting, and processing of predicted properties in ways nimbly guide attention to the right components of a scene for word learning.

We also need studies on cued-attention outside of the domain of early word learning, studies that investigate the processes and mechanisms in adult literature. At present there are very few studies (Kirkham, Slemmer & Johnson, 2002; Richards, 2005; Reid, Striano, Kaufman & Johnson, 2004: Sloutsky & Robinson, 2008; Wu & Kirkham, in press) and no active area of concentrated research effort. The studies that do exist suggest that cued attention will not work exactly the same way in children as in adults. At present, we know very little about attentional learning and cued-attention in young children. One distinction in the study of attentional processes that may be particularly relevant is that between exogenous and endogenous cueing (see Colombo, 2001; Smith & Chatterjee, 2008; Snyder & Munakata, 2008, 2010; McGoldberg, Maurer & Lewis, 2001). Exogenous attention refers to the quick capture of attention by salient stimuli, such as the flashing of lights. Endogenous attention is attention directed by associated cues. Many studies of endogenous versus exogenous cuing use the covert-orienting paradigm (Jonides, 1980; Posner & Raichle, 1994): a cue directs attention to the target information either exogenously (a flashing light) or endogenously (by a previous association) and either correctly or incorrectly. There is considerable development in the dynamics of both exogenous and endogenous cueing and in disengaging attention when miscued (see Columbo, 2001; Maurer et al, 2001. However, advances in endogenous cuing appear to lag behind exogenous cuing particularly in the over-riding of misleading cues and in switching attention given a new cue (Cepeda et al, 2001; Snyder & Munakata, 2010). By one proposal the major developmental changes lie in the processes that resolve cue competition (Snyder & Munakata 2008; 2010). Two recent studies of preschoolers in attention switching tasks suggest that experimentally training clusters of different cues for targets (Sloutsky & Fisher, 2008) or training more abstract cues (Snyder & Munkata, 2010) enhances switching, a result that may link cued-attention, word-learning, and executive control of attention.

In brief, internally directed attention that smartly moves to the right information for learning is likely to have its own compelling developmental story with successful attention dependent on one's history with a particular set of cues and overlapping mechanisms of activation, pre-activation, cue interactions, and competition. We need systematic and fine-grained studies of the development of cued attention from infancy through the early word-learning period and into childhood.

Knowledge as process

The systematicity with which young children learn the concepts they share with their elders, and they systematicity with which they generalize a newly learned name in different ways for different kinds, clearly demonstrates that they have knowledge. That is not in doubt; the contentious question is the nature of the knowledge. Macro-level accounts of this knowledge have been in terms of propositions and core principles about kinds of categories and kinds of word meanings, knowledge of the sort proposed in the 1990 special issue (see also Soja et al., 1992; Hall, 1996; Booth & Waxman, 2002; Deisendruck & Bloom, 2003; Dewar & Xu, 2009). These explanations are based on the traditional view of cognition as sharply divided into knowledge and process and competence and competence and performance. Thus, in terms of this traditional framework, the two levels of knowledge an process illustrated on the left of Figure 7, were separable and one could sensibly ask of children's performance in any word learning task whether their performance reflected knowledge about how words map to meaning (competence) or whether it reflected some performance variable. One example of this are the various debates as to whether the shape bias emerges because of conceptual knowledge about how nouns map to categories (e.g, Booth and Waxman, 2002; Cimpian & Markman, 2005) or whether it emerges because of attentional learning (Smith & Samuelson, 2006). This traditional explanation would seem to suggest that one could pit conceptual explanations against explanations in terms of (only) real time processing. This framing of the question – concepts versus attention --is problematic on its own terms. In any real time task, conceptual knowledge requires supporting processes as well – the perceptual, attention, memory, and decision processes. Whatever can be explained by the account on the right (the only account) can necessarily be explained by the account on the left: Whatever can be explained by A (process) can always also be explained by A+B (process plus propositions).

Figure 7.

Figure 7

An illustration of two views of cognition: On the left knowledge representations are connected to real time task performance as separable aspects of cognition. One the left, processes (which through their operating characteristics and history instantiate knowledge) generate performance in real time tasks. Within this framework, beliefs, concepts, and propositional knowledge are higher level theoretical descriptions of the knowledge embedded in these processes.

The framing of the contending explanations in terms of concepts versus processes such as attention is also more profoundly misguided. Modern-day understanding of neural processes makes clear that knowledge has no existence outside of process and that the relevant processes often encompass many different systems and time-scales (see Barsalou, 2009). What we call knowledge is an abstraction over many underlying processes. Although these higher-level abstractions may summarize meaningful regularities in the operating characteristics of the system as a whole, we also need to go underneath them –and understand the finer-grained processes from which they are made. “Word learning rules” and “concepts” may be to attentional processes as “hunger” is to hormones: not a separate mechanism but a port manteau abstraction that includes attentional learning (as well as other processes). Clearly, one goal of science is to unpack these carry-all abstractions. Cued-attention is a particularly intriguing potential mechanism from this perspective as it is one that melds competence and performance, knowledge and process, perception and conception. It does so because cued attention is a process, operating in real time, strongly influenced by the momentary input, with effects at the sensory and perceptual level but also driven by the rich data-structure of predictive relations between cues and outcomes in a lifetime of experiences. As such, attention is a process that aggregates, acquires and applies knowledge.

Acknowledgements

The research summarized in this paper was supported by NIMH grant R01MH60200, and NICHD grants R01HD 28675 to Linda Smith and NICHD 1R01HD058620-01 to Hanako Yoshida.

References

  1. Barsalou LW. Simulation, situated conceptualization, and prediction. Philosophical Transactions of Royal Society B. 2009;364:1281–1289. doi: 10.1098/rstb.2008.0319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Beck DM, Kastner S. Top-down and bottom-up mechanisms in biasing competition in the human brain. Vision Research. 49:1154–1165. doi: 10.1016/j.visres.2008.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bichot NP, Rossi AF. Parallel and serial neural mechanisms for visual search in Macaque Area V4. Science. 2005;308(5721):529–534. doi: 10.1126/science.1109676. [DOI] [PubMed] [Google Scholar]
  4. Bichot NP, et al. Visual feature selectivity in frontal eye fields induced by experience in mature macaques. Nature. 1996;381:697–699. doi: 10.1038/381697a0. [DOI] [PubMed] [Google Scholar]
  5. Biederman I. Perceiving real-world scenes. Science. 1972;177(4043):77–80. doi: 10.1126/science.177.4043.77. [DOI] [PubMed] [Google Scholar]
  6. Billman D, Heit E. Observational learning from internal feedback: A simulation of an adaptive learning method. Cognitive Science. 1989;12:587–625. [Google Scholar]
  7. Billman D, Knutson J. Unsupervised concept learning and value systematicity: A complex whole aids learning the parts. Journal of Experimental Psychology: Learning, Memory, and Cognition. 1996;22(2):458–475. doi: 10.1037/0278-7393.22.2.458. [DOI] [PubMed] [Google Scholar]
  8. Blaga OM, Colombo J. Visual processing and infant ocular latencies in the overlap paradigm. Developmental Psychology. 2006;42(6):1069–1076. doi: 10.1037/0012-1649.42.6.1069. [DOI] [PubMed] [Google Scholar]
  9. Booth AE, Waxman SR. Word learning is `smart': evidence that conceptual information affects preschoolers' extension of novel words. Cognition. 2002;84:B11–B22. doi: 10.1016/s0010-0277(02)00015-x. [DOI] [PubMed] [Google Scholar]
  10. Bowerman M. Cognitive versus linguistic determinants. In: Gumperz JJ, Levinson SC, editors. Rethinking Linguistic Relativity. Cambridge University Press; Cambridge: 1996. pp. 145–86. [Google Scholar]
  11. Bowers JS, Davis CJ, Hanley DA. Interfering neighbours: The impact of novel word learning on the identification of visually similar words. Cognition. 2005;97:B45–B54. doi: 10.1016/j.cognition.2005.02.002. [DOI] [PubMed] [Google Scholar]
  12. Boroditsky L. Does language shape thought? English and Mandarin speakers' conceptions of time. Cognitive Psychology. 2001;43:1–22. doi: 10.1006/cogp.2001.0748. [DOI] [PubMed] [Google Scholar]
  13. Boyce SJ, Pollatsek A, Rayner K. Effect of background information on object identification. Journal of Experimental Psychology: Human Perception and Performance. 1989;15(3):556–566. doi: 10.1037//0096-1523.15.3.556. [DOI] [PubMed] [Google Scholar]
  14. Boynton GM. A framework for describing the effects ofattention on visual responses. Vision Research. 2009;49:1129–1143. doi: 10.1016/j.visres.2008.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Brady TF, Chun MM. Spatial constraints on learning in visual search: Modeling contextual cuing. Journal of Experimental Psychology. 2007;33:798–815. doi: 10.1037/0096-1523.33.4.798. [DOI] [PubMed] [Google Scholar]
  16. Campbell SB, von Stauffenberg C. Delay and inhibition as early predictors of ADHD symptoms in third grade. Journal of Abnormal Child Psychology: An Official Publication of the International Society for Research in Child and Adolescent Psychopathology. 2009;37(1):1–15. doi: 10.1007/s10802-008-9270-4. [DOI] [PubMed] [Google Scholar]
  17. Carey S, Bartlett E. Acquiring a single new word. Papers and Reports on Child Language Development. 1978;15:17–29. [Google Scholar]
  18. Cepeda NJ, Kramer AF, de Sather J. Changes in executive control across the life span: Examination of task switching performance. Developmental Psychology. 2001;37:715–730. [PubMed] [Google Scholar]
  19. Chapman GB, Robbins SJ. Cue interaction in human contingency judgment. Memory & Cognition. 1990;18(5):537–545. doi: 10.3758/bf03198486. [DOI] [PubMed] [Google Scholar]
  20. Chatham CH, Frank MJ, Munakata Y. Pupillometric and behavioral markers of a developmental shift in the temporal dynamics of cognitive control. PNAS Proceedings of the National Academy of Sciences of the United States of America. 2009;106(14):5529–5533. doi: 10.1073/pnas.0810002106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Chelazzi L, et al. Responses of neurons in inferior temporal cortex during memory-guided visual search. Journal of Neurophysiology. 1998;80:2918–2940. doi: 10.1152/jn.1998.80.6.2918. [DOI] [PubMed] [Google Scholar]
  22. Chelazzi L, et al. A neural basis for visual search in inferior temporal cortex. Nature. 1993;363:345–347. doi: 10.1038/363345a0. [DOI] [PubMed] [Google Scholar]
  23. Chiang W, Wynn K. Infants' tracking of objects and collections. Cognition. 2000;77:169–195. doi: 10.1016/s0010-0277(00)00091-3. [DOI] [PubMed] [Google Scholar]
  24. Choi S, Bowerman M. Learning to express motion events in English and Korean: the influence of language-specific lexicalization patterns. Cognition. 1991;41:83–121. doi: 10.1016/0010-0277(91)90033-z. [DOI] [PubMed] [Google Scholar]
  25. Chun MM, Jiang Y. Contextual cueing: Implicit learning and memory of visual context guides spatial attention. Cognitive Psychology. 1998;36(1):28–71. doi: 10.1006/cogp.1998.0681. [DOI] [PubMed] [Google Scholar]
  26. Cimpian A, Markman E. The absence of a shape bias in children's word learning. Developmental Psychology. 2005;41:1003–1019. doi: 10.1037/0012-1649.41.6.1003. [DOI] [PubMed] [Google Scholar]
  27. Cleveland A, Schug M, Striano T. Joint attention and object learning in 5- and 7-month old infants. Infant and Child Development. 2007;16(3):295–306. [Google Scholar]
  28. Clohessy AB, Posner MI, Rothbart MK. Development of the functional visual field. Acta Psychologica. 2001;106:51–68. doi: 10.1016/s0001-6918(00)00026-3. [DOI] [PubMed] [Google Scholar]
  29. Colombo J. The development of visual attention in infancy. In: Fiske ST, Schacter DL, Zahn-Waxler C, editors. Annual review of psychology. Annual Reviews; Palo Alto, CA: 2001. pp. 337–367. [DOI] [PubMed] [Google Scholar]
  30. Colunga E, Smith LB. Dumb mechanisms make smart concepts. Proceedings of the Annual Conference of the Cognitive Science Society. 2004;26:239–244. [Google Scholar]
  31. Colunga E. The effect of priming on preschooler's extensions of novel words: How far can “dumb” processes go?. 30th Annual Boston University Conference on Language Development.2006. pp. 96–106. [Google Scholar]
  32. Colunga E, Smith LB. From the lexicon to expectations about kinds: A role for associative learning. Psychological Review. 2005;112(2) doi: 10.1037/0033-295X.112.2.347. [DOI] [PubMed] [Google Scholar]
  33. Colunga E, Smith LB. Flexibility and variability: Essential to human cognition and the study of human cognition. New Ideas in Psychology. 2008;26:174–192. [Google Scholar]
  34. Colunga E, Smith LB, Gasser M. Correlation versus prediction in children's word learning: Cross-linguistic evidence and simulations. Language and Cognition. 2009;1(2):197–217. doi: 10.1515/LANGCOG.2009.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Cusack R, Mitchell DJ, Duncan J. Discrete object representation, attention switching, and task difficulty in the parietal lobe. Journal of Cognitive Neuroscience. 2010;22(1):32–47. doi: 10.1162/jocn.2009.21194. [DOI] [PubMed] [Google Scholar]
  36. Davis CJ, Lupker SJ. Masked inhibitory priming in English: Evidence for lexical inhibition. Journal of Experimental Psychology: Human Perception and Performance. 2006;32(3):668–687. doi: 10.1037/0096-1523.32.3.668. [DOI] [PubMed] [Google Scholar]
  37. de Zubicaray GI, McMahon KL. Auditory context effects in picture naming investigated with event-related fMRI. Cognitive, Affective & Behavioral Neuroscience. 2009;9(3):260–269. doi: 10.3758/CABN.9.3.260. [DOI] [PubMed] [Google Scholar]
  38. Desimone R, Duncan J. Neural mechanisms of selective visual attention. Annual Reviews of Neuroscience. 1995;18:193–222. doi: 10.1146/annurev.ne.18.030195.001205. [DOI] [PubMed] [Google Scholar]
  39. Dewar K, Xu F. Do early nouns refer to kinds or distinct shapes? Evidence from 10-month-old infants. Psychological Science. doi: 10.1111/j.1467-9280.2009.02278.x. in press. [DOI] [PubMed] [Google Scholar]
  40. Diamond A. The early development of executive functions. In: Bialystok E, Craik FIM, editors. Lifespan cognition: Mechanisms of change. Oxford University Press; New York, NY, US: 2006. pp. 70–95. [Google Scholar]
  41. Diesendruck G, Bloom P. How specific is the shape bias? Child Development. 2003;74:168–178. doi: 10.1111/1467-8624.00528. [DOI] [PubMed] [Google Scholar]
  42. Duncan J. Cooperating brain systems in selective perception and action. In: Inui T, McClelland JL, editors. Attention and performance XVI. The MIT Press; Cambridge: 1996. pp. 549–578. [Google Scholar]
  43. Fernald A, Zangl R, Portillo AL, Marchman VA. Looking while listening: Using eye movements to monitor spoken language comprehension by infants and young children. In: Sekerina I, Fernández E, Clahsen H, editors. Language processing in children. Benjamins; Amsterdam: 2008. [Google Scholar]
  44. Fernald A, Thorpe K, Marchman V. Blue car, red car: Developing efficiency in online interpretation of adjective-noun phrases. Cognitive Psychology. 2010;60:190–217. doi: 10.1016/j.cogpsych.2009.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Franco F, Gagliano A. Toddlers' pointing when joint attention is obstructed. First Language. 2001;21(63):289–321. [Google Scholar]
  46. Freeth M, Ropar D, Chapman P, Mitchell P. The eye gaze direction of an observed person can bias perception, memory, and attention in adolescents with and without autism spectrum disorder. Journal of Experimental Child Psychology. 2010;105(1–2):20–37. doi: 10.1016/j.jecp.2009.10.001. [DOI] [PubMed] [Google Scholar]
  47. Gathercole V, Cramer L, Somerville S, Jansen op de Haar M. Ontological categories and function: Acquisition of new names. Cognitive Development. 1995;10:225–251. [Google Scholar]
  48. Gathercole V, Thomas EM, Evans D. What's in a noun? Welsh-, English-, and Spanish-speaking children see it differently. First Language. 2000;20:55–90. [Google Scholar]
  49. Gathercole V, Min H. Word meaning biases or language-specific effects? Evidence from English, Spanish and Korean. First Language. 1997;17(49 Pt 1):31–56. [Google Scholar]
  50. Gelman SA, Coley JD. Language and categorization: The acquisition of natural kind terms. In: Gelman SA, Byrnes JP, editors. Perspectives on language and thought: Interrelations in development. Cambridge University Press; New York, NY: 1991. [Google Scholar]
  51. Gelman R. First principles organize attention to and learning about relevant data: Number and animate-inanimate distinction as examples. Cognitive Science. 1990;14:79–106. [Google Scholar]
  52. Gentner D, Boroditsky L. Individuation, relational relativity and early word learning. In: Bowerman M, Levinson S, editors. Language acquisition and conceptual development. Cambridge University Press; Cambridge, England: 2001. [Google Scholar]
  53. Gilbert C, Ito M, Kapadia M, Westheimer G. Interactions between attention, context and learning in primary visual cortex. Vision Research. 2000;40:1217–1226. doi: 10.1016/s0042-6989(99)00234-5. [DOI] [PubMed] [Google Scholar]
  54. Gold JI, Shadlen MN. The neural basis of decision making. Annual Review Neuroscience. 2007;30:535–574. doi: 10.1146/annurev.neuro.29.051605.113038. [DOI] [PubMed] [Google Scholar]
  55. Goldstone RL. Perceptual learning. Annual Review of Psychology. 1998;49:585–612. doi: 10.1146/annurev.psych.49.1.585. [DOI] [PubMed] [Google Scholar]
  56. Goldberg M, Maurer D, Lewis T. Developmental changes in attention: the effects of endogenous cueing and of distractors. Developmental Science. 2001;4:209–219. [Google Scholar]
  57. Golinkoff RM, Mervis CB, Hirsh-Pasek K. Early object labels: The case for a developmental lexical principles framework. Journal of Child Language. 1994;21(1):125–155. doi: 10.1017/s0305000900008692. [DOI] [PubMed] [Google Scholar]
  58. Grossberg S. Processing of expected and unexpected events during conditioning and attention: A psychophysiological theory. Psychological Review. 1982;89(5):529–572. [PubMed] [Google Scholar]
  59. Grossmann T, Farroni T. Decoding social signals in the infant brain: A look at eye gaze perception. In: de Haan M, Gunnar MR, editors. Handbook of developmental social neuroscience. Guilford Press; New York, NY, US: 2009. pp. 87–106. [Google Scholar]
  60. Hall DG. Naming solids and non-solids: children's default construals. Cognitive Development. 1996;11:229–264. [Google Scholar]
  61. Hanania R, Smith LB. Selective attention and attention switching: towards a unified developmental approach. Developmental Science. doi: 10.1111/j.1467-7687.2009.00921.x. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Hirotani M, Stets M, Striano T, Friederici AD. Joint attention helps infants learn new words: Event-related potential evidence. NeuroReport: For Rapid Communication of Neuroscience Research. 2009;20(6):600–605. doi: 10.1097/WNR.0b013e32832a0a7c. [DOI] [PubMed] [Google Scholar]
  63. Horst J, Scott E, Pollard J. The role of competition in word learning via referent selection. Developmental Science. doi: 10.1111/j.1467-7687.2009.00926.x. in press. [DOI] [PubMed] [Google Scholar]
  64. Imai M, Gentner D. A cross-linguistic study of early word meaning: Universal ontology and linguistic influence. Cognition. 1997;62:169–200. doi: 10.1016/s0010-0277(96)00784-6. [DOI] [PubMed] [Google Scholar]
  65. Imai M, Gentner D. A cross-linguistic study of early word meaning: Universal ontology and linguistic influence. Cognition. 1997;62:169–200. doi: 10.1016/s0010-0277(96)00784-6. [DOI] [PubMed] [Google Scholar]
  66. Imai M, Mazuka R. Revisiting language universals and linguistic relativity: Language-relative construal of individuation constrained by universal ontology. Cognitive Science. doi: 10.1080/15326900701326436. in press. [DOI] [PubMed] [Google Scholar]
  67. Jiang Y, Chun MM. Contextual cueing: Reciprocal influences between attention and implicit learning. In: Jiménez L, editor. Attention and implicit learning. John Benjamins Publishing Company; Amsterdam, Netherlands: 2003. pp. 277–296. [Google Scholar]
  68. Jiang Y, Chun MM. Asymmetric object substitution masking. Journal of Experimental Psychology: Human Perception and Performance. 2001;27(4):895–918. [PubMed] [Google Scholar]
  69. Jiang Y, Leung AW. Implicit learning of ignored visual context. Psychonomic Bulletin & Review. 2005;12(1):100–106. doi: 10.3758/bf03196353. [DOI] [PubMed] [Google Scholar]
  70. Jones SS, Smith LB, Landau KB. Object properties and knowledge in early lexical learning. Child Development. 1991;62:499–516. [PubMed] [Google Scholar]
  71. Jiang Y, Olson IR, Chun MM. Organization of visual short-term memory. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2000;26(3):683–702. doi: 10.1037//0278-7393.26.3.683. [DOI] [PubMed] [Google Scholar]
  72. Jones SS, Smith LB. How children know the relevant properties for generalizing object names. Developmental Science. 2002;5:219–232. [Google Scholar]
  73. Jones SS, Smith LB. How children name objects with shoes. Cognitive Development. 1998;13:323–334. [Google Scholar]
  74. Jones SS, Smith LB, Landau B. Object properties and knowledge in early lexical learning. Child Development. 1991;62:499–516. [PubMed] [Google Scholar]
  75. Jonides J. Towards a model of the mind's eye's movement. Canadian Journal of Psychology. 1980;34:103–112. doi: 10.1037/h0081031. [DOI] [PubMed] [Google Scholar]
  76. Kamin LJ. `Attention-like' processes in classical conditioning. In: Jones MR, editor. Miami symposium on the prediction of behavior: Aversive stimulation. University of Miami Press; Coral Gagbles, FL: 1968. pp. 9–33. [Google Scholar]
  77. Kamin LJ. Predictability, surprise, attention, and conditioning. In: Campbell BA, Church RM, editors. Punishment. Appleton-Century-Crofts; New York: 1969. pp. 279–296. [Google Scholar]
  78. Kastner S, et al. Increased activity in human visual cortex during directed attention in the absence of visual stimulation. Neuron. 1999;22:751–761. doi: 10.1016/s0896-6273(00)80734-5. [DOI] [PubMed] [Google Scholar]
  79. Katz N, Baker E, Macnamara J. What's in a name? A study of how children learn common and proper names. Child Development. 1974;45(2):469–473. [Google Scholar]
  80. Keil F. The birth and nurturance of concepts by domains: The origins of concepts of living things. In: Hirschfeld L, Gelman S, editors. Mapping the mind: Domain specificity in cognition and culture. Cambridge University Press; New York, NY, US: 1994. pp. 234–254. [Google Scholar]
  81. Kirkham NZ, Slemmer JA, Johnson SP. Visual statistical learning in infancy: evidence for a domain general learning mechanism. Cognition. 2002;83:B35–B42. doi: 10.1016/s0010-0277(02)00004-5. [DOI] [PubMed] [Google Scholar]
  82. Kobayashi H. How 2-year-old children learn novel part names of unfamiliar objects. Cognition. 1998;68:B41–B51. doi: 10.1016/s0010-0277(98)00044-4. [DOI] [PubMed] [Google Scholar]
  83. Kruschke JK. ALCOVE: An exemplar-based connectionist model of category learning. Psychological Review. 1992;99:22–44. doi: 10.1037/0033-295x.99.1.22. [DOI] [PubMed] [Google Scholar]
  84. Kruschke JK. Base rates in category learning. Journal of Experimental Psychology: Learning, Memory and Cognition. 1996;22:3–26. doi: 10.1037//0278-7393.22.1.3. [DOI] [PubMed] [Google Scholar]
  85. Kruschke JK. Toward a unified model of attention in associative learning. Journal of Mathematical Psychology. 2001;45(6):812–863. [Google Scholar]
  86. Kruschke JK. Learning involves attention. In: Houghton G, editor. Connectionist models in cognitive psychology. Psychology Press; Hove, East Sussex, UK: 2005. pp. 113–140. Ch. 4. [Google Scholar]
  87. Kruschke JK, Blair NJ. Blocking and backward blocking involve learned inattention. Psycholonomic Bulletin & Review. 2000;7(4):636–645. doi: 10.3758/bf03213001. [DOI] [PubMed] [Google Scholar]
  88. Kruschke JK, Kappenman ES, Hetrick WP. Eye gaze and individual differences consistent with learned attention in associative blocking and highlighting. Journal of Experimental Psychology: Learning, Memory and Cognition. 2005;31:830–845. doi: 10.1037/0278-7393.31.5.830. [DOI] [PubMed] [Google Scholar]
  89. Landau B, Smith LB, Jones SS. The importance of shape in early lexical learning. Cognitive Development. 1988;3:299–321. [Google Scholar]
  90. Landau KB, Smith LB, Jones S. Object perception and object naming in early development. Trends in Cognitive Science. 1998;2:19–24. doi: 10.1016/s1364-6613(97)01111-x. [DOI] [PubMed] [Google Scholar]
  91. Levinson SC. Space in Language and Cognition: Explorations in Cognitive Diversity. Cambridge University Press; 2003. [Google Scholar]
  92. Lewicki P, Hill T, Czyzewska M. Nonconscious acquisition of knowledge. American Psycholgoist. 1992;47(6):796–801. doi: 10.1037//0003-066x.47.6.796. [DOI] [PubMed] [Google Scholar]
  93. Lewicki P, Hill T, Czyzewska M. Hidden covariation detection: A fundamental and ubiquitous phenomenon. Journal of Experimental Psychology: Learning, Memory, and Cognition. 1997;23(1):221–228. doi: 10.1037//0278-7393.23.1.221. [DOI] [PubMed] [Google Scholar]
  94. Liechty J, Pieters R, Wedel M. Global and local covert visual attention: Evidence from a Bayesian hidden Markov model. Psychometrika. 2003;68(4):519–542. [Google Scholar]
  95. Ludwig CJH, Gilchrist ID, McSorley E. The remote distractor effect in saccade programming: Channel interactions and lateral inhibition. Vision Research. 2005;45(9):1177–1190. doi: 10.1016/j.visres.2004.10.019. [DOI] [PubMed] [Google Scholar]
  96. Lubow RE, Moore AU. Latent inhibition: The effect of nonreinforced preexposure to the conditioned stimulus. Journal of Comparative and Physiological Psychology. 1959;52:415–419. doi: 10.1037/h0046700. [DOI] [PubMed] [Google Scholar]
  97. Markman EM. MIT Series in learning, development, and conceptual change. MA MIT Press; Cambridge: 1989. Categorization and naming in children: Problems of induction. [Google Scholar]
  98. Markman EM. Constraints children place on word meaning. Cognitive Science. 1990;14:57–77. [Google Scholar]
  99. Mackintosh NJ. A Theory of attention: Variations in the associability of stimuli with reinforcement. Psychological Review. 1975;82:276–298. [Google Scholar]
  100. Mackintosh NJ. Overshadowing and stimulus intensity. Animal Learning & Behavior. 1976;4:186–192. doi: 10.3758/bf03214033. [DOI] [PubMed] [Google Scholar]
  101. Mackintosh NJ, Turner C. Blocking as a function of novelty of CS and predictability of UCS. Quarterly Journal of Experimental Psychology. 1971;23:359–366. doi: 10.1080/14640747108400245. [DOI] [PubMed] [Google Scholar]
  102. MacIntosh NJ. Selective attention in animal discrimination learning. Psychological Bulletin. 1965;64:125–150. doi: 10.1037/h0022347. [DOI] [PubMed] [Google Scholar]
  103. Maunsell JHR, Treue S. Feature-based attention in visual cortex. Trends in Neuroscience. 2006;29(6):317–322. doi: 10.1016/j.tins.2006.04.001. [DOI] [PubMed] [Google Scholar]
  104. Mayor J, Plunkett K. A neurocomputational account of taxonomic responding and fast mapping in early word learning. Psychological Review. 2010;117(1):1–31. doi: 10.1037/a0018130. [DOI] [PubMed] [Google Scholar]
  105. McClelland JL, Rogers TT. The parallel distributed processing approach to semantic cognition. Nature Review Neuroscience. 2003;4(4):310–322. doi: 10.1038/nrn1076. [DOI] [PubMed] [Google Scholar]
  106. McHugh L, Reed P. Age trends in stimulus overselectivity. Journal of the Experimental Analysis of Behavior. 2007;88(3):369–380. doi: 10.1901/jeab.2007.88-369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. McPherson L. A little goes a long way: Evidence for a perceptual basis of learning for the noun categories COUNT and MASS. Journal of Child Language. 1991;18:315–338. doi: 10.1017/s0305000900011089. [DOI] [PubMed] [Google Scholar]
  108. Medin DL, Schaffer MM. Context theory of classification learning. Psychological Review. 1978;85(3):207–238. [Google Scholar]
  109. Medin DL, Altom MW, Edelson SM, Freko D. Correlated symptoms and simulated medical classification. Journal of Experimental Psychology: Learning, Memory, & Cognitive. 1982;8(1):37–50. doi: 10.1037//0278-7393.8.1.37. [DOI] [PubMed] [Google Scholar]
  110. McVay JC, Kane MJ. Conducting the train of thought: Working memory capacity, goal neglect, and mind wandering in an executive-control task. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2009;35(1):196–204. doi: 10.1037/a0014104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Miller JL, Ables EM, King AP, West MJ. Different patterns of contingent stimulation differentially affect attention span in prelinguistic infants. Infant Behavior & Development. 2009;32(3):254–261. doi: 10.1016/j.infbeh.2009.02.003. [DOI] [PubMed] [Google Scholar]
  112. Mintz TH, Gleitman LR. Adjectives really do modify nouns: The incremental and restricted nature of early adjective acquisition. Cognition. 2002;84(3):267–293. doi: 10.1016/s0010-0277(02)00047-1. [DOI] [PubMed] [Google Scholar]
  113. Nosofsky RM. Attention, similarity, and the identification-categorization relationship. Journal of Experimental Psychology. 1986;115:39–57. doi: 10.1037//0096-3445.115.1.39. [DOI] [PubMed] [Google Scholar]
  114. O'Reilly RC. Generalization in interactive networks: The benefits of inhibitory competition and Hebbian learning. Neural Computation. 2001;13:1199–1242. doi: 10.1162/08997660152002834. [DOI] [PubMed] [Google Scholar]
  115. Olson IR, Chun MM. Perceptual constraints on implicit learning of spatial context. Visual Cognition. 2002;9(3):273–302. [Google Scholar]
  116. Pessoa L, Kastner S, Ungerleider LG. Neuroimaging studies of attention: From modulation of sensory processing to top-down control. The Journal of Neuroscience. 2003;23(10):3990–3998. doi: 10.1523/JNEUROSCI.23-10-03990.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Pfeiffer R, Scheier C. Understanding intelligence. MIT Press; Cambridge, MA: 1999. [Google Scholar]
  118. Poole BJ, Kane MJ. Working-memory capacity predicts the executive control of visual search among distractors: The influence of sustained and selective attention. The Quarterly Journal of Experimental Psychology. 2009;62(7):1430–1454. doi: 10.1080/17470210802479329. [DOI] [PubMed] [Google Scholar]
  119. Posner M, Raichle M. Images of Mind. W. H. Freeman and Company; New York: 1994. [Google Scholar]
  120. Posner MI, Rothbart MK, Thomas-Thrapp L, Gerardi G. The development of orienting to locations and objects. In: Wright RD, editor. Visual attention. Oxford University Press; New York, NY, US: 1998. pp. 269–288. [Google Scholar]
  121. Posner M, Rothbart MK. Research on attention as a model for the integration of psychological science. Annual Review of Psychology. 2007;58:1–23. doi: 10.1146/annurev.psych.58.110405.085516. [DOI] [PubMed] [Google Scholar]
  122. Ramscar M, Yarlett D, Dye M, Denny K, Thorpe K. Feature-Label-Order Effects And Their Implications For Symbolic Learning. Cognitive Science. doi: 10.1111/j.1551-6709.2009.01092.x. in press. [DOI] [PubMed] [Google Scholar]
  123. Regier T. The emergence of words: Attentional learning in form and meaning. Cognitive Science: A Multidisciplinary Journal. 2005;29(6):819–865. doi: 10.1207/s15516709cog0000_31. [DOI] [PubMed] [Google Scholar]
  124. Reid VM, Striano T, Kaufman J, Johnson MH. Eye gaze cueing facilitates neural processing of objects in 4-month-old infants. NeuroReport: For Rapid Communication of Neuroscience Research. 2004;15(16):2553–255. doi: 10.1097/00001756-200411150-00025. [DOI] [PubMed] [Google Scholar]
  125. Rescorla RA, Wagner AR. In: A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement, Classical Conditioning II. Black AH, Prokasy WF, editors. Appleton-Century-Crofts; 1972. pp. 64–99. [Google Scholar]
  126. Reynolds JH, Heeger DJ. The normalization model of attention. Neuron. 2009;61:168–185. doi: 10.1016/j.neuron.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  127. Richards JE. Localizing cortical sources of event-related potentials in infants' covert orienting. Developmental Science. 2005;8(3):255–278. doi: 10.1111/j.1467-7687.2005.00414.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  128. Richards JE. Attention in young infants: A developmental psychophysiological perspective. In: Nelson CA, Luciana M, editors. Handbook of developmental cognitive neuroscience. 2nd ed. MIT Press; Cambridge, MA, US: 2008. pp. 479–497. [Google Scholar]
  129. Rogers TT, McClelland JL. Semantic Cognition: A Parallel Distributed Processing Approach. MIT Press; Cambridge, MA: 2004. [DOI] [PubMed] [Google Scholar]
  130. Rossi AF, Paradiso MA. Feature-specific effects of selective visual attention. Vision Res. 1995;35(5):621–634. doi: 10.1016/0042-6989(94)00156-g. [DOI] [PubMed] [Google Scholar]
  131. Samuelson LK, Smith LB. Early noun vocabularies: Do ontology, category structure, and syntax correspond? Cognition. 1999;73(1):1–33. doi: 10.1016/s0010-0277(99)00034-7. [DOI] [PubMed] [Google Scholar]
  132. Samuelson LK, Smith LB. Children's attention to rigid and deformable shape in naming and no naming tasks. Child Development. 2000;71(6):1555–1570. doi: 10.1111/1467-8624.00248. [DOI] [PubMed] [Google Scholar]
  133. Samuelson LK, Schutte AR, Horst JS. The dynamic nature of knowledge: Insights from a dynamic field model of children's novel noun generalizations. Cognition. doi: 10.1016/j.cognition.2008.10.017. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  134. Samuelson LK, Horst JS. Confronting complexity: Insights from the details of behavior over multiple timescales. Developmental Science. 2008;11:209–215. doi: 10.1111/j.1467-7687.2007.00667.x. [DOI] [PubMed] [Google Scholar]
  135. Samuelson LK, Horst JS. Dynamic noun generalization: Moment-to-momentinteractions shape children's naming biases. Infancy. 2007;11:97–110. [Google Scholar]
  136. Samuelson LK, Smith LB. Grounding development in cognitive processes. Child Development. 2000a;71:98–106. doi: 10.1111/1467-8624.00123. [DOI] [PubMed] [Google Scholar]
  137. Samuelson LK, Horst JS, Schutte AR, Dobbertin B. Rigid thinking about deformables: Do children sometimes overgeneralize the shape bias? Journal of Child Language. 2008;35:559–589. doi: 10.1017/S0305000908008672. [DOI] [PMC free article] [PubMed] [Google Scholar]
  138. Samuelson L. Statistical regularities in vocabulary guide language acquisition in connectionist models and 15–20-month-olds. Developmental Psychology. 2002;38:1016–1037. doi: 10.1037//0012-1649.38.6.1016. [DOI] [PubMed] [Google Scholar]
  139. Schneider W, Shiffrin RM. Controlled and automatic human information processing: I. Detection, search, and attention. Psychological Review. 1977;84(1):1–66. [Google Scholar]
  140. Shallice T, Stuss DT, Alexander MP, Picton TW, Derkzen D. The multiple dimensions of sustained attention. Cortex. 2008;44(7):794–805. doi: 10.1016/j.cortex.2007.04.002. [DOI] [PubMed] [Google Scholar]
  141. Shanks DR. Forward and backward blocking in human contigency judgement. Quarterly Journal of Experimental Psychology. 1985;37B:1–21. [Google Scholar]
  142. Shanks DR, Channon S, Wilkinson L, Curran HV. Disruption of sequential priming in organic and pharmacalogical amnesia: A role for the medial temporal lobes in implicit contextual learning. Neuropsychopharmacology. 2006;31(8):1768–1776. doi: 10.1038/sj.npp.1300935. [DOI] [PubMed] [Google Scholar]
  143. Shiffrin RM, Schneider W. Controlled and automatic human information processing: II. Perceptual learning, auotmatic attending, and a general theory. Psychological Review. 1977;84(2):127–190. [Google Scholar]
  144. Sloutsky VM, Fisher AV. Attentional learning and flexible induction: How mundane mechanisms give rise to smart behaviors. Child Development. 2008;79:639–651. doi: 10.1111/j.1467-8624.2008.01148.x. [DOI] [PubMed] [Google Scholar]
  145. Sloutsky VM, Robinson CW. The role of words and sounds in infants' visual processing: From overshadowing to attentional tuning. Cognitive Science: A Multidisciplinary Journal. 2008;32(2):342–365. doi: 10.1080/03640210701863495. [DOI] [PubMed] [Google Scholar]
  146. Smith SE, Chatterjee A. Visuospatial attention in children. Neurological Review. 2008;65:1284–1288. doi: 10.1001/archneur.65.10.1284. [DOI] [PubMed] [Google Scholar]
  147. Smith LB. Self-organizing processes in learning to learn words: Development is not induction. In: Nelson C, editor. Basic and applied perspectives on learning, cognition, and development. 1. Vol. 28. Lawrence Erlbaum Associates; Marwah, NJ: 1995. 32 pp. [Google Scholar]
  148. Smith LB, Jones S, Landau B, Gershkoff-Stowe L, Samuelson L. Object name learning provides on-the-job training for attention. Psychological Science. 2002;13(1):13–19. doi: 10.1111/1467-9280.00403. [DOI] [PubMed] [Google Scholar]
  149. Smith LB, Colunga E, Yoshida H. Making an ontology: Cross-linguistic evidence. In: Rakison D, Oakes L, editors. Early category and concept development Making sense of the blooming, buzzing confusion. Oxford University Press; London: 2003. pp. 275–302. [Google Scholar]
  150. Smith LB. The Minnesota Symposia on Child Psychology, Volume 28, Basic and144 Applied Perspectives On Learning, Cognition, and Development. Lawrence Erlbaum Associates; Mahwah, NJ: 1995. Self-organizing processes in learning to learn words: Development is not induction; pp. 1–32. [Google Scholar]
  151. Smith LB. Learning how to learn words: An associative crane. In: Golinkoff RM, et al., editors. Becoming a word learner: A debate on lexical acquisition. Counterpoints: Cognition, memory, and language. Oxford University Press; London: 2000. [Google Scholar]
  152. Smith LB, Colunga E, Yoshida H. Making ontology: Cross-linguistic evidence. In: Oakes L, Rakison D, editors. Category development. 2003. [Google Scholar]
  153. Smith LB, Jones SS, Landau B. Count nouns, adjectives, andperceptual properties in children's novel word interpretation. DevelopmentalPsychology. 1992;28:273–86. [Google Scholar]
  154. Smith LB, Jones SS, Landau B, Gershkoff-Stowe L, Samuelson L. Object name learning provides on-the-job training for attention. Psychological Science. 2002;13:13–19. doi: 10.1111/1467-9280.00403. [DOI] [PubMed] [Google Scholar]
  155. Smith LB, Gasser M. The development of embodied cognition: six lessons from babies. Artificial. Life. 2005;11:13–39. doi: 10.1162/1064546053278973. [DOI] [PubMed] [Google Scholar]
  156. Smith LB, Samuelson L. An Attentional Learning Account of the Shape Bias: Reply to Cimpian and Markman (2005) and Booth, Waxman, and Huang (2005) Developmental Psychology. 2006;42:1339–1343. doi: 10.1037/0012-1649.42.6.1339. [DOI] [PubMed] [Google Scholar]
  157. Smolensky P. On the proper treatment of connectionism. The Behavioral and Brain Sciences, Il. 1988 [Google Scholar]
  158. Snyder HR, Munakata Y. Becoming self-directed: Abstract representations support endogenous flexibility in children. Cognition. 2010 doi: 10.1016/j.cognition.2010.04.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  159. Snyder HR, Munakata Y. So many options, so little time: The roles of association and competition in underdetermined responding. Psychological Bulletin & Review. 2008;15:1083–1088. doi: 10.3758/PBR.15.6.1083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  160. Soja NN. Inferences about the meaning of nouns; the relationship between perception and syntax. Cognitive Development. 1992;7:29–45. [Google Scholar]
  161. Soja NN. Evidence for a distinct kind of noun. Cognition. 1994;51(3):267–284. doi: 10.1016/0010-0277(94)90056-6. [DOI] [PubMed] [Google Scholar]
  162. Soja NN, Carey S, Spelke ES. Ontological categories guide young children's inductions of word meanings: Object terms and substance terms. Cognition. 1991;38(2):179–211. doi: 10.1016/0010-0277(91)90051-5. [DOI] [PubMed] [Google Scholar]
  163. Spitzer H, et al. Increased attention enhances both behavioral and neuronal performance. Science. 1988;240:338–340ß. doi: 10.1126/science.3353728. [DOI] [PubMed] [Google Scholar]
  164. Swingley D, Aslin RN. Lexical competition in young children's word learning. Cognitive Psychology. 2007;54:99–132. doi: 10.1016/j.cogpsych.2006.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  165. Treue S, Martinez Trujillo JC. Feature-based attention influences motion processing gain in macaque visual cortex. Nature. 1999;399:575–579. doi: 10.1038/21176. [DOI] [PubMed] [Google Scholar]
  166. Vouloumanos A, Werker JF. Infants' learning of novel words in a stochastic environment. Developmental Psychology. 2009;45:1611–1617. doi: 10.1037/a0016134. [DOI] [PubMed] [Google Scholar]
  167. Walley RE, Weiden TD. Lateral inhibition and cognitive masking: A neuropsychological theory of attention. Psychological Review. 1973;80(4):284–302. doi: 10.1037/h0035007. [DOI] [PubMed] [Google Scholar]
  168. Ward TB, Becker AH, Hass SD, Vela E. Attribute availability and the shape bias in children's category generalization. Cognitive Development. 1991;6(2):143–167. [Google Scholar]
  169. Waxman SR, Markow DB. Words as invitations to form categories. Cognitive Psychology. 1995;29(3):257–302. doi: 10.1006/cogp.1995.1016. [DOI] [PubMed] [Google Scholar]
  170. Williford T, Maunsell JH. Effects of spatial attention on contrast response functions in macaque area V4. J Neurophysiol. 2006;96:40–54. doi: 10.1152/jn.01207.2005. [DOI] [PubMed] [Google Scholar]
  171. Wu R, Kirkham N. No two cues are alike: Depth of learning during infancy is dependent on what orients attention. Journal of Experimental Child Psychology. doi: 10.1016/j.jecp.2010.04.014. in press. [DOI] [PubMed] [Google Scholar]
  172. Yamamoto K, Keil FC. The acquisition of Japanese numerical classifiers—Linkages between grammatical forms and conceptual categories. Journal of East Asian Linguistics. 2000;9:379–409. [Google Scholar]
  173. Yeshurun Y, Carrasco M. Attention improves or impairs visual performance by enhancing spatial resolution. Nature. 1998;396:72–75. doi: 10.1038/23936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  174. Yoshida H, Smith LB. Shifting ontological boundaries: How Japanese- and English- speaking children generalize names for animals and artifact. Developmental Science. 2003;6(1):1–34. [Google Scholar]
  175. Yoshida H, Hanania R. Attentional highlighting as a mechanism behind early word learning. Twenty-ninth Annual Conference of the Cognitive Science Society; Nashville, Tennessee. 2007. [Google Scholar]
  176. Yoshida H, Smith LB. Linguistic cues enhance the learning of perceptual cues. Psychological Science. 2005;16(2):90–95. doi: 10.1111/j.0956-7976.2005.00787.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  177. Yoshida H, Smith LB. Early noun lexicons in English and Japanese Cognition. 2001;82:63–74. doi: 10.1016/s0010-0277(01)00153-6. [DOI] [PubMed] [Google Scholar]
  178. Yoshida H, Swanson J, Drake C, Gudel L. Japanese- and English-speaking children's use of linguistic cues to animacy in category formation. Biennial meeting of the Society for Research on Child Development; Minneapolis, MN. Apr, 2001. [Google Scholar]
  179. Yoshida H, Smith LB. Correlation, concepts and cross-linguistic differences. Developmental Science. 2003;6(1):30–34. [Google Scholar]
  180. Younger BA, Cohen LB. Infant perception of correlations among attributes. Child Development. 1983;54:858–867. [PubMed] [Google Scholar]
  181. Yu C, Smith LB. What you learn is what you see: Using eye movements to study infant cross-situational word learning. Developmental Science. doi: 10.1111/j.1467-7687.2010.00958.x. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES