Abstract
Little is known about the brain mechanisms involved in word learning during infancy and in second language acquisition and about the way these new words become stable representations that sustain language processing. In several studies we have adopted the human simulation perspective, studying the effects of brain-lesions and combining different neuroimaging techniques such as event-related potentials and functional magnetic resonance imaging in order to examine the language learning (LL) process. In the present article, we review this evidence focusing on how different brain signatures relate to (i) the extraction of words from speech, (ii) the discovery of their embedded grammatical structure, and (iii) how meaning derived from verbal contexts can inform us about the cognitive mechanisms underlying the learning process. We compile these findings and frame them into an integrative neurophysiological model that tries to delineate the major neural networks that might be involved in the initial stages of LL. Finally, we propose that LL simulations can help us to understand natural language processing and how the recovery from language disorders in infants and adults can be accomplished.
Keywords: language learning, speech segmentation, contextual learning, functional connectivity, ERPs, fMRI
1. Introduction
Suppose an adult and child human arrive on Mars and discover that there are Martians who seem to speak a language to one another. If the adult and child human stay on Mars for several years and try to learn this language, what do you think will be the outcome? (Gleitman & Newport 1997, p. 22)
The quoted problem introduced by Gleitman and Newport illustrates two crucial aspects of language learning (LL). First, although infants learn their first language as part of their cognitive development, adults are also faced with this challenge when learning a second language. Second and most important, the differences and commonalities existing between the acquisition process in infants and adults might shed some light on the learning machinery necessary for mastering a new language.
Despite this fact, infant and adult language acquisition and learning processes1 have rarely been compared. It is clear that important factors that differ between the two populations may impact the way learning takes place. In that sense, infants are making sense of a whole world while developing other cognitive functions in parallel with language (Diamond 2002). Moreover, this development in infants is accompanied by different rates of brain maturation and myelinization in different regions, which constrains cognitive functions (Casey et al. 2000; Uylings 2006). These maturation factors add to other aspects such as implicit learning (non-instructed) of first language acquisition, as compared to the frequently explicit training in adult LL. Nevertheless, aside from these factors, some core aspects of language acquisition might be shared between the two populations. Thus, we believe that the cross-talk between the cognitive neuroscience of LL in adults and infants is necessary and can be very fruitful because the two fields can bring valuable information to each other as will be outlined here. With that aim in mind, complementary neurobiological and developmental perspectives are required. Non-invasive acquisition of brain activity (e.g. using event-related brain potentials (ERPs) and structural and functional Magnetic Resonance Imaging (fMRI)) and the study of the developmental changes that occur in the course of word learning (at the structural and functional brain levels) will drastically aid in filling in the missing pieces of information about LL mechanisms in infants and adults (Spelke 2002).
The present paper is divided into four parts. First, we introduce the relevant factors that have been highlighted in the two fields in an attempt to show how they are interrelated. Second, we review the different studies of adults and infants devoted to early segmentation and word recognition processes along with the mechanisms involved in the extraction of rules embedded in those words. In the third section, we present the problem of inferring the meaning of a new word from a verbal context. In the fourth section, we present an integrative proposal of the main neurophysiological mechanisms involved in LL and the interface of LL with different cognitive processes.
2. Relevant factors in first and second language acquisition
(a). First language acquisition
The last 20 years have witnessed an enormous amount of research on LL and cognitive development, mostly coming from behavioural infant studies (Saffran et al. 2006; Kuhl & Rivera-Gaxiola 2008). Although our knowledge of how infants are able to master a new language is rapidly increasing, the problem is so complex and multifaceted that the current state of the art is still far from providing a clear picture of the exact learning mechanisms involved in this rich period of our life (see Gaskell & Ellis, this issue). When considering this ‘complex problem’, we need to consider at least three important aspects: (i) the input of the learning process, (ii) the cognitive mechanisms involved in learning, and (iii) their developmental time course.
When considering the input level, several authors underscore the distinction between learning words (lexical knowledge) and rules or constraints (grammatical knowledge). The capacity to develop grammatical abilities appears even in circumstances of impoverished input or impaired intellectual abilities (e.g. in Williams syndrome). However, Martens et al. (2008), in a recent review, contradicted this idea and suggested that individuals with Williams syndrome presented in some cases both typical (but delayed) and atypical grammatical abilities. In particular, the atypical abilities were encountered in tasks measuring complex skills, such as morphosyntactic and semantic integration. In any case, the general agreement between the independency of grammar evolution with respect to other cognitive abilities have led generative linguists to postulate the existence of a very powerful innate LL device for accomplishing this highly demanding task (Chomsky 2002). Thus, from the same input, the cognitive system is able to extract two different types of information for language development, namely words and rules. From the generative point of view, this is accomplished using a predefined innate LL device whereas input-based traditions sustain that learning is accomplished by exploiting the internal characteristics of the input (Ellis 2008).
Thus, although this generative grammar account has received several criticisms, some core ideas in this view remain important at the level of cognitive processes, in the sense that, whatever the learning mechanisms we are genetically endowed with, we are able to accomplish this complex enterprise when faced with the appropriate triggering inputs. The specification at the biological level of the genetic regulatory, expressive and epigenetic mechanisms that allow humans to accomplish this task might be one of the main fields of research in the future.2 An unresolved issue is to what extent infants are able to learn a language because they are equipped with a very powerful general-purpose learning mechanism or because they are equipped with a language-specific acquisition device (Elman et al. 1996; Bates et al. 1998; Hauser & Bever 2008).
Some researchers have already provided alternative hypotheses to the existence of specific LL mechanisms (Golinkoff et al. 2000). For example, it has been proposed that general principles of associative learning can explain many of the LL characteristics (Ellis 2008). In the same vein, a general purpose statistical learning mechanism has been proposed to underlie a great deal of the LL phenomena (Smith 2000). Other authors have considered a broader view of the infant word-learning process, which might require the interaction of more general capacities, including conceptual and theory of mind capacities and grammatical knowledge (Bloom 2000). Similarly, an influential current view emphasizes the unavoidable fact that infants are social agents and that LL might be considered fundamentally a social activity (Tomasello 2003). This theoretical position downplays the validity of the Quinean dilemma (Quine 1960), which points out the inherent ambiguity of the meaning conveyed between two persons in a learning context (multiple meanings could always be mapped onto a new word). Because a social interaction could disambiguate the referent intentions of the speaker, infants do not require the specific internal language constraints that have been postulated to learn the meaning of a new word (Markman 1989).
In fact, the existence and availability of multiple perceptual, linguistic and social cues in the learning environment of infants lead some researchers to consider that a simple blind association mechanism might not be enough and attention is required for focusing, selecting and integrating the incoming speech and the multiple cues provided by the environment. Several convergent ideas on the importance of attention to social cues (eye gaze, pointing behaviour and joint attention) highlight the importance of considering social interaction as a key aspect when learning a language (Baldwin 1991). Notice that these new social-learning perspectives emphasize the richness of the initial infant-learning environment and the necessity that powerful cognitive capacities aid the processing and integration of this information. For example, some important concepts in this social pragmatic perspective, like social understanding or theory of mind, might require the involvement of high-level cognitive processes.
In a similar vein, Gentner & Namy (2004) have proposed another domain-general mechanism, the comparison process, involved in early word learning. This mechanism, based on research in analogy and similarity, allows storage and comparison of multiple similar experiences highlighting their commonalities and inherent abstract relations (even when exposures are separated in time). Thus, this process might be useful in infants to infer and discover the meaning of newly encountered words. The observed commonalities between two different experiences (e.g. imagine a child or an adult hearing the same label applied to different entities, as for example would be the case for the word ‘animal’) might trigger the comparison process, bypassing surface commonalities (the label ‘animal’) and extracting deeper abstract or conceptual relations (the concept of animal applies to different entities). Analogical reasoning might underlie these comparison processes. To what degree this comparison process would be responsible for the acquisition of grammatical rules is still an open question (Gomez & Maye 2005).
Finally, at the development level, it is very important to observe the changes in the language acquisition mechanisms across the lifespan. One of the tenets of the emergentist coalition model (ECM) (Hollich et al. 2000) is that infants might use a coalition of available learning cues in the environment (perceptual, social and linguistic) in order to learn a language. The interesting aspect proposed by this account is that the weighting of these cues and their involvement in the word-learning journey might change over time (developmental perspective). For example, infants at the early stages of learning might depend more on perceptual and salient attributes in the environment and less on linguistic properties. With maturation and after the completion of the first word-learning milestone, infant's attention might turn more to linguistic aspects in order to boost their learning resources. Similarly, some developmental theories of lexical recognition and literacy (Fowler 1991; Walley 1993) have proposed that phoneme representations undergo fairly gradual substantial changes during childhood language development. This position is interesting, because it emphasizes how the representations that sustain phonological and lexical information interact between them and are transformed during language development. During this process, the initial phonological representation might consist of an implicit perceptual unit used for basic speech representation, and afterwards, it is transformed into an explicit cognitive representation that can be used for reading tasks. More specifically, the initial lexical representations in early childhood might be holistic (most probably based on larger units than the phonemic segments, such as syllables or overall acoustic shape) and as the lexicon is accrued by the child, these representations might become more refined and based on phonological segments in this model (Walley 1993; Metsala & Walley 1998). This transformation might be partially driven by infant vocabulary growth, and therefore, by the infants' increasing need to distinguish target words in the lexicon in a faster way (Metsala & Walley 1998).
(b). Second language acquisition
Although an important subset of language studies have focused their investigation on infants' language development, it is also true that most adults learn more than one language in their lifetime. Besides the emotional circumstances that will inevitably affect this learning process (Klein 1996), the crucial question is whether the large infants' learning plasticity reflected while learning their native language during the first 2 years of life is maintained at all throughout the lifespan. The idea that brain plasticity is largely reduced in adolescents and adults led some authors to propose the existence of critical periods or sensitive time windows for acquiring native-like competence in a second language, especially with regard to phonological and morphosyntactic aspects (see for a review, Birdsong 2006).3 In fact, this early idea about the limits on adult brain plasticity might explain the separation of the infant and adult learning research traditions. First, language acquisition has always been considered a kind of singular implicit process. Its development is circumscribed to a very narrow time window (the first 2–4 years) and characterized by a large amount of brain plasticity that allows infants to master one or multiple languages in a relatively short period of time. In contrast, second language acquisition in adulthood has always been characterized as a non-automatic, explicit and effortful process, clearly modulated by motivational and emotional factors, and comprising a rather crystallized cognitive system with no conceptual changes required.
Surprisingly, some studies of ultimate attainment have shown that an important number of second language learners acquire a near-native performance even in phonology and syntax (Flege 1987; White & Genesee 1996; Bongaerts 1999; Hyltenstam & Abrahamsson 2000; Birdsong 2006). Birdsong (1999) has estimated that between 5 and 15 per cent of learners attain near-native performance. Montrul & Slabakova (2003) and White & Genesee (1996) showed larger estimates ranging from 20–30%. Interestingly, Coppieters (1987) suggested, in view of the results from extensive interviews, that native and near-native speakers of French appeared to be comparable in terms of language use and proficiency, although the two groups clearly diverged in their interpretation of sentences involving basic grammatical contrasts. All these data cast some doubts about a rigid interpretation of the sensitive time window hypothesis (see for a critical view of the critical period hypothesis in language, Seidenberg & Zevin 2006). In a similar vein, during the last decade, the concepts of neural plasticity, neurogeneration and brain repair have been carefully redefined and the emergent picture of the adult learning brain is more dynamic, open and encouraging than the previous views (Buonomano & Merzenich 1998; Bruer 2003; De Felipe 2006). However, it is important to bear in mind that similar performance levels (e.g. when comparing near-native second language learners and native speakers or even infant and adult learning rates) do not directly inform about the implication of the same cognitive resources or processes. The relationship between performance and cognitive processes is always complex, and it is at this point that the use of complementary information from functional brain imaging and connectivity would be particularly helpful.
Complementary, and at the level of cognitive processing, second language research has also been focused on the identification of individual differences in cognitive abilities related to second LL. For instance, four main abilities have been pointed out by several authors: (i) phonemic encoding ability, (ii) grammatical sensitivity, (iii) inductive LL ability, and (iv) associative memory (Carroll 1993). Another aspect that has also been highlighted by other authors is cognitive control, a central aspect in bilingualism (Rodriguez-Fornells et al. 2006). In particular, Bialystok & Sharwood-Smith (1985) introduced the difference between knowledge and control, which refers to the speed in acquiring control over a second language. This distinction is analogous to the explicit–implicit difference in the sense that the slow, effortful-attention demanding, error-prone and feedback-dependent initial process should progressively be replaced by a non-conscious, easier, automatic, fast, errorless, non-feedback-dependent performance (DeKeyser 1997). This skilled learning process resembles the procedural learning observed in other domains (e.g. motor learning). In addition, individual differences in working memory have also been explored carefully in relation to non-word repetition and further vocabulary learning (Baddeley et al. 1998).
Finally, and after this general overview, one may return to the question of the degree of overlap in the learning mechanisms involved in infant and adult LL. In this sense, one can conceive the application of a similar research programme to the one instigated by the ECM (Hollich et al. 2000) with the aim of contrasting the interaction of multiple available cues and learning mechanisms over different stages of acquisition in infants and adults. In this respect, the human simulation paradigm (Gillette et al. 1999; Snedeker & Gleitman 2004) provides an interesting framework to interrelate infant and adult learning. These experiments are conceived as ‘simulations’ in which an adult learner is exposed to information of the kind naturally received by the infant learner being simulated. The underlying objective is to observe how well the adult simulation emulates the real child learning. Parallel findings in infants and adult second language learners would dismiss the possibility that the effects observed are due to limitations of immature cognitive mechanisms during the period of life in which infants are evaluated (Gillette et al. 1999). From this perspective, only a clear description of the cognitive resources of infants and adults, complemented with a developmental viewpoint and the input circumstances that trigger them, will allow determination of the exact learning mechanisms involved in the mastery of language. With this aim, we have adopted a similar approach in the experiments reviewed here.
3. Speech segmentation and word recognition
(a). Speech segmentation in infants using statistical learning
One of the first mandatory stages that infants and second language-learners encounter when acquiring a language is to identify (segment) the units (words) that compose the speech signal (the segmentation problem). The difficulty in segmenting the speech signal into words is accentuated by the lack of clearly marked word boundaries. It is not until a certain degree of familiarity with the language is gained that learners begin to recognize possible words from the speech input. Eventually, parsing the speech stream is possible by exploiting different sources of information. Once these units are identified, they should be mapped onto conceptual representations (the word-to-world mapping problem). The present section is devoted to understand the first process: how language learners are able to segment a new language.
Behavioural studies have pointed out the importance of a variety of different types of cues exploited by infants during speech segmentation tasks (Jusczyk 1999; Kuhl 2004). For example, it has been shown that, among other cues embedded in the speech signal, both infants (as early as 8 months) and adults are sensitive to the distribution of phonological and acoustic regularities and can exploit this type of information to segment the continuous speech signal into word-like units (Saffran et al. 1996). This important learning mechanism has been coined as statistical learning. In more detail, learners are sensitive to the fact that low transitional probabilities are found at word boundaries (low likelihood of one syllable following another), whereas high transitional probabilities are found within words. Statistical learning is considered a domain-general learning mechanism implicated not only in speech segmentation (Saffran et al. 1996) but also in diverse sequential learning situations such as artificial grammars, tone sequences or visual patterns (see Saffran et al. 2006).
Noteworthy, the majority of studies dealing with statistical learning and speech segmentation have not addressed the important question regarding the nature of the output of the speech segmentation process. Concerning this issue, Saffran (2001) proposed that the ‘representations emerging from statistical learning may serve as candidate lexical items for infants, available for integration into the native language’ (p. 9). In this study, the authors demonstrated that infants processed the newly segmented words differently as compared to the words that have not been segmented (non-words) when they were presented at the end of meaningful sentences. Furthermore, Graf et al. (2007) evidenced that 17-month-olds showed an advantage in mapping segmented words to new meanings compared to non-words. Overall, these two studies argued in favour of the existence of a special proto-lexical status attained by the newly segmented words. Conceptual information can then be linked to the proto-lexical traces already stored making the mapping-to-meaning process easier.
(b). Time-course studies of speech segmentation
Because it is difficult to directly measure the learning processes during speech segmentation relying exclusively on behavioural measures, ERPs have recently been used to investigate the time-course underlying speech segmentation in adult listeners (Sanders et al. 2002; Cunillera et al. 2006, 2008). Owing to their excellent time resolution, ERPs and magnetoencephalographic (MEG) techniques are able to inform about online covert processing, which is particularly relevant when studying fast learning processes.
In the first speech segmentation study, Sanders et al. (2002) reported evidence that segmentation from a continuous speech stream of previously taught non-sense words elicited larger amplitudes in the N100 and N400 components after training, a finding that is in agreement with the interpretation of the N400 as an index of lexical search. The N400 component has been classically related to lexical and semantic processing (Kutas & Federmeier 2000). In this regard, this study adds to the previous evidence of the proto-lexical status of the non-words in speech segmentation studies (Saffran 2001; Graf et al. 2007).
In a similar study (Cunillera et al. 2006), we focused directly on the online segmentation process in which non-sense words were discovered during exposure to 8 min continuous auditory streams without any previous training (figure 1a). By tracking the variation in the N400 and N100 components through the learning process, we were able to observe how quickly listeners were able to segment words in a new language. By analysing the learning process in 2 min blocks, we observed that frontocentral differences in the N400 component arose during the second minute of exposure to the language stream compared to a baseline condition in which syllables were presented in a random order, thus ensuring that statistical learning was not possible (see figure 1b). The maximum effect appeared between the 2 and 4 min of exposure (figure 2c; see also De Diego-Balaguer et al. 2007; Cunillera et al. 2009).
Figure 1.
(a) Language and random streams used in speech segmentation studies. In language streams, words were created by randomly concatenating each of the four words (combining the initial pool of 12 syllables). In the random streams, the syllables were randomly ordered. (b) Depicted the ERP averages for the language (words, solid line) versus random (non-words, dashed line) streams at a central location (Cz). Notice the increase in the negativity for words in between 350 and 550 ms (adapted from Cunillera et al. 2006). Right side: topographical isovoltage map showing the frontocentral distribution of the word minus non-word N400 effect (maximum and minimum values in +0.6/−0.6 µV). (c) Time course of the word (solid line)–non-word (dashed line) N400 effect across the 8 min learning phase (adapted from Cunillera et al. 2009). At the second, third and fourth 2 min blocks, the difference between non-words versus words was significant, being this effect largest during the second block (n = 15 participants, second block, F(1,14) = 17.6, p < 0.001; mean amplitude 350–550, omnibus repeated measures ANOVA applied at 15 electrode locations).
Figure 2.
(a) Examples of the materials used in the rule learning experiment (De Diego-Balaguer et al. 2007). (b) ERP averages comparing the third (solid line) and first (dashed line) minute of language exposure. An increase in the amplitude of the P2 component was observed from the third minute. In the middle, we plotted the correlation between rule-learning performance and the amplitude of the P2 component in the third minute. On the right, the difference waveform (third minute minus the first minute) showed a right frontal distribution for the P2 (maximum and minimum values in +0.4/−0.5 µV; mean value between 140 and 190 ms) (adapted from De Diego-Balaguer et al. 2007). (c) On the left, increased P2 effect encountered when comparing initial stressed words (solid line) versus stressed non-words (dashed line) (random streams). On the right, topography of the P2 component showing the same right frontal distribution (maximum and minimum values in +0.4/−0.4 µV) (adapted from Cunillera et al. 2006).
Similar results have been observed for infants' word-learning skills where an ERP negativity modulation with a frontal distribution was also found in 14-month-old infants performing a fast learning object-word mapping task (Friedrich & Friederici 2008). Likewise, increased frontal and sustained negativities were observed by Conboy & Mills (2006) for known versus unknown words. Moreover, in a recent ERP training study, Mills et al. (2005) showed that 20-month-olds had a larger N400 to trained words paired with an object compared with untrained words. In adults, the involvement of the N400 in fast word learning has also been reported in other studies addressing different aspects of lexical acquisition (McLaughlin et al. 2004; Perfetti et al. 2005; De Diego-Balaguer et al. 2007; Mestres-Missé et al. 2007; Mueller et al. 2008a).
Thus, overall, in adult and infant studies, negative polarity increases in the range of 200–500 ms appear to reflect the word-learning process. But what type of cognitive mechanisms are these ERP modulations reflecting? In the Cunillera et al. (2006) study, the N400 modulation was seen to reflect learners' ability to extract co-occurrence statistics found within a language. Based on their results with streams of tones built as an analogy to Cunillera et al. (2006), Abla et al. (2008) have proposed that the N400 modulation reflects the computation of transitional probabilities, whereas the N1 does not vary as a function of this factor. However, an alternative view would relate the N400 amplitude modulation to the progressive enhancement of a proto-lexical memory trace for the repeatedly encountered new word. The results of a recent study sustain this interpretation because this progressive amplitude modulation appears as a function of exposition even when segmentation using statistical information was not necessary, because words were pre-segmented by subtle pauses (see figure 2a; see also De Diego-Balaguer et al. 2007). In contrast, the N1 is attenuated when pauses are inserted even in random streams and enhanced for the detection of boundaries based on transitional probabilities (De Diego-Balaguer et al. 2008a), suggesting that it is this component that is likely to reflect the computation of transitional probabilities.
These different electrophysiological studies have also permitted the observation that these learning modulations in the range of the N400 component differ from the lexical-semantic effects observed for the N400 component. While, for the latter, the amplitude of the component decreases when lexical-semantic integration demands are reduced (e.g. with repetition or semantic contextual priming), the learning-related N400 shows the opposite pattern, with progressive amplitude enhancement as a function of increased exposure to the new word (Mueller et al. 2008a). In addition, the topographic distribution of this learning-related N400 component is more frontocentral, whereas the classical semantic N400 typically shows a right centro-parietal distribution (Kutas & Federmeier 2000). These differences in the amplitude modulation and topography indicate that, although these variations are observed in similar latency windows, they might not share the same cognitive processes and neural generators. Longitudinal word-learning designs will allow disentangling these differences and determining the cognitive processes shared for both word processing and long-term consolidation of newly acquired words. Several studies have already shown interesting effects of long-term training of new words as well as the effect of sleep in their consolidation (see Gaskell & Dumay 2003; Cornelissen et al. 2004; Grönholm et al. 2005; Dumay & Gaskell 2007; Tamminen & Gaskell 2008; Davis et al. 2009).
Finally, because speech segmentation to bootstrap linguistic regularities could be considered a crucial step in the chain of language development processes, it might be possible to observe effects of speech segmentation capacities in language development over the long term. However, studies on this issue are scarce. Newman et al. (2006) recently observed that infants' performance on speech segmentation tasks before 12 months of age was related to expressive vocabulary at 24 months. In addition, those children who were able to segment words from fluent speech scored higher on language measures but they did not differ in generalized intelligence. Moreover, early grammar development might be affected by this process as well if words have to be segmented first before the structural dependencies between words can be extracted (De Diego-Balaguer et al. 2007). If the two types of information can be extracted in parallel or the type of computations needed are different, segmentation and grammar acquisition might not show a dependency and may not appear sequentially, one type of knowledge after the other. Overall, these results suggest that speech segmentation abilities might be an important prerequisite for successful language development. In addition, Newman et al. (2006) emphasize the importance of evaluating prelinguistic skills and cognitive development using longitudinal studies.
(c). The role of attention in rule learning and speech segmentation using multiple cues
In natural language, multiple cues converge at the same time and can be exploited in order to segment the acoustic signal into word-like units (Hollich et al. 2000). Because a single cue in isolation is often not fully reliable, the combination of multiple probabilistic cues could facilitate infants and adults in the initial LL. Indeed, during language development, children are very sensitive to metrical sequences, which can help to extract regularities for grammatical acquisition (Jusczyk 1999). In an influential study (Peña et al. 2002), the importance of prosodic information to trigger the appropriate computations for the extraction of rules from language has been proved experimentally. They used a simplified artificial language, which contained words sharing a structural dependency: the first syllable of a word determined its syllable ending (e.g. paliku, paseku, paroku) similar to some morphological rules (e.g. unbelievable, untreatable, unbearable) (see figure 2a). They compared performances with the same material presented continuously or with subliminal pauses between words (25 ms pauses). While participants were always able to extract the specific words composing the language, they were only able to generalize the embedded rule when prosody given by the pauses was inserted. This interesting result raises an exciting follow-up question concerning how this prosodic information is triggering the appropriate computations, or, in other words, how this information changes the way the speech signal is treated.
In a recent study (De Diego-Balaguer et al. 2007), we used artificial languages like those in the Peña et al. study (with pauses) in order to tease apart the electrophysiological responses related to word learning from those associated with the extraction of rules. During the course of learning, adult volunteers exhibited a progressive positive deflection peaking around 200 ms after word onset (P2 modulation) that positively correlated with the listener's ability to generalize the rules embedded in a language. Comparing the variations of the ERP components through the learning process, we could observe that this P2 effect was clearly dissociated from the N400 modulation that appeared earlier in the learning phase (figure 2b).
The present results in relation to the P2 component were interpreted considering the effect of attention in biasing LL processes. This attentional bias was initially proposed by Gleitman & Warner (1982) and Echols & Newport (1992), who considered that infants might utilize certain perceptual or attentional processes that allow them to extract salient elements from the stream of language, leaving some elements unattended and reducing the scope of the segmentation word-learning problem. In adults, Ellis (2008) has proposed that in second LL and particularly for the acquisition of grammatical relations, attention is tuned to enhance the perception of the relevant information. While similar structural patterns may help transfer from L1 to L2, interference will be observed when the new language requires a differential allocation of attention to structural relations. In support of the relation between the attentional bias and the P2 modulation, several studies have showed enhancement of this component for salient stimuli that cued the selection of relevant information (Luck & Hillyard 1994). In the same vein, the P2 modulation was also observed when multiple cues (stress and statistical information) were used and integrated for word segmentation in the same artificial languages described in the previous section (Cunillera et al. 2006, 2008; figure 2c).
This point is crucial if we consider that a number of studies have evidenced that successful extraction of the underlying structural relations requires the presence of cues that could capture learners' attention. Attention might help to select the relevant information that has to be clustered. These cues include, for example, the presentation of clearly segmented words (Gomez & Maye 2005) and the salience of the syllables carrying the critical rule information either by their boundary position (Endress et al. 2005) or by increasing the variability of the irrelevant information (Onnis et al. 2008). Similarly, in our studies, prosodic information (word-stress for segmentation and pauses for rule-learning) could act as task-relevant salient information that helps to capture attentional resources. As proposed by Mueller et al. (2008b), prosody may guide learning by helping to focus on the relevant units. This is consistent with our data from neurodegenerative patients (Huntington's disease (HD)) showing a correlation with performance on different neuropsychological tests of executive function for rule-learning performance but not for word learning when the words presented are pre-segmented (De Diego-Balaguer et al. 2008a,b).
In summary, taking the different studies together, attention seems to play a crucial role in the segmentation and rule-learning process (Toro et al. 2005; Pacton & Perruchet 2008). Several accounts have proposed that given the power and usefulness of statistical learning, the same computations applied to the speech input can allow learners to segment and also to extract the rules governing grammatical relations (Pacton & Perruchet 2008; Ellis 2008). These proposals agree on the relevant role of attention to focus on the relevant units (words, clauses, phrases, grammatical categories) where calculations have to be applied. However, the difference here might not reside in the distinction between words and rules but rather in the use of multiple cues to be integrated versus the use of only one type of information, which is in agreement with the ideas underlying the Hybrid model of Hollich et al. (2000).
4. Inferring the meaning of new words using contextual learning
(a). Contextual learning in second language research
In a second series of studies, we aimed to simulate the mapping of existing meanings to new words using information from verbal contexts. This type of learning via guessing is a powerful mechanism that permits the discovery of the meaning of new words throughout the lifespan. This is the case for first and second LL, if learners experience the appropriate conditions (Nation 2001). Indeed, contextual learning could be considered an example of how a general learning mechanism, inductive reasoning, is required for the purpose of LL.
It has been estimated that students in the middle grades encounter between 16 000 and 24 000 new words (Nagy et al. 1987; Nation 2001). Although the estimation of the number of words learned per day differs across authors, a typical child (e.g. 8–10 years of age) might have to learn about six to 12 new words per day. Although it is supposed that many of these words would be learned using contextual information during reading (Durkin 1979), some studies have shown that this ability to derive the meanings of new words from contexts might be more difficult to attain than initially thought (Carnine et al. 2008; see for critical revisions Carver & Leibert 1995; Landauer & Dumais 1997; Laufer 2003). In order to extract the correct meaning, learners should selectively focus their attention on the relevant conceptual information of the context to correctly guess the meaning of the new word. This ability may depend on individual differences in verbal reasoning and working memory, which might help to ‘pick up’ those relevant aspects of the contexts that could provide the clues to the meaning of the new word (Sternberg 1987; van Daalen-Kapteijns et al. 2001). Interestingly, Chaffin et al. (2001) studied the inference of meaning from context using eye-movement recording. These authors found that the amount of reading time observed for informative regions of the context was larger than in neutral or non-informative ones. This result implies that participants were able to quickly adapt their reading strategy depending on the relevance of the information for contextual word learning.
Some authors have proposed that the way in which the different studies on child language evaluated the degree of learning of new words may not be appropriate and that children accrue only partial semantic knowledge for the new words that will be filled and completed with additional exposures (Landauer & Dumais 1997; McGregor et al. 2002). This ‘slow mapping’ process is thought to occur in children between several new words and their corresponding referents and meanings. Although this idea contrasts with the well-known fast mapping process (Carey & Barlett 1978), slow mapping may be triggered subsequently to the fast mapping process. Initially, a fragile new word representation might be created in the lexicon and the child could begin to hypothesize about its meaning, updating this semantic representation until it perfectly maps the relationship between the word, the referent and its related concepts.
From this perspective, word learning is considered to be an incremental process in which word representations are progressively developed and refined over time through multiple exposures (Bloom 2000; McGregor et al. 2002). Interestingly, this incremental learning process should be susceptible to learning and forgetting of various semantic attributes, in the sense that further encounters with a new word might reject some of the false conceptual attributes initially guessed or attach new ones (Wener & Kaplan 1952; McKeown 1985). This initial grasp of the meaning of a novel word might aid its placement in semantic space by indicating its similarity to already existing and established lexical entries. Then, readers might selectively allocate their processing resources to each region of the sentence depending on the semantic hypothesis generated and the relevance of the amount of pertinent information available in that region (see Morris & Williams 2003).
(b). Time-course analysis of contextual learning
Contextual learning tasks provide a good measure of the ability to extract particular components of a word's meaning as well as the ability to differentially select the right set of semantic components from the vast amount of potential relational information across the sentences. Using a beautiful figurative analogy introduced in Nagy and Gentner's contextual learning study (second experiment): ‘This experiment might be likened to dipping a magnet into a mixture of iron filings and sand: the iron fillings should stick and the sand should fall off’ (Nagy & Gentner 1990, p. 188). Similarly, the appropriate semantic features of the context have to be attached to the magnet, which might largely depend on the grammatical class of the new word.
To study how word meaning is online determined from reading information, we devised a word-learning task that mirrored a reading situation in which three-sentence contexts constrained or did not constrain the meaning of a new word (Mestres-Missé et al. 2007) (see table 1). In constrained coherent contexts (new word meaning condition), we created a learning context in which the three sentences referred to the same meaning, while in the other condition (new word no-meaning), the sentences referred to different concepts and participants could not infer the meaning of the new word. In the self-paced reading and ERPs experiments, we observed a very similar pattern of gradual acquisition of the meaning of the new word (see figure 3a,b). While clear differences were observed in the first sentence for new word conditions against a control real word condition, this difference gradually disappeared across the next two sentences between the new word meaning condition and the real words. New words in coherent contexts developed a gradual increase of the N400 that was not observed for new words embedded in incoherent contexts. Interestingly, at the end of the second sentence, a significant difference was already present at the N400 range between new words in coherent and incoherent contexts (figure 3b). This pattern of reading times is also in consonance with other studies where readers tend not to spend more time on a particular noun when it is referred to a previously, but not explicitly, mentioned concept in the sentence (O'Brien et al. 1988). In this regard, the lack of differences between the new word meaning and real-word condition, at the end of the third sentence, is in agreement with the idea that the concept was already inferred at that position.
Table 1.
Meaning inference task (Mestres-Missé et al. 2007). Participants are required to discover the meaning of a new word at the end of each of three successively presented sentences (eight-word sentences). In the meaningful condition, the meaning of the new word was readily apparent, whereas in the meaningless condition, no meaning could be mapped to the novel word, as the three sentences were referring to a different concept. To control for the repetition effects across sentences, real words were used at the end of the sentences in the real word condition. Upon completion of the third sentence participants were required to provide the meaning of the new word if possible (sentences were translated from Spanish keeping the word order).
new word meaningful |
Mario always forgets where he leaves the lankey |
It was expensive the repair of the lankey |
I punctured again the wheel of the lankey |
new word meaningless |
I have bought the tickets for the garty |
On the construction-site you must wear a garty |
Everyday I buy two loaves of fresh garty |
real word condition |
She likes people with nice and clean teeth |
In a fight Mary had broken two teeth |
After a meal you should brush your teeth |
Figure 3.
Meaning inference experiment (adapted from Mestres-Missé et al. 2007). (a) Pattern of reading times across eight-word sentences in each of the three sentence expositions (table 1). (b) ERP results showing the differences in the three conditions for a parietal electrode. Notice that while no differences were observed between new words in meaningful and non-meaningful contexts at the first sentence, the pattern diverged completely at the end of the third sentence, when the meaning of the new word was discovered for meaningful contexts (b) Black, real word; green, new word meaning; red, new word no meaning. (c) Results of the semantic priming experiment in which the new word was paired with its corresponding word meaning (dashed, related; solid line, unrelated). A similar N400 semantic priming effect is observed but with a different topographical distribution (as shown in the difference waveforms in (iii) dashed line, new words; solid line, real word; maximum and minimum values in the left isovoltage map, +0.46/−2.92 µV; right map, +0.33/−2.36 µV).
In summary, these experiments could be interpreted in accordance with the incremental learning idea that the semantic attributes of the new word are gradually built into the semantic space.4 In the same line, Borovsky et al. (2008) studied ERP responses to subsequent plausible and implausible usages of a new word that had been previously presented in a one-sentence exposition (weakly or strongly constrained). A reduction of the N400 was observed when the new word was introduced in a new plausible sentence compared with the implausible one, but only when the pre-exposition was in a strongly constrained sentence. These results show that a single exposure to a novel word and its context might be enough to influence the ability to judge a word's appropriate usage in subsequent contexts, but only if the first context where the novel word was encountered is highly informative.
In agreement with this idea, we further observed that the association of the new word with its meaning showed N400 semantic priming effects (see figure 3c; see similar N400 priming findings in Perfetti et al. 2005). Interestingly, the latency of this priming effect was delayed (about 150 ms) compared with the priming effect in real words and the distribution of the N400 effects was more frontocentral than the effect observed for real words (figure 3c, right). One of the explanations for these differences could be that the new conceptual relations attached to the newly learned word are still weak and, therefore, an increase in cognitive control might be required to guide semantic knowledge retrieval and selection (Krashen 1982; Bialystok & Sharwood-Smith 1985; Kroll & Steward 1994; Bialystok 2001; Rodriguez-Fornells et al. 2006).
5. A functional neuroanatomic model of ll: an integrative account
The present section proposes an integrative brain functional anatomic account of (figure 4) word learning based on three main ideas: (i) the dual-stream model of language processing (Hickok & Poeppel 2000), (ii) the role of the medial temporal lobe (MTL) structures in initial storage of information and further consolidation (Nadel & Moscovitch 1997), and (iii) the recruitment of several brain regions involved in cognitive control processes during second LL (Krashen 1982). The model is partially based on previous neuroimaging data gathered using the LL paradigms exposed in the previous sections (mostly speech segmentation and contextual word learning). Other parts of the model and specially the neuroanatomical interconnectivity of the different regions involved in LL are based on previous studies in which these issues have been addressed (mostly from neuroanatomy, neurophysiology, diffusion tensor imaging (DTI) and intraoperative electric stimulation studies in humans and other species). In this regard, parts of the model remain speculative and have to be considered as heuristics that can stimulate and guide future research on LL and its functional and structural connectivity.
Figure 4.
The main interconnectivity between the three LL streams proposed in the text. The dorsal route (green) begins from the phonological representation of the new word (NW) processed in the STG and pSTG, which is channelled to the dorsal rehearsal and processing route. The ventral meaning inference interface (yellow) starts from contextual information from the learning context (linguistic and extralinguistic) and it recruits neural regions involved in storing and retrieving conceptual information (black dots representing conceptual nodes) via the inferior, medial and anterior temporal regions richly interconnected via the uncinate fasciculus with the ventral IFG (inferior frontal gyrus). Finally, the NW and its corresponding context might trigger MTL-dependent storage processes (episodic-lexical interface). With repeated exposures to the NW, and after its meaning is attached (via direct association or inference from the ventral route), this new episodic-NW representation might be stored in the mental lexicon (word-form level), independent from MTL-rehearsal mechanisms. Several regions involved in cognitive control, inductive reasoning and motivation (feedback related) might be selectively triggered depending on the demands of the LL situation (e.g. MFG, striatum-thalamic loop). NW, new word; L1, lexical trace; MTL, medial temporal lobe; aITG, anterior inferior temporal gyrus; MTG, posterior medial temporal gyrus; aTP, anterior temporal pole; vIFG, ventral inferior frontal gyrus; MFG, middle frontal gyrus; vPMC, ventral premotor cortex; pIFG, posterior IFG; SMG, supramarginal gyrus; pSTG, posterior superior temporal gyrus. Thicker lines represent principal connections between the different LL streams. Green box, dorsal audio-motor interface; pink, episodic-lexical interface; yellow, ventral meaning inference interface; unfilled, shared cognitive mechanisms.
As depicted in figure 4, a learning context is considered as an experience in which a learner is exposed to one or more unknown items (including multiple linguistic and extra-linguistic cues). This information or parts of it could be repeatedly exposed. From this context, multiple and parallel processing streams might be triggered for (i) encoding the phonological representation of the new item, (ii) shortly retaining this representation and manipulating it in short term memory, (iii) long-term consolidation of this representation, and (iv) attaching conceptual representations to this form or inferring the meaning conveyed in the learning context.
We briefly outline in the following sections the importance of the three streams in LL: (i) the dorsal audio-motor interface, (ii) the ventral meaning integration interface, and (iii) the episodic-lexical interface in LL processes. We also discuss the involvement of other cognitive functions (e.g. attention, maintenance and manipulation of information, and inductive reasoning) that can aid to the previous processing streams in several LL processes.
(a). Dorsal audio-motor interface
This pathway is involved in mapping sounds into articulatory-based representations (Hickok & Poeppel 2000, 2004, 2007; Scott et al. 2000; Wise et al. 2001; Scott & Wise 2004). It engages the left posterior temporal regions and the parieto-temporal boundary, as well as the frontal regions sustaining motor speech representations (figure 4). In relation to LL, Hickok & Poeppel (2007) have recently put forward the hypothesis that this interface might be involved in the acquisition of new vocabulary. This process might involve generating a new sensory representation of the novel word and keeping this auditory representation in an active state (i.e. phonological short-term memory, Buchsbaum et al. 2005). At the same time, this newly created trace might guide the production of motor articulatory sequences. This proposal converges with the motor theory of speech perception (Liberman & Mattingly 1985). Several recent studies have evidenced the implication of several areas of this dorsal interface in learning new phonological contrasts (Golestani & Zatorre 2004; Golestani et al. 2007; Wong et al. 2007; Mei et al. 2008).
The first evidence in favour of the involvement of this dorsal interface while segmenting speech was published by McNealy et al. (2006). The authors showed that a left frontotemporal network is activated during the online speech segmentation process (see figure 5a,b). In particular, while listening to the language streams, the activation of the superior temporal gyrus (STG) was found to increase over time along with the recruitment of the premotor cortex (PMC) and other regions of the dorsal pathway. In order to gain more insight into the involvement of the audio-motor interface, we recently investigated the specific role of the PMC in speech segmentation. Using the same type of language and random streams as in the ERP experiments (figure 1a), we observed activation in the PMC in the language conditions during speech segmentation (see figure 5c). Besides, we showed that the performance in the word-segmentation task correlated with the activation in the left PMC only during the first 2 min of learning (Cunillera et al. 2009) (see figure 5d). This result is in convergence with the selective involvement of the PMC at the early stages of implicit motor sequence learning (Doyon et al. 2002) and the implication of the premotor and motor cortex in passive speech perception (Fadiga et al. 2002; Wilson et al. 2004; Meister et al. 2007).
Figure 5.
(a) Activation in temporal, frontal and parietal cortices when participants listened to random artificial streams (a) and language streams (b), and compared to a resting baseline (adapted from McNealy et al. 2006; with permission from the authors). (c) Sagittal activation maps of a similar experiment (n = 13 participants) in which language (red) and random (blue) streams were compared against off-periods during the first 2 min of exposition to the streams. The composition of the language and random streams followed the same structure as shown in figure 1a. Language and random streams were presented in eight 30 s task blocks (on periods) interleaved with off periods. An increase of activation was noticeable in the posterior superior temporal region, middle frontal gyrus and ventral premotor area (BA 6) in the language condition (language and random conditions against off periods) (adapted from Cunillera et al. 2009). (d) Haemodynamic responses for a selected premotor region of interest PMC (BA 6, centre −56, 0, 44). A significant increase of activation is noticeable during the first 2 min of exposition (L1) when compared to the next 2 min block (L2) or random blocks (R1, R2). At the right side is depicted the correlation between the percentage of words recognized at the end of the exposition and the activation increase in the ventral PMC (first block of language).
As stated by Warren et al. (2005), a possible mechanism for explaining these results would be that a template-matching algorithm in the posterior STG allows for the detection of coincidences between the stored auditory memory representations or ‘phonological templates’ derived from previous exposures and the new incoming stimuli. Then, the output of this process should be an ordered sequence of auditory representations that could be forwarded to the PMC. Based on previous studies using DTI (Catani et al. 2005; Saur et al. 2008) and intraoperative direct stimulation (Duffau 2008), the better candidate to support this communication between the posterior STG and the PMC might be the superior longitudinal fasciculus, most probably its lateral part, which runs parallel to the arcuate fasciculus and passes through the supramarginal gyrus (Catani et al. 2005; Duffau 2008).5 In the premotor areas, the encoded sequence of sounds might be mapped into a sequence of articulatory gestures, which would keep the segmented words active through a rehearsal mechanism. The importance of the rehearsal component of verbal working memory is also evident when participants are asked to learn the same type of artificial language streams while performing an articulatory suppression task. In this condition, which impedes the use of rehearsal, segmentation is blocked when compared to a simple auditory interference condition (Lopez-Barroso et al. 2009). This rehearsal mechanism might be crucial in the first stages of learning unfamiliar new words (Baddeley et al. 1998), reflecting the involvement of the rehearsal component of the phonological working memory when segmenting possible words. Indeed, Scott & Wise (2004) explicitly suggested that this dorsal stream might be very important in language acquisition, highlighting the important role of rehearsal during new words learning.
Using similar ideas as in feed-forward motor control models (Desmurget & Grafton 2000), this interface could allow the creation of an internal efference copy of the possible representation of the articulated word, which might be very useful to the language processing system for prediction purposes. This feed-forward prediction mechanism could be triggered when learning new words, with the PMC acting as a fine-tuning ‘top-down’ mechanism that regulates the template-matching process engaged in the posterior STG. Because of the implication of the premotor and the inferior frontal cortex in action selection, the existence of this type of predictive representations is very plausible as a monitoring device that checks the output before sending it out to the articulators. Similar proposals have also been raised in language production models (Dell & Reich 1981; Dell 1986) or in implicit motor skill learning (Poldrack & Willingham 2006). Notice that tracking of the predictability of upcoming attended stimuli might also be very important in speech segmentation. In fact, it has been shown that PMC responds to the prediction of auditory events in a structured sequence (Gelfand & Bookheimer 2003; Schubotz et al. 2003).
It is worth considering in which degree this auditory-motor learning network could also be involved in language acquisition in infants (Doupe & Kuhl 1999; Warren et al. 2005). Indeed, this auditory-motor interface is postulated to be very important in speech development, because speaking inherently requires fine-tuned motor learning. Language perception and production in the developing infant brain require a specific tuning to the language sounds encountered during the first year of life. First words imitated by a child are guided by the ‘gestural’ features of the sound, i.e. by the actions of the mouth rather than by a sound's acoustic features (Studdert-Kennedy 1987). Importantly, the outlined auditory-motor pathway must also be related to the brain network subserving imitation of simple movements (Iacoboni et al. 1999). In fact, the ability to mimic sounds seems essential for language development. This idea has been revitalized by the discovery of mirror neurons, recorded in macaques in the homologue of the ventral PMC region (including the superior part of Broca's region) and in humans (Rizzolatti & Arbib 1998). These specific audiovisual mirror neurons discharge not only when performing and observing a specific action, but also when hearing a specific sound representative of the observed action (Kohler et al. 2002). Mirror neurons also provide a mechanism for integrating perception and action at the neuronal level, which, at the same time, might contribute to various developmental processes such as the imitative behaviour of infants (and the necessity to integrate perceived and performed actions; Meltzoff & Decety 2003) and communicative acts (Rizzolatti & Arbib 1998).
(b). Ventral meaning integration interface
The involvement of the meaning interface in LL has a broader scope when compared to the previous interface, in the sense that it might be triggered even in contexts in which there are no specific new word tokens to be learned. Let us imagine that in the experiment we performed when inferring meaning (Mestres-Missé et al. 2007; see table 1), instead of a new word at the end of a sentence we had presented an abstract figure. Participants would have engaged the process of meaning inference independently of the nature of the item presented.
We believe that this interface is an endowed mechanism set up to infer meaning by exploiting internal and external sources of information. This mechanism might be triggered when a specific demand is present to infer the meaning of a situation, new word, discourse, etc. or when conflicting conceptual information exists that requires a specific solution. As a self-triggered learning mechanism, internal reinforcement might depend on the capacity to reduce conflict and uncertainty through the correct inference of the meaning of a specific learning context (Berlyne 1960). Notice that this approach is similar to the ideas previously presented in infant learning in relation to the utilization of multiple extralinguistic and pragmatic cues in order to disambiguate LL (Bloom 2000; Hollich et al. 2000; Tomasello 2003). Thus, the intrinsic drive to meaning integration might require the participation of multiple cognitive processes related to semantic and conceptual analysis of the information conveyed in the learning context. A similar mechanism might be required in ambiguous communication environments where full syntactic parsing is not possible (Ferreira 2003) or when processing sentences that contain semantically ambiguous words (Rodd et al. 2005).6
The existence of this interface is in agreement with the proposal of the ventral language stream by Hickok & Poeppel (2007). This stream would be selectively activated in meaningful learning contexts and might involve non-traditional perisylvian regions, comprising the medial, inferior and anterior temporal cortex (i.e. STG and sulcus, angular gyrus, the posterior inferior and middle temporal gyrus (MTG), anterior temporal pole) and the ventral inferior frontal gyrus (vIFG, triangular region—BA45) and more orbital, medial and ventrolateral aspects of the prefrontal cortex. The interconnectivity within this network and other brain regions might be mediated by the inferior longitudinal, the inferior fronto-occipital (occipito-temporal, sub-insular of frontal branch) and the uncinate fasciculus, which connects the anterior temporal and ventral prefrontal regions (Duffau et al. 2005). These pathways radiate out important information from occipital visual regions related to object processing and most probably are involved in the link between object representations and lexical labels (Mummery et al. 1999).7 However, a recent DTI study, alternatively, suggested that the white matter ventral pathway that crosses through the external capsule, conveying information from two different pathways, the middle longitudinal fasciculus and inferior longital fasciculus, might be implicated in the iterative exchange of information between the middle and inferior temporal regions and the ventral prefrontal cortex required for accessing lexical, semantic and conceptual representations (Saur et al. 2008). Indeed, these authors hypothesize that this pathway might be crucial for infants to derive meaning and construct conceptual knowledge.
In relation to the functional role of the prefrontal regions in semantic processing, there is agreement in the literature about the role of these regions in selection or controlled semantic retrieval (Badre & Wagner 2002; Gold et al. 2006; Thompson-Schill et al. 2006). For example, the anterior left IFG (pars triangularis, BA 45 and pars orbitalis, BA 47) has been associated with elaborate semantic processing (Petersen et al. 1988; Demb et al. 1995; Ferstl et al. 2008) and alternatively, selection from competing semantic features (Thompson-Schill et al. 1997).
The meaning inference paradigm (table 1) provides a good opportunity to evaluate the implication of this interface in LL. In Mestres-Missé et al. (2008), we observed that new word meaning from a verbal context selectively recruited several brain regions of this stream, most importantly the left anterior IFG and the left MTG. Besides the right and left parahippocampal gyrus, anterior cingulate gyrus and several subcortical structures (e.g. bilateral thalamus, bilateral caudate and left putamen) were also activated (see figure 6a,b).
Figure 6.
(a) Axial, sagittal and coronal views of the direct contrast (n = 12 participants) between new word inference conditions (meaning and no-meaning contexts) versus real-word conditions. Notice the recruitment of the ventral meaning interface, specifically the left MTG, IFG (BA 45), as well as the parahippocampal gyrus and various subcortical structures (caudate, left putamen and thalamus). (b) Coronal view of the group-average comparisons between meaningful (M+) and meaningless (M−) contrasts. Notice the larger activation of the left anterior parahippocampal gyrus and the left thalamus in meaningful contexts (adapted from Mestres-Missé et al. 2008).
The large activation observed in the MTG might reflect the activation of stored conceptual information that is necessary to infer the meaning of the new word from the context. This idea of the access of semantic information in MTG is in agreement with the proposal that this region might be a supramodal semantic processing region (Vandenberghe et al. 1996; Price 2000; Lindenberg & Scheef 2007; Patterson et al. 2007) involved in (i) the storage of long-term conceptual knowledge, segregated from word-form representations (Petersen et al. 1988; Martin & Chao 2001), (ii) lexical-semantic processing (Damasio et al. 1996; Ferstl & von Cramon 2001; Keller et al. 2001; Baumgaertner et al. 2002; Dronkers et al. 2004; Indefrey & Levelt 2004), (iii) the activation of visual forms and word meanings (Howard et al. 1992; Pugh et al. 1996; Hagoort et al. 1999), and (iv) increased semantic integration demands (Damasio et al. 1996; Just et al. 1996; Ni et al. 2000; Baumgaertner et al. 2002). In the novel word conditions, a set of initial candidate semantic features might be activated based on the information conveyed by the first sentence, and this primed semantic space might be narrowed down during the second and third sentences or more expositions of the new word in different contexts. This process of zooming into the semantic space of the new word is mediated by the interplay between the MTG and the ventral IFG, the last area involved in guiding semantic selection/retrieval processes via top-down modulations (Badre & Wagner 2002; Gold et al. 2006; Rodd et al. 2005). However, this mechanism of semantic selection and retrieval might require the monitoring of conflict between candidate meanings or lexical items pre-activated in the semantic network and the final selection of the best fitting candidate concept. This monitoring and selection of the final lexical candidate is probably also mediated by the anterior cingulate–striatum–thalamic loop (see below).
Finally, inductive reasoning might also be an important process aiding this ventral stream of meaning integration. As proposed by Gentner & Namy (2004), analogical reasoning might underlie the comparison processes that are triggered when multiple experiences with specific commonalities and inherent abstract relations are presented in LL. As outlined in figure 4, we have included the middle frontal gyrus (MFG) (BA10) and more posterior dorsolateral regions (encompassing BA46 and part of BA9) as non-specific regions aiding LL processes, which could also be activated for example during rule learning (Goel & Dolan 2000; Strange et al. 2001), or when it is necessary to infer the abstract relations between certain elements. Notice that this region was also activated in the speech segmentation studies (figure 5a,b) (see also McNealy et al. 2006) and therefore, it might provide a high-level cognitive structure involved in inductive and abstract reasoning, fluid intellectual processes and more general problem solving (Gray et al. 2003). These regions might also be involved in the integration of relationships among various stimulus dimensions (e.g. analogy, induction, controlled episodic retrieval processes), which have been maintained and accrued in the working memory system (at more posterior prefrontal regions, see Chein et al. 2003). Indeed, inductive reasoning shows larger activation in dorsolateral regions when compared to deductive reasoning (Goel & Dolan 2004). Corroborating this idea, in a recent study on meaning inference of verbs and nouns (Mestres-Missé et al. in press), we observed that those participants who inferred more meanings from the context showed larger activation in these left anterior prefrontal regions (BA10/46) (figure 7a).
Figure 7.
(a) Brain areas differently activated in ‘good’ versus ‘bad’ learners in an inference meaning paradigm (adapted from Mestres-Missé et al. in press). Eight participants were assigned to the ‘good’ learners group (85.9 ± 8.3%), achieving better meaning recognition percentages than the eight participants assigned to the ‘bad’ learners group (60 ± 3.7%). Notice the increase in activation observed at the anterior medial prefrontal cortex (BA 10) and more posterior sites (BA 46) in good learners (also at the left putamen and right thalamus). (b) From the same experiment, and directly comparing second versus first presentation of the new-nouns, scatterplots of the correlation between the percentage of meaning recognition rate for this condition and the beta values of this contrast at the left hippocampus (−20, −20, −16; r = 0.63; p < 0.002) and right hippocampus (32, −12, −20; r = 0.70; p < 0.0001) (n = 21 participants).
(c). Episodic-lexical interface
One important aspect of word leaning is that the meaning of a new word can be guessed or mapped with very few presentations (Carey & Barlett 1978). This fast mapping process has also been observed for learning facts and is present in infants and adults (Markson & Bloom 1997), which speaks in favour of a clear preservation of these mechanisms across the lifespan. In opposition to classic associative learning mechanisms that require multiple expositions, this type of fast (single-trial) learning has been traditionally assigned to declarative memory which relies on the MTL region, including the hippocampus, the parahippocampal, the entorhinal and the perirhinal cortex. Owing to the rich connections existing in non-human primates between the superior temporal regions and the polysensory MTL (Lavenex et al. 2002) (see figure 4), the possible participation of these structures in the initial fast-mapping of new words and their meanings could be very plausible. The involvement of the MTL region in the acquisition of new lexical knowledge has also been proposed in the declarative/procedural model by Ullman (2001). Furthermore, the implication of the MTL structures and, in particular, the hippocampus in processing novelty, specifically and novel verbal stimuli, is well known (Saykin et al. 1999; Strange et al. 1999).
These structures have been traditionally considered to be implicated in the storage of episodic memories, although the mechanisms involved in he long-term consolidation of these memory traces have been under dispute (McClelland et al. 1995; Nadel & Moscovitch 1997; Squire et al. 2004). An interesting aspect to consider is to what degree new words, which will become in the long run integrated and represented in the mental lexicon (L1 or L2) with the corresponding links to conceptual representations, will require the same type of learning processes as for other type of episodic traces (e.g. autobiographical events). For example, it has been proposed that during speech perception, detailed episodic traces of spoken words and non-words are created and remembered for considerable periods (Goldinger 1998). Besides, the famous patient H.M., characterized with anterograde amnesia following bilateral removal of the hippocampus together with the entorhinal and part of the perirhinal cortices, was impaired in the acquisition of new lexical information after the lesion (e.g. the word ‘xerox’, Gabrieli et al. 1988), but not in general lexical and grammatical processing tasks (Kensinger et al. 2001). Based on this study and similar ones, it has been proposed that the MTL region is involved in the acquisition of new semantic information (Bayley & Squire 2005).
However, this interpretation is in conflict with the study reported by Vargha-Khadem et al. (1997) and more recent studies in which it was observed that indeed H.M. acquired new factual information (e.g. names of people who became famous after the onset of his amnesia) (O'Kane et al. 2004). In the study of Vargha-Khadem, three young amnesic patients with selective hippocampal damage early in life were examined and, despite their episodic memory deficits, these patients showed nearly intact language competence, literacy and general factual knowledge. The authors proposed that the hippocampus plays a selective role in the creation of episodic memories, whereas the surrounding cortical areas (entorhinal, perirhinal and parahippocampal gyrus) could support the acquisition of new lexical and semantic information, even in the absence of the hippocampal area (see also Bayley & Squire 2005; De Haan et al. 2006). In favour of his differential participation of the MTL structures in semantic and episodic memory, Davies et al. (2004) showed that a group of patients with semantic memory impairments (Semantic Dementia) had less volume in the left anterior temporal pole, perirhinal cortex and anterior entorhinal cortex, while patients with Alzeihmer's disease showed marked reductions in the hippocampus and the posterior enthorhinal cortex.
In the model presented in figure 4, we adopt the idea presented by other authors that the initial binding of new word representations into a memory trace should be MTL dependent (Ullman 2001; Squire et al. 2004; Gaskell & Davis this issue). However, considering Nadel & Moscovitch (1997), we propose that the long-term consolidation process would be different for episodic (context dependent) or linguistic traces (context free). In the case of episodic events, the additional rehearsal of a specific trace (which could be exclusively achieved through pure neocortical traces or by relying on MTL-neocortical synaptic connections) will enhance the spatio-temporal context of the event or trace. In contrast, further encounters with the newly acquired words and its rehearsal will gradually decrease the spatio-temporal context trace in which the new word was presented and will enhance its associations to conceptual information (becoming a context-free trace). The differences in both the rehearsal and the long-term consolidation processes will be due to the specific requirements of the linguistic traces. While the efficiency of a consolidation process in episodic autobiographical memory depends on the capacity to correctly frame a trace as accurately as possible into a specific context, for a new word trace, the emphasis will be on the association of the new word with its meaning or the conceptual features that define it.
This trade-off in the MTL system between context rehearsal and conceptual rehearsal might explain in the long run the apparent division of declarative memory into episodic and semantic memory in humans. It is worth mentioning that a similar idea on how memory representations might change with the passage of time, becoming more semantic/fact like (context free) and less episodic (context dependent), has been proposed by Bayley et al. (2003). The authors proposed this idea in order to explain how autobiographical memories become independent of MTL structures and also some observations in amnesic patients in which autobiographical memories became integrated in a type of semantic-like representation and totally disconnected from their original spatio-temporal context (see also Cermak 1984).
Considering this proposal and previous neuroanatomic differences in semantic and episodic memory (Vargha-Khadem et al. 1997; Kensinger et al. 2001; Davies et al. 2004; O'Kane et al. 2004; Bayley & Squire 2005; De Haan et al. 2006), a prediction from our LL model would be that the hippocampus and the entorhinal cortex (posterior sides) might be more involved in the initial stages of word learning, while further exposures and consolidation processes would recruit more anterior entorhinal sides, perirhinal and parahipaccampal MTL regions. This time-dependent recruitment of the regions encompassing the MTL system would support the hypothesis of the trade-off in the MTL system between context rehearsal and conceptual rehearsal when storing new words and their meanings. Furthermore, this idea is also in congruence with the somehow preserved ability to store new factual information in amnesic patients, at least when sufficient exposure and repetitions to this information is available (as it might be the case for famous people or events) (O'Kane et al. 2004). As suggested by several authors, the storage of this new semantic information might depend on preserved parahipocampal and perirhinal cortex (Varga-Khadem et al. 1997) which might allow a more gradual and slow storage of context free information. Indeed, discovering the meaning of a new word might depend on the selection of the common conceptual attributes present in different contexts (learning experiences), and therefore, these spared regions in the MTL region (parahipocampal and perirhinal cortices) might be responsible for this type of slow and more gradual learning process. Interestingly, when damage to the MTL region extends to the perirhinal and partially to the parahipocampal cortices, no evidences of learning of new semantic information have been shown in a densely amnesic patient (Bayley & Squire 2002, 2005).8
Several studies related to word learning have already provided compelling evidence for the involvement of the MTL (hippocampus and parahippocampus) regions in word learning (Breitenstein et al. 2005; Mestres-Missé et al. 2008; Davis et al. 2009). In our previous study on incremental word learning through contextual information, we observed larger activation in the anterior portion of the parahippocampal gyrus when compared to the incoherent condition. The involvement of this region in this type of more incremental and semantic-based word learning is in agreement with the ideas previously exposed (Varga-Khadem et al. 1997; O'Kane et al. 2004; Bayley & Squire 2005). Besides, larger activation was encountered in the incoherent condition (where no meaning could be derived for the new word) in the left posterior parahippocampal gyrus (Mestres-Missé et al. 2008; see figure 6b). We interpreted this dissociation between anterior and posterior parahippocampus with the proposal that anterior regions within the medial-temporal lobe are predominantly involved in encoding, whereas the posterior regions subserve retrieval (Lepage et al. 1998; Saykin et al. 1999). Furthermore, in a more recent study and using a larger sample, we observed a clear correlation between successful meaning extraction of new words and the hippocampal activation (Mestres-Missé et al. in press) (figure 7b). Participants with larger word-discovery rate showed larger activations in the right and left hippocampal regions. These data and previous studies (Breitenstein et al. 2005; Davis et al. 2009) confirm the implication of the hippocampus in the initial stages of learning a new word. Interestingly, further studies might be required in order to understand in which degree these individual differences observed in word learning are also dependent on the underlying white-matter pathways that interconnect these MTL regions. For example, in a recent behavioural-DTI study, Fuentemilla et al. (2009) showed that individual differences in the amount of recognition and recall of previously presented words were associated with the microstructure of the inferior longitudinal fascicle, the major white-matter connectivity pathway of the MTL, extending from the ventral and lateral temporal regions to the posterior parahippocampal gyrus (Schmahmann et al. 2007).
(d). Interaction between the three LL streams of processing
It is interesting to understand to what degree the three interfaces might interact among them and what learning circumstances might trigger their cooperation. For example, an open question (see §3) is what might happen in the case that a new word is learned (e.g. as an output of speech segmentation), but it has not been associated to any meaning. The status of this trace will depend on the internal connections between the posterior STG, the phonological-articulatory trace (PMC) and the MTL. In fact, lexical representations might be stored in the brain separately from the MTL structures, and the new word might interact with the existing representations (phonological similarities, etc.) (Clay et al. 2007).
The interrelation between the meaning interface system and the episodic-lexical interface deserve also some attention. For example, it is possible that the engagement of inference reasoning at anterior prefrontal regions might be inversely related to the activation of MTL regions. The process of inference does not require intact storage of the elements presented because it privileges the similarities, not the uniqueness, of the items encoded. For example, Goel & Dolan (2000) showed that inductive rule-learning processing showed an inverse correlation between medial frontal activation and hippocampal activation when encoding new-animal tokens. The balance between the MTL and MFG regions could explain the trade-off between processing similarities and inferring rules versus encoding specific tokens or words. Indeed, LL individual differences could depend on the weigh of the learner to process information with one stream or the other.
In this regard, sentence comprehension processes create a type of gist semantic representation that is independent of the specific verbatim lexical representations (Bransford & Franks 1971; Brewer 1977; Graesser et al. 1994). Models of discourse processing assume that we understand language using not only local word-level information but also attending to the wider meaning of the sentential context (Singer 1994). This idea is further supported by memory models that provide evidence for the creation of verbatim and gist-type representations, which might further influence true and false memory recall processes (Brainerd & Reyna 2002). In this regard, the meaning interface is an ideal system to create these gist-semantic representations, which might be integrated and used in anterior frontal regions to infer common relationships and abstract ideas.
A last point when discussing the participation of the MTL structures is the rich interconnectivity that exists with medial diencephalic, ventral striatum and midbrain reward processing circuits in the brain. This interconnectivity between the MTL and this emotion–motivational network could be acting as a feedback rewarding mechanism and might explain the role of emotion and motivation in LL. This is a very interesting venue of future research, especially for contrasting feedback-extrinsic learning in adults versus less feedback-dependent learning in infants (Tricomi et al. 2006).
(e). The integrative role of the basal ganglia
Aside from the direct connections between the three streams, basal ganglia structures occupy a privileged position to hold an integrative function between the different LL streams. The striatum (caudate and putamen) acts as a funnel receiving inputs from different neocortical areas responsible for distinct cognitive functions and sending its outputs back to the cortex through the thalamus forming different functional loops (Middleton & Strick 2000). There is evidence indicating that different substructures of the basal ganglia, including the thalamus, are implicated in executive functions, attention (Couette et al. 2008) and storing and rehearsal in verbal working memory (Chang et al. 2007). These functions are important in the course of learning and the role of these subcortical structures particularly in the acquisition of sequences and categorization is well documented (Seger 2006). Because language is sequential by nature, a preponderant subcortical role has been proposed for the acquisition of the different information from language (Lieberman 2000).
Concerning the different LL streams introduced in this section, areas of the dorsal audio-motor stream (SMG, vPMC and vIFG), the ventral meaning stream (middle and inferior temporal gyri, and vIFG) and the episodic-lexical interface project to the caudate nucleus and anterior putamen. The PMC, involved also in the audio-motor stream, projects to the medial putamen (Leh et al. 2007). Recent evidence indicate that circuits involving the striatum show a great deal of interaction (Yin & Knowlton 2006) reinforcing the idea that this structure may have a key position to integrate inputs from different streams. Because of that, several authors have proposed that the basal ganglia (and thalamus) are engaged in modulating, gating and controlling information flow leading to the selection of appropriate items or behaviours (Haber & Calzavara 2009). For example, in the lexical-semantic interface, they might be responsible for the selection of adequate semantic-lexical items as well as for the inhibition of other candidates (Crosson et al. 2003). In our word-learning study with contextual information (figures 6 and 8), the direct comparison between meaningful and meaningless words yielded differences in the left thalamus. Moreover, a strong correlation was encountered between this region and the percentage of correct meanings reported (see figure 8). The critical role of subcortical structures for the extraction of meaning from context has also been confirmed by the impairment of HD patients with early striatal degeneration (Nogueira Teixeira et al. 2008).
Figure 8.
BOLD activation in the left thalamus when directly comparing meaningful and meaningless contexts (adapted from Mestres-Missé et al. 2008). Tridimensional view of the thalamus in which the original t-value map of the contrast is surface projected to the thalamus-derived template (radiological convention; left hemisphere thalamus is projected onto the right side of the brain). At the right, scatter plot of the beta values at the left thalamus (MNI coordinates: 8, 28, 8) with the efficiency of meaning extraction for meaningful contexts (r = 0.799; p < 0.002; n = 12).
In word segmentation and rule extraction, basal ganglia might be important to maintain the attentional bias to the relevant information as mentioned in §3c. The general cognitive control role fits also well with the results from the extraction of other types of information from speech. HD patients also showed LL difficulties particularly for the acquisition of the rules embedded in the speech stream (De Diego-Balaguer et al. 2008b). This is consistent also with the idea that these cortico-subcortical circuits are necessary whenever control is required, as for example during learning or when ambiguity or violations are present in language (Friederici & Kotz 2003; Wahl et al. 2008). However, more research is needed to clearly disentangle the role of the different structures (striatum versus thalamus) within the basal ganglia function.
6. Concluding remarks
The present word-learning simulations were designed to zoom into the LL processes that are difficult to observe under natural or more ecological learning conditions. With that purpose we concentrated on two problems, the isolation of words and rules in continuous speech (speech segmentation) and the mapping of new words onto existing meanings (inferring meaning from verbal contexts). Both problems were evaluated using complementary behavioural and neuroimaging techniques. We further developed an integrative functional connectivity model of three neurophysiological streams of processing in LL experiences.
We proposed the existence of (i) an audio-premotor interface, which is conceived as an internal audio-motor simulator that could be very important in initial learning of new word phonological forms, (ii) the meaning integration interface, which is envisioned as a mechanism involved in inferring meaning using multiple internal and external cues, and (iii) the episodic-lexical interface, which is in charge of fast-mapping of new words onto specific contexts and long-term consolidation of this new trace into the lexicon(s). The interaction between these systems is modulated by common cognitive control and high-level functions, such as the middle prefrontal cortex involved in inductive reasoning, the striatum–thalamus subcortical circuits involved in the coordination of the different streams and the reward/motivation and feedback processing system. The specific study of these neural circuits and their connectivity, as well as their developmental milestones, might help to understand the sensitive time-windows of each of these LL streams. This information will clarify to what degree adults are able to use these mechanisms for learning new languages. These streams of LL are not proposed to be specific for LL as they might also subserve learning in many other domains, such as music learning.
One important caveat to consider is that a reliable cognitive neuroscience account of adult and infant LL should not be directed exclusively to delineate the neural networks that support these processes. The crucial combination of online techniques (e.g. EEG, oscillatory activity, MEG) provide more subtle measures that allow researchers to understand the dynamics of these networks and the covert cognitive processes involved in learning. The combination of these tools will allow researchers to test empirical questions derived from new integrative models and accounts of the ‘complex learning language-problem’. Besides, LL simulation experiments might also help to understand language processing in general; in fact, one can envision them as stimulation devices or windows into the inner structure of language (see Gaskell & Dumay 2003; Clay et al. 2007; Mestres-Missé et al. 2009).
At the end of the word-learning journey, we will be more prepared to predict which would be the success of an infant and an adult when faced with an unknown language. The study of the neural networks and functional connectivity might have clear implications for clinical language neurorehabilitation of infant and adult language disorders (Cornelissen et al. 2003). Several studies have evaluated the effect of language training using different neuroimaging tools in anomic patients (Cornelissen et al. 2003; Grönholm et al. 2007).
Finally, brain plasticity in the adult brain is a new challenge for cognitive neuroscience. Although we have learned during the last century that brain plasticity was largely reduced in adults, it is also true that new discoveries about neurogenesis and epigenetics in adults might open the possibility that some claims about LL will be slowly changing. An intriguing question is the relationship between LL, brain plasticity and bilingualism. As LL can be considered one of the most demanding cognitive tasks humans can face in a short period of time, it might be interesting to know the impact of LL in brain plasticity and neurogenesis in the adult brain, as well as the long-term effects in terms of neuroprotective mechanisms (Bialystok et al. 2007). What might remain true for the next years is that research on brain plasticity and neuroscience might change the way we envisage the human learning brain.
Acknowledgements
This research has been supported by grants from the Spanish Government (Ministerio de Ciencia e Innovavión, MCIN) to A.R.F. (BSO2002-01211/SEJ2005-06067/PSIC and Ramon y Cajal program), predoctoral grant from (FPI) to A.M.M. and postdoctoral grant to R.D.B. We would like to acknowledge the contributions and helpful comments of the following colleagues during the realization of the word-learning experiments presented in this review: A.C. Bachoud-Levi, E. Camara, L. Fuentemilla, T. Gomila, M. Laine, D. Lopez-Barroso, J. Marco-Pallares, T.F. Münte and J.M. Toro.
Endnotes
One contribution of 11 to a Theme Issue ‘Word learning and lexical development across the lifespan’.
Krashen (1982) distinguished two independent ways of developing language abilities, i.e. acquisition and learning. Language acquisition is envisioned as a long-term process by which infants and adults store information in the brain unconsciously, being suitable for oral as well as for written language. This acquisition process for second LL would rely in similar mechanisms as the ones involved in first language acquisition. A typical example of this process might be the amount of knowledge that a learner might be able to pick up, for example, when an adult visits a foreign country and is exposed for 1 month to a new language. In contrast, LL is considered as a conscious process of knowledge development, a process that most of the time is supervised and susceptible to error corrections. An example of this type of learning would be the classic and supervised learning of grammar rules of a new language. With regards to second LL, an interesting idea dealing with cognitive control processes is also presented by Krashen (1982), which states that the knowledge of a language might act as an editor process (self-monitor). Thus, in order to produce a sentence in another language, first the unconscious ‘acquired knowledge’ would come up, and afterwards, the editor (learned knowledge) would be used to correct (self-correct) this sentence. A noteworthy claim of this framework is that the ‘learned knowledge’ might only be useful as a self-monitoring process, a time-consuming process which needs to be self-triggered, but without directly determining the fluency of the speaker. In this regard, LL might never become equal to language acquisition.
Bates et al. (1998) presented an emergentist approach as a possible solution to explain the origins of language, grammar and, at last, as a solution to the nature–nurture controversy. In this approach, innateness is defined as the amount of information in a complex outcome that was contributed by the genes (considering its differences in expression and complex regulation in response to specific environmental signals). Following Elman et al. (1996), three levels of innateness are defined, ordered based on the amount of information that must be contributed by the genes in each level (from strong to weak): (i) representational constraints: the innate structuring of the neural representations that constitute ‘knowledge’, which refers to synaptic connectivity at the cortical level; (ii) architectural constraints: the innate structuring of the information-processing system that must acquire and/or sustain these representations, which is specified as the quantity of units, layers, and types of connectivity between units, etc.; and (iii) temporal constraints: the innate constraints on the timing of developmental events, which refers to the number of cell divisions that take place in neurogenesis, the spatio-temporal waves of synaptic growth and pruning, and the relative differences in timing between subsystems (vision, audition, etc.). Bates et al. propose that only the last two levels (architectural and temporal constraints), which require much less genetically specified information, might be sufficient to provide an emergent solution to the nature–nurture controversy. Considering this approach, language would have emerged from the interaction between the architectural constraints placed by genes (nature) and the specific situations that an organism encounters in the world (nurture).
This idea that children and infants might be able to learn a language better than adults was initially proposed also in Penfield & Roberts (1959).
The topography of the difference waveforms between meaningful and meaningless new words at the third sentence showed a right parieto-central distribution which might fit with the standard interpretation that we are observing a modulation of the N400 component (Kutas & Federmeier 2000). However, it is noticeable in the figure that this difference comprises a larger time window (from 200 until 700 ms) and could also be related to a superimposed long-lasting positivity for meaningless new words or long-lasting negativity for the meaningful new words. The increased positivity for the new word meaningless condition could indicate a larger effort devoted to unravel the meaning of the new word reflected in the P3 family like components (which is in agreement with the long reading times in the self-pace reading experiment; see Otten et al. 2007). In a similar vein, long-lasting frontal-central negativities might be expected in this situation, considering that in the new word conditions participants required larger working memory demands compared to the real-word condition in order to assemble the different selected semantic aspects needed to be able to derive the meaning of the new work. Long-lasting negativities have been previously observed to increase working memory demands required for syntactic reanalysis (see Munte et al. 1998).
There is some controversy about the involvement of the arcuate fasciculus and the superior longitudinal fasciculus in the connection between the posterior STG and the premotor and nearby inferior frontal regions (mostly dorsal and opercular subregions). Schmahmann et al. (2007), using diffusion spectrum imaging (DSI) in post-mortem monkeys stated that the most possible connection between these regions might be supported by the fibres crossing the extreme capsule, in between the claustrum and the insula. This connection might be complemented by the middle longitudinal fasciculus, which would wire the inferior parietal lobe (near the angular gyrus) to the auditory association areas (STG) and multimodal upper bank of the superior temporal sulcus. The authors suggest that both systems in monkeys might correspond to a possible precursor of a language comprehension system. This proposal diverges radically from classical views in which the communication between these regions is mediated by the arcuate and superior-longitudinal fasciculus (Catani et al. 2005; Saur et al. 2008). These inconsistencies between DTI and DSI data in humans and monkeys could be clarified in the future using comparative DTI–DSI studies between different species. For example, in a recent and interesting study, Rilling et al. (2008) encountered that the projection of one of the branches of the arcuate fasciculus to the middle and inferior temporal lobe parts, which is clearly present in humans (see Catani et al. 2005), was not evident in non-human primates. Rilling and colleagues proposed that the larger increase in white-matter volume in the frontal and temporal lobes might be related to the evolution of language processing and the transmission of word-meaning information in humans when compared to non-human primates (Rilling & Seligman 2002; Glasser & Rilling 2008).
A very similar idea in discourse processing has been proposed by Graesser et al. (1994), which is called the search-after-meaning principle. It states that comprehenders will always attempt (effortfully) to construct meaning out of text, social interactions and perceptual input.
There is still disagreement about the crucial areas involved in semantic and conceptual processing and how semantic-conceptual information is stored and represented in the brain (see Thompson-Schill et al. 2006; Wise & Price 2006; Patterson et al. 2007). At least four different proposals exists for the involvement of the temporo-parietal cortex in partial storage of these representations: (i) the STG/STS bilaterally, with the posterior sites being more involved in form processing and anterior sites in meaning (Scott & Johnsrude 2003); (ii) the left pSTG (including the junction between left pSTG and the inferior parietal lobe) and left MTG (Mummery et al. 1999; Binder et al. 2000; Dronkers et al. 2004; Vigneau et al. 2006; Lindenberg & Scheef 2007); (iii) the involvement of left pMTG and ITG regions (Luders et al. 1991; Nobre et al. 1994; Hickok & Poeppel 2004); and (iv) the left anterior temporal pole, which is crucially affected in semantic dementia disorder (Patterson et al. 2007; Ferstl et al. 2008).
In a recent study, Bayley et al. (2008) evaluated two patients who had large MTL lesions, extending to the fusiform and insular cortices (the same patients were evaluated in Bayley & Squire 2005). In this new study and using different tasks, some residual evidences of new factual learning were detected. This result points out that in case of large MTL lesions, slow and gradual semantic learning is still possible using other neocortical regions, an idea in favour of previous computational proposals (McClelland et al. 1995).
References
- Abla D., Katahira K., Okanoya K.2008On-line assessment of statistical learning by event-related potentials. J. Cognit. Neurosci. 20, 952–964 (doi:10.1162/jocn.2008.20058) [DOI] [PubMed] [Google Scholar]
- Baddeley A., Gathercole S., Papagno C.1998The phonological loop as a language learning device. Psychol. Rev. 105, 158–173 (doi:10.1037/0033-295X.105.1.158) [DOI] [PubMed] [Google Scholar]
- Badre D., Wagner A. D.2002Semantic, retrieval, mnemonic control and prefrontal cortex. Behav. Cogn. Neurosci. Rev. 1, 206–218 (doi:10.1177/1534582302001003002) [DOI] [PubMed] [Google Scholar]
- Baldwin D. A.1991Infant contributions to the achievement of joint reference. Child Dev. 62, 875–890 (doi:10.2307/1131140) [PubMed] [Google Scholar]
- Bates E., Elman J. L., Johnson M., Karmiloff-Smith A., Parisi D., Plunkett K.1998Innateness and emergentism. In A Companion to Cognitive Science (eds Bechtel W., Graham G.), pp. 590–601 Oxford: Basil Blackwell [Google Scholar]
- Baumgaertner A., Weiller C., Buchel C.2002Event-related fMRI reveals cortical sites involved in contextual sentence integration. Neuroimage 16, 736–745 (doi:10.1006/nimg.2002.1134) [DOI] [PubMed] [Google Scholar]
- Bayley P. J., Squire L. R.2002Medial temporal lobe amnesia: gradual acquisition of factual information by nondeclarative memory. J. Neurosci. 22, 5741–5748 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bayley P. J., Squire L. R.2005Failure to acquire new semantic knowledge in patients with large medial temporal lobe lesions. Hippocampus 15, 273–280 (doi:10.1002/hipo.20057) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bayley P. J., Hopkins R. O., Squire L. R.2003Successful recollection of remote autobiographical memories by amnesic patients with medial temporal lobe lesions. Neuron 38, 135–144 (doi:10.1016/S0896-6273(03)00156-9) [DOI] [PubMed] [Google Scholar]
- Bayley P. J., O'Reilly R. C., Curran T., Squire L. R.2008New semantic learning in patients with large medial temporal lobe lesions. Hippocampus 18, 575–583 (doi:10.1002/hipo.20417) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berlyne D. E.1960Conflict, arousal and curiosity New York, NY: McGraw-Hill [Google Scholar]
- Bialystok E.2001Bilingualism in development: language, literacy, and cognition New York, NY: Cambridge University Press [Google Scholar]
- Bialystok E., Sharwood-Smith M.1985Interlanguage is not a state of mind, an evaluation of the construct for second language acquisition. Appl. Linguist. 6, 101–117 (doi:10.1093/applin/6.2.101) [Google Scholar]
- Bialystok E., Craik F. I., Freedman M.2007Bilingualism as a protection against the onset of symptoms of dementia. Neuropsychologia 45, 459–464 (doi:10.1016/j.neuropsychologia.2006.10.009) [DOI] [PubMed] [Google Scholar]
- Binder J. R., Frost J. A., Hammeke T. A., Bellgowan P. S., Springer J. A., Kaufman J. N., Possing E. T.2000Human temporal lobe activation by speech and nonspeech sounds. Cereb. Cortex 10, 512–528 (doi:10.1093/cercor/10.5.512) [DOI] [PubMed] [Google Scholar]
- Birdsong D.1999Second language acquisition and the critical period hypothesis Mahwah, NJ: Elbaum [Google Scholar]
- Birdsong D.2006Age and second language acquisition and processing: a selective overview. Lang. Learn. 56, 9–49 (doi:10.1111/j.1467-9922.2006.00353.x) [Google Scholar]
- Bloom P.2000How children learn the meanings of words Cambridge, MA: MIT Press [Google Scholar]
- Bongaerts T.1999Ultimate attainment in L2 pronunciation: the case of very advanced late learners. In Second language acquisition and the critical period hypothesis (ed. Birdsong D.), Mahwah, NJ: Erlbaum [Google Scholar]
- Borovsky A., Elman J., Kutas M.2008Getting the gist is not enough: an ERP investigation of word learning from context. In Proceedings of the 29th Annual Conference of the Cognitive Science Society (eds McNamara D. C., Trafton J. G.), pp. 119–124 Austin, TX: Cognitive Science Society [Google Scholar]
- Brainerd C. J., Reyna V. F.2002Fuzzy-trace theory and false memory. Curr. Dir. Psychol. Sci. 11, 164–169 (doi:10.1111/1467-8721.00192) [Google Scholar]
- Bransford J. D., Franks J. J.1971The abstraction of linguistic ideas. Cogn. Psychol. 2, 331–350 (doi:10.1016/0010-0285(71)90019-3) [Google Scholar]
- Breitenstein C., Jansen A., Deppe M., Foerster A. F., Sommer J., Wolbers T., Knecht S.2005Hippocampus activity differentiates good from poor learners of a novel lexicon. Neuroimage 25, 958–968 (doi:10.1016/j.neuroimage.2004.12.019) [DOI] [PubMed] [Google Scholar]
- Brewer W. F.1977Memory for the pragmatic implications of sentences. Mem. Cognit. 5, 673–678 [DOI] [PubMed] [Google Scholar]
- Bruer J.2003The myth of the first three years: a new understanding of early brain development and lifelong learning New York, NY: The Free Press [Google Scholar]
- Buchsbaum B. R., Olsen R. K., Koch P., Berman K. F.2005Human dorsal and ventral auditory streams subserve rehearsal-based and echoic processes during verbal working memory. Neuron 48, 687–697 (doi:10.1016/j.neuron.2005.09.029) [DOI] [PubMed] [Google Scholar]
- Buonomano D. V., Merzenich M. M.1998Cortical plasticity: from synapses to maps. Annu. Rev. Neurosci. 21, 149–186 (doi:10.1146/annurev.neuro.21.1.149) [DOI] [PubMed] [Google Scholar]
- Carey S., Barlett E.1978Acquiring a single new word. Pap. R. Child 15, 17–29 [Google Scholar]
- Carnine D., Kameenui E. J., Coyle G.2008Utilization of contextual information in determining the meaning of unfamiliar words. Read. Res. Quart. 19, 188–204 (doi:10.2307/747362) [Google Scholar]
- Carroll J. B.1993Human cognitive abilities: a survey of factor-analytic studies Cambridge, UK: Cambridge University Press [Google Scholar]
- Carver R. P., Leibert R. E.1995The effect of reading library books at different levels of difficulty upon gain in reading. Read. Res. Quart. 30, 26–48 (doi:10.2307/747743) [Google Scholar]
- Casey B. J., Giedd J. N., Thomas K. M.2000Structural and functional brain development and its relation to cognitive development. Biol. Psychol. 54, 241–257 (doi:10.1016/S0301-0511(00)00058-2) [DOI] [PubMed] [Google Scholar]
- Catani M., Jones D. K., ffytche D. H.2005Perisylvian language networks of the human brain. Ann. Neurol. 57, 8–16 (doi:10.1002/ana.20319) [DOI] [PubMed] [Google Scholar]
- Cermak L. S.1984The episodic-semantic distinction in amnesia. In The neuropsychology of memory (eds Squire L. R., Butters N.), pp. 55–62 New York, NY: Guilford Press [Google Scholar]
- Chaffin R., Morris R. K., Seely R. E.2001Learning new word meanings from context: a study of eye movements. J. Exp. Psychol. Learn. Mem. Cogn. 27, 225–235 (doi:10.1037/0278-7393.27.1.225) [PubMed] [Google Scholar]
- Chang C., Crottaz-Herbette S., Menon V.2007Temporal dynamics of basal ganglia response and connectivity during verbal working memory. Neuroimage 34, 1253–1269 (doi:10.1016/j.neuroimage.2006.08.056) [DOI] [PubMed] [Google Scholar]
- Chein J. M., Ravizza S. M., Fiez J. A.2003Using neuroimaging to evaluate models of working memory and their implications for language processing. J. Neurolinguist. 16, 315–339 (doi:10.1016/S0911-6044(03)00021-6) [Google Scholar]
- Chomsky N.2002On nature and language Cambridge, UK: Cambridge University Press [Google Scholar]
- Clay F., Bowers J. S., Davis C. J., Hanley D. A.2007Teaching adults new words: the role of practice and consolidation. J. Exp. Psychol. Learn. Mem. Cogn. 33, 970–976 (doi:10.1037/0278-7393.33.5.970) [DOI] [PubMed] [Google Scholar]
- Conboy B. T., Mills D. L.2006Two languages, one developing brain: event-related potentials to words in bilingual toddlers. Dev. Sci. 9, F1–F12 (doi:10.1111/j.1467-7687.2005.00453.x) [DOI] [PubMed] [Google Scholar]
- Coppieters R.1987Competence differences between native and near-native speakers. Language 63, 544–573 (doi:10.2307/415005) [Google Scholar]
- Cornelissen K., Laine M., Tarkiainen A., Jarvensivu T., Martin N., Salmelin R.2003Adult brain plasticity elicited by anomia treatment. J. Cogn. Neurosci. 15, 444–461 (doi:10.1162/089892903321593153) [DOI] [PubMed] [Google Scholar]
- Cornelissen K., Laine M., Renvall K., Saarinen T., Martin N., Salmelin R.2004Learning new names for new objects: cortical effects as measured by magnetoencephalography. Brain Lang. 89, 617–622 (doi:10.1016/j.bandl.2003.12.007) [DOI] [PubMed] [Google Scholar]
- Couette M., Bachoud-Levi A. C., Brugieres P., Sieroff E., Bartolomeo P.2008Orienting of spatial attention in Huntington's disease. Neuropsychologia 46, 1391–1400 (doi:10.1016/j.neuropsychologia.2007.12.017) [DOI] [PubMed] [Google Scholar]
- Crosson B., et al. 2003Left and right basal ganglia and frontal activity during language generation: contributions to lexical, semantic, and phonological processes. J. Int. Neuropsychol. Soc. 9, 1061–1077 [DOI] [PubMed] [Google Scholar]
- Cunillera T., Toro J. M., Sebastian-Galles N., Rodriguez-Fornells A.2006The effects of stress and statistical cues on continuous speech segmentation: an event-related brain potential study. Brain Res. 1123, 168–178 (doi:10.1016/j.brainres.2006.09.046) [DOI] [PubMed] [Google Scholar]
- Cunillera T., Gomila A., Rodriguez-Fornells A.2008Beneficial effects of word final stress in segmenting a new language: evidence from ERPs. BMC Neurosci. 9, 23 (doi:10.1186/1471-2202-9-23) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cunillera T., Camara E., Toro J. M., Marco-Pallares J., Sebastian-Galles N., Ortiz H., Pujol J., Rodriguez-Fornells A.2009Time course and functional neuroanatomy of speech segmentation in adults. 48, 541–553 [DOI] [PubMed] [Google Scholar]
- Damasio H., Grabowski T. J., Tranel D., Hichwa R. D., Damasio A. R.1996A neural basis for lexical retrieval. Nature 380, 499–505 (doi:10.1038/380499a0) [DOI] [PubMed] [Google Scholar]
- Davies R. R., Graham K. S., Xuereb J. H., Williams G. B., Hodges J. R.2004The human perirhinal cortex and semantic memory. Eur. J. Neurosci. 20, 2441–2446 (doi:10.1111/j.1460-9568.2004.03710.x) [DOI] [PubMed] [Google Scholar]
- Davis M. H., Di Betta A. M., Macdonald M. J., Gaskell M. G.2009Learning and consolidation of novel spoken words. J. Cogn. Neurosci. 21, 803–820 (doi:10.1162/jocn.2009.21059) [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Diego-Balaguer R., Toro J. M., Rodriguez-Fornells A., Bachoud-Levi A. C.2007Different neurophysiological mechanisms underlying word and rule extraction from speech. PLoS ONE 2, e1175 (doi:10.1371/journal.pone.0001175) [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Diego-Balaguer R., Andre M., Rodriguez-Fornells A., Bachoud-Levi A. C.2008aDecoding speech to extract the rules of language. In Proc. of the 15th Meeting of the European Society of Cognitive Psychology, Marseille, France [Google Scholar]
- De Diego-Balaguer R., Couette M., Dolbeau G., Durr A., Youssov K., Bachoud-Levi A. C.2008bStriatal degeneration impairs language learning: evidence from Huntington's disease. Brain 131, 2870–2881 (doi:10.1093/brain/awn242) [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Felipe J.2006Brain plasticity and mental processes: Cajal again. Nat. Rev. Neurosci. 7, 811–817 (doi:10.1038/nrn2005) [DOI] [PubMed] [Google Scholar]
- De Haan M., Mishkin M., Baldeweg T., Vargha-Khadem F.2006Human memory development and its dysfunction after early hippocampal injury. Trends Neurosci. 29, 374–381 [DOI] [PubMed] [Google Scholar]
- DeKeyser R. M.1997Beyond explicit rule learning: automatizing second language morphosyntax. Stud. Second Lang. Acq. 19, 195–221 [Google Scholar]
- Dell G. S.1986A spreading-activation theory of retrieval in sentence production. Psychol. Rev. 93, 283–321 (doi:10.1037/0033-295X.93.3.283) [PubMed] [Google Scholar]
- Dell G. S., Reich P. A.1981Stages in sentence production: an analysis of speech error data. J. of Verbal Learning and Verbal Behavior 20, 611–629 [Google Scholar]
- Demb J. B., Desmond J. E., Wagner A. D., Vaidya C. J., Glover G. H., Gabrieli J. D.1995Semantic encoding and retrieval in the left inferior prefrontal cortex: a functional MRI study of task difficulty and process specificity. J. Neurosci. 15, 5870–5878 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Desmurget M., Grafton S.2000Forward modeling allows feedback control for fast reaching movements. Trends Cogn. Sci. 4, 423–431 (doi:10.1016/S1364-6613(00)01537-0) [DOI] [PubMed] [Google Scholar]
- Diamond A.2002Normal development of prefrontal cortex from birth to young adulthood: cognitive functions, anatomy, and biochemistry. In Principles of frontal lobe function (eds Stuss D. T., Knight R. T.), pp. 466–503 New York, NY: Oxford University Press [Google Scholar]
- Dronkers N. F., Wilkins D. P., Van Valin R. D., Redfern B. B., Jaeger J. J.2004Lesion analysis of the brain areas involved in language comprehension. Cognition 92, 145–177 (doi:10.1016/j.cognition.2003.11.002) [DOI] [PubMed] [Google Scholar]
- Doupe A. J., Kuhl P. K.1999Birdsong and human speech: common themes and mechanisms. Annu. Rev. Neurosci. 22, 567–631 (doi:10.1146/annurev.neuro.22.1.567) [DOI] [PubMed] [Google Scholar]
- Doyon J., Song A. W., Karni A., Lalonde F., Adams M. M., Ungerleider L. G.2002Experience-dependent changes in cerebellar contributions to motor sequence learning. Proc. Natl. Acad. Sci. USA 99, 1017–1022 (doi:10.1073/pnas.022615199) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Duffau H.2008The anatomo-functional connectivity of language revisited. New insights provided by electrostimulation and tractography. Neuropsychologia 46, 927–934 (doi:10.1016/j.neuropsychologia.2007.10.025) [DOI] [PubMed] [Google Scholar]
- Duffau H., Gatignol P., Mandonnet E., Peruzzi P., Tzourio-Mazoyer N., Capelle L.2005New insights into the anatomo-functional connectivity of the semantic system: a study using cortico-subcortical electrostimulations. Brain 128, 797–810 (doi:10.1093/brain/awh423) [DOI] [PubMed] [Google Scholar]
- Dumay N., Gaskell M. G.2007Sleep-associated changes in the mental representation of spoken words. Psychol. Sci. 18, 35–39 (doi:10.1111/j.1467-9280.2007.01845.x) [DOI] [PubMed] [Google Scholar]
- Durkin D.1979What classroom observations reveal about reading comprehension. Read. Res. Quart. 14, 518–544 [Google Scholar]
- Echols C. H., Newport E. L.1992The role of stress and position in determining first words. Lang. Acquis. 2, 189–220 (doi:10.1207/s15327817la0203_1) [Google Scholar]
- Ellis N.2008Usage-based and form-focused language acquisition: the associative learning of constructions, learned-attention and the limited L2 endstate. In Handbook of Cognitive Linguistics and Second Language Acquisition, pp. 372–405 New York: Taylor & Francis [Google Scholar]
- Elman J. L., Bates E., Johnson M., Karmiloff-Smith A., Parisi D., Plunkett K.1996Rethinking innateness: a connectionist perspective on development Cambridge, MA: MIT Press [Google Scholar]
- Endress A. D., Scholl B. J., Mehler J.2005The role of salience in the extraction of algebraic rules. J. Exp. Psychol. Gen. 134, 406–419 (doi:10.1037/0096-3445.134.3.406) [DOI] [PubMed] [Google Scholar]
- Fadiga L., Craighero L., Buccino G., Rizzolatti G.2002Speech listening specifically modulates the excitability of tongue muscles: a TMS study. Eur. J. Neurosci. 15, 399–402 (doi:10.1046/j.0953-816x.2001.01874.x) [DOI] [PubMed] [Google Scholar]
- Ferreira F.2003The misinterpretation of noncanonical sentences. Cogn. Psychol. 47, 164–203 (doi:10.1016/S0010-0285(03)00005-7) [DOI] [PubMed] [Google Scholar]
- Ferstl E. C., von Cramon D. Y.2001The role of coherence and cohesion in text comprehension: an event-related fMRI study. Cogn. Brain Res. 11, 325–340 (doi:10.1016/S0926-6410(01)00007-6) [DOI] [PubMed] [Google Scholar]
- Ferstl E. C., Neumann J., Bogler C., von Cramon D. Y.2008The extended language network: a meta-analysis of neuroimaging studies on text comprehension. Hum. Brain Mapp. 29, 581–593 (doi:10.1002/hbm.20422) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flege J. E.1987A critical period for learning to pronounce foreign languages? Appl. Linguist. 8, 162–177 (doi:10.1093/applin/8.2.162) [Google Scholar]
- Fowler A. E.1991How early phonological development might set the stage for phoneme awareness. In Phonological processes in literacy: a tribute to isabelle Y. Liberman (eds Brady S. A., Shankweiler D. P.), pp. 97–117 Hillsdale, NJ: Erlbaum [Google Scholar]
- Friederici A. D., Kotz S. A.2003The brain basis of syntactic processes: functional imaging and lesion studies. Neuroimage 20, S8–S17 (doi:10.1016/j.neuroimage.2003.09.003) [DOI] [PubMed] [Google Scholar]
- Friedrich M., Friederici A. D.2008Neurophysiological correlates of online word learning in 14-month-old infants. Neuroreport 19, 1757–1761 (doi:10.1097/WNR.0b013e328318f014) [DOI] [PubMed] [Google Scholar]
- Fuentemilla L., Camara E., Münte T. F., Krämer U. M., Cunillera T., Marco-Pallares J., Tempelmann K., Rodriguez-Fornells A.2009Individual differences in true and false memory retrieval are related to white matter brain microstructure. J. Neurosci. 29, 8698–8703 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gabrieli J. D., Cohen N. J., Corkin S.1988The impaired learning of semantic knowledge following bilateral medial temporal-lobe resection. Brain Cogn. 7, 157–177 (doi:10.1016/0278-2626(88)90027-9) [DOI] [PubMed] [Google Scholar]
- Gaskell M. G., Dumay N.2003Lexical competition and the acquisition of novel words. Cognition 89, 105–132 (doi:10.1016/S0010-0277(03)00070-2) [DOI] [PubMed] [Google Scholar]
- Gelfand J. R., Bookheimer S. Y.2003Dissociating neural mechanisms of temporal sequencing and processing phonemes. Neuron 38, 831–842 (doi:10.1016/S0896-6273(03)00285-X) [DOI] [PubMed] [Google Scholar]
- Gentner D., Namy L. L.2004The role of comparison in children's early word learning. In Weaving a Lexicon (eds Hall D. G., Waxman S. R.), pp. 533–568 Cambridge, MA: MIT Press [Google Scholar]
- Gillette J., Gleitman H., Gleitman L., Lederer A.1999Human simulations of vocabulary learning. Cognition 73, 135–176 (doi:10.1016/S0010-0277(99)00036-0) [DOI] [PubMed] [Google Scholar]
- Glasser M. F., Rilling J. K.2008DTI tractography of the human brain's language pathways. Cereb. Cortex 18, 2471–2482 (doi:10.1093/cercor/bhn011) [DOI] [PubMed] [Google Scholar]
- Gleitman L., Newport E. L.1997The invention of language by children: enviromental and biological influences on the acquisition of language. In Language: an invitation to cognitive science, 2nd edn., vol. 1 (eds Gleitman L., Liberman M.), pp. 1–24 Cambridge, MA: The MIT Press [Google Scholar]
- Gleitman L., Wanner E.1982The state of the state of the art. In Language acquisition: the state of the art (eds Wanner E., Gleitman L.), pp. 3–48 Cambridge, UK: Cambridge University Press [Google Scholar]
- Goel V., Dolan R. J.2000Anatomical segregation of component processes in an inductive inference task. J. Cogn. Neurosci. 12, 110–119 (doi:10.1162/08989290051137639) [DOI] [PubMed] [Google Scholar]
- Goel V., Dolan R. J.2004Differential involvement of left prefrontal cortex in inductive and deductive reasoning. Cognition 93, B109–B121 (doi:10.1016/j.cognition.2004.03.001) [DOI] [PubMed] [Google Scholar]
- Gold B. T., Balota D. A., Jones S. J., Powell D. K., Smith C. D., Andersen A. H.2006Dissociation of automatic and strategic lexical-semantics: functional magnetic resonance imaging evidence for differing roles of multiple frontotemporal regions. J. Neurosci. 26, 6523–6532 (doi:10.1523/JNEUROSCI.0808-06.2006) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goldinger S. D.1998Echoes of echoes? An episodic theory of lexical access. Psychol. Rev. 105, 251–279 (doi:10.1037/0033-295X.105.2.251) [DOI] [PubMed] [Google Scholar]
- Golestani N., Zatorre R. J.2004Learning new sounds of speech: reallocation of neural substrates. Neuroimage 21, 494–506 (doi:10.1016/j.neuroimage.2003.09.071) [DOI] [PubMed] [Google Scholar]
- Golestani N., Molko N., Dehaene S., LeBihan D., Pallier C.2007Brain structure predicts the learning of foreign speech sounds. Cereb. Cortex 17, 575–582 (doi:10.1093/cercor/bhk001) [DOI] [PubMed] [Google Scholar]
- Golinkoff R. M., Hirsh-Pasek K., Smith L. B., Woodward A. L., Akhtar N., Tomasello M., Hollich G.2000Becoming a word learner: a debate on lexical acquisition New York, NY: Oxford University Press [Google Scholar]
- Gomez R., Maye J.2005The developmental trajectory of nonadjacent dependency learning. Infancy 7, 183–206 (doi:10.1207/s15327078in0702_4) [DOI] [PubMed] [Google Scholar]
- Graesser A. C., Singer M., Trabasso T.1994Constructing inferences during narrative text comprehension. Psychol. Rev. 101, 371–395 (doi:10.1037/0033-295X.101.3.371) [DOI] [PubMed] [Google Scholar]
- Graf E. K., Evans J. L., Alibali M. W., Saffran J. R.2007Can infants map meaning to newly segmented words? Statistical segmentation and word learning. Psychol. Sci. 18, 254–260 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gray J. R., Chabris C. F., Braver T. S.2003Neural mechanisms of general fluid intelligence. Nat. Neurosci. 6, 316–322 (doi:10.1038/nn1014) [DOI] [PubMed] [Google Scholar]
- Grönholm P., Rinne J. O., Vorobyev V. A., Laine M.2005Naming of newly learned objects: A PET activation study. Cogn. Brain Res. 25, 359–371 (doi:10.1016/j.cogbrainres.2005.06.010) [DOI] [PubMed] [Google Scholar]
- Grönholm P., Rinne J. O., Vorobyev V. A., Laine M.2007Neural correlates of naming newly learned objects in MCI. Neuropsychologia 45, 2355–2368 (doi:10.1016/j.neuropsychologia.2007.02.003) [DOI] [PubMed] [Google Scholar]
- Haber S. N., Calzavara R.2009The cortico-basal ganglia integrative network: the role of the thalamus.. Brain Res. Bull. 78, 69–74 (doi:10.1016/j.brainresbull.2008.09.013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hagoort P., Indefrey P., Brown C., Herzog H., Steinmetz H., Seitz R. J.1999The neural circuitry involved in the reading of German words and pseudowords: a PET study. J. Cogn. Neurosci. 11, 383–398 (doi:10.1162/089892999563490) [DOI] [PubMed] [Google Scholar]
- Hauser M. D., Bever T.2008Behavior. A biolinguistic agenda. Science 322, 1057–1059 (doi:10.1126/science.1167437) [DOI] [PubMed] [Google Scholar]
- Hickok G., Poeppel D.2000Towards a functional neuroanatomy of speech perception. Trends Cogn. Sci. 4, 131–138 (doi:10.1016/S1364-6613(00)01463-7) [DOI] [PubMed] [Google Scholar]
- Hickok G., Poeppel D.2004Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language. Cognition 92, 67–99 (doi:10.1016/j.cognition.2003.10.011) [DOI] [PubMed] [Google Scholar]
- Hickok G., Poeppel D.2007The cortical organization of speech processing. Nat. Rev. Neurosci. 8, 393–402 (doi:10.1038/nrn2113) [DOI] [PubMed] [Google Scholar]
- Hollich G. J., Hirsh-Pasek K., Golinkoff R. M., Brand R. J., Brown E., Chung H. L., Hennon E., Rocroi C.2000Breaking the language barrier: an emergentist coalition model for the origins of word learning. Monogr. Soc. Res. Child Dev. 65, 1–123 [PubMed] [Google Scholar]
- Howard D., Patterson K., Wise R., Brown W. D., Friston K., Weiller C., Frackowiak R.1992The cortical localization of the lexicons. Positron emission tomography evidence. Brain 115, 1769–1782 (doi:10.1093/brain/115.6.1769) [DOI] [PubMed] [Google Scholar]
- Hulten A., Vihla M., Laine M., Salmelin R.2009Accessing newly learned names and meanings in the native language. Hum. Brain Mapp. 30, 976–989 (doi:10.1002/hbm.20561) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hyltenstam K., Abrahamsson N.2000Who can become native-like in a second language? All, some, or none? On the maturational constraints controversy in second language acquisition. Stud. Linguist. 54, 150–166 (doi:10.1111/1467-9582.00056) [Google Scholar]
- Iacoboni M., Woods R. P., Brass M., Bekkering H., Mazziotta J. C., Rizzolatti G.1999Cortical mechanisms of human imitation. Science 286, 2526–2528 (doi:10.1126/science.286.5449.2526) [DOI] [PubMed] [Google Scholar]
- Indefrey P., Levelt W. J.2004The spatial and temporal signatures of word production components. Cognition 92, 101–144 (doi:10.1016/j.cognition.2002.06.001) [DOI] [PubMed] [Google Scholar]
- Jusczyk P. W.1999How infants begin to extract words from speech. Trends Cogn. Sci. 3, 323–328 (doi:10.1016/S1364-6613(99)01363-7) [DOI] [PubMed] [Google Scholar]
- Just M. A., Carpenter P. A., Keller T. A., Eddy W. F., Thulborn K. R.1996Brain activation modulated by sentence comprehension. Science 274, 114–116 (doi:10.1126/science.274.5284.114) [DOI] [PubMed] [Google Scholar]
- Keller T. A., Carpenter P. A., Just M. A.2001The neural bases of sentence comprehension: a fMRI examination of syntactic and lexical processing. Cereb. Cortex 11, 223–237 (doi:10.1093/cercor/11.3.223) [DOI] [PubMed] [Google Scholar]
- Kensinger E. A., Ullman M. T., Corkin S.2001Bilateral medial temporal lobe damage does not affect lexical or grammatical processing: evidence from amnesic patient H. M. Hippocampus 11, 347–360 (doi:10.1002/hipo.1049) [DOI] [PubMed] [Google Scholar]
- Klein W.1996Language acquisition at different ages. In The lifespan development of individuals: behavioral, neurobiological, and psychosocial perspectives (ed. Magnusson D.), pp. 244–264 Cambridge, UK: Cambridge University Press [Google Scholar]
- Kohler E., Keysers C., Umilta M. A., Fogassi L., Gallese V., Rizzolatti G.2002Hearing sounds, understanding actions: action representation in mirror neurons. Science 297, 846–848 (doi:10.1126/science.1070311) [DOI] [PubMed] [Google Scholar]
- Krashen S. J.1982Principles and practice in second language acquisition New York, NY: Pergamon [Google Scholar]
- Kroll J. F., Steward E.1994Category interference in translation and picture naming: evidence for asymmetric connections between bilingual memory representations. J. Mem. Lang. 33, 149–174 (doi:10.1006/jmla.1994.1008) [Google Scholar]
- Kuhl P. K.2004Early language acquisition: cracking the speech code. Nat. Rev. Neurosci. 5, 831–843 (doi:10.1038/nrn1533) [DOI] [PubMed] [Google Scholar]
- Kuhl P., Rivera-Gaxiola M.2008Neural substrates of language acquisition. Annu. Rev. Neurosci. 31, 511–534 (doi:10.1146/annurev.neuro.30.051606.094321) [DOI] [PubMed] [Google Scholar]
- Kutas M., Federmeier K. D.2000Electrophysiology reveals semantic memory use in language comprehension. Trends Cogn. Sci. 4, 463–470 (doi:10.1016/S1364-6613(00)01560-6) [DOI] [PubMed] [Google Scholar]
- Landauer T. K., Dumais S. T.1997A solution to Plato's problem: the Latent Semantic Analysis theory of acquisition, induction and representation of knowledge. Psychol. Rev. 104, 211–240 (doi:10.1037/0033-295X.104.2.211) [Google Scholar]
- Laufer B.2003Vocabulary acquisition in a second language: do learners really acquire most vocabulary by reading? Some empirical evidence. Can. Mod. Lang. Rev. 59, 1710 [Google Scholar]
- Lavenex P., Suzuki W. A., Amaral D. G.2002Perirhinal and parahippocampal cortices of the macaque monkey: projections to the neocortex. J. Comp. Neurol. 447, 394–420 (doi:10.1002/cne.10243) [DOI] [PubMed] [Google Scholar]
- Leh S. E., Ptito A., Chakravarty M. M., Strafella A. P.2007Fronto-striatal connections in the human brain: a probabilistic diffusion tractography study. Neurosci. Lett. 419, 113–118 (doi:10.1016/j.neulet.2007.04.049) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lepage M., Habib R., Tulving E.1998Hippocampal PET activations of memory encoding and retrieval: the HIPER model. Hippocampus 8, 313–322 (doi:10.1002/(SICI)1098-1063(1998)8:4<313::AID-HIPO1>3.0.CO;2-I) [DOI] [PubMed] [Google Scholar]
- Liberman A. M., Mattingly I. G.1985The motor theory of speech perception revised. Cognition 21, 1–36 (doi:10.1016/0010-0277(85)90021-6) [DOI] [PubMed] [Google Scholar]
- Lieberman P.2000Human language and our reptilian brain: the subcortical bases of speech, syntax and thought Cambridge, MA: Harvard University Press; [DOI] [PubMed] [Google Scholar]
- Lindenberg R., Scheef L.2007Supramodal language comprehension: role of the left temporal lobe for listening and reading. Neuropsychologia 45, 2407–2415 (doi:10.1016/j.neuropsychologia.2007.02.008) [DOI] [PubMed] [Google Scholar]
- Lopez-Barroso D., De Diego-Balaguer R., Cunillera T., Camara E., Rodríguez-Fornells A.2009Working memory effects on rule learning and speech segmentation: a combined behavioural-DTI study. Hum. Brain Mapp. 47, 96 [Google Scholar]
- Luck S. J., Hillyard S. A.1994Spatial filtering during visual search: evidence from human electrophysiology. J. Exp. Psychol. Human 20, 1000–1014 (doi:10.1037/0096-1523.20.5.1000) [DOI] [PubMed] [Google Scholar]
- Luders H., Lesser R. P., Hahn J., Dinner D. S., Morris H. H., Wyllie E., Godoy J.1991Basal temporal language area. Brain 114, 743–754 (doi:10.1093/brain/114.2.743) [DOI] [PubMed] [Google Scholar]
- Markman E. M.1989Categorization and naming in children Cambridge, MA: MIT Press [Google Scholar]
- Markson L., Bloom P.1997Evidence against a dedicated system for word learning in children. Nature 385, 813–815 (doi:10.1038/385813a0) [DOI] [PubMed] [Google Scholar]
- Martens M. A., Wilson S. J., Reutens D. C.2008Research review: Williams Syndrome: a critical review of the cognitive, behavioral, and neuroanatomical phenotype. J. Child Psychol. Psychiatry 49, 576–608 (doi:10.1111/j.1469-7610.2008.01887.x) [DOI] [PubMed] [Google Scholar]
- Martin A., Chao L. L.2001Semantic memory and the brain: structure and processes. Curr. Opin. Neurobiol. 11, 194–201 (doi:10.1016/S0959-4388(00)00196-3) [DOI] [PubMed] [Google Scholar]
- McClelland J. L., McNaughton B. L., O'Reilly R. C.1995Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. Psychol. Rev. 102, 419–457 (doi:10.1037/0033-295X.102.3.419) [DOI] [PubMed] [Google Scholar]
- McGregor K. K., Friedman R. M., Reilly R. M., Newman R. M.2002Semantic representation and naming in young children. J. Speech Lang. Hear. Res. 45, 332–346 (doi:10.1044/1092-4388(2002/026)) [DOI] [PubMed] [Google Scholar]
- McKeown M.1985The acquisition of word meaning from context by children of high and low ability. Read. Res. Quart. 20, 482–496 (doi:10.2307/747855) [Google Scholar]
- McLaughlin J., Osterhout L., Kim A.2004Neural correlates of second-language word learning: minimal instruction produces rapid change. Nat. Neurosci. 7, 703–704 (doi:10.1038/nn1264) [DOI] [PubMed] [Google Scholar]
- McNealy K., Mazziotta J. C., Dapretto M.2006Cracking the language code: neural mechanisms underlying speech parsing. J. Neurosci. 26, 7629–7639 (doi:10.1523/JNEUROSCI.5501-05.2006) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mei L., Chen C., Xue G., He Q., Li T., Xue F., Yang Q., Dong Q.2008Neural predictors of auditory word learning. Neuroreport 19, 215–219 (doi:10.1097/WNR.0b013e3282f46ea9) [DOI] [PubMed] [Google Scholar]
- Meister I. G., Wilson S. M., Deblieck C., Wu A. D., Lacoboni M.2007The essential role of premotor cortex in speech perception. Curr. Biol. 17, 1692–1696 (doi:10.1016/j.cub.2007.08.064) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meltzoff A. N., Decety J.2003What imitation tells us about social cognition: a rapprochement between developmental psychology and cognitive neuroscience. Phil. Trans. R. Soc. B 358, 491–500 (doi:10.1098/rstb.2002.1261) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mestres-Missé A., Rodriguez-Fornells A., Munte T. F.2007Watching the brain during meaning acquisition. Cereb. Cortex 17, 1858–1866 (doi:10.1093/cercor/bhl094) [DOI] [PubMed] [Google Scholar]
- Mestres-Missé A., Camara E., Rodriguez-Fornells A., Rotte M., Munte T. F.2008Functional neuroanatomy of meaning acquisition from context. J. Cogn. Neurosci. 20, 2153–2166 (doi:10.1162/jocn.2008.20150) [DOI] [PubMed] [Google Scholar]
- Mestres-Missé A., Rodriguez-Fornells A., Münte T. F.In press Neural differences in the mapping of verb and noun concepts onto novel words. Neuroimage. (doi:10.1016/j.neuroimage.2009.10.018) [DOI] [PubMed] [Google Scholar]
- Mestres-Missé A., Munte T. F., Rodriguez-Fornells A.2009Functional neuroanatomy of contextual acquisition of concrete and abstract words. J. Cogn. Neurosci. 21, 2154–2171 (doi:10.1162/jocn.2008.21171) [DOI] [PubMed] [Google Scholar]
- Metsala J. L., Walley A. C.1998Spoken vocabulary growth and the segmental restructuring of lexical representations: precursors to phonemic awareness and early reading ability. In Word recognition in beginning literacy (eds Metsala J. L., Ehri L. C.), pp. 89–120 Mahwah, NJ: Lawrence Erlbaum [Google Scholar]
- Middleton F. A., Strick P. L.2000Basal ganglia output and cognition: evidence from anatomical, behavioral, and clinical studies. Brain Cogn. 42, 183–200 (doi:10.1006/brcg.1999.1099) [DOI] [PubMed] [Google Scholar]
- Mills D. L., Plunkett K., Prat C., Schafer G.2005Watching the infant brain learn words: effects of vocabulary size and experience. Cogn. Dev. 20, 19–31 (doi:10.1016/j.cogdev.2004.07.001) [Google Scholar]
- Montrul S., Slabakova R.2003Competence similarities between native and near-native speakers. An investigation of the preterite-imperfect contrast in Spanish. Stud. Second Lang. Acq. 25, 351–398 [Google Scholar]
- Morris R. K., Williams R. S.2003Bridging the gap between old and new: eye movements and vocabulary acquisition in reading. In The Mind's eye: cognitive and applied aspects of eye movement research (eds Hyona J., Radach R., Deubel H.), pp. 235–252 Amsterdam: Elsevier Science [Google Scholar]
- Mueller J. L., Bahlmann J., Friederici A. D.2008aThe role of pause cues in language learning: the emergence of event-related potentials related to sequence processing. J. Cogn. Neurosci. 20, 892–905 (doi:10.1162/jocn.2008.20511) [DOI] [PubMed] [Google Scholar]
- Mueller J. L., Girgsdies S., Friederici A. D.2008bThe impact of semantic-free second-language training on ERPs during case processing. Neurosci. Lett. 443, 77–81 (doi:10.1016/j.neulet.2008.07.054) [DOI] [PubMed] [Google Scholar]
- Mummery C. J., Patterson K., Wise R. J., Vandenberghe R., Price C. J., Hodges J. R.1999Disrupted temporal lobe connections in semantic dementia. Brain 122, 61–73 (doi:10.1093/brain/122.1.61) [DOI] [PubMed] [Google Scholar]
- Munte T. F., Schiltz K., Kutas M.1998When temporal terms belie conceptual order. Nature 395, 71–73 (doi:10.1038/25731) [DOI] [PubMed] [Google Scholar]
- Nadel L., Moscovitch M.1997Memory consolidation, retrograde amnesia and the hippocampal complex. Curr. Opin. Neurobiol. 7, 217–227 (doi:10.1016/S0959-4388(97)80010-4) [DOI] [PubMed] [Google Scholar]
- Nagy W. E., Gentner D.1990Semantic constraints on lexical categories. Lang. Cogn. Proc. 5, 169–201 (doi:10.1080/01690969008402104) [Google Scholar]
- Nagy W. E., Anderson R. C., Herman P. A.1987Learning word meanings from context during normal reading. Am. Educ. Res. J. 24, 237–270 [Google Scholar]
- Nation I. S. P.2001Learning vocabulary in another language Cambridge, UK: Cambridge University Press [Google Scholar]
- Newman R., Ratner N. B., Jusczyk A. M., Jusczyk P. W., Dow K. A.2006Infants' early ability to segment the conversational speech signal predicts later language development: a retrospective analysis. Dev. Psychol. 42, 643–655 (doi:10.1037/0012-1649.42.4.643) [DOI] [PubMed] [Google Scholar]
- Ni W., Constable R. T., Mencl W. E., Pugh K. R., Fulbright R. K., Shaywitz S. E., Shaywitz B. A., Gore J. C., Shankweiler D.2000An event-related neuroimaging study distinguishing form and content in sentence processing. J. Cogn. Neurosci. 12, 120–133 (doi:10.1162/08989290051137648) [DOI] [PubMed] [Google Scholar]
- Nobre A. C., Allison T., McCarthy G.1994Word recognition in the human inferior temporal lobe. Nature 372, 260–263 (doi:10.1038/372260a0) [DOI] [PubMed] [Google Scholar]
- Nogueira Teixeira E., De Diego-Balaguer R., Mestres-Missé A., Rodriguez-Fornells A., Bachoud-Lévi A. C.2008Word learning abilities in Huntington's disease. J. Neurol. Neurosurg. Psychiatr. 79, A10 [Google Scholar]
- O'Brien E. J., Shank D. M., Myers J. L., Rayner K.1988Elaborative inferences during reading: do they occur on-line? J. Exp. Psychol. Learn. 14, 410–420 (doi:10.1037/0278-7393.14.3.410) [DOI] [PubMed] [Google Scholar]
- O'Kane G., Kensinger E. A., Corkin S.2004Evidence for semantic learning in profound amnesia: an investigation with patient H.M. Hippocampus 14, 417–425 (doi:10.1002/hipo.20005) [DOI] [PubMed] [Google Scholar]
- Onnis L., Waterfall H. R., Edelman S.2008Learn locally, act globally: learning language from variation set cues. Cognition 109, 423–430 (doi:10.1016/j.cognition.2008.10.004) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Otten M., Nieuwland M. S., Van Berkum J. J.2007Great expectations: specific lexical anticipation influences the processing of spoken language. BMC Neurosci. 8, 89 (doi:10.1186/1471-2202-8-89) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pacton S., Perruchet P.2008An attention-based associative account of adjacent and nonadjacent dependency learning. J. Exp. Psychol. Learn. Mem. Cogn. 34, 80–96 [DOI] [PubMed] [Google Scholar]
- Patterson K., Nestor P. J., Rogers T. T.2007Where do you know what you know? The representation of semantic knowledge in the human brain. Nat. Rev. Neurosci. 8, 976–987 [DOI] [PubMed] [Google Scholar]
- Peña M., Bonatti L. L., Nespor M., Mehler J.2002Signal-driven computations in speech processing. Science 298, 604–607 (doi:10.1126/science.1072901) [DOI] [PubMed] [Google Scholar]
- Penfield W., Roberts L.1959Speech and brain mechanisms New York, NY: Athenaeum [Google Scholar]
- Perfetti C. A., Wlotko E. W., Hart L. A.2005Word learning and individual differences in word learning reflected in event-related potentials. J. Exp. Psychol. Learn. 31, 1281–1292 (doi:10.1037/0278-7393.31.6.1281) [DOI] [PubMed] [Google Scholar]
- Petersen S. E., Fox P. T., Posner M. I., Mintun M., Raichle M. E.1988Positron emission tomographic studies of the cortical anatomy of single-word processing. Nature 331, 585–589 (doi:10.1038/331585a0) [DOI] [PubMed] [Google Scholar]
- Poldrack R. A., Willingham D. T.2006Skill learning. In Handbook of functional neuroimaging of congition (eds Cabeza R., Kingstone A.), pp. 113–148 [Google Scholar]
- Price C. J.2000The anatomy of language: contributions from functional neuroimaging. J. Anat. 197, 335–359 (doi:10.1046/j.1469-7580.2000.19730335.x) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pugh K. R., et al. 1996Cerebral organization of component processes in reading. Brain 119, 1221–1238 (doi:10.1093/brain/119.4.1221) [DOI] [PubMed] [Google Scholar]
- Quine W. V. O.1960Word and object Cambridge, MA: MIT Press [Google Scholar]
- Rilling J. K., Seligman R. A.2002A quantitative morphometric comparative analysis of the primate temporal lobe. J. Hum. Evol. 42, 505–533 (doi:10.1006/jhev.2001.0537) [DOI] [PubMed] [Google Scholar]
- Rilling J. K., Glasser M. F., Preuss T. M., Ma X., Zhao T., Hu X., Behrens T. E.2008The evolution of the arcuate fasciculus revealed with comparative DTI. Nat. Neurosci. 11, 426–428 (doi:10.1038/nn2072) [DOI] [PubMed] [Google Scholar]
- Rizzolatti G., Arbib M. A.1998Language within our grasp. Trends Neurosci. 21, 188–194 (doi:10.1016/S0166-2236(98)01260-0) [DOI] [PubMed] [Google Scholar]
- Rodd J. M., Davis M. H., Johnsrude I. S.2005The neural mechanisms of speech comprehension: fMRI studies of semantic ambiguity. Cereb. Cortex 15, 1261–1269 (doi:10.1093/cercor/bhi009) [DOI] [PubMed] [Google Scholar]
- Rodriguez-Fornells A., De Diego-Balaguer R., Munte T. F.2006Executive functions in bilingual language processing. Lang. Learn. 56, 133–190 (doi:10.1111/j.1467-9922.2006.00359.x) [Google Scholar]
- Saffran J. R.2001Words in a sea of sounds: the output of infant statistical learning. Cognition 81, 149–169 (doi:10.1016/S0010-0277(01)00132-9) [DOI] [PubMed] [Google Scholar]
- Saffran J. R., Aslin R. N., Newport E. L.1996Statistical learning by 8-month-old infants. Science 274, 1926–1928 (doi:10.1126/science.274.5294.1926) [DOI] [PubMed] [Google Scholar]
- Saffran J. R., Werker J. F., Werner L. A.2006The infant's auditory world: hearing, speech, and the beginnings of language. In Handbook of child psychology: vol.2, cognition, perception and language (eds Damon W., Lerner R. M.), pp. 58–108 New York, NY: Wiley [Google Scholar]
- Sanders L. D., Newport E. L., Neville H. J.2002Segmenting nonsense: an event-related potential index of perceived onsets in continuous speech. Nat. Neurosci. 5, 700–703 (doi:10.1038/nn873) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saur D., et al. 2008Ventral and dorsal pathways for language. Proc. Natl. Acad. Sci. USA 105, 18 035–18 040 (doi:10.1073/pnas.0805234105) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saykin A. J., et al. 1999Functional differentiation of medial temporal and frontal regions involved in processing novel and familiar words: an fMRI study. Brain 122, 1963–1971 (doi:10.1093/brain/122.10.1963) [DOI] [PubMed] [Google Scholar]
- Schmahmann J. D., Pandya D. N., Wang R., Dai G., D'Arceuil H. E., de Crespigny A. J., Wedeen V. J.2007Association fibre pathways of the brain: parallel observations from diffusion spectrum imaging and autoradiography. Brain 130, 630–653 (doi:10.1093/brain/awl359) [DOI] [PubMed] [Google Scholar]
- Schubotz R. I., von Cramon D. Y., Lohmann G.2003Auditory what, where, and when: a sensory somatotopy in lateral premotor cortex. Neuroimage 20, 173–185 (doi:10.1016/S1053-8119(03)00218-0) [DOI] [PubMed] [Google Scholar]
- Scott S. K., Johnsrude I. S.2003The neuroanatomical and functional organization of speech perception. Trends Neurosci. 26, 100–107 (doi:10.1016/S0166-2236(02)00037-1) [DOI] [PubMed] [Google Scholar]
- Scott S. K., Wise R. J.2004The functional neuroanatomy of prelexical processing in speech perception. Cognition 92, 13–45 (doi:10.1016/j.cognition.2002.12.002) [DOI] [PubMed] [Google Scholar]
- Scott S. K., Blank C. C., Rosen S., Wise R. J.2000Identification of a pathway for intelligible speech in the left temporal lobe. Brain 123, 2400–2406 (doi:10.1093/brain/123.12.2400) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seger C. A.2006The basal ganglia in human learning. Neuroscientist 12, 285–290 (doi:10.1177/1073858405285632) [DOI] [PubMed] [Google Scholar]
- Seidenberg M. S., Zevin J. D.2006Connectionist models in developmental cognitive neuroscience: critical periods and the paradox of success. In Processes of change in brain and cognitive development. attention and performance XXI (eds Munakata Y., Johnson M.). Oxford, UK: Oxford University Press [Google Scholar]
- Singer M.1994Discourse inference processes. In Handbook of psycholinguistics (ed. Gernsbacher M. A.), pp. 479–510 San Diego: Academic Press [Google Scholar]
- Smith L. B.2000Learning how to learn words: an associative crane. In Becoming a word learner: a debate on lexical acquisition (ed. Golinkoff R. M., et al.), New York, NY: Oxford University Press [Google Scholar]
- Snedeker J., Gleitman L.2004Why it is hard to label our concepts. In Weaving a lexicon (eds Hall G., Waxman S.), Cambridge, MA: MIT Press [Google Scholar]
- Spelke E. S.2002Developmental neuroimaging: a developmental psychologist looks ahead. Dev. Sci. 5, 392–396 (doi:10.1111/1467-7687.00378) [Google Scholar]
- Squire L. R., Stark C. E., Clark R. E.2004The medial temporal lobe. Annu. Rev. Neurosci. 27, 279–306 (doi:10.1146/annurev.neuro.27.070203.144130) [DOI] [PubMed] [Google Scholar]
- Sternberg R. J.1987Most vocabulary is learned from context. In The nature of vocabulary acquisition (eds McKeown M. G., Curtis M. E.), Hillsdale, NJ: Lawrence Erlbaum Associates [Google Scholar]
- Strange B. A., Fletcher P. C., Henson R. N., Friston K. J., Dolan R. J.1999Segregating the functions of human hippocampus. Proc. Natl. Acad. Sci. USA 96, 4034–4039 (doi:10.1073/pnas.96.7.4034) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Strange B. A., Henson R. N., Friston K. J., Dolan R. J.2001Anterior prefrontal cortex mediates rule learning in humans. Cereb. Cortex 11, 1040–1046 (doi:10.1093/cercor/11.11.1040) [DOI] [PubMed] [Google Scholar]
- Studdert-Kennedy M.1987The phoneme as a perceptuomotor structure. In Language perception and production (eds Allport D., et al.), pp. 67–84 London, UK: Academic Press [Google Scholar]
- Tamminen J., Gaskell M. G.2008Newly learned spoken words show long-term lexical competition effects. Q. J. Exp. Psychol 61, 361–371 [DOI] [PubMed] [Google Scholar]
- Thompson-Schill S. L., Kan I. P., Oliver R. T.2006Functional neuroimaging of semantic memory. In Handbook of functional neuroimaging of cognition (eds Cabeza R., Kingstone A.), pp. 149–190 Cambridge, MA: MIT Press [Google Scholar]
- Thompson-Schill S. L., D'Esposito M., Aguirre G. K., Farah M. J.1997Role of left inferior prefrontal cortex in retrieval of semantic knowledge: a reevaluation. Proc. Natl Acad. Sci. USA 94, 14792–14797 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tomasello M.2003Constructing a language: a usage-based theory of language acquisition Cambridge, MA: Harvard University Press [Google Scholar]
- Toro J. M., Sinnett S., Soto-Faraco S.2005Speech segmentation by statistical learning depends on attention. Cognition 97, B25–B34 (doi:10.1016/j.cognition.2005.01.006) [DOI] [PubMed] [Google Scholar]
- Tricomi E., Delgado M. R., McCandliss B. D., McClelland J. L., Fiez J. A.2006Performance feedback drives caudate activation in a phonological learning task. J. Cogn. Neurosci. 18, 1029–1043 (doi:10.1162/jocn.2006.18.6.1029) [DOI] [PubMed] [Google Scholar]
- Ullman M. T.2001The neural basis of lexicon and grammar in first and second language: the declarative/procedural model. Biling-Lang. Cogn. 4, 105–122 [Google Scholar]
- Uylings H. B. M.2006Development of the human cortex and the concept of ‘critical’ or ‘sensitive’ periods. In The cognitive neuroscience of second language acquisition (eds Gullberg M., Indefrey P.), pp. 59–90 Oxford, UK: Blackwell Publishing [Google Scholar]
- van Daalen-Kapteijns M., Elshout-Mohr M., de Glopper K.2001Deriving the meaning of unknown words from multiple contexts. Lang. Learn. 51, 145–181 (doi:10.1111/0023-8333.00150) [Google Scholar]
- Vandenberghe R., Price C., Wise R., Josephs O., Frackowiak R. S.1996Functional anatomy of a common semantic system for words and pictures. Nature 383, 254–256 (doi:10.1038/383254a0) [DOI] [PubMed] [Google Scholar]
- Vargha-Khadem F., Gadian D. G., Watkins K. E., Connelly A., Van Paesschen W., Mishkin M.1997Differential effects of early hippocampal pathology on episodic and semantic memory. Science 277, 376–380 (doi:10.1126/science.277.5324.376) [DOI] [PubMed] [Google Scholar]
- Vigneau M., Beaucousin V., Herve P. Y., Duffau H., Crivello F., Houde O., Mazoyer B., Tzourio-Mazoyer N.2006Meta-analyzing left hemisphere language areas: phonology, semantics, and sentence processing. Neuroimage 30, 1414–1432 (doi:10.1016/j.neuroimage.2005.11.002) [DOI] [PubMed] [Google Scholar]
- Wahl M., Marzinzik F., Friederici A. D., Hahne A., Kupsch A., Schneider G. H., Saddy D., Curio G., Klostermann F.2008The human thalamus processes syntactic and semantic language violations. Neuron 59, 695–707 (doi:10.1016/j.neuron.2008.07.011) [DOI] [PubMed] [Google Scholar]
- Walley A. C.1993The role of vocabulary development in children's spoken word recognition and segmentation ability. Dev. Rev. 13, 286–350 (doi:10.1006/drev.1993.1015) [Google Scholar]
- Warren J. E., Wise R. J. S., Warren J. D.2005Sounds do-able: auditory-motor transformations and the posterior temporal plane. Trends Neurosci. 28, 636–643 [DOI] [PubMed] [Google Scholar]
- Wener H., Kaplan E.1952The acquisition of word meanings: a developmental study. Monogr. Soc. Res. Child Dev. 15 [Google Scholar]
- White L., Genesee F.1996How native is near-native? The issue of ultimate attainment in adult second language acquisition. Second Lang. Res. 12, 238–265 [Google Scholar]
- Wilson S. M., Saygin A. P., Sereno M. I., Iacoboni M.2004Listening to speech activates motor areas involved in speech production. Nat. Neurosci. 7, 701–702 (doi:10.1038/nn1263) [DOI] [PubMed] [Google Scholar]
- Wise R. J., Price C.2006Functional neuroimaging of language. In Handbook of functional neuroimaging of cognition (eds Cabeza R., Kingstone A.), pp. 191–228 [Google Scholar]
- Wise R. J., Scott S. K., Blank S. C., Mummery C. J., Murphy K., Warburton E. A.2001Separate neural subsystems within ‘Wernicke's area’. Brain 124, 83–95 (doi:10.1093/brain/124.1.83) [DOI] [PubMed] [Google Scholar]
- Wong P. C., Perrachione T. K., Parrish T. B.2007Neural characteristics of successful and less successful speech and word learning in adults. Hum. Brain Mapp. 28, 995–1006 (doi:10.1002/hbm.20330) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yin H. H., Knowlton B. J.2006The role of the basal ganglia in habit formation. Nat. Rev. Neurosci. 7, 464–476 (doi:10.1038/nrn1919) [DOI] [PubMed] [Google Scholar]