Abstract
Both implicit learning and statistical learning focus on the ability of learners to pick up on patterns in the environment. It has been suggested that these two lines of research may be combined into a single construct of “implicit statistical learning.” However, by comparing the neural processes that give rise to implicit versus statistical learning, we may determine the extent to which these two learning paradigms do indeed describe the same core mechanisms. In this review, we describe current knowledge about neural mechanisms underlying both implicit learning and statistical learning, highlighting converging findings between these two literatures. A common thread across all paradigms is that learning is supported by interactions between the declarative and nondeclarative memory systems of the brain. We conclude by discussing several outstanding research questions and future directions for each of these two research fields. Moving forward, we suggest that the two literatures may interface by defining learning according to experimental paradigm, with “implicit learning” reserved as a specific term to denote learning without awareness, which may potentially occur across all paradigms. By continuing to align these two strands of research, we will be in a better position to characterize the neural bases of both implicit and statistical learning, ultimately improving our understanding of core mechanisms that underlie a wide variety of human cognitive abilities.
Keywords: Implicit learning, Statistical learning, Neuroimaging, fMRI, EEG, Neuroscience, Neuropsychology, Neural basis
1. Introduction
The ability to detect patterns in the environment is central to many aspects of human cognition, ranging from perception to language, to decision-making, to the enjoyment of avant-garde music. Understanding this ability has been the focus of two traditionally distinct lines of research, centered on “implicit learning” (Reber, 1967) and “statistical learning” (Saffran, Aslin, & Newport, 1996). Implicit learning is defined as “the capacity to learn without awareness of the products of learning” (Frensch & Runger, 2003, p. 14). In contrast, statistical learning is more broadly defined as the ability to extract the statistical properties of sensory input across time or space (e.g., Frost, Armstrong, Siegelman, & Christiansen, 2015; Schapiro & Turk-Browne, 2015; Siegelman, Bogaerts, & Frost, 2017). Although these two lines of research use different tasks and procedures, they also share many commonalities and may in fact encapsulate the same underlying memory phenomena, as has been previously discussed (Christiansen, 2018; Perruchet & Pacton, 2006). Both implicit learning and statistical learning involve the extraction of structure from input and are generally thought to occur incidentally, through exposure to positive examples, without instruction, and without intention to learn. Both types of learning also then influence behavior in similar ways, for example, by making it easier to hear phoneme or word boundaries (statistical learning) or produce practiced motor sequences (implicit sequence learning). It has recently been suggested that these two literatures may be united under a single framework of “implicit statistical learning” (e.g., Rebuschat, 2015).
At this key juncture of increasing crosstalk and interaction between these two lines of research, working toward a better understanding of the neural bases of these learning phenomena is critical for continued progress in both these fields. By comparing the neural processes that give rise to implicit versus statistical learning, we may determine the extent to which these two learning paradigms do indeed describe the same core mechanisms. Deepening our understanding of the neural mechanisms of implicit and statistical learning will also continue to provide key insights into both these learning phenomena at a theoretical level, providing biological constraints to narrow the space of possible models that may account for observed learning behavior. For example, past research has shown that over the course of a typical paradigm, learning may occur in different ways, mediated by different neural memory systems (e.g., Poldrack et al., 2001, cf. Reber, 2013). Neural evidence can thereby provide a more accurate, nuanced view of how qualitatively different mechanisms underlying learning may wax and wane over time. Finally and more generally, neural measures can often provide a more sensitive index of processing than behavioral data alone and allow for the investigation of populations in which a behavioral response may be difficult to acquire (e.g., infants, certain groups of patients). By obviating the need for behavioral responses, data from neuroscience can enable us to address a wider variety of research questions. In sum, articulating the neural contributions underlying implicit and statistical learning is critical to reaching a deeper understanding of these learning mechanisms and provides a view of these capacities that could not be achieved by considering behavioral data alone.
With these goals in mind, this article reviews what we currently know about how the brain accomplishes implicit learning and statistical learning tasks. We first provide a brief primer on the memory systems of the brain. We then outline the concepts, research foci, and experimental paradigms that have been used in the two fields of implicit learning and statistical learning. Next, we review findings that shed light on how learning occurs at the neural level in these different paradigms. While we describe neuroscience studies on “implicit learning” and “statistical learning” in separate sections, this distinction is designed to reflect the separate historical trajectories and different paradigms used by each field, rather than to necessarily argue for a strong delineation between these two constructs at a mechanistic level. We conclude by considering key commonalities in the neural bases underlying learning among these different paradigms and possible future research directions to drive continued progress in both fields.
1.1. Memory systems of the brain
All types of learning, including implicit learning and statistical learning, can be understood in terms of the underlying contributions made by the two basic memory systems of the brain. Although memory performance may be mediated predominantly by one memory system alone, often performance is a function of interactions between two. Declarative memory refers to the recall and recognition of facts and events and depends on the medial temporal lobe (MTL) and various cortical regions, particularly the prefrontal cortex and the parietal lobes (Gabrieli, 1998; Scoville & Milner, 1957; Squire, 2004; Squire & Zola, 1996). The MTL consists of the hippocampus and adjacent perirhinal, entorhinal, and parahippocampal cortices (Squire, Stark, & Clark, 2004). In contrast, facilitated processing that occurs independently of recognition or recall is referred to as nondeclarative memory (Squire & Zola, 1996) and does not require the MTL system. Nondeclarative learning capacities are heterogenous and include skills, habits, procedures, priming, and simple types of memory like habituation and sensitization, and as such do not rely on a single, coherent memory system (Reber, 2013). Rather, nondeclarative memory reflects an accumulation of changes that take place directly within the neural circuits that were activated during the initial learning process, in processing areas such as the basal ganglia (which contains the striatum, which in turn contains the caudate and putamen, among other structures), cerebellum, and neocortex (e.g., Eichenbaum & Cohen, 2001; Gabrieli, 1998; McClelland, McNaughton, & O’Reilly, 1995; Reber, 2013; Squire, 2004; Squire & Zola, 1996). For example, perceptual repetition priming—which refers to facilitation in processing a repeated stimulus—is driven by changes in sensory cortical areas that are directly involved in initial perception (Squire & Zola, 1996). The lack of hippocampal involvement, which allows for the flexible relational storage of information (Eichenbaum & Cohen, 2001), may explain why perceptual priming effects are often found to be highly stimulus specific.
Seminal evidence for the existence of these two separate memory systems came from Scoville and Milner’s (1957) observations of a hippocampal amnesic patient known as H.M. Following bilateral temporal lobe resection for intractable epilepsy, H.M. showed complete loss of declarative memory for events subsequent to his surgery, together with partial retrograde amnesia for 3 years leading up to his operation. Despite this dramatic impairment in his ability to form memories for new episodes and facts, H.M. showed preserved learning on other memory tests, such as mirror drawing and the Tower of Hanoi puzzle (Cohen, Eichenbaum, Deacedo, & Corkin, 1985; Corkin, 1968; Milner, Corkin, & Teuber, 1968). Subsequent research substantiated these early observations, demonstrating that amnesic patients can achieve normal performance on many other memory tasks, such as artificial grammar learning, prototype learning, probabilistic category learning, perceptuomotor skill learning, and perceptual priming (e.g., Cohen & Squire, 1980; Goshen-Gottstein, Moscovitch, & Melo, 2000; Graf & Schacter, 1985; Keane et al., 1997; Knowlton & Squire, 1993, 1994, 1996; Nissen & Bullemer, 1987; Reber & Squire, 1994, 1998). These types of learning do not require a normally functioning hippocampus and came to be known collectively as nondeclarative memory (Squire & Zola, 1996), as performance on these tasks demonstrates memory storage while not requiring the conscious retrieval of any knowledge. Thus, declarative and nondeclarative memory can be dissociated on the basis of both conscious awareness and reliance on the MTL system, although there is not a perfect one-to-one correspondence between awareness and MTL dependence (Hannula & Greene, 2012; Henke, 2010).
1.2. Competition between memory systems
As we will see, a common theme that emerges across implicit learning and statistical learning paradigms is that there is frequently interaction or competition between the declarative and nondeclarative memory systems of the brain. This is particularly evident in implicit learning research, which, relative to the statistical learning literature, has traditionally placed a much stronger emphasis on characterizing the neural basis of learning. Even in paradigms that have been specifically designed to isolate “implicit learning” per se, healthy learners completing these tasks may show behavioral evidence of having acquired both declarative and nondeclarative memory (e.g., Perruchet, Gallego, & Savy, 1990; Perruchet & Pacteau, 1990; Servan-Schreiber & Anderson, 1990). The finding that declarative and nondeclarative memory systems are often both active during learning, and may compete with one another, also appears across findings from the statistical learning literature. Thus, investigating interactions between implicit and explicit memory, rather than how each system functions in isolation, is critical to understanding how humans actually learn.
Equipped with a basic understanding of the two dissociable memory systems of the brain, we now turn our attention to reviewing the conceptual definitions, research foci, and experimental paradigms used in the two parallel literatures of implicit learning and statistical learning.
1.3. Overview of implicit learning and statistical learning fields: Research foci and experimental paradigms
A strong focus of implicit learning studies is on the nature of the representations acquired during learning (i.e., whether conscious or unconscious). Paradigms used to study implicit learning in a laboratory setting include the artificial grammar learning task (AGL; Reber, 1967), the serial reaction time task (SRTT; Nissen & Bullemer, 1987), and the weather prediction task (Knowlton, Squire, & Gluck, 1994). Learning in all these tasks is typically measured indirectly, without making direct reference to prior studied items, and the knowledge supporting performance on these tasks has generally been assumed to be unconscious (Reber, 1989).
For example, the AGL task—the hallmark paradigm of implicit learning research (Reber, 1967)—requires participants to memorize strings of letters generated by a finite-state artificial grammar. After exposure to numerous letter strings, participants are told that the strings were based on specific rules, and then asked to categorize new letter strings as either grammatical or ungrammatical. Despite having little verbalizable knowledge about the underlying rules, participants are typically able to categorize new strings at above-chance levels (Reber, 1967, 1976).
Similarly, in the SRTT, learning can occur in the absence of awareness of the underlying sequence (Destrebecqz & Cleeremans, 2001). In the canonical version of this task, participants respond to sequential visual cues that appear in one of four locations by pressing one of four corresponding response buttons. After each response, the next cue appears in a new location after a brief delay. Unbeknownst to participants, there are both sequential trials, which follow a repeating sequence, and random trials, in which the visual cues no longer follow a repeating pattern. As participants learn the hidden pattern, their response time to the repeating cues gradually decreases over and above the general response time reduction that occurs to randomly ordered cues as a function of task practice effects.
Another standard implicit learning task is the weather prediction task (Knowlton et al., 1994). This task involves learning of associations between combinations of stimuli and outcomes and is also generally assumed to occur without requiring the involvement of the declarative memory system. The stimuli in this task consist of a set of cards. Each card contains a unique geometric pattern and is associated with one of two possible outcomes (sun or rain) with a fixed probability. On each trial, participants are presented up to four cards and asked to predict the weather outcome. The actual weather outcome is probabilistically determined by the individual cards, whereby each card is a partially accurate predictor of the weather. With each trial, learners receive feedback and become increasingly accurate at predicting the weather. This improvement can occur even though learners may have little explicit knowledge of the information they are using to improve their judgments.
In contrast to the field of implicit learning, statistical learning researchers have generally not focused on the nature of the representations acquired during learning (i.e., whether conscious or unconscious), but rather on the finding that learning appears to occur incidentally—without instruction to detect patterns. Statistical learning is most commonly studied within the context of speech segmentation, using the artificial speech segmentation paradigm (Saffran, Newport, & Aslin, 1996; Saffran, Newport, Aslin, Tunick, & Barrueco, 1997; Saffran et al., 1996). In this task, learners are exposed to a continuous speech stream made up of repeating three-syllable nonsense “words” (e.g., bupada + babupu + tutibu → bupadababupututibu…; Saffran et al., 1996, 1996, 1997). Learners must become sensitive to the co-occurrence statistics between neighboring syllables—which are higher within words than across word boundaries—in order to discover the underlying “word” units. After exposure, learning in adult participants is then typically tested using a forced-choice recognition measure, requiring the discrimination of words from the exposure steam (e.g., bupada) and nonword foil items (e.g., pubati). If performance on this forced-choice measure is above chance, statistical learning is inferred. Reaction times on a speeded target detection task have also been used, which measure the effects of learning on online processing (e.g., Batterink, Reber, Neville, & Paller, 2015; Batterink, Reber, & Paller, 2015; Franco, Eberlen, Destrebecqz, Cleeremans, & Bertels, 2015; Kim, Seitz, Feenstra, & Shams, 2009; Turk-Browne, Scholl, Chun, & Johnson, 2009). Statistical learning is also studied within the visual domain, using an analogous paradigm in which participants view a sequence of images or abstract shapes organized into repeating triplets (e.g., Fiser & Aslin, 2002; Turk-Browne, Jungé, & Scholl, 2005).
In sum, the paradigms used to study implicit learning and statistical learning appear to focus on the same core mechanism—the detection of patterns governing elements in complex stimulus domains (cf. Reber, in Rebuschat, 2015). Although these tasks differ in notable ways—including demands on perceptual and motor processing, use of auditory versus visual stimuli, and engagement of linguistic versus nonlinguistic processing—“learning” in all cases is defined as the participants’ ability to extract regularities in input they are exposed to, without any instruction to do so. The overall similarities between the paradigms raise the possibility that similar neural mechanisms and substrates may be involved. In the next section, we review neuroimaging studies of healthy learners in order to characterize the specific brain areas activated in these different learning paradigms, providing a valuable perspective on the extent to which these different paradigms in these two literatures reflect similar neural processes.
1.4. Implicit learning paradigms
1.4.1. Artificial grammar learning (AGL)
Neuroimaging studies have implicated a number of different regions in AGL, including prefrontal cortical areas (most commonly left inferior frontal regions including BA 44/45), parietal areas, and the basal ganglia (Forkstam, Hagoort, Fernandez, Ingvar, & Petersson, 2006; Lieberman, Chang, Chiao, Bookheimer, & Knowlton, 2004; Petersson, Folia, & Hagoort, 2012; Petersson, Forkstam, & Ingvar, 2004; Skosnik, Gitelman, Parrish, Mesulam, & Reber, 2002). Different authors have focused on different neural regions when interpreting these widespread activation patterns. For example, activation of left inferior frontal gyrus or “Broca’s area” has been interpreted as reflecting this region’s role in linguistic processing, online sequence processing, and sequence learning (Forkstam et al., 2006; Petersson et al., 2004, 2012). Skosnik et al. (2002) found distinct neural correlates in posterior parietal areas for grammaticality judgments contrasted with letter string recognition. This activity was interpreted as the letter strings becoming more word form like as learning proceeded.
Behavioral evidence in amnesic patients and controls suggests that performance on the AGL task can be supported by either exemplar-specific “chunk” knowledge that can in some cases be explicit (i.e., the bigrams and trigrams shared between the strings presented at training and at test) or by abstract grammatical rule knowledge (Knowlton & Squire, 1996). Neuroimaging evidence supports this notion of different explicit and implicit sources of knowledge on this task, and additionally suggests that these different sources of knowledge are supported by different neural substrates, with competitive interactions occurring between these memory systems. For example, Lieberman et al. (2004) found that test items with higher chunk strength elicited greater hippocampal activation relative to low-chunk-strength items. In contrast, processing of grammatical compared to ungrammatical strings (controlling for chunk strength) was associated with increased caudate activation. Furthermore, caudate and hippocampal activations were strongly negatively correlated (r = −0.87), suggesting a competitive relationship between basal ganglia and medial temporal areas. Petersson et al. (2012) also reported strong MTL deactivation, including the hippocampus, during classification of grammatical items. Again, this activation was dissociated from chunk strength. This finding converges with the negative correlation between hippocampal and caudate activation reported by Lieberman and colleagues, and further supports the idea that knowledge of chunk strength—mediated by the hippocampus—and knowledge of abstract grammar rules—mediated by the caudate—actively compete with one another during grammatical classification.
Although neuroimaging studies commonly find that the caudate is activated in AGL, patients with basal ganglia dysfunction due to Parkinson’s disease or Huntington’s disease show intact AGL performance (Knowlton et al., 1996; Meulemans, Peigneux, & Van der Linden, 1998; Reber & Squire, 1999; Witt, Nuhsman, & Deuschl, 2002). These findings suggest that the caudate is not critically involved in AGL. Nonetheless, one possibility is that the basal ganglia contribute to AGL in healthy participants, but that patients with impaired basal ganglia functioning may rely on their intact MTL memory system in order to achieve normal learning on this task (Moody, Bookheimer, Vanek, & Knowlton, 2004). Future studies will be needed in order to conclusively resolve this issue.
1.4.2. Serial reaction time task (SRTT)
In neuroimaging studies, learning of trained sequences relative to untrained sequences is associated with activation of the striatum as well as cortical areas related to motor planning (Daselaar, Rombouts, Veltman, Raaijmakers, & Jonker, 2003; Doyon et al., 1996; Grafton, Hazeltine, & Ivry, 1995; Hazeltine, Grafton, & Ivry, 1997; Rauch et al., 1995, 1997; Peigneux et al., 2000; Reiss et al., 2005; Seidler et al., 2005). In addition, early stages of exposure to the sequence are also often accompanied by hippocampal activation, which decreases gradually over the course of training (Albouy et al., 2008; Fletcher et al., 2005; Grafton et al., 1995; Rieckmann, Fischer, & Büackman, 2010; Schendan, Searl, Melorse, & Stern, 2003). Learning success is positively related to activation in the striatum (Garraux et al., 2007; Peigneux et al., 2000; Rauch et al., 1997; Reiss et al., 2005) and to deactivation in the MTL (Albouy et al., 2008; Rieckmann et al., 2010).
Two explanations have been proposed to account for the time course of involvement of the MTL in sequence learning (Rieckmann et al., 2010). One suggestion is that the MTL supports rapid initial acquisition of higher order associations in complex sequences (Doeller, Opitz, Krick, Mecklinger, & Reith, 2005; Fletcher et al., 2005; Schendan et al., 2003). This idea is supported by studies showing that the MTL seems to be especially recruited in complex versions of the SRTT, in which participants are forced to rely on second-order relationships (Curran, 1997; Schendan et al., 2003; Shanks, Channon, Wilkinson, & Curran, 2006). The MTL may be involved in making predictions about possible outcomes (Bornstein & Daw, 2012, 2013); as learning progresses, the associations between stimuli become more predictable, leading to concomitant decreases in MTL activation. An alternative explanation is that learning-related disengagement of the MTL is related to competition or functional suppression between the striatal and MTL memory systems, consistent with findings from other implicit learning tasks. According to this view, greater expertise is associated with increased involvement of the striatum and decreased reliance on the MTL system. This competition idea is supported by several studies showing that SRTT learning is enhanced when declarative memory function or top-down processing is disrupted (Brown & Robertson, 2007a,b; Galea et al., 2009; Nemeth, Janacsek, Polner, & Kovacs, 2013; Virag et al., 2015).
Recently, P. J. Reber and colleagues (Gobel, Parrish, & Reber, 2011; Sanchez, Gobel, & Reber, 2010; Sanchez & Reber, 2013) developed a modified version of the SRTT, the Serial Interception Sequence Learning (SISL) task. The SISL task may better encapsulate implicit learning compared to the standard SRTT, which is commonly accompanied by evidence of explicit knowledge in healthy participants (Reber & Squire, 1994, 1998; Willingham, Greeley, & Bardone, 1993). Although the SRTT requires participants to respond to the onset of stationary cues that appear at fixed intervals, the SISL task involves making responses at a much more rapid pace, which must be precisely timed to moving cues. The SISL task is more challenging and requires participants to continually engage in task performance, potentially leaving them with fewer resources to identify or memorize the repeating sequence. The idea that response speed makes learning more likely to be implicit is supported by the finding that decreasing the interval between the response and the next cue in the standard version of the SRTT selectively impairs explicit sequence learning (Destrebecqz & Cleeremans, 2001).
The SISL paradigm has been shown to produce robust implicit learning with low levels of conscious sequence knowledge even in healthy participants (Sanchez et al., 2010). In addition, providing participants with full explicit knowledge of the repeating sequence does not impact sequence learning, consistent with the idea that explicit knowledge does not normally contribute to performance on this task (Sanchez & P. J. Reber, 2013). Interestingly, while performance of known sequences in the SISL paradigm is associated with increased activity in the basal ganglia—pointing to the importance of this structure across sequence learning tasks—it does not modulate hippocampal activity (Gobel et al., 2011). As MTL activation is typically observed in the standard SRTT, this finding tentatively suggests that competition between the basal ganglia and MTL systems may not occur during isolated implicit learning. It also raises the possibility that MTL activation during the standard SRTT may reflect the acquisition of parallel explicit knowledge of the sequence that does not actually contribute to implicit learning.
1.4.3. Probabilistic category learning
Neuroimaging studies have demonstrated activation of the basal ganglia, in particular the striatum, during probabilistic category learning (Aron et al., 2004; Moody et al., 2004; Poldrack, Prabhakaran, Seger, & Gabrieli, 1999; Poldrack et al., 2001). In addition, activity was also observed in a broad network of prefrontal, parietal, and occipitotemporal cortical regions. Mirroring findings from other implicit learning tasks, neuroimaging studies of probabilistic category learning also consistently find a decrease in MTL activation versus baseline (Aron et al., 2004; Moody et al., 2004; Poldrack et al., 1999, 2001). This finding again suggests a competitive interaction between these two memory systems.
Particularly relevant to the competition idea is the study by Poldrack et al. (2001), who used fMRI to compare neural activation during probabilistic category learning under two different conditions. In the version designed to engage implicit memory (i.e., the typical version of the task), subjects learned on the basis of trial-by-trial feedback; in contrast, in a modified version of the task designed to engage explicit memory, subjects learned on the basis of category labels, without making a categorization decision. Relative to the standard task version, the “explicit” version resulted in reduced activation of the basal ganglia and increased activation of the MTL, indicating that engagement of these two memory systems is modulated by whether the task encourages use of implicit or explicit memory strategies. In addition, activity in the MTL correlated negatively with activity in the striatum across learners. Furthermore, MTL and striatum showed reciprocal changes over time; the MTL was initially active and the striatum inactive, but as learning progressed, the striatum quickly became activated while the MTL became deactivated. These findings suggest that the MTL plays a role in learning during early stages of training, but that the basal ganglia subsequently “take over” learning, leading to disengagement of the MTL. A similar pattern of results was found by Foerde, Knowlton, and Poldrack (2006), who compared probabilistic category learning under dual-task versus single-task conditions. Equivalent levels of learning were found under the two conditions, but distraction by a secondary task reduced the involvement of the MTL and increased the involvement of the striatum. These results again point to a competitive interaction between memory systems during probabilistic category learning, as has been shown for other implicit learning tasks.
1.4.4. Statistical learning paradigms
We will now turn our attention to what is known about the neural basis of statistical learning. In contrast to the implicit learning literature, understanding the neural bases of learning has not been a central focus of statistical learning research. In addition, the field of statistical learning as a whole is younger, as can be seen readily by comparing the publication years for the two seminal papers of each field (Reber, 1967; Saffran, Aslin, et al., 1996; Saffran, Newport, et al., 1996). Thus, whereas there is a rich and decades-old literature on the neural mechanisms underlying implicit learning, parallel topics in the area of statistical learning have only recently come under investigation. Nonetheless, recent findings from statistical learning seem to converge with previous results from implicit learning, suggesting additional parallels between these two literatures.
A handful of neuroimaging studies have investigated the neural basis of this type of statistical learning, contrasting activation evoked by speech streams with repeating words with that evoked by random syllable streams (Cunillera et al., 2009; Karuza et al., 2013; McNealy, Mazziotta, & Dapretto, 2006). These studies have consistently found that statistical regularities produce enhanced activation of higher level auditory networks, in regions that have been previously implicated in auditory and/or linguistic processing, such as the left superior temporal gyrus and the left inferior frontal gyrus (Broca’s area; BA 44/45), extending to premotor cortex (BA 6; McNealy et al., 2006; Cunillera et al., 2009; Karuza et al., 2013). In addition, activation of left IFG/ventral PMC correlates with behavioral performance on postexposure tests of statistical learning (Cunillera et al., 2009; Karuza et al., 2013).
Thus, statistical learning underlying speech segmentation seems to be at least partially supported by unimodal auditory cortical regions.
In the visual domain, at least one neuroimaging study has revealed analogous results to what has been found using the speech segmentation paradigm, showing that processing of visual statistical structure is associated with enhanced activation in high-level visual networks (Turk-Browne et al., 2009). Participants viewed sequences of abstract shapes, which were either organized into repeating triplets or randomly ordered. Compared to random blocks, blocks containing visual regularities yielded greater activation in category-specific visual regions, namely object-selective lateral occipital cortex and word-selective ventral occipitotemporal cortex.
Taken together, these results have led to the proposal that statistical learning is largely supported by modality-specific learning mechanisms (Frost et al., 2015). According to this view, local computations are performed in separate neural networks in different cortical areas (e.g., visual, auditory, and somatosensory), each of which operates on modality-specific representations and is governed by its own set of constraints. For example, because auditory information unfolds over time, the auditory cortex displays greater sensitivity to temporal information. In contrast, relationships between items in the visual domain can be processed simultaneously, and thus, the visual cortex displays enhanced sensitivity to spatial information but lower sensitivity to temporal information. This model of statistical learning is supported by behavioral evidence showing a lack of correlation within individuals across different types of statistical learning tasks (e.g., syllables versus geometric shapes; Siegelman & Frost, 2015). This idea of modality specificity also echoes what has been demonstrated for different types of implicit learning—namely, that implicit learning results from experience directly shaping neural circuits within specific sensory cortical areas (Reber, 2013).
Although there appear to be important modality-specific contributions to statistical learning, some neural regions may play a domain-general role. For example, while the left IFG (i.e., “Broca’s area”) has been previously implicated in statistical learning of structured speech as described previously (Cunillera et al., 2009; Karuza et al., 2013; McNealy et al., 2006), it may also contribute to other types of statistical learning. In the visual domain, activation of the left IFG was found to correlate with behavioral familiarity with shape triplets (Turk-Browne et al., 2009). In addition, in a nonlinguistic auditory statistical learning paradigm using tone stimuli, left inferior frontal cortex showed enhanced activation to structured tone sequences, but not to random sequences of tones (Abla & Okanoya, 2008). Although the left IFG appears to contain subregions that are truly language specific, other subregions within this general area may contribute to domain-general processes, such as the processing and integration of sequential information (Fedorenko, Duncan, & Kanwisher, 2012). Interestingly, processing of artificial grammar rules in the AGL paradigm is also associated with left IFG activation (Forkstam et al., 2006; Petersson et al., 2004, 2012), suggesting a shared neural basis for the processing of structure in AGL and statistical learning tasks.
Neuroimaging evidence also indicates that there are contributions to statistical learning from domain-general memory systems, notably the striatum and the MTL systems, just as has been found for diverse types of implicit learning tasks (e.g., Bischoff-Grethe, Martin, Mao, & Berns, 2001; Lieberman et al., 2004; Rauch et al., 1997; Seger & Cincotta, 2005). Striatal activation has been observed to statistical structure in both the auditory domain (Karuza et al., 2013; McNealy et al., 2006) and the visual domain (Turk-Browne et al., 2009). In this latter study, by excluding data from subjects who performed above chance on the familiarity test, the investigators focused on extracting neural correlates of implicit statistical learning. Caudate activation (but no hippocampal activation) was observed even in learners who exhibited no subsequent familiarity, suggesting that this region may contribute to implicit statistical learning, dissociable from explicit knowledge. In contrast, when all learners were included in the analysis, both caudate and hippocampal activation were observed. The involvement of the MTL in statistical learning is also supported by a finding by Schapiro, Kustner, and Turk-Browne (2012). By presenting participants with a continuous stream composed of hidden pairs of objects and applying pattern similarity analysis, these authors found that the representations of objects presented in pairs became more similar to one another after exposure compared to before exposure in the hippocampus and MTL cortex. The right hippocampus may be particularly important for making predictions about upcoming stimuli, as it shows increased activation to predictive stimuli compared to non-predictive stimuli (Turk-Browne, Scholl, Johnson, & Chun, 2010).
The observed activation of the MTL system during statistical learning raises the same question that has been central to the study of implicit learning: Does the MTL play a necessary role in statistical learning, or does activity in this region merely represent the acquisition of parallel (explicit) representations that do not centrally contribute to learning? Although a few studies have examined amnesic patients’ performance on statistical learning tasks, this question cannot yet be answered conclusively. Schapiro, Gregory, Landau, McCloskey, and Turk-Browne (2014) found that patient LSJ, who suffered complete bilateral hippocampal loss and broader MTL damage, showed widespread deficits in statistical learning across a range of visual and auditory stimuli (Schapiro et al., 2014). However, given that many normal, healthy participants also do not show above-chance performance on statistical learning tasks (e.g., Siegelman & Frost, 2015), it is difficult to unambiguously attribute patient LSJ’s deficits to neurobiological impairment. Building on this work, Covington, Brown-Schmidt, and Duff (2018) studied a larger group of patients with MTL damage and reported that patients exhibit less learning overall compared to healthy controls. Nonetheless, these patients still exhibited above-chance learning, and their performance fell within the distribution of healthy participant performance. In addition, lesion size did not reliably predict performance. These results suggest that the hippocampus may contribute to statistical learning but is not strictly necessary for statistical learning to occur. Thus, the question of whether the MTL is necessary for statistical learning has not been conclusively resolved and awaits further investigation.
To summarize this small but growing literature on the neurobiological basis of statistical learning, there appear to be both modality-specific and domain-general neural mechanisms that contribute to learning. One class of findings indicates that statistical learning occurs within modality-specific sensory cortical areas, according to the stimuli presented during learning. For example, visual sequences activate higher level visual cortex, whereas word segmentation tasks activate higher level auditory areas. In addition, statistical learning also appears to be supported by domain-general memory systems that are involved regardless of stimulus modality. Both the striatum and MTL are activated to statistical structure across different modalities (audition and vision). This finding corresponds to what has been observed across many different types of implicit learning tasks—that multiple memory systems may be active under normal learning conditions in healthy learners, acquiring separate representations in parallel.
Using these findings as a point of departure, in the next section, we consider some outstanding questions and potential directions for future research on the neural bases of statistical learning and implicit learning. Many of the same methods, research questions, and conceptual approaches that have been productive in one field may be equally valuable when applied to the parallel research field (e.g., applying implicit learning concepts to statistical learning and vice versa).
2. Directions for future research
2.1. Statistical learning
2.1.1. Testing interactions between memory systems over the course of exposure
One important acknowledgment that has driven progress in the field of implicit learning is that implicit and explicit learning systems often operate in parallel in healthy learners, sometimes competing and sometimes interacting. In the domain of language learning, it has also been proposed that these two types of memories contribute and interact (e.g., Ullman, 2004, 2005). Given neuroimaging evidence demonstrating both hippocampal and striatal involvement in the same statistical learning task (e.g., Turk-Browne et al., 2009), interactions between these two different memory systems may also play a role in statistical learning generally, with different neural systems engaged at different points in time throughout the learning process. One possibility is that the hippocampus may contribute to statistical learning early during the exposure period, consistent with its role in making predictions about upcoming stimuli (Bornstein & Daw, 2012, 2013; Turk-Browne et al., 2010). As learning progresses, the striatum may become more involved, while the hippocampus becomes disengaged. Such a finding would echo results from different implicit learning tasks showing reciprocal engagement of these two memory systems over the course of learning (e.g., Albouy et al., 2008; Fletcher et al., 2005; Poldrack et al., 2001).
Future neuroimaging studies may address this idea by using more fine-grained measures to assess the time course of learning over the exposure period. To specifically target learning-related neural changes, time-course information could be combined with behavioral measures of learning collected over the exposure period, using approaches such as concurrent RT tasks (e.g., Batterink, 2017; Siegelman et al., 2017; Turk-Browne et al., 2010) or intermittent offline testing (Karuza et al., 2013). This sort of paradigm would allow for the assessment of whether reliance on one system over the other—across exposure or potentially even at different stages of exposure—predicts behavioral performance on tests of statistical learning. If positive evidence indicates that one memory system is optimally suited for extracting structure from the environment, a further step would be to test whether inhibiting the competing memory system enhances statistical learning, as has sometimes been found for different types of implicit learning tasks (e.g., Filoteo, Lauritzen, & Maddox, 2010; Foerde et al., 2006; Galea et al., 2009; Nemeth et al., 2013).
2.1.2. The role of the hippocampus in statistical learning
A related outstanding question with many parallels to the implicit learning literature is whether the hippocampus plays a necessary, causal role in statistical learning, or whether it merely acquires nonessential or redundant (explicit) knowledge in parallel. Yet a third possibility is that statistical learning may operate on declarative, MTL-dependent memory representations. Under this scenario, explicit memory traces would be initially acquired by the MTL system, with statistical learning mechanisms then operating on these representations to guide behavior (without necessarily requiring awareness).
As described previously, evidence from two prior studies suggests that patients with MTL damage on average perform more poorly on statistical learning tasks than healthy controls, including a case study failing to find above-chance performance (Covington et al., 2018; Schapiro et al., 2014). However, a potential limitation of these two studies is the use of only an offline recognition measure to assess learning. One possibility is that MTL patients show intact statistical learning when assessed through implicit measures, but they are impaired at tests involving explicit recognition, similar to dissociations that have been found for implicit learning (e.g., priming versus recognition; e.g., Keane et al., 1997; Goshen-Gottstein et al., 2000). To test this hypothesis, learning in these patients could be assessed using more implicit measures, such as RTs (Batterink & Paller, 2017; Batterink et al., 2015, 2015; Siegelman et al., 2017), statistically induced chunking recall (Isbilen, McCauley, Kidd, & Christiansen, 2017), or EEG-based neural entrainment to the underlying structure (Batterink & Paller, 2017). The use of neural measures (e.g., EEG or fMRI) in these patients could also potentially reveal evidence of learning even in the absence of above-chance recognition performance (cf. Turk-Browne et al., 2009). In addition, neural measures would allow for direct comparisons of learning in patients and healthy controls, circumventing the potential confound of impaired retrieval and recognition processes in the patient group. Clarifying the role of the hippocampus in statistical learning would also provide important evidence on hippocampal computations, potentially contributing to the growing body of work demonstrating hippocampal involvement across a wide range of diverse tasks (e.g., Duff & Brown-Schmidt, 2017; Greene, Gross, Elsinger, & Rao, 2006, 2007; Rubin, Watson, Duff, & Cohen, 2014; Westerberg, Miller, Reber, Cohen, & Paller, 2011).
2.1.3. Understanding components of statistical learning
Evidence about the neural basis of statistical learning may also help to test current theoretical models and to better understand the substantial variability seen in statistical learning performance. As has been previously noted, individual performance on postexposure statistical learning tasks varies substantially, with at least one-third of a sample often failing to perform the task at above-chance levels (Frost et al., 2015; Siegelman & Frost, 2015). While statistical learning is often conceptualized as a single process, it in fact involves multiple dissociable neurocognitive components, and this individual variability in performance may be driven by differences in any number of these components.
This idea has been discussed previously in a number of different forms. For example, Frost et al. (2015) propose the individual variance in statistical learning performance can be split into two main sources: (1) ability to encode representations of individual elements in a stream, within the presentation modality and (2) ability to compute the distributional properties of the encoded representations (e.g., the transitional probabilities between syllables). Karuza and colleagues (2014) describe four essential components to statistical learning: (1) sensory input encoding, (2) pattern extraction, (3) model building, and (4) retrieval/recognition. Similarly, we have recently proposed that statistical learning in the context of speech segmentation can be dissociated into (1) “word identification,” involving a transition from the perception and encoding of raw individual syllables to larger integrated words, and (2) subsequent memory storage and memory retrieval (Batterink & Paller, 2017). Regardless of how exactly these different processes are divided or described, logically they must all contribute to behavioral evidence of learning on postexposure tests. However, offline measures can only weigh in on whether that learning has (or has not) ultimately occurred, and they cannot dissociate these conceptually distinct components of learning.
In contrast, neural measures of learning have the potential to provide insight into the actual learning process itself, rather than merely the outcome of learning. Thus, these measures may allow us to empirically dissociate theoretically derived components of statistical learning and to identify their underlying neural mechanisms. For example, we previously used an EEG-based measure of neural entrainment to track statistical learning online, during exposure. This measure also allowed us to separate the “word identification” component of statistical learning—that is, the perceptual binding of individual syllables into component words—from subsequent memory-related retrieval processes (Batterink & Paller, 2017). In future studies, these sorts of neural measures of online statistical learning could be synergistically combined with experimental manipulations designed to tease apart these different components of learning. As an illustration, Bogaerts, Siegelman, and Frost (2016) independently manipulated exposure duration and transitional probabilities to dissociate individual item encoding from higher order computations of distributional properties. These experimental manipulations impacted performance on an offline forced-choice recognition measure and may also impact neural signatures of learning in highly specific ways. For example, manipulations designed to target item encoding may modulate learning effects in sensory cortex, whereas manipulations that influence the computation of transitional probabilities may influence the neural signatures of learning in domain-general memory systems. Thus, understanding the neural basis of statistical learning may provide a powerful platform to tease apart different theoretical components of statistical learning.
2.1.4. Neural measures as sensitive indices of learning
Neural measures of statistical learning also hold great promise as tools to track learning in populations where behavioral responses may not be easily acquired. Using fMRI, Turk-Browne et al. (2009) found neural signatures of learning in the striatum even in observers who exhibited no subsequent explicit familiarity, suggesting that neural measures may provide more sensitive indices of learning than traditional behavioral measures. Online EEG-based neural entrainment (Batterink & Paller, 2017) could also potentially be used to track statistical learning without requiring a behavioral response. These types of measures could be used to assess statistical learning in different patient populations (e.g., hippocampal amnesia and Parkinson’s disease), under different conditions of consciousness (e.g., sleep or anesthesia) and in different age groups (e.g., very young infants). By providing continuous data sampled over the exposure period, neural measures can also be used to track the time course of learning, rather than merely providing an assessment of the final outcome of learning.
2.2. Implicit learning
On the other side of the coin, insights from statistical learning research may also be applied to the field of implicit learning. Non-univariate neuroimaging methods such as representational similarity analysis (e.g., Schapiro et al., 2012, 2013) and functional connectivity analyses (Karuza et al., 2017) have recently been applied to understand statistical learning (2012), and they may be equally informative for understanding implicit learning. As previously described, using pattern similarity analyses, Schapiro et al. (2012) found that the representations of objects presented in pairs became more similar to one another in the hippocampus and MTL cortex after exposure to a structured visual stream. A similar approach may be applied to implicit learning paradigms such as the AGL or weather prediction tasks to investigate whether similar MTL-mediated mechanisms may drive learning. For example, in the AGL task, letters or strings that occur together more frequently during training may come to be represented more similar to one another in the hippocampus, supporting a chunk-based model. An alternative (though not mutually exclusive) possibility is that grammatical sequences will be represented more similar to one another than ungrammatical sequences as a consequence of learning, reflecting abstract grammatical knowledge independent of chunk strength.
Other non-univariate neuroimaging methods such as functional connectivity analyses have also been used to explore neural changes associated with statistical learning. In a spatial statistical learning task, decreases in task-based connectivity were reported over the course of exposure to structured input, with greater decreases predicting better posttest performance (Karuza et al., 2017). Similar analyses could be applied to a variety of implicit learning paradigms to understand how changes in connectivity among different brain regions may support learning. Overall, taking advantage of continued advances in neuroimaging will provide additional insights into different learning mechanisms that drive performance in the various implicit learning tasks.
3. Conclusions
The neural events underlying implicit learning and statistical learning appear to share many commonalities. At the same time, these two fields have separate and distinct histories, and understanding these different trajectories and research approaches is critical to fostering synergy and continued progress in both areas. These separate research histories argue against the idea that these two terms may be combined under a single construct of “implicit statistical learning.” Moving forward, we suggest that a reasonable approach to aligning these two strands of research is to discuss each type of learning by experimental paradigm, with an acknowledgment that learning across these different paradigms can often involve both declarative and nondeclarative memory systems in healthy learners. In this way, the term “implicit learning” may continue to be used to refer to learning “without awareness of the products of learning” (Frensch & Runger, 2003, p. 14), which may potentially emerge in any of these experimental paradigms, including the canonical triplet segmentation task typically used to study statistical learning as traditionally defined.
By taking advantage of what is currently known about the neural bases supporting learning in all of these related experimental paradigms, we may continue to make progress in these two historically separate fields of statistical learning and implicit learning. In particular, many decades of research have sought to uncover the neural mechanisms of implicit learning; however, in comparison, parallel research in the field of statistical learning is still in its infancy. To reach a better understanding of statistical learning, future research may help clarify issues such as how memory systems interact over the course of learning, and how the hippocampus may play a critical role. In addition, neural measures may yield valuable insight into theoretically dissociable components of statistical learning, provide sensitive indices of learning to supplement behavioral measures, and track the time course of learning. By the same token, research using implicit learning paradigms may also benefit from recent advances in neuroimaging methods, which have already been successfully applied to shed light on statistical learning mechanisms. By continuing to align these two strands of learning research, we will be in a better position to characterize the neural systems that support the core human ability to acquire patterns in the environment, helping us to understand the neurocognitive mechanisms that underlie a wide variety of cognitive abilities.
References
- Abla D, & Okanoya K (2008). Statistical segmentation of tone sequences activates the left inferior frontal cortex: A near-infrared spectroscopy study. Neuropsychologia, 46, 2787–2795. [DOI] [PubMed] [Google Scholar]
- Albouy G, Sterpenich V, Balteau E, Vandewalle G, Desseilles M, Dang-Vu T … Maquet P (2008). Both the hippocampus and striatum are involved in consolidation of motor sequence memory. Neuron, 58, 261–272. [DOI] [PubMed] [Google Scholar]
- Aron AR, Shohamy D, Clark J, Myers C, Gluck MA, & Poldrack RA (2004). Human midbrain sensitivity to cognitive feedback and uncertainty during classification learning. Journal of Neurophysiology, 92, 1144–1152. [DOI] [PubMed] [Google Scholar]
- Batterink L (2017). Rapid statistical learning supporting word extraction from continuous speech. Psychological Science, 28, 921–928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Batterink L, & Paller KA (2017). Online neural monitoring of statistical learning. Cortex, 90, 31–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Batterink L, Reber PJ, Neville H, & Paller KA (2015). Implicit and explicit contributions to statistical learning. Journal of Memory and Language, 83, 62–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Batterink L, Reber PJ, & Paller KA (2015). Functional differences between statistical learning with and without explicit training. Learning and Memory, 22, 544–556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bischoff-Grethe A, Martin M, Mao H, & Berns GS (2001). The context of uncertainty modulates the subcortical response to predictability. Journal of Cognitive Neuroscience, 13, 986–993. [DOI] [PubMed] [Google Scholar]
- Bogaerts L, Siegelman N, & Frost R (2016). Splitting the variance of statistical learning performance: A parametric investigation of exposure duration and transitional probabilities. Psychonomic Bulletin & Review, 23, 1250–1256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bornstein AM, & Daw ND (2012). Dissociating hippocampal and striatal contributions to sequential prediction learning. European Journal of Neuroscience, 35, 1011–1023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bornstein AM, & Daw ND (2013). Cortical and hippocampal correlates of deliberation during model-based decisions for rewards in humans. PLoS Computational Biology, 9(12), e1003387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brady T, & Oliva A (2008). Statistical learning using real-world scenes. Psychological Science, 19(7),678–685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown RM, & Robertson EM (2007a). Off-line processing: Reciprocal interactions between declarative and procedural memories. Journal of Neuroscience, 39, 10468–10475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown RM, & Robertson EM (2007b). Inducing motor skill improvements with a declarative task. Nature Neuroscience, 10, 148–149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christiansen MH (2018). Implicit statistical learning: A tale of two literatures. Topics in Cognitive Science, 10.1111/tops.12420 [DOI] [PubMed] [Google Scholar]
- Cohen NJ, Eichenbaum H, Deacedo B, & Corkin S (1985). Different memory systes underlying acquisition of procedural and declarative knowledge. Annals New York Academy of Sciences, 444, 54–71. [DOI] [PubMed] [Google Scholar]
- Cohen NJ, & Squire LR (1980). Preserved learning and retention of pattern-analyzing skill in amnesia: Dissociation of knowing how and knowing that. Science, 210(4466), 207. [DOI] [PubMed] [Google Scholar]
- Corkin S (1968). Acquisition of motor skill after bilateral medial temporal-lobe excision. Neuropsychologia, 6, 255–265. [Google Scholar]
- Covington NV, Brown-Schmidt S, & Duff MC (2018). The necessity of the hippocampus for statistical learning. Journal of Cognitive Neuroscience, 30, 680–697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cunillera T, Camara E, Toro JM, Marco-Pallares J, Sebastian-Galles N, Ortiz H … Rodríguez-Fornells A (2009). Time course and functional neuroanatomy of speech segmentation in adults. NeuroImage, 48(3), 541–553. [DOI] [PubMed] [Google Scholar]
- Curran T (1997). Higher-order associative learning in amnesia: Evidence from the serial reaction time task. Journal of Cognitive Neuroscience, 9, 522–533. [DOI] [PubMed] [Google Scholar]
- Daselaar SM, Rombouts SARB, Veltman DJ, Raaijmakers JGW, & Jonker C (2003). Similar network activated by young and old adults during the acquisition of a motor sequence. Neurobiology of Aging, 24, 1013–1019. [DOI] [PubMed] [Google Scholar]
- Destrebecqz A, & Cleeremans A (2001). Can sequence learning be implicit? New evidence with the process dissociation procedure. Psychonomic Bulletin & Review, 8, 343–350. [DOI] [PubMed] [Google Scholar]
- Doeller CF, Opitz B, Krick CM, Mecklinger A, & Reith W (2005). Prefrontal–hippocampal dynamics involved in learning regularities across episodes. Cerebral Cortex, 15, 1123–1133. [DOI] [PubMed] [Google Scholar]
- Doyon J, Owen AM, Petrides M, Sziklas V, & Evans AC (1996). Functional anatomy of visuomotor skill learning in human subjects examined with positron emission tomography. European Journal of Neuroscience, 8(4), 637–648. [DOI] [PubMed] [Google Scholar]
- Duff MC, & Brown-Schmidt S (2017). Hippocampal contributions to language use and processing In Hannula DE & Duff M (Eds.), The hippocampus from cells to systems (pp. 503–536). Cham, Switzerland: International. [Google Scholar]
- Eichenbaum H, & Cohen NJ (2001). From conditioning to conscious recollection: Multiple memory systems of the brain. New York: Oxford University Press. [Google Scholar]
- Fedorenko E, Duncan J, & Kanwisher N (2012). Language-selective and domain-general regions lie side by side within Broca’s area. Current Biology, 22(21), 2059–2062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Filoteo JV, Lauritzen S, & Maddox WT (2010). Removing the frontal lobes: The effects of engaging executive functions on perceptual category learning. Psychological Science, 21, 415–423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiser J, & Aslin RN (2002). Statistical learning of higher-order temporal structure from visual shape sequences. Journal of Experimental Psychology. Learning, Memory, and Cognition, 28, 458–467. [DOI] [PubMed] [Google Scholar]
- Fletcher P, Zafiris O, Frith C, Honey R, Corlett P, & Zilles K (2005). On the benefits of not trying: Brain activity and connectivity reflecting the interactions of explicit and implicit sequence learning. Cerebral Cortex, 15(7), 1002–1015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Foerde K, Knowlton B, & Poldrack RA (2006). Modulation of competing memory systems by distraction. Proceedings of the National Academy of Sciences of the United States of America, 103(31), 11778–11783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Forkstam C, Hagoort P, Fernandez G, Ingvar M, & Petersson KM (2006). Neural correlates of artificial syntactic structure classification. NeuroImage, 32, 956–967. [DOI] [PubMed] [Google Scholar]
- Franco A, Eberlen J, Destrebecqz A, Cleeremans A, & Bertels J (2015). Rapid serial auditory presentation: A new measure of statistical leaning in speech segmentation. Experimental Psychology, 62, 346–351. [DOI] [PubMed] [Google Scholar]
- Frensch PA, & Runger D (2003). Implicit learning. Current Directions in Psychological Science, 12, 13–18. [Google Scholar]
- Frost R, Armstrong BC, Siegelman N, & Christiansen MH (2015). Domain generality versus modality specificity: The paradox of statistical learning. Trends in Cognitive Sciences, 19, 117e125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gabrieli JDE (1998). Cognitive neuroscience of human memory. Annual Review of Psychology, 49, 87–115. [DOI] [PubMed] [Google Scholar]
- Galea JM, Albert NB, Ditye T, & Miall RC, (2009). Disruption of the dorsolateral prefrontal cortex facilitates the consolidation of procedural skills. Journal of Cognitive Neuroscience, 22, 1158–1164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garraux G, Peigneux P, Carson RE, & Hallett M (2007). Task-related interaction between basal ganglia and cortical dopamine release. Journal of Neuroscience, 27(52), 14434–14441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gobel EW, Parrish TB, & Reber PJ (2011). Neural correlates of skill acquisition: decreased cortical activity during a serial interception sequence learning task. NeuroImage, 58, 1150–1157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goshen-Gottstein Y, Moscovitch M, & Melo B (2000). Intact implicit memory for newly formed verbal associations in amnesic patients following single study trials. Neuropsychology, 15, 570–578. [DOI] [PubMed] [Google Scholar]
- Graf P, & Schacter DL (1985). Implicit and explicit memory for new associations in normal and amnesic subjects. Journal of Experimental Psychology: Learning, Memory, and Cognition, 11(3), 501–518. [DOI] [PubMed] [Google Scholar]
- Grafton ST, Hazeltine E, & Ivry R (1995). Functional mapping of sequence learning in normal humans. Journal of Cognitive Neuroscience, 7, 497–510. [DOI] [PubMed] [Google Scholar]
- Greene AJ, Gross WL, Elsinger CL, & Rao SM (2006). An fMRI analysis of the human hippocampus: Inference, context and task awareness. Journal of Cognitive Neuroscience, 18, 1156–1173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greene AJ, Gross WL, Elsinger CL, & Rao SM (2007). Hippocampal differentiation without recognition: An fMRI analysis of the contextual cueing task. Learning & Memory, 14, 548–553. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hannula DE, & Greene AJ (2012). The hippocampus reevaluated in unconscious learning and memory: At a tipping point? Frontiers in Human Neuroscience, 6, 80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hazeltine E, Grafton ST, & Ivry R (1997). Attention and stimulus characteristics determine the locus of motor-sequence encoding—A PET study. Brain, 120, 123–140. [DOI] [PubMed] [Google Scholar]
- Henke K (2010). A model for memory systems based on processing modes rather than consciousness. Nature Reviews Neuroscience, 11, 523–532. [DOI] [PubMed] [Google Scholar]
- Howard DV, & Howard JH (2001). When it does hurt to try: Adult age differences in the effects of instructions on implicit pattern learning. Psychonomic Bulletin & Review, 8(4), 798–805. [DOI] [PubMed] [Google Scholar]
- Isbilen ES, McCauley SM, Kidd E, & Christiansen MH (2017). Testing statistical learning implicitly: A novel chunk-based measure of statistical learning. Paper presented at the 39th Annual Meeting of the Cognitive Science Society. [Google Scholar]
- Karuza EA, Emberson LL, & Aslin RN (2014). Combining fMRI and behavioral measures to examine the process of human learning. Neurobiology Learning and Memory, 109, 193–206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karuza EA, Emberson LL, Roser ME, Cole D, Aslin RN, & Fiser J (2017). Neural signatures of spatial statistical learning: Characterizing the extraction of structure from complex visual scenes. Journal of Cognitive Neuroscience, 29(12), 1963–1976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karuza EA, Newport EL, Aslin RN, Starling SJ, Tivarus ME, & Bavelier D (2013). Neural correlates of statistical learning in a word segmentation task: An fMRI study. Brain and Language, 127, 46–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keane M, Gabrieli J, Monti L, Fleischman D, Cantor J, & Noland J (1997). Intact and impaired conceptual memory processes in amnesia. Neuropsychology, 11, 59–69. [DOI] [PubMed] [Google Scholar]
- Kim R, Seitz A, Feenstra H, & Shams L (2009). Testing assumptions of statistical learning: Is it long-term and implicit? Neuroscience Letters, 461, 145–149. [DOI] [PubMed] [Google Scholar]
- Knowlton BJ, & Squire LR (1993). The learning of categories: Parallel brain systems for item memory and category knowledge. Science, 262(5140), 1747–1749. [DOI] [PubMed] [Google Scholar]
- Knowlton BJ, & Squire LR (1994). The information acquired during artificial grammar learning. Journal of Experimental Psychology. Learning, Memory, and Cognition, 20(1), 79. [DOI] [PubMed] [Google Scholar]
- Knowlton BJ, & Squire L (1996). Artificial grammar learning depends on implicit acquisition of both abstract and exemplar-specific information. Journal of Experimental Psychology. Learning, Memory, and Cognition, 22(1), 169. [DOI] [PubMed] [Google Scholar]
- Knowlton B, Squire L, & Gluck M (1994). Probabilistic classification learning in amnesia. Learn and Memory, 1, 106–120. [PubMed] [Google Scholar]
- Knowlton BJ, Squire LS, Paulsen JS, Swerdlow NR, Swenson M, & Butters N (1996). Dissociations within nondeclarative memory in Huntington’s disease. Neuropsychology, 4, 538–548. [Google Scholar]
- Lieberman MD, Chang GY, Chiao J, Bookheimer SY, & Knowlton BJ (2004). An event-related fMRI study of artificial grammar learning in a balanced chunk strength design. Journal of Cognitive Neuroscience, 16, 427–438. [DOI] [PubMed] [Google Scholar]
- McClelland JL, McNaughton BL, & O’Reilly RC (1995). Why there are complementary learning systems in the hippocampus and neocortex: Insights from the successes and failures of connectionist models of learning and memory. Psychological Review, 102(3), 419. [DOI] [PubMed] [Google Scholar]
- McNealy K, Mazziotta JC, & Dapretto M (2006). Cracking the language code: Neural mechanisms underlying speech parsing. Journal of Neuroscience, 26(29), 7629–7639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meulemans T, Peigneux P, & Van der Linden M (1998). Preserved artificial grammar learning in Parkinson’s disease. Brain and Cognition, 37, 109–112. [Google Scholar]
- Milner B, Corkin S, & Teuber HL (1968). Further analysis of the hippocampal amnesic syndrome: 14-year follow-up study of H.M. Neuropsychologia, 6, 215–235. [Google Scholar]
- Moody TD, Bookheimer SY, Vanek Z, & Knowlton BJ (2004). An implicit learning task activates medial temporal lobe in patients with Parkinson’s disease. Behavioral Neuroscience, 118, 438–442. [DOI] [PubMed] [Google Scholar]
- Nemeth D, Janacsek K, Polner B, & Kovacs ZA (2013). Boosting human learning by hypnosis. Cerebral Cortex, 23, 801–805. [DOI] [PubMed] [Google Scholar]
- Nissen MJ, & Bullemer PT (1987). Attentional requirements for learning: Evidence from performance measures. Cognitive Psychology, 19, 1–32. [Google Scholar]
- Peigneux P, Maquet P, Meulemans T, Destrebecqz A, Laureys S, Degueldre C … Cleeremans A (2000). Striatum forever, despite sequence learning variability: A random effect analysis of PET data. Human Brain Mapping, 10, 179–194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perruchet P, Gallego J, & Savy I (1990). A critical reappraisal of the evidence for unconscious abstraction of deterministic rules in complex experimental situations. Cognitive Psychology, 22, 493–516. [Google Scholar]
- Perruchet P, & Pacteau C (1990). Synthetic grammar learning: Implicit rule abstraction or explicit fragmentary knowledge. Journal of Experimental Psychology: General, 119, 264–275. [Google Scholar]
- Perruchet P, & Pacton P (2006). Implicit learning and statistical learning: One phenemenon, two approaches. Trends in Cognitive Science, 10(5), 233–238. [DOI] [PubMed] [Google Scholar]
- Petersson KM, Folia V, & Hagoort P (2012). What artificial grammar learning reveals about the neurobiology of syntax. Brain & Language, 120, 83–95. [DOI] [PubMed] [Google Scholar]
- Petersson KM, Forkstam C, & Ingvar M (2004). Artificial syntactic violations activate Broca’s region. Cognitive Science, 28, 383–407. [Google Scholar]
- Poldrack RA, Clark J, Pare-Blagoev EJ, Shohamy D, Creso Moyano J, Myers C, & Gluck MA (2001). Interactive memory systems in the human brain. Nature, 414(6863), 546–550. [DOI] [PubMed] [Google Scholar]
- Poldrack RA, Prabhakaran V, Seger CA, & Gabrieli JD (1999). Striatal activation during acquisition of a cognitive skill. Neuropsychology, 13, 564–574. [DOI] [PubMed] [Google Scholar]
- Rauch SL, Savage CR, Brown HD, Curran T, Alpert NM, Kendrick A, Fischman AJ, & Kosslyn SM (1995). A PET investigation of implicit and explicit sequence learning. Human Brain Mapping, 3, 271–286. [Google Scholar]
- Rauch SL, Whalen PJ, Savage CR, Curran T, Kendrick A, Brown HD, Bush G, Breiter HC, & Rosen BR (1997). Striatal recruitment during an implicit sequence learning task as measured by functional magnetic resonance imaging. Human Brain Mapping, 5, 124–132. [PubMed] [Google Scholar]
- Reber A (1967). Implicit learning of artificial grammars. Journal of Verbal Learning and Verbal Behavior, 6, 855–863. [Google Scholar]
- Reber AS (1976). Implicit learning of synthetic languages: The role of instructional set. Journal of Experimental Psychology, 2(1), 88.1249526 [Google Scholar]
- Reber AS (1989). Implicit learning and tacit knowledge. Journal of Experimental Psychology, 118, 219–235. [Google Scholar]
- Reber PJ (2013). The neural basis of implicit learning and memory: A review of neuropsychological and neuroimaging research. Neuropsychologia, 51, 2026–2042. [DOI] [PubMed] [Google Scholar]
- Reber PJ, & Squire LJ (1994). Parallel brain systems for learning with and without awareness. Learning and Memory, 1, 217–229. [PubMed] [Google Scholar]
- Reber PJ, & Squire LJ (1998). Encapsulation of implicit and explicit memory in sequence learning. Journal of Cognitive Neuroscience, 10, 248–263. [DOI] [PubMed] [Google Scholar]
- Reber PJ, & Squire LR (1999). Intact learning of artificial grammars and intact category learning by patients with Parkinson’s disease. Behavioral Neuroscience, 113, 235–242. [DOI] [PubMed] [Google Scholar]
- Reber PJ, Stark CEL, & Squire LR (1998). Contrasting cortical activity associated with category memory and recognition memory. Learning and Memory, 5(6), 420–428. [PMC free article] [PubMed] [Google Scholar]
- Rebuschat P (Ed.) (2015). Implicit and explicit learning of languages. Amsterdam: John Benjamins. [Google Scholar]
- Reiss JP, Campbell DW, Leslie WD, Paulus MP, Stroman PW, & Polimeni JO (2005). The role of the striatum in implicit learning: A functional magnetic resonance imaging study. NeuroReport, 16, 1291–1295. [DOI] [PubMed] [Google Scholar]
- Rieckmann A, Fischer H, & Büackman L (2010). Activation in striatum and medial temporal lobe during sequence learning in younger and older adults: Relations to performance. NeuroImage, 50, 1303–1312. [DOI] [PubMed] [Google Scholar]
- Rubin RD, Watson PD, Duff MC, & Cohen NJ (2014). The role of the hippocampus in flexible cognition and social behavior. Frontiers in Human Neuroscience, 8, 742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saffran JR, Aslin RN, & Newport EL (1996). Statistical learning by 8-month-old infants. Science, 274, 1926–1928. [DOI] [PubMed] [Google Scholar]
- Saffran JR, Newport EL, & Aslin R (1996). Word segmentation: The role of distributional cues. Journal of Memory and Language, 35, 606–621. [Google Scholar]
- Saffran J, Newport E, Aslin R, Tunick R, & Barrueco S (1997). Incidental language learning: Listening (and learning) out of the corner of your ear. Psychological Science, 8, 101–105. [Google Scholar]
- Sanchez DJ, Gobel EW, & Reber PJ (2010). Performing the unexplainable: Implicit task performance reveals individually reliable sequence learning without explicit knowledge. Psychonomic Bulletin & Review, 17, 790–796. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sanchez DJ, & Reber PJ (2013). Explicit pre-training instruction does not improve implicit perceptual-motor sequence learning. Cognition, 126, 341–351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schapiro AC, Gregory E, Landau B, McCloskey M, & Turk-Browne NB (2014). The necessity of the medial temporal lobe for statistical learning. Journal of Cognitive Neuroscience, 26(8), 1736–1747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schapiro AC, Kustner LV, & Turk-Browne NB (2012). Shaping of object representations in the human medial temporal lobe based on temporal regularities. Current Biology, 22, 1622–1627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schapiro A, & Turk-Browne N (2015). Statistical learning In Toga AW (Ed.), Brain mapping: An encyclopedic reference, vol. 3 (pp. 501–506). New York: Academic Press. [Google Scholar]
- Schendan HE, Searl MM, Melorse RJ, & Stern CE (2003). An fMRI study of the role of the medial temporal lobe in implicit and explicit sequence learning. Neuron, 37, 1013–1025. [DOI] [PubMed] [Google Scholar]
- Scoville WB, & Milner B (1957). Loss of recent memory after bilateral hippocampal lesions. Journal of Neurology, Neurosurgery and Psychiatry, 2, 11–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seger CA, & Cincotta CM (2005). The roles of the caudate nucleus in human classification learning. Journal of Neuroscience, 25, 2941–2951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seidler RD, Purushotham A, Kim SG, Ugurbil K, Willingham D, & Ashe J (2005). Neural correlates of encoding and expression in implicit sequence learning. Exp Brain Res, 165, 114–124. [DOI] [PubMed] [Google Scholar]
- Servan-Schreiber E, & Anderson JR (1990). Learning artificial grammars with competitive chunking. Journal of Experimental Psychology: Learning, Memory, and Cognition, 16, 592–608. [Google Scholar]
- Shanks DR, Channon S, Wilkinson L, & Curran HV (2006). Disruption of sequential priming in organic and pharmacological amnesia: A role for the medial temporal lobes in implicit contextual learning. Neuropsychopharmacology, 31, 1768–1776. [DOI] [PubMed] [Google Scholar]
- Siegelman N, Bogaerts L, & Frost R (2017). Measuring individual differences in statistical learning: Current pitfalls and possible solutions. Behavioral Research Methods, 49, 418–432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegelman N, & Frost R (2015). Statistical learning as an individual ability: Theoretical perspectives and empirical evidence. Journal of Memory and Language, 81, 105–120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Skosnik PD, Gitelman DR, Parrish TB, Mesulam M-M, & Reber PJ (2002). Neural correlates of artificial grammar learning. NeuroImage, 17, 1306–1314. [DOI] [PubMed] [Google Scholar]
- Squire L (2004). Memory systems of the brain: A brief history and current perspective. Neurobiology of Learning and Memory, 82, 171–177. [DOI] [PubMed] [Google Scholar]
- Squire LR, Stark CEL, & Clark RE (2004). The medial temporal lobe. Annual Review of Neuroscience, 27, 279–306. [DOI] [PubMed] [Google Scholar]
- Squire LR, & Zola SM (1996). Structure and function of declarative and nondeclarative memory systems. Proceedings of the National Academy of Sciences of the United States of America, 93, 13515–13522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Turk-Browne N, Jungé J, & Scholl B (2005). The automaticity of visual statistical learning. Journal of Experimental Psychology: General, 134, 552–563. [DOI] [PubMed] [Google Scholar]
- Turk-Browne NB, Scholl BJ, Chun MM, & Johnson MK (2009). Neural evidence of statistical learning: Efficient detection of visual regularities without awareness. Journal of Cognitive Neuroscience, 21, 1934–1945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Turk-Browne NB, Scholl BJ, Johnson MK, & Chun MM (2010). Implicit perceptual anticipation triggered by statistical learning. Journal of Neuroscience, 30, 11177–11187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ullman MT (2004). Contributions of memory circuits to language: The declarative/procedural model. Cognition, 92(1–2), 231–270. [DOI] [PubMed] [Google Scholar]
- Ullman MT (2005). A cognitive neuroscience perspective on second language acquisition: The declarative/procedural model In Sanz C (Ed.), Mind and context in adult second language acquisition: Methods, theory, and practice (pp. 141–178). Washington, DC: Georgetown University Press. [Google Scholar]
- Virag M, Janacsek K, Horvath A, Bujdoso Z, Fabo D, Nemeth D (2015). Competition between frontal lobe functions and implicit sequence learning: Evidence from the long-term effects of alcohol. Experimental Brain Research, 233, 2081. [DOI] [PubMed] [Google Scholar]
- Westerberg CE, Miller BB, Reber PJ, Cohen NJ, & Paller KA (2011). Neural correlates of contextual cueing are modulated by an explicit learning strategy. Neuropsychologia, 49, 3439–3447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Willingham DB, Greeley T, & Bardone AM (1993). Dissociation in a serial response time task using a recognition measure: Comment on Perruchet and Amorim (1992). Journal of Experimental Psychology: Learning, Memory, & Cognition, 19, 1424–1430. [Google Scholar]
- Witt K, Nuhsman A, & Deuschl G (2002). Intact artificial grammar learning in patients with cerebellar degeneration and advanced Parkinson’s disease. Neuropsychologia, 40, 1534–1540. [DOI] [PubMed] [Google Scholar]
