Abstract
In their seminal study of chess expertise, Simon and Chase (Simon & Chase, 1973; Chase & Simon, 1973) proposed that perceptual learning was a necessary component of skill acquisition. In their view, acquisition of skill results from the strategic use of learning at multiple levels in order to adaptively overcome inherent limitations. The knowledge acquired by way of perceptual learning that supported increasingly sophisticated perceptual discrimination processes, according to Simon and Chase, was referred to as a “chunk.” The “chunk” was conceptualized as a meaningful complex set of features that abstracted the notion of a perceptual object. Simon and Chase further suggested that meaningful combinations of chunks could be combined to form configurations (Simon & Chase, 1973, p. 399). The present study addresses this idea by framing the notion of a “chunk” in terms of two formal meta-theories, one that addresses representation (Ashby & Townsend, 1986), and one that addresses processing (Townsend & Nozawa, 1995), and tests the prediction that perceptual learning produces organized perceptual objects (“chunks”). Two experiments combine behavioral and electroencephelographic (EEG) measures to show that perceptual learning produces (a) a shift from perceptual independence and separability to violations of separability, and (b) shifts from limited-capacity serial processing to super-capacity parallel processing. The evidence from both experiments is strong and consistent: perceptual learning does indeed induce “chunking”—the production of perceptual objects, and the foundation of perceptual expertise.
Keywords: perceptual learning, skill acquisition, expertise, chunking
In 1963, Eleanor Gibson (Gibson, 1963) noted that the study of perceptual learning had become a “healthily growing” (p. 50) field, having only three years earlier “acquired the status of an area worthy of separate review” (p. 29, in reference to Drever, 1960). Further, she noted that while perceptual learning must logically be a component of the acquisition of higher-order skills, “[v]ery little applied (sic) research on perceptual skills is to be found in psychological journals” (Gibson, 1963, p. 49). More than 50 years later, comprehensive reviews of the literature (Kellman & Massey, 2013; Dosher & Lu, 2017; Seitz, 2017; Watanabe & Sasaki, 2015) suggest that Gibson’s observations still hold true. The literature on perceptual learning is healthy. However, it is also true that there is relatively limited work on the explicit role of perceptual learning in the acquisition of higher-level expertise (Lu, Lin, & Dosher, 2016; Kellman & Garrigan, 2009; Kellman & Massey, 2013; Lu et al., 2016; Polat, 2016).
Perceptual learning and perceptual “chunks”
In their seminal study of chess, Simon and Chase (Simon & Chase, 1973; Chase & Simon, 1973) proposed that perceptual learning was a necessary component of skill acquisition. In their view, acquisition of skill results from the strategic use of learning at multiple levels in order to adaptively overcome inherent limitations. They proposed that “specific perceptual knowledge acquired through long experience, stored in long-term memory, and accessed by perceptual discrimination processes” (Simon & Chase, 1973, p. 394) was necessary to alleviate the constraints of a limited-capacity short-duration memory. The knowledge acquired by way of perceptual learning that supported increasingly sophisticated perceptual discrimination processes, according to Simon and Chase, was referred to as a “chunk.”
The “chunk” was conceptualized as a meaningful complex set of features that abstracted the notion of a perceptual object. Simon and Chase further suggested that meaningful combinations of chunks could be combined to form configurations (Simon & Chase, 1973, p. 399). Simon and Chase formalized this recursive structure in a computational model that possessed the critical characteristic of coding chunks by changing the weighted connections from input to output elements. The literature provides some support for the idea that perceptual learning of any stimulus involving more than one dimension produces effects that are consistent with the regularities of learning and attending to objects (as dependent conjunctions of features, e.g., W. Li, Piech, & Gilbert, 2004; R. W. Li, Levi, & Klein, 2004; Schyns, Goldstone, & Thibaut, 1998; Czerwinski, Lightfoot, & Shiffrin, 1992). Indeed, modern conceptualizations, such as Goldstone’s notion of perceptual unitization (Goldstone, 1998, 2000), along with computational (e.g., Schyns et al., 1998) and conceptual (e.g., Kellman & Garrigan, 2009; Kellman & Massey, 2013) accounts of the construct of chunking share many structural aspects with successful models of perceptual learning (e.g., Dosher, Jeter, Liu, & Lu, 2013; Petrov, Dosher, & Lu, 2005; Huang, Lu, & Dosher, 2012; Zhaoping, Herzog, & Dayan, 2003).
The hierarchical/recursive structure proposed by Simon and Chase (1973) implies a set of characteristics present in both the empirical and theoretical literatures on perceptual learning. First, their proposal implies that learning can occur at multiple levels of representation. This is most critically true if the first-order organization of features into perceptual objects is generalized to configurations (and potentially) beyond. This is a characteristic of the major theoretical conceptions of perceptual learning (e.g., Dosher et al., 2013; Watanabe & Sasaki, 2015; Ahissar & Hochstein, 2004) and is suggested by some of the data on generalization and transfer (e.g., Wang et al., 2016; Wang, Zhang, Klein, Levi, & Yu, 2012, 2014; J.-Y. Zhang, Cong, Klein, Levi, & Yu, 2014). Second, the notion that learning can occur at multiple levels of representation suggests the computational need for changing both feed-forward and feed-back connections among levels of processing. This computational requirement is one that has been acknowledged in multiple theoretical perspectives (see review in Schyns et al., 1998), is one that has been shown to be critical in maintaining stability in networks of neurons (e.g., Moldakarimov, Bazhenov, & Sejnowski, 2014), and that is supported by various sources of empirical evidence (e.g., Rauss, Schwartz, & Pourtois, 2011; Ruff & Driver, 2006). Third, implied in the original proposal by Simon and Chase, and elaborated more explicitly in later work (e.g., Feltovich, Prietula, & Ericsson, 2006; Chase & Ericsson, 1981), is the notion that ongoing perceptual organization, or the formation of new chunks, is dependent on and assisted by existing representations. Support for this idea can be found empirically in the “Eureka” phenomenon (Ahissar & Hochstein, 1997), as well as in effects due to frames of reference or context (e.g., Rauss et al., 2011; Pourtois, Grandjean, & Sander, 2004), and theoretically in concepts such as Gibson’s predifferentiation (Gibson & Walk, 1956) and the mnemonic encoding principle of skilled memory theory (Chase & Ericsson, 1981; Wenger & Payne, 1995). Fourth, according to Simon and Chase, the learning of perpetual objects with extensive practice allowed for the inherent limitations of declarative memory to be alleviated by a transition to procedural memory. They conceived of this as stimulus-response learning, idealized computationally as a production, a pairing of a condition and an action. On this issue, Simon and Chase were consistent with the earlier conception of the stages of skill acquisition described by Fitts and Posner (1967), in which the second stage was a transition to declarative retrieval. Repeated declarative retrieval provided the basis for the transition to the autonomous stage, sometimes referred to as automaticity (e.g., Shiffrin & Schneider, 1977; Schneider & Shiffrin, 1977; Logan, 1988; Wenger, 1999), a stage characterized by extremely efficient responding, absent the need for controlled attention. This conception is consistent with the empirical regularities associated with the learning and transfer of “pop-out” (e.g., Ahissar, Laiwand, Kozminsky, & Hochstein, 1998; Schoups & Orban, 1996) and with contemporary computational (e.g., Tenison & Anderson, 2016) and neurobiological (e.g., Ashby, Ennis, & Spiering, 2007) theories.
Defining and detecting perceptual objects
The approach taken here conceptualizes perceptual objects in terms of foundational aspects of stimulus encoding and processing (O’Toole, Wenger, & Townsend, 2001; Townsend & Wenger, 2015). With respect to encoded representations, the work relies on the theoretical definitions of dependency and lack-of-separability in general recognition theory (GRT, Ashby & Townsend, 1986; Townsend, Houpt, & Silbert, 2012). In that view, a perceptual object is one in which the encoded stimulus features evidence of a violation of perceptual independence (PI), a violation of perceptual separability (PS), and/or a violation of decisional separability (DS). A violation of PI would imply that encoded information about the level of one dimension (e.g., the contrast of a gabor patch) would be correlated with (not independent of) the encoded information about another dimension (e.g., the gabor patch’s orientation) within an individual stimulus. A violation of PS would imply that a change in the level of one dimension (e.g., an increase in contrast) would change the level of encoded information about another dimension (e.g., an increase in perceived tilt), with this effect being obtained across stimuli. A violation of DS would imply that a change in the level of one dimension (e.g., an increase in contrast) would change the criterion for judging the state of another dimension (e.g., becoming more predisposed to judge the patch as tilted up, independent of the actual level of orientation), with this effect also being obtained across stimuli. With respect to the processing of encoded information, we rely on the theoretical characterization of processing architecture, stopping-rules, channel independence, and capacity developed in systems factorial theory (SFT, Townsend & Nozawa, 1995; Little, Altieri, Yang, & Fific, 2017). In that view, a perceptual object is one that is processed in a parallel, dependent, exhaustive manner, and which evidences unlimited- to super-capacity processing. The advantages of using these two conceptions include rigorous mathematical definitions of their central constructs, well-developed and well-understood experimental tasks, and highly-refined statistical methodologies for linking theory and data.
Learning perceptual objects: GRT.
With respect to encoded representations, the notion of creating a perceptual object from two or more features by way of perceptual learning implies a practice-dependent shift from independence or separability to violations of either or both. In order to test for this change, it is necessary to have a task that allows the perceptual state of the observer to be determined with respect to all of the features simultaneously (Ashby & Townsend, 1986; Kadlec & Townsend, 1992). The task that allows this is known as the complete identification (CID) paradigm. In this task, all possible combinations of each of the features is presented with each requiring a unique response. For example, in the experiments below, two contrast-defined features each are either absent or present and at either a low or high level of contrast. In experiment 1, this results in four possible stimulus states with each assigned a unique response. The data from this task are summarized as an identification/confusion matrix, with each cell having a corresponding set of reaction times (RTs). The first of the critical measures used with the response frequencies is a test of marginal response invariance (MRI), which is defined in terms of marginal responses on each level of each dimension. MRI holds for a given level on a given dimension if the marginal probability of identifying that level on that dimension is the same across levels of the other dimension (Ashby & Townsend, 1986; Silbert & Hawkins, 2016). If MRI does not hold, then one infers a failure of PS, DS, or both. The second critical measure is a test of report independence (RI, originally referred to as sampling independence, see Ashby & Townsend, 1986). RI holds for a given stimulus if the probability of correctly identifying that stimulus is equal to the product of the marginal probabilities of accurately identifying the level of each component (Ashby & Townsend, 1986; Silbert & Hawkins, 2016). If RI does not hold, it suggests a violation of PI for that stimulus. More recently, Townsend and colleagues (2012) have developed RT-based tests of MRI and RI (referred to respectively as timed MRI [tMRI] and timed RI [tRI]). On the basis of the inferences suggested by these tests, multidimensional gaussian models with these characteristics are fit to the data and compared (per Thomas, 2001) to both more- and less-restrictive models to determine whether the inferences drawn from the tests of MRI and RI are consistent with the best-fitting model.
Learning perceptual objects: SFT.
With respect to foundational characteristics of information processing, the notion of creating a perceptual object from two or more features by way of perceptual learning implies a practice-dependent shift from serial or parallel independent, self-terminating, limited-capacity processing to parallel, dependent, exhaustive, unlimited- to super-capacity processing (O’Toole et al., 2001; Wenger & Townsend, 2001; Townsend & Wenger, 2015). In order to perform strong-inference tests for this change, it is necessary to use a task known as the double-factorial paradigm (DFP Townsend & Nozawa, 1995; Townsend & Wenger, 2004a, 2004b). The prototypical version of this task involves two features, each of which can be present or absent (Townsend & Nozawa, 1995; Ingvalson & Wenger, 2005). In addition, when both features are present, their relative speeds of processing are manipulated so that each can be processed slowly or quickly. The response instruction used here is to give one response when both features are present; otherwise a second response is given. This is referred to as an AND task.
Inferences about processing architecture and stopping rule are drawn using RT interaction contrasts calculated for the four cells of the design in which both features are present. The interaction contrasts are calculated at the level of the mean, and then at the level of the survivor function of the RT distribution (the complement of the cumulative distribution function). The signs of these two interaction contrasts allow for unique inferences regarding architecture and stopping rule, and statistical tests are available to check the signs of both (Houpt & Townsend, 2010). Inferences regarding capacity are drawn on the basis of a capacity coefficient which, for the AND task is defined as a ratio of the inverse cumulative hazard functions (Chechile, 2011) for single- and double-target trials (Townsend & Wenger, 2004b; Townsend & Eidels, 2011; Townsend, Wenger, & Houpt, 2018). Values of this coefficient that are equal to 1 indicate unlimited-capacity processing, values < 1 indicate limited capacity processing, and values > 1 indicate super-capacity processing, and statistical tools have been developed to assess the reliability of any deviations from 1 (Houpt & Townsend, 2010). Inferences regarding independence are guided by the inferences regarding capacity, as it has been shown that limited or super-capacity processing typically results from channel dependences (Townsend & Wenger, 2004b; Wenger & Townsend, 2006).
Converging evidence from the timing of neural events.
The logic and statistical practice relating data to theory in both GRT and SFT relies on converging sources of evidence (as in Bridgman, 1945). In an effort to strengthen our inferences, we sought to use a neurophysiological variable along with response frequencies and latencies. Following Schweickert and Mounts (1998), we sought to use a feature of the EEG signals that could be readily interpreted in terms consistent with the assumptions of the time-based measures in GRT and the distributional measures in SFT, in particular the assumption of selective influence. The most promising candidate in this regard was the lateralized readiness potential (LRP). The LRP is a negative-going waveform, measured in central electrodes contralateral to the motor response that it precedes, and is interpreted as being an indicator that sufficient processing has been completed in order to program the motor response (Coles, 1989; Hackley & Miller, 1995; Miller & Hackley, 1992; Mordkoff & Grosjean, 2001; Ray, Slobounov, Mordkoff, Johnston, & Simon, 2000). As well, the LRP should behave in accord with the assumptions regarding processing times and observable RTs in GRT and SFT. The LRP is estimated by subtracting the ipsilateral from the contralateral potential for each hand. Our focus will be on the onset time for the LRP, operationalized as the earliest time at which the LRP becomes reliably less than 0 (as in Kuefner, Jacques, Prieto, & Rossion, 2010; Von Der Heide, Wenger, Bittner, & Fitousi, 2018). We test the hypothesis that when the timing of the onset of the LRP is analyzed using the RT tests specified for GRT and SFT, the conclusions will be consistent with those drawn from the analyses of the RT data, providing an additional source of converging evidence for our inferences.
Experiment 1
Experiment 1 uses the theory and methods of GRT to test the hypothesis that perceptual learning results in a change in encoded representations such that they become perceptual objects. Specifically, we predict that prior to perceptual learning, observers will encode the features of a two-element stimulus as independent and/or separable, both perceptually and decisionally. After perceptual learning, observers will encode those features as dependent and/or non-separable, perceptually or decisionally. This will be consistent with the theoretical conception of an object from the GRT perspective. Critically, these inferences should be supported by the analyses of the response frequencies, RTs, and the onset times of the LRP.
Methods
Participants.
A total of four (all female, ages 21–24) participants were recruited from the University of Oklahoma community. All were right-handed, reported normal or corrected-to-normal vision, normal hearing, and unencumbered use of both hands. Participants were reimbursed by gift card at the completion of their participation. The protocol and procedures for this experiment were reviewed and approved by the Institutional Review Board of the University of Oklahoma (IRB approval number 3908).
Design.
Two tasks were used in experiment 1: a complete identification task before and after perceptual practice, and a three-down one-up staircase task augmented with catch trials for perceptual practice. The complete identification task was a feature-complete factorial design, involving the presence and absence of two contrast-defined features, presented just above or below fixation in the center of the stimulus. This resulted in four possible stimuli, each of which were assigned a unique response. These stimuli were presented at two levels of (Michelson) contrast (10% and 50%) before practice, and three levels of contrast (final threshold for each participant, 10%, and 50%) after practice.1 The three-down one-up staircase task involved the three stimuli in which at least one of the contrast-defined targets was present. Separate staircases were run for each stimulus type.
Materials.
The stimuli were created from a digital version of the line drawing of the Cheshire Cat from the original edition of Alice in Wonderland (Carroll, 1865). Figure 1 presents examples of each type of stimulus. The units of the dimensions noted in Figure 1 are degrees of visual angle at a constrained viewing distance of 78 cm. The target-absent stimulus was created from a randomly-selected and arranged set of 15 × 15 pixels from the forehead of the cat. Each of the two targets was created from the left eye of the cat, rotated by 90°. Pattern masks were created by scrambling (in 15 × 15 pixel tiles) the stimulus presented on each trial. Gaussian noise (μ = 0, σ = 0.10) was added to all stimuli and masks. Contrast for all stimuli and masks ranged from 1% to 60%, in steps of 0.1%. All stimuli were presented on an 55 cm (diagonal) gray-scale CRT, and all responses were made using a four-button response box with button presses timed to ± 1 ms. Both the complete identification trials and staircase practice trials were presented and timed using E-prime (Psychology Software Tools, Sharpsburg PA).
Figure 1 .
Examples (here at 40% contrast) of the four stimulus types used in Experiments 1 and 2. Dimensions are in degrees of visual angle. (a) Target-absent, (b) bottom only, (c) top only, (d) top and bottom.
Procedure.
The experiment was performed across 14 days, and consisted of one day of a baseline (BL) complete identification task, followed by 10 days of practice using the three-down one-up modified staircase procedure, followed by an endline (EL) set of three days of sessions with the complete identification task. Concurrent EEG was collected during the BL and EL complete identification sessions. All testing took place in light- and sound-shielded chambers. Testing with concurrent EEG occurred in an identical chamber that was also electromagnetically shielded (i.e., a Faraday cage). All sessions began with a five minute period of dark adaptation and all testing occurred between 0800 and 1700 local time. Participants performed all trials with their heads in a chin rest positioned 78 cm from the monitor.
Each session of the complete identification task required approximately 90 min, including time for EEG setup and clean-up. The BL session involved 300 presentations of each test stimulus, each at two levels of contrast: 10% and 50%. The EL sessions each involved 300 presentations of each test stimulus, each at three levels of contrast: 10%, 50%, and each participant’s final level of contrast from the perceptual practice sessions. Each session was divided into blocks of 400 trials, with a short (5 min) break between blocks, during which impedances were checked and adjusted as needed. Order of trials was randomized by session for each participant.
Each trial of the complete identification task included the following events. A small dot signaled the beginning of each trial, which was self-initiated by the observer by pressing the key under the index finger of their dominant hand. A fixation cross was presented centrally for a random period on each trial, determined by an exponential distribution with a mean of 700 ms, censored at 400 and 1000 ms. This was followed by three randomly selected pattern masks, at the level of contrast for the test stimulus, each presented for 25 ms. The test stimulus was then presented for 25 ms, followed by a set of three randomly selected pattern masks. The participant then was allowed up to 2 s to respond. Participants were instructed to respond as quickly and as accurately as possible, and no feedback was provided.
Each session of perceptual practice required between 45 and 60 min; no concurrent EEG was collected. Each session of practice was divided into six blocks of 250 trials per block, with each block involving only one of the three target-present stimuli. A short (eyes closed) break occurred between blocks. All participants began the first block of practice with each stimulus type at 60% contrast. Each subsequent block of each stimulus type began at 110% of the threshold from the previous block with that stimulus type. Contrast decrements and increments during each staircase were at 10% of the current contrast level. Catch trials involved presentation of a target-absent stimulus at the current level of contrast. The sequence and timing of events were the same as those in the complete ID. Participants indicated the perceived presence of a stimulus by pressing the button under the index finger of their dominant hand and indicated the perceived absence of a stimulus by pressing the button under the index finger of their non-dominant hand. No feedback was provided.
EEG data were collected using a 128-channel EGI system (EGI Philips, Eugene OR). EEG was recorded continuously with hardware filters set from 0.1 to 100 Hz, a sampling rate of 100 Hz, and an online vertex reference. Impedances were kept ≤ 50KΩ for the entire session. Continuous EEG was epoched around the cues (−800 to 1500 ms). Data were preprocessed and analyzed using EEGLab (Delorme & Mackeig, 2004). Data were first inspected visually, and bad channels were deleted. Artifacts were then rejected and any additional bad channels were deleted on the basis of probability using EEGLab. The data were then band-pass filtered (1 to 90 Hz), using a notch filter centered at 60 Hz (59–61 Hz). The data were then epoched, and independent components analysis (ICA) was performed and components corresponding to artifacts were removed. The data were then low-pass filtered at 8 Hz in order to determine the LRP. A set of six electrodes, three to the left of midline and three to the right, were used to determine the LRP, following previously used procedures (Kuefner et al., 2010; Von Der Heide et al., 2018). The start-time for the LRP was defined as the time point at which the negative-going difference wave was reliably different from 0.
Results
Practice.
Thresholds in each block for each stimulus type were determined by calculating the geometric mean of the contrast level associated with last 15 reversals in each block. False alarm rates were calculated for each block. To account for individual differences, thresholds were converted to relative thresholds for each observer by dividing the threshold in block n by that observer’s threshold in block 1. Reliability of change for both threshold and false alarm rates was assessed using linear regression on log-transformed values as a function of log-transformed block number. Plots of threshold and false alarm rates as a function of block are presented for each of the four observers in Figure 2 and the results of the regression analyses are presented in Table 1. Practice produced reliable decreases in threshold for all four observers for each stimulus type. Practice also produced reliable increases in false alarm rates for all four observers and for all stimulus types, with the exception of the single-target top stimulus for observer 2 and the single-target bottom stimulus for observer 4.
Figure 2 .
Experiment 1: Relative thresholds (THR) and false alarm rates (FAR) for the three stimulus types for each of the four observers (Obs).
Table 1.
Results of the regression analyses on relative thresholds and false alarm rates from the practice sessions for all four observers and all three target-present stimuli in Experiment 1. Note that all values were reliably different from 0 (p < .05) except for those marked with +.
| Relative Threshold | False Alarm Rate | ||||||
|---|---|---|---|---|---|---|---|
| Observer | Stimulus | Intercept | R2 | Intercept | R2 | ||
| 1 | top | 0.13 | −1.25 | 0.93 | −2.18 | 0.30 | 0.58 |
| bottom | −0.07 | −1.30 | 0.91 | −1.80 | 0.14 | 0.57 | |
| both | −0.25 | −1.17 | 0.90 | −2.32 | 0.35 | 0.48 | |
| 2 | top | −0.71 | −0.45 | 0.36 | −2.34 | 0.12+ | 0.34 |
| bottom | −0.15 | −1.21 | 0.80 | −3.05 | 0.51 | 0.66 | |
| both | −0.81 | −0.59 | 0.46 | −3.07 | 0.51 | 0.62 | |
| 3 | top | 1.07 | −1.82 | 0.91 | −2.44 | 0.33 | 0.42 |
| bottom | 0.38 | −1.53 | 0.91 | −2.03 | 0.18 | 0.17 | |
| both | 0.34 | −1.44 | 0.92 | −2.13 | 0.20 | 0.15 | |
| 4 | top | 0.91 | −1.59 | 0.87 | −2.00 | 0.25 | 0.58 |
| bottom | 0.79 | −0.93 | 0.62 | −1.70 | 0.10+ | 0.36 | |
| both | 1.22 | −1.55 | 0.77 | −2.07 | 0.26 | 0.48 | |
| 0.78 | 0.45 | ||||||
Baseline and endline identification.
Data analysis.
Since the inception of GRT (Ashby & Townsend, 1986), there have been a range of approaches to analyzing data with respect to inferences regarding PI, PS, and DS, with there being a range of strengths and weaknesses noted for each (e.g., Silbert & Thomas, 2013; Thomas, 2001, 2003; Thomas & Silbert, 2014). A critical point, however, is that the literature on the analysis of complete identification data is a vibrant and evolving one, with regular additions and evolutions (e.g., Silbert & Hawkins, 2016; Soto et al., 2015), including the extension of GRT to RTs (Townsend et al., 2012). The approach taken here is to take advantage of the strengths of a variety of approaches by combining them, with the goal of using the inferences drawn from each approach as sources of potentially-converging evidence (see, e.g., Cornes, Donnelly, Godwin, & Wenger, 2011; Von Der Heide et al., 2018, for a similar approach).
Figure 3 illustrates the manner in which our analyses proceeded. One path in this process was used for preliminary inferences regarding PS and DS, and a second was used for preliminary inferences regarding PI. The critical quantities for the first path were the following. The first is a test of marginal response invariance (MRI) for the response frequencies, and its corresponding test in RTs (timed MRI or tMRI; Townsend et al., 2012). MRI is defined in terms of marginal responses on each level of each dimension. MRI holds for a given level on a given dimension if the marginal probability of identifying that level on that dimension is the same across levels of the other dimension (Ashby & Townsend, 1986; Silbert & Hawkins, 2016). If MRI and tMRI do not hold, then this suggests a failure of PS, DS, or both. The second set of quantities are the marginal measures of sensitivity (d′) and criterion (c) as defined in signal detection theory (Green & Swets, 1966; Macmillan & Creelman, 2005). Equality of these marginal measures across the levels of the other dimension (e.g., marginal d′ for the top feature across the two levels of the bottom) is tested using 95% confidence intervals (Gourevitch & Galanter, 1967). Inequality of the marginal d′s suggests a possible failure of PS and inequality of the marginal cs suggests a possible failure of DS. The critical quantities on the second path is a test of report independence (RI, originally referred to as sampling independence; see Ashby & Townsend, 1986) and its corresponding test for RTs (timed RI or tRI, Townsend et al., 2012). RI holds for a given stimulus if the probability of correctly identifying that stimulus is equal to the product of the marginal probabilities of accurately identifying the level of each component (Ashby & Townsend, 1986; Silbert & Hawkins, 2016). If RI and tRI do not hold, it suggests a violation of PI for that stimulus. Here it should be noted that we applied the tests of tMRI and tRI to both the RT data and the start times of the LRPs.
Figure 3 .
Process for deriving inferences from the complete identification data. Boxes shaded in gray represent preliminary inferences that are tested/confirmed using hierarchical model fitting.
Prior to analysis, the RT data were censored at 200 and 2000 ms. Any LRP start times that were less than 100 ms before the observed RT were deleted prior to analysis. This eliminated less than 1% of the observations for each observer. Overall correlations between RTs and LRP start times for the four observers ranged from r = 0.78 to r = 0.89.
All of the preliminary inferences drawn to this point were then used to guide hierarchical model fitting (per Thomas, 2001). Specifically, the set of preliminary inferences regarding PI, PS, and DS suggested an hypothesized model. This hypothesized model was used to specify a hierarchy of possible models, starting with the simplest possible model (PI, PS, and DS all holding). The most complex model in any hierarchy was more complex than the hypothesized model, and was constructed by relaxing one assumption of the hypothesized model. In addition, in cases in which the hypothesized model contained either a violation of PI or a violation of DS, an alternative model was constructed using a violation of the alternative (Silbert & Thomas, 2013). In cases in which the hypothesized model contained both violations of PI and DS, alternative models in which only PI or only DS were violated were also fit. Finally, a completely unconstrained (all parameters free) was also fit to the data. The best model (based on a χ2 statistic calculated on the negative log likelihoods of each model, per Thomas, 2001) selected in this process was used for the final set of inferences.
Tests of MRI and tMRI.
Table 2 presents presents a summary of the inferences for all of the analyses of the data from experiment 1. In order to conserve space, we present the results only for the cases in which either MRI or tMRI failed. For the tests of MRI, the critical test statistic is a χ2 (Silbert & Hawkins, 2016) and for the tests of tMRI (using the RT and the LRP data), the critical test statistic is D, scaled by the number of observations in the cumulative distribution functions that are being compared (Townsend et al., 2012, p. 485). In Table 2, cells highlighted in gray indicate where the tests for MRI or tMRI fail. In all but one case, MRI and tMRI (for both RTs and LRPs) held for all observers at both contrast levels at BL (before perceptual practice). However, there were numerous failures of both MRI and tMRI at endline (after perceptual practice), occurring most frequently at the two super-threshold levels of contrast and far more frequently for the top rather than the bottom feature. This suggests that PS and/or DS may both have held prior to practice, but that either or both may have been violated after practice. In addition, the tests of tMRI on the RTs and LRPs showed a high level of consistency. Of the 40 possible inferences, the analyses of the RTs and LRPs agreed on 35 (88%).
Table 2.
Experiment 1: Tests of MRI (response frequencies) and tMRI (RTs and start times of the LRPs) for all four observers. Cells shaded in gray indicate cases in which MRI or tMRI failed. Note: BL = baseline, EL = endline.
| Tests of MRI | Tests of tMRI | |||||||
|---|---|---|---|---|---|---|---|---|
| Target | Level of other | D | ||||||
| Obs | Test | Contrast | Dimension | Dimension | x2 | Comparison | RT | LRP |
| 1 | EL | 10 | top | absent | 5.617 | Top, absent | 0.136 | 0.183 |
| present | 25.734 | Top, present | 0.136 | 0.163 | ||||
| bottom | absent | 1.539 | Bottom, absent | 0.053 | 0.245 | |||
| present | 0.375 | Bottom, present | 0.028 | 0.193 | ||||
| 50 | top | absent | 0.000 | Top, absent | 0.070 | | 0.206 | ||
| present | 33.689 | Top, present | 0.205 | 0.128 | ||||
| 2 | EL | threshold | top | absent | 2.347 | Top, absent | 0.079 | 0.072 |
| present | 21.425 | Top, present | 0.138 | 0.024 | ||||
| 10 | top | absent | 0.137 | Top, absent | 0.065 j | 0.229 | ||
| present | 45.405 | Top, present | 0.242 | 0.390 | ||||
| bottom | absent | 0.852 | Bottom, absent | 0.041^ | 0.187 | |||
| present | 0.833 | Bottom, present | 0.326 | 0.235 | ||||
| 50 | top | absent | 6.005 | Top, absent | 0.219 | 0.193 | ||
| present | 38.487 | Top, present | 0.362 | 0.259 | ||||
| bottom | absent | 0.791 | Bottom, absent | 0.091 | 0.284 | |||
| present | 0.059 | Bottom, present | 0.163 | 0.127 | ||||
| 3 | BL | 50 | top | absent | 0.084 | Top, absent | 0.120 | 0.128 |
| EL | threshold | top | absent | 3.607 | Top, absent | 0.162 | 0.165 | |
| present | 0.012 | Top, present | 0.135 | 0.201 | ||||
| 10 | top | absent | 0.118 | Top, absent | 0.260 | 0.226 | ||
| present | 3.350 | Top, present | 0.352 | 0.135 | ||||
| bottom | absent | 1.769 | Bottom, absent | 0.325 | 0.150 | |||
| present | 0.450 | Bottom, present | 0.380 | 0.143 | ||||
| 50 | top | present | 21.007 | Top, present | 0.366 | 0.270 | ||
| 4 | EL | bottom | absent | 2.286 | Bottom, absent | 0.111 | 0.204 | |
| present | 0.142 | Bottom, present | 0084^ | 0.164 | ||||
| 10 | top | absent | 1.141 | Top, absent | 0.050 | 0.029 | ||
| present | 3.334 | Top, present | 0.111 | 0.400 | ||||
| 50 | top | absent | 3.687 | Top, absent | 0.04l| | 0.177 | ||
| present | 23.525 | Top, present | 0.198 | 0.209 | ||||
Tests of marginal signal detection measures.
Figure 4 plots the differences in the marginal hit and false alarm rates, before and after practice, at each level of contrast. The values plotted are the difference in each measure for one of the features (top, bottom) across the two levels of the other feature (present, absent). The points labeled as ”Top” are the differences between the measure when the bottom feature was present minus the value of the measure when the bottom feature was absent. Figure 5 plots the differences in the marginal measures of sensitivity (d′) and criterion for the four observers at each level of contrast, before and after practice. Prior to practice, equality of both marginal measures held for all four observers. However, after practice, equality of the marginal d′s failed 10 times, with the majority of those failures occurring at the two supra-threshold levels of contrast for the top feature. In addition, after practice, equality of the marginal cs failed eight times, with at least one failure for every observer except observer 1. The majority of the failures of equality occurred for the top feature at the two supra-threshold levels of contrast. In all cases the failures of equality involved a more liberal criterion when the other feature was present relative to when it was absent. All of this suggests that perceptual practice resulted in possible failures of both PS and DS.
Figure 4 .
Experiment 1: Differences (Diff) in marginal hit rates (HR, panels a-d) and false alarm rates (FR, panels e-h) at baseline (BL) and endline (EL) at each level of contrast.
Figure 5 .
Experiment 1: Differences (Diff) in marginal sensitivity (d′, panels a-d) and criterion (c, panels e-h) at baseline (BL) and endline (EL) at each level of contrast.
Tests of RI and tRI.
Table 3 summarizes the tests of RI and tRI, listing only the cases in which RI or tRI failed. As can be seen, there were no failures of any of the tests of RI prior to practice and only a small number (five) of possible failures of RI or tRI after practice. For those five failures, four were observed at the two supra-threshold levels of contrast. In addition, in four of the five cases, the failures were observed for the stimulus in which both features were present. Finally, there was one observer (4) for whom there was no evidence of any failures of RI or tRI, at either stage of practice or at any level of contrast. The consistency among the three tests here was more limited than for the tests of MRI and tMRI. In four of the five cases, the tests on the response frequencies and RTs were in agreement with each other but in disagreement with the test on the LRP start times. The source of this inconsistency is not readily apparent. In sum, there is at best limited evidence for practice-induced violations of PI.
Table 3.
Experiment 1: Failures of tests of RI (response frequencies) or tRI (RTs and LRP start times). Cells shaded in gray indicate the test that suggested the failure.
| RI | tRI: D | ||||||
|---|---|---|---|---|---|---|---|
| Obs | Test | Contrast | Top | Bottom | χ2 | RT | LRP |
| 1 | Endline | 10 | Present | Present | 1.000 | 0.051 | 0.145 |
| 2 | Baseline | 50 | Present | Present | 2.500 | 0.016 | 0.289 |
| 3 | Baseline | 50 | Present | Present | 3.000 | 0.065 | 0.226 |
| Endline | threshold | Present | Absent | 0.300 | 0.016 | 0.240 | |
| 10 | Present | Present | 9.400 | 0.111 | 0.273 | ||
| counts | 0 | 1 | 5 | ||||
Hierarchical model fits.
Table 4 summarizes the results of the hierarchical model fitting. Here it can be seen that, prior to practice, there was a uniform preservation of PI, PS, and DS, for all observers, at both levels of contrast. Practice produced three sets of changes. First, none of the possible violations of PI suggested by the tests of RI and tRI were supported by the model fitting. Second, in nine of 12 cases, PS was violated, and in all cases the violations were for the top feature. This suggests that one of the effects of perceptual practice is to alter the representation of the top feature as a function of whether the bottom feature was present or absent. This can be interpreted as a violation of PS in nine of the 12 possible cases. Third, in six of 12 cases, the response criterion for the top feature was altered as a function of the state of the bottom feature. This can be interpreted as violations of DS in half of the 12 possible cases.
Table 4.
Experiment 1: Final inferences from the hierarchical model fitting to the data from each observer, at each time point and level of contrast.
| Presservation of | ||||||||
|---|---|---|---|---|---|---|---|---|
| Test | Obs | Contrast | PI | PS | DS | Parms | ln(L) | AIC |
| Baseline | 1 | 10 | T | T | T | 4 | −1453.76 | 2915.51 |
| 50 | T | T | T | 4 | −1210.74 | 2429.48 | ||
| 2 | 10 | T | T | T | 4 | −1508.57 | 2915.51 | |
| 50 | T | T | T | 4 | −1336.18 | 2680.35 | ||
| 3 | 10 | T | T | T | 4 | −1472.05 | 2952.11 | |
| 50 | T | T | T | 4 | −1226.02 | 2460.05 | ||
| 4 | 10 | T | T | T | 4 | −1541.55 | 3091.11 | |
| 50 | T | T | T | 4 | −1299.68 | 2607.36 | ||
| violations | 0 | 0 | 0 | |||||
| Endline | 1 | threshold | T | T | T | 4 | −1502.66 | 3013.31 |
| 10 | T | F: Top | T | 5 | −1069.95 | 2149.90 | ||
| 50 | T | F: Top | T | 5 | −965.45 | 1940.90 | ||
| 2 | threshold | T | F: Top | T | 5 | −1469.53 | 2949.05 | |
| 10 | T | F: Top | F: Top | 6 | −1289.52 | 2591.03 | ||
| 50 | T | F: Top | F: Top | 6 | −1047.88 | 2107.76 | ||
| 3 | threshold | T | T | T | 4 | −1245.90 | 2499.80 | |
| 10 | T | F: Top | F: Top | 6 | −969.70 | 1957.41 | ||
| 50 | T | F: Top | F: Top | 6 | −653.46 | 1320.92 | ||
| 4 | threshold | T | T | T | 4 | −1541.55 | 3091.11 | |
| 10 | T | F: Top | F: Top | 6 | −1205.35 | 2422.70 | ||
| 50 | T | F: Top | F: Top | 6 | −1205.33 | 2424.65 | ||
| violations | 0 | 9 | 6 | |||||
Discussion
Experiment 1 was intended to test the hypothesis that perceptual learning leads to the creation of perceptual objects, defined in terms of the GRT constructs of PI, PS, and DS. Specifically, we tested the prediction that perceptual learning for two arbitrary, contrast-defined features would produce a shift from separability and independence to violations of perceptual and/or decisional separability, and violations of perceptual independence. Performance (as measured by detection thresholds) reliably decreased for all observers, with this being accompanied by small but reliable increases in false alarm rates. Prior to practice, there was consistent evidence for PS, DS, and PI. In contrast, after practice, there was reasonably consistent evidence for violations of PS, accompanied in a number of cases by violations of DS, suggesting perceptual learning involves changes in both encoding and decision-making. There was, however, no strong evidence that perceptual learning produced violations of PI. Thus, the results can be interpreted in terms of creating perceptual representations in which the features are non-separable, rather than dependent.
Critically, these conclusions were reached on the basis of three sources of data: choice frequencies, choice RTs, and the onset time of the LRP. While frequencies and latencies have been used in conjunction in previous work with GRT (see Townsend et al., 2012), this is the first case to our knowledge that adds a neural variable in a theory-based way as a source of converging evidence. Although there were points of disagreement across these three variables, the consistency was generally high. This combination of evidence allowed us to efficiently identify a set of candidate models, and then use hierarchical model-fitting to adjudicate the final inferences.
Finally, we should note that this approach allowed us to identify individual differences as a function of learning (see also Fific, Nosofsky, & Townsend, 2008). This has long been a critical strength of the GRT approach, allowing for examination of individual differences in strategy and learning. In this case, the approach revealed that observers varied with respect to the prevalence of violations of both PS and DS, suggestive of variations in strategies across observers.
Experiment 2
Experiment 2 uses the theory and methods of SFT (Townsend & Nozawa, 1995; Townsend & Wenger, 2004a, 2004b) to test the hypothesis that perceptual learning results in a change in the processing of encoded representations such that they become perceptual objects. Specifically, we predict that prior to practice, observers will process the elements of a stimulus in ways that would not be associated with the processing of a set of features bound together as an object: in serial, exhaustively, independently, and with limited capacity. After practice, observers should process the elements quite differently, in a way that would be associated with a set of features bound together as an object: in parallel, exhaustively, non-independently, and with unlimited to super-capacity. We test this hypothesis using the same set of stimuli that were used in Experiment 1 with a new set of observers.
Method
Participants.
A total of five observers (3 females, 20–24 y) were recruited from the University of Oklahoma community and were financially compensated for their participation. All reported normal or corrected-to-normal vision and had unencumbered use of both hands. One of the participants discontinued participation midway through the experiment. Consequently, we report only the data from the four participants who contributed complete data sets. The protocol and procedures for this experiment were reviewed and approved by the Institutional Review Board of the University of Oklahoma (IRB approval number 3908).
Design.
Two tasks were used in Experiment 2. The first was the modified staircase procedure used in Experiment 1; this task was used for practice. The second task was a double-factorial paradigm redundant targets task, using an AND response rule (Townsend & Nozawa, 1995, see Figure 6). The double-factorial paradigm (DFP) consists of a set of two nested 2× 2 factorials. The outer factorial in this case was the presence or absence of either the top or bottom feature. The task was structured so that a positive (“yes”) response was given only when both features were perceived as present; the three other possibilities required a “no” response. Nested within this factorial was a second 2 × 2 factorial, relevant to the stimuli in which both features were present. Here, a manipulation designed to selectively influence the speed of processing was used (described below), such that all four possible combinations of fast (F) and slow (S) processing times were represented. Trial frequencies were set (Mordkoff, Yantis, & Egeth, 1990; Townsend & Nozawa, 1995) to equate the number of total positive and negative trials, as well as the number of single- and double-target trials.
Figure 6 .
Experiment 2: Schematic of the design for the AND version of the double-factorial paradigm. Regions marked with “yes” and “no” indicate the correct response to the designated stimulus. F = fast, S = slow.
Materials.
The test stimuli and masks used in the practice sessions of experiment 2 were identical to those used in the practice sessions of experiment 1. The test stimuli and masks used in the DFP testing in experiment 2 were presented at 30% and 60% contrast at BL and EL, with an additional set of stimuli at each observer’s final threshold at EL. The higher levels of contrast, relative to experiment 1, were selected during pilot work in order to reduce the number of errors, as the SFT analyses are done on correct RTs and LRP start times.
Procedure.
As in experiment 1, experiment 2 took place over 14 days. Day 1 was a baseline assessment of performance in the DFP at two supra-threshold levels of contrast, days 2–11 were practice using the modified staircase procedure,2 and days 12–14 were an endline assessment of performance in the DFP at three levels contrast (threshold plus the two levels used at BL). Each of the DFP sessions lasted approximately 90 min, including time for EEG setup and cleanup. Contrast levels in the DFP were blocked, and presentation of all the possible stimuli within were randomized by observer and session. The events on each trial were identical to those for the complete identification task in Experiment 1, with the exception that observers made either a positive or negative response, using the index finger of their dominant hand for the positive response and the index finger of the non-dominant hand for the negative response. Concurrent EEG was collected during the BL and EL DFP sessions, using the equipment and parameters used in Experiment 1. Finally, all details of the practice sessions in Experiment 2 were identical to those in Experiment 1.
Results
Practice.
Relative thresholds and FA rates as a function of practice are plotted in Figure 7 and the results of the analyses of the changes in relative threshold and FA rates are presented in Table 5. Reliable reductions in threshold were obtained for all stimuli for all four observers, though the reductions for observer 3 were much more modest than those of the other three observers. Reliable increases in FA rates were obtained for all stimuli for two of the observers (1 and 2). Observer 4 showed no change in FA rates and observer 5 showed no change with the top-only stimulus.
Figure 7 .
Relative thresholds and false alarm rates for each of the four observers (Obs) in Experiment 2.
Table 5.
Results of the regression analyses on relative thresholds and false alarm rates from the practice sessions for all four observers and all three target-present stimuli in Experiment 2. Note that all values were reliably different from 0 (p < .05) except for those marked with+.
| Threshold | False Alarms | ||||||
|---|---|---|---|---|---|---|---|
| Observer | Stimulus | Intercept | R2 | Intercept | R2 | ||
| 1 | top | 0.476 | −1.764 | 0.92 | −2.471 | 0.378 | 0.58 |
| bottom | 0.128 | −1.376 | 0.91 | −2.187 | 0.277 | 0.50 | |
| both | −0.467 | −1.184 | 0.86 | −2.187 | 0.280 | 0.52 | |
| 2 | top | 0.875 | −1.929 | 0.92 | −1.834 | 0.225 | 0.58 |
| bottom | 0.876 | −1.836 | 0.92 | −1.890 | 0.231 | 0.51 | |
| both | 0.143 | −1.405 | 0.97 | −3.088 | 0.825 | 0.61 | |
| 3 | top | −0.129 | −0.224 | 0.24 | −1.655 | −0.179+ | 0.12 |
| bottom | 0.003 | −0.231 | 0.28 | −1.827 | −0.081+ | 0.02 | |
| both | 0.834 | −0.269 | 0.18 | −1.764 | −0.221+ | 0.20 | |
| 4 | top | 0.247 | −1.327 | 0.58 | −1.893 | 0.042+ | 0.02 |
| bottom | 0.286 | −1.619 | 0.95 | −1.835 | 0.034+ | 0.02 | |
| both | −0.342 | −0.507 | 0.66 | −2.631 | 0.266 | 0.34 | |
| 0.70 | 0.34 | ||||||
Baseline and endline DFP performance.
Dependent measures.
The RTs and LRP start times for correct trials were analyzed with respect to inferences regarding processing architecture, stopping rule, capacity, and independence in the following way. With respect to architecture and stopping rule, the data from the four shaded cells in Figure 6 were analyzed using two interaction contrasts of the form
| (1) |
T represents either the mean RT (or LRP start time) or survivor function of the RT (or LRP start time) distribution, with the survivor function being the complement of the cumulative distribution function, S(t) = 1 − F(t). Townsend and Nozawa (1995) showed that when the interaction contrast at these two levels are used together, it is possible to uniquely identify the processing architecture and stopping rule (see Table 6). Reliability of the interaction contrasts at the level of the mean was assessed using a 2 × 2 analysis of variance, separately for each observer at each contrast level at BL and EL. Reliability of the interaction contrasts at the level of the survivor functions was analyzed using the methods described in Houpt and Townsend (2010).
Table 6.
Allowable inferences when the signs of the mean interaction contrast (MIC) and survivor function interaction contrast (SFIC) are used together.
| MIC | SFIC | Architecture | Stopping-rule |
|---|---|---|---|
| 0 | 0 | serial | self-terminating |
| 0 | <0→>0 | serial | exhaustive |
| >0 | >0 | parallel | self-terminating |
| <0 | <0 | parallel | exhaustive |
| >0 | <0→>0 | coactive |
With respect to capacity, we analyzed the RT and LRP start time data from both the single- and double-target trials using the capacity coefficient for an AND version of the DFP (Townsend & Wenger, 2004b):
| (2) |
where K = −ln[F(t)] (the reverse cumulative hazard function, Chechile, 2011), and the subscripts indicate the presence of the top (t), bottom (b) or both (tb) features. A value of 1 for this coeffcient indicates unlimited capacity processing, values < 1 indicate limited capacity processing, and values >1 indicate super-capacity processing. Reliability of any departures from 1 was assessed using the methods described in Houpt and Townsend (2012). Censoring and selection of the RT and LRP start time data were done as in Experiment 1, and the correlations between the LRP start times and the RTs ranged from from r = 0.56 to r = 0.69.
Mean RTs and LRP start times.
Results of the analyses of the mean RTs and mean LRP start times are presented in Tables 7 and 8, respectively. At both BL and EL, the main effects for each feature were reliable in all but one case, which was marginal. The orderings associated with each of these main effects were consistent with the speed of processing manipulation being effective, with this being true for both RTs and LRP start times. With respect to the mean RTs before practice, the interaction was reliable in only two cases, with both of these being being at the highest level of contrast. After practice, however, the interaction was reliable for all observers at both supra-threshold levels of contrast, and was not reliably different from 0 at threshold. With respect to the mean LRP start times before practice, only one interaction was reliably different from 0 (observer 1 at 60% contrast); however, after practice, the interaction was reliable for all observers at the two supra-threshold contrast levels and was not reliably different from 0 at threshold. In all cases, for both the RTs and the LRP start times, when the interaction was reliably different from 0, the sign of the MIC was negative. This suggests that (see Table 6), in most cases, practice produced a shift from some form of serial processing to some form of parallel processing, with these inferences being supported by both the behavioral and the EEG data.
Table 7.
Experiment 2: Analyses of the RTs at the level of the mean. Notes: all effects were df =1; MIC =mean interaction contrast (in ms). + = 0.05 ≤ p < 0.10, * = p < .05, † = p < 0.01, ‡ = p < 0.001.
| Threshold | 30% contrast | 60% contrast | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Obs | Test | Effect | F | MSE | MIC | F | MSE | MIC | F | MSE | MIC |
| 1 | BL | top (t) | 4.53* | 29751 | 24 | 16.61‡ | 20825 | −53 | |||
| bottom (b) | 4.25* | 7.71† | |||||||||
| t × b | 0.31 | 3.99* | |||||||||
| EL | t | 29.55‡ | 34284 | 7 | 80.74‡ | 4437 | −65 | 38.43‡ | 5716 | −53 | |
| b | 31.93‡ | 66.30‡ | 34.82‡ | ||||||||
| t × b | 0.03 | 21.55‡ | 10.71† | ||||||||
| 2 | BL | top (t) | 2.24* | 248981 | 10 | 10.94† | 73504 | −71 | |||
| bottom (b) | 9.01 | 3.62* | |||||||||
| t × b | 1.36 | 6.00* | |||||||||
| EL | t | 13.81‡ | 69154 | 29 | 19.71‡ | 83080 | −107 | 22.95‡ | 37725 | −51 | |
| b | 10.40† | 3.23+ | 16.85‡ | ||||||||
| t × b | 0.09 | 5.07* | 5.51* | ||||||||
| 3 | BL | top (t) | 3.64* | 535271 | −13 | 3.74* | 167296 | 23 | |||
| bottom (b) | 4.06* | 2.74+ | |||||||||
| t × b | 0.78 | 1.87 | |||||||||
| EL | t | 34.07‡ | 77527 | −19 | 52.46‡ | 15032 | −95 | 60.62‡ | 6599 | −84 | |
| b | 12.97‡ | 22.85‡ | 98.33‡ | ||||||||
| t × b | 0.10 | 13.80‡ | 23.81‡ | ||||||||
| 4 | BL | top (t) | 4.01* | 23212 | 7 | 8.31* | 11313 | 4 | |||
| bottom (b) | 6.48* | 11.64* | |||||||||
| t × b | 0.04 | 0.03 | |||||||||
| EL | t | 11.87‡ | 24014 | 24 | 63.42‡ | 9328 | −74 | 14.54‡ | 82574 | −125 | |
| b | 20.78‡ | 71.41‡ | 6.19* | ||||||||
| t × b | 1.12 | 13.14‡ | 4.21* | ||||||||
Table 8.
Experiment 2: Analyses of the LRP start times at the level of the mean. Notes: all effects were df =1; MIC = mean interaction contrast (in ms); + = 0.05 ≤ p < 0.10, * = p < .05, † = p < 0.01, ‡ = p < 0.001.
| Threshold | 30% contrast | 60% contrast | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Obs | Test | Effect | F | MSE | MIC | F | MSE | MIC | F | MSE | MIC |
| 1 | BL | top (t) | 80.45‡ | 6105 | 14 | 91.06‡ | 5118 | −58 | |||
| bottom (b) | 114.85‡ | 99.29‡ | |||||||||
| t × b | 1.37 | 14.64‡ | |||||||||
| EL | t | 36.55‡ | 5196 | 2 | 49.59‡ | 3346 | −35 | 88.46‡ | 3451 | −41 | |
| b | 27.28‡ | 25.20‡ | 88.57‡ | ||||||||
| t × b | 0.03 | 4.77* | 11.13‡ | ||||||||
| 2 | BL | top (t) | 193.88‡ | 2787 | 16 | 27.63‡ | 4521 | −37 | |||
| bottom (b) | 103.28‡ | 82.80‡ | |||||||||
| t × b | 1.84 | 2.87+ | |||||||||
| EL | t | 3.75* | 13835 | 27 | 15.18‡ | 17961 | −49 | 5.73* | 18331 | −64 | |
| b | 2.92+ | 11.50‡ | 11.76† | ||||||||
| t × b | 0.80 | 5.20* | 4.34‡ | ||||||||
| 3 | BL | top (t) | 40.40‡ | 13355 | 29 | 27.82‡ | 10444 | 26 | |||
| bottom (b) | 35.22‡ | 53.64‡ | |||||||||
| t × b | 1.63 | 1.56 | |||||||||
| EL | t | 4.22* | 7158 | −2 | 172.05‡ | 3342 | −31 | 123.63‡ | 4626 | −43 | |
| b | 5.19* | 68.63‡ | 142.45‡ | ||||||||
| t × b | 0.05 | 5.23* | 14.44‡ | ||||||||
| 4 | BL | top (t) | 11.37‡ | 9579 | 11 | 24.91‡ | 6406 | 6 | |||
| bottom (b) | 16.97‡ | 32.10‡ | |||||||||
| t × b | 0.30 | 0.28 | |||||||||
| EL | t | 33.84‡ | 4508 | 15 | 98.23‡ | 5557 | −40 | 105.56‡ | 4042 | −41 | |
| b | 54.09‡ | 40.76‡ | 101.82‡ | ||||||||
| t × b | 1.54 | 8.98† | 13.54‡ | ||||||||
Distributions of RTs and LRP start times.
We consider first the data from the double-target trials, as these will guide our inferences regarding architecture and stopping rule. A first question has to do with the orderings on the distributions, and Table 9 presents the results of the pairwise Kolmogorov-Smirnov tests on the orderings. Here it can be seen that, for all pair-wise comparisons except those involving the fast-slow and slow-fast trials, the majority of orderings were reliable, with those that did not reach criterion for reliability being marginally-reliable (0.05 ≤ p < 0.10). These outcomes reinforce the conclusion that the speed manipulation had its intended effect on both RTs and LRP start times, and allows the survivor function interaction contrasts (SFIC) results for both dependent variables to be interpretable. This is the first time, to our knowledge, that this regularity has been assessed for a neurophysiological rather than a behavioral variable, and furthers the extent to which predictions from SFT can be applied to the timing of a neural event.
Table 9.
Experiment 2: KS test statistics testing the orderings of the distributions of RTs and LRP start times in the double-target trials, as a function of practice and contrast. Note: FF = fast-fast, FS = fast-slow, SF = slow-fast, SS = slow-slow (see Figure 6); + = 0.05 ≤ p < 0.10, * = p < .05, † = p < 0.01, ‡ = p < 0.001.
| Before Practice | After Practice | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 30 | 60 | threshold | 30 | 60 | |||||||
| Obs | Comp | RT | LRP | RT | LRP | RT | LRP | RT | LRP | RT | LRP |
| 1 | FF vs FS | 2.79‡ | 4.94‡ | 1.69* | 3.96‡ | 3.32‡ | 3.02‡ | 1.71† | 2.56‡ | 1.56* | 3.08‡ |
| FF vs SF | 2.65‡ | 4.42‡ | 1.86† | 3.17‡ | 3.70‡ | 3.55‡ | 1.49* | 3.41‡ | 1.49* | 3.16‡ | |
| FF vs SS | 3.60‡ | 5.67‡ | 2.56‡ | 6.10‡ | 4.92‡ | 4.96‡ | 4.84‡ | 4.87‡ | 4.55‡ | 5.57‡ | |
| FS vs SF | 0.69 | 1.17 | 0.53 | 1.58† | 0.89 | 1.19 | 0.52 | 1.54* | 0.45 | 1.61‡ | |
| FS vs SS | 2.06‡ | 3.85‡ | 1.71† | 4.57‡ | 2.91‡ | 4.42‡ | 3.95‡ | 4.55* | 3.88‡ | 5.24‡ | |
| SF vs SS | 2.32‡ | 4.22‡ | 1.89† | 5.13‡ | 2.74‡ | 4.09‡ | 4.02‡ | 4.39‡ | 4.17‡ | 4.81‡ | |
| 2 | FF vs FS | 2.83‡ | 4.76‡ | 1.73† | 3.59‡ | 1.92† | 2.19‡ | 1.58* | 1.31+ | 2.13‡ | 1.94† |
| FF vs SF | 2.38‡ | 5.62‡ | 1.73† | 2.43‡ | 2.05‡ | 2.45‡ | 1.49* | 1.54* | 2.71‡ | 1.57* | |
| FF vs SS | 3.80‡ | 5.67‡ | 3.10‡ | 5.18‡ | 2.57‡ | 2.49‡ | 4.33‡ | 2.43‡ | 5.45‡ | 2.28‡ | |
| FS vs SF | 0.61 | 1.25+ | 0.55 | 1.73† | 0.77 | 0.52 | 0.82 | 0.20 | 0.98 | 0.59 | |
| FS vs SS | 1.87† | 3.64‡ | 2.35‡ | 3.78‡ | 1.44* | 2.18‡ | 3.40‡ | 1.94† | 4.78‡ | 1.90† | |
| SF vs SS | 1.99‡ | 3.42‡ | 2.47‡ | 4.51‡ | 1.79† | 2.19‡ | 3.36‡ | 1.94† | 4.73‡ | 1.77† | |
| 3 | FF vs FS | 2.01‡ | 2.908‡ | 2.65‡ | 3.320‡ | 3.38‡ | 2.02† | 1.89† | 3.25‡ | 1.57* | 4.12‡ |
| FF vs SF | 2.92‡ | 3.441‡ | 2.74‡ | 3.104‡ | 3.19‡ | 2.43‡ | 1.94† | 4.66‡ | 1.42* | 4.30‡ | |
| FF vs SS | 3.72‡ | 5.415‡ | 3.80‡ | 4.852* | 4.98‡ | 3.69‡ | 5.07‡ | 7.24‡ | 5.14‡ | 7.24‡ | |
| FS vs SF | 1.10 | 0.858 | 0.82 | 0.608 | 1.03 | 1.09 | 0.55 | 0.09 | 0.82 | 0.553 | |
| FS vs SS | 2.25‡ | 3.040‡ | 1.86† | 2.387‡ | 2.81‡ | 2.41‡ | 3.85‡ | 5.22‡ | 4.10‡ | 5.512‡ | |
| SF vs SS | 2.41‡ | 2.829‡ | 1.67† | 2.433‡ | 2.09‡ | 1.73* | 3.73‡ | 5.11‡ | 4.70‡ | 5.678‡ | |
| 4 | FF vs FS | 1.552* | 1.948† | 1.688* | 2.34‡ | 2.012† | 3.958‡ | 2.162‡ | 1.46+ | 2.053† | 2.45‡ |
| FF vs SF | 1.643† | 2.476‡ | 1.713* | 2.58‡ | 1.714+ | 3.423‡ | 1.938† | 2.35‡ | 1.789* | 2.42‡ | |
| FF vs SS | 2.921‡ | 3.116‡ | 2.282‡ | 4.57‡ | 3.429‡ | 4.980‡ | 5.116‡ | 5.15‡ | 5.278‡ | 6.28‡ | |
| FS vs SF | 0.548 | 1.033+ | 0.728 | 0.568 | 0.820 | 0.907 | 0.522 | 1.35+ | 0.838 | 0.84 | |
| FS vs SS | 1.643* | 1.832* | 1.523+ | 2.80‡ | 1.566* | 3.051‡ | 3.702‡ | 4.51‡ | 3.605‡ | 5.11‡ | |
| SF vs SS | 2.008† | 1.815* | 1.596+ | 2.82* | 2.087† | 2.821‡ | 3.768‡ | 4.68‡ | 4.326‡ | 5.64‡ | |
Figure 8 plots the SFICs for all four observers, at BL and EL, at each level of contrast, for both the RTs and LRP start times. Table 10 presents the test statistics for the positive (D+) and negative (D−) deflections of the SFICs for both data sources. The overall pattern across observers was a transition from serial exhaustive to parallel exhaustive processing, particularly at the two supra-threshold levels of contrast, and this pattern was present for both the RTs and the LRP start times.
Figure 8 .
Experiment 2: SFICs for the RTs (column a) and LRP start times (column b).
Table 10.
Experiment 2: Tests of the positive and negative deflections of the SFICs, as calculated from the RTs and the start times of the LRPs. Note: + = 0.05 ≤ p < 0.01,+ = p < 0.05
| Baseline | Endline | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| RT | LRP | RT | LRP | ||||||
| Obs | Contrast | D+ | D− | D+ | D− | D+ | D− | D+ | D− |
| 1 | threshold | 0.36* | −0.37* | 0.53* | −0.58* | ||||
| 30 | 0.33* | −0.32* | .52* | −0.51* | 0.01 | −0.47* | 0.25 | −0.58* | |
| 60 | 0.00 | 0.31* | 0.16 | −0.66* | 0.03 | −0.49* | 0.03 | −0.49* | |
| 2 | threshold | 0.38* | −0.39* | 0.65* | −0.50* | ||||
| 30 | 0.29+ | −0.31* | 0.47* | −0.50* | 0.04 | −0.29* | 0.37 | −0.67* | |
| 60 | 0.05 | −0.39* | 0.27 | −0.62* | 0.15 | −0.49* | 0.32 | −0.56* | |
| 3 | threshold | 0.32+ | −0.35* | 0.05 | −0.13 | ||||
| 30 | 0.33+ | −0.38* | 0.33* | −0.35+ | 0.01 | −0.43* | 0.12 | −0.53* | |
| 60 | 0.38* | −0.32+ | 0.33* | −0.38* | 0.06 | −0.60* | 0.13 | −0.57* | |
| 4 | threshold | 0.11 | −0.14 | 0.20 | −0.23+ | ||||
| 30 | 0.28+ | −0.33* | 0.20 | −0.27+ | 0.05 | −0.43* | 0.12 | −0.51* | |
| 60 | 0.13 | −0.36* | 0.12 | −0.28+ | 0.06 | −0.41* | 0.04 | −0.61* | |
Figure 9 presents the capacity coeffcients3 for each of the observers at each contrast level, at baseline and at endline, and Table 11 presents the test statistics (from Houpt & Townsend, 2012) assessing the extent to which those coefficients deviate from 1. The coefficients and tests are presented for both the RTs and the start times of the LRPs. With a few exceptions, with these being at the highest contrast, BL capacity was limited. At endline, capacity was generally limited at threshold and super-capacity at both supra-threshold levels of contrast. Finally, in all but one case (observer 1, BL 60% contrast), the inferences drawn from the RTs and the inferences drawn from the start times of the LRPs were in agreement.
Figure 9 .
Experiment 2: C(t)s for the RTs (column a) and LRP start times (column b).
Table 11.
Experiment 2: Tests of the deviations of the capacity coefficients from 1, as calculated from the RTs and the start times of the LRPs. Note: * = p < .05, † = p < 0.01, ‡ = p < 0.001.
| Baseline Contrast | Endline Contrast | |||||
|---|---|---|---|---|---|---|
| Obs | Data | 30 | 60 | threshold | 30 | 60 |
| 1 | RT | −5.95‡ | −1.29* | −2.96‡ | 5.39‡ | 2.53‡ |
| LRP start times | −1.77* | 0.62 | −2.41† | 5.05‡ | 5.56‡ | |
| 2 | RT | −22.32‡ | −22.92‡ | −2.71* | 8.09‡ | 8.57‡ |
| LRP start times | −1.98* | 3.41† | −1.66* | 6.81‡ | 4.26‡ | |
| 3 | RT | −1.65* | −1.64* | −1.86* | 3.72‡ | 3.96‡ |
| LRP start times | −12.24‡ | −10.51‡ | −13.94‡ | 2.62† | 2.04* | |
| 4 | RT | −1.80* | −1.80* | −2.87† | 11.89‡ | 13.88‡ |
| LRP start times | −2.65† | −1.98* | −1.73* | 3.69‡ | 3.96‡ | |
Discussion
Experiment 2 was designed to test the hypothesis that perceptual learning leads to the creation of perceptual objects, in terms of the basic characteristics of information processing addressed by systems factorial theory. Specifically, this experiment tested the prediction that practice should produce a transition from serial, exhaustive, limited-capacity processing to parallel, exhaustive, super-capacity processing, indicative of positively-dependent channels. As in experiment 1, perceptual practice produced reliable decreases in thresholds, with mixed results with respect to increases in false alarm rates. At BL, the data suggested in most cases serial, exhaustive, limited capacity processing, with the exceptions observed at the highest level of contrast. In comparison, at EL, at both supra-threshold levels of contrast, processing was parallel, exhaustive, and super-capacity. All of this is consistent with the proposal that perceptual practice leads to the creation of perceptual objects, as defined in SFT.
As in experiment 1, the inferences drawn from the RTs and the start times of the LRPs were consistent with one another. This, to our knowledge, is the first demonstration of the use of behavioral and neural data as converging sources of evidence to test predictions drawn from SFT. Finally, the variations across observers in terms of inferences demonstrate the ability of this approach to quantify individual differences at both behavioral and neural levels.
General Discussion
In work that both set an agenda and coalesced a set of theoretical perspectives on skill (e.g., Fitts & Posner, 1967), Simon and Chase (1973; 1973) advanced the notion that the learning of perceptual “chunks” was a necessary first step in the acquisition of expertise. They held that this learning of “chunks” allowed the capacity limitations of a short-duration memory to be avoided and allowed for the transition from the slow use of declarative memory to the fast use of procedural memory. Indeed, the notion of “chunk” has remained conceptually influential, with a number of more recent theories having a role for perceptual and conceptual “chunks” (e.g., Czerwinski et al., 1992; Goldstone, 1998, 2000; Kellman & Garrigan, 2009; Kellman & Massey, 2013; W. Li et al., 2004; R. W. Li et al., 2004; Schyns et al., 1998).
A critical problem, however, is that a “chunk” has at best been given operational definition. The two experiments reported here took a very different approach. Specifically, we used two formal meta-theories—general recognition theory and systems factorial theory—to apply rigorous definitions of the characteristics of a “chunk”—a perceptual object—in terms of aspects of representation and processing. In terms of representation, we defined a perceptual object as one that produced violations of independence and separability. In terms of processing, we defined a perceptual object as one processed exhaustively, in a parallel dependent manner, that exhibits super-capacity. Note that these are not novel uses of the constructs of GRT and SFT, as they have been used in previous theorizing to characterize facial holism (O’Toole et al., 2001) and the general notion of a gestalt (Townsend & Wenger, 2015). One contribution of the present effort is to bring them together to bear on a foundational question in perceptual learning and expertise.
In experiment 1, we documented that two arbitrary contrast-defined features were represented as independent and separable before perceptual learning. Perceptual practice produced a shift to violations of independence and separability, both perceptually and decisionally, consistent with the learning of a perceptual object. In experiment 2, we showed that these same two features were generally processed in a serial and exhaustive manner, with limited capacity, prior to perceptual learning. Perceptual practice produced a shift to parallel, dependent, super-capacity processing, also consistent with the learning of a perceptual object. Our use of GRT and SFT allowed us to illustrate that these changes were not completely uniform, with there being important differences across individuals (Smith & Little, 2018). This ability to quantify individual differences has been a consistent strength of these two meta-theories (e.g., Fific et al., 2008; Fifić, Townsend, & Eidels, 2008; Fific & Townsend, 2010).
In addition to testing our hypotheses with behavioral data—response frequencies and latencies—we were able to make use of neurophysiological data as a source of converging evidence. Specifically, we analyzed the onset times of the lateralized readiness potentials in terms of the temporal measures associated with both GRT and SFT and found that the inferences drawn on the basis of the patterns in this feature of the EEG were generally consistent with those drawn from on the basis of the patterns in the RT. The extent to which theories such as GRT and SFT can make principled predictions for measures of neural activity has been of interest at least since the work of Schweickert (Schweickert, 1989; Schweickert & Mounts, 1998). Schweickert’s results were mixed, while ours were reasonably consistent. We would suggest that the difference lies in the level of analysis: while Schweickert focused on the amplitude of a single ERP feature derived at individual electrodes, we relied on a specific property of EEG activity that signals the programming of a motor output on the basis of completion of cognitive processing. GRT and SFT are intentionally general, and describe regularities that theoretically apply to any information processing system, not just one that can be analyzed in terms of response frequencies and latencies. We believe our results suggest that questions such as those raised by Schweickert (1989; 1998), along with the regularities that can be derived as predictions from GRT and SFT, can be addressed with neural data at the appropriate level of analysis (see, e.g., Q. Zhang, Walsh, & Anderson, 2018, for another compelling example). Further, we believe that our results suggest that, at an appropriate level of analysis, neural data can provide a theoretically-motivated source of converging evidence, strengthening the connection between theory and data.
There are, to be sure, limitations to the present effort. Primary among these is the use of the 2× 2 design used in experiment 1, which has been shown to be subject to identifiability problems with respect to violations of perceptual independence and decisional separability (Silbert & Thomas, 2013; Thomas & Silbert, 2014). Future work should use an expanded design and take advantage of newer statistical developments (Soto et al., 2015). A second limitation is that the tests of representation and processing, although using identical stimuli, were performed across rather than within subjects. Future work should test the extent to which the regularities observed across the two experiments reported here hold within individual observers.
This latter point also raises an important theoretical opportunity. Specifically, the regularities predicted using GRT and SFT should be relatable within a single, coherent meta-theory. Work with linear systems theory (e.g., Wenger & Townsend, 2006; Townsend et al., 2012) suggests one avenue for exploration. Conceptually, there is a necessity for relating the notion of learning a perceptual object within a single theoretical perspective.
In summary, in two experiments we demonstrated that perceptual learning for two arbitrary contrast-defined features produces a shift from independence, separability, seriality, and limitations in capacity to violations of dependence and separability, exhaustive parallelism, and super-capacity, and that evidence for these learning-dependent shifts can be observed in both behavioral and neurophysiological data. The evidence is strong and consistent: perceptual learning does indeed induce “chunking”—the production of perceptual objects.
Footnotes
It has been noted that this basic version of the complete identification task is subject to identifiability problems (e.g., Silbert & Thomas, 2013), and that an expanded design along with novel statistical approaches (Soto, Vucovich, Musgrave, & Ashby, 2015) has the ability to solve those identifiability problems. We used the limited design because data collection had begun prior to the publication of the last of these papers. Consequently, we here rely on the logic of converging evidence to support our inferences, noting where necessary where there may limitations to the strength of our inferences.
Due to schedule constraints, observer 2 completed only 15 blocks of practice.
Note that we did not use the more-recent version of the capacity coefficient (Townsend & Altieri, 2012), one that can be conditionalized on accuracy, based on the fact that performance was near ceiling in all cases.
References
- Ahissar M, & Hochstein S (1997). Task difficulty and the specificity of perceptual learning. Nature, 387, 401–406. [DOI] [PubMed] [Google Scholar]
- Ahissar M, & Hochstein S (2004). The reverse hierarchy theory of visual perceptual learning. Trends in Cognitive Sciences, 8, 457–464. [DOI] [PubMed] [Google Scholar]
- Ahissar M, Laiwand L, Kozminsky G, & Hochstein S (1998). Learning pop-out detection: building representions for conflicting target-distractor relationships. Vision Research, 38, 3095–3107. [DOI] [PubMed] [Google Scholar]
- Ashby FG, Ennis JM, & Spiering BJ (2007). A neurobiological theory of automaticity in perceptual categorization. Psychological Review, 114 (3), 632–656. [DOI] [PubMed] [Google Scholar]
- Ashby FG, & Townsend JT (1986). Varieties of perceptual independence. Psychological Review, 93, 154–179. [PubMed] [Google Scholar]
- Bridgman PW (1945). Some general principles of operational analysis. Psychological Review, 52 (5), 246. [Google Scholar]
- Carroll L (1865). Alice in wonderland. London: Macmillan. [Google Scholar]
- Chase WG, & Ericsson KA (1981). Skilled memory In Anderson JR (Ed.), Cognitive skills and their acquisition (p. 141–190). Hillsdale, NJ: Erlbaum. [Google Scholar]
- Chase WG, & Simon HA (1973). Perception in chess. Cognitive Psychology, 4 (1), 55–81. [Google Scholar]
- Chechile RA (2011). Properties of reverse hazard functions. Journal of Mathematical Psychology, 55 (3), 203–222. [Google Scholar]
- Coles MG (1989). Modern mind-brain reading: psychophysiology, physiology, and cognition. Psychophysiology, 26 (3), 251–269. [DOI] [PubMed] [Google Scholar]
- Cornes K, Donnelly N, Godwin H, & Wenger MJ (2011). Perceptual and decisional factors affecting the detection of the Thatcher illusion. Journal of Experimental Psychology: Human Perception and Performance, 37, 645–668. [DOI] [PubMed] [Google Scholar]
- Czerwinski M, Lightfoot N, & Shiffrin RM (1992). Automatization and training in visual search. American Journal of Psychology, 105, 271–315. [PubMed] [Google Scholar]
- Delorme A, & Mackeig S (2004). eeglab: an open source toolbox for analysis of single-trial eeg dynamics including independent component analysis. Journal of Neuroscience Methods,134, 9–21. [DOI] [PubMed] [Google Scholar]
- Dosher BA, Jeter P, Liu J, & Lu Z-L (2013). An integrated reweighting theory of perceptual learning. Proceedings of the National Academy of Sciences, 110 (33), 13678–13683. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dosher BA, & Lu Z-L (2017). Visual perceptual learning and models. Annual Review of Vision Science, 3, 343–363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drever J (1960). Perceptual learning. Annual Review of Psychology, 11 (1), 131–160. [Google Scholar]
- Feltovich PJ, Prietula MJ, & Ericsson KA (2006). Studies of expertise from psychological perspectives In Ericsson KA, Charness N, Feltovich PJ, & Hoffman RR (Eds.), The cambridge handbook of expertise and expert performance (pp. 41–67). Cambridge University Press. [Google Scholar]
- Fific M, Nosofsky RM, & Townsend JT (2008). Information-processing architectures in multidimensional classification: A validation test of the systems factorial technology. Journal of Experimental Psychology: Human Perception and Performance, 34 (2), 356–375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fific M, & Townsend JT (2010). Information-processing alternatives to holistic perception:Identifying the mechanisms of secondary-level holism within a categorization paradigm. Journal of Experimental Psychology: Learning, Memory, and Cognition, 36 (5), 1290–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fifić M, Townsend JT, & Eidels A (2008). Studying visual search using systems factorial methodology with target-distractor similarity as the factor. Perception & psychophysics, 70 (4), 583–603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fitts PM, & Posner MI (1967). Human performance. Oxford, England: Brooks/Cole. [Google Scholar]
- Gibson EJ (1963). Perceptual learning. Annual Review of Psychology, 14 (1), 29–56. [DOI] [PubMed] [Google Scholar]
- Gibson EJ, & Walk RD (1956). The eėct of prolonged exposure to visual patterns on learning to discriminate similar and different patterns. Journal of Comparitive and Psysiological Psychology, 51, 584–587. [DOI] [PubMed] [Google Scholar]
- Goldstone RL (1998). Perceptual learning. Annual review of psychology, 49, 585–612. [DOI] [PubMed] [Google Scholar]
- Goldstone RL (2000). Unitization during category learning. Journal of Experimental Psychology: Human Perception and Performance, 26, 86–112. [DOI] [PubMed] [Google Scholar]
- Gourevitch V, & Galanter E (1967). A significance test for one parameter isosensitivity functions. Psychometrika, 32, 25–33. [DOI] [PubMed] [Google Scholar]
- Green DM, & Swets JA (1966). Signal detection theory and psychophysics. New York: Wiley. [Google Scholar]
- Hackley SA, & Miller J (1995). Response complexity and precue interval effects on the lateralized readiness potential. Psychophysiology, 32 (3), 230–241. [DOI] [PubMed] [Google Scholar]
- Houpt JW, & Townsend JT (2010). The statistical properties of the survivor interaction contrast. Journal of Mathematical Psychology, 54 (5), 446–453. [Google Scholar]
- Houpt JW, & Townsend JT (2012). Statistical measures for workload capacity analysis. Journal of Mathematical Psychology, 56 (5), 341–355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang C-B, Lu Z-L, & Dosher BA (2012). Co-learning analysis of two perceptual learning tasks with identical input stimuli supports the reweighting hypothesis. Vision Research, 61, 25–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ingvalson EM, & Wenger MJ (2005). A strong test of the dual mode hypothesis. Perception & Psychophysics, 67, 14–35. [DOI] [PubMed] [Google Scholar]
- Kadlec H, & Townsend JT (1992). Signal detection analysis of dimensional interactions In Ashby FG (Ed.), Multidimensional models of perception and cognition (p. 181–228). Hillsdale, NJ: Erlbaum. [Google Scholar]
- Kellman PJ, & Garrigan P (2009). Perceptual learning and human expertise. Physics of Life Reviews, 6 (2), 53–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kellman PJ, & Massey CM (2013). Perceptual learning, cognition, and expertise In Psychology of learning and motivation (Vol. 58, pp. 117–165). Elsevier. [Google Scholar]
- Kuefner D, Jacques C, Prieto E, & Rossion B (2010). Electrophysiological correlates of the composite face illusion: Disentangling perceptual and decisional components of holistic face processing in the human brain. Brain and Cognition, 74, 225–238. [DOI] [PubMed] [Google Scholar]
- Li RW, Levi DM, & Klein SA (2004). Perceptual learning improves efficiency by re-tuning the decision ‘template’ for position discrimination. Nature Neuroscience, 7, 178–183. [DOI] [PubMed] [Google Scholar]
- Li W, Piech V, & Gilbert CD (2004). Perceptual learning and top-down influences in primary visual cortex. Nature Neuroscience, 7, 651–657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Little D, Altieri N, Yang C, & Fific M (2017). Systems factorial technology: A theory drivenmethodology for the identification of perceptual and cognitive mechanisms. Elsevier Science & Technology Books. Retrieved from https://books.google.com/books?id=OQICMQAACAAJ [Google Scholar]
- Logan GD (1988). Toward an instance theory of automatization. Psychological Review, 95,492–527. [Google Scholar]
- Lu Z-L, Lin Z, & Dosher BA (2016). Translating perceptual learning from the laboratory to applications. Trends in Cognitive Sciences, 20 (8), 561–563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Macmillan NA, & Creelman CD (2005). Detection theory: A user’s guide (second edition). Mahwah, NJ: Erlbaum. [Google Scholar]
- Miller J, & Hackley SA (1992). Electrophysiological evidence for temporal overlap among contingent mental processes. Journal of Experimental Psychology: General, 121 (2), 195. [DOI] [PubMed] [Google Scholar]
- Moldakarimov S, Bazhenov M, & Sejnowski TJ (2014). Top-down inputs enhance orientation selectivity in neurons of the primary visual cortex during perceptual learning. PLoS Computational Biology, 10 (8), e1003770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mordkoff JT, & Grosjean M (2001). The lateralized readiness potential and response kinetics in response-time tasks. Psychophysiology, 38 (5), 777–786. [PubMed] [Google Scholar]
- Mordkoff JT, Yantis S, & Egeth HE (1990). Detecting conjunctions of color and form in parallel. Perception & Psychophysics, 48, 157–168. [DOI] [PubMed] [Google Scholar]
- O’Toole AJ, Wenger MJ, & Townsend JT (2001). Quantitative models of perceiving and remembering faces: Precedents and possibilities In Wenger MJ & Townsend JT (Eds.), Computational, geometric, and process perspectives on facial cognition: Contexts and challenges (p. 1–38). Mahwah NJ: Erlbaum. [Google Scholar]
- Petrov A, Dosher B, & Lu Z-L (2005). The dynamics of perceptual learning: An incremental reweighting model. Psychological Review, 112, 715–743. [DOI] [PubMed] [Google Scholar]
- Polat U (2016). Advanced perceptual learning techniques induce neuroplasticity to enable improved visual functions. Current Ophthalmology Reports, 4 (1), 1–7. [Google Scholar]
- Pourtois G, Grandjean D, & Sander PD, and Vuilleumier . (2004). Electrophysiological correlates of rapid spatial orienting towards fearful faces. Cerebral Cortex, 14 (6), 619–633. [DOI] [PubMed] [Google Scholar]
- Rauss K, Schwartz S, & Pourtois G (2011). Top-down eėcts on early visual processing in humans: A Predictive coding framework. Neuroscience & Biobehavioral Reviews, 35 (5),1237–1253. [DOI] [PubMed] [Google Scholar]
- Ray W, Slobounov S, Mordkoff J., Johnston J, & Simon R (2000). Rate of force development and the lateralized readiness potential. Psychophysiology, 37 (6), 757–765. [PubMed] [Google Scholar]
- Ruff CC, & Driver J (2006). Attentional preparation for a lateralized visual distractor:Behavioral and fMRI evidence. Journal of Cognitive Neuroscience, 18 (4), 522–538. [DOI] [PubMed] [Google Scholar]
- Schneider W, & Shiffrin RM (1977). Controlled and automatic human information processing: Detection, search, and attention. Psychological Review, 84, 1–66. [Google Scholar]
- Schoups AA, & Orban GA (1996). Interocular transfer in perceptual learning of a pop-out discrimination task. Proceedings of the National Academy of Sciences, 93 (14), 7358–7362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schweickert R (1989). Separable effects of factors on activation functions in discrete and continuous models: d’ and evoked potentials. Psychological Bulletin, 106, 318–328. [DOI] [PubMed] [Google Scholar]
- Schweickert R, & Mounts J (1998). Additive effects of factors on reaction time and evoked potentials in continuous-flow models In Dowling CE & Roberts FS (Eds.), Recent progress in mathematical psychology: Psychophysics, knowledge, representation, cognition, and measurement (p. 311–327). Mahwah, NJ: Erlbaum. [Google Scholar]
- Schyns PG, Goldstone RL, & Thibaut J-P (1998). The development of features in object concepts. Behavioral and Brain Sciences, 21 (1), 1–17. [DOI] [PubMed] [Google Scholar]
- Seitz AR (2017). Perceptual learning. Current Biology, 27 (13), R631–R636. [DOI] [PubMed] [Google Scholar]
- Shiffrin RM, & Schneider W (1977). Controlled and automatic human information processing: Ii. perceptual learning, automatic attending and a general theory. Psychological review, 84 (2), 127. [Google Scholar]
- Silbert NH, & Hawkins RXD (2016). A tutorial on general recognition theory. Journal of Mathematical Psychology, 73, 94–109. [Google Scholar]
- Silbert NH, & Thomas RD (2013). Decisional separability, model identification, and statistical inference in the general recognition theory framework. Psychonomic Bulletin & Review, 1–20. [DOI] [PubMed] [Google Scholar]
- Simon HA, & Chase WG (1973). Skill in chess. American Scientist, 61 (4), 394–403. [Google Scholar]
- Smith PL, & Little DR (2018). Small is beautiful: In defense of the small-N design. Psychonomic Bulletin & Review, 1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soto FA, Vucovich L, Musgrave R, & Ashby FG (2015). General recognition theory with individual differences: a new method for examining perceptual and decisional interactions with an application to face perception. Psychonomic bulletin & review, 22 (1), 88–111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tenison C, & Anderson JR (2016). Modeling the distinct phases of skill acquisition. Journal of Experimental Psychology: Learning, Memory, and Cognition, 42 (5), 749. [DOI] [PubMed] [Google Scholar]
- Thomas RD (2001). Characterizing perceptual interactions in face identification using multidimensional signal detection theory In Wenger MJ & Townsend JT (Eds.), Computational, geometric, and process perspectives on facial cognition: Contexts and challenges (p. 193–228). Mahwah, NJ: Erlbaum. [Google Scholar]
- Thomas RD (2003). Further considerations of a general d′ in multidimensional space. Journal of Mathematical Psychology, 47 (2), 220–224. [Google Scholar]
- Thomas RD, & Silbert NH (2014). Technical clarification to silbert and thomas (2013):“decisional separability, model identification, and statistical inference in the general recognition theory frameworkâĂİ”. Psychonomic Bulletin & Review, 21 (2), 574–575. [DOI] [PubMed] [Google Scholar]
- Townsend JT, & Altieri N (2012). An accuracy-response time capacity assessment function that measures performance against standard parallel predictions. Psychological Review, 119 (3), 500–516. [DOI] [PubMed] [Google Scholar]
- Townsend JT, & Eidels A (2011). Workload capacity spaces: A unified methodology for response time measures of efficiency as workload is varied. Psychonomic Bulletin & Review, 659–681. [DOI] [PubMed] [Google Scholar]
- Townsend JT, Houpt JW, & Silbert NH (2012). General recognition theory extended to include response times: Predictions for a class of parallel systems. Journal of Mathematical Psychology, 56 (6), 476–494. [Google Scholar]
- Townsend JT, & Nozawa G (1995). On the spatio-temporal properties of elementary perception: An investigation of parallel, serial, and coactive theories. Journal of Mathematical Psychology, 39, 321–359. [Google Scholar]
- Townsend JT, & Wenger MJ (2004a). The serial-parallel dilemma: A case study in a linkage of theory and method. Psychonomic Bulletin & Review, 11, 391–418. [DOI] [PubMed] [Google Scholar]
- Townsend JT, & Wenger MJ (2004b). A theory of interactive parallel processing: Newcapacity measures and predictions for a response time inequality series. Psychological Review, 111, 1003–1035. [DOI] [PubMed] [Google Scholar]
- Townsend JT, & Wenger MJ (2015). On the dynamic perceptual characteristics of gestalten: Theory-based methods In Wagemans J (Ed.), The oxford handbook of perceptual organization (p. 948–968). Oxford. [Google Scholar]
- Townsend JT, Wenger MJ, & Houpt JW (2018). Uncovering mental architecture and related mechanisms in elementary human perception, cognition and action In Wagenmaker EJ (Ed.), The stevens handbook of experimental psychology and cognitive neuroscience. London: Oxford Univerity Press. [Google Scholar]
- Von Der Heide RJ, Wenger MJ, Bittner JL, & Fitousi D (2018). Converging operations and the role of perceptual and decisional influences on the perception of faces: Neural and behavioral evidence. Brain and Cognition, 122, 59–75. [DOI] [PubMed] [Google Scholar]
- Wang R, Wang J, Zhang J-Y, Xie X-Y, Yang Y-X, Luo S-H, et al. (2016). Perceptual learning at a conceptual level. Journal of Neuroscience, 36 (7), 2238–2246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang R, Zhang J-Y, Klein SA, Levi DM, & Yu C (2012). Task relevancy and demand modulate double-training enabled transfer of perceptual learning. Vision Research, 61, 33–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang R, Zhang J-Y, Klein SA, Levi DM, & Yu C (2014). Vernier perceptual learning transfers to completely untrained retinal locations after double training: A “piggybacking” effect. Journal of Vision, 14 (13), 12–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watanabe T, & Sasaki Y (2015). Perceptual learning: toward a comprehensive theory. Annual Review of Psychology, 66, 197–221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wenger MJ (1999). On the whats and hows of retrieval in the acquisition of a simple skill. Journal of Experimental Psychology: Learning, Memory and Cognition, 25, 1137–1160. [DOI] [PubMed] [Google Scholar]
- Wenger MJ, & Payne DG (1995). On the acquisition of mnemonic skill: Application of skilled memory theory. Journal of Experimental Psychology: Applied, 3, 194–215. [Google Scholar]
- Wenger MJ, & Townsend JT (2001). Faces as Gestalt stimuli: Process characteristics In Wenger MJ & Townsend JT (Eds.), Computational, geometric, and process perspectives on facial cognition (p. 229–284). Mahwah, NJ: Erlbaum. [Google Scholar]
- Wenger MJ, & Townsend JT (2006). On the costs and benefits of faces and words. Journal of Experimental Psychology: Human Perception and Performance, 32, 755–779. [DOI] [PubMed] [Google Scholar]
- Zhang J-Y, Cong L-J, Klein SA, Levi DM, & Yu C (2014). Perceptual learning improves adult amblyopic vision through rule-based cognitive compensation. Investigative Ophthalmology & Visual Science, 55 (4), 2020–2030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Q, Walsh MM, & Anderson JR (2018). The Impact of inserting an additional mental process. Computational Brain & Behavior, 1–14.30906917 [Google Scholar]
- Zhaoping L, Herzog MH, & Dayan P (2003). Nonlinear ideal observation and recurrent preprocessing in perceptual learning. Network: Computation in Neural Systems, 14 (2), 233–247. [PubMed] [Google Scholar]









