Abstract
Performance for many visual tasks improves with training. The magnitude of improvement following training depends on the training task, number of trials per training session and the total amount of training. Does the magnitude of improvement also depend on the frequency of training sessions? In this study, we compared the learning effect for three groups of normally sighted observers who repeatedly practiced the task of identifying crowded letters in the periphery for six sessions (1000 trials per session), according to three different training schedules — one group received one session of training everyday, the second group received a training session once a week and the third group once every two weeks. Following six sessions of training, all observers improved in their performance of identifying crowded letters in the periphery. Most importantly, the magnitudes of improvement were similar across the three training groups. The improvement was accompanied by a reduction in the spatial extent of crowding, an increase in the size of visual span and a reduction in letter-size threshold. The magnitudes of these accompanied improvements were also similar across the three training groups. Our finding that the effectiveness of visual perceptual learning is similar for daily, weekly and biweekly training has significant implication for adopting perceptual learning as an option to improve visual functions for clinical patients.
Keywords: perceptual learning, crowding, letter identification, peripheral vision
INTRODUCTION
Performance for a variety of visual tasks improves with practice (e.g. Ball & Sekuler, 1982; 1987; Beard, Levi & Reich, 1995; Fahle & Edelman, 1993; Fiorentini & Berardi, 1980, 1981; Karni & Sagi, 1991; McKee & Westheimer, 1978; Poggio, Fahle & Edelman, 1992; Saarinen & Levi, 1995). The magnitude of improvement following the process of repeated practice (training), often termed perceptual learning, depends on many aspects of the training regime, including the task chosen for training, the total amount of practice and the amount of practice w ithin each training session. Recently, perceptual learning has been proposed as a treatment to improve visual functions or to overcome some of the disabilities as a result of amblyopia (Astle, Webb & McGraw, 2011; Levi & Li, 2009; Polat, 2009), presbyopia (Polat et al, 2012) and macular disorders (Chung, 2011). A major consideration for applying perceptual learning to improving vision in clinical patients is compliance, which usually relates to the inconveniences brought about by the training regime. For instance, if the training regime calls for many training sessions, or extensive hours of training for each session, patients may find it difficult to adhere to the training schedule. Fortunately, for many visual tasks, improvements usually occur fairly rapidly for the first couple of training sessions (e.g. Fiorentini & Berardi, 1981; Karni & Sagi, 1993; Poggio, Fahle & Edelman, 1992), although it has been shown that performance for certain tasks could improve slowly after the initial rapid improvement; and may require up to 40-50 hours of practice to reach a plateau (Li, Provost & Levi, 2007; Li, Klein & Levi, 2009). Also, shorter training sessions have been shown to be more effective in inducing improvements than longer ones (Molloy et al, 2012). Therefore, the number of training sessions and the duration of each session may not be the major factors limiting patient compliance.
Numerous studies that examined perceptual learning in observers with normal vision adopted a protocol in which observers attended daily training sessions (e.g. Chung, 2007; Chung, Legge & Cheung, 2004; Chung, Levi & Tjan, 2005), or at the minimum, three to five training sessions per week (e.g. Gold, Bennett & Sekuler, 1999; Li, Provost & Levi, 2007; Li, Klein & Levi, 2009; Saarinen & Levi, 1995; Sun, Chung & Tjan, 2010). This frequency of training sessions was believed to be crucial to maximize the benefit of perceptual learning. However, is daily training really necessary to obtain the largest amount of improvement? If perceptual learning is to be adopted as a treatment for clinical populations, relaxing the frequency of training sessions is necessary as many patients may not be able to attend daily training sessions. This is especially so for visually impaired patients who are not able to drive and thus their availability to attend training sessions would depend on arrangements for transportation. Therefore, the primary goal of this study was to examine the dependence of the efficacy of perceptual learning on the frequency of training sessions. We compared the amount of improvement following perceptual learning in the normal periphery for three training schedules: daily, weekly and biweekly (every two weeks). Daily training was the popular schedule used in many previous perceptual learning studies. We chose to evaluate the effectiveness of a weekly and biweekly training schedule because many visually impaired observers are able to participate in research projects in our laboratory once a week or once every two weeks, implying that a weekly or biweekly training schedule is feasible for this group of patients. However, it remains unknown whether the effectiveness of perceptual learning would be reduced when there is a longer time interval between training sessions.
Though virtually any training task could be used to address our primary goal, in this study, we trained normally sighted observers to identify letters closely flanked by two other letters (learning to “uncrowd”) in the periphery. This training task has been proven to be effective in reducing crowding in the normal periphery (Chung, 2007; Sun et al, 2010) and in observers with amblyopia (Chung, Li & Levi, 2012). Crowding refers to the deleterious influence of nearby contours on visual discrimination (Pelli, Palomares & Majaj, 2004; Levi, 2008). Previously, we trained observers to identify a letter closely flanked by two other letters, and found that following 6000 trials of repeated testing, observers improved in their ability to identify the flanked letters. This effect was found in the normal periphery (Chung, 2007; Sun et al, 2010) as well as in the amblyopic eye of a group of amblyopic observers (Chung et al, 2011). Because our target and flanking letters were randomly chosen from the 26 lowercase letters on each trial, the observed improvements could not be attributed to observers learning a specific combination of letters, as in studies in which only a very limited set of combinations of letters was used for training (e.g. Huckauf and Nazir, 2007). Using a different paradigm in which the letter spacing between the flanking letters and the target letter varied during training, Hussain et al (2012) reported a similar effect that crowding can be reduced in the normal periphery and in amblyopic observers through perceptual learning. Interestingly, even when the training task was not specifically designed to reduce crowding, such as in video-game playing (e.g. Green & Bavelier, 2007; Li, Ngo, Nguyen & Levi, 2011), or using a task that is more closely related to lateral masking than crowding1 (Maniglia et al, 2011), a reduction in crowding was still observed following perceptual learning. The reduction in crowding was manifested as either better acuities measured in the presence of flankers in close proximity, or a reduction in the target-flanker spacing such that the performance for discriminating some attribute of the target (e.g. contrast or orientation) was not affected (for a review, see Huurneman et al, 2012).
The design of this study closely followed that of Chung (2007) with some modifications. In the Chung (2007) study, despite a substantial improvement in observers’ ability to identify crowded letters following a daily training protocol, the improvement did not lead to improved reading speed. Previously, Legge and coworkers showed that the visual span, the number of characters that can be recognized in a single glance, is a sensory bottleneck on reading (Legge, 2007; Legge et al, 2007). This supposition is based on the strong correlation (r2 > 0.8) between reading speed and the size of the visual span (expressed as mutual information transmitted in bits, see Methods) determined for different stimulus characteristics such as contrast, letter size and stimulus presentation eccentricity. Given the link between reading speed and visual span, and the finding of Chung (2007), we expected that the visual span, like reading speed, would not benefit from the same uncrowd training task. Such a result would further strengthen the supposition of the visual span as a sensory bottleneck on reading. On the contrary, if the visual span benefits from the uncrowd training task, then the close relationship between the visual span and reading speed would need to be revisited, and the results might help us understand why reducing crowding does not benefit reading speed. The secondary goal of this study was to test if the improvements following a training protocol to learn to uncrowd would lead to an enlargement in the visual span.
To preview our results, we found that observers showed an improved ability to identify crowded letters following six sessions of training. Most importantly, the magnitudes of improvement were similar for the daily, weekly and biweekly training groups. The improvement due to training was accompanied by a reduction in the spatial extent of crowding, an increase in the size of the visual span and a reduction in letter-size threshold. The magnitudes of these (transferred) improvements were also similar among the three training groups.
METHODS
Twenty-four young adults with normal vision, aged 19 – 27, participated in this study. Written informed consent was obtained from each observer after the procedures of the experiment were explained and prior to the commencement of data collection. Observers were randomly assigned to one of three training groups, with eight observers in each group.2 The three training groups differed only on the frequency of the training sessions, with one group receiving training on a daily basis (“daily”), the second group received training on a weekly basis (once a week on the same day of the week, “weekly”) and the third group received training every fortnight (once every other week on the same day of the week, “biweekly”). The average ages of the three groups were very similar (daily = 20.13 years, weekly = 20.75 years, biweekly = 20.5 years). All testings (pre-tests, training and post-tests) were performed at 10° eccentricity in the lower visual field.
The basic experimental design and training schedule are represented schematically in Fig. 1. The pre-test, lasted approximately 1.5 h, consisted of the measurements of letter - size threshold, spatial extent of crowding and a visual-span profile (in the order listed). The letter-size threshold was used to determine the letter size that was used in subsequent testings (other pre-tests and training).
Training consisted of six sessions, each lasting approximately 1 h. The training task was very similar to that used in Chung (2007), whereby observers identified a letter flanked closely by two other letters on each trial, at 10° in the inferior visual field (Fig. 2A). The only differences between this study and Chung (2007) were that we used Courier font in this study (Times font was used in Chung (2007)) and that we specified the letter separation with respect to the standard letter spacing (equivalent to 1.16× the width of the lowercase letter x) in Courier. Each session comprised 10 blocks of trials, with 100 trials per block. Observers in the daily training group completed their training over six consecutive days, while observers in the weekly and biweekly training groups completed their training over six and eleven consecutive weeks, respectively.
The post-test immediately followed the last training session on the same day. Following the last training session, observers were given a 15 – 30 min break before the post-test commenced. The post-test was identical to the pre-test except that the measurements of the visual-span profile, spatial extent of crowding and letter-size thresholds were conducted in the reverse order as that during the pre-test.
Stimuli
Trigrams, random sequences of three letters arranged horizontally, were used as stimuli during all phases of the experiment (pre-test, training and post-test). With the exception of the visual-span measurements, the middle letter of each trigram was presented at 10° directly below a fixation target. For the visual-span measurements, the trigrams were presented at 10° below the fixation target, but at various letter positions right or left of the vertical midline (see details later). Stimuli were generated on a Macintosh G4 computer with software written in MATLAB 5.2.2 (The MathWorks, MA), using the Psychophysics Toolbox extensions (Brainard, 1997; Pelli, 1997), and were displayed on a Sony color monitor (Model # GDM-17E21, refresh rate = 75 Hz). All trigrams comprised only lowercase letters rendered in Courier font, randomly chosen from the 26 letters of the Roman alphabet, and were rendered as black (0.2 cd/ m 2) letters on a white background (45 cd/ m 2). The center-to-center separation between adjacent letters of the trigrams varied, depending on the different task (see below). Each trigram was presented for 106 ms, a duration shorter than the latency of saccadic eye movements. The task of the observers was to identify only the middle flanked letter (all tasks except for the measurement of visual-span profiles) or all three letters (measurement of visual-span profiles) of each trigram while fixating the fixation target.
Pre- & post-test: Letter-size threshold measurements
Letter-size threshold was measured using letter trigrams, where adjacent letters were separated from each other by a center-to-center separation equivalent to 3× the standard letter spacing. This letter separation was large enough such that the measured size threshold approximated the unflanked threshold; while at the same time, maintained the same task demand for our observers. We used the Method of Constant Stimuli to present trigrams at five letter sizes in each block. Observers’ task was to identify the middle letter of each trigram. For each observer, a cumulative-Gaussian function was used to fit the data relating the proportion of correct responses and letter size. The letter-size threshold was defined as the letter size that yielded 52% correct (50% correct after correction for guessing) on the cumulative-Gaussian function.
Pre- & post-test: Spatial extent of crow ding measurements
Performance for identifying the middle letters of trigrams was measured as a function of letter separation during pre- and post-tests, as in Chung (2007). Five letter separations were tested, ranging from 0.8 to 2× the standard letter spacing (Fig. 2B). Letter size was 1.4× the letter-size threshold as determined previously for each observer, which yielded a proportion of correct responses averaging 0.85 (range: 0.63 to 0.93) for the largest letter separation before training. A cumulative-Gaussian function was used to fit the data relating the proportion of correct responses and letter separation. The separation that yielded 52% correct (50% correct after correction for guessing) on the cumulative-Gaussian function was used to represent the spatial extent of crowding.
Pre- & post-test: Visual-span measurements
Visual-span profiles were measured using a trigram-recognition task as in Chung et al (2004). Letter size was fixed at 1.4× the letter-size threshold. For this task, trigram s were presented at 13 positions, indexed by the position of the middle letter, from six letter slots left of the vertical midline (letter slot 0 was 10° directly below fixation) to six letter slots right of the vertical midline (Fig. 2C). Each trigram position was tested ten times in a random order within a block of trials, yielding a total of 130 trials tested in each block. Observers’ task was to identify all three letters of each trigram, from left to right. A letter was scored as being identified correctly if and only if its order within the trigram was also correct. To calculate the overall performance of letter identification at each letter slot, we combined the identification accuracies across trials where the letter slot was occupied by the left, middle or right letter of a trigram. A split-Gaussian function centered at letter slot 0 was then used to fit each set of data relating identification accuracy and letter position, representing the visual-span profile (Legge et al., 2001; Chung et al., 2004). Curve-fitting was restricted to data within five letter slots left and right of fixation because the sixth letter slot left and right of fixation did not contain trials where the letter slot was occupied by the inner letter (the letter of a trigram closest to fixation). To quantify the size of the visual span, following Legge et al. (2001) and Chung et al. (2004), we converted the identification accuracy at each letter slot to bits of mutual information transmitted by the visual span. According to Information Theory (Shannon, 1948), mutual information measures the amount of information that can be obtained about one random variable by observing another. In other words, it quantifies the dependence between the joint probabilities (the entropy or the uncertainty) of two events. With respect to our task, the two events could be: what is the probability of an observer’s response being a ‘b’, given a stimulus letter ‘h’? Because there were 26 letters, the mutual information transmitted at a given letter slot ranged from zero bit for chance accuracy of 0.0384 to approximately 4.7 bits for perfect identification. To convert letter identification accuracy at a given letter slot to bits of mutual information transmitted, we used the following equation which was derived based on confusion matrices for single letter identification determined empirically by Beckmann (1998):
bits of information = −0.037 + 4.676 × proportion correct of letter identification Then we summed up the total bits of information transmitted across all letter slots of the visual-span profile. This method of quantifying the visual span is akin to calculating the area under the curve.
Training task
The training task was similar to the one used in Chung (2007), whereby observers repeatedly identified the middle letters of trigrams rendered at a letter separation of 0.8× the standard spacing, which caused substantial crowding (letter identification accuracy was much lower than that for a larger letter separation). Letter size used for training was 1.4× the letter-size threshold. Regardless of the training group assignment, all observers completed six sessions of training. Each session consisted of 10 blocks of trials, with 100 trials per block.
Data analyses and reporting
Curve-fitting was accomplished using Igor Pro, which minimized the χ-square between the observed and predicted values. Unless otherwise stated, the reported error bars associated with the group-average values represent the 95% confidence intervals.
RESULTS
Consistent with the results of a previous study (Chung, 2007), performance for identifying the middle letters of trigrams rendered at a letter separation of 0.8× the standard spacing at 10° in the normal periphery improved following six days of training for the daily training group. More importantly, using the same training task, observers in the weekly and biweekly training groups also showed substantial improvement following six sessions of training. Fig. 3 presents the data during training for two observers of each training group, one with the most (top panels) and the other with the least (bottom panels) amount of improvement, to show the range of performance during training for each training group. To quantify the improvement, for each observer, we fit a linear regression function relating his/ her identification accuracy as a function of training block (Chung, 2007; Chung, Li & Levi, 2012). A t-test was performed to determine if the slope of each regression function differed from a slope of 0, an indication that there was no improvement due to training. The p -values of this analysis are given in Table 1. After Bonferroni correction for multiple comparisons (p -value ≤ 0.0021 to be considered as significant), the changes in performance during training for one observer of each of the three training groups are found to be not significantly different from 0, implying these three observers did not show any training effect. The proportion of observers who did not show any improvement following training is lower than those reported in the literature (Fahle & Henke-Fahle, 1996; Chung, Levi & Tjan, 2005).
Table 1.
Observer | Training data: p- value of regression fit |
Training data: (60th block accuracy – 1st block accuracy)/ 1st block accuracy |
Post/pre ratio of spatial extent of crowding |
Post – pre difference in size of visual span (bits) |
Post/pre ratio of letter-size threshold |
---|---|---|---|---|---|
DAILY TRAINING GROUP | |||||
D1 | 0.0001 | 0.209 | 0.825 | 8.470 | 0.787 |
D2 | 0.7356 | −0.015 | 0.948 | −0.320 | 0.776 |
D3 | <0.0001 | 0.635 | 0.592 | 5.469 | 0.970 |
D4 | 0.0004 | 0.308 | 0.700 | 7.008 | 0.990 |
D5 | <0.0001 | 0.443 | 0.783 | 8.670 | 0.492 |
D6 | <0.0001 | 0.639 | 0.982 | 7.491 | 0.679 |
D7 | <0.0001 | 0.608 | 0.804 | 9.344 | 0.729 |
D8 | 0.0094 | 0.202 | 0.888 | 9.276 | 0.786 |
AVERAGE | 0.379 ± 0.168 | 0.815 ± 0.089 | 6.926 ± 2.219 | 0.776 ± 0.110 | |
WEEKLY TRAINING GROUP | |||||
W1 | <0.0001 | 0.247 | 1.014 | 2.802 | 0.771 |
W2 | <0.0001 | 0.908 | 0.789 | 4.920 | 0.832 |
W3 | <0.0001 | 0.312 | 0.875 | 10.680 | 1.089 |
W4 | <0.0001 | 0.571 | 0.774 | 7.697 | 0.688 |
W5 | <0.0001 | 1.096 | 0.615 | 10.600 | 0.815 |
W6 | 0.3463 | 0.051 | 0.800 | 5.382 | 0.787 |
W7 | <0.0001 | 0.277 | 0.869 | 8.965 | 0.685 |
W8 | <0.0001 | 0.488 | 0.713 | 7.875 | 0.813 |
AVERAGE | 0.494 ± 0.245 | 0.806 ± 0.082 | 7.365 ± 1.944 | 0.810 ± 0.087 | |
BIWEEKLY TRAINING GROUP | |||||
B1 | 0.0020 | 0.208 | 0.832 | 5.902 | 0.706 |
B2 | 0.0013 | 0.261 | 1.022 | 3.055 | 0.817 |
B3 | 0.0017 | 0.244 | 0.929 | 12.010 | 0.741 |
B4 | 0.0019 | 0.212 | 0.890 | 7.326 | 0.942 |
B5 | <0.0001 | 0.327 | 0.853 | 6.022 | 0.682 |
B6 | 0.0008 | 0.273 | 0.925 | 6.380 | 0.716 |
B7 | 0.1078 | 0.112 | 1.075 | 4.526 | 0.831 |
B8 | <0.0001 | 0.449 | 0.819 | 8.117 | 0.727 |
AVERAGE | 0.261 ± 0.068 | 0.918 ± 0.063 | 6.668 ± 1.849 | 0.770 ± 0.060 |
With the linear regression function, another analysis we performed to quantify the magnitude of improvement due to training was to compare the performance accuracy between the first and the last (60th) block based on the calculated values from the fitted function (see Table 1). Averaged across observers of each group, the magnitude of improvement ((accuracy for the last block – accuracy for the first block)/ accuracy for the first block) was 37.9 ± 16.8%, 49.4 ± 24.5% and 26.1 ± 6.8% for the daily, weekly and biweekly training groups, respectively. These values are not different from each other (ANOVA: Fdf=2 = 2.09, p = 0.15; see Fig. 4A). Note that the initial performance was also similar across the three training groups (results not shown, ANOVA: Fdf=2 = 0.13, p = 0.88).
To examine whether the improvement in identifying crowded letters generalize to improved ability to identify letters flanked at larger letter separations, we compared the pre- and post-test measurements of the spatial extent of crowding. Fig. 5 shows the data from two observers of each training group, one with the largest change in the spatial extent of crowding (top panels) and the other with the smallest change in the spatial extent of crowding (bottom panels). Dotted lines in the panel for observer D3 represent how we defined the spatial extent of crowding (the letter separation that yielded a proportion-correct of identifying the middle flanked letters at 52% correct). The change in the spatial extent of crowding, expressed as the post-pre ratio (PPR: Levi & Li, 2009), averaged across observers of each group, is p lotted for the three training groups in Fig. 4B (see also Table 1). A PPR smaller than 1 implies that the spatial extent is smaller after training than before. Averaged across observers of each group, the PPR averaged 0.815 ± 0.089, 0.806 ± 0.082 and 0.918 ± 0.063 for the daily, weekly and biweekly training groups, respectively. Because the 95% confidence ranges do not include the value of 1 (no improvement), these PPRs represent significant changes in the spatial extent of crowding following training for all three training groups. However, these changes are not different from one another (ANOVA: Fdf=2 = 2.59, p = 0.10).
We also examined if the improvement in identifying crowded letters generalize to an increased in the size of the visual span. Fig. 6 presents the data from two observers of each training group, one with the largest increase in the size of the visual span (top panels) and the other with the least increase in the size of the visual span (bottom panels). For each observer, we calculated the difference in the size of the visual span before and after training. Averaged across observers of each group, the visual span increased by 6.93 ± 2.22 bits, 7.37 ± 1.94 bits and 6.67 ± 1.85 bits for the daily, weekly and biweekly training groups, respectively (Fig. 4C and Table 1). In other words, all three groups exhibited significant improvement in the size of the visual span following training on identifying crowded letters (the 95% confidence limits of all the distributions did not contain the value of 0), but these improvements are not different among the three groups (ANOVA: Fdf=2 = 0.12, p = 0.89).
The third comparison we made for measurements before and after training was the size threshold for letter identification. The PPR for the three training groups averaged 0.78 ± 0.11, 0.81 ± 0.09 and 0.77 ± 0.06 for the daily, weekly and biweekly training groups, respectively (Fig. 4D and Table 1). ANOVA shows that these values are not statistically different from one another (Fdf=2 = 0.23, p = 0.80).
DISCUSSION
Following six sessions (6000 trials) of repeated practice on the task of identifying crowded letters at 10° inferior visual field, observers’ performance for identifying such letters improved substantially. This improvement following training transferred to other untrained letter separations such that the spatial extent of crowding, defined as the letter separation at which the target letter was identified at 50% accuracy (after correction for guessing), decreased with training. Further, the improvement also led to an enlargement of the visual-span profile, and improved letter-size threshold. Most importantly, the amount of improvements for the trained task or other untrained tasks did not depend on whether training was conducted on a daily, weekly or biweekly schedule. These findings bear significant practical implication as they imply that there is no need for observers to attend daily training sessions in the laboratory t o maximize their benefits from perceptual learning. This is especially important if perceptual learning is going to be adopted as a treatment for clinical populations.
It is well known that the magnitude of improvement following perceptual learning depends on many aspects of the training regime, including the training task (Fine & Jacobs, 2002), the total number of practice trials (e.g. Tsodyks & Gilbert, 2004), the amount of practice within each training session (Aberg et al, 2009; Hussain, Sekuler & Bennett, 2009; Kumar & Glaser, 1993) and the distribution of practice across time (Aberg et al, 2009; Molloy et al, 2012; Taub & Goldberg, 1973; Xue et al, 2011). Task specificity is a notable characteristic of perceptual learning. Even for tasks that are seemingly related, for example, letter identification and reading, the transfer of learning from a trained to an untrained task is usually not complete. Maximal improvement is always obtained using a training task that targets directly at the intended visual function (Yu et al, 2010). The choice of a training task should therefore, be based on the specific visual function for which an improvement is desired. Even with the most relevant task, how much training is necessary? For sensory visual tasks, a recent report suggests that as few as one trial per condition on the first day of training is sufficient to produce an improvement greater than that for the control group (Hussain, Sekuler & Bennett, 2009), although maximal learning requires more practice trials, from a total of several hundreds (Fiorentini & Berardi, 1981; Karni & Sagi, 1993; Poggio, Fahle & Edelman, 1992; Wright & Sabin, 2007) to even thousands (Li, Levi & Klein, 2004), depending on the specific training task. Note that however, performance for some tasks may continue to improve with additional practice up to 40 – 50h (Li, Provost & Levi, 2007; Li, Klein & Levi, 2009). But how much practice within each training session is needed? Aberg et al (2009) reports that for a Chevron discrimination task, a minimum of 400 trials per training session is required to produce improvements. However, numerous reports show that the distribution of the practice trials, instead of the total number of practice trials, is a more important factor governing the amount of learning. The distribution of practice trials has often been discussed in the context of massed vs. spaced practice. Massed practice refers to the condition when successive trials are delivered one after another in a continuous manner whereas spaced training refers to the condition when there is a short time interval between successive trials, even if the interval is in the order of a few seconds (Ramsay, Utrecht & Alkema, 1967; Taub & Goldberg, 1973; for reviews, see Donovan & Radosevich, 1999; or Cepeda et al, 2006). Most studies report that spaced practice is often accompanied by a greater magnitude of improvement and a higher retention of learning than massed practice (for reviews, see Donovan & Radosevich, 1999; or Cepeda et al, 2006), a finding that has been attributed to the need for consolidation of learned information, memory enhancement and/ or the reduction of neural repetition suppression (e.g. Karni & Sagi, 1993; Xue et al, 2011). However, at least in the visual perceptual learning literature, one aspect of the training regime that receives little, if any, attention is the frequency of training sessions. The frequency of training sessions differs from the massed vs. spaced training in that the former one refers to the time interval between successive training sessions while the latter one refers to the time interval between successive trials. If perceptual learning is to be adopted as a treatment, or a mean to improve visual functions for the clinical populations, it is important to know what is the maximum time interval between training sessions such that the maximal learning effect could still be observed. Using an uncrowded letter identification training task, we found that the improvement in observers’ performance for identifying crowded letters, and the improvements that were transferred to the untrained tasks (spatial extent of crowding, size of visual span and letter-size threshold) did not depend on whether observers were trained on a daily, weekly or biweekly schedule. In relation to the current theories of how perceptual learning relates to the need for the consolidation of learned information and memory, our results imply that the consolidation of the learned information is the same, and that the decay of the memory of the learned information does not change with the time interval between training sessions (at least up to intervals of two weeks). This seems to contradict our everyday experience that our memory of an event decays with time. However, our results are consistent with one or more of the following explanations: (1) the decay of memory of an event (can be considered as a “one-trial” practice) is different from the decay of memory of learned information through extensive practice (in our experiment, observers practiced 1000 trials per session), possibly due to a building-up or reinforcing of the representation of the learned information in memory through repeated practice in a single session; (2) the maximum amount of decay of learning occurs within 24 hours after the end of a training session; and/ or (3) the topping-up of the learning effect from each subsequent training session is the same regardless of the time interval since the last training session. Whether or not these speculations are correct is outside the scope of the current study but warrants further investigations.
Perceptual learning is defined as “… any relatively permanent and consistent change in the perception of a stimulus array, after practice or experience with this array” (Gibson, 1963). In order for perceptual learning to be a useful treatment to ameliorate the functional vision of clinical patients, in addition to the effectiveness of the training paradigm, another important consideration is the retention of the learning effect. Hence, an interesting question is whether the different training schedules affect how well observers retain their learning after training ceases, despite similar learning effects. Unfortunately, we did not examine retention in this study. However, in real life, if the trained task is an important daily task for the clinical patients (e.g. reading), then the patients are likely to be exposed to that task even after training ceases in the laboratory. This additional exposure could act as top -up training, and the question of how long the learning effect can be retained may not be too crucial an issue.
Comparison with Chung (2007)
The secondary goal of this study was to investigate if the improvements following a training protocol to learn to identify crowded letters would lead to an increase in the size in the visual span. The rationale for the investigation was to bridge our knowledge of the link between the visual span and reading speed (Legge, 2007; Legge, Mansfield & Chung, 2001; Legge et al., 2007), and the lack of an accompanied increase in reading speed following a very similar training paradigm (Chung, 2007). Using essentially the same training protocol (same number of observers, same trained retinal location and same number of daily training sessions and number of trials), the magnitude of improvement during training ((performance accuracy for the last block – performance accuracy for the first block)/ performance accuracy for the first block) was highly similar between the study of Chung (2007: 0.381 ± 0.086 [95% CI]) and the daily-training group of the present study (0.379 ± 0.168 [95% CI]).3 Thus, we can assume that had we trained the same group of observers and measured their visual spans and reading speeds before and after training, they would likely show an enlargement of the visual span, as we found in the present study, but not an increase in the maximum reading speed, as was reported in Chung (2007). Previously, Legge and co-workers reported that the size of the visual span and the maximum reading speed exhibit the same qualitative dependence on character size, contrast (Legge et al., 2007) and testing eccentricity (Legge, Mansfield & Chung, 2001). Further, following 20 repeated measurements of the visual-span profile, the size of visual span increases and is accompanied by an increase in the maximum reading speed (Chung, Legge & Cheung, 2004). Based on the relationships between the size of the visual span and reading speed for different stimulus or testing conditions, Legge et al (2007) deduced that an increase in the size of the visual span by 4.7 bits corresponds to approximately 39% increase in the maximum reading speed. However, despite the 7-bit increase in the size of the visual span exhibited by the daily-training group of the present study, maximum reading speed increased by a mere 7.2% in Chung (2007). What could have accounted for the lack of an increase in reading speed despite a sizeable increase in the size of the visual span?
The words we used for our RSVP reading task were presented left-justified on the computer display, and contained different number of letters. As such, the letters of the words extended to different letter positions right of fixation. We speculate that reading speed would benefit from an increase in the bits of information transmitted by the visual span if the increase in bits was distributed evenly across the different letter slots of the entire visual-span profile. However, our training task might have selectively improved the letter identification ability of our observers at only one letter position, the one corresponding to the target middle letter of trigrams, the position that was directly below fixation. Potentially our training task might have caused a location -specific improvement in letter identification, which might not have been of much benefit to reading. To determine if our training paradigm indeed led to an improvement that was specific to the location of the middle letters of trigrams (at 10° directly below fixation), we calculated the difference in bits of information transmitted by the visual span before and after training, only for letter position 0 which corresponded to the position of the middle letters of trigrams during training, and normalized this value to the overall difference in the size of the visual-span profile before and after training (from five letter slots left of the vertical midline to five letter slots right of the vertical midline). For comparison, this ratio was also calculated for the six observers who were trained at 10° in the inferior visual field using a visual-span letter-recognition training task, which presumably should have caused a more uniform improvement in letter-recognition across all letter positions (Chung, Legge & Cheung, 2004). Averaged across observers of the respective group, these ratios are 0.11 ± 0.03 [95% CI] and 0.08 ± 0.02 for the present study and that of Chung, Legge and Cheung (2004), respectively. These similar values suggest that although we only trained observers at one retinal location in the present study, the improvements transferred to other untrained letter positions thus leading to a general improvement in letter-identification across different letter positions. More importantly, these results suggest that our training trigrams presen ted directly below fixation did not cause a location-specific improvement in letter recognition, and thus could not explain why there was a lack of an increase in reading speed despite an increase in the size of the visual span.
Another possibility is that although both the visual span and reading speed are limited by low-level sensory factors such as letter contrast, letter size and retinal eccentricity, reading is likely to be subjected to additional higher-level limiting factors. These factors include contextual cues and the global shape information of combinations of letters (combinations of chunks of letters within a word or the overall word -shape). With our uncrowd training task, observers were trained to ignore the two flanking letters so as to improve their performance to identify the flanked letters. Thus, this training task would not have helped observers to improve their ability to recognize the global shape of letter-combinations, which could be important in reading (Pelli & Tillman, 2007). This is different from the visual-span letter-recognition training task in which observers had to identify all three letters of a trigram on each trial. One way to bridge the gap between the two types of training tasks is to train observers on trigrams that are always presented at one location, but require observers to report the identities for all three letters. An alternative way is to train observers using trigrams that are presented at different letter positions left and right of the vertical midline, but only require observers to report the middle letter of each trigram.
Summary
Consistent with the findings of Chung (2007), practicing identification of letters closely flanked by two other letters in peripheral vision improves observers’ performance for identifying such letters. This improvement transfers to other larger letter separations such that the spatial extent of crowding is reduced following training, and is accompanied by an increase in the size of the visual span and a smaller letter -size threshold. Most importantly, the magnitudes of these improvements do not depend on whether observers were trained on a daily, weekly or biweekly schedule. Our results imply that clinical patients may benefit from perceptual learning even if they receive training once a week or every other week. Indeed, Chung (2011) trained a group of observers with central vision loss due to macular disorders once a week using a reading task, and found a sizeable (an average of 53%) increase in reading speed following six sessions (weeks) of training. This result provides support that clinical patients with reduced vision can benefit from perceptual learning on a weekly (perhaps even less frequent) training schedule.
Perceptual learning compared for 3 training schedules: daily, weekly and biweekly
All observers received the same amount of training: 6 sessions of 1000 trials each
Magnitudes of improvement following training were similar for the 3 training groups
Transferred improvements to other tasks were also similar for the 3 training groups
Perceptual learning on a weekly or biweekly schedule was as effective as a daily one
Acknowledgments
This research was supported by NIH research grant R01-EY012810 and the UC Berkeley Undergraduate Research Apprentice Program (URAP). We thank Amit Arunkumar and Deyue Yu for their assistance in data collection, and Jean -Baptiste Bernard and Girish Kumar for their helpful comments.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
The distinction between lateral masking and crowding has been addressed in previous studies (Chung, Legge & Levi, 2001; Pelli, Palomares & Majaj, 2004).
A power analysis for ANOVA designs revealed that our sample size of eight observers per group yielded a power of 0.999 to detect any effect at p = 0.05, for our training task of identifying crowded letters, as well as for the untrained tasks of spatial extent of crowding measurements and visual-span measurements. For the trained task of identifying crowded letters, we assumed an improvement in proportion-correct of 0.181, with a standard deviation of 0.048 (values based on finding of Chung (2007)). This yielded an effect size of 3.77. For the untrained tasks, the assumed effect sizes were 6.783 (average post-pre ratio = 0.624, standard deviation = 0.092, based on Chung (2007)) for the spatial extent of crowding measurements; and 3.414 (average improvements in bits = 6.1, standard deviation = 1.787, based on Chung et al (2004)) for visual-span measurements.
Note that highly similar amount of learning was obtained despite the different fonts, and how letter separations were specified in the current study vs. Chung (2007).
REFERENCES
- Aberg KC, Tartaglia EM, Herzog MH. Perceptual learning with chevrons requires a minimal number of trials, transfers to untrained directions, but does not require sleep. Vision Research. 2009;49:2087–2094. doi: 10.1016/j.visres.2009.05.020. [DOI] [PubMed] [Google Scholar]
- Astle AT, Webb BS, McGraw PV. Can perceptual learning be used to treat amblyopia beyond the critical period of visual development? Ophthalmic & Physiological Optics. 2011;31:564–573. doi: 10.1111/j.1475-1313.2011.00873.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ball KK, Sekuler R. A specific and enduring improvement in visual motion discrimination. Science. 1982;218:697–698. doi: 10.1126/science.7134968. [DOI] [PubMed] [Google Scholar]
- Ball KK, Sekuler R. Direction-specific improvement in motion discrimination. Vision Research. 1987;27:953–965. doi: 10.1016/0042-6989(87)90011-3. [DOI] [PubMed] [Google Scholar]
- Beard BL, Levi DM, Reich LN. Perceptual learning in parafoveal vision. Vision Research. 1995;35:1679–1690. doi: 10.1016/0042-6989(94)00267-p. [DOI] [PubMed] [Google Scholar]
- Beckmann PJ. Unpublished doctoral thesis. University of Minnesota; 1998. Preneural factors limiting letter identification in central and peripheral vision. [Google Scholar]
- Brainard DH. The psychophysics toolbox. Spatial Vision. 1997;10:433–436. [PubMed] [Google Scholar]
- Cepeda NJ, Pashier H, Vul E, Wixted JT, Rohrer D. Distributed practice in verbal recall tasks: A review and quantitative synthesis. Psychological Bulletin. 2006;132:354–380. doi: 10.1037/0033-2909.132.3.354. [DOI] [PubMed] [Google Scholar]
- Chung STL. Learning to identify crow ded letters: does it improve reading speed? Vision Research. 2007;47:3150–3159. doi: 10.1016/j.visres.2007.08.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chung STL. Improving reading speed for people with central vision loss through perceptual learning. Investigative Ophthalmology & Visual Science. 2011;52:1164–1170. doi: 10.1167/iovs.10-6034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chung STL, Legge GE, Cheung S-H. Letter-recognition and reading speed in peripheral vision benefit from perceptual learning. Vision Research. 2004;44:695–709. doi: 10.1016/j.visres.2003.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chung STL, Levi DM, Legge GE. Spatial-frequency and contrast properties of crowding. Vision Research. 2001;41:1833–1850. doi: 10.1016/s0042-6989(01)00071-2. [DOI] [PubMed] [Google Scholar]
- Chung STL, Levi DM, Tjan BS. Learning letter identification in peripheral vision. Vision Research. 2005;45:1399–1412. doi: 10.1016/j.visres.2004.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chung STL, Li RW, Levi DM. Learning to id entify near-acuity letters, either with or without flankers, results in improved letter size and spacing limits in adults with amblyopia. PLoS ONE. 2012;7(4):e35829. doi: 10.1371/journal.pone.0035829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Donovan JJ, Radosevich DJ. A meta-analytic review of the distribution of practice effect: Now you see it, now you don’t. Journal of Applied Psychology. 1999;84:795–805. [Google Scholar]
- Fahle M, Edelman S. Long-term learning in Vernier acuity: Effects of stimulus orientation, range and of feedback. Vision Research. 1993;33:397–412. doi: 10.1016/0042-6989(93)90094-d. [DOI] [PubMed] [Google Scholar]
- Fahle M, Henke-Fahle S. Interobserver variance in perceptual performance and learning. Investigative Ophthalmology & Visual Science. 1996;37:869–877. [PubMed] [Google Scholar]
- Fine I, Jacobs RA. Comparing perceptual learning across tasks: A review. Journal of Vision. 2002;2:190–203. doi: 10.1167/2.2.5. [DOI] [PubMed] [Google Scholar]
- Fiorentini A, Berardi N. Perceptual learning specific for orientation and spatial frequency. Nature. 1980;287:43–44. doi: 10.1038/287043a0. [DOI] [PubMed] [Google Scholar]
- Fiorentini A, Berardi N. Learning in grating waveform discrimination: Specificity for orientation and spatial frequency. Vision Research. 1981;21:1149–1158. doi: 10.1016/0042-6989(81)90017-1. [DOI] [PubMed] [Google Scholar]
- Gibson EJ. Perceptual learning. Annual Review of Psychology. 1963;14:2956. doi: 10.1146/annurev.ps.14.020163.000333. [DOI] [PubMed] [Google Scholar]
- Gold J, Bennett PJ, Sekuler AB. Signal but not noise changes with perceptual learning. Nature. 1999;402:176–178. doi: 10.1038/46027. [DOI] [PubMed] [Google Scholar]
- Green CS, Bavelier D. Action-video-game experience alters the spatial resolution of vision. Psychological Science. 2007;18:88–94. doi: 10.1111/j.1467-9280.2007.01853.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huckauf A, Nazir TA. How odgcrnwi becomes crowding: stimulus-specific learning reduces crowding. Journal of Vision. 2007;7(2):18.1–18.12. doi: 10.1167/7.2.18. [DOI] [PubMed] [Google Scholar]
- Hussain Z, Sekuler AB, Bennett PJ. How much practice is needed to produce perceptual learning? Vision Research. 2009;49:2624–2634. doi: 10.1016/j.visres.2009.08.022. [DOI] [PubMed] [Google Scholar]
- Hussain Z, Webb BS, Astle AT, McGraw PV. Perceptual learning reduces crowding in amblyopia and in the normal periphery. Journal of Neuroscience. 2012;32:474–480. doi: 10.1523/JNEUROSCI.3845-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huurneman B, Boonstra FN, Cox RFA, Cillessen AHN, van Rens G. A systematic review on ‘foveal crowding’ in visually impaired children and perceptual learning as a method to reduce crowding. BMC Ophthalmology. 2012;12:27. doi: 10.1186/1471-2415-12-27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karni A, Sagi D. Where practice makes perfect in texture discrimination: Evidence for primary visual cortex plasticity. Proceedings of the National Academy of Sciences of the United States of America. 1991;88:4966–4970. doi: 10.1073/pnas.88.11.4966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karni A, Sagi D. The time course of learning a visual skill. Nature. 1993;365:250–252. doi: 10.1038/365250a0. [DOI] [PubMed] [Google Scholar]
- Kumar T, Glaser DA. Initial performance, learning and observer variability for Hyperacuity tasks. Vision Research. 1993;33:2287–2300. doi: 10.1016/0042-6989(93)90106-7. [DOI] [PubMed] [Google Scholar]
- Legge GE. Psychophysics of reading in normal and low vision. Mahwah, NJ and London; Lawrence Erlbaum Associates; 2007. [Google Scholar]
- Legge GE, Mansfield JS, Chung STL. Psychophysics of reading. XX. Linking letter recognition to reading speed in central and peripheral vision. Vision Research. 2001;41:725–734. doi: 10.1016/s0042-6989(00)00295-9. [DOI] [PubMed] [Google Scholar]
- Legge GE, Cheung S-H, Yu D, Chung STL, Lee H-W, Owens DP. The case for the visual span as a sensory bottleneck in reading. Journal of Vision. 2007;7(2):9.1–9.15. doi: 10.1167/7.2.9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Levi DM. Crowding — An essential bottleneck for object recognition: A mini-review. Vision Research. 2008;48:635–654. doi: 10.1016/j.visres.2007.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Levi DM, Li RW. Perceptual learning as a potential treatment for amblyopia: a mini-review. Vision Research. 2009;49:2535–2549. doi: 10.1016/j.visres.2009.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li RW, Klein SA, Levi DM. Prolonged perceptual learning of positional acuity in adult amblyopia: Perceptual template retuning dynamics. Journal of Neuroscience. 2009;28:14223–14229. doi: 10.1523/JNEUROSCI.4271-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li RW, Levi DM, Klein SA. Perceptual learning improves efficiency by re-tuning the decision ‘template’ for position discrimination. Nature Neuroscience. 2004;7:178–183. doi: 10.1038/nn1183. [DOI] [PubMed] [Google Scholar]
- Li RW, Ngo C, Nguyen J, Levi DM. Video-game play induces plasticity in the visual system of adults with amblyopia. PLoS Biology. 2011;9(8):e1001135. doi: 10.1371/journal.pbio.1001135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li RW, Provost A, Levi DM. Perceptual learning results in substantial recovery of positional acuity and visual acuity in juvenile amblyopia. Investigative Ophthalmology & Visual Science. 2007;48:5046–5051. doi: 10.1167/iovs.07-0324. [DOI] [PubMed] [Google Scholar]
- Maniglia M, Pavan A, Cuturi LF, Campana G, Sato G, Casco C. Reducing crowding by weakening inhibitory lateral interactions in the periphery with perceptual learning. PLoS ONE. 2011;6(10):e25568. doi: 10.1371/journal.pone.0025568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McKee SP, Westheimer G. Improvement in Vernier acuity with practice. Perception and Psychophysics. 1978;24:258–262. doi: 10.3758/bf03206097. [DOI] [PubMed] [Google Scholar]
- Molloy K, Moore DR, Sohoglu E, Amitay S. Less is more: Latent learning is maximized by shorter training sessions in auditory perceptual learning. PLoS ONE. 2012;7(5):e36929. doi: 10.1371/journal.pone.0036929. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pelli DG. The videotoolbox software for visual psychophysics: transforming numbers into movies. Spatial Vision. 1997;10:437–442. [PubMed] [Google Scholar]
- Pelli DG, Palomares M, Majaj NJ. Crowding is unlike ordinary masking: Distinguishing feature integration from detection. Journal of Vision. 2004;4:1136–1169. doi: 10.1167/4.12.12. [DOI] [PubMed] [Google Scholar]
- Pelli DG, Tillman KA. Parts, wholes, and context in reading: a triple dissociation. PLoS ONE. 2007;2(8):e680. doi: 10.1371/journal.pone.0000680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poggio T, Fahle M, Edelman S. Fast perceptual learning in visual hyperacuity. Science. 1992;256:1018–1021. doi: 10.1126/science.1589770. [DOI] [PubMed] [Google Scholar]
- Polat U. Making perceptual learning practical to improve visual functions. Vision Research. 2009;49:2566–2573. doi: 10.1016/j.visres.2009.06.005. [DOI] [PubMed] [Google Scholar]
- Polat U, Schor C, Tong JL, Zomet A, Lev M, Yehezkel O, Sterkin A, Levi DM. Training the brain to overcome the effect of aging on the human eye. Scientific Reports. 2012;2:278. doi: 10.1038/srep00278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramsay RW, Utrecht LC, Alkema D. The effect of massed versus spaced practice on auditory threshold. Acta Psychologica. 1967;26:227–232. doi: 10.1016/0001-6918(67)90020-0. [DOI] [PubMed] [Google Scholar]
- Saarinen J, Levi DM. Perceptual learning in Vernier acuity: What is learned? Vision Research. 1995;35:519–527. doi: 10.1016/0042-6989(94)00141-8. [DOI] [PubMed] [Google Scholar]
- Shannon CE. A mathematical theory of communication. Bell System Technical Journal. 1948;27:379–423. 623-656. [Google Scholar]
- Sun GJ, Chung STL, Tjan BS. Ideal observer analysis of crowding and the reduction of crowding through learning. Journal of Vision. 2010;10(5):16. doi: 10.1167/10.5.16. 1-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taub E, Goldberg IA. Prism adaptation: Control of intermanual transfer by distribution of practice. Science. 1973;180:755–757. doi: 10.1126/science.180.4087.755. [DOI] [PubMed] [Google Scholar]
- Tsodyks M, Gilbert C. Neural networks and perceptual learning. Nature. 2004;431:775–781. doi: 10.1038/nature03013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wright BA, Sabin AT. Perceptual learning: how much daily training is enough? Experimental Brain Research. 2007;180:727–736. doi: 10.1007/s00221-007-0898-z. [DOI] [PubMed] [Google Scholar]
- Xue G, Mei L, Chen C, Lu Z-L, Poldrack R, Dong Q. Spaced learning enhances subsequent recognition memory by reducing neural repetition suppression. Journal of Cognitive Neuroscience. 2011;23:1624–1633. doi: 10.1162/jocn.2010.21532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu D, Legge GE, Park H, Gage E, Chung STL. Development of a training protocol to improve reading performance in peripheral vision. Vision Research. 2010;50:36–45. doi: 10.1016/j.visres.2009.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]