Abstract
Poor discrimination of non-linguistic sounds has been implicated in language-learning problems in children, but research evidence has been inconsistent. This study included 32 participants with specific language impairment (SLI) and 32 typically-developing controls aged 7 to 16 years. Frequency discrimination thresholds were estimated in a task where participants had to distinguish a higher-frequency tone from a 1000 Hz tone. Neurophysiological responses were assessed in an oddball paradigm. Stimuli were either (a) 1030- or 1200-Hz pure tones (deviants) presented in a series of standard 1000-Hz tones, or (b) syllables: deviant [da] or [bi] in a series of standard [ba]. On the behavioural task, children (7 to 11-year-olds) had high thresholds, regardless of language status, but teenagers (12-16 yr) with SLI had higher thresholds than their controls. Conventional analysis of electrophysiological responses showed no difference between groups for the mismatch negativity (MMN), but the late discriminative negativity (LDN) was reduced in amplitude for smaller deviants in participants with SLI. Time-frequency analysis revealed that whereas the MMN reflected enhanced inter-trial coherence in the theta frequency band, the LDN corresponded to a period of event-related desynchronization extending across a wide low-frequency band including delta, theta and alpha. This manifested as a drop in power in those frequencies, which was marked in the controls but reduced or absent in children with SLI across all stimulus types. This provides compelling evidence for a low-level auditory perceptual impairment in SLI that affects a processing stage after initial detection of a sound change.
Keywords: Specific language impairment, auditory, speech perception, event-related potential, mismatch negativity, late discriminative negativity, time-frequency analysis, maturation, auditory discrimination, event-related desynchronization
Introduction
Event-related potentials (ERPs) can inform a long-standing debate in the field of developmental disorders, namely, whether language difficulties result from low-level auditory perceptual problems. Difficulties with phonological processing, i.e., the discrimination and/or categorisation of speech sounds, are reliably found in both developmental dyslexia and specific language impairment (SLI) (Bishop and Snowling, 2004). There is, however, little agreement as to whether these phonological difficulties indicate domain-specific deficits in the language system, or whether they are downstream consequences of auditory impairments (Bailey and Snowling, 2003). Behavioral evidence is hard to interpret because young children, who can distinguish stimuli adequately, may nevertheless obtain high auditory discrimination thresholds because of poor attention, memory or motivation.
The mismatch negativity (MMN) has been proposed as an alternative method for assessing auditory discrimination in SLI (Cheour, Leppänen, and Kraus, 2000). The MMN is assessed in a paradigm where rare deviant stimuli are presented in a train of repeated standards. Responses to deviants and standards are averaged separately and then subtracted, to reveal enhanced negativity to deviants, typically occurring around 100-250 ms post stimulus onset. Importantly, the MMN can be measured passively, without requiring the child to perform a task or attend to the stimuli. However, a review of MMN studies on dyslexia and SLI (Bishop, 2007) revealed wide disparities in findings. Bishop suggested that this might reflect the specific stimuli used: studies that showed significant differences between controls and language/literacy-impaired children tended to use standard-deviant pairings with small differences between stimuli. Studies also varied in the interval used to define the MMN, with many using time windows later than the traditional 100-250 ms post-onset period. This is important, because there is a later negativity in the mismatch wave, known as the late-discriminative negativity (LDN), which has different characteristics from the classic MMN (Cheour, Korpilahti, Martynova, and Lang, 2001). Bishop, Hardiman and Barry (in press) found that the LDN, in contrast to the MMN, was larger in children than in adults. Also, whereas the MMN becomes larger as the difference between standard and deviant increases, the opposite pattern was shown for the LDN, which was most pronounced for small deviants.
In the current study we used time-frequency analysis as well as more conventional averaging to assess mismatch responses in early (MMN) and late (LDN) time windows, comparing a group of individuals with SLI with a control group matched on age and nonverbal ability. Both groups were subdivided into children (7-11 years) and teenagers (12-16 years), because both behavioral and electrophysiological measures of auditory processing develop over this age range. We assessed frequency discrimination thresholds and event-related potentials (ERPs) to both tone and syllable stimuli, using large and small deviants presented in a train of standards. The hypotheses were that the group with SLI would show reductions of mismatch responses to small deviants only, and that this difference would be seen for both speech and non-speech stimuli.
Materials and Methods
Participants
Individuals with SLI (N = 32) were recruited from special schools and classrooms for children with language or literacy problems. All obtained a scaled score of 1.3 SD below average or worse on at least two of eleven language measures, had a nonverbal IQ of 80 or above, and passed a hearing screen covering speech frequencies (500 to 4000 Hz). Two reading tests were included in the language battery, but no child was included solely on the basis of poor literacy. No child had a diagnosis of autism or was taking any psychotropic medication. Children with signs of autistic features were screened using the Social Communication Questionnaire (Rutter, Bailey, and Lord, 2003) and excluded if they scored above a cutoff score of 11 (Eaves, Wingert, Ho, and Mickelson, 2006).
Typically-developing controls (TD group) were a subset of those participating in a study of normal development by Bishop et al. (in press). They were selected to be matched on age and nonverbal IQ to the children with SLI. All had nonverbal IQs of 80 or above and passed the hearing screen. The test battery is shown in Table 1, together with the mean scores and gender distribution for the SLI and TD children.
Table 1.
Mean (SD) age and scores on psychometric battery* in relation to age and language status
| Group N male and female1 |
TD-child 6 f 10 m |
TD-teen 6 f 10 m |
SLI-child 5 f 11 m |
SLI-teen 5 f 11 m |
|---|---|---|---|---|
| Age (yr) | 9.75 | 13.80 | 9.82 | 14.06 |
| (1.29) | (1.06) | (1.27) | (1.20) | |
| WASI PIQ | 102.31 | 101.38 | 98.38 | 100.75 |
| (9.86) | (12.15) | (9.22) | (9.35) | |
| N language tests low | 0.31 | 0.19 | 5.00 | 3.63 |
| (0.48) | (0.40) | (2.22) | (1.89) | |
| 1) CCC-2: GCC2 | 75.40 | 71.93 | 36.27 | 37.25 |
| (30.00) | (22.39) | (17.71) | (16.74) | |
| 2) Test for Reception of Grammar3 | 99.07 | 104.43 | 79.81 | 94.38 |
| (11.44) | (6.30) | (11.59) | (11.10) | |
| 3) ERRNI story telling3 | 98.53 | 99.21 | 90.00 | 100.36 |
| (11.03) | (10.28) | (12.50) | (13.64) | |
| 4) ERRNI story recall3 | 101.60 | 100.43 | 86.13 | 101.00 |
| (13.49) | (10.08) | (18.54) | (15.54) | |
| 5) ERRNI forgetting3 | 103.53 | 101.07 | 91.44 | 96.71 |
| (12.97) | (7.55) | (17.91) | (16.55) | |
| 6) ERRNI comprehension3 | 105.13 | 108.57 | 86.31 | 102.57 |
| (9.13) | (14.93) | (21.97) | (16.18) | |
| 7) ERRNI MLU3 | 100.00 | 95.29 | 91.07 | 94.71 |
| (9.18) | (10.58) | (10.92) | (24.36) | |
| 8) NEPSY nonword repetition4,5 | 9.67 | 10.86 | 5.44 | 6.31 |
| (3.60) | (3.70) | (3.54) | (3.57) | |
| 9) NEPSY sentence repetition4 | 9.93 | 12.21 | 4.50 | 5.94 |
| (3.37) | (2.08) | (1.86) | (3.70) | |
| 10) TOWRE sight words3 | 101.47 | 98.75 | 87.38 | 79.38 |
| (11.69) | (13.23) | (14.36) | (13.41) | |
| 11) TOWRE phonetic decoding3 | 106.27 | 108.13 | 83.56 | 77.06 |
| (10.50) | (9.65) | (14.07) | (12.91) |
Test battery: Wechsler, D. (1999). Wechsler Abbreviated Scale of Intelligence (WASI), San Antonio: Psychological Corporation; Bishop, D. V. M. (2003). The Children's Communication Checklist, version 2 (CCC-2), London: Pearson; Bishop, D. V. M. (2003). The Test for Reception of Grammar, version 2 (TROG-2), London: Pearson. Bishop, D. V. M. (2004), Expression, Reception and Recall of Narrative Instrument (ERRNI). London: Psychological Corporation; Korkman, M., Kirk, U., & Kemp, S. I. (1998). NEPSY: A developmental neuropsychological assessment, San Antonio: Psychological Corporation; Torgesen, J. K., Wagner, R., & Rashotte, C. (1999). Test of Word Reading Efficiency (TOWRE), New York: Psychological Corporation.
Missing data:seven cases on CCC-2, three on TROG-2, five on ERRNI, three on nonword repetition, three on sentence repetition, and one on TOWRE
General Communication Composite: normative mean = 80
Scaled with mean 100, SD 15
Scaled with mean 10, SD 3
Relative to own norms
The study was approved by the Oxford Psychiatric Research Ethics Committee; parents of all participants gave written informed consent, and the children themselves gave assent after the study was explained in age-appropriate language.
Psychoacoustic assessment of frequency discrimination
The psychoacoustic procedure used a game-like computerised task with AXB three-alternative two-interval format, with the child hearing three tones on each trial, and being asked to select the tone (first or third) that differed from the middle one. The standard stimulus was a 1000 Hz tone of 100 ms duration. The first or third tone was either another standard stimulus, or a deviant stimulus of higher frequency. Frequency was adjusted adaptively depending on the child's responses to converge on a threshold corresponding to 75% correct. In general, two runs were given, unless a very low threshold (10 Hz or less) was achieved on the first run. Thresholds (in Hz difference from the standard) were converted to natural logs, and the better of the two estimates was taken for analysis.
Electrophysiological assessment of mismatch negativity
Stimuli
Stimuli are described in detail by Bishop et al. (in press). Tone stimuli consisted of a standard 1000 Hz tone and deviants of 1200 Hz (large deviant) and 1030 Hz (small deviant). Speech stimuli consisted of the syllables [ba] (standard), [bi] (large deviant) and [da] (small deviant). Natural speech recordings were used, with intonation contours equated using Praat (Boersma and Weenink, 2005) and RMS equalization carried out with GoldWave (Craig, 2008). Stimuli had durations of 175 ms, windowed at 15 ms, and were presented monaurally to the right ear at 86.5 dB SPL through sound-attenuating Sennheiser HD25-1 headphones.
Procedure
Standards were presented on 70% of trials, with each deviant occurring on 15% of trials in a quasi-random sequence, avoiding occurrence of two deviants in succession. Stimulus onset asynchrony was 1 s. There were two blocks each of 333 trials, making a total of 466 standards, and 100 of each deviant type. Participants were seated in a comfortable upright chair in a sound-attenuated electrically-shielded booth. To help them ignore the stimuli, participants played Gameboy, watched a silent DVD or video on a small TV screen located at a distance of 1.3 m away.
EEG recording and data analysis
The EEG was recorded on a SynAmps or NuAmps NeuroScan Inc. system using Ag/AgCl sintered electrodes and a water-soluble conductive gel. Early pilot studies indicated no difference in the results obtained from the two recording systems, and the proportions of children from each group tested on each system did not differ significantly. An electrode cap was fitted to record from 28 sites: FC1, F7, FP1, FZ, FP2, F8, FC2, FT9, FC5, F3, FCZ, F4, FC6, FT10, T7, C3, CZ, C4, T8, CP5, P7, P3, PZ, P4, P8, CP6, M1, and M2. M1 or M2 was selected as reference electrode and ground was placed at AFZ. Electro-oculograms (EOG) were recorded from supra- and infra-orbital electrodes on the left eye and also from electrodes placed lateral to the left and right eyes. Impedances for all electrodes were kept below 8 kΩ. The EEG was recorded continuously on-line and stored for off-line processing. EEG data were digitised at 500 Hz and band-pass filtered (0.01 – 70 Hz for SynAmps; 0.1 – 70 Hz for NuAmps) and a 50 Hz notch filter was employed.
Offline analysis
Data processing is described in detail by Bishop et al. (in press). This was done for each individual participant separately for tone and speech stimuli and included:
Epoching into trials of 1000 ms duration, including 200 ms baseline. Re-referencing to average mastoids. Rejection of trials with extreme amplitudes (+/− 350 μv) was done to remove noisy sections of record, while retaining blinks;
Artefact reduction, using independent component analysis (ICA) to identify unwanted components such as blinks, which were mathematically subtracted from the data. Further artefact rejection was then applied with cutoff +/− 150 μv;
Spatial principal components analysis, treating channel amplitudes at each time point in the averaged waveform (all trials, standard and deviant) as a new set of observations. Weights from the first component were then used to create a new channel consisting of the weighted average of all channels, with polarity set to be consistent with electrode FZ;
Creation of difference waveforms for single-trial analysis. For each stimulus type (tones or speech) this involved categorising four types of trial; large deviants, small deviants, dummy deviants (i.e., standards preceding deviants), and all other standards. This latter set was averaged to give a mean standard response which was then subtracted from all the other trials. We thus had three types of difference waveforms: one for each deviant, and one ‘dummy’ set formed by subtracting each standard-before-deviant from the average of other standards. These waveforms represent the specific activity associated with mismatch, after removing the ERP common to standards and deviants;
Conventional analysis of mean amplitudes for each type of difference waveform, using time windows of 100-250 ms for the MMN and 350-550 ms for the LDN;
Time-frequency analysis of single trials for each type of difference waveform using EEGlab software (DeLorme and Makeig, 2004) to obtain two spectrally-based measures. The method for frequency extraction was FFT with Hanning window tapering, and pad ratio set to 2. This gave 200 estimates with subwindow centres ranging from −135.9 to 735.9 ms, and window size of 128 ms. Inter-trial coherence (ITC) ranges from 0 to 1 and corresponds to the extent to which individual trials have oscillations in phase at a given frequency at a given time point, i.e. the degree of temporal alignment of auditory responses. Event-related spectral perturbation (ERSP) is a complementary measure, which estimates power at a given frequency in the signal.
Results
Frequency discrimination thresholds
Mean frequency discrimination limens (in natural log units) and the results from a two-way ANOVA (age band × SLI status) are shown in Table 2, indicating a main effect of age band (teenagers have lower thresholds than children) and an interaction between age band and SLI status. Contrasts conducted using t-tests indicated that the SLI-child and TD-child groups did not differ significantly (t= 0.87, d.f. = 30, p = .389), whereas theSLI-teen group did worse than the TD-teen group (t = 2.9, d.f. = 28, p = .007). The Hz equivalents of the thresholds are also shown in Table 2, and indicate that the mean thresholds for the TD-child group was just above 30 Hz, the frequency difference used for the smaller deviant in the MMN study.
Table 2.
Mean (SD) frequency discrimination (FD) limen (log units) in relation to age and language status1.
| TD-child | SLI-child | TD-teen | SLI-teen | |
|---|---|---|---|---|
|
| ||||
| N | 16 | 16 | 152 | 152 |
| FD limen (Ln Hz) | 3.78 | 3.32 | 2.01 | 3.15 |
| (1.34) | (1.63) | (0.88) | (1.24) | |
| Mean Hz equivalent | 44.0 | 27.7 | 7.47 | 23.4 |
Main effect of SLI status, F (1, 58) = 1.04, p = .311, partial η2 = .018 Main effect of age band, F (1, 58) = 8.52, p = .005, partial η2 = .128 Interaction, SLI status x age band, F (1, 58) = 5.81, p = .019, partial η2 = .091.
Data missing for one TD-teen and one SLI-teen because of equipment failure or scheduling problems
Electrophysiological responses to standards and deviants
The number of deviant trials included per condition did not differ by SLI status, but teenagers had slightly more included trials: means were 83.6 (SD = 9.90) for TD-teen, 89.2 (SD = 7.78) for TD-teen, 86.9 (SD = 9.46) for SLI-child and 92.1 (SD = 7.27) for SLI-teen. All participants had at least 65 deviant trials per condition. See Supplemental material, Table S1, for ANOVA.
To simplify data analysis, we followed the procedure of Bishop et al (in press) and used the first spatial principal component to represent the auditory ERP. This avoids the need for analysing multiple electrodes, while ensuring that all sites contributing to the auditory ERP are represented. The mean weights from each channel contributing to the principal component for each stimulus type and group are shown in head-plots in Supplemental Material, Figure S1, and indicate a similar fronto-central distribution in all four groups. Note that amplitudes of responses measured this way are substantially larger than those seen when a single electrode is considered.
Figure 1 shows the responses to standards and deviants based on the spatial principal component. Consistent with previous literature, there are striking differences between the waveforms for the children and teenagers. The N1 response is missing in the TD-child and SLI-child groups, as is usually found for children when stimuli are presented with SOA of less than 1.4 s (Ceponiené, Cheour, and Näätänen, 1998). The teenagers show a more adult-like pattern, with smaller amplitudes overall, and a developing N1-P2 complex. Although there are some differences in the amplitude of waveforms for TD vs SLI groups, the overall shape is similar at each age band.
Figure 1.
Average amplitude at principal component by age-group and stimulus for TD (black line) and SLI (gray line). The N1 response is missing in the TD-child and SLI-child groups, whereas teenagers show a developing N1-P2 complex.
Figure 2 shows the difference waveforms for each group. Dummy waveforms, created by subtracting the average for standards from each standard-before-deviant, are shown in addition to mismatch waveforms for deviants; these give an indication of how much amplitude variation is to be expected by chance. To identify regions where the averaged difference waveform for all participants fell significantly below zero, one-sample t-tests were conducted at each time-point. T-values less than −1.96 are shown below the waveforms in Figure 2. As noted by Guthrie and Buchwald (1991), significance levels of t-values are misleading with this approach, because time points are not independent of one another, and a difference is only reliable when a sequence of t-values is significant, with the length of sequence depending on the autocorrelation of the waveform. In our study, the likelihood of obtaining spurious differences could be directly estimated by considering the dummy difference waveforms, where the same stimulus acts standard and deviant.
Figure 2.
Average mismatch responses from principal component by stimulus type and group, for dummy difference files (gray), large deviants (red) and small deviants (blue). Regions where t-test value is < −1.96 are shown as bars of same color below the plot. In the MMN interval (100-250 ms post-onset) the mismatch to speech stimuli is significantly greater than to tones, and mismatch to large deviants is greater than to small deviants, but group and age have no effect. For the LDN interval (350-550 ms) mismatch is larger for the smaller deviant, this effect being significantly greater for TD than SLI participants.
Following Bishop et al (in press), we estimated the size of mismatch response by taking mean amplitude of the difference waveform in two time windows, 100-250 ms for the MMN and 350-550 ms for the LDN. For each time window, repeated measures ANOVA was used with two within-subject factors: stimulus type (tones or speech), and deviant size (large or small), and two between-subject factors: SLI status (2 levels) and age band (2 levels). For the MMN, there were no main effects of the between subjects factors of age and SLI status, and no significant interactions with these factors. Main effects of stimulus type and deviant were significant, with MMN to speech stimuli being substantially larger than to tones, F (1, 60) = 7.75, p = .007, η2 = .114, and MMN to large deviants being greater than to small deviants, F (1, 60) = 5.34, p = .024, η2 = .082. Full ANOVA output is shown in Supplemental materials, Table S2. ANOVA on peak latency showed no main effects or interactions, other than a main effect of stimulus type, and is shown in Supplemental materials, Table S3.
A parallel analysis was conducted for the LDN interval, again showing significant main effects of stimulus type and deviant size, but this time the latter effect reflected a larger LDN to the smaller deviant, an effect previously reported by Bishop et al (in press), who included a subset of the current TD sample in their analysis. Neither age band nor SLI status gave a significant main effect (see Supplemental material, Table S4 for full output), but there was a significant interaction between SLI status and deviant size, F (1, 60) = 4.48, p = .038, η2 = .070. This was followed up using t-tests. The mean LDN to the large deviant was −1.63 (SD = 5.50) for TD participants and −1.43 (SD = 5.17) for SLI participants, a non-significant difference, t (62) = 0.15, p = .878. For the small deviants, the mean LDN for TD participants was −5.69 (SD = 4.36), whereas that for SLI participants was −2.37 (SD = 5.44), a significant difference, t (62) = 2.69, p = .009. Thus the analysis confirmed the impression seen in Figure 2 that there is a greater LDN for TD than for SLI participants, for the small deviant only.
Time-frequency analysis
Figure 3 shows the mean levels of ITC for mismatch waveforms in frequencies ranging from 1 to 20 Hz for the four groups. This indicates the extent to which individual mismatch waveforms are synchronised in phase at different frequencies. One can see in all groups the phenomenon described by Bishop et al (in press) whereby the MMN region is characterised by a burst of increased ITC that is particularly marked in frequencies below 10 Hz. Visual inspection does not, however, suggest any substantial differences between SLI and TD participants, and this was confirmed by ANOVA on the mean ITC value in the theta frequency band, which found no main effect or interaction for SLI status (see Supplemental Material, Table S5).
Figure 3.
ITC for difference waveforms in relation to stimulus and group, with colors indicating range from zero, i.e. no synchronization (green) to 0.2 (deep red). The MMN region is characterised by a burst of increased ITC below 10 Hz, which is similar for TD and SLI groups.
Figure 4 shows corresponding data for the ERSP, which measures the amount of power in the difference waveform at different timepoints, relative to power in the baseline. A striking and hitherto unremarked phenomenon can be seen in the TD plots, namely a decrease in power across a broad frequency band in the later half of the trial (blue denoting power below baseline levels). This was quantified by taking mean ERSP in the theta band (4-7 Hz) for the second half of the trial, from 300-600 ms. ANOVA revealed that this index gave a significant main effect of SLI status, F (1, 60) = 9.67, p = .003, η2 = .139, whereas the effect of age was close to zero and nonsignificant (see Supplemental material Table S6 for full ANOVA). The only other significant effects were a two-way interaction between SLI status and deviant size, and a three-way interaction between SLI status, age band, and deviant size. This latter interaction reflects the phenomenon apparent in Figure 4, whereby a small deviant is associated with a large decrease in power in TD participants, but an increase in power in SLI participants, with the effect especially prominent in older children. An ANOVA was run to check whether this might arise from a difference in baseline power (see Supplemental Material,Table S6); this revealed a large effect of age, but no main effect of language status, and no interaction of language status with age, stimulus type, or deviant. Supplemental material Figure S2 shows the mean log power in different frequency bands without baseline correction.
Figure 4.
ERSP for difference waveforms in relation to stimulus and group, with colors indicating range from −.75 (deep blue = power decrease) through zero (green) to .75 (deep red = power increase). The TD plots show a decrease in power (event-related desynchronisation) across a broad frequency band in the later half of the trial.
Relationship between indices of mismatch responses and behavioral measures
We inspected correlations between mismatch indices and both frequency discrimination and nonword repetition, with the expectation that mismatch responses to tones might be correlated with frequency discrimination, and those to speech would correlate with nonword repetition. To reduce the number of correlations considered, focus was restricted to the small deviants, giving 16 correlations, and Bonferroni-adjusted p-value of .003. As can be seen in Table 3, we did not replicate the earlier finding by Bishop et al (in press) of a correlation between frequency discrimination and early ITC, and none of the other mismatch indices for tones related to FD threshold. However, there was a significant negative correlation between the ERSP to syllables in the later time interval and nonword repetition score (r = −.4, N = 61, p = .001). This indicates that participants who are poor at nonword repetition fail to show a drop in power corresponding to event-related desynchronization. This correlation was recomputed separately for TD and SLI groups to see whether it was driven solely by the TD/SLI group differences. Within the TD group, it was non-significant (r = −.056, N = 29, ns), but it remained significant within the SLI group (r = −.395, N = 32, p = .032): see Figure S3, Supplemental material, for the scatterplot.
Table 3.
Correlations1 between mismatch indices and behavioral measures
| Small deviant mismatch indices | Frequency discrimination skill2 N = 62 |
Nonword repetition N = 61 |
|---|---|---|
| Tones MMN mean amplitude | .024 | .046 |
| Tones LDN mean amplitude | −.044 | −.018 |
| Tones ITC 0-300 ms, 4-7 Hz | .114 | 0 |
| Tones ERSP 300-600 ms, 4-7 Hz | .099 | −.217 |
| Syllables MMN mean amplitude | −.218 | .007 |
| Syllables LDN mean amplitude | .243 | −.094 |
| Syllables ITC 0-300 ms, 4-7 Hz | .010 | .113 |
| Syllables ERSP 300-600 ms, 4-7 Hz | −.138 | −.400 |
Correlation of .37 or more is significant at Bonferroni-corrected level of .05/16
Thresholds inverted so that a positive correlation means good skill corresponds to positive mismatch index
Discussion
Overview of findings on SLI
Data from this study help resolve some of the inconsistent findings in this field. Previous studies have disagreed as to whether children with SLI have reduced mismatch responses to auditory signals, and if so, whether they are confined to speech sounds (Bishop, 2007). We found abnormal responses in SLI, but only for the LDN time range, not the earlier MMN. Some researchers have blurred the distinction between these responses, labelling them as ‘early’ and ‘late’ MMN. However, they appear to reflect sequential stages in processing an auditory signal, and show different developmental courses, and opposite effects of size of difference between standard and deviant. One might wonder whether the lack of group differences on the MMN reflects the fact that this component is much smaller in children than adults, when quantified as mean amplitude in the time window 100-250 ms. However, no group difference is seen when the MMN is assessed using ITC, a measure of phase coherence across trials. This gives a more sensitive measure of MMN in children than averaged mean amplitude, because it is unaffected by the polarity of a response, and so will reflect positive as well as negative mismatch responses. As can be seen in Figure 2, a positive mismatch response is evident after an initial small negativity, and the ITC is significant during this period. This effect, however, is at least as strong in those with SLI as in the TD group. We therefore conclude that the initial brain response indicating detection of mismatch between memory of the standard and an incoming auditory signal is intact in SLI for both speech and nonspeech signals. This suggests the ‘perceptual anchor’ theory does not apply to SLI: this theory, which was proposed for dyslexia, maintains there is difficulty in forming a long-term representation of a repeating standard in memory (Ahissar et al., 2006). A difference between SLI and TD groups is, however, apparent in the later interval corresponding to the LDN. We discuss the significance of this finding further below.
The lack of significant correlations between indices of mismatch responses to small frequency deviants and FD thresholds in this study is puzzling, in view of a significant association between early-interval ITC and FD thresholds found by Bishop et al (in press) in a larger normative sample that covered the whole age range from children to adults, and included a subset of the current TD sample. There are two possibilities. First, the finding of raised FD thresholds in the SLI-teen group may have different causes from the abnormal LDN to the same stimuli seen in SLI individuals of all ages. For instance, the behavioral task has attentional and memory demands which may be important. A more prosaic explanation is in terms of low reliability of electrophysiological indices of mismatch responses in individuals. Bishop and Hardiman (2010) demonstrated good reliability of mismatch indices in adults when 300 deviant responses were available, but this declined when a smaller number of trials was used. We had around 85-90 trials per condition on average in this study, and it is likely that this may be sufficient to demonstrate clear effects at the group level, but too few for reliable estimates in individuals. The current study did, however, reveal one significant correlation between behavior and electrophysiological response, namely the negative correlation between ERSP to small speech deviants in the later part of the epoch and performance on a nonword repetition task. This observation merits further investigation, given that impaired nonword repetition is one of the most reliable markers of heritable forms of SLI (Bishop, North, and Donlan, 1996).
Implications for our understanding of the nature of mismatch responses in the auditory ERP
Time-frequency analysis in this study provided new insights into the mechanisms underlying generation of the MMN and LDN, emphasising the difference between these mismatch responses. MMN appears to arise from an enhancement of phase synchronisation in the theta frequency band (Bishop and Hardiman, 2010; Bishop et al, in press). In typically-developing children, we found that LDN is accompanied by a reduction of power in a relatively wide frequency band encompassing delta, theta and low alpha. Pfurtscheller and Lopes da Silva (1999) termed a decrease in power in a specific frequency band as ‘event-related desynchronization’ (ERD), and proposed it reflected a reduction in synchrony of activity an underlying neuronal population. Most studies of ERD have involved memory or motor tasks, and have focused on alpha or beta frequency bands. An exception is a study by Edwards et al. (2009), who measured electrophysiological responses from the cortical surface to auditory speech stimuli in nine adult patients who were being evaluated prior to neurosurgery. They found a reduction of power at 7-14 Hz relative to baseline (ERD) in the range 200-400 ms post-onset, which was correlated with a prior enhancement of power in the gamma band at 50-200 ms post-onset. Although they used an oddball paradigm, their analysis focused on responses to standard and deviant stimuli, rather than difference waveforms, and so is not directly comparable to ours, but it suggests that the lower-frequency ERD that we observe in the later portion of the difference waveforms is an exaggeration of the normal brain response to auditory stimuli.
The physiological basis of ERD is still poorly understood, but Pfurtscheller and Lopes da Silva (1999) noted that it is influenced by interconnectivity in neuronal circuits. One aspect of neural maturation that develops throughout childhood is greater specificity in synaptic connectivity as non-functional connections are pruned. This underlies the development of more focal representations of functions in the brain. High spectral power is seen when there is a large number of coherently activated neurons; thus the ERSP profile of typical children could indicate a transition from an early stage of detection of a deviant stimulus by a large brain region (indexed by an assembly of neurons responding in phase to give high ITC) to a later stage of stimulus identification requiring more focal brain activity, indexed by a decrease in ERSP below baseline levels. According to this model, the fact that ERD is reduced or absent in children with SLI could indicate that they have less specialised brain maps for differentiation of auditory stimuli. Identification of a change from standard to deviant would lead to altered oscillatory activity in a smaller region of cortex in TD children than in those with SLI, and hence correspond to more event-related desynchronisation as activity becomes more focal. This highly speculative account has features in common with Tallal's (2004) proposals about coarse neural coding of auditory stimuli in SLI, but it is more general, in that it is not restricted to brief or rapid stimuli. Tallal's theory has been extended to account for developmental dyslexia as well as SLI, and it will be of considerable interest to see whether time-frequency analysis can reveal similar patterns of response in children who have milder difficulties that predominantly affect written language.
Implications for a maturational lag account of SLI
Hautus et al (2003), McArthur and Bishop (2004), and Wright and Zecker (2004) proposed that that auditory perceptual difficulties seen in some children with developmental speech-language disorders may correspond to a maturational lag rather than a static deficit. For instance, if there were little change in ability on a task from age 5 to 10 yrs followed by an improvement in adolescence, then a 10-year-old with a 4-year lag in performance might appear unimpaired in the pre-teen years, but a deficit would become apparent in adolescence. The pattern of results on the behavioral FD task obtained here fits that picture, and if we had only behavioral data to rely on, the maturational lag account would seem the best explanation. The electrophysiological data, however, tell a different story. There are clear developmental changes in mismatch responses (Bishop et al, in press), but children with SLI do not appear to be immature. Rather, they show atypical mismatch responses, regardless of age, the most striking feature of which is a lack of the usual power reduction at lower frequencies in the second half of the trial.
In conclusion, the current study illustrates the value of analysing oscillatory aspects of ERP data alongside more traditional averaging approaches. Key oscillatory changes are often invisible in the averaged signal, yet can be uncovered by single trial analysis. There is growing interest in using this approach to study neurophysiological maturation (Shahin et al, 2010). Such an approach is feasible for analysing mismatch responses, where single trial difference waveforms can be created by subtraction of the averaged standard response from each trial, and, as we have shown, has promise for resolving some long-standing debates about the underlying causes of neurodevelopmental disorders, and throwing light on the biological bases of language impairments.
Supplementary Material
Acknowledgments
The project was funded by Wellcome Trust Programme Grant 053335. We thank Kathy White, Liz Line, Ifat Yasin, and Margriet Groen for assistance with data collection. We gratefully acknowledge the help of the many schools and families who participated in the research. Without their support and enthusiasm this work would not have been possible.
References
- Ahissar M, Lubin Y, Putter-Katz H, Banai K. Dyslexia and the failure to form a perceptual anchor. Nat Neurosci. 2006;9:1558–1564. doi: 10.1038/nn1800. [DOI] [PubMed] [Google Scholar]
- Bailey PJ, Snowling MJ. Auditory processing and the development of language and literacy. Brit Med Bull. 2003;63:135–146. doi: 10.1093/bmb/63.1.135. [DOI] [PubMed] [Google Scholar]
- Bishop DVM. Using mismatch negativity to study central auditory processing in developmental language and literacy impairments: where are we, and where should we be going? Psychol Bull. 2007;133:651–672. doi: 10.1037/0033-2909.133.4.651. [DOI] [PubMed] [Google Scholar]
- Bishop DVM, Hardiman MJ. Measurement of mismatch negativity in individuals: a study using single-trial analysis. Psychophysiol. 2010;47:697–705. doi: 10.1111/j.1469-8986.2009.00970.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bishop DVM, Hardiman MJ, Barry JG. Is auditory discrimination mature by middle childhood? A study using time-frequency analysis of mismatch responses from 7 years to adulthood. Dev Sci. doi: 10.1111/j.1467-7687.2010.00990.x. in press. doi/10.1111/j.1467-7687.2010.00990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bishop DVM, North T, Donlan C. Nonword repetition as a behavioural marker for inherited language impairment: evidence from a twin study. J Child Psychol Psychiatry. 1996;37:391–403. doi: 10.1111/j.1469-7610.1996.tb01420.x. [DOI] [PubMed] [Google Scholar]
- Bishop DVM, Snowling MJ. Developmental dyslexia and Specific Language Impairment: Same or different? Psychol Bull. 2004;130:858–886. doi: 10.1037/0033-2909.130.6.858. [DOI] [PubMed] [Google Scholar]
- Boersma P, Weenink D. Praat: A system for doing phonetics by computer. Glot Internat. 2005;5(9/10):341–345. [Google Scholar]
- Ceponiené R, Cheour M, Näätänen R. Interstimulus interval and auditory event-related potentials in children: Evidence for multiple generators. Electroen Clin Neuro. 1998;108:345–354. doi: 10.1016/s0168-5597(97)00081-6. [DOI] [PubMed] [Google Scholar]
- Cheour M, Leppänen PH, Kraus N. Mismatch negativity (MMN) as a tool for investigating auditory discrimination and sensory memory in infants and children. Clin Neurophysiol. 2000;111:4–16. doi: 10.1016/s1388-2457(99)00191-1. [DOI] [PubMed] [Google Scholar]
- Cheour M, Korpilahti P, Martynova O, Lang AH. Mismatch negativity and late discriminative negativity in investigating speech perception and learning in children and infants. Audiol Neuro-Otol. 2001;6:2–11. doi: 10.1159/000046804. [DOI] [PubMed] [Google Scholar]
- Craig C. GoldWave (Version 5.23) [Sound Editing Program] GoldWave Inc.; St Johns: 2008. [Google Scholar]
- Delorme A, Makeig S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis ( sccn.ucsd.edu/eeglab/) J Neurosci Meth. 2004;134:9–21. doi: 10.1016/j.jneumeth.2003.10.009. [DOI] [PubMed] [Google Scholar]
- Eaves L, Wingert H, Ho H, Mickelson E. Screening for autism spectrum disorders with the social communication questionnaire. J Dev Behav Pediatr. 2006;(2 Suppl):S95–S103. doi: 10.1097/00004703-200604002-00007. [DOI] [PubMed] [Google Scholar]
- Edwards E, Soltani M, Kim W, Dalal SS, Nagarajan SS, Berger MS, Knight RT. Comparison of time-frequency responses and the event-related potential to auditory speech stimuli in human cortex. J Neurophysiol. 2009;102:377–386. doi: 10.1152/jn.90954.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guthrie D, Buchwald J. Significance testing of difference potentials. Psychophysiol. 1991;28:240–244. doi: 10.1111/j.1469-8986.1991.tb00417.x. [DOI] [PubMed] [Google Scholar]
- Hautus MJ, Setchell GJ, Waldie KE, Kirk IJ. Age-related improvements in auditory temporal resolution in reading-impaired children. 2003;9:37–45. doi: 10.1002/dys.234. [DOI] [PubMed] [Google Scholar]
- McArthur GM, Bishop DVM. Which people with specific language impairment have auditory processing deficits? Cog Neuropsychol. 2004;21:79–94. doi: 10.1080/02643290342000087. [DOI] [PubMed] [Google Scholar]
- Pfurtscheller G, Lopes da Silva FH. Event-related EEG/MEG synchronization and desynchronization: basic principles. Clin Neurophysiol. 1999;110:1842–1857. doi: 10.1016/s1388-2457(99)00141-8. [DOI] [PubMed] [Google Scholar]
- Rutter M, Bailey A, Lord C. Social Communication Questionnaire (SCQ) Western Psychological Services; Los Angeles, CA: 2003. [Google Scholar]
- Shahin AJ, Trainor LJ, Roberts LE, Backer KC, Miller LM. Development of auditory phase-locked activity for music sounds. J Neurophysiol. 2010;103:218–229. doi: 10.1152/jn.00402.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tallal P. Improving language and literacy is a matter of time. Nat Rev Neurosci. 2004;5:721–728. doi: 10.1038/nrn1499. [DOI] [PubMed] [Google Scholar]
- Wright BA, Zecker SG. Learning problems, delayed development, and puberty. P Natl Acad Sci. 2004;101:9942–9946. doi: 10.1073/pnas.0401825101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




