Abstract
There is mounting evidence for predictive coding theory from computational, neuroimaging, and psychological research. However, there remains a lack of research exploring how predictive brain function develops across childhood. To address this gap, we used pediatric magnetoencephalography to record the evoked magnetic fields of 18 younger children (M = 4.1 years) and 19 older children (M = 6.2 years) as they listened to a 12-min auditory oddball paradigm. For each child, we computed a mismatch field “MMF”: an electrophysiological component that is widely interpreted as a neural signature of predictive coding. At the sensor level, the older children showed significantly larger MMF amplitudes relative to the younger children. At the source level, the older children showed a significantly larger MMF amplitude in the right inferior frontal gyrus relative to the younger children, P < 0.05. No differences were found in 2 other key regions (right primary auditory cortex and right superior temporal gyrus) thought to be involved in mismatch generation. These findings support the idea that predictive brain function develops during childhood, with increasing involvement of the frontal cortex in response to prediction errors. These findings contribute to a deeper understanding of the brain function underpinning child cognitive development.
Keywords: Magnetoencephalography, Auditory, Mismatch field, Child
Introduction
Under predictive coding theory, the brain houses an internal, probabilistic, generative model which represents the statistical structure of the external world (Rao and Ballard 1999; Friston and Kiebel 2009; Hohwy 2013; Clark 2015). The brain uses this model to generate predictions about the most likely causes of incoming sensory signals, and tests its model by comparing its top-down prior predictions with bottom-up sensory signals from the world (Hohwy 2012). Predictions and sensory signals can be represented as probability distributions whereby the difference between the two represents the model’s “prediction error.” Prediction error is a useful learning signal as it indicates which sensory information the model failed to predict and hence, which information requires further processing at higher levels of the neural hierarchy (Dołęga and Dewhurst 2021). To reduce processing requirements, the brain strives to minimize prediction errors over time (Clark 2013).
An account of how children come to know about the world falls naturally from predictive coding principles (Scholl 2004; Gopnik 2012; Badcock et al. 2019). According to this view, infants are thought to begin life with a set of innately specified priors, rooted in the genes (Kersten and Yuille 2003). Following environmental stimulation, “innate priors” are updated through “perceptual inference” (Hohwy 2012) and subsequently called “empirical priors” (Kersten and Yuille 2003; Badcock et al. 2019). Thus, predictive coding takes advantage of both nature and nurture perspectives, allowing them to interact within a unifying framework. Iterative model updating should enable children to meet incoming sensory signals with increasingly accurate priors which should, in turn, result in fewer prediction errors, and hence greater confidence in the model’s predictions. Children’s priors may become increasingly precise as they gain experience in the world (Kayhan et al. 2019; Köster et al. 2020). As such, the posterior distribution (i.e. the perceptual experience; Hohwy 2012) would be gradually biased toward the increasingly narrow prior distribution (representing the brain’s stored knowledge), and away from the sensory signal distribution (Lucas et al. 2014).
Rapid model maturation across childhood may be underpinned by concurrent and significant neurophysiological changes. Indeed, the brain roughly quadruples in weight before age 6, by which time it has reached approximately 90% of its adult volume (Brown and Jernigan 2012). Furthermore, the efficiency of neuronal communication increases due to a prolonged period of synaptic pruning and myelination during childhood and adolescence (Blakemore and Choudhury 2006; Fischer and Bidell 2007; Santos and Noggle 2011). In particular, the protracted maturation of the prefrontal cortex (Huttenlocher and Dabholkar 1997)—a region thought to play a key role in extracting complex statistical regularities from incoming sensory signals (Basirat et al. 2014; Dürschmid et al. 2016; Jaffe-Dax et al. 2020)—may support increasingly sophisticated predictive brain function across development (Krogh et al. 2013). While a predictive coding account of neurocognitive development is supported by an extensive body of behavioral evidence (Gopnik and Wellman 2012; Köster et al. 2020), available neural evidence (Emberson et al. 2015, 2017, 2019a; Jaffe-Dax et al. 2020; Zhang and Emberson 2020) is in short supply —partly due to practical challenges associated with conducting functional brain recordings with young children (Barkovich et al. 2019). We sought to address this gap in the literature.
A popular method for testing predictive coding is to use electroencephalography (EEG) and/or magnetoencephalography (MEG) to record participants’ brain responses as they listen to an auditory oddball paradigm (Friston 2005; Heilbron and Chait 2018). These paradigms are comprised of high-probability “standard” and lower probability “deviant” stimuli (e.g. pure tones). Averaged evoked responses to the “standards” and “deviants” typically diverge between 0.1 and 0.25 s following stimulus onset, with the “deviant” showing a larger amplitude relative to the “standard” waveform (Näätänen et al. 2019b). This divergence—conventionally presented as a difference waveform—is the “mismatch negativity” (MMN; EEG literature) or the “mismatch field” (MMF; MEG literature; hereafter, “MMN/F” will be used when referring to both the MMN and MMF).
The MMN/F has been interpreted as a neural index of prediction error (Friston 2005), whereby the larger the evoked response amplitude, the larger the corresponding prediction error signal. With significant neurophysiological maturation during childhood (Huttenlocher and Dabholkar 1997; Blakemore and Choudhury 2006; Brown and Jernigan 2012), children’s brains may become increasingly proficient at extracting statistical regularities from the auditory input (Emberson et al. 2019b; Raviv and Arnon 2018). Consequently, older children may be able to form relatively precise priors of the upcoming stimuli, whereby the strongest prediction would be for the presentation of a high-probability “standard.” Thus, the presentation of a lower probability “deviant” may evoke a larger prediction error relative to that of a “standard.” This, in turn, would give rise to a relatively large MMF amplitude. By contrast, if younger participants are less proficient at extracting these statistical regularities, then they may form comparatively less-precise predictions for the upcoming stimuli. Thus, if younger children are less-precisely predicting the presentation of a standard, then the presentation of both high-probability “standards” and lower probability “deviants” may evoke similar degrees of prediction error, resulting in a relatively attenuated MMF.
Overall, one might expect a larger MMF amplitude in older relative to younger children, reflecting maturation of predictive brain function across development. However, previous EEG findings have been mixed, with studies reporting an increase (Oades et al. 1997; Bishop et al. 2011a; Chobert et al. 2014; Putkinen et al. 2014a; Putkinen et al. 2014b; Linnavalli et al. 2018), decrease (Csépe et al. 1992; Kraus et al. 1993a; Kraus et al. 1993b), or no difference (Kraus et al. 1999; Gomot et al. 2000; Shafer et al. 2000, 2010) in the MMN amplitude with age.
Furthermore, it appears that all previous studies employed EEG to investigate MMN maturation. Although MEG and EEG are similar in their millisecond temporal resolution, MEG offers advantages in terms of source-space reconstruction, as MEG magnetic fields are less susceptible to signal smearing and distortion compared with EEG electrical potentials (Baillet 2017). The scarcity of maturational MMF studies is likely due to the lack of availability of MEG systems—and pediatric MEG systems in particular—around the world (Azhari et al. 2020). Given the lack of child MEG-MMF studies, we currently know little about the early maturational changes in activity within the canonical MMN/F network. We sought to address this shortcoming in this study.
Here we aimed to test a predictive coding account of early neurocognitive development. To this end, we used pediatric MEG (Johnson et al. 2010; Rapaport et al. 2019) and a multi-feature auditory oddball paradigm, and measured MMF responses in younger (Mage = 4.1 years, range: 3.1–5.3) and older (Mage = 6.2 years, range: 5.6–6.9) children. We hypothesized that the older-relative-to-younger children would show a larger MMF amplitude, reflecting maturation of predictive brain function across development. Furthermore, we hypothesized that this maturation would be underpinned by increasing involvement of the frontal cortex in responding to prediction errors, as reflected by a significantly larger frontal-MMF in the older children.
Materials and methods
Participants
We recruited 60 children (aged 3 to 6 years) via the Macquarie University “Neuronauts” child research participation database. Of those, 3 children were unable to complete the MEG recording session due to anxiety (n = 2, aged 3.1 and 5.5 years) or falling asleep during the recording (n = 1, aged 3.1 years). Of the 57 collected datasets, 20 (35%) were excluded due to: (a) having a poor head position in relation to the MEG sensor array (n = 6; determined based on visual inspection); (b) excessive in-scanner head motion (based on real-time head motion tracking data) that exceeded an average of 10 mm over the recording (n = 4; Mage = 4.7 years, range: 3.2–5.8); (c) excessive in-scanner head motion (based on pre- and post-recording marker coil measurements as the real-time head motion tracking system was malfunctioning) that exceeded 5 mm (n = 5; Mage = 5.0, range: 3.2–7.0); (d) having more than 2 standard deviations above the mean for the number of interpolated channels (n = 4); and (e) missing triggers (n = 1). Thus, the final sample included 37 children (Mage = 5.2 years, SD = 1.2, range = 3.1–6.9 years; 20 females; 5 ambidextrous, 5 left-handed, based on parent report).
To investigate the (cross-sectional) developmental trajectory of the MMF, we performed a median age split—separating the data groups of younger children (n = 18; Mage = 4.1 years, SD = 0.9, range = 3.1–5.3 years; 8 females; 3 ambidextrous, 3 left-handed) and older children (n = 19; Mage = 6.2 years, SD = 0.4, range = 5.6–6.9 years; 12 females; 2 ambidextrous, 2 left-handed determined based on parent report). Normal hearing thresholds between 500 and 1,500 Hz were confirmed with pure-tone audiometric testing using an Otovation Amplitude T3 series audiometer (Otovation LLC, PA, United States). All children had parent-reported normal or corrected-to-normal vision and had no history of developmental disorders, epilepsy, brain injury, or language or speech impairment, as reported by parents. Children and their parents provided verbal and written informed consent, respectively, before the experiment. All procedures were approved by the Macquarie University Human Research Ethics Committee (reference: 5201600188). Families were paid 40 AUD for their participation and the children received a gift bag and certificate.
Auditory oddball paradigm
Electrophysiological responses were measured as participants listened to a passive multi-feature auditory oddball paradigm (adapted from Näätänen et al. 2004). We chose to use the multi-feature over the traditional oddball paradigm as the former version offers advantages in terms of time efficiency—a quality which is of critical importance when testing young children who are generally less compliant than adults (Lovio et al. 2009). The multi-feature recording time is significantly reduced by presenting standards and deviants in an alternating pattern, as opposed to the traditional paradigm whereby the ratio of deviants to standards is between 1:7 and 1:10 (Petermann et al. 2009). No differences have been found between MMNs elicited by multi-feature compared with traditional oddball paradigms (Näätänen et al. 2004; Pakarinen et al. 2007). Furthermore, the multi-feature paradigm has been shown to reliably evoke mismatch responses in children (Petermann et al. 2009; Putkinen et al. 2012; Putkinen et al. 2014b).
The paradigm consisted of 3 sequential stimulus blocks, each of which ran for approximately 4 min and consisted of 495 stimuli (0.5 s stimulus-onset-asynchrony). Block order was counterbalanced across participants. Each block began with the presentation of 15 “standard” stimuli followed by alternating “standard” and “deviant” stimuli (see Fig. 1 for a schematic diagram). The “standard” stimuli were sinusoidal harmonic tones which were 75 ms in duration (including 5 ms rise and fall times) and presented at a volume of 80 SPL and at a frequency of 550, 1,000, and 1,500 Hz (for each block, respectively). Each “standard” had a presentation probability of P = 0.5 (excluding the first 15 standards in each block).
Fig. 1.

A schematic illustration of the multi-feature auditory oddball paradigm. The first 15 tones are “standards” (S) followed by alternating “standard” and 1 of 4 types of “deviant” (D) tones.
The “deviant” stimuli differed from the “standards” in 1 of 4 acoustic features: (1) frequency (half were 10% higher [partials: 605, 1,100, 1,650 Hz] and half were 10% lower [partials: 495, 900, 1,350 Hz] than the standards), (2) intensity (half were 10% louder and half were 10% softer than the standards), (3) duration (the stimulus was presented for 25 ms with a silent 25 ms gap before the next stimulus), or (4) by having a silent gap in the middle of the stimulus (i.e. 7 ms removed from the middle of the stimulus tone, including 1 ms fall and rise times). Each of the 4 deviant subtypes had a presentation probability of P = 0.125. Deviants were presented in a pseudorandom order such that the same type of deviant could not be presented in succession. The paradigm was programmed and presented in MATLAB (Mathworks, Natick, MA, USA) using Psychtoolbox (Brainard 1997).
MEG Acquisition
Electrophysiological data were acquired using a whole-head, supine, pediatric MEG system (Model PQ1064R-N2m, KIT, Kanazawa, Japan) housed in a magnetically shielded room (MSR; Fujihara Co. Ltd, Tokyo, Japan). The MEG sensor array contained 125 first-order axial gradiometers, each of which had a coil diameter of 15.5 mm and a baseline of 50 mm (see He et al. 2019 for further details). The dewar was designed to fit a maximum head circumference of 53.4 cm, accommodating the heads of more than 90% of 5-year-old Caucasian children (Johnson et al. 2010). All data were acquired at a sampling rate of 1,000 Hz and with an online bandpass filter of 0.03–200 Hz. To maximize the likelihood of obtaining high-quality data, we followed a child-friendly MEG testing protocol (see Rapaport et al. 2019).
Before the MEG recording, participants were fitted with a polyester cap containing 5 head position indicator (HPI) “marker” coils. A digitizer pen (Polhemus Fastrak, Colchester, USA) was used to record the locations of the HPI coils, as well as 3 fiducial points (the nasion and bilateral pre-auricular points) and 300–500 points from the scalp and face. During the MEG recording, participants listened to the multi-feature paradigm while watching a silent video of their choice. The auditory stimuli were presented via a 60 cm2 speaker (Panphonics SSH sound shower, Panphonics) positioned centrally at the foot of the MEG bed. The video was projected onto the MSR ceiling above the dewar. Participants’ head position in relation to the MEG sensor array was continuously monitored using a real-time marker coil tracking system (Oyama et al. 2012). Participants were accompanied by a researcher (and often a caregiver) during the MEG recording. The MEG procedures, including the setup and the recording, took approximately 45 min to complete.
Data pre-processing
Following an inspection of the data, we removed 3 MEG channels that were consistently flat and noisy across participants for more than 10% of the recording (Gross et al. 2013). The following steps were performed using MEG160 (Yokogawa Electric Corporation and Eagle Technology Corporation, Tokyo, Japan). Environmental noise—estimated based on recordings from 3 reference magnetometers—was suppressed using a Time-Shift Principal Component Analysis (TSPCA) algorithm (de Cheveigné and Simon 2007; block width: 10,000 ms, 3 shifts). Data acquired with the real-time marker coil tracking system were used to correct for head motion artifacts (Knösche 2002; realignment conditions: sphere mesh = 321, prune ratio = 0.05). Head motion artifact correction could not be performed for 7 datasets that had missing or erroneous real-time coil tracking data for more than 10% of the recording. These 7 datasets were still included as head movement between the pre- and post-recording marker coil measurements did not exceed 5 mm.
Further pre-processing steps were performed in Matlab 2020a (MathWorths, Inc., Natick, MA, USA) using the Fieldtrip Toolbox (v20200213; Oostenveld et al. 2011). For each participant, the entire recording was high- and low-pass filtered at 0.1 and 40 Hz, respectively (using a onepass-zerophase firws filter with a Blackman window), and band-stop filtered to remove residual 50 Hz power-line contamination and its harmonics. Following visual inspection, segments of the recording containing artifacts (e.g. SQUID jumps and jaw clenches) were removed. Channels that contained a large number of these visually identified artifacts and/or were flat for more than 10% of the recording were interpolated to ensure that all participants had the same number of channels (Medvedovsky et al. 2007; NB. these removals were in addition to the 3 channels already rejected across the entire sample). An independent component analysis (ICA) was used to suppress eye blink artifacts: the raw recordings were high-pass filtered at 1 Hz to improve the ICA performance (Winkler et al. 2015) and then components with scalp distributions that corresponded to eye blinks were removed from the 0.1 Hz, pre-processed data. Up to one independent component was removed per participant.
The continuous data were epoched into segments of 0.5 s (0.1 s pre-, and 0.4 s post-stimulus onset). The first 15 epochs of each stimulus block were excluded from further analysis. Standard and deviant epochs were averaged across, respectively, to compute standard and deviant event-related fields (ERFs). It should be noted that we averaged across all 4 deviant conditions, as: (1) we sought to maximize the statistical power of our comparisons, and (2) we had no theoretical reason to analyze the 4 types of deviants separately. While the deviants differed in terms of their acoustic features, the constant feature that unites them is their pseudorandom occurrence (in contrast to the relatively predictable occurrence of the standards—see Fig. 1). The standard ERF was subtracted from the deviant ERF to produce an MMF.
Sensor-level analysis
The sensor-level analysis involved 2 key steps performed in MNE-Python (Gramfort et al. 2013): (1) identifying a time-window of interest (TOI), and (2) constraining the age-related analysis to this time-window. We used a data-driven approach to identify a TOI. First, we reduced the multi-variate (channel x time) data from all 37 participants into a single time-course—achieved using an “Effect-Matched Spatial (EMS) filtering” approach (Schurger et al. 2013). EMS filtering involves estimating a spatial filter from the data itself, and then projecting the data through the spatial filter. EMS is superior to other data reduction methods (e.g. averaging a contiguous cluster of channels) as it accounts for the spatiotemporal evolution of the experimental effect across time and space, and weights the channels at each timepoint accordingly. EMS filtering was used to reduce the dimensionality of our data, and was orthogonal to our (between groups) hypothesis test.
To avoid circularity in the EMS filtering procedure, we employed 5-fold stratified cross-validation procedure (Huang et al. 2020; Engemann and King 2021). This analysis involved performing the following steps:
The data were normalized using z-scores.
A spatial filter was derived from 80% of the dataset, leaving out a different 20% for each repetition. The spatial filter was computed by subtracting the “standard” from the “deviant” epochs at each timepoint and channel. This produced a set of weightings (i.e. a spatial filter) that reflected the magnitude of the experimental effect (i.e. the MMF) over temporal and spatial dimensions.
The spatial filter was applied to the remaining 20% of the dataset by taking the product of the filter and the data at each time point and for each channel, and summing the results to render a single, spatially filtered time-course for the standard and deviant conditions, respectively.
The time-courses were then averaged across the 37 participants to form a single time-course for the “standard” and “deviant” conditions, respectively. Subsequently, we performed a nonparametric cluster-based permutation analysis (Maris and Oostenveld 2007) on the whole-group EMS filtered data to determine the time-course of the MMF effect. This permutation approach has been shown to adequately control the Type-I error rate for electrophysiological data. The analysis involved conducting one-sample t-tests at each time point to determine whether the MMF amplitude was significantly different from zero. We clustered samples whose t-values fell below a threshold corresponding to an alpha level of 0.05 (based on temporal proximity) and calculated cluster-level test statistics by taking the average of the t-values within each cluster. The data were then permuted 1,000 times, each time randomly shuffling the “standard” and “deviant” condition labels and recomputing the t-values. We constructed a permutation distribution from these random partition t-values. Finally, the significance of each cluster was determined using a threshold Monte-Carlo p-value.
To maximize statistical power, the time window of the significant cluster (identified in the analysis above) was used to constrain the age-related analysis. First, for each participant, we computed a mean MMF amplitude by taking the average of their EMS-filtered data within the TOI. Here the MMF was liberally defined as any significantly larger deviant-relative-to-standard amplitude within the TOI. Our definition stands in contrast to the definition of the classic adult MMN/F, which emerges soon after the second “N1” or “M2” component (Hari and Puce 2017, p. 265; Näätänen et al. 2019a, p. 53) and is visible between 0.1 and 0.25 s following stimulus onset (Näätänen et al. 2019b). We relied on this more liberal definition as the latency of the peak mismatch effect changes across childhood (Näätänen et al. 2019a) and it would therefore have been inappropriate to constrain our analysis to the classic adult MMN time window. It should be noted that prior pediatric auditory oddball studies have likewise used liberally defined time windows to extract MMN effects (e.g. 0.2–0.33 s in 6- to 7-year-olds, Lovio et al. 2009; 0.3–0.5 s in 4- to 12-year-olds, Partanen et al. 2013; 0.15–0.4 s in 5- to 7-year-olds, Petermann et al. 2009; 0.1–0.3 s in 9- to 13-year-olds, Putkinen et al. 2014a; 0.1–0.32 s in 4- to 10-year-olds, Shafer et al. 2000).
We compared mean MMF amplitudes between the younger and older children using a 2-sided permutation independent-samples t-test with 5,000 bootstrap samples. For each p-value, we performed 5,000 reshuffles of the age group labels using the DABEST (Data Analysis with Bootstrap-coupled ESTimation; Ho et al. 2019) open-source libraries for Python. Additionally, we used the Pearson correlation coefficient to investigate the relationship between mean MMF amplitude and age.
Source-level analysis
To investigate how the neural network underpinning the MMF changes across age, we conducted source analysis on 6 predefined regions of interest (ROIs): bilateral primary auditory cortices (A1), bilateral superior temporal gyri (STG), and bilateral inferior frontal gyri (IFG; Garrido et al. 2007, 2008, 2009; Phillips et al. 2015). These ROIs were drawn from the adult literature as this is, to our knowledge, the first pediatric MEG study which has examined the neural sources underpinning MMN/F generation in typically developing children.
As the participants did not have individual structural MRI scans, we used the MRI Estimation for MEG Sourcespace toolbox (MEMES; Seymour 2018; https://github.com/Macquarie-MEG-Research/MEMES) to create surrogate structural MRIs. This procedure uses an Iterative Closest Point algorithm to match the participant’s digitized head shape information to an age-appropriate average MRI template (Neurodevelopmental MRI Database; Richards et al. 2016).
For each participant, the best-fitting template with the lowest objective registration (Gohel et al. 2017) was selected and used to create a cortical mesh and source grid and co-registered with the MEG sensor locations. A forward model (i.e. leadfield) was then computed using the cortical mesh as the volume conductor model. Source reconstruction was performed using a linearly constrained minimum variance (LCMV) beamformer (Van Veen et al. 1997), as implemented in Fieldtrip Toolbox (v20200213; Oostenveld et al. 2011). A spatial filter for each vertex in the source grid was computed using a free dipole orientation (Garrido et al. 2007). This step was performed separately for the “standard” and “deviant” conditions based on the covariance matrix calculated from the data combined across conditions. The spatial filters for all vertices within each ROI were combined into a single spatial filter by weighting voxels within the ROI according to their proximity to a centroid (Brookes et al. 2016). This centroid was defined as the voxel within the ROI that was nearest to all other voxels in the ROI (Douw et al. 2018). The sensor-level data were then right-multiplied by the spatial filters for each condition.
The final 3 analysis steps involved using the nonparametric cluster-based permutation approach (Maris and Oostenveld 2007). First, we used paired-samples t-tests to compare the “standard” and “deviant” amplitudes at each of the ROIs for all 37 children, permuting the condition labels 2,000 times. Second, we used independent-samples t-tests to compare the source-level (i) MMF (ii) standard, and (iii) deviant amplitudes between the groups of younger and older children, permuting the group labels (“younger” and “older”) 2,000 times. The experimental paradigm, data pre-processing and data analysis scripts are freely available on an Open Science Framework repository: https://osf.io/35q8n/.
Results
Sensor-level results
Across all 37 children, we found a significantly larger deviant-relative-to-standard (i.e. MMF) amplitude between 0.19 and 0.49 s following stimulus onset (see Fig. 2). Results from a 2-sided permutation independent-samples t-test (constrained to 0.19–0.49 s) indicated that the mean MMF amplitudes were significantly larger in the older children (n = 19; M = 0.71, SD = 0.45) compared with the younger children (n = 18; M = 0.40, SD = 0.34), P = 0.027, Cohen’s d = 0.77 (95% CI for Cohen’s d: 0.03–1.4; see Fig. 3). A moderate, positive correlation of borderline significance was found between mean MMF amplitude and age (Pearson’s r = 0.33, P = 0.049; see Fig. 4).
Fig. 2.

Top panel: Whole-group EMS filtered MEG data (a.u. = arbitrary units) for the standard (blue) and deviant (red) waveforms (n = 37). Bottom panel: Results from the cluster-based permutation test. The cluster of significant t-statistics is highlighted in red.
Fig. 3.

A Gardner–Altman estimation plot. Mean MMF amplitudes for all participants are plotted by age group on the 2 left-most axes. The effect size is presented as a bootstrap 95% confidence interval on the separate but aligned right-hand-side axis, and is displayed to the right of the data. The mean value of the older group is aligned with the effect size.
Fig. 4.

Scatterplot of the mean MMF responses and age for all participants (n = 37). The shaded area represents the 95% confidence interval.
Source-level results
Across the entire sample, we found significantly larger MMF amplitudes, P < 0.05, in the 3 right-hemisphere ROIs: A1 (0.25–0.36 s), STG (0.25–0.36 s), and IFG (0.23–0.34 s; see Fig. 5). Hence, we constrained the subsequent age-related analysis to the right-hemisphere ROIs, and to the time window of 0.23 to 0.36 s. Consistent with our hypothesis, the older children showed a significantly larger MMF amplitude, P < 0.05, in the right-IFG (0.31 to 0.33 s) relative to the younger children (see Fig. 6). We found no significant differences between the younger and older children’s standard or deviant amplitudes in the same time window, P > 0.05.
Fig. 5.

Whole-group source-level results. The 2 columns show the standard (blue) and deviant (red) ERF amplitudes (ampere per meter square) by time (s) for the left and right hemisphere ROIs, respectively. The shaded regions around each waveform represent 95% confidence intervals. The horizontal black line indicates clusters of significant differences (P < 0.05) between the standard and deviant ERF. Note the different y-axis scales.
Fig. 6.
Source-level results for the younger and older children. The first 2 columns show the standard (blue) and deviant (red) ERF amplitudes (ampere per meter square) by time (s) for the younger and older children, respectively. The third column shows the MMF waveforms for the younger (pink) and older (green) children. The shaded regions around each waveform represent 95% confidence intervals. The horizontal black line indicates clusters of significant differences (P < 0.05) between the younger and older children’s MMF amplitudes. Note the different y-axis scales.
Discussion
This pediatric MEG study investigated the maturation of MMF between the ages of 3 and 6. Consistent with our hypotheses, we found that—at the sensor level—the older children (M = 6.2 years, range: 5.6–6.9) showed significantly larger MMF amplitudes relative to the younger children (M = 4.1 years, range: 3.1–5.3). Furthermore—at the source level—the older children showed a significantly larger MMF amplitude in the right IFG relative to the younger children. The larger MMF in older (relative to younger) children did not appear to be driven by either smaller or larger deviants alone, by rather a change in both directions, thereby giving rise to a significantly larger difference waveform across age. Our study is a first step toward measuring the early maturation of predictive brain function, with a focus on frontal regions like the IFG.
The maturation of the MMF
The right-lateralized source-space findings are consistent with prior evidence that the classic, pure-tone-elicited MMN is most pronounced in the right hemisphere (Paavilainen et al. 1991). Furthermore, our finding of larger MMF amplitudes in older-relative-to-younger children is in line with the findings of previous EEG studies—both longitudinal (Chobert et al. 2014; Putkinen et al. 2014a; Putkinen et al. 2014b; Linnavalli et al. 2018) and cross-sectional (Oades et al. 1997; Bishop et al. 2011a)—which have similarly reported an increase in the MMN amplitude across childhood. Yet our results conflict with the findings of several other cross-sectional EEG studies, which have reported a decrease (Csépe et al. 1992; Kraus et al. 1993a; Kraus et al. 1993b) or no difference (Kraus et al. 1999; Gomot et al. 2000; Shafer et al. 2000, 2010) in the MMN amplitude with age.
One possible reason for the mixed findings regarding the maturation of the MMN/F may be related to variability in the methods used to calculate the MMN/F amplitude. Mean measures (calculated by taking the average amplitude across a time window) are generally considered to be superior to peak measures (calculated by locating the maximum amplitude within a given time window) as they are less susceptible to high-frequency noise distortions (Bishop 2007; Bishop et al. 2011b; Luck 2014a). Previous MMN studies that computed “mean” measures largely reported an amplitude increase with age (Bishop et al. 2011b; Chobert et al. 2014; Putkinen et al. 2014a; Putkinen et al. 2014b; Linnavalli et al. 2018), whereas those using “peak” measures reported either a decrease (Csépe et al. 1992; Kraus et al. 1993a; Kraus et al. 1993b) or no difference (Shafer et al. 2000, 2010) in the MMN amplitude with age. Given the problems associated with peak measures, the mean amplitude findings seem to be the most reliable and are consistent with the findings from this study.
The key finding of this study is that older children (Mage = 6.2 years) showed significantly larger MMF amplitudes in the right IFG relative to the younger children (Mage = 4.1 years). To our knowledge, this is the first pediatric MEG study which has examined the neural sources underpinning MMN/F generation in typically developing children. We cautiously interpret the current finding of larger MMF amplitudes in older children as reflecting the maturation of predictive brain function across childhood. Specifically, we suggest that with significant physiological maturation of the brain during childhood—and, the prolonged maturation of the frontal lobes (Huttenlocher and Dabholkar 1997)—children’s brains may become increasingly proficient at extracting statistical regularities from the stream of auditory input. Equipped with this statistical knowledge, children are then able to form increasingly precise priors of the upcoming stimuli, whereby the strongest prediction would be for the presentation of a high probability “standard.” The larger MMF difference waveform amplitude seen in the older children may reflect a larger prediction error response to the lower probability deviants (which may lie outside of the precise prior distribution), as well as a smaller error response to the higher probability standards (which may be precisely predicted). By contrast, the relatively attenuated MMF amplitude seen in the younger children may reflect—compared with the older children—a smaller prediction error to the deviants (which may lie within the broad prior distribution), as well as a larger response to the standards (which may be less precisely predicted). Furthermore, the results support the proposal that increasingly mature predictive brain function across development is underpinned by greater involvement of the frontal cortex in responding to prediction errors.
The notion that priors become increasingly precise with age is consistent with what we intuitively understand about the differences between adult and child perception (Lucas et al. 2014). Increasingly precise prior distributions across development would progressively bias the posterior distribution (i.e. the perceptual experience) toward the narrowing prior (i.e. the brain’s stored knowledge about the world) and away from incoming sensory signals. Furthermore, increasingly precise predictions about the world have an adaptive function, resulting in fewer prediction error signals and thus, reduced demands on neural bandwidth in the long run.
Conversely, having relatively broad priors in the early years of life would give rise to a perceptual experience that is biased toward incoming sensory signals from the world and less biased by prior knowledge. Furthermore, having imprecise, broad priors should yield more frequent prediction errors—driving more frequent model updating and thus boosting the learning rate. Indeed, it might be precisely because young children know less about the world that makes them more open to learning new information (Lucas et al. 2014).
Alternative mechanistic accounts
Predictive coding theory provides a plausible account of the current findings. While alternative mechanistic accounts of the MMN/F have been put forward, these accounts can only account for MMN/F responses elicited by auditory oddball paradigms in which the “standards” are both: (a) acoustically identical to one another, and (b) repetitive in their occurrence. The “traditional” auditory oddball paradigm fits these criteria, involving the presentation of a stream of repetitive standard (S) stimuli that are infrequently interrupted by the presentation of a single deviant (D) stimulus (i.e. D-SSSD-SSSSS-D…).
Under the “sensory memory” account, a traditional MMN/F is generated when there is a detectable acoustic difference between the current input (e.g. a “deviant” stimulus) and a brief memory trace of the preceding input (e.g. a “standard” stimulus; Näätänen et al. 1978). Alternatively, under the “neuronal adaptation” account, neurons tuned to the “standards” become suppressed through repeated stimulation, whereas the rarer “deviant” stimuli activate non-adapted neurons (i.e. “fresh afferents”), resulting in an enhanced deviant-relative-to-standard evoked response (May and Tiitinen 2010). Finally, under predictive coding, the traditional MMN/F could reflect a relatively larger prediction error signal elicited in response to the lower probability “deviants” (P = 0.5/4) relative to the higher probability “standards” (P = 0.5). Thus, all 3 mechanistic accounts provide equally satisfying explanations of the traditional MMN/F.
Yet these sensory memory and neuronal adaptation accounts fail to explain MMN/Fs elicited by paradigms in which the “standard” stimuli are non-repetitive in their occurrence—such as the multi-feature paradigm implemented in this study (i.e. S-D4-S-D2-S-D1-S-D4-S-D3…). Under the “sensory memory” account, a change-detection mechanism would signal acoustic change on every trial, resulting in equally large evoked response amplitudes to all the stimuli and therefore, no mismatch effect. Likewise, under the neuronal adaptation account, the constant acoustic changes would not allow for habitation to any of the stimuli, again resulting in no mismatch effect.
Study implications
Overall, the current neural findings offer preliminary support to the notion that predictive brain function improves across childhood. Our findings are consistent with a substantial body of behavioral evidence suggesting that young children learn the statistical regularities in their environment and use that statistical knowledge to generate predictions about future sensory events (Gopnik and Wellman 2012; Köster et al. 2020). The predictive coding account of childhood neurocognitive development represents a relatively new and radical way of conceptualizing early learning (Gopnik 2012).
This account may also have important implications for understanding the neurocognitive underpinnings of neurodevelopmental conditions that manifest in the early childhood years. For example, predictive coding accounts of autism have proposed that both the sensory and social characteristics of autistic people may be explained by divergent precision weighting of prediction errors (Brock 2012; Pellicano and Burr 2012; Friston et al. 2013; Pellicano 2013; van Boxtel and Lu 2013; Van de Cruys et al. 2013, 2014; Lawson et al. 2014, 2017). The current findings will serve as an important baseline for testing these predictive coding accounts of autism, as well as other neurodevelopmental conditions.
Study limitations
It should be noted that the current findings are constrained by several limitations. The between-group analyses were conducted with relatively small samples (younger children: n = 18; older children: n = 19). While the effect sizes were large, studies with larger samples are needed to strengthen the present findings. With respect to the source analysis, our approach was constrained to 6 ROIs taken from Garrido et al. (2007, 2008, 2009): the bilateral primary auditory cortices (A1), STG, and IFG. We chose to constrain the analysis in this way in order to maximize the statistical power of the permutation approach (Luck 2014b). However, there is evidence of a more extensive MMN/F neural network beyond the frontotemporal regions, including generators in the parietal cortex (Celsis et al. 1999; Opitz et al. 1999; Schall et al. 2003; Molholm et al. 2005; Näätänen et al. 2019b). As such, important ROIs may have been left out of the current source analysis. Future research should attempt to investigate the neural generators more precisely beyond the regions investigated here with a considerably larger sample of participants (e.g. N = ≥ 100) using functional magnetic resonance imaging (fMRI) and/or MEG whole-brain source analyses.
Conclusions
In conclusion, this study is the first to use MEG to investigate the maturation of the MMF during the childhood years. In line with our sensor-level hypotheses, we found larger MMF amplitudes in older (relative to younger) children. Furthermore, at the source-level, older children showed significantly larger MMF amplitudes in the right IFG relative to the younger children. We interpret these findings as preliminary support for the notion that predictive brain function matures across early development, and that this maturation is underpinned by increasing involvement of the frontal cortex in responding to prediction errors. These findings contribute to a deeper understanding of the brain function underpinning child cognitive development.
Acknowledgements
We are extremely grateful to all the children and families who so generously gave up their time to participate in this research.
Contributor Information
Hannah Rapaport, School of Psychological Sciences, 16 University Ave, Macquarie University, Sydney, 2109, Australia; Macquarie School of Education, 26 Wally’s Walk, Macquarie University, Sydney, 2109, Australia.
Robert A Seymour, Wellcome Centre for Human Neuroimaging, 12 Queen Square, London WC1N 3AR, United Kingdom.
Nicholas Benikos, School of Psychological Sciences, 16 University Ave, Macquarie University, Sydney, 2109, Australia.
Wei He, School of Psychological Sciences, 16 University Ave, Macquarie University, Sydney, 2109, Australia.
Elizabeth Pellicano, Macquarie School of Education, 26 Wally’s Walk, Macquarie University, Sydney, 2109, Australia; Department of Clinical, Educational and Health Psychology, 26 Bedford Way, University College London, London WC1E 6BT, United Kingdom.
Jon Brock, School of Psychological Sciences, 16 University Ave, Macquarie University, Sydney, 2109, Australia.
Paul F Sowman, School of Psychological Sciences, 16 University Ave, Macquarie University, Sydney, 2109, Australia.
CRediT authors statement
Hannah Rapaport: Methodology, Software, Formal analysis, Investigation, Data Curation, Writing – Original Draft, Writing – ReviewEditing, Visualization, Robert A. Seymour: Methodology, Software, Writing – ReviewEditing, Visualization, Nick Benikos: Investigation, Resources, Writing – ReviewEditing, Wei He: Methodology, Investigation, Supervision, Elizabeth Pellicano: Methodology, Writing – ReviewEditing, Supervision, Jon Brock: Conceptualisation, Writing – ReviewEditing, Paul F. Sowman: Conceptualization, Methodology, Software, Formal analysis, Writing – ReviewEditing, Visualization, Supervision, Funding acquisition.
Funding
This work was supported by the Australian Research Council Discovery Projects (grant number DP170103148).
Conflict of interest statement: None declared.
Data availability
The experimental paradigm, data pre-processing and data analysis scripts are freely available on an Open Science Framework repository: https://osf.io/35q8n/.
References
- Azhari A, Truzzi A, Neoh MJY, Balagtas JPM, Tan HAH, Goh PLP, Ang XHA, Setoh P, Rigo P, Bornstein MH, et al. A decade of infant neuroimaging research: what have we learned and where are we going? Infant Behav Dev. 2020:58:101389. [DOI] [PubMed] [Google Scholar]
- Badcock PB, Friston KJ, Ramstead MJD, Ploeger A, Hohwy J. The hierarchically mechanistic mind: an evolutionary systems theory of the human brain, cognition, and behavior. Cogn Affect Behav Neurosci. 2019:19(6):1319–1351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baillet S. Magnetoencephalography for brain electrophysiology and imaging. Nat Neurosci. 2017:20(3):327–339. [DOI] [PubMed] [Google Scholar]
- Barkovich MJ, Li Y, Desikan RS, Barkovich AJ, Xu D. Challenges in pediatric neuroimaging. NeuroImage. 2019:185:793–801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Basirat A, Dehaene S, Dehaene-Lambertz G. A hierarchy of cortical responses to sequence violations in three-month-old infants. Cognition. 2014:132(2):137–150. [DOI] [PubMed] [Google Scholar]
- Bishop DVM. Using mismatch negativity to study central auditory processing in developmental language and literacy impairments: where are we, and where should we be going? Psychol Bull. 2007:133(4):651–672. [DOI] [PubMed] [Google Scholar]
- Bishop DVM, Anderson M, Reid C, Fox AM. Auditory development between 7 and 11 years: an event-related potential (ERP) study. PLoS One. 2011a:6(5):1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bishop DVM, Hardiman MJ, Barry JG. Is auditory discrimination mature by middle childhood? A study using time-frequency analysis of mismatch responses from 7 years to adulthood. Dev Sci. 2011b:14(2):402–416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blakemore SJ, Choudhury S. Development of the adolescent brain: implications for executive function and social cognition. J Child Psychol Psychiatry. 2006:47(3–4):296–312. [DOI] [PubMed] [Google Scholar]
- Brainard DH. The psychophysics toolbox. Spat Vis. 1997:10:433–436. [PubMed] [Google Scholar]
- Brock J. Alternative Bayesian accounts of autistic perception: comment on Pellicano and Burr. Trends Cogn Sci. 2012:16(12):573–574. [DOI] [PubMed] [Google Scholar]
- Brookes MJ, Tewarie PK, Hunt BAE, Robson SE, Gascoyne LE, Liddle EB, Liddle PF, Morris PG. A multi-layer network approach to MEG connectivity analysis. NeuroImage. 2016:132:425–438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown TT, Jernigan TL. Brain development during the preschool years. Neuropsychol Rev. 2012:22(4):313–333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Celsis P, Boulanouar K, Doyon B, Ranjeva JP, Berry I, Nespoulous JL, Chollet F. Differential fMRI responses in the left posterior superior temporal gyrus and left supramarginal gyrus to habituation and change detection in syllables and tones. NeuroImage. 1999:9(1):135–144. [DOI] [PubMed] [Google Scholar]
- Chobert J, François C, Velay JL, Besson M. Twelve months of active musical training in 8- to 10-year-old children enhances the preattentive processing of syllabic duration and voice onset time. Cereb Cortex. 2014:24(4):956–967. [DOI] [PubMed] [Google Scholar]
- Clark A. Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav Brain Sci. 2013:36(3):181–204. [DOI] [PubMed] [Google Scholar]
- Clark A. Surfing uncertainty: prediction, action, and the embodied mind. New York City: Oxford University Press; 2015. [Google Scholar]
- Csépe V, Dieckmann B, Hoke M, Ross B. Mismatch negativity to pitch change of acoustic stimuli in preschool-and school-age children. Proceedings of EPIC. 1992:10(32):91–104. [Google Scholar]
- de Cheveigné A, Simon JZ. Denoising based on time-shift PCA. J Neurosci Methods. 2007:165(2):297–305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dołęga K, Dewhurst JE. Fame in the predictive brain: a deflationary approach to explaining consciousness in the prediction error minimization framework. Synthese. 2021:198(8):7781–7806. [Google Scholar]
- Douw L, Nieboer D, Stam CJ, Tewarie P, Hillebrand A. Consistency of magnetoencephalographic functional connectivity and network reconstruction using a template versus native MRI for co-registration. Hum Brain Mapp. 2018:39(1):104–119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dürschmid S, Edwards E, Reichert C, Dewar C, Hinrichs H, Heinze HJ, Kirsch HE, Dalal SS, Deouell LY, Knight RT. Hierarchy of prediction errors for auditory events in human temporal and frontal cortex. Proc Natl Acad Sci U S A. 2016:113(24):6755–6760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Emberson LL, Richards JE, Aslin RN. Top-down modulation in the infant brain: learning-induced expectations rapidly affect the sensory cortex at 6 months. Proc Natl Acad Sci U S A. 2015:112(31):9585–9590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Emberson LL, Boldin AM, Riccio JE, Guillet R, Aslin RN. Deficits in top-down sensory prediction in infants at risk due to premature birth. Curr Biol. 2017:27(3):431–436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Emberson LL, Boldin AM, Robertson CE, Cannon G, Aslin RN. Expectation affects neural repetition suppression in infancy. Dev Cogn Neurosci. 2019a:37(100597):1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Emberson LL, Misyak JB, Schwade JA, Christiansen MH, Goldstein MH. Comparing statistical learning across perceptual modalities in infancy: an investigation of underlying learning mechanism(s). Dev Sci. 2019b:22(6):1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Engemann D, King JR. Compute effect-matched-spatial filtering (EMS). MNE Documentation. 2021. [accessed 2021 May 13]: https://mne.tools/stable/auto_examples/decoding/plot_ems_filtering.html. [Google Scholar]
- Fischer K, Bidell TR. Dynamic development of action and thought. In: Lerner RM, editor. Handbook of child psychology: theoretical models of human development. Vol. 1. 6th ed. New Jersey: John Wiley & Sons, Inc.; 2007. pp. 313–399. [Google Scholar]
- Friston K. A theory of cortical responses. Philosophical Transactions of the Royal Society B: Biological Sciences. 2005:360(1456):815–836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston K, Kiebel S. Predictive coding under the free-energy principle. Philosophical Transactions of the Royal Society B: Biological Sciences. 2009:364(1521):1211–1221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston K, Lawson R, Frith CD. On hyperpriors and hypopriors: comment on Pellicano and Burr. Trends Cogn Sci. 2013:17(1):1. [DOI] [PubMed] [Google Scholar]
- Garrido MI, Kilner JM, Kiebel SJ, Stephan KE, Friston KJ. Dynamic causal modelling of evoked potentials: a reproducibility study. NeuroImage. 2007:36(3):571–580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garrido MI, Friston KJ, Kiebel SJ, Stephan KE, Baldeweg T, Kilner JM. The functional anatomy of the MMN: a DCM study of the roving paradigm. NeuroImage. 2008:42(2):936–944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garrido MI, Kilner JM, Stephan KE, Friston KJ. The mismatch negativity: a review of underlying mechanisms. Clin Neurophysiol. 2009:120(3):453–463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gohel B, Lim S, Kim M-Y, Kwon H, Kim K. Approximate subject specific pseudo MRI from an available MRI dataset for MEG source imaging. Front Neuroinform. 2017:11:50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gomot M, Giard MH, Roux S, Barthelemy C, Bruneau N. Maturation of frontal and temporal components of mismatch negativity (MMN) in children. Neuroreport. 2000:11(14):3109–3112. [DOI] [PubMed] [Google Scholar]
- Gopnik A. Scientific thinking in young children: theoretical advances, empirical research, and policy implications. Science (1979). 2012:337(6102):1623–1627. [DOI] [PubMed] [Google Scholar]
- Gopnik A, Wellman HM. Reconstructing constructivism: causal models, Bayesian learning mechanisms and the theory theory. Psychol Bull. 2012:138(6):1085–1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gramfort A, Luessi M, Larson E, Engemann DA, Strohmeier D, Brodbeck C, Goj R, Jas M, Brooks T, Parkkonen L, et al. MEG and EEG data analysis with MNE-python. Front Neurosci. 2013:7:267. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gross J, Baillet S, Barnes GR, Henson RN, Hillebrand A, Jensen O, Jerbi K, Litvak V, Maess B, Oostenveld R, et al. Good practice for conducting and reporting MEG research. NeuroImage. 2013:65:349–363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hari R, Puce A. Brain signals related to change detection. MEG-EEG primer. Oxford: Oxford University Press; 2017. pp. 262–276. [Google Scholar]
- He W, Sowman PF, Brock J, Etchell AC, Stam CJ, Hillebrand A. Increased segregation of functional networks in developing brains. NeuroImage. 2019:200:607–620. [DOI] [PubMed] [Google Scholar]
- Heilbron M, Chait M. Great expectations: is there evidence for predictive coding in auditory cortex? Neuroscience. 2018:389:54–73. [DOI] [PubMed] [Google Scholar]
- Ho J, Tumkaya T, Aryal S, Choi H, Claridge-Chang A. Moving beyond P values: data analysis with estimation graphics. Nat Methods. 2019:16:565–566. [DOI] [PubMed] [Google Scholar]
- Hohwy J. Attention and conscious perception in the hypothesis testing brain. Front Psychol. 2012:3:96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hohwy J. The predictive mind. Oxford: Oxford University Press; 2013. [Google Scholar]
- Huang HW, Nascimben M, Wang YY, Fong DY, Tzeng OJL, Huang CM. Which digit is larger? Brain responses to number and size interactions in a numerical Stroop task. Psychophysiology. 2020:58(3):e13744. [DOI] [PubMed] [Google Scholar]
- Huttenlocher PR, Dabholkar AS. Regional differences in synaptogenesis in human cerebral cortex. J Comp Neurol. 1997:387(2):167–178. [DOI] [PubMed] [Google Scholar]
- Jaffe-Dax S, Boldin AM, Daw ND, Emberson LL. A computational role for top–down modulation from frontal cortex in infancy. J Cogn Neurosci. 2020:32(3):508–514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson BW, Crain S, Thornton R, Tesan G, Reid M. Measurement of brain function in pre-school children using a custom sized whole-head MEG sensor array. Clin Neurophysiol. 2010:121(3):340–349. [DOI] [PubMed] [Google Scholar]
- Kayhan E, Heil L, Kwisthout J, van Rooij I, Hunnius S, Bekkering H. Young children integrate current observations, priors and agent information to predict others’ actions Korn CW, editor. PLoS One. 2019:14(5):e0200976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kersten D, Yuille A. Bayesian models of object perception. Curr Opin Neurobiol. 2003:13(2):150–158. [DOI] [PubMed] [Google Scholar]
- Knösche TR. Transformation of whole-head MEG recordings between different sensor positions. Biomed Tech (Berl). 2002:47(3):59–62. [DOI] [PubMed] [Google Scholar]
- Köster M, Kayhan E, Langeloh M, Hoehl S. Making sense of the world: infant learning from a predictive processing perspective. Perspect Psychol Sci. 2020:15(3):562–571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kraus N, Therese MG, Micco A, Sharma A, Carrell T, Nicol T. Mismatch negativity in school-age children to speech stimuli that are just perceptibly different. Electroencephalography and Clinical Neurophysiology/ Evoked Potential. 1993b:88(2):123–130. [DOI] [PubMed] [Google Scholar]
- Kraus N, McGee T, Carrell T, Sharma A, Micco A, Nicol T. Speech-evoked cortical potentials in children. J Am Acad Audiol. 1993a:4(4):238–248. [PubMed] [Google Scholar]
- Kraus N, Koch DB, McGee TJ, Nicol TG, Cunningham J. Speech-sound discrimination in school-age children: psychophysical and neurophysiologic measures. Journal of Speech, Language, and Hearing Research. 1999:42(5):1042. [DOI] [PubMed] [Google Scholar]
- Krogh L, Vlach HA, Johnson SP, Männel C. Statistical learning across development: flexible yet constrained. Front Psychol. 2013:3:598. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lawson RP, Rees G, Friston KJ. An aberrant precision account of autism. Front Hum Neurosci. 2014:8:1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lawson RP, Mathys C, Rees G. Adults with autism overestimate the volatility of the sensory environment. Nat Neurosci. 2017:20(9):1293–1299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Linnavalli T, Putkinen V, Huotilainen M, Tervaniemi M. Maturation of speech-sound ERPs in 5–6-year-old children: a longitudinal study. Front Neurosci. 2018:12:814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lovio R, Pakarinen S, Huotilainen M, Alku P, Silvennoinen S, Naatanen R, Kujala T. Auditory discrimination profiles of speech sound changes in 6-year-old children as determined with the multi-feature MMN paradigm. Clin Neurophysiol. 2009:120:916–921. [DOI] [PubMed] [Google Scholar]
- Lucas CG, Bridgers S, Griffiths TL, Gopnik A. When children are better (or at least more open-minded) learners than adults: developmental differences in learning the forms of causal relationships. Cognition. 2014:131(2):284–299. [DOI] [PubMed] [Google Scholar]
- Luck SJ. Quantifying ERP amplitudes and latencies. In: Luck S, editor. An introduction to the event-related potential technique. Cambridge (MA): MIT Press; 2014a. pp. 283–307. [Google Scholar]
- Luck SJ. A mass univariate approach and permutation statistics. In: Luck S, editor. An introduction to the event-related potential technique. Cambridge (MA): MIT Press; 2014b. [Google Scholar]
- Maris E, Oostenveld R. Nonparametric statistical testing of EEG- and MEG-data. J Neurosci Methods. 2007:164(1):177–190. [DOI] [PubMed] [Google Scholar]
- May PJC, Tiitinen H. Mismatch negativity (MMN), the deviance-elicited auditory deflection, explained. Psychophysiology. 2010:47(1):66–122. [DOI] [PubMed] [Google Scholar]
- Medvedovsky M, Taulu S, Bikmullina R, Paetau R. Artifact issue during head position correction in MEG. Epilepsia. 2007:48:598. [Google Scholar]
- Molholm S, Martinez A, Ritter W, Javitt DC, Foxe JJ. The neural circuitry of pre-attentive auditory change-detection: an fMRI study of pitch and duration mismatch negativity generators. Cereb Cortex. 2005:15(5):545–551. [DOI] [PubMed] [Google Scholar]
- Näätänen R, Gaillard AWK, Mäntysalo S. Early selective-attention effect on evoked potential reinterpreted. Acta Psychol. 1978:42(4):313–329. [DOI] [PubMed] [Google Scholar]
- Näätänen R, Pakarinen S, Rinne T, Takegata R. The mismatch negativity (MMN): towards the optimal paradigm. Clin Neurophysiol. 2004:115(1):140–144. [DOI] [PubMed] [Google Scholar]
- Näätänen R, Kujala T, Light G. The development of the MMN. In: Näätänen R, Kujala T, Light G, editors. Mismatch negativity: a window to the brain. Oxford: Oxford University Press; 2019a. pp. 52–66. [Google Scholar]
- Näätänen R, Kujala T, Light G. The mismatch negativity (MMN): An introduction. In: Näätänen R, Kujala T, Light G, editors. Mismatch negativity: a window to the brain. Oxford: Oxford University Press; 2019b. pp. 1–40. [Google Scholar]
- Oades RD, Dmittmann-Balcar A, Zerbin D. Development and topography of auditory event-related potentials (ERPs): mismatch and processing negativity in individuals 8-22 years of age. Psychophysiology. 1997:34(6):677–693. [DOI] [PubMed] [Google Scholar]
- Oostenveld R, Fries P, Maris E, Schoffelen JM. FieldTrip: open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput Intell Neurosci. 2011:2011:1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Opitz B, Mecklinger A, von Cramon DY, Kruggel F. Combining electrophysiological and hemodynamic measures of the auditory oddball. Psychophysiology. 1999:36(1):142–147. [DOI] [PubMed] [Google Scholar]
- Oyama D, Adachi Y, Higuchi M, Kawai J, Kobayashi K, Uehara G. Real-time coil position monitoring system for biomagnetic measurements. Phys Procedia. 2012:36:280–285. [Google Scholar]
- Paavilainen P, Alho K, Reinikainen K, Sams M, Näätänen R. Right hemisphere dominance of different mismatch negativities. Electroencephalogr Clin Neurophysiol. 1991:78:466–479. [DOI] [PubMed] [Google Scholar]
- Pakarinen S, Takegata R, Rinne T, Huotilainen M, Näätänen R. Measurement of extensive auditory discrimination profiles using the mismatch negativity (MMN) of the auditory event-related potential (ERP). Clin Neurophysiol. 2007:118(1):177–185. [DOI] [PubMed] [Google Scholar]
- Partanen E, Torppa R, Pykäläinen J, Kujala T, Huotilainen M. Children’s brain responses to sound changes in pseudo words in a multifeature paradigm. Clin Neurophysiol. 2013:124:1132–1138. [DOI] [PubMed] [Google Scholar]
- Pellicano E. Sensory symptoms in autism: a blooming, buzzing confusion? Child Dev Perspect. 2013:7(3):143–148. [Google Scholar]
- Pellicano E, Burr D. When the world becomes ‘too real’: a Bayesian explanation of autistic perception. Trends Cogn Sci. 2012:16(10):504–510. [DOI] [PubMed] [Google Scholar]
- Petermann M, Kummer P, Burger M, Lohscheller J, Eysholdt U, Döllinger M. Statistical detection and analysis of mismatch negativity derived by a multi-deviant design from normal hearing children. Hear Res. 2009:247(2):128–136. [DOI] [PubMed] [Google Scholar]
- Phillips HN, Blenkmann A, Hughes LE, Bekinschtein TA, Rowe JB. Hierarchical organization of frontotemporal networks for the prediction of stimuli across multiple dimensions. J Neurosci. 2015:35(25):9255–9264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Putkinen V, Niinikuru R, Lipsanen J, Tervaniemi M, Huotilainen M. Fast measurement of auditory event-related potential profiles in 2–3-year-olds. Dev Neuropsychol. 2012:37(1):51–75. [DOI] [PubMed] [Google Scholar]
- Putkinen V, Tervaniemi M, Saarikivi K, Ojala P, Minna H. Enhanced development of auditory change detection in musically trained school-aged children: a longitudinal event-related potential study. Dev Sci. 2014a:17(2):282–297. [DOI] [PubMed] [Google Scholar]
- Putkinen V, Tervaniemi M, Saarikivi K, de Vent N, Huotilainen M. Investigating the effects of musical training on functional brain development with a novel melodic MMN paradigm. Neurobiol Learn Mem. 2014b:110:8–15. [DOI] [PubMed] [Google Scholar]
- Rao RPN, Ballard DH. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci. 1999:2(1):79–87. [DOI] [PubMed] [Google Scholar]
- Rapaport H, Seymour RA, Sowman PF, Benikos N, Stylianou E, Johnson BW, Crain S, He W. Studying brain function in children using magnetoencephalography. J Vis Exp. 2019:146:e58909. [DOI] [PubMed] [Google Scholar]
- Raviv L, Arnon I. The developmental trajectory of children’s auditory and visual statistical learning abilities: modality-based differences in the effect of age. Dev Sci. 2018:21(4):1–13. [DOI] [PubMed] [Google Scholar]
- Richards JE, Sanchez C, Phillips-Meek M, Xie W. A database of age-appropriate average MRI templates. NeuroImage. 2016:124:1254–1259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Santos E, Noggle CA. Synaptic pruning. In: Goldstein S, Naglieri JA, editors. Encyclopedia of child behavior and development. Boston, MA: Springer; 2011. pp. 1464–1465. [Google Scholar]
- Schall U, Johnston P, Todd J, Ward PB, Michie PT. Functional neuroanatomy of auditory mismatch processing: an event-related fMRI study of duration-deviant oddballs. NeuroImage. 2003:20(2):729–736. [DOI] [PubMed] [Google Scholar]
- Scholl BJ. Innateness and (Bayesian) visual perception: Reconciling nativism and development. In:Carruthers P, Laurence S, Stich S, editors. The innate mind: structure and contents. Cambridge: Cambridge University Press; 2004. [Google Scholar]
- Schurger A, Marti S, Dehaene S. Reducing multi-sensor data to a single time course that reveals experimental effects. BMC Neurosci. 2013:14(1):1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seymour RA. Macquarie-MEG-research/MEMES: for Zenodo (version v0.31). Zenodo. 2018. 10.5281/zenodo.1451031. [DOI] [Google Scholar]
- Shafer VL, Morr ML, Kreuzer JA, Kurtzberg D. Maturation of mismatch negativity in school-age children. Ear Hear. 2000:21(3):242–251. [DOI] [PubMed] [Google Scholar]
- Shafer VL, Yu YH, Datta H. Maturation of speech discrimination in 4- to 7-yr-old children as indexed by event-related potential mismatch responses. Ear Hear. 2010:31(6):735–745. [DOI] [PubMed] [Google Scholar]
- van Boxtel JJA, Lu H. A predictive coding perspective on autism spectrum disorders. Front Psychol. 2013:4:19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van de Cruys S, Evers K, Boets B, Wagemans J, De-Wit L, Evers K, Boets B, Wagemans J. Weak priors versus overfitting of predictions in autism: reply to Pellicano and Burr. i-Perception. 2013:4(2):95–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van de Cruys S, Evers K, van der Hallen R, van Eylen L, Boets B, de-Wit L, Wagemans J. Precise minds in uncertain worlds: predictive coding in autism. Psychol Rev. 2014:121(4):649–675. [DOI] [PubMed] [Google Scholar]
- van Veen BD, van Drongelen W, Yuchtman M, Suzuki A. Localization of brain electrical activity via linearly constrained minimum variance spatial filtering. IEEE Trans Biomed Eng. 1997:44(9):867–880. [DOI] [PubMed] [Google Scholar]
- Winkler I, Debener S, Muller K-R, Tangermann M. 2015. On the influence of high-pass filtering on ICA-based artifact reduction in EEG-ERP. In: Cerutti S, Bonato P, editors. 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Milan, Italy: IEEE. p. 4101–4105. [DOI] [PubMed] [Google Scholar]
- Zhang F, Emberson LL. Using pupillometry to investigate predictive processes in infancy. Infancy. 2020:25(6):1–23. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The experimental paradigm, data pre-processing and data analysis scripts are freely available on an Open Science Framework repository: https://osf.io/35q8n/.

