Abstract
Flash evoked responses can be recorded from the fetus in utero. However a standard analysis approach based on orthogonal projection (OP) to attenuate maternal and fetal cardiac signals leads to spatial redistribution of the signal. This effect prevents correlation of source location with known fetal head location in some cases and the signal to noise ratio (SNR) is sometimes limited such that response latency is difficult to determine. We used a modified beamformer model search analysis to avoid the redistribution shortcoming and to improve the SNR. We included a statistical test for residual interference in the average and quantified the significance of the evoked response with a bootstrap method. Selected source locations compared favorably to fetal head locations estimated from ultrasound exams. The evoked response time course was found to have a significant post-trigger peak with latency between about 180 and 770 ms in more than 90% of the subject measurements. These results confirm that the combined application of a beamformer model search and bootstrap significance test provide a validation of the flash evoked response observed in OP processed fetal MEG channels.
Keywords: fetal magnetoencephalography, MEG, evoked response, beamformer, bootstrap
1. Introduction
The investigation of fetal brain development in utero in humans can be performed by fetal magnetoencephalography (fMEG). There is growing evidence that fMEG is able to detect auditory evoked responses, flash evoked responses, and spontaneous brain activity. The investigation of flash evoked responses is especially interesting, because the stimulation is not perceived by the mother; avoiding potential event-related artifact. Validation of the fetal averaged evoked response is complicated by poor SNR due to the presence of fetal spontaneous brain activity, fetal eye and mouth movements, and unidentified interferers within the maternal abdomen which may not be sufficiently attenuated by the limited number of averaged events. The problem is exacerbated at this point in the development of fMEG due to the absence of co-registration between the MEG sensor array and a 3D ultrasound image of the fetus. The low SNR regime suggests that validation of these results should include (a) statistical significance of the time course, (b) proximity of significant MEG sensor channels (or a modeled source) to the estimated fetal head location, and (c) observation of physiologically plausible time course morphology.
In our initial report of flash evoked fMEG (Eswaran et al 2002) we observed an evoked field in only 63% of the cases. However, we have recently reported improved response using orthogonal projection (OP) processing (McCubbin et al 2007a). The improvement over prior efforts, observed as an increase in the number of successful recordings from 63% to 89%, was attributed to an upgraded visual stimulator, better measurement protocol, and processing advancements.
A significance measure was applied in the recent work, using the pre-stimulus interval for estimation of the background noise in the average. However, e.g. in cases with large spontaneous burst brain activity often seen in the late gestation fetus (Rose and Eswaran 2004), the pre-stimulus interval available for estimation of the statistical distribution for the background interference may not be sufficient to conservatively satisfy the significance validation (a). The redistribution of fMEG signal among MEG channels due to OP processing (Vrba et al 2004) made it difficult to carry out the proximity validation (b) however the morphology criterion (c) was qualitatively successful.
An alternate approach would be to fit a dipole model to the OP processed and averaged time course and then project the modeled dipole activity back to the MEG sensor space. This could eliminate the proximity validation difficulties, but does not provide any benefit in SNR for the other validation criteria. In the current work we investigated a beamformer model search (Vrba et al 2007) to improve SNR over the OP processing method and to allow correlation of the selected fetal head model to the estimated fetal orientation. In addition the beamformer approach suppresses environmental interference and other interfering sources which cannot be identified by template matching for OP. The beamformer applies a spatial filter which attenuates all signal components that do not support a modeled source at the target location and provides a transformation of the MEG signal space to a single source activation. Beamformer rejection of MCG is superior to that obtained with OP and it additionally attenuates other interference arising from sources external to the fetal head. The beamformer is well suited to situations where the source model is not well known. It may be configured to search a grid of model origins over a plausible volume to provide candidate model source time courses. Selection of a source with peak SNR then provides a time course for the morphology validation and a model sphere location for the proximity validation.
The measure applied for the significance validation relies on a conservative estimate of the background interference in the average. Since the beamformer attenuates all external interference, the dominant interference for the investigated evoked response source location should be leakage from sources of nearby spontaneous brain activity. It has been observed that fetal spontaneous brain signals may be up to 10 times larger than averaged evoked responses (Rose and Eswaran 2004), and the spatial filter is not sharp enough to provide sufficient attenuation. While a lower limit for the spontaneous burst characteristic frequency may be around 1 Hz, the period of the burst envelope is on the order of 5 – 20 sec and is not stationary (Stockard-Pope et al 1992) so that it is unlikely to reliably capture this interference in the pre-stimulus window. In this work we have replaced the pre-stimulus background estimate with one based on a set of randomly triggered average values. This should conservatively represent the average of all activity not time-locked to the stimulus trigger (McCubbin et al 2007b).
Details of the data collection and processing method are presented in the next section. Validation of the method is then provided by processing data collected from a newborn baby on the fMEG array. In this example with good SNR and well known head position, OP processing, standard beamformer source reconstruction, and a beamformer model search all produce similar results. Results from the fetal study are then discussed and representative data are presented. We conclude that the flash evoked response has been validated.
2. Materials and methods
The data consisted of 27 records from a flash evoked fMEG study using a 151 channel MEG array conforming to the pregnant abdomen (CTF Systems, Vancouver). Subjects with normal pregnancies were recruited with informed consent and the protocol was approved by the institutional review board. Gestational age was between 30 and 36 weeks. Visual stimulation was provided by a 3.4 m fiber optic cable terminated in a 3 × 5 cm woven panel (StockerYale, Salem). The input end of the cable was illuminated by 4 high-power LED arrays resulting in a light output of 35 mW at 630 nm (red) at the panel.
Three small (approximately 1 cm diameter by 3mm thick) localization coils were positioned on the maternal hips and spine by means of an elastic belt for registration. The position and orientation of the fetus relative to the maternal abdomen was established by an ultrasound exam just before the MEG measurement. There was no co-registration of the ultrasound image with the MEG coordinate system, however the fetal eye position was roughly marked on the abdominal surface. This was used as a guide for the placement of the stimulator light panel. A fourth localization coil was attached to the light panel for registration.
The inter-stimulus interval (ISI) for this paradigm was 3.5 +/− 0.5 s and stimulus duration was 500 ms. Approximately 150 trials were collected at a sample rate of 312.5 Hz over a measurement session of 8 minutes. The data was filtered to a bandwidth of 0.5 to 10 Hz and trials were averaged over a time window of (−0.5, 1.5) s relative to the stimulus trigger after exclusion of trials from the average based on fetal movement, inferred from changes in fetal QRS p-p variations (actogram) (Zhao and Wakai 2002).
The fourth coil location was not reliable enough to establish the beamformer model search limits in the vicinity of the fetal head. As an alternative, the beamformer model origin search was conducted in two steps; once over approximately the entire maternal abdominal volume to estimate the fetal heart (fMCG) location and again over a smaller volume constrained by knowledge of the fetal heart location, fetal head-heart orientation, and measurement geometry. The model used for both fMCG and fMEG searches was a single current dipole in a conducting sphere. The beamformer computation requires the forward solution based on the model, covariance matrix of the data, and coordinates of both the origin and source search grids.
Covariance for the fMCG search was computed in a bandwidth of 0 to 70 Hz using all data in the record. A rectangular grid with limits of {(−17, 17), (−20, 15), (−30, −15)} cm in rectangular coordinates of the sensor array with an increment of 2 cm was established for the model origin search to cover most of the maternal abdominal volume. A spherical shell grid with radial limits of (2.5, 6.5) cm and increment of 1 cm was used to construct beamformer source weights for each model origin. The source grid limits were established by visually optimizing fMCG source time course resolution on a few example datasets. An average window of (−0.1, 0.4) s relative to fMCG QRS markers (determined by template matching), with bandwidth of 0 to 70 Hz, was used to form an averaged fMCG time course over MEG channels. The MEG channel data was then transformed by beamformer weights to candidate source time courses for each origin/source grid location. The source location with the largest SNR was taken as a rough estimate of the fetal heart position (Robinson and Vrba 2004a).
For the fMEG search, covariance was computed in a bandwidth of 0.5 to 10 Hz and again in 0 to 70 Hz using all data remaining in the record after elimination of segments with fetal movement. The narrow band covariance was used to determine the beamformer model source orientation and the broad band covariance was used to compute the beamformer weights. The narrow band covariance was necessary to obtain a solution specifically for the expected frequency range of the evoked response while the broad band covariance provided weights which were valid over the full bandwidth of the recorded data. The usual rectangular grid model search over the entire maternal abdominal volume results in an ambiguity region of possible model origins due to the low SNR (Vrba et al 2007) and any beamformer model within this region can explain the data adequately. We have attempted to address this situation by limiting the search grid to geometrically plausible head model origins based on our estimate of the fetal heart location and fetal orientation from the ultrasound exam (either vertex – head down or breech – head up). A grid increment of 2 cm was used. An example origin search grid is shown in figure 1. A minimum heart radius of 1.5 cm and a fetal head radius of 4.5 cm was used to limit the closest possible head – heart origin distance to 6 cm. A maximum head – heart origin distance was estimated as the minimum plus one head diameter, or 15 cm. A maximum origin depth was set as one head radius below the fetal heart. Then for vertex (breech) orientation identified by US exam just before the MEG session, the head must be below (above) the heart so we have taken only grid points below (above) the heart center, longitudinally. Origins which produced interference between a fetal head model radius of 4.5 cm and the maternal inner abdominal surface (with estimated abdominal thickness of 1 cm) were excluded as well as origins outside the plan projection of the sensor array. At each origin a spherical shell with radii of 2 cm and 4.5 cm was used to limit the source search grid, with an increment of 1 cm.
The data was then averaged over a discrete time window of (τ−, τ+) = (−156, 469) samples or (−0.5, 1.5) s relative to the set of stimulus triggers, {tS(i)}, i = 1,…,N. The data was averaged again relative to J = 30 sets of randomized triggers, tR, (tR = tS +/− tr ,tr = 625 samples or 2 s with an exclusion of +/− te = 188 samples or 0.6 s around each true stimulus trigger).
(1) |
where Rj[.] represents the operation of making a single random selection from the set of sample numbers indicated in the brackets. The averaged time course signal space vectors for the randomized triggers are
(2) |
and the averaged time course signal space vector for the true triggers is
(3) |
where X(t) is the signal space vector at time sample t. Baseline was subtracted based on the pre-trigger interval before projection of all J+1 time courses through beamformer weights corresponding to every origin/source from the grid combination. Transforming from MEG signal space to a beamformer source space, the time course corresponding to model origin index, k, and source index, l, is given as
(4) |
where Wkl is the beamformer weight vector for (model origin index, source index) = (k, l).
We have taken the set of randomized averages for a specific source as a sample of background noise in the averaged time course. Since the randomized triggers have no meaningful temporal relationship to the averaged time course points, the time points from all randomized averages were pooled and taken together as a random sample from the statistical distribution of the background noise. Compute the variance of the pooled randomized beamformer source values as
(5) |
where
(6) |
and the summation is over all combinations of j and τ. We have then tested the null hypothesis that a value equal to the peak of the true averaged evoked response time course may be drawn from the background distribution with a p-value threshold of p = 0.001. Only sources for which this null hypothesis can be rejected were considered as candidate evoked response sources, i.e. we select all (k, l) such that
(7) |
where P[a, B] is the probability of drawing the value ‘a’ from distribution ‘B’ and G is the Gaussian distribution characterized by zero mean and variance of the pooled random sample.
An averaged time course signal to interference ratio was computed as
(8) |
where Δb and Δr are the sets of time samples for background and response time windows relative to the stimulus trigger point and n_ is the number of samples in Δ_, so that Qkl is the rms ratio of event related signal to residual interference in the average. The background and response windows were taken as (−0.5, 0.1) s and (0.2, 0.8) s respectively, assuming that a primary ER peak would occur no sooner than 100 ms after the trigger and no later than 800 ms after the trigger and further that the pretrigger interval including 100 ms after the trigger should represent averaged residual interference. It was expected that the ISI was long enough that there should be no response overlap from the preceding trial. Note that the pre and post intervals were the same length to avoid bias.
Origin/source locations with p < 0.001 were then sorted by Q with the constraints that response rms should be greater than 5 nA.m and less than 20 nA.m to identify a few most plausible evoked response sources. Source locations with values of Q less than 2 were excluded because they generally did not satisfy the morphology criterion above. The amplitude constraints were imposed to avoid candidate sources that were unreasonably small or excessively large, possibly as a result of model mismatch, and the values were chosen experimentally. Finally the significance of event related peaks was established by a 95% bootstrap confidence interval and related significance measure (McCubbin et al 2007b). Confidence intervals were plotted with the averaged response time coarse for a quick visual indication of response variability however the statistical significance of a post-trigger point in the average was quantified with an equal means test. P-value of the observed post-trigger response was computed relative to the pooled random average background variance. Statistical power was determined from the overlap of the bootstrap distribution estimate with the background distribution estimate. We assigned statistical significance to a post-stimulus mean time point for p < 0.05 and power > 0.8.
An example recording from a newborn using the same apparatus and paradigm was used to validate the measurement protocol and the data processing method (except that the actogram trial rejection was not used). The newborn subject was positioned with the head near the center of the fMEG array and facing outward with head localization coils positioned above the nose and each ear. The flash stimulator light panel was mounted above the eyes at a distance of about 60 cm. A cradle-like attachment was used to support the newborn body.
The newborn data was first processed with a standard OP processing for comparison with an established event-related beamformer analysis (Robinson 2004, Cheyne 2007) using a single current dipole in a conducting sphere model. The origin of the model sphere was taken as the midpoint between localization coils positioned above the left and right ear. The background or control covariance was computed over the pre-stimulus interval and the signal or active covariance was computed over the post-stimulus interval (as defined above) in a 0 to 70 Hz bandwidth. Output of the processing was the ratio of beamformer power in the active window to that in the control window for each point in a source grid within a spherical shell with inner radius of 3.5 cm and outer radius of 5.3 cm. The newborn data was then processed using the fetal MEG model search described above to validate the method used for the fetal data.
3. Results and discussion
The flash evoked response of the example newborn subject is presented in figure 2 and 3. MEG channel time courses displayed in figure 2 (a) were selected based on best Q after random average screening (Q range over all channels was 8.6 – 1.9). A flat baseline with a broad peak between about 300 and 700 ms was observed with a maximum amplitude of about 55 fT. The data was processed with OP before averaging to attenuate interference from the newborn heart. Signal redistribution due to OP explains the scatter in channel locations relative to the head position seen in figure 2 (c) (Vrba et al 2004). The event-related beamformer processing resulted in a tight cluster of sources with similar time course (figure 2 (b)) as selected MEG channels and maximum amplitude near 30 nA.m. The beamformer baseline was not as flat as the selected MEG channels, probably because the beamformer sources were selected based on maximum beamformer power ratio rather than Q, according to Robinson (2004). Nonetheless both methods produce convincing evidence of an evoked response to the light flash stimulation paradigm, similar to reported neonate responses (Huotilainen 2006).
Time course of the best few sources from the model search processing based on Q plotted in figure 3 (a) have a very similar character to figure 2 (a) but with visibly improved SNR (Q range over 5 best was 21.9 – 19.6). The traces have been normalized and polarity matched by peak value for ease of response shape comparison. Peak source amplitude varied between 3.4 and 7.5 nA.m; much smaller than those of figure 2 (b). This discrepancy may be due to the beamformer response to model inaccuracies. Best model sphere origins were spread by about one head radius around the true head model. The associated source locations had a similar scatter; consistent with the model search ambiguity behavior described in Vrba et al (2007). These results provide a clear validation of the model search method. The one best source (based on Q) is plotted in figure 3 (b) together with 95% confidence interval and statistically significant time segments. The SNR for this data is good and the significance analysis is not really necessary for validation of the newborn response however it is provided for comparison with fetal results to follow. The significance at the end of the average window may be evidence of a very late response component or it could be a reminder that a 95% confidence interval is not a very strict condition.
A significant post-stimulus peak between about 180 and 770 ms latency was identified on 25 of 27 fetal records. Q varied from the minimum cutoff value of 2 to about 8, distributed as shown in the histogram of figure 4. The histogram indicates the limited global robustness of the response with a Q below 3 for one third of the records. Peak source amplitude was between about 4 and 40 nA.m. Response latency was measured as the first significant peak or shoulder. There was a hint of latency reduction with increasing GA, but the variance was too large to make any conclusion, especially at 33 and 36 weeks. This approach has an inherent limitation, because latencies of different component complexes describing the morphology of a visual evoked response can be intermixed. In a recent paper (Eswaran et al 2004) we used a simple separation scheme of the evoked components to investigate early (< 300 ms) and late responses (>300 ms). The latency of the early response showed a significant decrease with gestation. In the current study this approach was not applicable, because only 6 recordings showed a significant response with a latency below 300ms.
The 3 fetal records with the largest Q are presented in figure 5 (a – c). SNR was low even for these exemplary records, as seen in the significance plots of the one best source time course for each example, however a clear response peak was readily identified. The selected head models were consistent with ultrasound exam notes, however the proximity to the 4th coil was large in some cases. The method of attaching the localization coil to the light panel was not secure and the coil tended to slip from its intended position as the subject's abdomen was engaged with the sensor array surface and might explain the observation. On the other hand, the distance between the 4th coil and head model origin could be expected to be up to about 18 cm given the model origin ambiguity of about 1 head diameter, the head radius, and an additional 3 to 5 cm to account for the fetal head surface to abdominal wall distance. The maximum 4th coil to origin distance observed was 18.4 cm, with 15 of 25 records less than 11 cm, and 23 of 25 less than 18 cm. As such, even though there was no clear discrepancy, the 4th coil position was not used as a component of the validation due to the limited resolution.
The 3 fetal records with the smallest Q are presented in figure 5 (d – f) for contrast. In these cases, it was difficult to see an evoked response above the background signal by eye, however a significant peak was still detected for the one best source in each case. The time course of figure 5 (e) had a poor baseline and that of figure 5 (f) was a borderline case where only 2 sources passed the random average screening and the significant peak was rather late, at 770 ms (possibly due to offset of the stimulus at 500 ms).
4. Conclusions
We have demonstrated 3 criteria for validation of the averaged flash evoked response of the fetus (statistical significance of the time course, proximity of significant modeled source to the estimated fetal head location, and observation of physiologically plausible time course morphology) for 25 of a sample of 27 subjects with gestational age between 30 and 36 weeks. The environmental and biomagnetic interference was suppressed by using a beamformer model search over a plausible abdominal volume based on geometrical constraints including proximity to the fetal heart. A statistical screening was employed to limit the false detection of an evoked response by the chance average of residual interference; estimating a background distribution from a set of randomized averages and testing the null hypothesis that a candidate response could be drawn from that background distribution. The screened source with the largest SNR time course was then accepted as the most plausible model and response latency was estimated from the earliest statistically significant post-stimulus peak.
The response time course and associated latency was quite variable and it was not possible to demonstrate the expected decrease in latency with increasing gestational age. Perhaps SNR is still insufficient and additional processing refinements may be required. Alternatively, the large variability in the EEG evoked response latency of premature newborns (Umezaki and Morrell 1970) suggests that we may be able to demonstrate the correlation with a larger sample size. Further studies are needed to develop a refined classification scheme for visual evoked fields in the fetus. We also intend to apply this processing method to auditory paradigms and to test other neurophysiological processes such as habituation.
The search grids were determined with rough estimates for physiological dimensions. An improved procedure would be to use ultrasound/MEG co-registration or at least individual anthropomorphic data from the ultrasound exam for beamformer search grid constraints and head model size. It is also recognized that the heart model is poor but both these limitations were deemed adequate and within the required precision given the ambiguity. Future work will include development of a method for co-registration of 3D ultrasound fetal head images with MEG. We also intend to improve the reliability of the ultrasound-guided placement of the 4th localization coil so that a 3D ultrasound procedure may not be routinely necessary.
Acknowledgements
The work was supported by the U.S. National Institutes of Health under grants R01NS36277-04A1 and R33EB000978-01A1.
References
- Cheyne D, Bostan AC, Gaetz W, Pang EW. Event-related beamforming: A robust method for presurgical functional mapping using MEG. Clin Neurophysiol. 2007 doi: 10.1016/j.clinph.2007.05.064. doi:10.1016/j.clinph.2007.05.064. [DOI] [PubMed] [Google Scholar]
- Eswaran H, Wilson JD, Preissl H, Robinson SE, Vrba J, Murphy P, Rose DF, Lowery CL. Magnetoencephalographic recordings of visual evoked brain activity in the human fetus. The Lancet. 2002;360:779–80. doi: 10.1016/s0140-6736(02)09905-1. [DOI] [PubMed] [Google Scholar]
- Eswaran H, Lowery CL, Wilson JD, Murphy P, Preissl H. Functional development of the visual system in human fetus using magnetoencephalography. Experimental Neurology. 2004;190:S52–S58. doi: 10.1016/j.expneurol.2004.04.007. [DOI] [PubMed] [Google Scholar]
- Huotilainen M. Magnetoencephalography of the newborn brain. Semin Fetal Neonatal Med. 2006;11(6):437–43. doi: 10.1016/j.siny.2006.07.003. [DOI] [PubMed] [Google Scholar]
- McCubbin J, Robinson SE, Cropp R, Moiseev A, Vrba J, Murphy P, Priessl H, Eswaran H. Optimal reduction of MCG in fetal MEG recordings. IEEE Trans. Biomed. Eng. 2006;53:1720–4. doi: 10.1109/TBME.2006.876619. [DOI] [PubMed] [Google Scholar]
- McCubbin J, Murphy P, Wilson JD, Eswaran H, Preissl H, Robinson SE, Yee T, Vrba J, Lowery CL. Improved flash evoked response from fetal MEG. Int. Congr. Ser. 2007a doi: 10.1088/0031-9155/52/19/005. doi:10.1016/j.ics.2006.12.054 in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McCubbin J, Yee T, Vrba J, Robinson SE, Murphy P, Eswaran H, Preissl H. Bootstrap significance of low SNR evoked response. J Neuroscience Methods. 2007b doi: 10.1016/j.jneumeth.2007.10.003. submitted JNEUMETHD-07-00276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson SE. Localization of Event-Related Activity by SAM(erf) Neurol Clin Neurophysiol. 2004;2004:109. [PubMed] [Google Scholar]
- Rose DF, Eswaran H. Spontaneous neuronal activity in fetuses and newborns. Exp. Neurol. 2004;190:S28–S36. doi: 10.1016/j.expneurol.2004.06.026. [DOI] [PubMed] [Google Scholar]
- Stockard-Pope JE, Werner SS, Bickford RG. Atlas of Neonatal Electroencephalography. Raven Press; New York: 1992. pp. 105–75. [Google Scholar]
- Umezaki H, Morrell F. Developmental study of photic evoked responses in premature infants. Electroenceph. Clin. Neurophysiol. 1970;28:55–63. doi: 10.1016/0013-4694(70)90008-8. [DOI] [PubMed] [Google Scholar]
- Vrba J, Robinson SE, McCubbin J, Lowery CL, Eswaran H, Wilson JD, Murphy P, Preissl H. Fetal MEG redistribution by projection operators. IEEE Trans. Biomed. Eng. 2004;51:1207–18. doi: 10.1109/TBME.2004.827265. [DOI] [PubMed] [Google Scholar]
- Vrba J, Robinson SE, McCubbin J, Lowery L, Eswaran H, Murphy P, Preissl H. Searching for the best model: ambiguity of inverse solutions and application to fetal magnetoencephalography. Phys. Med. Biol. 2007;52:757–76. doi: 10.1088/0031-9155/52/3/016. [DOI] [PubMed] [Google Scholar]
- Zhao H, Wakai RT. Simultaneity of foetal heart rate acceleration and foetal trunk movement determined by foetal magnetocardiogram actocardiography. Phys. Med. Biol. 2002;5:839–46. doi: 10.1088/0031-9155/47/5/310. [DOI] [PubMed] [Google Scholar]