Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Jan 1.
Published in final edited form as: IEEE Trans Biomed Eng. 2020 Dec 21;68(1):78–89. doi: 10.1109/TBME.2020.3004548

Ballistocardiogram artifact reduction in simultaneous EEG-fMRI using deep learning

James R McIntosh 1,#, Jiaang Yao 2,#, Linbi Hong 3, Josef Faller 4, Paul Sajda 5
PMCID: PMC7808341  NIHMSID: NIHMS1656700  PMID: 32746037

Abstract

Objective:

The concurrent recording of electroencephalography (EEG) and functional magnetic resonance imaging (fMRI) is a technique that has received much attention due to its potential for combined high temporal and spatial resolution. However, the ballistocardiogram (BCG), a large-amplitude artifact caused by cardiac induced movement contaminates the EEG during EEG-fMRI recordings. Removal of BCG in software has generally made use of linear decompositions of the corrupted EEG. This is not ideal as the BCG signal propagates in a manner which is non-linearly dependent on the electrocardiogram (ECG). In this paper, we present a novel method for BCG artifact suppression using recurrent neural networks (RNNs).

Methods:

EEG signals were recovered by training RNNs on the nonlinear mappings between ECG and the BCG corrupted EEG. We evaluated our model’s performance against the commonly used Optimal Basis Set (OBS) method at the level of individual subjects, and investigated generalization across subjects.

Results:

We show that our algorithm can generate larger average power reduction of the BCG at critical frequencies, while simultaneously improving task relevant EEG based classification.

Conclusion:

The presented deep learning architecture can be used to reduce BCG related artifacts in EEG-fMRI recordings.

Significance:

We present a deep learning approach that can be used to suppress the BCG artifact in ECG-fMRI without the use of additional hardware. This method may have scope to be combined with current hardware methods, operate in real-time and be used for direct modeling of the BCG.

Keywords: Electroencephalography (EEG), functional magnetic resonance imaging (fMRI), artifact removal, ballistocardiogram, deep learning, gated recurrent unit (GRU)

I. Introduction

SIMULTANEOUS recording of electroencephalography (EEG) and functional magnetic resonance imaging (fMRI) presents a powerful approach for acquiring cortical data [1]. These simultaneous EEG-fMRI recordings hope to combine the high temporal resolution of EEG with the broad spatial sampling and superior spatial resolution of fMRI [2]. While challenges exist in fusing information from the two modalities (for example, see [36]), the technique has been widely used, for example in studies of basic human neuroscience studies of attentional orienting [79], perceptual decision making [1012], value based decision making and reward processing [1315] and analysis of resting state [16] as well as clinical applications such as epileptic event localization [17, 18].

Despite the advantages that EEG-fMRI has to offer, one of the major difficulties of using the technique is the presence of various artifacts in the EEG data during concurrent fMRI-based image acquisition. One major artifact is known as the gradient artifact (GA), which arises due to the rapid switching of magnetic fields inside MRI scanners during fMRI recordings. Despite the large GA amplitude, it is possible to deal with these artifacts successfully through average artifact subtraction (AAS) [19], or optimal basis set (OBS) extensions since the GA repeats in a stereotypical way [20]. Another prominent artifact, which is our primary concern in this paper is known as the ballistocardiogram (BCG) [21, 22]. The BCG artifact arises due to the motion of EEG electrodes in the static magnetic field of the fMRI scanner. This motion is likely predominantly due to head movements during cardiac cycles [23] and local pulsatile movement of the scalp [24], both of which occur due to varying blood flow through scalp vessels during cardiac rhythms. The BCG artifact is characterized by complex spatiotemporal dynamics (Fig. 1), where its waveforms can vary greatly from channel to channel, as well as its intricate temporal relation with cardiac rhythms captured by the electrocardiogram (ECG).

Fig. 1.

Fig. 1.

Example ECG and accompanying BCG corrupted EEG. Top: Example ECG trace for three cardiac cycles. Middle: Corresponding corrupted EEG topography at marked times in the cardiac cycle. Bottom: Example BCG corrupted EEG trace over the corresponding time for channel T8 (top) and Pz (bottom), showing the diverse temporally shifted non-linear transformation from ECG to BCG. Data was processed as described in the methods section ‘EEG Pre-processing’.

Several previous studies have focused on BCG artifact removal, with one of the earliest proposed methods being AAS [19]. However, unlike the GA, the BCG does not have a clearly time locked indicator. To deal with this problem, the ECG is pre-processed with a QRS complex detection method to produce a basis for the time locking, however, robust QRS complex detection is not trivial, and becomes increasingly difficult in high static field environments [24]. Additionally, the QRS complex event time is itself not indicative of the corresponding peak in the BCG, so that to subtract around this point the BCG is often shifted by a fixed delay of 210 ms [21] (but see [25]). Ultimately, the weakness of AAS is that the BCG does not form clean repeats around the shifted QRS complex peak. To address this, another method developed both for GA and BCG is OBS [20]. In this algorithm, an AAS step is used in combination with principal component analysis (PCA) to retain signal components strongly related to the BCG, these are then subtracted in adaptive fashion from the corrupted signal. This algorithm may be an improvement over AAS, however, the method still relies on QRS complex detection and is dependent on the number of components marked as artifacts, which can potentially interfere with the interpretation of the underlying neural data. This is in part because BCG artifacts may not always be linearly separable from neural activity [25]. Other software based methods, for example based on independent component analysis (ICA) [2628], or harmonic regression [29] have also been suggested, but it is not yet clear whether they provide systematic advantages over OBS.

Deep learning [30] has gained popularity in recent years, and its use in neuroscience and related fields has become pervasive. For example, a small selection of applications include generic task relevant classification based on EEG activity [31, 32], markerless motion tracking [33], modeling and characterizing neurons in visual cortex [34], modeling visual processing [35], modeling and analysis in psychiatric and neurological disorders [36], and neuron tracing and segmentation [37, 38].

In this paper, we propose a novel pipeline for BCG suppression using recurrent neural networks (RNN) [39, 40], specifically, gated recurrent units (GRU) [41] configured in a deep architecture. After the successful completion of a standard EEG-fMRI recording, by assuming that cardiac rhythms do not contain significant cortical information, we train our network (termed BCGNet) on ECG and BCG corrupted EEG data, in order to uncover the nonlinear state dependent mapping between ECG and BCG signals. This mapping can then be subsequently used for subtracting the BCG waveforms from the EEG data. Our method generates robust and reliable prediction of BCG waveforms and cleaned EEG data, and importantly does not remove the signal reflecting the underlying neural activity, as it improves single-trial analysis of the EEG.

II. Methods

A. Subjects and Behavioral Paradigm

A total of 25 subjects were scanned and six were excluded from further analysis. Subjects were rejected due to: missing neuroimaging data (two), low quality ECG recording (one), excessive movement (one), drowsiness and inability to complete the task as per instruction (one), EPI image quality affected by electrode adjustment (one). For the final 19 subjects included in this study, 6 were male and 13 were female. Age ranged from 18 to 32, with a mean of 25.9 and a standard deviation of 3.6 years. All participants had normal or corrected-to-normal vision and no history of psychiatric illness or head injury. The Columbia University Institutional Review Board (IRB) approved all experiments and informed consent was obtained before the start of each experiment.

An auditory oddball paradigm (see [42]), was performed by subjects while fMRI and EEG were simultaneously recorded. The paradigm included 80% standard and 20% oddball (target) stimuli, where standard stimuli were pure tones with a frequency of 350 Hz, while the oddball stimuli were broadband (laser gun) sounds. Stimuli lasted for 200 ms with an inter-trial interval (ITI) sampled from a uniform distribution between 2 s and 3 s. Stimuli were presented through MR compatible earphones, and subjects were instructed to respond to oddball sounds as quickly and as accurately as possible, by pressing a button on an MR-compatible button box (PYKA, Current Designs, PA, USA). Subjects were told to ignore standard tones.

Every subject was scheduled to complete five runs in total (105 trials per run), with an average of 4.7 runs per subject (range of three to five, standard deviation of 0.7). At the processing stage, we further rejected a single run from three subjects as we could not perform either successful GA removal or QRS detection on them so that the final average number of runs per subject was 4.6 (range of two to five, standard deviation of 0.98).

B. Simultaneous EEG and fMRI Data Acquisition

EEG and fMRI were recorded inside a 3T Siemens Prisma scanner using a 64 channel head coil. During the task, functional echo planar imaging (EPI) data were collected with 3 mm in-plane resolution and 3 mm slice thickness. 42 slices of 64 x 64 voxels were acquired using a 2100 ms repetition time (TR) and 25 ms echo time (TE). EEG was recorded with a 64 channel BrainAmp MR Plus system (Brain Products, Germany), at a sampling rate of 5 kHz. The 64 channels include 63 cap electrodes and 1 ECG electrode in an extended 10–20 configuration with ground electrode at AFz and reference electrode at FCz. Inside the scanner, we used sandbags to stabilize the amplifiers against any potential vibrations caused by the scanner. To avoid amplifier saturation and ensure high SNR, we used the following settings in the Brain Products Recorder environment: 1) Voltage resolution at 0.5 μV, 2) high-pass cut-off frequency at 0.1 Hz, 3) low-pass cut-off frequency at 250 Hz and 4) all electrode impedances controlled to be under 20 kΩ

C. EEG Pre-processing

Raw data was initially imported into MATLAB using the EEGLAB toolbox [43]. We subsequently low-pass filtered with a non-causal finite impulse response (FIR) filter, cutoff at 70 Hz and then performed GA removal on the dataset using FASTR [44], part of the FMRIB plugin for EEGLAB, provided by the University of Oxford Centre for Functional MRI of the Brain (FMRIB) [20, 45].

The data was resampled to 500 Hz, and non-causally high-pass filtered at 0.25 Hz to reduce electrode drift, while not interfering with cardiac related information. We will refer to the output of this processing step as “BCG corrupted EEG” (BCE, see Fig. 2a).

Fig. 2.

Fig. 2.

Processing pipeline and network architecture. (a) Major processing steps performed on the raw data to generate OBS and BCGNet cleaned datasets. After GA removal and resampling, data is processed independently by the OBS method and by BCGNet. Equivalent test epochs that were not used for BCGNet training are then compared between the two methods. Note that the comparison step refers only to our power spectral density measures (Fig. 4), as the evoked response and classification based analyses use all the data, not just the matched test set epochs. (b) Standard BCGNet architecture with layer sizes in parentheses. Additional L2 regularization parameters were also used (recurrent: N1 = 0.096, N2 = 0.090, N5 = 0.024, N6 = 2.5 × 10−7; activation: N1 = 0.030, N2 = 0.013, N5 = 0.067, N6 = 0.055).

D. BCGNet based BCG Suppression

In our proposed method, we made use of the MNE package [46, 47] to load our BCG corrupted EEG into Python (v3.7). Data was further resampled to 100 Hz due to computational constraints, and each channel was normalized to zero-mean and unit variance within each run. Data was then epoched into consecutive 3 s chunks. We were concerned that epochs with very large artifacts may be detrimental for our training procedure, and we consequently took the magnitude of the epochs, averaged over channels and time and used these values to calculate the median absolute deviation. Any epochs corresponding to a value greater than five times the median absolute deviation was then rejected (this resulted in 2.7 % of epochs being rejected). Epochs were then randomly assigned to a training set (70 %), validation set (15 %) or test set (15 %). After BCG signal prediction was carried out, the downsampling step was reversed by using a Piecewise Cubic Hermite Interpolating Polynomial (PCHIP) [48] to upsample our predicted BCG signal so that it could be subtracted from our BCG corrupted EEG.

E. Training

While several architectures were tested, our general approach was to stack regularized GRU layers, mapping the ECG to the BCG corrupted EEG, through a final linear dense layer. Specifically, our network takes in 1-channel ECG data and outputs 63-channel BCG predictions. Our loss function was designed to minimize the mean squared error (MSE) with the Adam optimizer [49]. Training was done on an NVIDIA GeForce RTX 2080 and Tesla P100 GPUs with CUDA 10.0 and cuDNN v7.1, in Tensorflow v1.14 [50], using the Keras API [51], and CuDNNGRU layers were used to speed up training. Early stopping based on the validation set using a liberal patience of 25 epochs was used, allowing selection of model weights that generate the lowest MSE on the validation set.

F. Architecture Selection

In order to explore the large space of neural network models that may be suitable for BCG prediction from ECG, we began by applying a simple architecture on run one of all subjects. The architecture we used consisted of three hidden bidirectional layers, each with 16 GRU cells, followed by a dense layer with a linear activation function. We then sub-selected three subjects that span the range of test set MSE (one subject from each of the lowest, middle and highest quintile). We subsequently used Hyperas [52], a Python package that combines Hyperopt [53] and Keras, for hyperparameter tuning on these three subjects. In particular, we were interested in knowing the number of hidden and non-hidden layers, the number of cells in each layer, as well as values for regularizing parameters. This approach enabled hyperparameter tuning and exploration of the space of architectures in a reasonable time. With the results of the Hyperas runs, we examined the patterns of the best performing architectures and used common properties to construct a more general architecture which is shown in Fig. 2b, which we refer to as BCGNet throughout this manuscript.

G. Optimal Basis Set BCG Reduction

In order to generate a comparison for our proposed method, we used the FMRIB EEGLAB plugin to perform BCG artifact removal. The BCG corrupted EEG was high-pass filtered at 2.5 Hz, and QRS complex detection [54, 55] was performed. A separate copy of the BCG corrupted EEG (with the 2.5 Hz high-pass filtering step omitted) was then fed to the FMRIB plugin’s BCG suppression function combined with the generated QRS complex event times. The BCG suppression function was run in OBS mode, with the number of bases set to four (the default).

H. Validation

1). Power Spectral Density Measures

We estimated the power spectral density (PSD, Welch’s method with a Hanning window) for all EEG channels in each test epoch for BCGNet, OBS and the BCG corrupted EEG. In order to compare the performance of the two methods under investigation, we then calculated the ratio of the PSD for BCGNet and OBS against the BCG corrupted EEG signal from its PSD. This provides us with a measure of power reduction of the two BCG suppression techniques for all frequencies, runs and channels. The median was then calculated across the frequency band of interest (either delta 0.5 to 4 Hz, theta 4 to 8 Hz, or alpha 8 to 13 Hz), test epochs and channels of interest (including the average of all channels), and this quantity was then averaged with an arithmetic mean across runs. A Wilcoxon signed-rank test across subjects, between methods was then carried out on the channels and bands of interest and Bonferroni-Holm corrected across frequency bands for T8, Pz and the channel average independently.

2). QRS related analysis

We were curious as to whether power reductions of our method over OBS are solely due to a non-reliance on QRS complex detection. We calculated the standard deviation of the difference between consecutive QRS markers (ΔQRS) as an estimate of the quality of QRS detection. We then fitted a linear model relating the alpha band amplitude reduction of BCGNet vs OBS averaged over all channels (see previous section) against the quality of the QRS detection. We tested this linear model against a constant model by using an F-test.

One potential failure mode of OBS is due to small misalignments in detection of the QRS complex, or changes in the QRS complex over time. While BCGNet does not rely on QRS detection, it could predict BCG that is the wrong shape or misaligned to the real BCG. We considered the case where the occipital alpha band is of interest, and hypothesized that these failure modes would lead to a distortion of the QRS-locked alpha band amplitude average. To examine this systematically, we calculated the QRS locked alpha band amplitude at Oz after current source density estimation (CSD) [56]. We non causally filtered the data with an FIR filter from 8 to 13Hz. The Hilbert transform was then applied to calculate the analytic signal, and the magnitude of this was taken in order to produce an instantaneous estimate of the alpha band amplitude for the OBS and BCGNet methods. In order to test for a statistically significant difference in QRS locked alpha band amplitude we averaged within subject, and then performed a Wilcoxon signed-rank test across subjects for every time point (500 Hz) in the range −0.5 s to 0.5 s relative to QRS. We FDR corrected the result using the Benjamini–Hochberg procedure [57].

3). Evoked Response Analysis

While the PSD is generally informative about whether the BCG signal is being removed, it does not discriminate between suppression of BCG (or other source of artifact), and suppression of signal reflecting the underlying neural activity. In an attempt to constrain our analysis to the outcome of BCG suppression on this signal, we evaluated our within subject models on the entire ECG time series for each run. Note that for the power spectral density measures we proceeded by extracting the test epochs from the cleaned EEG time series and only performed analysis on these. This is because, for practical applications of the method as we are demonstrating for the recovery of evoked responses it is more convenient to evaluate a model on the entire dataset (for a single subject) in order to avoid discontinuities. This is acceptable since for this use case the application is to remove the BCG as well as possible, and not to evaluate its performance on unseen data. We note that the same is true for OBS where the basis functions are directly extracted and fitted to the data. We compare the OBS method with our BCGNet method after current source density estimation (CSD) [56], applying a further 0.5 Hz high-pass and 50 Hz low-pass [58]. Data was epoched into sections from 0.5 s to +1.5 s relative to standard and target stimuli.

In BCG corrupted EEG, the across trial standard deviation of evoked responses is likely to be large and dominated by different time portions of the BCG. It follows that one way to assess the quality of the BCG suppression is to investigate how the standard deviation across trials is reduced for the methods under test. To do this, we calculate the time-locked and time-resolved standard deviation across standard trials, and separately target trials for each subject and each channel of interest. In order to compare this measure across methods we averaged it from 0.25 s to 0.75 s post stimulus and performed a Wilcoxon signed-rank test across subjects. Bonferroni-Holm correction was applied across comparisons for the displayed channels, T8 and Pz, as well as the channel average independently.

4). Task Relevant Single-Trial Classification

Another method to compare BCG suppression techniques which gets at the heart of whether we are disrupting task-relevant EEG signal is to build a classifier that discriminates between targets and standard stimuli. In order to do this, we initially filtered and epoched the data as in our evoked response analysis. We then used logistic regression with L2 regularization [59] applied over features composed from all samples from all channels within 60 ms sliding windows moving in 20 ms steps (see [42, 58] for more details on this approach) to estimate the area under the receiver operator characteristic curve (AUC) as a measure of performance. AUC was averaged from the results of six-fold cross-validation, with a further internal six-fold cross validation to determine the regularization parameter. The regularization parameter was selected by following the "one-standard error" rule, where the largest value for the parameter is used such that the deviance is within one standard error of the minimum [60]. In order to compare classification performance for different BCG suppression methods AUC was averaged from 0.25 s to 0.75 s post stimulus, and a Wilcoxon signed-rank test across subjects was then carried out and Bonferroni-Holm corrected.

5). Across Subject BCGNet Training

After evaluating our model, and comparing its performance to the OBS method, we proceeded to train it on all training data from all subjects while holding one subject out entirely (this was done for each subject). Evaluation was then done on the held-out subject, followed by an evaluation on the test data set of the held-out subject after further training on their data set. This procedure enabled us to examine how well our model generalizes to new subjects, and how quickly our model can be trained on a new subject’s data if it has been pre-trained.

I. Open-access dataset analysis

We additionally analyzed the open access “EEG, fMRI and NODDI dataset” [61, 62] which is a resting state dataset. We performed power ratio analysis and examined the QRS-locked alpha as previously described. Additional analyses were also constructed to examine the recovery of alpha band amplitude over the course of the experiment for BCGNet suppressed BCE compared to the OBS method. See supplemental materials for further information. See supplemental materials for further information.

III. Results

A. BCGNet Models BCG Signal when Trained Within-subject

We initially compare the performance of OBS and BCGNet at suppressing BCG signal power by examining the PSD in test epochs. We show a single subject example for channels T8 and Pz in Fig. 3, demonstrating that BCGNet can predict the progression of the BCG time series despite its inhomogeneous behavior across channels.

Fig. 3.

Fig. 3.

Comparison of BCGNet and OBS methods for BCG suppression in a single subject. (a) ECG over a selected time period. (b) Top: BCGNet predicted BCG for channel T8 in a test epoch (blue), overlaid on gradient artifact removed BCG corrupted EEG (BCE) signal (gray). Bottom: Corrupted EEG with subtracted BCGNet predicted BCG (light green) as well as OBS cleaned EEG for comparison (dark green). (c) Power spectral density of corrupted EEG, BCGNet cleaned EEG, and OBS cleaned EEG for channel T8. OBS shows a reduction in power over the corrupted EEG signal as expected, but BCGNet shows substantial further reduction. Shaded regions show 2 × SEM. Note that the large variance at 20 Hz is due to residual GA. (d) Same as (b) but for channel Pz. (e) Same as (c) but for channel Pz.

Fig. 4 shows the power ratio divided by EEG bands for all subjects calculated from the test epochs. Interestingly when considering an average over channels, it appears that for low frequencies (delta) OBS is approximately as successful as BCGNet at power reduction with a median power ratio difference (MD) of −0.03 (p = 0.334, BCGNet subtracted from OBS), with larger differences in specific channels (see supplemental Fig. S3). On the other hand, BCGNet is clearly more successful in theta (MD = 0.06, p = 0.003) and alpha bands (MD = 0.10, p = 4 × 10−4). We were concerned that part of the reason why BCGNet does not outperform OBS in the delta band (and in fact displays higher power) is because OBS is too aggressive and removes EEG signal. This is a lesser concern for BCGNet as the network is not able to directly adapt to the BCG corrupted EEG test epochs, and we investigate this in the following sections.

Fig. 4.

Fig. 4.

Comparison of BCGNet and OBS methods for BCG suppression for all subjects. (a) Boxplot of average power ratio in relevant EEG bands for OBS and BCGNet methods compared to BCG corrupted EEG (BCE) signal for channel T8. Linked grey dots represent individual subjects, median differences (MD) are presented as the median of the OBS – BCGNet difference. Hinges represent first and third quartile and whiskers span the range of the data not considered outliers (approximately ±2.7σ). Both methods strongly reduce signal power, with BCGNet appearing more effective in the theta and alpha band. (delta MD = −0.03, p = 0.171; theta MD = 0.06, p = 0.006; alpha MD = 0.09, p = 4 × 10−4). (b) Same as (a), but for the channel Pz (delta MD = −0.01, p = 0.717; theta MD = 0.03, p = 0.002; alpha MD = 0.11, p = 6 × 10−4). BCGNet reduces power more effectively in the theta and alpha bands. (c) Same as (a), but for the across channel average. BCGNet power reduction is significantly larger than for OBS in the theta and alpha band (delta MD = −0.03, p = 0.334; theta MD = 0.06, p = 0.003; alpha MD = 0.10, p = 4 × 10−4). OBS more effectively reduces power in the delta band than BCGNet, although this difference does not appear to be statistically significant. See supplemental Fig. S3 for scalp topographies.

B. Relation to QRS Complex Detection

One reason for BCGNet performance generally exceeding that of OBS, may be due to the QRS complex detection, since OBS relies on detection of all QRS complexes. In order to investigate this, we calculated the power ratio of the alpha band power (as for Fig. 4), where the relative power reduction of BCGNet is most striking. We then modeled the change of this ratio between OBS and BCGNet as linearly dependent on the ΔQRS standard deviation. The ΔQRS standard deviation was used as a measure of QRS detection quality. We found (see Fig. 5) a relation between the fraction of outlier ΔQRS and the change in power ratio (F-test vs constant model, p = 0.015, n = 19), suggesting that indeed BCGNet is relatively robust to cases where OBS fails due to poor QRS detection. It should be noted however, that this effect appears to be driven by a small subset of subjects, and that BCGNet generally reduces alpha band power more than OBS for most subjects.

Fig. 5.

Fig. 5.

BCGNet reduces BCG signal power in the alpha band more effectively than OBS regardless of QRS complex detection quality, however the improvement may be more pronounced when QRS detection quality is worse. (a) Ratio of cleaned EEG to corrupted EEG in the alpha band (8 to 13Hz) averaged over all channels for the BCGNet (light green) and OBS (dark green) method. (b) Difference in power ratio between BCGNet and OBS from (a) plotted against ΔQRS standard deviation. Solid red line shows fit (F-test vs constant model, p = 0.015, n = 19) indicating a relation between BCGNet improvement over OBS and QRS detection quality. Dashed lines correspond to 95 % confidence intervals. (c) Representative single subject example of alpha band amplitude extracted from Oz locked to QRS detection. BCGNet recovered QRS locked band amplitude is lower than for OBS, demonstrating lower systematic impact of the BCG signal. Shaded region represents 2 x SE across QRS detections. (d) Effect shown in (c) for a single subject is present across subjects. Natural logarithm of p-value derived from Wilcoxon signed-rank test between BCGNet and OBS recovered QRS locked alpha band amplitude. Gray horizontal lines indicate significant differences (p < 0.05) FDR corrected using Benjamini–Hochberg procedure) and all correspond to median alpha band amplitude being lower for BCGNet recovery than for OBS recovery.

We hypothesized that the alpha band amplitude locked to QRS detections should not show systematic modulation around QRS if BCG is successfully removed. Fig. 5c shows an example of this for OBS and BCGNet for a single representative subject, while both methods clearly show some alpha band amplitude modulation BCGNet shows an overall greater reduction. To investigate this systematically across subjects, we performed a Wilcoxon signed-rank test between BCGNet and OBS at each time point around QRS detection. As shown in Fig. 5d, we found that BCGNet reduced the QRS complex locked alpha amplitude at multiple time points.

C. BCGNet Improves Task Relevant Metrics

Because power reductions in the BCG corrupted EEG signal can have multiple sources, we further compare OBS and BCGNet using task relevant metrics. We expect that if BCGNet is able to reduce BCG more effectively than OBS (rather than simply signal power), that we should see a reduction in standard deviation of the evoked responses. Indeed, as shown in Fig. 6 this is what we find. In particular, when examining the average standard deviation across channels for targets, we find that OBS reduces evoked standard deviation over the gradient artifact removed BCG corrupted EEG (BCE) signal (p = 8 × 10−4, Wilcoxon signed-rank test), and in turn BCGNet reduces the evoked standard deviation over OBS (p = 9 × 10−4) and the BCE signal (p = 5 × 10−4). This is broadly replicated for specific channels and for both standards and targets as shown in Fig. 6.

Fig. 6.

Fig. 6.

BCGNet does not alter the evoked response when compared to OBS, however standard deviation of evoked response is reduced. (a) Mean across subject evoked response to stimulus (standard or target presented at 0 s) for gradient artifact removed BCG corrupted EEG (BCE) signal for T8 and Pz channels. Shaded region represents 2 x SE across subjects. BCGNet and OBS methods not shown due to the similarity to the BCE mean evoked potential. (b) Boxplot of standard deviation of evoked responses in the task relevant regions from 0.25 s to 0.75 s post stimulus for individual subjects. Linked grey dots represent individual subjects, for T8 we found no significant difference between BCGNet and OBS for standards (st) or targets (ta), although the difference was present between either method and the BCE signal (BCE/OBS st: p = 8 × 10−4, ta: p = 5 × 10−4; BCE/BCGNet st: p = 8 × 10−4, ta: p = 7 × 10−4; OBS/BCGNet st: p = 0.053, ta: p = 0.016). Hinges represent first and third quartile and whiskers span the range of the data not considered outliers (approximately ±2.7 σ). (c) Boxplots as in (b) but for channel Pz. Here, both methods reduce task relevant standard deviation relative to BCE, although BCGNet further reduces task relevant standard deviation of the evoked responses relative to OBS (BCE/OBS st: p = 9 × 10−4, ta: p = 0.002; BCE/BCGNet st: p = 8 × 10−4, ta: p = 8 × 10−4; OBS/BCGNet st: p = 9 × 10−4, ta: p = 7 × 10−4). (d) Boxplots as in (b) for the standard deviation averaged across channels. As in (c), standard deviation is reduced by OBS from the BCE signal, and further reduced by BCGNet (BCE/OBS st: p = 8 × 10−4, ta: p = 8 × 10−4; BCE/BCGNet st: p = 7 × 10−4, ta: p = 5 × 10−4; OBS/BCGNet st: p = 9 × 10−4, ta: p = 9 × 10−4) See supplemental Fig. S4 and Fig. S5 for scalp topographies.

A yet more direct method to examine how OBS and BCGNet impact the EEG signal in a task relevant manner is to attempt to use the cleaned signal for classification. As shown in Fig. 7, and consistent with our hypothesis, we find that BCGNet increases classification accuracy over the BCE signal (p = 4 × 10−4). We found a similar pattern of results for band amplitude-based classification (see Fig. S1 in the supplemental material). Interestingly, we also see a low p-value (p = 0.053) corresponding to OBS decreasing classification accuracy over BCE, indicating that it may reduce task relevant SNR.

Fig. 7.

Fig. 7.

Overall classification quality is modestly but consistently improved by application of BCGNet when compared to OBS. (a) Average across subject receiver operator characteristic area under the curve (AUC) corresponding to standard vs target sliding window classification (shaded region corresponds to 2 x SE across subjects) for BCG corrupted EEG (BCE) signal. (b) Boxplots of AUC averaged in the task relevant region from 0.25 s to 0.75 s post stimulus for the BCE and tested BCG suppression method. Hinges represent first and third quartile and whiskers span the range of the data (MDBCE-OBS = 0.004, p = 0.053; MDBCE-BCGNet = −0.011, p = 4 × 10−4; MDOBS-BCGNet = −0.015, p = 4 × 10−4).

D. Across Subject Generalization

We performed a ‘leave-one-subject-out’ experiment and show that the network struggles to generalize in its current form. This is no surprise, as there is substantial variability across subjects in the properties of the BCG. Upon retraining our model on the training set of the held-out subject, our power suppression and classification performance exhibit the same general pattern when compared to OBS (Fig. 8ab). When compared to direct within subject training however, the retrained network power ratio is significantly higher (see Fig. S2a). The main advantage of using a pre-trained network is that it reduces further computational cost, in this case cutting the median number of training epochs from 59 to 10 (see Fig. S2b).

Fig. 8.

Fig. 8.

Transfer in network trained across subjects. (a) Power ratio for BCGNet trained on all subjects and evaluated on held-out subjects for average channel (gray, pre-trained, PT) demonstrates poor performance which is improved upon substantially by within subject retraining (light blue, retrained, RT). OBS method (dark green) as in Fig. 4c plotted for comparison. Bonferroni-Holm correction was carried out across bands and networks, although we limited our tests to PT compared to RT (delta MDPT-RT = 0.27, p = 8 × 10−4; theta MDPT-RT = 0.35, p = 8 × 10−4; alpha MDPT-RT = 0.27, p = 7 × 10−4) and RT compared to OBS(delta MDRT-OBS = 0.06, p = 0.018; theta MDRT-OBS = −0.04, p = 0.018; alpha MDRT-OBS = −0.05, p = 0.016). (b) Boxplots of AUC calculated from retrained BCGNet and OBS method averaged in the task relevant region from 0.25 s to 0.75 s post stimulus. Hinges represent first and third quartile and whiskers span the range of the data (MDRT-OBS = 0.018, p = 1 × 10−4).

E. Open-access dataset analysis

We replicated the power ratio analysis as well as the QRS-locked alpha band analysis in an open-access resting state dataset. Power ratio analysis after averaging channels shows that OBS is not significantly different from BCGNet in the delta band, with a median power ratio difference MD = 0.00 (p = 1.00, BCGNet subtracted from OBS). However, BCGNet is more successful in theta (MD = 0.07, p = 0.001) and alpha bands (MD = 0.10, p = 0.002) when looking at the average across channels. These results, as well as their topographies (Fig. S6, equivalent of Fig. S3) appear highly consistent with our results in the go/no-go task. Fig. S7 shows the QRS locked alpha band (equivalent of Fig. 5cd), demonstrating that as for our main dataset, BCGNet reduces QRS locked alpha band amplitude.

We additionally examined the progression of alpha band amplitude during the experiment and demonstrate an expected increase over the course of the experiment which is however coupled with a lower overall power when the BCE is processed with BCGNet (Fig. S8).

IV. Discussion

We have shown that on average and across channels our method reduces power in several standard EEG frequency bands, while simultaneously improving task relevant metrics over the conventional OBS method. In combination, we interpret these results as strong evidence that BCGNet suppresses BCG artifact more effectively than OBS. Furthermore, we have shown that a pre-trained network can substantially aid training when applied in a new subject, and acts to enable fast learning of the BCG structure.

A subtlety appears to be that in the delta band, suppression of power for BCGNet is not significantly higher than for OBS, and a general trend in the opposite direction is apparent in some channels (see supplemental Fig. S3). However, results of classification which are likely dominated by low frequency content, based on OBS and BCGNet cleaned signals, indicate better signal quality after processing with BCGNet. It could be argued that this occurs because BCGNet is relatively more effective at higher frequencies (theta and alpha bands) than OBS, however because classification accuracy based on the OBS cleaned signal shows a trend for being lower than accuracy on the gradient artifact removed BCG corrupted EEG signal, this may be in fact due to OBS suppressing signal reflecting underlying neural activity at low frequencies. Taken together, these results point to BCGNet being more successful at BCG artifact reduction independent of band frequency.

The effect size associated with the statistically significant improvement in classification accuracy that we observed may be characterized as modest. This is to be expected since the act of classification is itself finding a low dimensional feature space that is orthogonal to the BCG signal. In other words the classification procedure itself should already be largely immune to corruption by the BCG signal, and only suffers in so much as the BCG signal is inseparable from the task related signal in a linear subspace. Detectable improvement in classification accuracy is therefore a non-trivial example of the strength of the presented method.

A. Limitations

In general, most BCG signal suppressing methods have more constrained models than what we propose in this work. However, the potential removal of EEG reflecting the underlying neural activity is not unique to our method, but also present when any attempt is made to construct BCG artifact with a subset of components extracted by relying on dimensionality reduction of the BCG corrupted EEG, such as ICA or PCA. However, in principle our method is particularly susceptible to specifically removing ECG related EEG signal in cases where it is present. One of our foundational assumptions is that the EEG is independent of the ECG. This is clearly not valid under all circumstances. For example, [63] discuss how the QRS complex timing is related to response times during a decision task, so it seems clear that the exact timing of the cardiac cycle is likely to have some influence on specific brain function. In fact AAS, and by extension OBS and many proposed methods also make the assumption that ECG and EEG are uncorrelated [22]. However, because deep neural networks are particularly powerful, our method may remove EEG signal that is dependent on ECG that standard methods may be unable to accurately model. This is likely to be a relevant criticism, however with current experimental paradigms it seems safe to assume that in general, any ECG related activity in EEG should be treated as a confound (as its source is ultimately unverifiable), so that its removal is not detrimental. Plainly, EEG-fMRI may not currently be a good experimental setting for studying heart-brain relations. In our task, because classification improves and inter-trial evoked response standard deviation decreases it seems that BCGNet did not have a particular disadvantage because of its likely ability to remove cardiac related EEG activity. In contrast to this, OBS based classification performance was reduced when compared to BCG corrupted EEG based performance which suggests that the method may have detrimentally removed task relevant signal.

We hypothesize that the main limitation of our method implemented in its current form is that we could not feed our input stage with sufficient information regarding sources of artifact. In principle, it is possible that our method can address some movement of the head when it is coupled with a change of the ECG, as the change in cardiac rhythm state progression may be detected and used as an indicator of a change in BCG state progression. Essentially, while our model weights are currently fixed at evaluation time, the model itself is in principle adaptable to changing circumstances (such as changes in heart rate). However, changes in the ECG-BCG mapping during an experiment (such as head motion) that are unrelated to the ECG are undetectable to our method and will likely result in estimation errors of the BCG.. Alternative approaches for BCG suppression that depend on hardware (rather than already discussed software methods) extensions that go beyond EEG and ECG appear increasingly promising. Most of these methods work by adding reference sensors to the head (or specific sources of noise), and then removing their contribution to the EEG signals in software with varying degrees of sophistication [6469]. While the reference layer often may not provide a perfect mirror for the BCG signal, and in general hardware solutions involve a certain overhead, methods that exploit reference signals are likely to provide an edge in the artifact removal process. We conjecture that our proposed method could be used to enhance the effectiveness of current reference layer methods, and address what we perceive to be the main limitation of our method - a lack of information regarding head movement and other sources of noise.

We have tested BCGNet in a resting state dataset and found results that are consistent with those found in our go/no-go paradigm, as well as analyzing changes in alpha band amplitude over the course of an experiment [70]. While the observed improvement in EEG-based classification performance in the go/no-go paradigm, along with the decrease in the standard deviation of task evoked EEG activity and suppression of QRS-locked artifactual activity in the alpha band make us confident that BCGNet will be a valuable tools for other experimental paradigms (such as in studies examining time-frequency relations [29]), this will need to be verified in future studies.

Another limitation of our current study is that we have not compared our results to data recorded outside of the range of the static magnetic field. Indeed, a specifically designed paradigm involving in and out of scanner experiments under different noise conditions would be ideal, and data from such recordings may enable generative adversarial extensions of our method that could prove to be yet more powerful.

B. Advantages

We believe that in finding the direct ECG to BCG mapping, we have identified a novel approach to BCG suppression, which may be further refined with a larger body of data.

The fact that our approach does not rely on summarizing the cardiac cycle in terms of discretely detectable QRS complex events appears to be one of the advantages of our method. It may be argued that the comparison between BCGNet and OBS is not entirely fair because OBS relies on QRS complex detection, so that if the QRS complex detection is poor OBS is at a disadvantage. While this is in part the point, as we do not get to choose whether or not any individual subjects has good or bad quality QRS complex detection, we have also shown that avoiding QRS complex detection is not the only advantage conferred by BCGNet. It has also been observed [24], that QRS complex detection generally becomes more difficult at high field strengths, making a lack of reliance on the method increasingly relevant.

A more subtle advantage of our method is that it remains applicable in unusual experimental settings. For example, in EEG-fMRI-TMS [71, 72] a TMS pulse might cause a small QRS complex like artifact in the ECG. Since the timing of these occurrences is known, a straight-forward extension of our method can allow us to continue running our model based on its own predictions of the upcoming ECG. At the same time, the EEG itself is corrupted much more strongly, making standard BCG suppression difficult, however our forward model can continue being evaluated enabling recovery of the EEG signal that only depends on how quickly it recovers from the TMS pulse.

One general difficulty in assessing BCG suppression quality is that the ground truth BCG signal is unknown. Unlike other current methods, the core of our method is designed to directly generate BCG from ECG. It is therefore possible to see how BCGNet might be used to simulate ground truth BCG signal, which can be augmented with simulated 1/f noise and brain derived sources in order to study the relative efficacy of other BCG suppression techniques. Furthermore, via extensions of BCGNet that either model the propagation of the ECG signal itself, or by directly injecting signal into the small central dense layer of the network, it may be possible to gain fine grained control of the BCG construction under test, for example, to study the different methods under changing heart-rate conditions.

One of the main advantages of our method is that because BCGNet is a software-based solution, it can be used directly with existing commercial hardware, and does not require specially designed equipment. We hypothesize that our proposed method has scope for improvement as a wider body of EEG-fMRI data becomes openly available. As previously mentioned, we also consider that with the appropriate sensing equipment BCGNet is extensible to removing a broader array of artifacts, for example electrooculography (EOG) or respiration artifacts. With the appropriate reference signal, it may also be useful in compensating for the helium pump artifact so that the Helium pump can remain on during EEG-fMRI scans [66, 73].

C. Real-Time Applications

Hardware reference based methods [66], as well as some newer software based methods [29] do not rely on QRS complex detection. Methods that rely on QRS complex detection are likely to be difficult to extend to real-time BCG suppression because, as previously mentioned, QRS detection at high field strengths is difficult, and removal of the BCG around an upcoming QRS complex is likely to require a prediction of that QRS complex time. Small errors in that prediction are likely to result in poor BCG suppression. On the other hand, similarly to hardware-based methods an RNN based approach allows us to implicitly estimate the instantaneous state of cardiac activity and make use of it directly. This avoids the explicit prediction of the upcoming QRS complex, and likely makes us robust to deviations in heart rate which may occur over the course of an experimental session. A disadvantage that our method has in real-time applications is that it needs to learn the ECG-BCG relationship for each subject, so a training step would need to be performed before the main experiment in this specific scenario. It is important to note that generalization of a trained BCGNet model to unseen data sets is not a key requirement to make the presented approach useful in offline (non-real-time) applications (although we highlight: this does not mean that the method cannot be used on new data, but simply that a new model with the same architecture must be retrained). In contrast to this in real-time applications this is an important requirement, where the model may be required to adapt to some unseen changes in ECG-BCG mapping over the course of an experimental session. The nature of this change and its impact on real-time applications will need to be assessed in future work.

V. Conclusion

We have developed a deep learning based method for direct modeling of the BCG from the ECG, which can be used to reduce BCG related artifact in EEG-fMRI recordings, beyond the commonly used standard without the use of additional hardware. This method has the potential to be extended for use in validation of other methods, and in real-time applications. We believe these are encouraging results that are likely to be improved upon with more data, and the merging with developing hardware methods to make use of simple reference signals to address artifacts that we could not cater for in the current iteration of BCGNet.

Supplementary Material

supp1-3004548

Acknowledgment

We thank Anna Jasper for comments. We acknowledge computing resources from Columbia University’s Shared Research Computing Facility project, which is supported by NIH Research Facility Improvement Grant 1G20RR03089301, and associated funds from the New York State Empire State Development, Division of Science Technology and Innovation (NYSTAR) Contract C090171, both awarded April 15, 2010.

This work was supported in part by the National Institute of Mental Health under grant R33MH106775; the United States Army Research Laboratory under Cooperative Agreement W911NF-10-2-0022 and W911NF-16-2-0008; and a seed grant for MR Studies Program of the Zuckerman Mind Brain Behavior Institute at Columbia University (CU-ZI-MR-S-0006). The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the United States government.

Contributor Information

James R. McIntosh, Department of Biomedical Engineering, Columbia University, New York, NY 10027, USA.

Jiaang Yao, Department of Biomedical Engineering, Columbia University, New York, NY 10027, USA.

Linbi Hong, Department of Biomedical Engineering, Columbia University.

Josef Faller, Department of Biomedical Engineering, Columbia University, and the Human Research and Engineering Directorate, U.S. Army Research Laboratory, Aberdeen Proving Ground, MD 21005, USA.

Paul Sajda, Department of Biomedical Engineering, and the Data Science Institute, Columbia University.

References

  • [1].Huster RJ et al. , “Methods for simultaneous EEG-fMRI: an introductory review,” J Neurosci, vol. 32, no. 18, pp. 6053–60, May 2012, doi: 10.1523/JNEUROSCI.0447-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Mulert C and Lemieux L, EEG - fMRI: Physiological Basis, Technique, and Applications. Springer Science & Business Media, 2010. [Google Scholar]
  • [3].Liu X and Sajda P, “A Convolutional Neural Network for Transcoding Simultaneously Acquired EEG-fMRI Data,” presented at the 9th International IEEE/EMBS Conference on Neural Engineering (NER), 2019/3, 2019, doi: 10.1109/NER.2019.8716994. [DOI] [Google Scholar]
  • [4].Deneux T and Faugeras O, “EEG-fMRI fusion of paradigm-free activity using Kalman filtering,” Neural Comput, vol. 22, no. 4, pp. 906–48, Apr 2010, doi: 10.1162/neco.2009.05-08-793. [DOI] [PubMed] [Google Scholar]
  • [5].Lemieux L et al. , “Event-related fMRI with simultaneous and continuous EEG: description of the method and initial case report,” Neuroimage, vol. 14, no. 3, pp. 780–7, Sep 2001, doi: 10.1006/nimg.2001.0853. [DOI] [PubMed] [Google Scholar]
  • [6].Oberlin T et al. , “Symmetrical EEG-FMRI imaging by sparse regularization,” presented at the 23rd European Signal Processing (EUSIPCO), Nice, 2015, doi: 10.1109/EUSIPCO.2015.7362708. [DOI] [Google Scholar]
  • [7].Goldman RI et al. , “Single-trial discrimination for integrating simultaneous EEG and fMRI: identifying cortical areas contributing to trial-to-trial variability in the auditory oddball task,” Neuroimage, vol. 47, no. 1, pp. 136–47, Aug 2009, doi: 10.1016/j.neuroimage.2009.03.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Walz JM et al. , “Simultaneous EEG-fMRI reveals a temporal cascade of task-related and default-mode activations during a simple target detection task,” Neuroimage, vol. 102, pp. 229–39, Nov 2014, doi: 10.1016/j.neuroimage.2013.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Walz JM et al. , “Simultaneous EEG-fMRI reveals temporal evolution of coupling between supramodal cortical attention networks and the brainstem,” J Neurosci, vol. 33, no. 49, pp. 19212–22, Dec 2013, doi: 10.1523/JNEUROSCI.2649-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Gherman S and Philiastides MG, “Human VMPFC encodes early signatures of confidence in perceptual decisions,” eLife, vol. 7, Sep 2018, doi: 10.7554/eLife.38293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Tu T et al. , “Network Configurations in the Human Brain Reflect Choice Bias during Rapid Face Processing,” J Neurosci, vol. 37, no. 50, pp. 12226–12237, Dec 13 2017, doi: 10.1523/JNEUROSCI.1677-17.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Muraskin J et al. , “A multimodal encoding model applied to imaging decision-related neural cascades in the human brain,” Neuroimage, vol. 180, pp. 211–222, Oct 2018, doi: 10.1016/j.neuroimage.2017.06.059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Fouragnan E et al. , “Two spatiotemporally distinct value systems shape reward-based learning in the human brain,” Nat Commun, vol. 6, p. 8107, Sep 2015, doi: 10.1038/ncomms9107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Plichta MM et al. , “Simultaneous EEG and fMRI reveals a causally connected subcortical-cortical network during reward anticipation,” J Neurosci, vol. 33, no. 36, pp. 14526–33, Sep 2013, doi: 10.1523/JNEUROSCI.0631-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Andreou C et al. , “Theta and high-beta networks for feedback processing: a simultaneous EEG-fMRI study in healthy male subjects,” Transl Psychiatry, vol. 7, no. 1, p. e1016, Jan 2017, doi: 10.1038/tp.2016.287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Muraskin J et al. , “Brain dynamics of post-task resting state are influenced by expertise: Insights from baseball players,” Hum Brain Mapp, vol. 37, no. 12, pp. 4454–4471, Dec 2016, doi: 10.1002/hbm.23321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Abreu R et al. , “EEG synchronization measures predict epilepsy-related BOLD-fMRI fluctuations better than commonly used univariate metrics,” Clin Neurophysiol, vol. 129, no. 3, pp. 618–635, Mar 2018, doi: 10.1016/j.clinph.2017.12.038. [DOI] [PubMed] [Google Scholar]
  • [18].Thornton RC et al. , “Imaging haemodynamic changes related to seizures: comparison of EEG-based general linear model, independent component analysis of fMRI and intracranial EEG,” Neuroimage, vol. 53, no. 1, pp. 196–205, Oct 2010, doi: 10.1016/j.neuroimage.2010.05.064. [DOI] [PubMed] [Google Scholar]
  • [19].Allen PJ et al. , “A method for removing imaging artifact from continuous EEG recorded during functional MRI,” Neuroimage, vol. 12, no. 2, pp. 230–9, Aug 2000, doi: 10.1006/nimg.2000.0599. [DOI] [PubMed] [Google Scholar]
  • [20].Niazy RK et al. , “Removal of FMRI environment artifacts from EEG data using optimal basis sets,” Neuroimage, vol. 28, no. 3, pp. 720–37, Nov 2005, doi: 10.1016/j.neuroimage.2005.06.067. [DOI] [PubMed] [Google Scholar]
  • [21].Allen PJ et al. , “Identification of EEG events in the MR scanner: the problem of pulse artifact and a method for its subtraction,” Neuroimage, vol. 8, no. 3, pp. 229–39, Oct 1998, doi: 10.1006/nimg.1998.0361. [DOI] [PubMed] [Google Scholar]
  • [22].Debener S et al. , “EEG Quality: Origin and Reduction of the EEG Cardiac-Related Artefact,” in EEG - fMRI: Physiological Basis, Technique, and Applications, Mulert C and Lemieux L Eds. Berlin, Germany: Springer Science & Business Media, 2010, pp. 135–151. [Google Scholar]
  • [23].Nakamura W et al. , “Removal of ballistocardiogram artifacts from simultaneously recorded EEG and fMRI data using independent component analysis,” IEEE Trans Biomed Eng, vol. 53, no. 7, pp. 1294–308, Jul 2006, doi: 10.1109/TBME.2006.875718. [DOI] [PubMed] [Google Scholar]
  • [24].Debener S et al. , “Properties of the ballistocardiogram artefact as revealed by EEG recordings at 1.5, 3 and 7 T static magnetic field strength,” Int J Psychophysiol, vol. 67, no. 3, pp. 189–199, Mar 2008, doi: 10.1016/j.ijpsycho.2007.05.015. [DOI] [PubMed] [Google Scholar]
  • [25].Marino M et al. , “Adaptive optimal basis set for BCG artifact removal in simultaneous EEG-fMRI,” Sci Rep, vol. 8, no. 1, p. 8902, Jun 2018, doi: 10.1038/s41598-018-27187-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26].Mantini D et al. , “Complete artifact removal for EEG recorded during continuous fMRI using independent component analysis,” Neuroimage, vol. 34, no. 2, pp. 598–607, Jan 2007, doi: 10.1016/j.neuroimage.2006.09.037. [DOI] [PubMed] [Google Scholar]
  • [27].Srivastava G et al. , “ICA-based procedures for removing ballistocardiogram artifacts from EEG data acquired in the MRI scanner,” Neuroimage, vol. 24, no. 1, pp. 50–60, Jan 2005, doi: 10.1016/j.neuroimage.2004.09.041. [DOI] [PubMed] [Google Scholar]
  • [28].Vanderperren K et al. , “Removal of BCG artifacts from EEG recordings inside the MR scanner: a comparison of methodological and validation-related aspects,” Neuroimage, vol. 50, no. 3, pp. 920–34, Apr 2010, doi: 10.1016/j.neuroimage.2010.01.010. [DOI] [PubMed] [Google Scholar]
  • [29].Krishnaswamy P et al. , “Reference-free removal of EEG-fMRI ballistocardiogram artifacts with harmonic regression,” Neuroimage, vol. 128, pp. 398–412, Mar 2016, doi: 10.1016/j.neuroimage.2015.06.088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].LeCun Y et al. , “Deep learning,” Nature, vol. 521, no. 7553, pp. 436–44, May 2015, doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
  • [31].Tabar YR and Halici U, “A novel deep learning approach for classification of EEG motor imagery signals,” J Neural Eng, vol. 14, no. 1, p. 016003, Feb 2017, doi: 10.1088/1741-2560/14/1/016003. [DOI] [PubMed] [Google Scholar]
  • [32].Lawhern VJ et al. , “EEGNet: a compact convolutional neural network for EEG-based brain-computer interfaces,” J Neural Eng, vol. 15, no. 5, p. 056013, Oct 2018, doi: 10.1088/1741-2552/aace8c. [DOI] [PubMed] [Google Scholar]
  • [33].Mathis A et al. , “DeepLabCut: markerless pose estimation of user-defined body parts with deep learning,” Nat Neurosci, vol. 21, no. 9, pp. 1281–1289, Sep 2018, doi: 10.1038/s41593-018-0209-y. [DOI] [PubMed] [Google Scholar]
  • [34].Abbasi-Asl R et al. , “The DeepTune framework for modeling and characterizing neurons in visual cortex area V4,” bioRxiv, Nov 2018, doi: 10.1101/465534. [DOI] [Google Scholar]
  • [35].Kriegeskorte N, “Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing,” Annu Rev Vis Sci, vol. 1, pp. 417–446, Nov 2015, doi: 10.1146/annurev-vision-082114-035447. [DOI] [PubMed] [Google Scholar]
  • [36].Vieira S et al. , “Using deep learning to investigate the neuroimaging correlates of psychiatric and neurological disorders: Methods and applications,” Neurosci Biobehav Rev, vol. 74, no. Pt A, pp. 58–75, Mar 2017, doi: 10.1016/j.neubiorev.2017.01.002. [DOI] [PubMed] [Google Scholar]
  • [37].Zhou Z et al. , “DeepNeuron: an open deep learning toolbox for neuron tracing,” Brain Inform, vol. 5, no. 2, p. 3, Jun 2018, doi: 10.1186/s40708-018-0081-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Soltanian-Zadeh S et al. , “Fast and robust active neuron segmentation in two-photon calcium imaging using spatiotemporal deep learning,” PNAS, vol. 116, no. 17, pp. 8554–8563, 2019, doi: 10.1073/pnas.1812995116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [39].Goodfellow I et al. , Deep Learning. MIT Press, 2016. [Google Scholar]
  • [40].Rumelhart DE et al. , “Learning representations by back-propagating errors,” Nature, vol. 323, no. 6088, pp. 533–536, Oct 1986, doi: 10.1038/323533a0. [DOI] [Google Scholar]
  • [41].Chung J et al. , “Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling,” arXiv [cs.NE], Dec 2014. [Online]. Available: http://arxiv.org/abs/1412.3555. [Google Scholar]
  • [42].Hong L et al. , “Your eyes give you away: prestimulus changes in pupil diameter correlate with poststimulus task-related EEG dynamics,” PLoS One, vol. 9, no. 3, p. e91321, Mar 2014, doi: 10.1371/journal.pone.0091321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [43].Delorme A and Makeig S, “EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis,” J Neurosci Methods, vol. 134, no. 1, pp. 9–21, Mar 2004, doi: 10.1016/j.jneumeth.2003.10.009. [DOI] [PubMed] [Google Scholar]
  • [44].Niazy RK et al. , “Improved FMRI artifact reduction from simultaneously acquired EEG data using slice dependant template matching,” presented at the 12th ISMRM, May, 2004.
  • [45].Iannetti GD et al. , “Simultaneous recording of laser-evoked brain potentials and continuous, high-field functional magnetic resonance imaging in humans,” Neuroimage, vol. 28, no. 3, pp. 708–19, Nov 2005, doi: 10.1016/j.neuroimage.2005.06.060. [DOI] [PubMed] [Google Scholar]
  • [46].Gramfort A et al. , “MEG and EEG data analysis with MNE-Python,” Front Neurosci, vol. 7, p. 267, Dec 2013, doi: 10.3389/fnins.2013.00267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Gramfort A et al. , “MNE software for processing MEG and EEG data,” Neuroimage, vol. 86, pp. 446–60, Feb 2014, doi: 10.1016/j.neuroimage.2013.10.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [48].Fritsch FN and Carlson RE, “Monotone Piecewise Cubic Interpolation,” Siam J Numer Anal, vol. 17, no. 2, pp. 238–246, Apr 1980, doi: 10.1137/0717021. [DOI] [Google Scholar]
  • [49].Kingma DP and Ba J, “Adam: A Method for Stochastic Optimization,” arXiv [cs.LG], Dec 2014. [Online]. Available: http://arxiv.org/abs/1412.6980. [Google Scholar]
  • [50].Abadi M et al. , “TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems,” arXiv [cs.DC], Mar 2016. [Online]. Available: http://arxiv.org/abs/1603.04467. [Google Scholar]
  • [51].Chollet F and Others. “Keras.” https://keras.io (accessed May 26, 2019).
  • [52].“Hyperas.” GitHub. https://github.com/maxpumperla/hyperas (accessed Jun 16, 2019).
  • [53].“Hyperopt.” GitHub. https://github.com/hyperopt/hyperopt (accessed Jun 16, 2019).
  • [54].Christov II, “Real time electrocardiogram QRS detection using combined adaptive threshold,” Biomed Eng Online, vol. 3, no. 1, p. 28, Aug 2004, doi: 10.1186/1475-925X-3-28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [55].Kim KH et al. , “Improved ballistocardiac artifact removal from the electroencephalogram recorded in fMRI,” J Neurosci Methods, vol. 135, no. 1–2, pp. 193–203, May 2004, doi: 10.1016/j.jneumeth.2003.12.016. [DOI] [PubMed] [Google Scholar]
  • [56].Kayser J and Tenke CE, “Principal components analysis of Laplacian waveforms as a generic method for identifying ERP generator patterns: I. Evaluation with auditory oddball tasks,” Clin Neurophysiol, vol. 117, no. 2, pp. 348–68, Feb 2006, doi: 10.1016/j.clinph.2005.08.034. [DOI] [PubMed] [Google Scholar]
  • [57].Benjamini Y and Hochberg Y, “Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing,” J R Statist Soc, B, vol. 57, no. 1, pp. 289–300, 1995. [Google Scholar]
  • [58].Philiastides MG and Sajda P, “EEG-informed fMRI reveals spatiotemporal characteristics of perceptual decision making,” J Neurosci, vol. 27, no. 48, pp. 13082–91, Nov 2007, doi: 10.1523/JNEUROSCI.3540-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [59].Qian J et al. “Glmnet for Matlab.” http://www.stanford.edu/~hastie/glmnetmatlab/ (accessed March 15, 2019).
  • [60].Hastie T et al. , The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed. New York, NY, USA: Springer Science & Business Media, 2009. [Google Scholar]
  • [61].Deligianni F et al. , “NODDI and Tensor-Based Microstructural Indices as Predictors of Functional Connectivity,” PLoS One, vol. 11, no. 4, p. e0153404, 2016, doi: 10.1371/joumal.pone.0153404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [62].Deligianni F et al. , “Relating resting-state fMRI and EEG whole-brain connectomes across frequency bands,” Front Neurosci, vol. 8, p. 258, 2014, doi: 10.3389/fnins.2014.00258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [63].van der Molen MW et al. , “Phasic heart rate responses and cardiac cycle time in auditory choice reaction time,” Biol Psychol, vol. 16, no. 3–4, pp. 255–71, May 1983, doi: 10.1016/0301-(83)90028-5. [DOI] [PubMed] [Google Scholar]
  • [64].Bonmassar G et al. , “Motion and ballistocardiogram artifact removal for interleaved recording of EEG and EPs during MRI,” Neuroimage, vol. 16, no. 4, pp. 1127–41, Aug 2002, doi: 10.1006/nimg.2002.1125. [DOI] [PubMed] [Google Scholar]
  • [65].Chowdhury ME et al. , “Reference layer artefact subtraction (RLAS): a novel method of minimizing EEG artefacts during simultaneous fMRI,” Neuroimage, vol. 84, pp. 307–19, Jan 2014, doi: 10.1016/j.neuroimage.2013.08.039. [DOI] [PubMed] [Google Scholar]
  • [66].van der Meer JN et al. , “Carbon-wire loop based artifact correction outperforms post-processing EEG/fMRI corrections--A validation of a real-time simultaneous EEG/fMRI correction method,” Neuroimage, vol. 125, pp. 880–894, Jan 2016, doi: 10.1016/j.neuroimage.2015.10.064. [DOI] [PubMed] [Google Scholar]
  • [67].Steyrl D et al. , “Reference layer adaptive filtering (RLAF) for EEG artifact reduction in simultaneous EEG-fMRI,” J Neural Eng, vol. 14, no. 2, p. 026003, Apr 2017, doi: 10.1088/1741-2552/14/2/026003. [DOI] [PubMed] [Google Scholar]
  • [68].Luo Q et al. , “Ballistocardiogram artifact removal with a reference layer and standard EEG cap,” J Neurosci Methods, vol. 233, pp. 137–49, Aug 2014, doi: 10.1016/j.jneumeth.2014.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [69].Dyrholm M et al. , “Removal of BCG artifacts using a non-Kirchhoffian overcomplete representation,” IEEE Trans Biomed Eng, vol. 56, no. 2, pp. 200–4, Feb 2009, doi: 10.1109/TBME.2008.2005952. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [70].Benwell CSY et al. , “Frequency and power of human alpha oscillations drift systematically with time-on-task,” Neuroimage, vol. 192, pp. 101–114., May 2019, doi: 10.1016/j.neuroimage.2019.02.067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [71].Faller J et al. , “An EEG-fMRI-TMS instrument to investigate BOLD response to EEG guided stimulation,” presented at the 9th International IEEE/EMBS Conference on Neural Engineering (NER), March, 2019, doi: 10.1109/NER.2019.8716889. [DOI]
  • [72].George M et al. , “Combined TMS-EEG-fMRI. The level of TMS-evoked activation in anterior cingulate cortex depends on timing of TMS delivery relative to frontal alpha phase,” Brain Stimulation, vol. 12, no. 2, p. 580, Mar 2019, doi: 10.1016/j.brs.2018.12.924. [DOI] [Google Scholar]
  • [73].Nierhaus T et al. , “Internal ventilation system of MR scanners induces specific EEG artifact during simultaneous EEG-fMRI,” Neuroimage, vol. 74, pp. 70–6, Jul 2013, doi: 10.1016/j.neuroimage.2013.02.016. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supp1-3004548

RESOURCES