Skip to main content
Human Brain Mapping logoLink to Human Brain Mapping
. 2017 Jul 26;38(11):5331–5342. doi: 10.1002/hbm.23737

The impact of T1 versus EPI spatial normalization templates for fMRI data analyses

Vince D Calhoun 1,2,, Tor D Wager 4, Anjali Krishnan 4, Keri S Rosch 5,6, Karen E Seymour 5,6, Mary Beth Nebel 5,7, Stewart H Mostofsky 5,6,7, Prashanth Nyalakanai 1, Kent Kiehl 1,3
PMCID: PMC5565844  NIHMSID: NIHMS894036  PMID: 28745021

Abstract

Spatial normalization of brains to a standardized space is a widely used approach for group studies in functional magnetic resonance imaging (fMRI) data. Commonly used template‐based approaches are complicated by signal dropout and distortions in echo planar imaging (EPI) data. The most widely used software packages implement two common template‐based strategies: (1) affine transformation of the EPI data to an EPI template followed by nonlinear registration to an EPI template (EPInorm) and (2) affine transformation of the EPI data to the anatomic image for a given subject, followed by nonlinear registration of the anatomic data to an anatomic template, which produces a transformation that is applied to the EPI data (T1norm). EPI distortion correction can be used to adjust for geometric distortion of EPI relative to the T1 images. However, in practice, this EPI distortion correction step is often skipped. We compare these template‐based strategies empirically in four large datasets. We find that the EPInorm approach consistently shows reduced variability across subjects, especially in the case when distortion correction is not applied. EPInorm also shows lower estimates for coregistration distances among subjects (i.e., within‐dataset similarity is higher). Finally, the EPInorm approach shows higher T values in a task‐based dataset. Thus, the EPInorm approach appears to amplify the power of the sample compared to the T1norm approach when not using distortion correction (i.e., the EPInorm boosts the effective sample size by 12–25%). In sum, these results argue for the use of EPInorm over the T1norm when no distortion correction is used. Hum Brain Mapp 38:5331–5342, 2017. © 2017 The Authors Human Brain Mapping Published by Wiley Periodicals, Inc.

Keywords: spatial normalization, echo planar image, fMRI, coregistration

INTRODUCTION

Spatial normalization of fMRI data has been broadly implemented in a variety of widely used software packages including AFNI (http://afni.nimh.nih.gov/afni), SPM [Friston, 2005], and FSL (http://www.fmrib.ox.ac.uk). One of the challenges of working with the echo planar imaging (EPI) scans is that they suffer from geometric distortion and signal dropout. Two main template‐based approaches have been utilized to spatially normalize EPI data into standard space (e.g., Montreal Neuroimaging Institute (MNI) standard space) [Collins et al., 1994; Mazziotta et al., 2001, 1995; Talairach and Tournoux, 1988; Tzourio‐Mazoyer et al., 2002].

The first approach, called EPInorm, involves an affine transform followed by a nonlinear registration of the EPI image to an EPI template in standard space (we use MNI from this point forward though the approach can apply to any template) [Fox, 1995; Friston et al., 1995a, 1995b, 1999; Gaser et al., 1999; Huntenberg et al., 2014; Klein et al., 2009; Woods et al., 1998]. An advantage of this approach is that it directly addresses the nonlinearities the EPI image exhibits, but a potential drawback is that it can suffer from over correction (e.g., pulling of unrelated brain regions to fill regions of signal dropout).

The second approach, called T1norm, includes the estimation of an affine transform mapping between the EPI image to the T1 image for that individual followed by a nonlinear warp between the T1 and a T1 MNI template. These warp parameters are then applied to the coregistered EPI image resulting in MNI normalized EPI data [Fox, 1995; Friston et al., 1995; Klein et al., 2009; Tzourio‐Mazoyer et al., 2002]. An advantage of this approach is that it typically relies on an image with higher spatial resolution to estimate the nonlinear warp to MNI space. However, a potential drawback of this approach is that it does not account for the geometric distortions (which can be substantial) that impact the EPI data, but not the T1 data, as it assumes the affine transform can correct for any differences between the EPI and T1 data from the same subject (Fig. 1).

Figure 1.

Figure 1

Side‐by‐side comparison of EPInorm and T1norm approaches for a single subject transparently overlaid on the T1 image from that subject. The T1norm process is unable to compensate for distortions throughout the brain, which are not present in the T1 scan (blue circles). [Color figure can be viewed at http://wileyonlinelibrary.com]

Both the above approaches can be applied to raw EPI data or on distortion‐corrected EPI data. Geometric distortion can be corrected to a degree using field maps [Jezzard and Balaban, 1995] or interpolation between images calculated using multiple phase encode directions [Holland et al., 2010]. In this case, the T1norm approach makes more sense as the nonlinear distortion correction provides a physics‐based correction of the nonlinearities between the T1 and the EPI data. Despite this, presumably because collection of and/or use of the field map adds additional complexity to the data collection and post processing, the vast majority of fMRI studies published use the T1norm approach and do not report using geometric distortion correction. When distortion correction is not performed, the spatial resolution advantages of the T1norm approach are likely outweighed by the disadvantages caused by distortion discrepancies between individual subjects' EPI and T1 data; poor coregistration between the EPI and T1 data may result in EPI data that are not well aligned across subjects. While several studies have compared nonlinear warping procedures commonly used within the T1norm framework [Ardekani et al., 2004, 2005; Crinion et al., 2007; Ghosh et al., 2010; Tahmasebi et al., 2009], to our knowledge, no study has systematically compared the EPInorm and T1norm approaches.

The goal of this article is to evaluate EPI spatial normalization schemes in group studies using either the EPInorm or the T1norm approaches. To do this, we evaluate four different datasets using several metrics. One outcome metric is the variability of the coregistration across subjects. This measure estimates the degree of similarity between the spatially normalized images from different individuals. A second metric is the coregistration between spatially normalized images across individuals. Finally, using a go/no‐go task, we evaluate the impact of these various approaches on the resulting T values, as increasing the anatomic accuracy of spatial normalization has been shown to increase sensitivity for detecting task‐related activation as well as the replicability of activation maps [Ardekani et al., 2005]. We focus primarily on the template‐based nonlinear spatial normalization within the SPM12 pipeline, but results are expected to apply to any T1norm approach. For comparison, we also include preprocessed data from the Autism Brain Imaging Data Exchange (ABIDE [Craddock, 2013; Martino et al., 2013]) which differed in the way that EPI and T1 data were aligned and in the way that T1 data were transformed to standard space (http://preprocessed-connectomes-project.org/abide/). The nonlinear boundary based registration (BBR) approach in FSL 5.0 was used to align EPI and T1 data, and linear and nonlinear approaches in the advanced normalization tools (ANTs) software package [Avants et al., 2008] were used to transform T1 data to standard space. BBR includes field inhomogeneity correction and aligns the EPI to the T1 by maximizing the intensity gradient across tissue boundaries [Greve and Fischl, 2009]. ANTs has been demonstrably more accurate in published studies comparing T1 normalization strategies than standard voxelwise normalization [Avants and Gee, 2004; Avants et al., 2011].

METHODS

We focus on a comparison of EPInorm and T1norm for four different datasets, processed a variety of different ways. The data are briefly described in Table 1 and include an 81‐subject multiband dataset collected during a go/no‐go task at the Mind Research Network and analyzed without (Experiment 1a) and with (Experiment 1b) distortion correction, a 30‐subject pain dataset previously published using a T1norm approach [Krishnan et al., 2016] (Experiment 2), a pediatric study including 112 typically developing 8–12‐year‐old children (Experiment 3), and the ABIDE [Craddock, 2013; Martino et al., 2013] data, which is an 1100‐subject multisite autism dataset (age range 5–64) processed using an FSL T1norm pipeline (Experiment 4).

Table 1.

Summary of datasets

# Participants, data type Distortion corrected EPInorm T1norm Output
Ex 1a: Multiband 81 go‐nogo task Yes SPMa SPMa Subjectwise variance map, T‐map
Ex 1b: Multiband 81 go‐nogo task No SPMa SPMa Subjectwise variance map, T‐map
Ex 2: CU Boulder 30, pain task No SPMa SPMa Subjectwise variance map
Ex 3: KKI 112, rest fMRI No SPMa SPMa Coregistration subject‐to‐template similarity
Ex 4: ABIDE multisite 1100, rest fMRI No SPMa FSLb Subjectwise variance map
a

SPM12, 4 × 5 × 4 basis set, interpolated to 3 × 3 × 3 mm voxels.

b

FSL used the boundary‐based registration approach described in the ABIDE preprocessing initiative [Friston et al., 1999].

The data are slice‐time corrected, motion corrected, and then spatially normalized in SPM using two approaches. For the EPInorm approach, we spatially normalize using the EPI MNI template as the target using a 4 × 5 × 4 basis set to mitigate overfitting. This approach involves an initial affine transform followed by a nonlinear warp, resulting in a nonlinear matching of the EPI image to the template. The T1norm approach involves an affine transform from the EPI image to the T1 image (or vice versa) from the same subject. In the case of the ABIDE dataset, the EPI data are registered to the T1 image with a linear transformation, followed by a white‐matter boundary based transformation using FMRIB's Linear Image Registration Tool (FLIRT) [Jenkinson and Smith, 2001] and then the prior white‐matter tissue segmentation from FMRIB's Automated Segmentation Tool (FAST) [Zhang et al., 2001]. Next, the T1 image is warped to the template using the SPM T1 MNI template via the unified segmentation approach [Ashburner and Friston, 2005], or in the case of the ABIDE dataset, linear and nonlinear transformations to MNI are performed using ANTs. The resulting warp parameters are applied to the EPI image, producing functional images in standard MNI space. See Figure 2 for a schematic demonstrating the two approaches. Additionally, for the multiband dataset, we collected a distortion correction set, which consists of two images acquired with different phase encode directions. These are used to distortion correct the EPI data using the FSL topup program [Andersson et al., 2003] and compared with the images that were not distortion corrected.

Figure 2.

Figure 2

Schematic of the EPInorm and T1norm approaches. [Color figure can be viewed at http://wileyonlinelibrary.com]

Metrics

We use several straightforward metrics for comparison in this study. The first metric used (Experiments 1, 2, and 4) was based on an evaluation of the voxelwise variability of the coregistered images across subjects. Within each dataset/experiment, we extracted the first timepoint of each subject's fMRI data, divided by the in‐brain mean for that subject and multiplied by 100. Next, we compute the voxelwise standard deviation among subjects. This provides a measure of the variability of the image; if a given voxel is always inside the brain at a voxel with a value of 100, the standard deviation will be zero. If a given voxel is on the edge and varying constantly between being “in” and “out” of the brain, this voxel will tend to have a high standard deviation. In an ideal case, all the datasets would be perfect matches to the MNI atlas and the standard deviation would be very low. In reality, we expect more variability at the edges of the brain, which largely reflects mismatch in the alignment of brains across individuals.

A second metric we use (Experiment 3) was the mean subject‐to‐subject displacement. That is, using the SPM image alignment algorithm, we compute the average displacement among each subject relative to a random subject in the dataset. Specifically, the first smoothed volume of the reference subject was concatenated with the first smoothed functional volume of every other subject along the fourth dimension to create a single image file with 112 frames. The rigid body realignment parameters were then estimated on this 4D file created for each normalization method. Next, we calculated the sum of the absolute value of the six realignment estimates for each frame (subject) relative to the first frame (reference subject) after converting the three rotational displacements from degrees to millimeters by assuming a 50‐mm radius from the cortex to the center of the head.

A third metric used was the task‐activation effect size. For Experiments 1 and 2, we computed a GLM model fit using SPM and compared the resulting T values for the main effect of interest (false alarms versus hits) in a go/no‐go task. We then tested for differences in the within subject mean of the T value above a given threshold (e.g., T = 4). In addition, under certain assumptions, we can calculate the number of subjects needed to match the T‐values. That is, if T1=μ1/(σ1N11) and T2=μ2/(σ2N21), then given a calculated value for T1and T2 and assuming the mean and standard deviations are the same (i.e., μ1=μ2 and σ1=σ2, we can calculate the number of subjects needed to make the T values equal, i.e., T1T2=N11/N21 or Neff=T1T22Nref1+1. Or more specifically, if T1>T2 this will enable us to calculate how many more subjects we would have needed to collect ( N_eff) to find equivalent T‐values to the reference case ( Nref).

RESULTS

Voxel Variability as a Measure of Intersubject Alignment

The voxelwise standard deviation metric showed that intersubject alignment was worse (indicated by significantly higher voxelwise standard deviations) using the T1norm approach than using the EPInorm approach. This was true for all datasets regardless of whether (a) distortion correction was performed; (b) functional data were collected using a standard EPI pulse sequence or a multiband pulse sequence; or (c) whether we implemented the T1norm approach ourselves or relied on publicly available preprocessed data using this approach. For Experiment 1a (go/no‐go fMRI data processed without distortion correction), the mean + standard deviation was lower for the EPInorm approach (EPInorm: 5.9 ± 8.0, T1norm: 7.4 ± 9.7). Figure 3a shows the standard deviation images for T1norm and EPInorm without distortion correction and the difference. Figure 3b shows the violin plots of the mean across the image and indicates that the EPInorm mean is lower than the T1norm mean. In both cases, it is clear that the T1norm approach shows more voxels with larger subject standard deviation values.

Figure 3.

Figure 3

T1norm versus EPInorm in Experiment 1a: (a, left) T1 norm voxelwise subject standard deviation, (a, middle) EPInorm voxelwise subject standard deviation, (a, right) difference (T1norm–EPInorm). (b) Violin plot of the voxels showing a subject standard deviation of over 25, T1norm shows a higher whole brain mean. [Color figure can be viewed at http://wileyonlinelibrary.com]

Voxelwise standard deviations for Experiment 1b (go/no‐go fMRI data processed with distortion correction) are shown in Figure 4a. As was the case for Experiment 1a, more voxels had larger standard deviation values across subjects for the T1norm approach than for the EPInorm approach. Comparing voxelwise standard deviation values across Experiments 1a and 1b, we observed that the T1norm approach with distortion correction showed more subjectwise variability than without distortion correction. This may be due to the fact that the EPI image is collected with a single phase encode direction rather than with both; as such, the distortions (and signal dropout) are somewhat biased to a specific scenario (and not well corrected by the T1norm approach). We thus rank the four processing scenarios for the go/no‐go fMRI data in terms of the whole brain mean and standard deviation for voxel variability in the following order from worst to best: T1norm (with distortion correction): 11.0 ± 13.4, T1norm (without distortion correction): 8.5 ± 11.1, EPInorm (without distortion correction): 8.0 ± 10.5, and EPInorm (with distortion correction): 7.5 ± 10.3.

Figure 4.

Figure 4

T1norm versus EPInorm in Experiment 1b: (a, left) T1 norm voxelwise subject standard deviation, (a, middle) EPInorm voxelwise subject standard deviation, (a, right) difference (T1norm–EPInorm). (b) Violin plot of the voxels showing a subject standard deviation of over 25, T1norm shows a higher whole brain mean. [Color figure can be viewed at http://wileyonlinelibrary.com]

To ensure that our results for Experiments 1a and 1b were not simply due to our greater familiarity with the EPInorm approach, we additionally evaluated spatial variability across subjects following normalization using 30 subjects from a published dataset that was carefully normalized using the T1norm approach using a manually curated SPM pipeline from Krishnan et al. [2016] (Experiment 2). We reanalyzed these data using the EPInorm approach and computed the voxelwise subject standard deviation for comparison purposes. As before, the T1norm resulted in a higher mean and standard deviation (10.2 ± 8.8) than the EPInorm approach (7.0 ± 5.0). Results are shown in Figure 5. As before, the higher voxelwise subject standard deviation is visible for the individual images as well as for the difference image. In addition, we also reprocessed the data using the same pipeline as Experiment 1 and results were consistent with increased variability for the T1norm approach (results not shown).

Figure 5.

Figure 5

T1norm versus EPInorm in UC Boulder dataset: (left) T1 norm voxelwise subject standard deviation, (middle) EPInorm voxelwise subject standard deviation, (right) difference (T1norm–EPInorm). [Color figure can be viewed at http://wileyonlinelibrary.com]

Intersubject Realignment Estimates were Lower for EPInorm Than for T1norm

Experiment 3, a dataset from the Kennedy Krieger Institute, consisted of 112 resting state scans all from typically developing 8–12‐year‐old children. We processed these data using both the T1norm and EPInorm approaches and then assessed between‐subject alignment for each method using the first image from each participant as described in the methods and a random subject from the set as a reference. We compared this measure of intersubject displacement across normalization approaches. Results showing the intersubject image displacement relative to the random subject used as a reference are presented in Figure 6. We observed significantly smaller displacements (indicative of better alignment) between subjects using the EPInorm approach relative to the T1norm approach (Wilcoxon signed‐rank test, V = 631, P =  1.59×1013), which is consistent with our voxelwise standard deviation findings.

Figure 6.

Figure 6

Alignment of first image for each participant relative to a random KKI subject. The EPInorm approach showed significantly more similarity (P < 0.05) among subjects in alignment relative to the T1norm approach (Wilcoxon signed‐rank test, V = 631, P = 1.59×1013). [Color figure can be viewed at http://wileyonlinelibrary.com]

Experiment 4, the final dataset, ABIDE, consisted of 1100 participants combined via a grass roots multisite consortium. The data were collected in separate studies, rather than harmonized in a coordinated manner. As such, this study has considerably more variability across subjects in parameters, scanner types, and other measures. As part of the ABIDE effort, a preprocessed dataset was released publicly which used FSL's boundary‐based registration via a T1norm approach [Craddock, 2013]. For comparison, we computed the SPM‐based EPInorm approach, and as before, we computed the voxelwise subject standard deviation. Figure 7 again shows the voxelwise subject standard deviation across the brain. As before, the T1norm resulted in higher standard deviations throughout the brain, but especially on the edges of the brain where we expect the highest values. The mean and standard deviation of voxelwise variability for the T1norm and EPInorm approaches were 13.6 ± 9.8 and 9.1 ± 5.7, respectively. A violin plot of the voxel values as well as a scatter plot of T1norm versus EPInorm in Figure 7 shows an even greater difference in variability for both T1norm and EPInorm than for the other datasets evaluated (likely due to the fact that ABIDE is a multisite dataset). We also see the same pattern in which the EPInorm exhibits less variability than the T1norm data in terms of voxelwise subject standard deviation. We also reprocessed the ABIDE data using the same T1norm pipeline as in Experiment 1. Results were consistent in showing greater subjectwise variability using the T1norm compared to the EPInorm approach regardless of whether FSL's boundary‐based registration was used to align each subject's EPI and T1 data followed by nonlinear warping of the T1 data to MNI space or whether SPM's coregistration algorithm was used to align each subject's EPI and T1 data followed by nonlinear warping of the T1 data to MNI space using SPM's unified segmentation normalization procedure (results not shown).

Figure 7.

Figure 7

T1norm versus EPInorm in ABIDE dataset: (a, left) T1 norm voxelwise subject standard deviation, (a, middle) EPInorm voxelwise subject standard deviation, (a, right) difference (T1norm–EPInorm). (b) Violin plot of the voxels showing a subject standard deviation of over 25, T1norm is clearly much higher. [Color figure can be viewed at http://wileyonlinelibrary.com]

Assessing the Impact of Intersubject Alignment Strategy on Group‐Level Inferences

To assess the practical impact of variability in intersubject alignment on our ability to draw inferences at the group level, we also analyzed the go/no‐go task data (Experiments 1a and 1b) in SPM. To compare methods, we looked at distributions of T values for the false‐alarm‐versus‐hit contrast and calculated effective sample sizes. Figure 8 shows maps of the false‐alarm‐versus‐hit contrast generated from both EPInorm and T1norm in Experiment 1a (without distortion correction; thresholded at T > 4.0). It is clear from the figure that the EPInorm approach results in higher T values associated with task activity. The maximum (minimum) T value for EPInorm is 9.68 (−7.26) and for T1norm is 8.31 (−5.77). If we calculate the average T value above 4.0 (to ensure we are comparing effect sizes only in voxels that were strongly task‐related) and test the difference between the two approaches, we find they are significantly different at P < 0.05 (P = 4.3 × 10−20). In addition, as described in the methods, we can calculate the number of subjects needed to obtain a T value for the T1norm approach that is equal to that for the EPInorm approach using this mean T value. We do this for the single slice in the top right corner of the figure that shows expected activation patterns for the go/no‐go task, and also for the more general whole brain case. This suggests, for the no distortion correction case, an effective sample size of Neff=TEPInorm=5.7313TT1norm=5.1105280+1=102, using the EPInorm approach relative to the original Nref=81 for the single slice case. Put another way, and subject to the assumptions mentioned earlier, if we take the T1norm mean T value as a reference, the EPInorm has amplified the effective N by almost 25%. In the whole brain case, we find Neff=TEPInorm=5.4891TT1norm=5.1756280+1=91, that is, a 12% “boost” to the effective sample size relative to the T1norm approach. Note that these results are not particularly dependent on the specific T value used. We evaluated the change in effective sample size for T values between 2 and 5, and in all cases, we observed a higher effective N for the EPInorm approach relative to the T1norm approach without distortion correction. The change in the distribution of voxel values between the two approaches can be observed by comparing the T1norm and EPInorm voxel values show in Figure 8. The averaging approach was used to ensure the effect was consistent across multiple task‐related voxel values rather than selecting only the maximum voxel value (which, if used, showed an even larger benefit of the EPInorm approach compared to the T1norm approach).

Figure 8.

Figure 8

T values corresponding to false alarms versus hits for the go/no‐go task without distortion correction for (a) T1norm and (b) EPInorm. [Color figure can be viewed at http://wileyonlinelibrary.com]

Figure 9 shows the maps of the false‐alarm‐versus‐hit contrast generated from both EPInorm and T1norm in Experiment 1b (with distortion correction; thresholded at T > 4.0), and qualitatively, the maps for EPInorm and T1norm are much more similar when distortion correction is performed compared to the case without distortion correction in Figure 8. The maximum (minimum) T value for EPInorm is 9.73 (−8.18) and for T1norm is 10.45 (−6.30). As before, we calculated the average T value above 4.0 and tested the difference between the two approaches with distortion correction but found that this difference was not significant P > 0.15. To compare effective sample sizes with distortion correction, we used the T1norm data as the reference because the T values were slightly, although not significantly, higher than those from EPInorm with distortion correction. When distortion correction is performed, we calculated the effective number of subjects needed for the EPInorm approach relative to the T1norm approach to be Neff=TT1norm=5.4807TEPInorm=5.4238280+1=83, relative to the original Nref=81 for the single slice case. This represents a 3% boost in the effective sample size. In the whole brain case, we find Neff=TT1norm=5.1177TEPInorm=5.1384280+1=83, again a 3% increase to the effective sample size relative to the EPInorm approach. Interestingly, the average T values for the EPInorm approach without distortion correction were higher than the T values for EPInorm with distortion correction (though the maximum T value was higher for the EPInorm approach with distortion correction).

Figure 9.

Figure 9

T values corresponding to false alarms versus hits for the go/no‐go task with distortion correction for (a) T1norm and (b) EPInorm. [Color figure can be viewed at http://wileyonlinelibrary.com]

EPInorm Produced Larger T Values Than T1Norm Without Distortion Correction

Figure 10 shows the T values extracted from single subjects. To do this, a mask was created including all voxels for which the group‐level T values were larger than 4 for all of the four cases (T1norm, EPInorm, T1norm with distortion correction, and EPInorm with distortion correction). We then calculated the mean of the T values within the group mask for each subject. Paired t tests showed significant differences among almost all the cases, though, the most significant difference was the T1norm (without distortion correction) compared to the other three cases. The EPInorm approach without distortion correction had a significantly higher mean than both with distortion correction cases for this analysis, though the overall means are relatively close. Reported P values (Figure 10) are uncorrected; if we Bonferroni correct for the 6 comparisons, all the results are still significant at P < 0.05 with the exception of the comparison between the EPInorm without distortion correction and EPInorm with distortion correction.

Figure 10.

Figure 10

Single‐subject T values for the four cases (T1norm, EPInorm, T1norm with distortion correction, and EPInorm with distortion correction): EPInorm shows the highest mean T‐values and T1norm without distortion correction is significantly lower than the other three approaches. [Color figure can be viewed at http://wileyonlinelibrary.com]

DISCUSSION

Given the large number of studies using either the T1norm or the EPInorm approaches, we compared these two approaches using several straightforward metrics on multiple datasets. Results indicated that the T1norm approach consistently shows higher variability across subjects than does the EPInorm approach. In addition, the inter‐subject realignment estimates were lower for data processed using the EPInorm approach, suggesting more similarity among subjects in alignment relative to the T1norm approach. Finally, the group T values generated using go/no‐go data processed using the EPInorm approach were higher than those generated using the T1norm approach. Comparing T‐scores resulting from the two normalization methods suggests that the EPInorm approach effectively amplifies the sample size by between 12% and 25%. The results also suggest that distortion correction substantially improves the T1norm approach, but has less of an impact on the EPInorm approach (which is already doing a type of distortion correction).

Our results also suggest EPInorm without distortion correction provides results as good as or in some cases better than EPInorm or T1norm with distortion correction. This is consistent with previous work which has directly used high resolution EPI data to develop templates and found increased activation as a result [Grabner et al., 2014]. In our case, we show similar enhancement in activation even for standard EPI acquisitions with little T1 contrast. Our results are also consistent with Huang et al. who showed that using a study specific EPI template resulted in greater t values and activated voxels within a predefined region of interest [Huang et al., 2010].

A reliance on distortion correction pulse sequences, while attractive as it offers a physics‐based approach for reducing distortions [Jezzard and Balaban, 1995; Holland et al., 2010], suffers from some limitations as well. Studies often collect these data, but in our experience, many do not use them, likely because it adds additional steps to the processing pipeline and also can be further complicated by motion within the run. And, though many studies currently collect distortion correction sequences and share data that has been corrected [Van Essen et al., 2012], there are a huge number of legacy studies which are being shared but did not collect such sequences [Eickhoff et al., 2016]. In addition, despite the relatively simple and short acquisition, many prospective studies still do not collect distortion sequences.

Distortion correction strategies also typically assume there is no motion between the distortion correction sequence and the fMRI acquisition. This can be particularly problematic when participants move their head and no additional distortion correction is collected in this new position. Furthermore, because distortions are different depending on which phase encode direction is used, and most fMRI scans collect only a single phase encode direction, there are limits to what can be corrected. Signal dropout is of course another issue that disproportionally affects EPI data relative to T1 data [Deichmann et al., 2003; Wastling and Barker, 2015]. It would be interesting to design a study to isolate the impact of the geometric distortion and signal dropout properties that the EPI data experience, and to study how these image properties propagate error through the T1norm and EPInorm pipelines. In our opinion, the signal dropout is likely having the largest impact on the results, introducing error in the coregistration between the T1 and EPI data that is then propagated when the transformation to standard space estimated using the T1 image is then applied to the EPI data, but this should be studied in future work.

It should also be noted that EPInorm can be applied (optimally) to each functional MRI run. This minimizes concerns about subject motion (and geometric distortion differences) between runs creating increased variability in spatial normalization. Of course, motion within a run would still cause potential issues, and to address this problem, emerging approaches collect distortion correction information more regularly throughout the scan [In et al., 2015; Oh et al., 2012]. Alternatively, one could collect reverse phase encoded images at every other timepoint, however at the cost of cutting the temporal resolution in half. And finally, there are also approaches which jointly estimate multiple factors such as distortion correction and movement interactions or EPI Nyquist ghost effects [Andersson et al., 2017; Xie et al., 2017].

There are several limitations to our approach. We mainly focused on data without the use of distortion correction, primarily because this is the most widespread use of the T1norm approach. In addition, the distortions and signal dropout in the EPI scan can be complex. In some cases, the EPInorm approach may be detrimentally stretching out signal to cover dropout regions that have been lost; in other cases, the EPInorm approach may also be doing a better job of aligning the voxels within the brain. In this empirical study, the “right answer” is relative to the conditions and nature of the data tested. Here, we include several datasets of different types and tasks to increase the generalizability of our conclusions, but some datasets from some scanners may behave differently. We did not exhaustively compare all T1‐based registration algorithms [Klein et al., 2009], including those demonstrably more accurate in published studies such as surface‐based registration [Dale et al., 1999; Fischl et al., 1999; Fischl and Dale, 2000]. However, previous research comparing the impact of T1‐based registration algorithms on prediction and reproducibility metrics derived from group‐level statistical parametric images found that higher order polynomial warps compared to affine alignment had only a minor impact [Strother et al., 2004]. While surface‐based algorithms have been shown to further improve the T1‐to‐T1 warping compared to now standard higher‐order polynomial volume‐based normalization algorithms, they still require proper EPI‐to‐T1 alignment, which is suspect to the warping and signal dropout aspects of the EPI scan, and thus we predict that they would show similar results as the T1norm approach used in this article. However, future studies should more extensively compare the impact of other T1‐based approaches on the analysis of fMRI data. Some newer studies are collecting very high‐resolution EPI images, which may benefit even more from the EPInorm approach; however, if distortions are minimized in such acquisitions then the T1norm approach may provide some benefits. Note, one other advantage of the EPInorm approach is it allowed spatial normalization to proceed without the requirement of the extra T1 scan (which in some cases may not be available).

CONCLUSIONS

We show results suggesting that the widely used T1norm approach (without distortion correction) does not spatially normalize the EPI data and the EPInorm approach. The differences are striking, consistent across multiple datasets (differing in how the data were collected and processed), and should give pause to those who plan to use the T1norm approach without distortion correction. Interesting, distortion correction substantially improves the results for the T1norm approach, but has much less of an effect on the EPInorm approach, presumably because it is already doing a distortion correction of sorts through the nonlinear transformation.

REFERENCES

  1. Andersson JL, Skare S, Ashburner J (2003): How to correct susceptibility distortions in spin‐echo echo‐planar images: Application to diffusion tensor imaging. NeuroImage 20:870–888. [DOI] [PubMed] [Google Scholar]
  2. Andersson JLR, Graham MS, Drobnjak I, Zhang H, Filippini N, Bastiani M (2017): Towards a comprehensive framework for movement and distortion correction of diffusion MR images: Within volume movement. NeuroImage 152:450–466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ardekani BA, Bachman A, Strother S, Fujibayashi Y, Yonekura Y (2004): Impact of inter‐subject image registration on group analysis of fMRI data in international congress series. Elsevier; 1265:49–59. [Google Scholar]
  4. Ardekani BA, Guckemus S, Bachman A, Hoptman MJ, Wojtaszek M, Nierenberg J (2005): Quantitative comparison of algorithms for inter‐subject registration of 3D volumetric brain MRI scans. J Neurosci Methods 142:67–76. [DOI] [PubMed] [Google Scholar]
  5. Ashburner J, Friston KJ (2005): Unified segmentation. NeuroImage 26:839–851. [DOI] [PubMed] [Google Scholar]
  6. Avants B, Gee JC (2004): Geodesic estimation for large deformation anatomical shape averaging and interpolation. NeuroImage 23:S139–S150. [DOI] [PubMed] [Google Scholar]
  7. Avants BB, Epstein CL, Grossman M, Gee JC (2008): Symmetric diffeomorphic image registration with cross‐correlation: Evaluating automated labeling of elderly and neurodegenerative brain. Med Image Anal 12:26–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Avants BB, Tustison NJ, Song G, Cook PA, Klein A, Gee JC (2011): A reproducible evaluation of ANTs similarity metric performance in brain image registration. NeuroImage 54:2033–2044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Collins DL, Neelin P, Peters TM, Evans AC (1994): Automatic 3D intersubject registration of MR volumetric data in standardized Talairach space. J Comput Assist Tomogr 18:192–205. [PubMed] [Google Scholar]
  10. Craddock C (2013): ABIDE Preprocessed Initiative, http://preprocessed-connectomes-project.github.io/abide/quality_assessment.html.
  11. Crinion J, Ashburner J, Leff A, Brett M, Price C, Friston K (2007): Spatial normalization of lesioned brains: Performance evaluation and impact on fMRI analyses. NeuroImage 37:866–875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Dale AM, Fischl B, Sereno MI (1999): Cortical surface‐based analysis. I. Segmentation and surface reconstruction. NeuroImage 9:179–194. [DOI] [PubMed] [Google Scholar]
  13. Deichmann R, Gottfried JA, Hutton C, Turner R (2003): Optimized EPI for fMRI studies of the orbitofrontal cortex. NeuroImage 19:430–441. [DOI] [PubMed] [Google Scholar]
  14. Eickhoff S, Nichols TE, Van Horn JD, Turner JA (2016): Sharing the wealth: Neuroimaging data repositories. NeuroImage 124:1065–1068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fischl B, Dale AM (2000): Measuring the thickness of the human cerebral cortex from magnetic resonance images. Proc Natl Acad Sci USA 97:11050–11055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fischl B, Sereno MI, Dale AM (1999): Cortical surface‐based analysis. II: Inflation, flattening, and a surface‐based coordinate system. NeuroImage 9:195–207. [DOI] [PubMed] [Google Scholar]
  17. Fox PT (1995): Spatial normalization origins: Objectives, applications, and alternatives. Hum Brain Mapp 3:161–164. [Google Scholar]
  18. Friston K (2005): SPM, http://www.fil.ion.ucl.ac.uk/spm.
  19. Friston KJ, Ashburner J, Frith CD, Poline JP, Heather JD, Frackowiak RS (1995a): Spatial registration and normalization of images. Hum Brain Mapp 2:165–189. [Google Scholar]
  20. Friston KJ, Holmes A, Worsley KJ, Poline JP, Frith CD, Frackowiak RS (1995b): Statistical parametric maps in functional imaging: A general linear approach. Hum Brain Mapp 2:189–210. [Google Scholar]
  21. Friston KJ, Holmes AP, Worsley KJ (1999): How many subjects constitute a study?. NeuroImage 10:1–5. [DOI] [PubMed] [Google Scholar]
  22. Gaser C, Volz HP, Kiebel S, Riehemann S, Sauer H (1999): Detecting structural changes in whole brain based on nonlinear deformations‐application to schizophrenia research. NeuroImage 10:107–113. [DOI] [PubMed] [Google Scholar]
  23. Ghosh SS, Kakunoori S, Augustinack J, Nieto‐Castanon A, Kovelman I, Gaab N, Christodoulou JA, Triantafyllou C, Gabrieli JD, Fischl B (2010): Evaluating the validity of volume‐based and surface‐based brain image registration for developmental cognitive neuroscience studies in children 4 to 11 years of age. NeuroImage 53:85–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Grabner G, Poser BA, Fujimoto K, Polimeni JR, Wald LL, Trattnig S, Toni I, Barth M (2014): A study‐specific fMRI normalization approach that operates directly on high resolution functional EPI data at 7 Tesla. NeuroImage 100:710–714. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Greve DN, Fischl B (2009): Accurate and robust brain image alignment using boundary‐based registration. NeuroImage 48:63–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Holland D, Kuperman JM, Dale AM (2010): Efficient correction of inhomogeneous static magnetic field‐induced distortion in Echo Planar Imaging. NeuroImage 50:175–183. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Huang CM, Lee SH, Hsiao IT, Kuan WC, Wai YY, Ko HJ, Wan YL, Hsu YY, Liu HL (2010): Study‐specific EPI template improves group analysis in functional MRI of young and older adults. J Neurosci Methods 189:257–266. [DOI] [PubMed] [Google Scholar]
  28. Huntenberg JM, Gorgolewski K, Anwander A (2014): Evaluating Nonlinear Coregistration of BOLD EPI and T1 Images. Hamburg, Germany: OHBM. [Google Scholar]
  29. In M, Posnansky HO, Beall EB, Lowe MJ, Speck O (2015): Distortion correction in EPI using an extended PSF method with a reversed phase gradient approach. PLoS One 10:e0116320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Jenkinson M, Smith S (2001): A global optimisation method for robust affine registration of brain images. Med Image Anal 5:143–156. [DOI] [PubMed] [Google Scholar]
  31. Jezzard P, Balaban RS (1995): Correction for geometric distortion in echo planar images from B0 field variations. Magn Reson Med 34:65–73. [DOI] [PubMed] [Google Scholar]
  32. Klein A, Andersson J, Ardekani BA, Ashburner J, Avants B, Chiang MC, Christensen GE, Collins DL, Gee J, Hellier P, Song JH, Jenkinson M, Lepage C, Rueckert D, Thompson P, Vercauteren T, Woods RP, Mann JJ, Parsey RV (2009): Evaluation of 14 nonlinear deformation algorithms applied to human brain MRI registration. NeuroImage 46:786–802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Krishnan A, Woo CW, Chang LJ, Ruzic L, Gu X, Lopez‐Sola M, Jackson PL, Pujol J, Fan J, Wager TD (2016): Somatic and vicarious pain are represented by dissociable multivariate brain patterns. Elife 5:e15166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Martino AD, Yan C, Li G, Denio QE, Castellanos F, Alaerts XK, Anderson J, Assaf SM, Bookheimer S, Dapretto Y, Deen M, Delmonte B, Dinstein S, Ertl‐Wagner IB, Fair D, Gallagher AL, Kennedy DP, Keown C, Keysers LC, Lainhart J, Lord E, Luna C, Menon BV, Minshew NJ, Monk C, Mueller SS, Muller RA, Nebel MB, Nigg J, O'Hearn TK, Pelphrey KA, Peltier SJ, Rudie J, Sunaert D, Thioux SM, Tyszka JM, Uddin LQ, Verhoeven J, Wenderoth SN, Wiggins JL, Mostofsky SH, Milham MP (2013): The autism brain imaging data exchange: Towards a large‐scale evaluation of the intrinsic brain architecture in autism. Mol Psychiatry [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Mazziotta J, Toga A, Evans A, Fox P, Lancaster J, Zilles K, Woods R, Paus T, Simpson G, Pike B, Holmes C, Collins L, Thompson P, MacDonald D, Iacoboni M, Schormann T, Amunts K, Palomero‐Gallagher N, Geyer S, Parsons L, Narr K, Kabani N, Le Goualher G, Boomsma D, Cannon T, Kawashima R, Mazoyer B (2001): A probabilistic atlas and reference system for the human brain: International Consortium for Brain Mapping (ICBM). Philos Trans R Soc Lond B Biol Sci 356:1293–1322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Mazziotta JC, Toga AW, Evans A, Fox P, Lancaster J (1995): A probabilistic atlas of the human brain: Theory and rationale for its development. The International Consortium for Brain Mapping (ICBM). NeuroImage 2:89–101. [DOI] [PubMed] [Google Scholar]
  37. Oh SH, Chung JY, In MH, Zaitsev M, Kim YB, Speck O, Cho ZH (2012): Distortion correction in EPI at ultra‐high‐field MRI using PSF mapping with optimal combination of shift detection dimension. Magn Reson Med 68:1239–1246. [DOI] [PubMed] [Google Scholar]
  38. Strother S, La Conte S, Kai Hansen L, Anderson J, Zhang J, Pulapura S, Rottenberg D (2004): Optimizing the fMRI data‐processing pipeline using prediction and reproducibility performance metrics: I. A preliminary group analysis. NeuroImage 23:S196–S207. [DOI] [PubMed] [Google Scholar]
  39. Tahmasebi AM, Abolmaesumi P, Zheng ZZ, Munhall KG, Johnsrude IS (2009): Reducing inter‐subject anatomical variation: Effect of normalization method on sensitivity of functional magnetic resonance imaging data analysis in auditory cortex and the superior temporal region. NeuroImage 47:1522–1531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Talairach J, Tournoux P (1988): A Co‐Planar Sterotaxic Atlas of a Human Brain. Thieme, Stuttgart: Georg Thieme Verlag. [Google Scholar]
  41. Tzourio‐Mazoyer N, Landeau B, Papathanassiou D, Crivello F, Etard O, Delcroix N, Mazoyer B, Joliot M (2002): Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single‐subject brain. NeuroImage 15:273–289. [DOI] [PubMed] [Google Scholar]
  42. Van Essen DC, Ugurbil K, Auerbach E, Barch D, Behrens TE, Bucholz R, Chang A, Chen L, Corbetta M, Curtiss SW, Della Penna S, Feinberg D, Glasser MF, Harel N, Heath AC, Larson‐Prior L, Marcus D, Michalareas G, Moeller S, Oostenveld R, Petersen SE, Prior F, Schlaggar BL, Smith SM, Snyder AZ, Xu J, Yacoub E, Consortium WU‐MH (2012): The Human Connectome Project: A data acquisition perspective. NeuroImage 62:2222–2231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Wastling SJ, Barker GJ (2015): Designing hyperbolic secant excitation pulses to reduce signal dropout in gradient‐echo echo‐planar imaging. Magn Reson Med 74:661–672. [DOI] [PubMed] [Google Scholar]
  44. Woods RP, Grafton ST, Watson JD, Sicotte NL, Mazziotta JC (1998): Automated image registration: II. Intersubject validation of linear and nonlinear models. J Comput Assist Tomogr 22:153–165. [DOI] [PubMed] [Google Scholar]
  45. Xie VB, Lyu M, Wu EX (2017): EPI Nyquist ghost and geometric distortion correction by two‐frame phase labeling. Magn Reson Med 77:1749–1761. [DOI] [PubMed] [Google Scholar]
  46. Zhang Y, Brady M, Smith S (2001): Segmentation of brain MR images through a hidden Markov random field model and the expectation‐maximization algorithm. IEEE Trans Med Imag 20:45–57. [DOI] [PubMed] [Google Scholar]

Articles from Human Brain Mapping are provided here courtesy of Wiley

RESOURCES