Abstract
Pre-processing MRI scans prior to performing volumetric analyses is common practice in MRI studies. As pre-processing steps adjust the voxel intensities, the space in which the scan exists, and the amount of data in the scan, it is possible that the steps have an effect on the volumetric output. To date, studies have compared between and not within pipelines, and so the impact of each step is unknown. This study aims to quantify the effects of pre-processing steps on volumetric measures in T1-weighted scans within a single pipeline. It was our hypothesis that pre-processing steps would significantly impact ROI volume estimations. One hundred fifteen participants from the OASIS dataset were used, where each participant contributed three scans. All scans were then pre-processed using a step-wise pipeline. Bilateral hippocampus, putamen, and middle temporal gyrus volume estimations were assessed following each successive step, and all data were processed by the same pipeline 5 times. Repeated-measures analyses tested for a main effects of pipeline step, scan-rescan (for MRI scanner consistency) and repeated pipeline runs (for algorithmic consistency). A main effect of pipeline step was detected, and interestingly an interaction between pipeline step and ROI exists. No effect for either scan-rescan or repeated pipeline run was detected. We then supply a correction for noise in the data resulting from pre-processing.
1 Introduction
Magnetic Resonance Imaging (MRI) has become a central tool in both research and medicine due to its ability to capture in vivo anatomic and functional data. Differences in structure and/or metabolism between groups can be strongly correlated with behavior performance [1–4], thus explaining the functions of the various regions of the cerebrum and cerebellum. Longitudinal studies give insight into neurodegenerative diseases and psychiatric disorders [5–12].
Central to these studies and diagnoses are various methods of interacting with the MRI data. Several studies [8, 9, 13, 14–19] have already investigated the effects of multiple software suites on a single dataset, finding that one often outperforms another in a certain regard. For instance, the FreeSurfer suite [20] is widely used for cortical and subcortical parcellation, yet recent studies have shown that Advanced Normalization Tools (ANTs) [21], a more recently developed suite of software, outperforms FreeSurfer in certain aspects [17, 18, 22].
Pre-processing steps like rigid-body transformations, field-inhomogeneity corrections, and skull-stripping are quite common [7, 18, 23–26]. While their effects on the data, such as adjusting the position of the data within the field-of-view or the distribution of the histogram, are well known and needed for the sake of both algorithmic robustness and multi-modal approaches, the impact of such effects are not well investigated; few studies have investigated different pre-processing steps and quantified their impact on the data [27, 28], and have done so while also investigating varying software and pipelines. Further, few studies have quantified the variance of multiple intra-subject scans resulting merely from pre-processing rather than some other independent variable [29–32]. Consequently, no consensus on how each pre-processing step impacts the downstream output has been established.
As it is unknown how each step within a single pipeline influences the overall output of the pipeline, different researchers with similar questions could potentially achieve significantly different results that are not a product of the independent variable under investigation, but rather the result of different pre-processing steps (let alone software). Differing pipeline steps may introduce an unknown amount of noise into the data even if their pre-processing algorithms are based in the same suite of software. This may be especially true if the various pre-processing steps add different amounts of noise or variance as a difference of only 1 to 2% between groups may be considered significant [33].
It is therefore the aim of this paper to assess the impact of common pre-processing steps on scan-rescan variance and anatomic volume. We analyzed the estimated volumes of bilateral putamen, hippocampus, and middle temporal gyrus in 115 participants, and varied pre-processing steps to include images left in original space, rotated and cropped images, N4-bias correction, and skull-stripping of the images. We hypothesize that each different pre-processing step will have a differential impact on the region of interest (ROI) estimated volume.
2 Methods
2.1 MRI acquisition
Hosted cross-sectional data from the Open Access Series of Imaging Studies (OASIS) project [34] were used in this study. 115 participants whose age ranged between 20 and 29 (female = 68, mean age = 22.8 ±2.48) were selected from the entire database, as we intended to only include healthy young adults. All participants were scanned with a T1-weighted MPRAGE sequence using following parameters: slice thickness = 1 mm, TR = 1900 ms, TE = 2.26 ms, field of view = 218 × 250, voxel size = 1 × 0.977 × 0.977 mm, acquisition matrix = 256 × 215, flip angle = 9°. As each participant contributed three or four scans, the first three scans of each participant were used in this study.
As all images were hosted in a 16-bit big-endian Analyze 7.5 format, images were converted into NIfTI and MGZ formats via mri_convert for processing in ANTs- or FreeSurfer-based pipelines, respectively, detailed below.
2.2 FreeSurfer
FreeSurfer 6.0 (https://surfer.nmr.mgh.harvard.edu) [20, 35] was used to render hippocampal, putamen, middle temporal gyrus, and total brain volumes in all subjects. All scans were segmented using the FS software with the command recon-all –subjid subjDir –all –sd /path/to/workDir -notal-check –cw256. Volumes were then extracted from the aseg.stats, lh.aparc.DKTatlas.stats, and rh.aparc.DKTatlas.stats output files. FreeSurfer output was also hosted by OASIS, and the same data were extracted from the hosted FS data; we expect our data to differ from the OASIS FS data, as we are using an updated version of the software. While it is not the aim of this paper to compare between diverse pipelines, but rather individual steps within a single pipeline, FS was included due to its ubiquity in order to have a reference point for our findings. Further, referencing the DKT atlas with produce complimentary labels to the protocol below.
2.3 Template construction
Twenty representative participants were selected in a pseudo-random fashion from the 115 participants (female = 10, mean age = 22.8 ±2.5) in order to construct a study-specific template. First, scans were warped into a standardized coordinate space (MNI) [36, 37, 38] using a nonlinear diffeomorphic normalization algorithm supplied by ANTs [14] with the following command:
ANTS 3 –o <prefix> -i 100x100x100x20 –t SyN[0.1]–r Gauss[3,0.5]–m CC[<template.nii.gz>, <input.nii.gz, 4, 4]; WarpImageMultiTransform 3 <input.nii.gz> <output.nii.gz> Warp.nii.gz Affine.txt –R <template.nii.gz>.
Second, warped scans were then skull-stripped, as we intended to render both a head (brain with meninges, skull, etc) and a brain (all non-brain information removed) template, for use with images containing head, or just brain, information, respectively. Skull-stripping was done utilizing ANTs and referencing the OASIS-30 template hosted at http://www.mindboggle.info/data.html via antsBrainExtraction.sh –d 3 –a lab_template.nii.gz –e ref_template.nii.gz –m probability_mask.nii.gz –f registration_mask.nii.gz –o <prefix>. Third, structural scans in MNI space were then used to construct a lab-specific head and brain templates, following previous research [17, 21], via buildtemplateparallel.sh –d 3 –m 30x90x30 –t GR –s CC –c 2 –j 8 –o <prefix> -z mni_icbm152_template.nii <structural_mni_scans.nii.gz>. This resulted in templates derived from the statistical average of all input scans. Fourth, the lab-specific head template was segmented according to the Desikan-Killiany-Tourville protocol using the Joint Label Fusion toolkit and referencing the OASIS-TRT-20 dataset [22, 39–41]:
antsJointLabelFusion.sh –d 3 –t <template.nii.gz>–o <prefix> -p prior%04d.nii.gz –c 5 –j 4 –g atlas_1/<atlas.nii.gz>–l labels_1/<labels.nii.gz> … -g atlas_20/<atlas.nii.gz>–l labels_20/<labels.nii.gz>
This produced ROI-specific probabilistic priors in template space, for use in anatomic segmentation. The ROI priors were produced only in head template space, as both the head and brain templates existed in the same space, and in order to reduce template-mask confounds during segmentation. Finally, priors for skull-stripping participant scans were constructed in template head space. This was first accomplished by skull-stripping the head template by referencing the OASIS-30 priors via
antsCorticalThickness.sh –d 3 –a <moving.nii> -e <fixed.nii> -t <brain_mask.nii> -m <probability_mask.nii> -f <extraction_mask.nii> -p priors%d.nii.gz –o <prefix>.
Next a probability mask was constructed with SmoothImage 3 <binary_mask.nii> 1 <probability_mask.nii>, an extraction mask was constructed with c3d <binary_mask.nii> -dilate 28x28x28vox –o <extraction_mask.nii> [42], and a registration mask constructed via c3d <binary_mask.nii> -dilate 1 18x18x18vox –o <registration_mask.nii>.
2.4 Pre-processing participant scans
As we intended to investigate whether or not pre-processing MRI data has an impact on volumetric output, scans from each of the participants were then pre-processed in a stepwise fashion such that the output of a previous step became the input for the subsequent step. Common pre-processing steps [23–25, 29, 43, 44] were selected from the literature; all pre-processing and segmentation was conducted on all three scans contributed by each participant. First, images converted from the Analyze format to NIfTI format where left in native space (ORIG). Second, a rigid-body transformation utilizing six degrees of freedom rotated (RROT) the structural scans into approximate template space via antsRegistrationSyNQuick –d 3 –f <template.nii> -m <input.nii> -t r –o <prefix>. RROT differs from ORIG in the location of information within the domain, which may impact matrix-based calculations performed in a diffeomorphic registration. Third, RROT scans were then used to produce N4-bias corrected scans (N4BC) [45] via the ANTs-based toolkit N4BiasFieldCorrection –d 3 –i <input.nii.gz> -s 4 –c [50x50x50x50,0.0000001]–b [200]–o <output.nii.gz>, where care was taken to use the same parameters and arguments found in the antsBrainExtraction.sh script. This step corrects non-uniform image intensities of the same tissue class which result from field inhomogeneities. For example, gray matter in one region may have a very similar voxel intensity to white matter of another region due local signal variation resulting from field inhomogeneities. Correcting for such variance produces tissue classes that have similar intensity signatures. Finally, scans were skull-stripped (N4SS), a process which involves the removal of non-brain tissues (e.g., the durae), skull, and scalp, via antsBrainExtraction.sh –d 3 –a <input.nii.gz>–e <template.nii.gz>–m <probability_mask.nii.gz>–o <prefix>. Additionally, as the skull-stripping script performs the N4-bias correction prior to segmentation, RROT scans were used as the input files in this step in order to avoid repeated bias corrections. Thus, each participant would be measured by the ORIG, RROT, N4BC, and N4SS scans produced by the previous steps, in addition to the FreeSurfer (FS) measures (Fig 1).
2.5 Segmentation
Segmentation of the participant scans (ORIG, RROT, N4BC, and N4SS) was then conducted in two steps. First, ANTs was used to register each scan to the template, resulting in a calculation of the bilateral diffeomorphic deformation of both template and participant scan to a common midpoint. This was done with the command ANTS 3 -o Reg -i 100x100x100x20 -t SyN[0.1] -r Gauss[3,0.5] -m CC[template.nii.gz, input.nii.gz,4,4]. Care was taken to register non-skull-stripped scans (head) with the head template, and skull-stripped (brain) scans with the brain template. Upon completion, the calculations were then used to warp 6 probabilistic segmentation labels from template to participant space using a nearest-neighbor interpolation. The labels included two subcortical structures (bilateral hippocampus and putamen) and one cortical structure (bilateral middle temporal gyrus). The hippocampus was selected due to its difficulty in segmentation, whereas the putamen was considered easier to segment being clearly separated from nearby structures by white matter. The middle temporal gyrus was selected in order to include a cortical measure. Segmentation of these various regions of interest (ROIs) occurred via WarpImageMultiTransform 3 <ROI_label.nii.gz> <output.nii.gz> -i Affine.txt InverseWarp.nii.gz –R <input.nii.gz>—use-NN. Upon review, and in accordance with the literature [46], these ROI masks in participant space were then thresholded and binarized via c3d <input.nii.gz>–thresh 0.5 1 1 0 –o <output.nii.gz>. In total, six measures (bilateral hippocampus, putamen, middle temporal gyrus) were produced for each of the pre-processing steps (ORIG, RROT, N4BC, N4SS, and FS) for each participant (n = 115), yielding a large number of observations. Further, repeated scans for each participant contributed additional measurements, which will be used to assess scan-rescan variance. ROI volumes were extracted via c3d <input.nii.gz> -dup –lstat, while Dice similarity coefficients [47] were extracted via c3d <input1.nii.gz> <input2.nii.gz> -overlap <label>.
2.6 Total brain volumes
In order to perform a ratio correction, total brain volumes were calculated for all scans for each participant, using both FreeSurfer-and ANTs-based tools. Calculations of total brain volumes of the mgz files were pulled from the aseg.stats file, and total brain volumes of the NIfTI files were derived from the binary BrainExtractionMask produced by antsBrainExtraction.sh thereby giving six total brain volumes (TBV) for each participant, for each scan. Each ROI was then converted into a ratio of TBV in order to account for hidden confounds such as scanner warmth.
2.7 Repeated pipeline runs
In order to assess the consistency of the various software used in sections 2.4–6, the data from all participants were processed via the pipeline described above a total of five times. This will help determine if the various software interacts consistently or idiosyncratically with the MRI data, as it is assumed that the algorithms perform consistently. Thus it can be determined whether the significant variance, if any, is the product of the pre-processing software, scanner inconsistencies, or the pre-processing steps within the pipeline. All scripts used for in this study are available at https://github.com/nmuncy/Preproc_Effects.
3 Results
3.1 Within-subject repeated measures
All analyses were performed on the volumetric output of six brain regions (left and right putamen, hippocampus, and middle temporal gyrus). To control for potential confounds (e.g. the size and sex of the participant, daily hydration levels, scanner warmth, etc.), each volumetric output was corrected by the total brain volume prior to analyses. The total brain volumes were calculated separately for the FS- and ANTs-based pipelines, and these values were used as denominators for their respective volume measurements, producing a ratio.
To determine the effect of pre-processing steps, we performed a multivariate repeated measure analysis on the six corrected ROI volumes and the five different pre-processing steps for each of the 115 participants at an α = 0.05 level. That is, the regions of interest and the different methods are the two within-subject factors. The Hotelling’s T2(4,113) value (a generalized F statistic of a repeated measure that determines the discrepancy of sample volume mean and hypothetical mean) of 7349.77 exceeded the critical value (CV) of 10.08 (generalized partial η2 = 0.25), indicating that regional ratios of at least one of the pre-processing pipelines differed significantly from the others across the tested regions. In other words, there are significant differences in regional ratios that are dependent on how the scans are pre-processed. Additionally, a main effect of ROI was detected (T2(5,113) = 10648.60, CV = 11.91, generalized partial η2 = 0.33). Most interestingly, there is an interaction effect of pre-processing step and ROI (T2(20,113) = 3637.80, CV = 40.47, generalized partial η2 = 0.13). This means that pre-processing steps differentially affect the volumetric measurement of ROI for at least one of the steps. Finally, there was no sex effect (F = 0.10, p = .75).
As the main effects and interaction of multivariate analysis were significant, we then performed t-tests for each ROI to find which pre-processing steps significantly differ from each other. Though it is unnecessary to perform any multiple comparison corrections because the multivariate analysis was significant [48, 49], we performed the very conservative Bonferroni correction (Table 1). In this follow-up analysis, we see the differential effects of pre-processing on the regions. For the left and right putamen, three step comparisons produced significantly different volumes after a Bonferroni correction (Table 1). ORIG differed significantly from both RROT and N4BC in both hemispheres, and the right hemisphere differed between the N4SS and FS. For the left and right hippocampus, ORIG produced significantly different volumes from RROT and N4BC, but not from N4SS, and N4SS only differed from FS. The RROT and N4BC steps also produced significantly different volumes in the hippocampi. For the left and right MTG, all step comparisons significantly differed except RROT vs N4BC. Please note that these results only apply when each ROI is corrected by total brain volume produced by each step. Without this correction, there are more ROI by step interactions (see supplementary).
Table 1. Ratio pairwise comparisons of pipelines.
LPut | RPut | LHip | RHip | LMTG | RMTG | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Comparison | t | p | t | p | t | p | t | p | t | p | t | p |
ORIG vs RROT | 10.99 | <.0001 | 11.20 | <.0001 | 12.61 | <.0001 | 12.33 | <.0001 | 11.16 | <.0001 | 13.48 | <.0001 |
ORIG vs N4BC | 13.2 | <.0001 | 13.40 | <.0001 | 15.50 | <.0001 | 14.57 | <.0001 | 8.65 | <.0001 | 13.50 | <.0001 |
ORIG vs N4SS | -0.81 | .42 | -0.84 | .40 | -0.79 | .43 | -0.87 | .39 | -3.56 | .0005 | -2.73 | .007 |
RROT vs N4BC | 2.30 | .02 | 2.57 | .01 | 3.94 | .0001 | 4.85 | <.0001 | -1.08 | .28 | 1.70 | .09 |
RROT vs N4SS | -1.50 | .14 | -1.52 | .13 | -1.52 | .13 | -1.56 | .12 | -4.28 | <.0001 | -3.64 | .0004 |
N4BC vs N4SS | -1.62 | .11 | -1.65 | .1 | -1.69 | .09 | -1.77 | .08 | -4.26 | <.0001 | -3.74 | .0003 |
N4SS vs FS | -1.67 | .10 | -5.78 | <.0001 | -34.68 | <.0001 | -30.52 | <.0001 | -18.83 | <.0001 | -14.76 | <.0001 |
Significant t-values show which method of pre-processing produces regional ratios that differ significantly from one another, i.e. which two pre-processing steps produce dissimilar ratios for each region of interest. FS is only compared to N4SS as they have a similar pipeline. Also, these findings indicate that as data move along the overall pipeline that is commonly performed (Orig → RROT → N4BC → N4SS), volumes significant differ depending on the brain region of interest. Bolded values are significant at the Bonferroni corrected value of 0.05/7. L = left, R = right, Put = putamen, Hip = hippocampus, MTG = middle temporal gyrus.
3.2 Repeated pipeline runs
The first scan from each participant was processed using the various pre-processing pipelines four more times. These repeated pipeline runs allowed us to investigate whether the significant mean ROI by step differences detected in Section 3.1 are the result of the different pre-processing steps, or whether they were the result of the pre-processing algorithms creating variance that did not previously exist in the actual data. In other words, repeated iterations of the pipelines can tell us either (a) that the difference is the result pre-processing steps interacting consistently with varying noise inherent in the data, or the result of each step producing a different but consistent amount of noise, or (b) that the steps produce differing and idiosyncratic amounts of noise in the data. We assumed that if the repeated iterations of the pipelines produced statistically identical volumes, then the volumetric differences calculated were the result of one or both parts of (a). Unfortunately, this study cannot tease apart the two parts of (a).
To analyze the five pipeline runs, we performed another multivariate repeated measure analysis after repeatedly processing all data with FS and each of the ANTs-based pipelines. Both methods produced statistically similar volumes for all the five runs (T2(4,114) = 4.08, CV = 10.08, generalized partial η2 = 0.0002). This implies that the algorithms are consistently interacting with the data.
As an additional summary measure for the multiple runs, we provided Dice similarity coefficients (DSC) [50, 51] between the runs for each participant and each brain region as shown in Table 2. DSC is a measure of overlap in space, where DSC is a distance measure of the overlap of two ROI labels divided by the total. Values for the DSCs range from 0 to 1, where scores ≥ 0.7 are considered “good” and a score of 1 indicates perfect agreement [52, 53]. All steps had very high similarities.
Table 2. Dice similarity coefficients.
Step | LPut | RPut | LHip | RHip | LMTG | RMTG |
---|---|---|---|---|---|---|
ORIG | 0.999 (0.0001) | 0.999 (0.0002) | 0.999 (0.0002) | 0.999 (0.0002) | 0.999 (0.0008) | 0.999 (0.0007) |
RROT | 0.999 (0.0003) | 0.999 (0.0006) | 0.999 (0.0007) | 0.999 (0.0008) | 0.999 (0.0033) | 0.999 (0.0048) |
N4BC | 0.999 (0.0006) | 0.999 (0.0008) | 0.999 (0.0011) | 0.999 (0.0011) | 0.999 (0.0068) | 0.999 (0.0062) |
N4SS | 0.998 (0.0022) | 0.998 (0.0022) | 0.997 (0.0036) | 0.997 (0.0034) | 0.985 (0.0210) | 0.987 (0.0170) |
FS | 1.000 (0.0000) | 1.000 (0.0000) | 1.000 (0.0000) | 1.000 (0.0000) | 1.000 (0.0000) | 1.000 (0.0000) |
Mean Dice coefficients across runs and participants, mean (SD). All pre-processing steps have extremely high similarities.
3.3 Scan-rescan analysis
To test for scanner consistency, we processed the three repeated scans taken from the OASIS dataset on the 115 participants through the same FS- and ANTs-based pipelines. Once again, we performed a multivariate repeated measure analysis, now with three within-subject factors (day, method, and ROI). The T2(2,114) value of 0.46 (critical value of 6.21 and generalized partial η2 = 0.00004) indicates the repeated scans of the same participant produced consistent volumes for all the regions tested.
Thus, the significant ROI volumetric differences detected in Section 3.1 is in fact an effect of pre-processing the data, and not due to the algorithms inconsistencies (Section 3.2) nor is the differences the result of an artifact resulting from a scan-by-pipeline-step interaction (Section 3.3).
3.4 Percent variability error correction
As the different steps in pre-processing give different mean corrected volumes, this variability must be taken into account before comparing between groups’ regions of interest. One method to account for these differences is to find the percent variability error in scan-rescan for each pipeline step in order to find the error bounds. To calculate the percent variability error, we followed the method described by Tustison et al., [22] and averaged the absolute difference for scan-rescan for each participant in each method. That is, if Tijk is the kth scan in step j for participant i (i = 1,…,N), then the percent variability error is
(1) |
The percent variability error for each step is given in Table 3. Corrections for all six brain regions are included. Each value indicates the percentage of the specific region value to add to or subtract from each method’s corrected volume when comparing group averages, longitudinal measurements, or the different pre-processing methods. In other words, if researchers are using the N4BC ratio pipeline, their left hippocampal corrected volumes should be considered as X ± 0.0151X (PVE = 1.51) or as
(2) |
That is, if the hippocampal volume was 0.22% of the total brain volume for the N4BC, then this value should be treated as 0.22% ± 0.003%, thereby giving the range of left hippocampal volume percentage as 0.217%– 0.223%.
Table 3. The mean percent variability error for each method.
Step | LPut | RPut | LHip | RHip | LMTG | RMTG |
---|---|---|---|---|---|---|
ORIG | 1.00 | 1.14 | 1.36 | 1.17 | 1.40 | 1.18 |
RROT | 1.40 | 1.31 | 1.51 | 1.41 | 1.56 | 1.43 |
N4BC | 1.48 | 1.43 | 1.51 | 1.46 | 1.60 | 1.45 |
N4SS | 3.41 | 3.26 | 3.37 | 3.55 | 3.64 | 3.71 |
FS | 2.42 | 2.41 | 2.35 | 1.86 | 3.09 | 2.58 |
Because the pre-processing methods are significantly different from each other, each step in each region needs a correction to be able to compare different methods. Though the percent variability error may appear small, the values in this table are for the regional volume divided by the total brain volume.
While these percentages may not seem significant, consider the percentages in terms of actual volume. In this study, the average total brain volume produced by N4BC was 1,513,592 mm3. The left hippocampal volume range of 0.217%-0.223% becomes 3284.5–3375.3 mm3, translating to a range of about 90.8 mm3. This is relevant in the comparison of groups using the same pipeline, or of various studies using differing pipelines, where the upper bounds of one group or pre-processing pipeline may actually fall within the lower bounds of another. Similar to a 95% confidence interval when estimating parameters, this range given by the percent variability error takes the variance of the pre-processing method into account. This is so a comparison between groups may be representative of independent variables and not simply the differences of pre-processing methods or other confounds.
4 Discussion
4.1 Summary
This study investigated the effects of certain pre-processing steps in registration-based volumetric ROI segmentation. Analyses showed a main effect of pre-processing step, indicating that the various steps investigated significantly altered the ROI-corrected volume. Further, and importantly, an interaction of pre-processing step and ROI was discovered, indicating that the alterations in ROI-corrected volumes were not consistent across different brain regions. The significantly different corrected volumes found between the pre-processing steps may be the result of the algorithm of each pre-processing step interacting with the data in a consistent but unique fashion, although it is possible that the algorithms were in fact interacting with each scan idiosyncratically. We tested this assumption by running all data through the same pipeline four additional times; no effect of run was detected. Additionally, it was possible that the volumetric differences resulted from inconsistent scanner output, thereby producing an interactive effect between the pre-processing step and MRI noise. To test this, three scans of each participant were processed through the same pipeline steps; an effect of repeated scans was not detected, indicating consistent MRI output. As such, we concluded that it was the pre-processing steps that significantly changed the ROI-corrected volumes, step to step, and not the algorithms performing inconsistently or variance from the scanner itself. Finally, we supplied a correction for each step and for each ROI to account for the noise in the data resulting from pre-processing.
4.2 Limitations
First, only a few specific pre-processing steps were investigated in this study, and as such the generalizability of our findings are quite restricted. In this preliminary study, an effort was made to investigate whether or not unaccounted-for and significant variance existed in registration-based volumetric data, and whether the noise was a result of pipeline algorithms, scanner output, or pre-processing protocol. As such, we used pre-processing steps based within the same software suites in order to reduce potential confounds. While significantly different volumes, resulting from pipeline steps, were detected, such findings are constrained to the parameters of our study. It is reasonable, however, that significant variance resulting from pre-processing will be detected in other pipelines, in accordance with our findings, as pre-processing algorithms alter the data directly. Further, and more importantly, our attempt to quantify variance that is unaccounted for was done in order to address a deficit in the literature: while numerous studies have investigated differences between differing pipelines [17, 18, 22, 27, 45, 54–61], these do not investigate the effects of pre-processing on the data within a single pipeline. If, as was detected in this study, pre-processing steps significantly change the data, then it may not be meaningful to compare studies which differ in as little as a single pre-processing step, and may be even less meaningful to compare studies using different pre-processing software. Second, while we investigated discrete steps within a single pipeline, we did not investigate the impact of various parameters within a single step. This was beyond the scope of this study; we attempted to optimize the steps that were used according to extant literature in order to investigate the impact of each step in the pipeline. Different parameters would assuredly change the impact of each step, but comprehensively investigating all permutations of parameters within a pipeline would overwhelm the statistical models. Further, certain combinations of parameters would be unjustifiable from the literature and inappropriate to use. It would be relevant in a subsequent study to investigate various arguments in order to see if the impact of each step in a pipeline could be optimized. Third, only three bilateral ROIs were included in the analysis. A small number of ROIs were specifically used in order to avoid overwhelming the statistical models. The hippocampus was selected as it is known to be sensitive to pre-processing in registration-based pipelines due to a similar intensity signature with the amygdala and the thin alveus which delineates anterior hippocampus from posterior/ventral amygdala [35, 61]. The putamen was selected as we considered it to have good white-matter boundaries with other subcortical gray-matter structures. The middle temporal gyrus was selected both in order to have a cortical structure as well as to assess the impact of skull-stripping in this pipeline as it has been established [27, 60, 62] that skull-stripping impacts cortical volume. Fourth, a single scanning sequence was used in investigating the effect of pre-processing step on ROI volumes. Using the established OASIS dataset was done in order to decrease potential sources of noise in the data. It is likely that differing scanning sequences would produce different amounts of noise, which may produce an interaction with the pre-processing pipeline. This, however, was also beyond the scope of this study, and warrants further investigation. Fifth, the pipelines in this study used only registration-based segmentation. This was done because it was the intention of this preliminary study to look at the impact on the data of various steps within a single pipeline. Different pipelines (e.g. non-registration-based) would undoubtedly have different variances associated with them and would be worth investigating.
4.3 Recommendations
As each step in the pre-processing pipeline produced differences in either volume or variance, we recommend calculating the percent variance error (PVE) correction, to account for the differential effects that these pre-processing steps have on the data, before comparing analyses. Volumetric and morphometric studies reporting significance with a small difference in voxel numbers are readily available [63, 64–74], and may be false positives as the data consists, in part, of unaccounted noise, some of which stems from pre-processing. A difference of a few tens of voxels in sensitive longitudinal or multi-group studies may well be accounted for by the upper and lower bounds provided by the PVE correction, particularly if there are differences in pre-processing steps. Additional noise is probable if there are further differences in scanning sequences, MRI scanners, software suites, pipeline steps, and arguments used.
We also recommend, echoing Tustison et al., [75], that a detailed description of the pre-processing used be provided in order to clarify any impact that pre-processing may have had on the overall outcome of the analysis. Additionally, the inclusion of pre-processing steps will help make replication an easier task [76, 77].
Acknowledgments
This study was conducted using data from the Open Access Series of Imaging Studies (OASIS) project.
Data Availability
All scripts used for pre-processing and statistical analyses are available at https://github.com/nmuncy/Preproc_Effects, and MRI data is hosted at http://www.oasis-brains.org/app/template/Index.vm.
Funding Statement
This work was supported by National Institutes of Health (US) R01 AG021910, P50 MH071616, U24 RR021382, R01 MH56584, P50 AG05681, P01 AG03991.
References
- 1.Stewart JC, Tran X, Cramer SC. Age-related variability in performance of a motor action selection task is related to differences in brain function and structure among older adults. NeuroImage. 2014;86: 326–34. 10.1016/j.neuroimage.2013.10.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Pangelinan MM, Zhang G, VanMeter JW, Clark JE, Hatfield BD, Haufler AJ. Beyond age and gender: Relationships between cortical and subcortical brain volume and cognitive-motor abilities in school-age children. NeuroImage. 2011;54(4): 3093–100. 10.1016/j.neuroimage.2010.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Sandman CA, Head K, Muftuler LT, Su L, Buss C, Davis EP. Shape of the basal ganglia in preadolescent children is associated with cognitive performance. NeuroImage. 2014;99: 93–102. 10.1016/j.neuroimage.2014.05.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sinanaj I, Cojan Y, Vuilleumier P. Inter-individual variability in metacognitive ability for visuomotor performance and underlying brain structures. Consciousness and Cognition. 2015;36: 327–37. 10.1016/j.concog.2015.07.012. [DOI] [PubMed] [Google Scholar]
- 5.Csernansky JG, Wang L, Jones D, Rastogi-Cruz D, Posener JA, Heydebrand G, et al. Hippocampal Deformities in Schizophrenia Characterized by High Dimensional Brain Mapping. American Journal of Psychiatry. 2002;159(12): 2000–6. 10.1176/appi.ajp.159.12.2000 [DOI] [PubMed] [Google Scholar]
- 6.Csernansky JG, Wang L, Joshi SC, Tilak Ratnanather J, Miller MI. Computational anatomy and neuropsychiatric disease: probabilistic assessment of variation and statistical inference of group difference, hemispheric asymmetry, and time-dependent change. Mathematics in Brain Imaging. 2004;23, Supplement 1(0): S56–S68. [DOI] [PubMed] [Google Scholar]
- 7.Mak E, Su L, Williams GB, Watson R, Firbank M, Blamire AM, et al. Longitudinal assessment of global and regional atrophy rates in Alzheimer's disease and dementia with Lewy bodies. NeuroImage: Clinical. 2015;7: 456–62. 10.1016/j.nicl.2015.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nakamura K, Fox R, Fisher E. CLADA: Cortical longitudinal atrophy detection algorithm. NeuroImage. 2011;54(1): 278–89. 10.1016/j.neuroimage.2010.07.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Smith SM, Rao A, De Stefano N, Jenkinson M, Schott JM, Matthews PM, et al. Longitudinal and cross-sectional analysis of atrophy in Alzheimer's disease: Cross-validation of BSI, SIENA and SIENAX. NeuroImage. 2007;36(4): 1200–6. 10.1016/j.neuroimage.2007.04.035. [DOI] [PubMed] [Google Scholar]
- 10.van Erp TGM, Greve DN, Rasmussen J, Turner J, Calhoun VD, Young S, et al. A multi-scanner study of subcortical brain volume abnormalities in schizophrenia. Psychiatry Research: Neuroimaging. 2014;222(1–2): 10–6. 10.1016/j.pscychresns.2014.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wang L, Miller JP, Gado MH, McKeel DW, Rothermich M, Miller MI, et al. Abnormalities of hippocampal surface structure in very mild dementia of the Alzheimer type. NeuroImage. 2006;30(1): 52–60. 10.1016/j.neuroimage.2005.09.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Csernansky JG, Wang L, Swank J, Miller JP, Gado M, McKeel D, et al. Preclinical detection of Alzheimer's disease: hippocampal shape and volume predict dementia onset in the elderly. NeuroImage. 2005;25(3): 783–92. 10.1016/j.neuroimage.2004.12.036 [DOI] [PubMed] [Google Scholar]
- 13.Aarnink SH, Vos SB, Leemans A, Jernigan TL, Madsen KS, Baaré WFC. Automated longitudinal intra-subject analysis (ALISA) for diffusion MRI tractography. NeuroImage. 2014;86: 404–16. 10.1016/j.neuroimage.2013.10.026. [DOI] [PubMed] [Google Scholar]
- 14.Avants BB, Epstein CL, Grossman M, Gee JC. Symmetric diffeomorphic image registration with cross-correlation: Evaluating automated labeling of elderly and neurodegenerative brain. Special Issue on The Third International Workshop on Biomedical Image Registration—WBIR 2006. 2008;12(1): 26–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Crum WR, Scahill RI, Fox NC. Automated Hippocampal Segmentation by Regional Fluid Registration of Serial MRI: Validation and Application in Alzheimer's Disease. NeuroImage. 2001;13(5): 847–55. 10.1006/nimg.2001.0744. [DOI] [PubMed] [Google Scholar]
- 16.de Flores R, La Joie R, Chételat G. Structural imaging of hippocampal subfields in healthy aging and Alzheimer’s disease. Neuroscience. 2015;309: 29–50. 10.1016/j.neuroscience.2015.08.033. [DOI] [PubMed] [Google Scholar]
- 17.Klein A, Andersson J, Ardekani BA, Ashburner J, Avants B, Chiang M, et al. Evaluation of 14 nonlinear deformation algorithms applied to human brain MRI registration. NeuroImage. 2009;46(3): 786–802. 10.1016/j.neuroimage.2008.12.037 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Klein A, Ghosh SS, Avants B, Yeo BTT, Fischl B, Ardekani BA, et al. Evaluation of volume-based and surface-based brain image registration methods. NeuroImage. 2010;51(1): 214–20. 10.1016/j.neuroimage.2010.01.091 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mulder ER, de Jong RA, Knol DL, van Schijndel RA, Cover KS, Visser PJ, et al. Hippocampal volume change measurement: Quantitative assessment of the reproducibility of expert manual outlining and the automated methods FreeSurfer and FIRST. NeuroImage. 2014;92: 169–81. 10.1016/j.neuroimage.2014.01.058. [DOI] [PubMed] [Google Scholar]
- 20.Fischl B. FreeSurfer. NeuroImage. 2012;62(2): 774–81. 10.1016/j.neuroimage.2012.01.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Avants BB, Tustison NJ, Song G, Cook PA, Klein A, Gee JC. A reproducible evaluation of ANTs similarity metric performance in brain image registration. NeuroImage. 2011;54(3): 2033–44. 10.1016/j.neuroimage.2010.09.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Tustison NJ, Cook PA, Klein A, Song G, Das SR, Duda JT, et al. Large-scale evaluation of ANTs and FreeSurfer cortical thickness measurements. NeuroImage. 2014;99(0): 166–79. 10.1016/j.neuroimage.2014.05.044. [DOI] [PubMed] [Google Scholar]
- 23.Bazin P-L, Weiss M, Dinse J, Schäfer A, Trampel R, Turner R. A computational framework for ultra-high resolution cortical segmentation at 7 Tesla. NeuroImage. 2014;93, Part 2: 201–9. 10.1016/j.neuroimage.2013.03.077. [DOI] [PubMed] [Google Scholar]
- 24.Focke NK, Helms G, Kaspar S, Diederich C, Tóth V, Dechent P, et al. Multi-site voxel-based morphometry—Not quite there yet. NeuroImage. 2011;56(3): 1164–70. 10.1016/j.neuroimage.2011.02.029. [DOI] [PubMed] [Google Scholar]
- 25.Jovicich J, Czanner S, Han X, Salat D, van der Kouwe A, Quinn B, et al. MRI-derived measurements of human subcortical, ventricular and intracranial brain volumes: Reliability effects of scan sessions, acquisition sequences, data analyses, scanner upgrade, scanner vendors and field strengths. NeuroImage. 2009;46(1): 177–92. 10.1016/j.neuroimage.2009.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Keihaninejad S, Heckemann RA, Fagiolo G, Symms MR, Hajnal JV, Hammers A. A robust method to estimate the intracranial volume across MRI field strengths (1.5T and 3T). NeuroImage. 2010;50(4): 1427–37. 10.1016/j.neuroimage.2010.01.064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Acosta-Cabronero J, Williams GB, Pereira JM, Pengas G, Nestor PJ. The impact of skull-stripping and radio-frequency bias correction on grey-matter segmentation for voxel-based morphometry. Neuroimage. 2008;39(4): 1654–65. 10.1016/j.neuroimage.2007.10.051 [DOI] [PubMed] [Google Scholar]
- 28.Reuter M, Schmansky NJ, Rosas HD, Fischl B. Within-subject template estimation for unbiased longitudinal image analysis. NeuroImage. 2012;61(4): 1402–18. 10.1016/j.neuroimage.2012.02.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Aubert-Broche B, Fonov VS, García-Lorenzo D, Mouiha A, Guizard N, Coupé P, et al. A new method for structural volume analysis of longitudinal brain MRI data and its application in studying the growth trajectories of anatomical brain structures in childhood. NeuroImage. 2013;82: 393–402. 10.1016/j.neuroimage.2013.05.065. [DOI] [PubMed] [Google Scholar]
- 30.de Boer R, Vrooman HA, Ikram MA, Vernooij MW, Breteler MMB, van der Lugt A, et al. Accuracy and reproducibility study of automatic MRI brain tissue segmentation methods. NeuroImage. 2010;51(3): 1047–56. 10.1016/j.neuroimage.2010.03.012. [DOI] [PubMed] [Google Scholar]
- 31.Droby A, Lukas C, Schänzer A, Spiwoks-Becker I, Giorgio A, Gold R, et al. A human post-mortem brain model for the standardization of multi-centre MRI studies. NeuroImage. 2015;110: 11–21. 10.1016/j.neuroimage.2015.01.028. [DOI] [PubMed] [Google Scholar]
- 32.Morey RA, Selgrade ES, Wagner HR 2nd, Huettel SA, Wang L, McCarthy G. Scan-rescan reliability of subcortical brain volumes derived from automated segmentation. Human brain mapping. 2010;31(11): 1751–62. Epub 2010/02/18. 10.1002/hbm.20973 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Maltbie E, Bhatt K, Paniagua B, Smith RG, Graves MM, Mosconi MW, et al. Asymmetric bias in user guided segmentations of brain structures. NeuroImage. 2012;59(2): 1315–23. 10.1016/j.neuroimage.2011.08.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Marcus DS, Wang TH, Parker J, Csernansky JG, Morris JC, Buckner RL. Open Access Series of Imaging Studies (OASIS): cross-sectional MRI data in young, middle aged, nondemented, and demented older adults. Journal of cognitive neuroscience. 2007;19(9): 1498–507. 10.1162/jocn.2007.19.9.1498 [DOI] [PubMed] [Google Scholar]
- 35.Fischl B, Salat DH, Busa E, Albert M, Dieterich M, Haselgrove C, et al. Whole Brain Segmentation: Automated Labeling of Neuroanatomical Structures in the Human Brain. Neuron. 2002;33(3): 341–55. [DOI] [PubMed] [Google Scholar]
- 36.Collins DL, Zijdenbos AP, Baaré WF, Evans AC, editors. ANIMAL+ INSECT: improved cortical structure segmentation. Biennial International Conference on Information Processing in Medical Imaging; 1999: Springer.
- 37.Fonov V, Evans AC, Botteron K, Almli CR, McKinstry RC, Collins DL. Unbiased average age-appropriate atlases for pediatric studies. NeuroImage. 2011;54(1): 313–27. 10.1016/j.neuroimage.2010.07.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Fonov VS, Evans AC, McKinstry RC, Almli C, Collins D. Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. NeuroImage. 2009;47: S102. [Google Scholar]
- 39.Klein A, Tourville J. 101 Labeled Brain Images and a Consistent Human Cortical Labeling Protocol. Frontiers in Neuroscience. 2012;6(171). 10.3389/fnins.2012.00171 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Wang H, Suh JW, Das SR, Pluta J, Craige C, Yushkevich PA. Multi-Atlas Segmentation with Joint Label Fusion. IEEE transactions on pattern analysis and machine intelligence. 2013;35(3): 611–23. 10.1109/TPAMI.2012.143 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Klein A, Ghosh SS, Bao FS, Giard J, Häme Y, Stavsky E, et al. Mindboggling morphometry of human brains. PLoS computational biology. 2017;13(2): e1005350 10.1371/journal.pcbi.1005350 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Yushkevich PA, Piven J, Hazlett HC, Smith RG, Ho S, Gee JC, et al. User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability. NeuroImage. 2006;31(3): 1116–28. 10.1016/j.neuroimage.2006.01.015 [DOI] [PubMed] [Google Scholar]
- 43.Coupé P, Manjón JV, Fonov V, Pruessner J, Robles M, Collins DL. Patch-based segmentation using expert priors: Application to hippocampus and ventricle segmentation. NeuroImage. 2011;54(2): 940–54. 10.1016/j.neuroimage.2010.09.018 [DOI] [PubMed] [Google Scholar]
- 44.Eskildsen SF, Coupé P, Fonov V, Manjón JV, Leung KK, Guizard N, et al. BEaST: brain extraction based on nonlocal segmentation technique. NeuroImage. 2012;59(3): 2362–73. 10.1016/j.neuroimage.2011.09.012 [DOI] [PubMed] [Google Scholar]
- 45.Tustison NJ, Avants BB, Cook PA, Zheng Y, Egan A, Yushkevich PA, et al. N4ITK: improved N3 bias correction. IEEE transactions on medical imaging. 2010;29(6): 1310–20. 10.1109/TMI.2010.2046908 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hodgetts CJ, Voets NL, Thomas AG, Clare S, Lawrence AD, Graham KS. Ultra-high-field fMRI reveals a role for the subiculum in scene perceptual discrimination. Journal of Neuroscience. 2017;37(12): 3150–9. 10.1523/JNEUROSCI.3225-16.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Dice LR. Measures of the Amount of Ecologic Association Between Species. Ecology. 1945;26(3): 297–302. [Google Scholar]
- 48.Hummel TJ, Sligo J. Empirical comparison of univariate and multivariate analysis of variance procedures. Psychological Bulletin. 1971;76(1): 49–57. http://psycnet.apa.org/doi/10.1037/h0031323. [Google Scholar]
- 49.Rencher AC, Christensen WF. Methods of Multivariate Analysis. 3rd ed Hoboken, New Jersey: John Wiley & Sons, Inc.; 2012. [Google Scholar]
- 50.Crum WR, Camara O, Hill DL. Generalized overlap measures for evaluation and validation in medical image analysis. IEEE transactions on medical imaging. 2006;25(11): 1451–61. 10.1109/TMI.2006.880587 [DOI] [PubMed] [Google Scholar]
- 51.Cha S-H. Comprehensive survey on distance/similarity measures between probability density functions. City. 2007;1(2): 1. [Google Scholar]
- 52.Bartko JJ. Measurement and reliability: Statistical thinking considerations. Schizophrenia Bulletin. 1991;17(3): 483–9. 1992-09297-001. PsycARTICLES Identifier: szb-17-3-483. First Author & Affiliation: Bartko, John J. [DOI] [PubMed] [Google Scholar]
- 53.Zijdenbos AP, Dawant BM, Margolin RA, Palmer AC. Morphometric analysis of white matter lesions in MR images: method and validation. IEEE Transactions on Medical Imaging. 1994;13(4): 716–24. 10.1109/42.363096 [DOI] [PubMed] [Google Scholar]
- 54.Sone D, Sato N, Maikusa N, Ota M, Sumida K, Yokoyama K, et al. Automated subfield volumetric analysis of hippocampus in temporal lobe epilepsy using high-resolution T2-weighed MR imaging. NeuroImage: Clinical. 2016;12: 57–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Schwarz CG, Gunter JL, Wiste HJ, Przybelski SA, Weigand SD, Ward CP, et al. A large-scale comparison of cortical thickness and volume methods for measuring Alzheimer's disease severity. NeuroImage: Clinical. 2016;11: 802–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Yousefi S, Kehtarnavaz N, Gholipour A. Improved labeling of subcortical brain structures in atlas-based segmentation of magnetic resonance images. IEEE Transactions on Biomedical Engineering. 2012;59(7): 1808–17. 10.1109/TBME.2011.2122306 [DOI] [PubMed] [Google Scholar]
- 57.Ghosh SS, Kakunoori S, Augustinack J, Nieto-Castanon A, Kovelman I, Gaab N, et al. Evaluating the validity of volume-based and surface-based brain image registration for developmental cognitive neuroscience studies in children 4 to 11years of age. NeuroImage. 2010;53(1): 85–93. 10.1016/j.neuroimage.2010.05.075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Khan AR, Wang L, Beg MF. FreeSurfer-initiated fully-automated subcortical brain segmentation in MRI using Large Deformation Diffeomorphic Metric Mapping. NeuroImage. 2008;41(3): 735–46. 10.1016/j.neuroimage.2008.03.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Crum WR, Rueckert D, Jenkinson M, Kennedy D, Smith SM, editors. A framework for detailed objective comparison of non-rigid registration algorithms in neuroimaging. International Conference on Medical Image Computing and Computer-Assisted Intervention; 2004: Springer.
- 60.Lee J-M, Yoon U, Nam SH, Kim J-H, Kim I-Y, Kim SI. Evaluation of automated and semi-automated skull-stripping algorithms using similarity index and segmentation error. Computers in biology and medicine. 2003;33(6): 495–507. [DOI] [PubMed] [Google Scholar]
- 61.Chupin M, Mukuna-Bantumbakulu AR, Hasboun D, Bardinet E, Baillet S, Kinkingnéhun S, et al. Anatomically constrained region deformation for the automated segmentation of the hippocampus and the amygdala: Method and validation on controls and patients with Alzheimer’s disease. NeuroImage. 2007;34(3): 996–1019. 10.1016/j.neuroimage.2006.10.035 [DOI] [PubMed] [Google Scholar]
- 62.Chiverton J, Wells K, Lewis E, Chen C, Podda B, Johnson D. Statistical morphological skull stripping of adult and infant MRI data. Computers in biology and medicine. 2007;37(3): 342–57. 10.1016/j.compbiomed.2006.04.001 [DOI] [PubMed] [Google Scholar]
- 63.Baaré WF, Vinberg M, Knudsen GM, Paulson OB, Langkilde AR, Jernigan TL, et al. Hippocampal volume changes in healthy subjects at risk of unipolar depression. Journal of Psychiatric Research. 2010;44(10): 655–62. 10.1016/j.jpsychires.2009.12.009 [DOI] [PubMed] [Google Scholar]
- 64.Cheng Y-q, Xu J, Chai P, Li H-j, Luo C-r, Yang T, et al. Brain volume alteration and the correlations with the clinical characteristics in drug-naïve first-episode MDD patients: A voxel-based morphometry study. Neuroscience Letters. 2010;480(1): 30–4. 10.1016/j.neulet.2010.05.075 [DOI] [PubMed] [Google Scholar]
- 65.Ebdrup BH, Langkilde AR, Paulson OB. Hippocampal and caudate volume reductions in antipsychotic-naive first-episode schizophrenia. Journal of psychiatry & neuroscience: JPN. 2010;35(2): 95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Frodl T, Skokauskas N. Meta‐analysis of structural MRI studies in children and adults with attention deficit hyperactivity disorder indicates treatment effects. Acta Psychiatrica Scandinavica. 2012;125(2): 114–26. 10.1111/j.1600-0447.2011.01786.x [DOI] [PubMed] [Google Scholar]
- 67.Machado-de-Sousa JP, de Lima Osório F, Jackowski AP, Bressan RA, Chagas MH, Torro-Alves N, et al. Increased amygdalar and hippocampal volumes in young adults with social anxiety. PloS one. 2014;9(2): e88523 10.1371/journal.pone.0088523 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Malykhin NV, Seres P, Coupland NJ. Structural changes in the hippocampus in major depressive disorder: contributions of disease and treatment. Journal of psychiatry & neuroscience: JPN. 2010;35(5): 337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Meisenzahl E, Koutsouleris N, Reiser M, Möller H-J, Frodl T. Structural MRI correlates for vulnerability and resilience to major depressive disorder. Journal of psychiatry & neuroscience: JPN. 2011;36(1): 15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Meisenzahl EM, Seifert D, Bottlender R, Teipel S, Zetzsche T, Jäger M, et al. Differences in hippocampal volume between major depression and schizophrenia: a comparative neuroimaging study. European Archives of Psychiatry and Clinical Neuroscience. 2010;260(2): 127–37. 10.1007/s00406-009-0023-3 [DOI] [PubMed] [Google Scholar]
- 71.Moreno-Alcázar A, Ramos-Quiroga JA, Radua J, Salavert J, Palomar G, Bosch R, et al. Brain abnormalities in adults with Attention Deficit Hyperactivity Disorder revealed by voxel-based morphometry. Psychiatry Research: Neuroimaging. 2016;254: 41–7. 10.1016/j.pscychresns.2016.06.002 [DOI] [PubMed] [Google Scholar]
- 72.Nakao T, Radua J, Rubia K, Mataix-Cols D. Gray matter volume abnormalities in ADHD: voxel-based meta-analysis exploring the effects of age and stimulant medication. American Journal of Psychiatry. 2011;168(11): 1154–63. 10.1176/appi.ajp.2011.11020281 [DOI] [PubMed] [Google Scholar]
- 73.O'Dwyer L, Lamberton F, Matura S, Tanner C, Scheibe M, Miller J, et al. Reduced hippocampal volume in healthy young ApoE4 carriers: an MRI study. PloS one. 2012;7(11): e48895 10.1371/journal.pone.0048895 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Palacios EM, Sala-Llonch R, Junque C, Fernandez-Espejo D, Roig T, Tormos JM, et al. Long-term declarative memory deficits in diffuse TBI: correlations with cortical thickness, white matter integrity and hippocampal volume. Cortex. 2013;49(3): 646–57. 10.1016/j.cortex.2012.02.011 [DOI] [PubMed] [Google Scholar]
- 75.Tustison NJ, Johnson HJ, Rohlfing T, Klein A, Ghosh SS, Ibanez L, et al. Instrumentation bias in the use and evaluation of scientific software: recommendations for reproducible practices in the computational sciences. Front Neurosci. 2013;7: 162 Epub 2013/09/24. 10.3389/fnins.2013.00162 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Andronache A, Rosazza C, Sattin D, Leonardi M, D'Incerti L, Minati L. Impact of functional MRI data preprocessing pipeline on default-mode network detectability in patients with disorders of consciousness. Frontiers in Neuroinformatics. 2013;7(16). 10.3389/fninf.2013.00016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL, et al. The minimal preprocessing pipelines for the Human Connectome Project. NeuroImage. 2013;80: 105–24. 10.1016/j.neuroimage.2013.04.127. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
All scripts used for pre-processing and statistical analyses are available at https://github.com/nmuncy/Preproc_Effects, and MRI data is hosted at http://www.oasis-brains.org/app/template/Index.vm.