Skip to main content
Human Brain Mapping logoLink to Human Brain Mapping
. 2022 Feb 3;43(6):1973–1983. doi: 10.1002/hbm.25767

Restoring statistical validity in group analyses of motion‐corrupted MRI data

Antoine Lutti 1,, Nadège Corbin 2,3, John Ashburner 3, Gabriel Ziegler 4, Bogdan Draganski 1,5, Christophe Phillips 6, Ferath Kherif 1, Martina F Callaghan 3, Giulia Di Domenicantonio 1
PMCID: PMC8933245  PMID: 35112434

Abstract

Motion during the acquisition of magnetic resonance imaging (MRI) data degrades image quality, hindering our capacity to characterise disease in patient populations. Quality control procedures allow the exclusion of the most affected images from analysis. However, the criterion for exclusion is difficult to determine objectively and exclusion can lead to a suboptimal compromise between image quality and sample size. We provide an alternative, data‐driven solution that assigns weights to each image, computed from an index of image quality using restricted maximum likelihood. We illustrate this method through the analysis of quantitative MRI data. The proposed method restores the validity of statistical tests, and performs near optimally in all brain regions, despite local effects of head motion. This method is amenable to the analysis of a broad type of MRI data and can accommodate any measure of image quality.

Keywords: heteroscedasticity, motion artefact, quality control, quantitative MRI, statistical image analysis


Motion during the acquisition of magnetic resonance imaging (MRI) data degrades image quality. We provide a data‐driven solution that assigns weights to each image of an analysis, computed from an index of image quality using restricted maximum likelihood. The proposed method restores the validity of statistical tests, is amenable to the analysis of a broad type of MRI data and can accommodate any measure of image quality.

graphic file with name HBM-43-1973-g004.jpg

1. INTRODUCTION

Movement notoriously degrades magnetic resonance imaging (MRI) data, leading to prolonged examinations and increased costs in clinical applications (Andre et al., 2015; Makowski, Lepage, & Evans, 2019). Head movement also impacts the estimates of brain features extracted from MRI data (Epstein et al., 2007; Power, Barnes, Snyder, Schlaggar, & Petersen, 2012; Satterthwaite et al., 2012; Tisdall et al., 2016; Van Dijk, Sabuncu, & Buckner, 2012), and can lead to spurious detection or suppression of anatomy differences in neuroscience studies (Makowski et al., 2019). This issue is particularly acute for the study of non‐compliant patient populations, where the effects of brain disease and head motion cannot be separated (Callicott et al., 1998; Havsteen et al., 2017). Quality control procedures exist that help mitigate the effect of image degradation on analysis results. These procedures require an assessment of data quality, provided by a motion degradation index (MDI) (Castella et al., 2018; Esteban et al., 2017; Mortamet et al., 2009; Reuter et al., 2015; Rosen et al., 2018; Savalia et al., 2017). MDIs computed using dedicated image analysis routines require little labour investment and have become instrumental in the oversight of the large data cohorts that have emerged in recent years (Miller et al., 2016; German National Cohort (GNC) Consortium, 2014; Breteler, Stöcker, Pracht, Brenner, & Stirnberg, 2014; Satterthwaite et al., 2014; Glasser et al., 2016). Recent findings suggest that they may provide higher sensitivity for detecting motion artefacts than visual assessment (Alexander‐Bloch et al., 2016; Reuter et al., 2015) and, combined with supervised learning methods (Alfaro‐Almagro et al., 2018; Esteban et al., 2017; Pizarro et al., 2016), might allow the automated identification of images usable in subsequent analyses.

Quality control is typically followed by dichotomising the data into images that are either suitable (‘accept’) or unsuitable (‘exclude’) for analysis. Here, the threshold value of the MDI between the ‘exclude’ and ‘accept’ categories can be difficult to determine. Also, the effects of motion on MRI data are continuous (Alexander‐Bloch et al., 2016; Reuter et al., 2015) and it is likely that no hard categorisation of the data might achieve optimal compromise between image quality and sample size. We propose an alternative method that assigns a weight to each image within a cohort, computed from its MDI value using the restricted maximum likelihood (REML) algorithm (Diedrichsen & Shadmehr, 2005; Friston et al., 2002). The weights are specific to each image and down‐weight low quality images in subsequent analyses. We illustrate this method through the analysis of a large cohort (1,432 participants) of quantitative MRI (qMRI) data. qMRI data are in vivo biomarkers of brain microstructure (Fukunaga et al., 2010; Langkammer et al., 2010; Lutti, Dick, Sereno, & Weiskopf, 2014) that have great potential for clinical neuroscience (Barbosa et al., 2015; Khalil et al., 2011; Ropele et al., 2011). For the estimation of the weights by REML, we choose the MDI introduced by Castella et al. (2018), because this index was validated empirically against the history of head motion during data acquisition.

We show that in conventional analyses, degradation of image quality due to motion invalidates any assumption of identical variance for all samples (‘homoscedasticity’). While heteroscedasticity has little impact on the model coefficient estimates in a general linear model, the SE of the coefficients can be poorly estimated, leading to invalid statistical inference (Hayes & Cai, 2007). The proposed method, called QUIQI for ‘analysis of QUantitative Imaging data using a Quality Index’, restores homoscedasticity, ensuring the validity of statistical tests. With QUIQI, the improvements in homoscedasticity are superior to those obtained by inserting the MDI in the design matrix of the analysis. This global approach provides near optimal results in whole‐brain analysis of neuroimaging data, despite local effects of motion. The framework has been implemented in the popular, open‐source neuroimaging analysis software Statistical Parametric Mapping (www.fil.ion.ucl.ac.uk/spm, Wellcome Centre for Human Neuroimaging). The framework is flexible and amenable to other MDIs and to the analysis of other types of MRI data.

2. METHODS

2.1. Participant cohort

MRI data were acquired on 1,432 healthy research participants (743 females), as part of ‘BrainLaus’ (https://www.colaus-psycolaus.ch/professionals/brainlaus/) (Lorio et al., 2016; Melie‐Garcia et al., 2018; Trofimova et al., 2021), a nested project of the PsyCoLaus/CoLaus study (Firmann et al., 2008; Preisig et al., 2009). The BrainLaus study received approval from the local Ethics Committee and all participants gave their written informed consent prior to participation. The distribution of the motion degradation index values and participants' ages are shown in Figure S1. The dependence of the index on the participants' age is also shown there.

2.2. MRI acquisition

The MRI protocol consisted of three multi‐echo 3D fast low angle shot (FLASH) acquisitions with magnetization transfer (MTw), proton density (PDw) and T1 (T1w)‐weighted contrasts, respectively. The repetition time and nominal flip angle were 24.5 ms/6°, 24.5 ms/6° and 24.5 ms/21°, respectively. The MTw contrast was achieved with a Gaussian‐shaped RF pulse prior to the excitation (4 ms duration, 220° nominal flip angle, 2 kHz frequency offset from water resonance). 6/8/8 echo images were acquired for the MTw/PDw/T1w contrasts with a minimal echo time of 2.34 ms and an inter‐echo spacing of 2.34 ms. The image resolution was 1 mm3 isotropic, the field of view was 256 × 240 × 176. Parallel imaging was used along the phase‐encoding direction (acceleration factor 2; GRAPPA reconstruction (Griswold et al., 2002)). Partial Fourier was used in the partition direction with acceleration factor 6/8. The acquisition time was 7 min per contrast.

2.3. qMRI map computation and pre‐processing

Quantitative MRI maps were computed from the raw MRI data using the hMRI toolbox (Tabelow et al., 2019) and bespoke analysis scripts written in MATLAB (The MathWorks Inc., Natick, MA, USA). Maps of the MRI parameter R2* were computed separately from the raw echo images with MTw, PDw and T1w contrast, from the regression of the log signal with the corresponding echo times (Weiskopf, Callaghan, Josephs, Lutti, & Mohammadi, 2014). The value of the MDI described in Castella et al. (2018) was computed for each R2* map, as provided by the hMRI toolbox (Tabelow et al., 2019). Maps of Magnetization Transfer estimates (MRI parameter MTsat), were computed as described in (Helms, Dathe, & Dechent, 2008; Helms, Dathe, Kallenberg, & Dechent, 2008; Lutti et al., 2012; Lutti, Hutton, Finsterbusch, Helms, & Weiskopf, 2010; Weiskopf et al., 2013), following averaging of the echo images for each contrast to increase the signal‐to‐noise ratio (Helms & Dechent, 2009). The MTsat maps were only used for spatial normalisation of the data into a common group space (see below) and were not used in subsequent analyses.

Data were analysed using Statistical Parametric Mapping (SPM12, Wellcome Centre for Human Neuroimaging, London). The MT maps were segmented into maps of grey and white matter probabilities using Unified Segmentation (Ashburner & Friston, 2005). The nonlinear diffeomorphic algorithm Dartel (Ashburner, 2007) was used for inter‐subject registration of the tissue classes. The tissue probability maps were normalised to the stereotactic space of the Montreal Neurological Institute (MNI) template using the resulting Dartel template and the deformation fields. As described in Draganski et al.(2011), the quantitative maps were normalised using the same deformation fields but without modulation by the Jacobian determinants. Instead, a combined probability weighting and Gaussian smoothing procedure was used with a 6 mm FWHM isotropic smoothing kernel to produce tissue‐specific parameter maps while preserving the quantitative estimates.

2.4. Computation of image‐specific weights for data analysis

QUIQI is integrated in the analysis of the MRI data. The first step in the estimation of the weights is specific to each application, and involves the data to be analysed and the MDI used. Here, we illustrate QUIQI on the group‐level analysis of quantitative maps of the MRI parameter R2*, computed by fitting a mono‐exponential decay model to the raw multi‐echo data (Figure 1a). For the current application, we use the MDI described in Castella et al. (2018), computed from these maps as the standard deviation of the R2* values in white matter (Figure 1b). Our choice is motivated by the empirical validation of this index, in the original study, against measures of head motion obtained from an external tracking device (Castella et al., 2018).

FIGURE 1.

FIGURE 1

QUIQI integrates correction of motion degradation into the analysis of MRI data. For the current application, analysis data are quantitative maps of the MRI parameter R2* (a). QUIQI requires a value of the MDI for each data set of the analysis. Here, we show the distribution of the MDI values across the images used for analysis (N = 1,432) (b). With QUIQI, basis functions are computed from powers of the MDI and inserted into REML (Friston et al., 2002) for the computation of the noise covariance matrix V (c). The set of powers of the MDI, α, is pre‐defined by the user. From V, weights are computed that are used in the general linear model for data analysis (d)

With QUIQI, the values of the MDI—specific to each participant—are inserted into the REML algorithm using basis functions, that is, matrices computed from the MDI. For effective use of REML, these basis functions must capture the increased image noise due to head motion and their computation from the MDI requires a suitable mathematical form. We infer this mathematical form from the dependence of image noise on the MDI values in a standard ordinary least squares (OLS) analysis. From the polynomial dependence shown in Figure 2b, the basis functions are specified as comprising of multiple powers of the MDI, placed along the diagonal.

FIGURE 2.

FIGURE 2

QUIQI restores homoscedasticity of the residual noise distribution in an analysis. Following model fitting, maps of the residuals are computed for each individual image (example shown in a). An estimate of image noise is computed from the variance across these residual maps and plotted against the MDI for OLS (b) and WLS analyses (c). Enforcing positivity for the hyper‐parameters (λ i > 0) leads to a dominant cubic power in the modelling of image noise by REML (d)

The following step in the calculation of the weights is independent of the application. REML estimates a set of hyperparameters λ that act as weights in the linear combination of the basis functions for the computation of the noise covariance matrix V. The matrix V captures the noise level in each image and is diagonal because noise is uncorrelated between participants (Figure 1c). REML estimates the λ hyperparameters that encode the matrix V by maximising the evidence lower bound (ELBO) objective function, or negative variational free energy. The idea underlying REML is that it estimates the variance–covariance of the residuals from fitting a linear model in a way that accounts for the uncertainty on the estimated model parameters (Harville, 1977). Model fitting involves jointly estimating the hyperparameters (λ) and voxel‐specific parameters (β n). For a design matrix X and image data Y (with N voxels included in the analysis) the ELBO is computed as (ignoring a constant term):

Eλ=N2lnVλN2lnXTVλX12n=1NynXβ^nTVλ1ynXβ^n.

This effectively leads to a weighted least‐square analysis (WLS) with weights W=V12 (Figure 1d). To compare noise models computed by REML from different sets of basis functions, we used the ELBO as a model selection criterion. The ELBO favours reducing residual errors while also penalising model complexity. The ELBO estimates were obtained from the implementation of REML within the SPM software (Friston et al., 2002).

2.5. Image analysis

Because the focus of this study was on the methodology involved in incorporating an MDI index into the analysis of MRI data, we restricted our analysis to R2* maps—and MDI values—computed from a single set of raw echo images per individual. We primarily focused on the analysis of the changes in R2* associated with healthy ageing, driven by changes in iron and myelin concentration in grey and white matter, respectively (Callaghan et al., 2014; Draganski et al., 2011; Yeatman, Wandell, & Mezer, 2014). Statistical analyses were carried out after estimating the parameters of a general linear model with SPM12. We included four regressors in the model, including age and the squared values of age (age 2 ), as well as gender and brain volume as variables of no interest. Analyses were conducted using the common approach of assuming identical noise levels in all quantitative maps (Ordinary Least Squares, OLS) as well as assuming different noise levels for each map, computed from the MDI values (Weighted Least Squares, WLS).

We computed measures of noise heteroscedasticity at the global level of a tissue type as well as at the individual voxel level. At the global level, our measure of heteroscedasticity was the coefficient of determination R2, that is, the fraction of the variance of image noise that follows a polynomial dependence on the MDI. To test for residual heteroscedasticity at the voxel level, we conducted Engle’s ARCH tests of the serial dependence of the residuals in each voxel of the MRI data, with a maximum lag of 40 data points (Engle, 1982). The fraction of voxels with significant heteroscedasticity was calculated after FDR‐correction using the Benjamini–Hochberg procedure (Glickman et al., 2014).

2.5.1. Age‐associated differences in R2*

To asses the effect of QUIQI on the detection of brain‐related differences in neuroscience, we conducted statistical F tests of the significance of age‐related differences in R2*. We conducted these analyses, both at the global level of a whole tissue type (grey and white matter) and at the local level of a grey matter region, to assess the performance of WLS analyses in correcting local effects of head motion. The regional analyses were conducted using explicit masks defined from the grey matter maximum probability tissue labels derived from the ‘MICCAI 2012 Grand Challenge and Workshop on Multi‐Atlas Labeling’ (https://masi.vuse.vanderbilt.edu/workshop2012/index.php/Challenge_Details), computed from MRI scans originating from the OASIS project (http://www.oasis-brains.org/) and labelled data provided by Neuromorphometrics, Inc. (http://neuromorphometrics.com/) under academic subscription. The regional masks included voxels from both hemispheres.

2.5.2. Specificity

We assessed the specificity of the OLS and WLS methods by monitoring the rate of false positives in two types of image analysis frequently conducted in neuroscience studies. I. In a subset of data (N = 123; up to 10 images per age bin of 5 years when available), the participants' age was randomly scrambled between the images before conducting the analysis of age‐associated differences in R2* described above. Any positive result would therefore be a false positive. II. In the subset of data with a narrow age range (56–58 y.o.; N = 129), we conducted two‐sample T tests for the comparison of two subgroups. Similarly, any positive results would therefore be a false positive. Analyses were repeated with N 1 = 2, 5, 10, 20 30 and 60 images in the first group. We repeated both types of specificity analyses 1,000 times, monitoring the rate of false positives across repetitions at the voxel and cluster levels (p < 0.05, FWE‐corrected). For cluster‐level inference, the cluster forming threshold was p < 0.001 uncorrected.

2.5.3. Modelling motion‐related variance in the analysis design

As a potential alternative to QUIQI, we modelled motion‐related variance in the data by inserting dedicated regressors in the design matrix (Carroll, 1982). On the model of the REML basis functions (see Section 3), these regressors contained the first to fourth powers of the MDI values. For a subset of the data within a narrow age range (56–58 y.o.; N = 129), we conducted statistical F tests of the variance of the R2* maps associated with these regressors.

3. RESULTS

3.1. The motion degradation index is a predictor of residual noise

The use of REML relies on the empirical observation that in a conventional analysis (Ordinary Least Squares, OLS), noise in MRI data can be accurately modelled from the MDI (Diedrichsen & Shadmehr, 2005). We computed an estimate of image noise as the spatial variance of the residual maps in each sample image (var[ε]). Figure 2a shows an example of such residual maps for grey and white matter. In an OLS analysis, the polynomial dependence of residual noise on the MDI (R 2 ~ 0.6–0.8) highlights heteroscedasticity in the data, which leads to misestimation of the precision of parameter estimates and undermines the validity of statistical tests (Figure 2b). Motivated by this dependence, QUIQI uses powers of the MDI as basis functions for estimating the noise covariance matrix. The resulting residual noise is independent of the MDI (R 2 ~ 0–0.2), restoring the homoscedasticity of the data (Figure 2c). The spread in residual noise level, estimated at the global level of a tissue type, is higher for grey matter than white matter due to partial volume effects at the tissue interface and inhomogeneities of the magnetic field. The REML hyper‐parameter estimates (λi), obtained with QUIQI using a positivity constraint, shows that for the current application, the basis function that includes the cubic power of the MDI (i.e., MDI3) conveys the dominant contribution to the noise covariance matrix, consistently for grey and white matter (Figure 2d). This result suggests that this basis function alone might be sufficient to model accurately the effect of head motion on image noise. We will further investigate this in a subsequent step by examination of the REML ELBO (Figure 3).

FIGURE 3.

FIGURE 3

Global and local analysis of the REML ELBO. At the global level of a whole tissue type (grey or white matter), the gain in ELBO compared to OLS analyses is maximal with a single basis function in the REML estimation that contains the cubic power of the MDI values (α = 3, a). In grey matter regional analyses, the global optimal model (MDI3) was also the local optimal within most regions, primarily located in frontal areas (b). In posterior regions, the local optimal model involved the square power of the MDI. With the global optimal model, the local gain in ELBO showed a gradient in the anteroposterior direction, with the highest gain in frontal areas (c). This is consistent with typical motion of study participants during MRI examinations

We conducted Engle’s ARCH tests of residual heteroscedasticity in each voxel of the MRI data. In grey matter, the null hypothesis of no ARCH effects could be rejected in 88% and 3% of voxels for OLS and WLS, respectively (p < 0.05, FDR‐corrected using the Benjamini–Hochberg procedure (Glickman et al., 2014); see Figure S2). For the WLS analysis, these voxels were mainly located in sub‐cortical regions, regions affected by magnetic field inhomogeneities, or at the interface of brain tissue with its surrounding. In white matter, the null hypothesis could be rejected in 92% and 1% of voxels for OLS and WLS, respectively (p < 0.05, FDR‐corrected using the Benjamini–Hochberg procedure). Incidentally, WLS also reduced the number of voxels where the hypothesis of standard normally distributed noise could be rejected (Kolmogorov–Smirnov tests).

3.2. Optimal modelling of noise from the MDI

The evidence lower bound (ELBO), or negative variational free energy, is a model selection criterion that favours reducing residual errors while also penalising model complexity. The optimal noise model computed from the MDI maximised the ELBO provided by REML. At the global level of a given tissue type, that is, grey or white matter, a gain in ELBO of up to two orders of magnitude was obtained with WLS compared to OLS analyses (Figure 3a). Consistent with Figure 2d, the maximum gain was obtained using the MDI cubed (i.e., MDI3) as a basis function in the REML estimation (global optimal model). Including additional powers of the MDI did not increase the ELBO.

QUIQI is primarily intended for the analysis of entire images. However, for the purpose of assessing QUIQI’s ability to correct for local degradation of image quality due to head motion, we repeated the analysis separately for each region of a grey matter atlas. The global optimal model (i.e., MDI3) led to the highest gain in ELBO in 68% of regions. In the remaining regions, the ELBO from the global optimal model was smaller than its local counterpart by an average of 5.2. Although substantial in terms of model evidence, these differences are small compared to the gain over OLS analyses. Regions where the global and local optimal models were identical were located primarily in frontal areas, while posterior areas tended to exhibit locally optimal models with a lower power of the MDI (Figure 3b). This anterior–posterior gradient is also apparent in the increase in ELBO compared to OLS analyses: the highest gains are observed in frontal regions (Figure 3c).

The global optimal model for the current application, with only MDI3 as a basis function in the REML estimation, was used in all subsequent analyses conducted with QUIQI.

3.3. QUIQI increases analysis sensitivity to brain differences

Figure 4a shows a map of statistical F‐scores of the dependence of the MRI data on age, obtained using QUIQI with the noise model optimal for the current application. As previously reported (Callaghan et al., 2014; Draganski et al., 2011), the most prominent age‐related differences in R2* were located in sub‐cortical grey matter due to a local increase in iron concentration with age and in frontal white matter due to a peak in axonal myelination around midlife (Slater et al., 2019).

FIGURE 4.

FIGURE 4

QUIQI increases the sensitivity of MRI data analysis. The effect of QUIQI on the analysis of brain‐related differences was assessed using statistical F tests of the dependence of the MRI data with age (a). Compared with OLS analyses, QUIQI leads to region‐specific decreases or increases in F‐scores due to the restored noise homoscedasticity (b). Exclusion of the 30% most degraded images was required to restore noise homoscedasticity in OLS analyses. WLS analyses yield higher age‐sensitivity over the whole brain (c)

Compared with OLS analyses, QUIQI leads to region‐specific decreases or increases in F‐scores (Figure 4b) but because QUIQI restores noise homoscedasticity, WLS analyses are more sensitive to true age effects. We compared the age sensitivity of WLS analyses with that of OLS analyses after exclusion of the fraction of the images with the highest MDI values (i.e., the most degraded images, see Figure S3a). From the multiple fractions considered (3, 7, 13, 20 and 30% (Esteban et al., 2017; Mortamet et al., 2009; Pizarro et al., 2016; Rosen et al., 2018)), we selected the one that led to similar noise homoscedasticity to WLS analyses. In grey matter, this was achieved after removing 30% of the images (R 2 = 0.11, see Figure S3b). With this fraction of excluded data, heteroscedasticity was still present in white matter (R 2 = 0.43) but higher values were deemed too prohibitive to be considered. WLS analyses led to higher age‐sensitivity than exclusion OLS analyses in both tissue types (Figure 4c).

3.4. QUIQI preserves the specificity of statistical analyses

Line 1 of Table 1 shows the rate of false positives, obtained from statistical F tests of age‐related differences after scrambling of the age regressor (analysis I). The rate of false positives was within the expected range at the voxel and cluster levels, for both OLS and WLS analyses (p < 0.05, FWE‐corrected). Line 2 of Table 1 shows the rate of false positives in group comparisons (analysis II). The false positive rate increased similarly for OLS and WLS analyses in unbalanced group comparisons (Salmond et al., 2002). For white matter, a false positive rate within 0.05 was found for cluster‐level inference with N 1 ≈ 5 or more images in the first group. More false positives were observed for grey matter. The false positives were primarily located in cortical regions affected by magnetic field inhomogeneities (e.g., orbitofrontal cortex and temporal lobes, see Figure S4).

TABLE 1.

Specificity of MRI analyses: false positive rates in one‐sample and two‐sample analyses

GM WM
One sample t test OLS 0.02/0.022 0.029/0.023
WLS 0.027/0.022 0.026/0.013
Two sample t tests N 1 = 2 N 1 = 5 N 1 = 10 N 1 = 30 N 1 = 60 N 1 = 2 N 1 = 5 N 1 = 10 N 1 = 30 N 1 = 60
OLS 0.860/0.245 0.509/0.104 0.203/0.055 0.074/0.049 0.051/0.042 0.618/0.182 0.254/0.066 0.132/0.043 0.076/0.021 0.072/0.032
WLS 0.895/0.268 0.601/0.112 0.332/0.061 0.102/0.047 0.075/0.046 0.592/0.167 0.256/0.05 0.117/0.03 0.068/0.024 0.071/0.033

Note: The false positive rates are reported at the voxel/cluster‐levels. For one‐sample t tests, the false positive rate was below 0.05, the p value used for statistical significance, for both OLS and WLS analyses. For two‐sample t tests the false positive rate, reported for different numbers of subjects N 1 in the first group, increased similarly for OLS and WLS analyses in unbalanced group comparisons (Salmond et al., 2002). The higher number of false positives in grey matter arises from cortical regions affected by magnetic field inhomogeneities leading to bias on the R2* estimates (see Figure S4).

3.5. Modelling motion‐related variance from covariates in the analysis design does not restore homoscedasticity

Figure 5 shows the results of inserting a set of regressors computed from the MDI in the image analysis. For OLS analyses, statistically significant results were found in 3.2% and 5.5% of voxels in grey and white matter, respectively (p < 0.05, FWE‐corrected). For grey matter, these voxels were primarily located in frontal regions (see Figure 5a). For WLS analyses, statistically significant results were only found in 0.1% of voxels in grey and white matter (p < 0.05, FWE‐corrected). Residual analysis showed that noise heteroscedasticity remains in the data for both tissue types with OLS analyses, despite the motion regressors. Noise heteroscedasticity is not present in WLS analyses (Figure 5b).

FIGURE 5.

FIGURE 5

Using motion regressors as design covariates does not restore heteroscedasticity. In OLS analyses, statistical F tests of the variance of the R2* maps associated with powers of the MDI show statistically significant results in 3.2% and 5.5% of voxels in grey and white matter, respectively (p < 0.05, FWE‐corrected) (a). In WLS analyses, significant results are found in 0.1% of voxels only (p < 0.05, FWE‐corrected). Noise heteroscedasticity remains present for both tissue types in OLS analyses, despite the motion regressors, but is not present in WLS analyses (b)

4. DISCUSSION

In this study, we introduce a method that accounts for the degradation of image quality due to motion in the analysis of MRI data. This method addresses an important limitation of existing approaches for quality control, which only enable the removal of the most degraded images from analysis. From an index of image quality, the proposed method computes weights that capture the noise level in each image, leading to increased sensitivity to brain change in an analysis. This method is based on restricted maximum likelihood, available in most image analysis software suites. The implementation used here, within the SPM software (Friston, Stephan, Lund, Morcom, & Kiebel, 2005), is commonly used to account for differential noise levels in, for example, group‐comparison studies (‘non‐sphericity’). Here, we extend this methodology to the statistical analysis of structural MRI data, using an index of image quality to estimate the noise level in each individual image. We validated this method using a large cohort of 1,432 subjects, which allowed the design of multiple analyses to test different aspects of the method. The analysis code used in this study is publicly available (Lutti, 2021).

In conventional analysis methods, the increase of the noise level due to motion leads to a violation of the homoscedasticity assumption of statistical tests. By estimating the noise level in each image from its MDI value, QUIQI restores the validity of this assumption, both at the global level of a tissue type (i.e., grey or white matter) and in individual voxels of the MRI data. Voxels where significant heteroscedasticity remain with QUIQI are primarily located in regions where other non‐motion mechanisms may be at play: sub‐cortical areas (delineation errors), interface of brain tissue (partial volume effects) and regions of inhomogeneous magnetic field (bias of the R2* estimates). We tested an alternative method to QUIQI, based on the insertion of the MDI as a covariate in the design matrix of the analyses. However, we show that this alternative method does not successfully correct for noise heteroscedasticity.

To illustrate the effect of QUIQI on the analysis of MRI data, we conducted analyses of age‐associated brain differences. OLS analyses led to spurious statistical results due to noise heteroscedasticity in the data. Restoring noise homoscedasticity in OLS analyses required the removal of the 30% of the images most affected by motion in grey matter (‘exclusion analyses’). This ratio was even higher for white matter. The sensitivity of WLS analyses to age‐related differences was superior to that of exclusion analyses over the whole brain. The higher sensitivity of WLS analyses did not inflate the rate of false positives (Table 1).

QUIQI corrects local effects of head motion optimally—or near optimally—in whole‐brain analyses, the archetypal use of MR images for neuroscience. The increase in ELBO with QUIQI was largest in frontal brain regions and the local optimal noise model involved a higher power of the MDI there. This is consistent with the supine position of the study participants during MRI examination, with the back of the head resting on the scanner table, and provides further evidence of the ability of QUIQI to correct the effects of motion on MRI data quality.

We implemented QUIQI by enforcing positivity of the hyper‐parameter values estimated by REML (spm_reml_sc). This is by no means a requirement because allowing negative hyper‐parameter values works equally well (Figure S5). Our choice was primarily guided by the fact that enforcing positivity allowed us to identify a single basis function (MDI3) as sufficient to effectively model noise in the data (Figure 2d). With a single basis function, local analysis results (Figure 3b and c) reflect the ability of the method to correct for local effects of motion in a global analysis of a whole tissue type (e.g., grey or white matter). This would not be the case with several basis functions because the hyper‐parameter values, which combine the basis functions in the estimation of the noise covariance matrix, would differ in a local and global analysis. Here, we provide a version of spm_est_non_sphericity that calls spm_reml_sc to enforce positive hyperparameter estimates (Lutti, 2021).

The proposed method is amenable to most types of MRI data and motion degradation indices. However, we expect that the optimal noise model might differ depending on the type of MRI data to be analysed. This study outlines a process for the identification of the optimal noise model based on the estimation of noise heteroscedasticity (Figure 2) and on the maximisation of the REML ELBO (Figure 3). To facilitate uptake by interested users, we provide a customised version of the hMRI toolbox (Tabelow et al., 2019) with a dedicated QUIQI module available from the GUI (Lutti, Di Domenicantonio, Corbin, Phillips, & Callaghan, 2021). The powers of the MDI used to define the REML basis functions can be set freely from the user interface. QUIQI allows the plotting and polynomial fitting of image noise versus the MDI (‘QUIQI check’). On the model of this study, this allows users to assess heteroscedasticity levels in their analysis data, with or without QUIQI, and helps identify the optimal set of basis functions for the REML estimation. We emphasise that this feature runs independently from the analysis and that the polynomial fits are not used subsequently. Instead, the use of REML for the computation of the weights provides estimates of ELBO as measures of noise model efficiency (the analysis code attached to this manuscript includes a customised version of the ‘spm_est_non_sphericity’ SPM function that provides the estimate of the REML ELBO computed by SPM). Similarly, we do not recommend the computation of the weights directly from the global noise estimates as factors independent of motion degradation such as analysis model or data type (e.g., field inhomogeneities here) may then have an impact on the weight estimates.

Here, we emphasised the principles of the method and provided a detailed assessment of its performance. For illustration, we therefore chose the analysis of brain phenotype data that can be computed from one raw image type, that is, quantitative maps of the MRI parameter R2* (Lutti, Di Domenicantonio, Kherif, & Draganski, 2021). The extension of this method to the analysis of quantitative maps computed across multiple types of raw images, each with their own degree of motion degradation, is currently ongoing. Another important field of application is the analysis of differences in brain morphology (e.g., grey matter volume or cortical thickness), the most widespread phenotypical measures extracted from MR images. Such applications will highlight the effect of image processing (segmentation) on the sensitivity of analysis to motion. While QUIQI can be readily used with different MDIs, we highlight the importance of the specificity and sensitivity of the index to motion degradation (Castella et al., 2018), which drive the efficacy of the method. Potential sources of confound on image‐based MDIs (e.g., brain disease) should be closely investigated (Supporting Information, Appendix S1).

5. CONCLUSION

We introduce a method that accounts for the degradation of image quality due to motion in the analysis of MRI data. From an index of image quality, this method computes weights specific to each individual structural image of a statistical analysis. We show that in conventional analysis methods, the increased noise level due to motion leads to a violation of the homoscedasticity assumption of statistical tests. By estimating the noise level in each image from its MDI value, the proposed method restores the validity of this assumption. This method was compared with existing approaches for quality control based on the removal of the most degraded images from analysis. We show that the proposed method improves the compromise between image quality and sample size, leading to increased sensitivity to brain change. The improvements in homoscedasticity are also superior to those obtained by modelling motion degradation as a confounding effect in the design matrix of the analysis.

The proposed method is based on restricted maximum likelihood, available in most image analysis software suites, and can be readily used with different indices of motion degradation. The specificity and sensitivity of the index to the degradation of MRI data to motion is paramount to ensure optimal performance of the method.

CONFLICT OF INTERESTS

The authors declare no competing interests.

AUTHOR CONTRIBUTIONS

A.L. designed the experiment. A.L., G.D.D, F.K. and B.D. collected the data. A.L., N.C. and G.D.D. developed techniques and analysed the data. A.L. wrote the paper. A.L., N.C., J.A., G.Z., B.D., C.P., F.K. and M.F.C. designed the analysis and interpreted the results. A.L., N.C., J.A., G.Z., B.D., C.P., F.K., M.F.C. and G.D.D discussed and edited the manuscript.

ETHICS APPROVAL STATEMENT

This study received approval from the local Ethics Committee and all participants gave their written informed consent prior to participation.

Supporting information

APPENDIX S1. Supporting Information

Figure S1 Distribution of the MDI values (a) and participants' age (b) across the datasets used for analysis (N = 1,432), and dependence of the MDI on participant’s age (c)

Figure S2 QUIQI reduces residual heteroscedasticity in each voxel of the MRI data. In grey matter, significant heteroscedasticity was found in 88% and 3% of voxels for OLS and WLS respectively (p < 0.05, FDR‐corrected using the Benjamini–Hochberg procedure). In white matter, significant heteroscedasticity was found in 92% and 1% of voxels for OLS and WLS respectively (p < 0.05, FDR‐corrected using the Benjamini–Hochberg procedure)

Figure S3 With OLS analyses, restoring homoscedasticity requires removal of 30% of the subjects or more. We assessed noise homoscedasticity after exclusion of up to 30% of the most degraded images from OLS analyses (a). We estimated noise homoscedasticity by fitting the dependence of residual noise on the MDI with a polynomial function of order 3. In grey matter, restoring noise homoscedasticity requires the removal of 30% of the images (goodness of fit: R 2 = 0.11; (b)). In white matter, noise homoscedasticity comparable to that of WLS analyses was not achieved

Figure S4 Regions affected by magnetic field inhomogeneities drive the occurrence of false positives in unbalanced group comparisons. Spatial distribution of the voxels showing significant differences at least twice out of 1,000 repetitions in the analysis of specificity in group comparisons. The number of subjects in the first group was N 1 = 5. Similarly for OLS (a) and WLS (b) analyses, these voxels are primarily located in regions affected by magnetic field inhomogeneities (e.g., temporal lobes, orbitofrontal cortex)

Figure S5 Allowing for negative REML hyper‐parameter values leads to a similar performance of QUIQI. Allowing for negative hyper‐parameter values (λ > 0 or λ < 0), noise homoscedasticity is restored as when positive hyper‐parameters is enforced (λ > 0) (a). Note that consistently with the REML estimation, the polynomial fitting of the dependence of image noise on the MDI in a) also allowed for negative coefficients. The gain in ELBO compared to OLS analyses is also in a similar range (b)

ACKNOWLEDGMENTS

This work was supported by the Swiss National Science Foundation (grant no 320030_184784 (AL)) and the Fondation ROGER DE SPOELBERCH. The Wellcome Centre for Human Neuroimaging is supported by core funding from the Wellcome [203147/Z/16/Z]. CP is Senior Research Associate at the FRS‐F.N.R.S., Belgium. BD is supported by the Swiss National Science Foundation (NCCR Synapsy, project grant numbers 32003B_135679, 32003B_159780, 324730_192755 and CRSK‐3_190185) and the Fondation Leenaards. The MRI data were acquired on the MRI platform of the Clinical Neuroscience Department, Lausanne University Hospital.

Lutti, A. , Corbin, N. , Ashburner, J. , Ziegler, G. , Draganski, B. , Phillips, C. , Kherif, F. , Callaghan, M. F. , & Di Domenicantonio, G. (2022). Restoring statistical validity in group analyses of motion‐corrupted MRI data. Human Brain Mapping, 43(6), 1973–1983. 10.1002/hbm.25767

Funding information Fondation ROGER DE SPOELBERCH; Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung, Grant/Award Numbers: 320030_184784, 32003B_135679; 32003B_159780; 324730_192755; CRSK‐; Wellcome Trust, Grant/Award Number: 203147/Z/16/Z

DATA AVAILABILITY STATEMENT

Material related to this paper is available here:

https://zenodo.org/record/4889895#.YdgV6CDMJPY

This material also includes the results of the analyses presented in this article, conducted on the available data.

REFERENCES

  1. Alexander‐Bloch, A. , Clasen, L. , Stockman, M. , Ronan, L. , Lalonde, F. , Giedd, J. , & Raznahan, A. (2016). Subtle in‐scanner motion biases automated measurement of brain anatomy from in vivo MRI. Human Brain Mapping, 37, 2385–2397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alfaro‐Almagro, F. , Jenkinson, M. , Bangerter, N. K. , Andersson, J. L. R. , Griffanti, L. , Douaud, G. , … Smith, S. M. (2018). Image processing and quality control for the first 10,000 brain imaging datasets from UKbiobank. NeuroImage, 166, 400–424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Andre, J. B. , Bresnahan, B. W. , Mossa‐Basha, M. , Hoff, M. N. , Smith, C. P. , Anzai, Y. , & Cohen, W. A. (2015). Toward quantifying the prevalence, severity, and cost associated with patient motion during clinical MR examinations. Journal of the American College of Radiology, 12, 689–695. [DOI] [PubMed] [Google Scholar]
  4. Ashburner, J. (2007). A fast diffeomorphic image registration algorithm. NeuroImage, 38, 95–113. [DOI] [PubMed] [Google Scholar]
  5. Ashburner, J. , & Friston, K. J. (2005). Unified segmentation. NeuroImage, 26, 839–851. [DOI] [PubMed] [Google Scholar]
  6. Barbosa, J. H. O. , Santos, A. C. , Tumas, V. , Liu, M. , Zheng, W. , Haacke, E. M. , & Salmon, C. E. G. (2015). Quantifying brain iron deposition in patients with Parkinson's disease using quantitative susceptibility mapping, R2 and R2*. Magnetic Resonance Imaging, 33(5), 559–565. 10.1016/j.mri.2015.02.021 [DOI] [PubMed] [Google Scholar]
  7. Breteler, M. M. B. , Stöcker, T. , Pracht, E. , Brenner, D. , & Stirnberg, R. (2014). MRI in the RHINELAND study: A novel protocol for population neuroimaging. Alzheimer’s Dement, 10, P92. [Google Scholar]
  8. Callaghan, M. F. , Freund, P. , Draganski, B. , Anderson, E. , Cappelletti, M. , Chowdhury, R. , … Weiskopf, N. (2014). Widespread age‐related differences in the human brain microstructure revealed by quantitative magnetic resonance imaging. Neurobiology of Aging, 35, 1862–1872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Callicott, J. , Ramsey, N. F. , Tallent, K. , Bertolino, A. , Knable, M. B. , Coppola, R. , … Weinberger, D. R. (1998). Functional magnetic resonance imaging brain mapping in psychiatry: Methodological issues illustrated in a study of working memory in schizophrenia. Neuropsychopharmacology, 18, 186–196. [DOI] [PubMed] [Google Scholar]
  10. Carroll, R. J. (1982). Adapting for Heteroscedasticity in Linear Models. The Annals of Statistics, 10(4), 1224–1233. 10.1214/aos/1176345987 [DOI] [Google Scholar]
  11. Castella, R. , Arn, L. , Dupuis, E. , Callaghan, M. F. , Draganski, B. , & Lutti, A. (2018). Controlling motion artefact levels in MR images by suspending data acquisition during periods of head motion. Magnetic Resonance in Medicine, 80, 2415–2426. 10.1002/mrm.27214 [DOI] [PubMed] [Google Scholar]
  12. Diedrichsen, J. , & Shadmehr, R. (2005). Detecting and adjusting for artifacts in fMRI time series data. NeuroImage, 27, 624–634. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Draganski, B. , Ashburner, J. , Hutton, C. , Kherif, F. , Frackowiak, R. S. J. , Helms, G. , & Weiskopf, N. (2011). Regional specificity of MRI contrast parameter changes in normal ageing revealed by voxel‐based quantification (VBQ). NeuroImage, 55, 1423–1434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Engle, R. F. (1982). Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica, 50, 987. [Google Scholar]
  15. Epstein, J. N. , Casey, B. J. , Tonev, S. T. , Davidson, M. , Reiss, A. L. , Garrett, A. , … Spicer, J. (2007). Assessment and prevention of head motion during imaging of patients with attention deficit hyperactivity disorder. Psychiatry Research: Neuroimaging, 155, 75–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Esteban, O. , Birman, D. , Schaer, M. , Koyejo, O. O. , Poldrack, R. A. , & Gorgolewski, K. J. (2017). MRIQC: Advancing the automatic prediction of image quality in MRI from unseen sites. PLoS One, 12, e0184661. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Firmann, M. , Mayor, V. , Vidal, P. M. , Bochud, M. , Pécoud, A. , Hayoz, D. , … Vollenweider, P. (2008). The CoLaus study: A population‐based study to investigate the epidemiology and genetic determinants of cardiovascular risk factors and metabolic syndrome. BMC Cardiovascular Disorders, 8, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Friston, K. J. , Penny, W. , Phillips, C. , Kiebel, S. , Hinton, G. , & Ashburner, J. (2002). Classical and Bayesian inference in neuroimaging: Theory. NeuroImage, 16, 465–483. [DOI] [PubMed] [Google Scholar]
  19. Friston, K. J. , Stephan, K. E. , Lund, T. E. , Morcom, A. , & Kiebel, S. (2005). Mixed‐effects and fMRI studies. NeuroImage, 24(1), 244–252. 10.1016/j.neuroimage.2004.08.055 [DOI] [PubMed] [Google Scholar]
  20. Fukunaga, M. , Li, T. Q. , van Gelderen, P. , de Zwart, J. A. , Shmueli, K. , Yao, B. , … Duyn, J. H. (2010). Layer‐specific variation of iron content in cerebral cortex as a source of MRI contrast. Proceedings of the National Academy of Sciences of the United States of America, 107, 3834–3839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. German National Cohort (GNC) Consortium . (2014). The German National Cohort: Aims, study design and organization. European Journal of Epidemiology, 29, 371–382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Glasser, M. F. , Smith, S. M. , Marcus, D. S. , Andersson, J. L. R. , Auerbach, E. J. , Behrens, T. E. J. , … van Essen, D. C. (2016). The human connectome project’s neuroimaging approach. Nature Neuroscience, 19, 1175–1187. 10.1038/nn.4361 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Glickman, M. E. , Rao, S. R. , & Schultz, M. R. (2014). False discovery rate control is a recommended alternative to Bonferroni‐type adjustments in health studies. Journal of Clinical Epidemiology, 67(8), 850–857. 10.1016/j.jclinepi.2014.03.012 [DOI] [PubMed] [Google Scholar]
  24. Griswold, M. A. , Jakob, P. M. , Heidemann, R. M. , Nittka, M. , Jellus, V. , Wang, J. , … Haase, A. (2002). Generalized autocalibrating partially parallel acquisitions (GRAPPA). Magnetic Resonance in Medicine, 47, 1202–1210. [DOI] [PubMed] [Google Scholar]
  25. Harville, D. A. (1977). Maximum likelihood approaches to variance component estimation and to related problems. Journal of the American Statistical Association, 72, 320–338. 10.2307/2286796 [DOI] [Google Scholar]
  26. Havsteen, I. , Ohlhues, A. , Madsen, K. H. , Nybing, J. D. , Christensen, H. , & Christensen, A. (2017). Are movement artifacts in magnetic resonance imaging a real problem?—A narrative review. Frontiers in Neurology, 8, 232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hayes, A. F. , & Cai, L. I. (2007). Using heteroskedasticity‐consistent standard error estimators in OLS regression: An introduction and software implementation. Behavior Research Methods, 39, 709–722. [DOI] [PubMed] [Google Scholar]
  28. Helms, G. , Dathe, H. , & Dechent, P. (2008). Quantitative FLASH MRI at 3T using a rational approximation of the Ernst equation. Magnetic Resonance in Medicine, 59, 667–672. [DOI] [PubMed] [Google Scholar]
  29. Helms, G. , Dathe, H. , Kallenberg, K. , & Dechent, P. (2008). High‐resolution maps of magnetization transfer with inherent correction for RF inhomogeneity and T1 relaxation obtained from 3D FLASH MRI. Magnetic Resonance in Medicine, 60, 1396–1407. [DOI] [PubMed] [Google Scholar]
  30. Helms, G. , & Dechent, P. (2009). Increased SNR and reduced distortions by averaging multiple gradient echo signals in 3D FLASH imaging of the human brain at 3T. Journal of Magnetic Resonance Imaging, 29, 198–204. [DOI] [PubMed] [Google Scholar]
  31. Khalil, M. , Langkammer, C. , Ropele, S. , Petrovic, K. , Wallner‐Blazek, M. , Loitfelder, M. , … Fazekas, F. (2011). Determinants of brain iron in multiple sclerosis: A quantitative 3T MRI study. Neurology, 77, 1691–1697. [DOI] [PubMed] [Google Scholar]
  32. Langkammer, C. , Krebs, N. , Goessler, W. , Scheurer, E. , Ebner, F. , Yen, K. , … Ropele, S. (2010). Quantitative MR imaging of brain iron: A postmortem validation study. Radiology, 257, 455–462. [DOI] [PubMed] [Google Scholar]
  33. Lorio, S. , Kherif, F. , Ruef, A. , Melie‐Garcia, L. , Frackowiak, R. , Ashburner, J. , … Draganski, B. (2016). Neurobiological origin of spurious brain morphological changes: A quantitative MRI study. Human Brain Mapping, 37, 1801–1815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lutti, A. , Di Domenicantonio, G. , Corbin, N. , Phillips, C. & Callaghan, M hMRI‐toolbox with original QUIQI implementation (2021). 10.5281/zenodo.4737489 [DOI]
  35. Lutti, A. , Di Domenicantonio, G. , Kherif, F. & Draganski, B. QUIQI public dataset: Inserting an index of motion degradation into the analysis of MRI data (2021). 10.5281/zenodo.4647080 [DOI]
  36. Lutti, A. , Dick, F. , Sereno, M. I. , & Weiskopf, N. (2014). Using high‐resolution quantitative mapping of R1 as an index of cortical myelination. NeuroImage, 93, 176–188. [DOI] [PubMed] [Google Scholar]
  37. Lutti, A. , Hutton, C. , Finsterbusch, J. , Helms, G. , & Weiskopf, N. (2010). Optimization and validation of methods for mapping of the radiofrequency transmit field at 3T. Magnetic Resonance in Medicine, 64, 229–238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lutti, A. , Stadler, J. , Josephs, O. , Windischberger, C. , Speck, O. , Bernarding, J. , … Weiskopf, N. (2012). Robust and fast whole brain mapping of the RF transmit field B1 at 7T. PLoS One, 7, e32379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Lutti, A. QUIQI analysis script: Inserting an index of motion degradation into the analysis of MRI data (2021). 10.5281/ZENODO.4890217 [DOI]
  40. Makowski, C. , Lepage, M. , & Evans, A. C. (2019). Head motion: The dirty little secret of neuroimaging in psychiatry. Journal of Psychiatry and Neuroscience, 44, 62–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Melie‐Garcia, L. , Slater, D. , Ruef, A. , Sanabria‐Diaz, G. , Preisig, M. , Kherif, F. , … Lutti, A. (2018). Networks of myelin covariance. Human Brain Mapping, 39, 1532–1554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Miller, K. L. , Alfaro‐Almagro, F. , Bangerter, N. K. , Thomas, D. L. , Yacoub, E. , Xu, J. , … Smith, S. M. (2016). Multimodal population brain imaging in the UKBiobank prospective epidemiological study. Nature Neuroscience, 19, 1523–1536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Mortamet, B. , Bernstein, M. A. , Jack Jr C. R., Gunter, J. L. , Ward, C. , Britson, P. J. , … Alzheimer's Disease Neuroimaging Initiative . (2009). Automatic quality assessment in structural brain magnetic resonance imaging. Magnetic Resonance in Medicine, 62, 365–372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Pizarro R. A., Cheng X., Barnett A., Lemaitre H., Verchinski B. A., Goldman A. L., & Mattay, V. S. (2016). Automated Quality Assessment of Structural Magnetic Resonance Brain Images Based on a Supervised Machine Learning Algorithm. Frontiers in Neuroinformatics, 10, 52. 10.3389/fninf.2016.00052 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Power, J. D. , Barnes, K. A. , Snyder, A. Z. , Schlaggar, B. L. , & Petersen, S. E. (2012). Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. NeuroImage, 59, 2142–2154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Preisig, M. , Waeber, G. , Vollenweider, P. , Bovet, P. , Rothen, S. , Vandeleur, C. , … Muglia, P. (2009). The PsyCoLaus study: Methodology and characteristics of the sample of a population‐based survey on psychiatric disorders and their association with genetic and cardiovascular risk factors. BMC Psychiatry, 9, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Reuter, M. , Tisdall, M. D. , Qureshi, A. , Buckner, R. L. , van der Kouwe, A. J. W. , & Fischl, B. (2015). Head motion during MRI acquisition reduces gray matter volume and thickness estimates. NeuroImage, 107, 107–115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Ropele, S. , de Graaf, W. , Khalil, M. , Wattjes, M. P. , Langkammer, C. , Rocca, M. A. , … Fazekas, F. (2011). MRI assessment of iron deposition in multiple sclerosis. Journal of Magnetic Resonance Imaging, 34, 13–21. [DOI] [PubMed] [Google Scholar]
  49. Rosen, A. F. G. , Roalf, D. R. , Ruparel, K. , Blake, J. , Seelaus, K. , Villa, L. P. , … Satterthwaite, T. D. (2018). Quantitative assessment of structural image quality. NeuroImage, 169, 407–418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Salmond, C. H. , Ashburner, J. , Vargha‐Khadem, F. , Connelly, A. , Gadian, D. G. , & Friston, K. J. (2002). Distributional Assumptions in Voxel‐Based Morphometry. NeuroImage, 17(2), 1027–1030. 10.1006/nimg.2002.1153 [DOI] [PubMed] [Google Scholar]
  51. Satterthwaite, T. D. , Elliott, M. A. , Ruparel, K. , Loughead, J. , Prabhakaran, K. , Calkins, M. E. , … Gur, R. E. (2014). Neuroimaging of the Philadelphia neurodevelopmental cohort. NeuroImage, 86, 544–553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Satterthwaite, T. D. , Wolf, D. H. , Loughead, J. , Ruparel, K. , Elliott, M. A. , Hakonarson, H. , … Gur, R. E. (2012). Impact of in‐scanner head motion on multiple measures of functional connectivity: Relevance for studies of neurodevelopment in youth. NeuroImage, 60, 623–632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Savalia, N. K. , Agres, P. F. , Chan, M. Y. , Feczko, E. J. , Kennedy, K. M. , & Wig, G. S. (2017). Motion‐related artifacts in structural brain images revealed with independent estimates of in‐scanner head motion. Human Brain Mapping, 38, 472–492. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Slater, D. A. , Melie‐Garcia, L. , Preisig, M. , Kherif, F. , Lutti, A. , & Draganski, B. (2019). Evolution of white matter tract microstructure across the life span. Human Brain Mapping, 40, 2252–2268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Tabelow, K. , Balteau, E. , Ashburner, J. , Callaghan, M. F. , Draganski, B. , Helms, G. , … Mohammadi, S. (2019). hMRI—A toolbox for quantitative MRI in neuroscience and clinical research. NeuroImage, 194, 191–210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Tisdall, M. D. , Reuter, M. , Qureshi, A. , Buckner, R. L. , Fischl, B. , & van der Kouwe, A. J. W. (2016). Prospective motion correction with volumetric navigators (vNavs) reduces the bias and variance in brain morphometry induced by subject motion. NeuroImage, 127, 11–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Trofimova, O. , Loued‐Khenissi, L. , DiDomenicantonio, G. , Lutti, A. , Kliegel, M. , Stringhini, S. , … Draganski, B. (2021). Brain tissue properties link cardio‐vascular risk factors, mood and cognitive performance in the CoLaus|PsyCoLaus epidemiological cohort. Neurobiology of Aging, 102, 50–63. [DOI] [PubMed] [Google Scholar]
  58. Van Dijk, K. R. A. , Sabuncu, M. R. , & Buckner, R. L. (2012). The influence of head motion on intrinsic functional connectivity MRI. NeuroImage, 59, 431–438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Weiskopf, N. , Callaghan, M. F. , Josephs, O. , Lutti, A. , & Mohammadi, S. (2014). Estimating the apparent transverse relaxation time (R2*) from images with different contrasts (ESTATICS) reduces motion artifacts. Frontiers in Neuroscience, 8(278), 1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Weiskopf, N. , Suckling, J. , Williams, G. , Correia, M. M. , Inkster, B. , Tait, R. , … Lutti, A. (2013). Quantitative multi‐parameter mapping of R1, PD*, MT and R2* at 3T: A multi‐center validation. Front Neurosci Press. Available from http//www.frontiersin.org/Brain_Imaging_Methods/10.3389/fnins.2013.00095/abstract, 7(95), 1–11. 10.3389/fnins.2013.00095 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Yeatman, J. D. , Wandell, B. A. , & Mezer, A. A. (2014). Lifespan maturation and degeneration of human brain white matter. Nature Communications, 5, 4932. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

APPENDIX S1. Supporting Information

Figure S1 Distribution of the MDI values (a) and participants' age (b) across the datasets used for analysis (N = 1,432), and dependence of the MDI on participant’s age (c)

Figure S2 QUIQI reduces residual heteroscedasticity in each voxel of the MRI data. In grey matter, significant heteroscedasticity was found in 88% and 3% of voxels for OLS and WLS respectively (p < 0.05, FDR‐corrected using the Benjamini–Hochberg procedure). In white matter, significant heteroscedasticity was found in 92% and 1% of voxels for OLS and WLS respectively (p < 0.05, FDR‐corrected using the Benjamini–Hochberg procedure)

Figure S3 With OLS analyses, restoring homoscedasticity requires removal of 30% of the subjects or more. We assessed noise homoscedasticity after exclusion of up to 30% of the most degraded images from OLS analyses (a). We estimated noise homoscedasticity by fitting the dependence of residual noise on the MDI with a polynomial function of order 3. In grey matter, restoring noise homoscedasticity requires the removal of 30% of the images (goodness of fit: R 2 = 0.11; (b)). In white matter, noise homoscedasticity comparable to that of WLS analyses was not achieved

Figure S4 Regions affected by magnetic field inhomogeneities drive the occurrence of false positives in unbalanced group comparisons. Spatial distribution of the voxels showing significant differences at least twice out of 1,000 repetitions in the analysis of specificity in group comparisons. The number of subjects in the first group was N 1 = 5. Similarly for OLS (a) and WLS (b) analyses, these voxels are primarily located in regions affected by magnetic field inhomogeneities (e.g., temporal lobes, orbitofrontal cortex)

Figure S5 Allowing for negative REML hyper‐parameter values leads to a similar performance of QUIQI. Allowing for negative hyper‐parameter values (λ > 0 or λ < 0), noise homoscedasticity is restored as when positive hyper‐parameters is enforced (λ > 0) (a). Note that consistently with the REML estimation, the polynomial fitting of the dependence of image noise on the MDI in a) also allowed for negative coefficients. The gain in ELBO compared to OLS analyses is also in a similar range (b)

Data Availability Statement

Material related to this paper is available here:

https://zenodo.org/record/4889895#.YdgV6CDMJPY

This material also includes the results of the analyses presented in this article, conducted on the available data.


Articles from Human Brain Mapping are provided here courtesy of Wiley

RESOURCES