Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Sep 1.
Published in final edited form as: Ann Appl Stat. 2018 Sep 11;12(3):1451–1478. doi: 10.1214/17-AOAS1117

COMPLEX-VALUED TIME SERIES MODELING FOR IMPROVED ACTIVATION DETECTION IN FMRI STUDIES

Daniel W Adrian , Ranjan Maitra §,*, Daniel B Rowe ¶,
PMCID: PMC6168091  NIHMSID: NIHMS933088  PMID: 30294404

Abstract

A complex-valued data-based model with pth order autoregressive errors and general real/imaginary error covariance structure is proposed as an alternative to the commonly-used magnitude-only data-based autoregressive model for fMRI time series. Likelihood-ratio-test-based activation statistics are derived for both models and compared for experimental and simulated data. For a dataset from a right-hand finger-tapping experiment, the activation map obtained using complex-valued modeling more clearly identifies the primary activation region (left functional central sulcus) than the magnitude-only model. Such improved accuracy in mapping the left functional central sulcus has important implications in neurosurgical planning for tumor and epilepsy patients. Additionally, we develop magnitude and phase detrending procedures for complex-valued time series and examine the effect of spatial smoothing. These methods improve the power of complex-valued data-based activation statistics. Our results advocate for the use of the complex-valued data and the modeling of its dependence structures as a more efficient and reliable tool in fMRI experiments over the current practice of using only magnitude-valued datasets.

AMS 2000 subject classifications: Primary 60K35, 60K35, secondary 60K35

Keywords and phrases: area under the ROC curve, contrast-to-noise ratio, finger-tapping motor experiment, hemodynamic response function, Kronecker product, neurosurgical planning guide, phase information, signal-to-noise ratio, structured covariance matrix

1. Introduction

Functional magnetic resonance imaging (fMRI) is a popular non-invasive method for studying brain function. Since its inception 25 years ago, fMRI has led to important developments in neurosurgical planning, cognitive neuroscience, and the perception of mental illness, among other fields (Rosen and Savoy, 2012). The imaging modality is based on the fact that when neurons fire in response to a stimulus or a task, the blood oxygen level in neighboring vessels changes, affecting the magnetic resonance (MR) signal on the order of 0.5–3% due to the differing magnetic susceptibilities of oxygenated and deoxygenated hemoglobin (Lazar, 2008). This difference is behind the so-called Blood Oxygen Level Dependent (BOLD) contrast (Bandettini et al., 1993; Belliveau et al., 1991; Kwong et al., 1992; Ogawa et al., 1990) which is utilized as a surrogate for neural activity and is used to acquire time-course sequences of images. The stimulus time course commonly consists of a block design, in which stimulus and resting periods are alternated regularly, or is event-related, in which case the stimulus is applied instantaneously at several time-points with the subject at rest at other times.

Each MR image is obtained in a series of steps from the k-space data (Brown, Kincaid and Ugurbil, 1982; Ljunggren, 1983; Tweig, 1983) which encodes different frequency contributions to each voxel. The varied frequencies result from different magnetic field gradients (Jezzard and Clare, 2001) and need to be inverted to localize measurements at each voxel. This step is implemented by applying the inverse Fourier transform (Jain, 1989) on the k-space data and results in a complex-valued observation at each voxel and each time-point. Thus, acquired fMRI data at each voxel and time-point can, in reality, be written in terms of its real and imaginary (or equivalently, magnitude and phase) components.

Acquired MR datasets have typically used only the magnitude measurements at each voxel for display and analysis. This practice of using only the magnitude data while discarding the phase at each voxel has carried over to fMRI practice so much that most statistical analyses routinely ignore the phase data and base their inferences on only the magnitude time series at each voxel (Rowe and Logan, 2004). Thus, in a literal sense, it could be said that fMRI analyses have almost exclusively thrown away half of its collected data, and though the magnitude values unarguably contain more information than do the phase measurements about the BOLD response of interest, we argue and demonstrate in this paper that the phase contains information also important to the analysis.

The general strategy in so-called “magnitude-only” statistical analysis of fMRI data is to fit, at each voxel, a model — commonly a general linear model (Friston et al., 1995) – to the time series observations against a transformation of the input stimulus. This transformation is the expected BOLD response and is effectively modeled in terms of a convolution of the stimulus time course with the hemodynamic response function (HRF), which measures the delay and dispersion of the BOLD response to an instantaneous neuronal activation (Friston, Jezzard and Turner, 1994; Glover, 1999). This provides the setting for the application of the Statistical Parametric Mapping (SPM) technique of Friston et al. (1990), in which the time series at each voxel is reduced to a test statistic that summarizes the association between each voxel time course and the expected BOLD response (Bandettini et al., 1993). The resulting map is then thresholded to identify voxels that are significantly activated (Genovese, Lazar and Nichols, 2002; Logan and Rowe, 2004; Worsley et al., 1996). Commonly, the images at each time point are spatially smoothed before each voxel time series is modeled and the test statistics are computed. Smoothing can improve the contrast-to-noise ratio (CNR) by averaging away the noise in neighboring voxels, as long as the activation is not smoothed away as well (Lazar, 2008; Smith, 2001).

A different approach using both magnitude and phase data (that is, the complete complex-valued data) in the analysis has several advantages. For instance, the phase time series can be useful in the so-called “brain or vein” problem, that is, in determining whether a voxel showing a task-related magnitude change represents (desired) grey matter or (an undesired) draining vein. This is because it has been demonstrated that voxels containing large, coherently oriented vessels (i.e., draining veins) can exhibit a task-related phase change (Hoogenrad et al., 1998; Menon, 2002); in contrast, according to theory, voxels containing small, randomly-oriented vessels exhibit little phase change. Recent findings also indicate that phase changes can occur outside of voxels containing draining veins as well, and thus (while complicating the “brain or vein” problem) phase data can provide additional physiological information (Feng et al., 2009; Zhao et al., 2007). In addition, preliminary evidence indicates that phase information may be important in neuronal current MRI (ncMRI), which aims to directly detect neuron firings (instead of using the BOLD surrogate) but its development is still in its early stages (Bandettini, Petridou and Bodurka, 2005). Lastly, simulations (Rowe and Logan, 2004) show that magnitude-only data-based activation tests have a substantial drop of power at low signal-to-noise ratios (SNRs) while corresponding complex-valued data-based tests have constant (and higher) power over all SNRs. This is a potentially important result for future fMRI studies incorporating greater spatial resolution, as voxel volume is inversely proportional to SNR.

1.1. Statistical models for complex-valued voxel time series

We now introduce notation for complex-valued fMRI time series. Focusing on a single voxel (and suppressing voxel-related subscripts), the complex-valued observation at time t can be denoted by yRt + iyIt in terms of real/imaginary components or by rtet = rt(cos ϕt + i sin ϕt) in terms of magnitude/phase components. Thus, it holds that yRt = rt cos ϕt, yIt = rt sin ϕt, rt=(yRt2+yIt2)12, and ϕt = arctan4(yIt, yRt), the 4-quadrant arctangent (see Glisson, 2011, Page 348) corresponding to arctan(yIt/yRt). For statistical analysis, we consider the complex number yRt + iyIt to be the bivariate random vector (yRt, yIt)′ and also denote the (real-valued) real, imaginary, magnitude, and phase time series vectors, respectively, by yR = (yR1,…, yRn)′, yI = (yI1,…, yIn)′, r = (r1,…, rn)′, and ϕ = (ϕ1,…, ϕn)′, with n denoting the number of scans. Most statistical models for complex-valued voxel time series are based on the fact that yRt and yIt are independent normal random variables with the same variance (Wang and Wei, 1994). These models may be categorized as whether the magnitude and phase are uncoupled (Lai and Glover, 1997; Lee et al., 2007) or (correctly) coupled, as in Nan and Nowak (1999), Rowe and Logan (2004), and Rowe (2005). Here. we specifically present the Rowe and Logan (2004) model:

(yRyI)=(XβcosθXβsinθ)+(ηRηI) (1.1)

because of its simplicity and because our investigation (see Section 4.2) indicates its adequacy for our dataset. The model couples the expected response of the magnitude time series with the phase location parameter θ that is assumed constant throughout the time series. Here the columns of the design matrix X represent features of the fMRI signal including the baseline signal and the expected BOLD response. The errors (ηR,ηI)~N(0,Φ), where Σ and Φ are matrices of order 2 and n representing the real/imaginary and temporal covariance, respectively, and ⊗ represents the direct (Kronecker) product. This model generalizes the design matrix and covariance structure of Nan and Nowak (1999), and was originally proposed by Rowe and Logan (2004) who however proceeded by assuming independence (i.e. by assuming that Σ = σ2I2 and Φ = In, where Ik is the identity matrix of order k) when deriving maximum likelihood estimates (MLEs) and a likelihood ratio test (LRT) for activation. Rowe (2005) further generalized model (1.1) by introducing time-dependent phase location parameters according to a linear model. Under this updated model, Rowe (2005) derived LRTs for task-related phase changes such as those caused by draining veins, as discussed previously.

1.2. Overview of contributions of this paper

This work offers three major contributions to complex-valued voxel-wise modeling of fMRI time series. Firstly, we develop methodology to correct for instability in the magnitude and phase signals via a “complex-valued running line,” which fits a running-line smoother to the magnitude and phase time series simultaneously via a complex-valued model. Correcting for nuisance signals in magnitude time series which, whether caused by scanner drift or aliased physiological effects (Lazar, 2008), are a well-known processing step in the analysis pipeline. Less known is that instability of phase time series can cause complex-valued data-based detection methods to become less powerful than their magnitude-only counterparts. Consequently, methods have been developed to correct the phase for inhomogeneities in the static magnetic field (Hahn, Nencka and Rowe, 2009), physiological noise (Petridou et al., 2009), and head motion (Hahn, Nencka and Rowe, 2012; Hahn and Rowe, 2012). In our case, preprocessing via the complex-valued running line controls for nonlinear nuisance magnitude signals, allows for the assumption of constant phase location parameter in model (1.1), and provides a better model fit.

Secondly, we apply a more general covariance structure to model (1.1) than previously considered in Rowe and Logan (2004), which assumed that the real and imaginary errors were independent with the same variance (Σ = σ2I2) and also temporally independent (Φ = In). Based on work in magnitude fMRI time series, it is widely realized for several reasons that the temporal independence assumption is not supported in reality. For one, the hemodynamic response to a single neural activation takes six seconds to peak and 15–20 seconds to return to baseline and so the measured response occurs over several scans (Friston, Jezzard and Turner, 1994). The subject’s cardiac and respiratory cycles provide another source of autocorrelation (Friston et al., 2000). Following Bullmore et al. (1996) who extended the linear model for magnitude-only time series by adding autoregressive (AR) errors, we apply AR(p) temporal dependence in the real and imaginary errors of (1.1). We also generalize the real/imaginary covariance Σ, allowing separate variances for the real and imaginary errors as well as real/imaginary correlation.

Thirdly, we study the effects of spatial smoothing on complex-valued fMRI data, including the correlations induced and their effect on activation detection. Nencka, Hahn and Rowe (2009) developed a theoretical framework for understanding the statistical effects of spatial smoothing (and other preprocessing operations), which was expanded upon in Bruce, Karaman and Rowe (2011) to examine the correlations induced by acquisition methods utilizing multiple receiver coils. However, to our knowledge, the effects of spatial smoothing upon complex-valued data-based activation detection have not previously been investigated.

The remainder of this paper is organized as follows. Section 2 introduces our showcase application, a complex-valued fMRI dataset acquired from a finger-tapping experiment, the type of which is utilized as a noninvasive neurosurgical planning tool. In this context, we introduce the complex-valued running line methodology and derive an LRT statistic for activation based on the extension of model (1.1) including temporal and real/imaginary correlation. (Section 3). We compare this test statistic to a magnitude-only AR(p) model-based LRT statistic, analyzing the finger-tapping experiment dataset under different amounts of smoothing in Section 4 and running simulation studies in Section 5 to explore the nature and validity of the results. We conclude with some discussion in Section 6. This paper also has an online supplement (Adrian, Maitra and Rowe, 2018) providing further details on methodology, performance evaluations, data analysis, and simulation studies. Sections, figures, and tables in the supplement referred to in this paper are labeled with the prefix “S−”.

2. Motivating example: aiding neurosurgical planning via accurate activation detection in a finger-tapping experiment

Our showcase application for this paper comes from a commonly-performed unilateral sequential finger-tapping experiment. In this case, the MR images were acquired with TR = 1s while the (normal healthy male) volunteer was instructed through visual cues to either lie at rest or to rapidly tap fingers of his right hand (hence unilateral) sequentially in the order of index, middle, ring, and little fingers. The experiment followed a block design with 16 epochs of 15s of tapping alternating with 15s of rest concluding with an additional 10s of rest at the end of the experiment. Hence, the dataset contains a temporal sequence of n = 490 images for use in our analysis. Each image was composed of seven 2.5 mm thick 96 × 96 axial slices with a 24.0 cm FOV. (The data are the last 490 time points analyzed in Karaman, Bruce and Rowe (2015), where it was inadvertently referred to as a bilateral finger-tapping experiment.) For this dataset, the phase components of the time series images were not discarded but stored along with the magnitude images used in traditional fMRI analysis. The data processing flow included Nyquist ghost removal and correction for global zero-order off-resonance using three navigator echos (Jesmanowicz, Wong and Hyde, 1993; Nencka, Hahn and Rowe, 2008), image reconstruction from k-space by inverse Fourier transform (Kumar, Welti and Ernst, 1975; Rowe, 2016), and estimation and correction of the dynamic field using temporal off-resonance alignment of single-echo timeseries (TOAST) (Hahn, Nencka and Rowe, 2009, 2012). A binary mask of voxels above 15% of the maximum voxel signal magnitude was generated from the first image of the Karaman, Bruce and Rowe (2015) dataset, representing voxels within the brain.

Figure 1 shows images of the real, imaginary, magnitude and phase components of the middle slice at time points 1, 8, 15, and 22. (See Figure S-1 for displays of the other slices.) These images appear largely unchanged over time because the BOLD stimulus response is very small compared to the overall MR signal. Figure 2 displays the real, imaginary, magnitude, and phase time series for one of the voxels showing the most activation. It is evident that the real, imaginary, and magnitude time series at this voxel oscillate with a period similar to the stimulus time course; in contrast, the phase time series has relatively constant location (though the fit of the constant phase assumption has some room for improvement, as we will see). Figure 2 also provides some pointers as to why the fMRI community has found it difficult to recognize that discarding half of the data (the phase) is not necessarily a prudent practice. This is because the magnitude data have the best definition, while the phase data have high variability and do not seemingly provide much visual content. However, the phase data have additional information on variability in the MR signal, and we contend and demonstrate that harnessing this additional information can improve specificity in fMRI analysis.

Fig 1.

Fig 1

Images of the (a) real, (b) imaginary, (c) magnitude, and (d) phase components of the middle slice for time points 1, 8, 15, and 22 (moving left to right).

Fig 2.

Fig 2

Time series of (from top) the real, imaginary, magnitude, and phase observations for one of the voxels showing the most activation. Lighter lines in each display represents the raw time series, while the darker lines show the result after applying a simple, central moving average filter with 5 nearest neighbors. The bottom display is of the HRF superimposed on the block design of the stimulus.

Paradigms involving hand stimulation, such as in the finger-tapping experiment introduced here, have deeper clinical significance, especially in the context of utilizing fMRI as a noninvasive neurosurgical planning tool for tumor and epilepsy patients. Such experiments have shown that the central sulcus is the location of hand function in the sensorimotor cortex for normal healthy adults (Rumeau et al., 1994). At risk of confusion, it is important to distinguish between the functional central sulcus and the anatomical central sulcus, as the two do not in general agree. The functional central sulcus is the brain activation region of hand function determined by the fMRI experiment (Lee et al., 1999). On the other hand, the anatomical central sulcus refers to brain anatomy instead of function, in this case the sulcus, or fold, in the cerebral cortex which separates the frontal lobe from the parietal lobe. As a neurosurgical planning tool, such fMRI experiments seek to identify the functional central sulcus in order to better assess the risk/benefit of surgery through knowledge of the relative locations of the proposed surgical target and of specific functional areas on the cortical surface (Lee et al., 1999). Traditionally, determining such knowledge required an invasive procedure, but the usage of fMRI allows it to be determined noninvasively and preoperatively. For a normal healthy subject, the functional central sulcus should align with its anatomical twin, but for many studies, “functional activation may extend over two or three sulci”, which results in “ambiguous identification of the [functional] central sulcus” (see Lee, Jack and Riederer, 1998, who also assert that “any level of ambiguity is unacceptable for surgical mapping”). Thus, methodology that reduces this ambiguity as far as possible is a desirable clinical goal. Figure 3 shows the location of the left anatomical central sulcus (Harnsberger et al., 2007) on the T2-weighted anatomical image of our subject’s brain used to create the mask. (We focus on the left central sulcus because its functional version will be activated by right-hand finger tapping due to the decussation of nerve fibers in the upper slices of the brain.) Therefore, the experiment here provides a test case for evaluating not only our methodology but also for the potential applicability of fMRI as a noninvasive clinical tool for the planning of neurosurgeries.

Fig 3.

Fig 3

(a) Anatomical image of the fifth slice and (b) corresponding contour plot identifying the left central sulcus.

3. Methodological Development

3.1. Trend removal via a complex-valued running line

Our methodology for the complex-valued running line was motivated by approaches to remove trend in magnitude-data fMRI analysis, of which there are several. In our experience, global polynomial fitting methods are not sufficiently flexible to model the gamut of nonlinear trends and frequency filtering methods can infer spurious trends at the ends of the time series. A good alternative is a running line as introduced by Marchini and Ripley (2000), which is similar to a moving average except that a least squares linear fit is applied to the k nearest neighbors of every time point. For periodic designs, Marchini and Ripley (2000) suggest letting k equal twice the period, and we adopt this practice here. Using this value of k removes nuisance trends while preserving task-related magnitude and phase changes.

To extend this methodology to complex-valued time series, we fit the Rowe (2005) model in each neighborhood. Specifically we find the value of the complex-valued running line at t* by fitting the model

(yRtyIt)=(ρtcosθtρtsinθt)+(ηRtηIt) (3.1)

for t in the neighborhood N(t*) = {t : max(t*k, 1) ≤ t ≤ min(t*+ k, n)}. In (3.1), the magnitude ρt and the phase θt are linear functions of time: specifically, ρt = β0 + stβ1, where st is the tth entry of an arithmetic sequence, and θt = γ0 + g(stγ1), where g(·) = 2arctan(·). The adaptation of the link function g(·) follows from Fisher and Lee (1992)’s methodology for regression with angular responses that was applied to phase-only fMRI time series in Rowe, Meller and Hoffman (2007) and allows unique ML estimation of γ1. (It is recommended that the arithmetic sequence be centered around zero for faster convergence of the MLE of γ1.) Details on the ML estimation of the parameters are given in Section S-2.1. Using these MLEs, the magnitude and phase of the complex-valued running line at time t* are given by ρ̂t* = β̂0 + st* β̂1 and θ̂t* = γ̂0 + g(st* γ̂1). (To save computer time, we can perform ML estimation for every j time points and use linear interpolation to obtain the values in between these time points.) The detrended complex-valued time series is then given by teiϕ̃t, t = 1,…, n, where t = rtρ̂t + ρ̄, with ρ¯=(1/n)t=1nρ^t, and ϕ̃t = ϕtθ̂t + θ0, where θ0 is an analyst-specified central phase that is the same at each voxel. Centering each phase time series around θ0 establishes phase coherence which is needed because the phase does not generally match between slices – see Figure S-1(d) and Figure 6 of Jesmanowicz, Nencka and Hyde (2014). Phase coherence of the voxel time series is desirable for the application of spatial smoothing, which we discuss next.

3.2. Spatial smoothing of complex-valued fMRI data

Following detrending with the complex-valued running line, we smooth the real and imaginary images (separately) at each time point with a 3-dimensional discrete isotropic Gaussian filter (Bruce, Karaman and Rowe, 2011; Nencka, Hahn and Rowe, 2009). We vary the amounts of smoothing by applying filters with different bandwidths expressed in terms of the full-width-at-half-maximum (FWHM). As a result, we produce a total of four datasets: three that are smoothed with FWHMs = 2, 4, and 6 voxels and one in which no smoothing is performed. We place special focus on the results of the smoothed dataset with FWHM = 4 voxels because this represents a moderate amount of smoothing and is the default value used by the Analysis of Functional Neuroimaging (AFNI) software (Cox, 1996, 2012; Cox and Hyde, 1997).

3.3. Overview of activation detection

We compare the activation detected in these datasets by LRT statistics under complex-valued and magnitude-only data-based models, both of which assume AR(p) errors; the statistics are denoted by ΛC,p and ΛM,p, respectively. For now, the AR order p is assumed to be known; in practice, p will be detected via the methodology to be described in Section 3.5. As mentioned in Section 1, the complex-valued model is an extension of the Rowe and Logan (2004) model,

(yRyI)=(XβcosθXβsinθ)+(ηRηI), (3.2)

where X contains columns for the baseline and expected BOLD response and we assume a constant phase location parameter θ. (Due to the complex-valued running line, it is not necessary for X to contain additional columns, such as polynomials, to model the magnitude trend; further, the constant phase assumption is strengthened.) The errors (ηR,ηI)~N(0,Φ), where

=Cov(yRt,yIt)=(σR2ρσRσIρσRσIσI2) (3.3)

and Φ = Rn is such that Cov(ηR)=σR2Rn and Cov(ηI)=σI2Rn; thus the model assumes that the autocorrelation of the real and imaginary errors are determined by the same AR coefficients α = (α1,…, αp)′. The magnitude-only model is given by r = + ε, where X is the same as in (3.2) and ε follows a pth order autoregressive structure. (Note that the corresponding parameters in the magnitude-only and complex-valued models are not necessarily equivalent, but we do not distinguish them notationally for the sake of simplicity.)

Under either model, the activation test may be posed generally as H0 : = 0 vs. Ha : 0, and the LRT statistics have χm2 asymptotic null distributions, where m = rank(C). We derive the complex-valued model-based LRT statistic (LRTS) in Section 3.4; we relegate magnitude-only LRTS derivations to the supplement (Section S-2.2). At each voxel, a p-value is computed from the obtained LRTS. We use cluster thresholding (Forman et al., 1995; Logan, Geliazkova and Rowe, 2008) on these p-values to determine activated voxels by tagging clusters of a minimum size where voxels have p-values below a specified significance level.

3.4. Modeling for AR(p) complex-valued fMRI time series

Denoting the model parameters by τ=(α,β,θ,σR2,σI2,ρ) and utilizing the properties of Kronecker products, model (3.2) has log-likelihood function given by

logL(τyR,yI)=-n/2log[σR2σI2(1-ρ2)]-logRn-h/[2(1-ρ2)],

where

h=(ηRηI)(1σR2Rn-1-ρσRσIRn-1-ρσRσIRn-11σI2Rn-1)(ηRηI). (3.4)

The maximum likelihood estimate (MLE) of β is

β^=β^R[cosθ^σ^R2-ρ^sinθ^σ^Rσ^I]+β^I[sinθ^σ^I2-ρ^cosθ^σ^Rσ^I]cos2θ^σ^R2+sin2θ^σ^I2-2ρ^sinθ^cosθ^σ^Rσ^I, (3.5)

where β^R=(XR^n-1X)-1XR^n-1yR,β^I=(XR^n-1X)-1XR^n-1yI, and R^n-1 is a function of α̂, the MLE of α, according to the (2p + 1)-diagonal matrix given in Pourahmadi (2001, Section 4.4). (The sparseness of the matrix R^n-1 is a computational advantage of the AR(p) model.) Further, the MLE of θ is

θ^=[arcsin(c/a2+b2)-arctan4(b,a)]/2, (3.6)

where a, b, and c (defined in terms of d, e, and f to be defined a little later) are given by a=d/σ^I2-e/σ^R2,b=-f(1/σ^R2+1/σ^I2)-ρ^(d+e)/(σ^Rσ^I), and c=ρ^(d-e)/(σ^Rσ^I)+f(1/σ^R2-1/σ^I2). The terms d, e, and f are given by d=BRR/σ^R4+ρ^2BII/(σ^R2σ^I2)-2ρ^BRI/(σ^R3σ^I),e=BII/σ^I4+ρ^2BRR/(σ^R2σ^I2)-2BRIρ^/(σ^Rσ^I3), and f=(1+ρ^2)BRI/(σ^R2σ^I2)-ρ^/(σ^Rσ^I)(BRR/σ^R2+BII/σ^I2), where we have BRR=β^R(XR^n-1X)β^R,BII=β^I(XR^n-1X)β^I, and BRI=β^R(XR^n-1X)β^I. The MLEs for σR2,σI2, and ρ are given by

σ^R2=(yR-Xβ^cosθ^)R^n-1(yR-Xβ^cosθ^)/n (3.7)
σ^I2=(yI-Xβ^sinθ^)R^n-1(yI-Xβ^sinθ^)/n (3.8)
ρ^=(yR-Xβ^cosθ^)R^n-1(yI-Xβ^sinθ)/(nσ^Rσ^I). (3.9)

Lastly, we obtain α̂ by solving the system of equations (Miller, 1995)

d^θk=j=1p[d^ij+(j/n)d^0,j-k]α^j, (3.10)

for k = 1,…, p, with d^ij=d^ij(RR)/σ^R2+d^ij(II)/σ^I2-ρ^/(σ^Rσ^I)[dij(RI)+d^ij(IR)], 0 ≤ i, jp. In the preceding development, we have d^ij(ιζ)=t=1n-i-jη^ι,t+iηζ,t+j with ι, ζR, I. wherein η^Rt=yRt-xtβ^cosθ^,η^It=yIt-xtβ^sinθ^, and xt is the tth row of X, t = 1,…, n. In practice, starting values for β̂ and θ̂ can be computed under the assumption of independence (see equation (2.10) in Rowe and Logan, 2004). ML estimation then consists of iteratively updating (3.5)–(3.10) until convergence. The LRT statistic is

ΛC,p=nlog(σR2σI2(1-ρ2)σ^R2σ^I2(1-ρ^2))-2log(|Rp-1|/|R^p-1|), (3.11)

where σR2,σI2, and ρ̃ are restricted MLEs under H0 : = 0 (see Section S-2.3), Rp is the p×p correlation matrix such that Cov(ηR1,,ηRp)=σR2Rp and Cov(ηI1,,ηIp)=σI2Rp and R^p-1 and Rp-1 are functions of α̂ and α̃, respectively, as in Pourahmadi (2001, Section 4.4). (Equation (3.11) relies on the equality |Rn| = |Rp|, which is another computational advantage of the AR model.)

3.5. Choosing the order of the AR model

We suggest a sequential testing approach to decide on the AR order p. Starting with k = 1, and for increasing k, we posit H0 : p = k − 1 vs. Ha : pk (or, in terms of the AR coefficients, H0 : ∀jk,αj = 0 vs. Ha : ∃jk : αj ≠ 0). The estimated AR order is then = k′−1, where k′ is the first k in the sequence of tests for which H0 can not be rejected. Note that, in the procedure, the significance level controls the probability of over-detection of the order (i.e. the order detected is greater than the true order) in the sense described in Section S-2.4. For both the magnitude-only and the complex-valued models, we employ a LRT-based test statistic given by 2(ℓ̂k − ℓ̂k−1), where ℓ̂k is the optimized log-likelihood for the AR(k) model. From standard results, this test statistic is asymptotically χ12-distributed under H0 : p = k − 1. We denote the orders detected by the magnitude-only and complex-valued model test statistics by M and C, respectively.

4. Application to fMRI study

We discuss the application of the methodology to the finger-tapping dataset described in Section 2.

4.1. Analysis pipeline and specifications

We analyzed the complex-valued (CV) and magnitude-only (MO) datasets separately, yet in tandem. First, the CV running line was applied to each CV voxel time series, and a regular running line was applied to each magnitude voxel time series. The central phase of each detrended CV time series was set to be θ0 = π/4 (for reasons discussed in Section S-3.3). Then, three datasets were created from both the CV and MO datasets by smoothing the real, imaginary, and magnitude images at each time point with 3-dimensional Gaussian filters with FWHMs = 2, 4, and 6 voxels; thus, including the spatially unsmoothed datasets, there were a total of 4 CV and MO datasets. The CV and MO models were then applied to each voxel time series. For both models, the design matrix X had n = 490 rows and q = 2 columns: one column was for an intercept modeling the baseline MR signal and the other was the waveform modeling the expected BOLD response (Glover, 1999). Consequently, with β = (β0, β1)′, the activation tests posited H0 : β1 = 0 vs. Ha : β1 ≠ 0 with the LRT statistics being χ12-distributed under H0. For both the CV and MO models, we computed two test statistics, one using the detected AR order at each voxel and, for comparison, also with p ≡ 0. Thus, we compared the performance of four LRT statistics, denoted by ΛC,p̂C, ΛM,p̂M, ΛC,0, and ΛM,0. For each LRT statistic (LRTS) and smoothing level, we computed activation maps using cluster thresholding with third-order neighborhood structure, a size threshold of 11 voxels (Forman et al., 1995), and a significance level threshold of 0.001 (as recommended by Woo, Krishnan and Wager, 2014).

4.2. Addressing model fit and model assumptions

We first examined the performance of the CV running line. Figure 4 shows a voxel’s magnitude and phase time series before and after correction by the CV running line and suggests that nuisance trends were well-removed in both time series. For a global assessment, we fit model (3.1) to the entirety of each voxel time series (not just the nearest neighbors) and computed an LRTS for H0 : γ1 = 0 vs. Ha : γ1 ≠ 0. Using a Bonferroni-corrected significance level of 0.05, 63.3% of the voxels rejected H0. This suggests that even when with the application of advances such as TOAST (Hahn, Nencka and Rowe, 2009, 2012) (done here, see Section 2)) to stabilize the phase data, a phase detrending preprocessing step such as the CV running line may be needed to allow for the constant phase location assumption of model (3.2). This preprocessing step is important because phase instability can degrade the power of complex-valued data-based activation tests (Hahn, Nencka and Rowe, 2009), making them less powerful than their magnitude-only data-based counterparts.

Fig 4.

Fig 4

A voxel’s (a,c) magnitude and (b,d) phase time series (a,b) before and (c,d) after correction by the complex-valued running line. Figures (a,b) also show the CV running-line fit.

We refer to Section S-3.1 for additional assessments of complex-valued model fit and assumptions. These include the assumptions of phase coupling and the equality of the AR coefficients for the real and imaginary errors, an assessment of the autoregressive fit to the autocorrelation, and a comparison of the computational times for the CV and MO data-based analyses.

4.3. Activation maps

Figure 5 displays activation maps under the four LRTSs for the fifth slice of the dataset smoothed three-dimensionally with FWHM = 4 voxels. We display the fifth slice in order to compare the regions detected to be functionally active with the anatomical left central sulcus in Figure 3(b). Although all four statistics show functional activation in the anatomical left central sulcus (as it should for our normal subject), the activation map using ΛC,p̂C identifies it with the least ambiguity. For instance, the ΛM,p̂M map shows less significant p-values in the left central sulcus, suggesting that it is less powerful than ΛC,p̂C. Further, the independent error model-based maps detect brain regions outside the left central sulcus, and detection inside the region is more diffuse than the ΛC,p̂C map. Thus, the superior ability of the complex-valued AR(p) model to more accurately identifying the correct activation region points to its applicability in clinically utilizing fMRI finger stimulation experiments for neurosurgical planning in tumor and epilepsy patients, where the goal is to locate the functional central sulcus accurately (vide our discussion in Section 2).

Fig 5.

Fig 5

Activation maps for the finger-tapping experiment after cluster thresholding. The p-value of the observed test statistic for each voxel identified as activated under each model is overlaid on the anatomic slice under investigation.

For a broader view, Section S-3.2 presents activation images for three slices and using smoothing levels of FWHMs = 2, 4, and 6 voxels and no smoothing. The better performance of complex-valued AR(p) model-based statistic held over all slices and smoothing kernels. However, with spatially unsmoothed data, there was much less difference between the maps of the four activation statistics, and it was difficult to determine a clear winner. We explain this result in terms of the effect of smoothing (or none) on model parameter estimates in the next section. Section S-3.2 also displays maps obtained by false discovery rate (FDR) thresholding and assuming p = 1. Results (see Figure S-7) were similar to those in Figure 5.

4.4. Effect of spatial smoothing on parameter estimates

We examined the effect of smoothing on the complex-valued AR(p) model parameter estimates and the activation maps. (Recall from the previous section that the ΛC,p̂C maps were favored when spatial smoothing was performed at any level, but there was not much difference between the maps for the unsmoothed data.) For easier interpretation of the extent of temporal correlation, we assumed p = 1 for each voxel. (Spatial and frequency distributions of the detected AR orders are given in Figures S-9, S-10, and S-11.) Figure 6 contains images of the real/imaginary correlation ρ̂ and the temporal AR parameter α̂1 for the fifth slice (corresponding images for the other parameters are given in Figure S-12). Two general observations are as follows: first, for the unsmoothed data, ρ̂ and α̂1 are close to zero, indicating that the independence assumptions applied in Rowe and Logan (2004) may be adequate. This explains why the independent model-based statistics had similar activation maps to their counterparts that assumed an AR model for the unsmoothed data. Further, simulations in Rowe and Logan (2004) indicated that at the SNRs of the finger-tapping dataset, there were very small differences between the activation detection power between the complex-valued and magnitude-only model-based statistics under the assumptions of temporal and real/imaginary independence. However, we found that increased smoothing was associated with greater values of |ρ̂| and α̂1. This result may be unexpected: while it would be expected that the spatial smoothing would induce spatial correlation (Nencka, Hahn and Rowe, 2009), it may be puzzling that it would also induce temporal correlation – in this case, both within and between the real and imaginary time series. Figure 7 displays the real and imaginary time series for a representative voxel (see Figure 8(a)) for the unsmoothed and smoothed (FWHM = 4 voxels) data and provides another view of our remarks above. For the unsmoothed data, the real and imaginary time series resemble white noise and the real/imaginary components are not correlated; however, with spatial smoothing, the path of the time series resembles an AR process with correlated real and imaginary components. This may be explained by the application of smoothing on spatio-temporal data (Karaman et al., 2014). In summary, the assumption of temporal and real/imaginary independence is increasingly untenable with increasing smoothing (also see Figure S-13). This may be a reason behind the more spurious and diffuse activation displayed in the independent model-based activation maps. It may also explain the apparent reduced power of the magnitude-only AR(p) model-based map, as the real/imaginary correlation cannot be estimated using magnitude-only data. We report our tests of these assertions with simulation studies in Section 5.

Fig 6.

Fig 6

Voxel-wise parameter estimates (a) ρ̂ and (b) α̂1 for (moving left to right within each subfigure) spatial smoothing with FWHM = 0, 2, 4, and 6 voxels. (FWHM= 0 means no spatial smoothing.)

Fig 7.

Fig 7

Time series plots of the real and imaginary values – showing only the first 200 s for ease of viewing – for the representative voxel in Figure 8(a) for the (top) unsmoothed and (bottom) smoothed data with FWHM = 4 voxels.

Fig 8.

Fig 8

(a) Location of voxel in the fifth slice from which our representative time series was chosen. Correlation between the real component of the time series at the voxel and each (b) real or (c) imaginary time series in a 37×37 spatial neighborhood centered at the voxel in (a). Corresponding correlations between the imaginary time series at the voxel in (a) and the (d) real and (e) imaginary time series. (f) Correlation expected in (b) and (e) due to spatial smoothing assuming independence. (Such correlations in (c) and (d) are expected to be zero because the real and imaginary component images are smoothed separately.)

In way of explaining the effect of spatial smoothing on the parameter estimates, we offer the following spatiotemporal model for complex-valued fMRI data following detrending via the CV running line. Letting yRst and yIst be the real and imaginary measurements at voxel s and time t, respectively, the model states that

(yRstyIst)=(xtβs+vst)(cosθsinθ)+(wRstwIst), (4.1)

where xtβs is the mean response and θ is the central phase as before (and is equal for all s following the application of the CV running line), and vst and (wRst, wIst)′ represent two sources of variation. Thus, (4.1) suggests that the real and imaginary errors in model (3.2) can be further decomposed into ηRst = vst cos θ + wRst and ηIst = vst sin θ + wIst. This decomposition is supported by the fMRI literature, where it is well-established that the (magnitude) noise contains multiple components, including “both white [e.g., thermal (Johnston) noise] and colored components [e.g., pulsatile motion of the brain caused by cardiac cycles or local modulation of the static magnetic field (B0) by respiratory movement]” (Friston et al., 2000). (See also Purdon and Weisskoff, 1998, who model fMRI noise time series as an AR(1) time series plus added white noise.) In our model, vst represents the colored (i.e. autocorrelated) and (wRst, wIst) represent the white components. In addition to being autocorrelated, the model assumes that vst are spatially correlated (Zarahn, Aguirre and D’Esposito, 1997); for example, their covariance could be modeled as a Kronecker product of spatial and temporal AR(1) covariance matrices, as in Kruggel and von Cramon (1999). This fits the framework of Karaman et al. (2014) for describing the effects of processing such as spatial smoothing on spatiotemporally uncorrelated, temporally correlated, spatially correlated, and spatiotemporally correlated data. Figure 8 demonstrates that the spatial correlations in the finger-tapping dataset are much more extensive than would be expected by spatial smoothing alone. In contrast, wRst, wIst are assumed to be mutually, temporally, and spatially independent; the independence of the real and imaginary thermal noise components is well-established (Gudbjartsson and Patz, 1995).

We now discuss how model (4.1) supports the observed dependence of the parameter estimates on spatial smoothing. First, the fact that the temporal and real/imaginary cross-correlations are near-zero for the unsmoothed data suggests that the variances of wRst and wIst larger than vst. However, under spatial smoothing, the variance due to (wRst, wIst) decreases – more rapidly with increasing FWHM – due to the averaging of the independent random variables. On the other hand, because the vsts are spatially correlated, their fluctuations are less affected by such averaging. The net effect is that the temporal correlation increases with spatial smoothing. Also, our choice of θ0 = π/4 for all voxels in applying the CV running line means that the real and imaginary time series share a common vst in model (4.1). Thus, the fact that smoothing reduces the variation of (wRst, wIst) more than vst means that it increases the real/imaginary cross-correlation as well. To clarify the above discussion, we present some derivations under simplifying assumptions in Section S-3.3.1.

To conclude, it is interesting to note that the parameter estimates also depend on the choice of the central phase θ0.We explore this fact in Section S-3.3.2, including how it is expected under model (4.1) and that the choice of θ0 = π/4 supports the assumption that the real and imaginary errors share the same AR coefficients.

5. Simulations

Analysis of the finger-tapping dataset in Section 4 yielded the observation that if the dataset was smoothed, the complex-valued AR(p)-model-based activation map identified the appropriate region with the least ambiguity. With unsmoothed data however, the maps provided by the four competing statistics ΛC,p̂C, ΛM,p̂M, ΛC,0, and ΛM,0 were very similar. We attributed this to the fact that the estimated temporal and real/imaginary correlations were negligible without smoothing but sizable with smoothing. To understand and clarify these results in settings of known ground truth, we performed simulation studies to examine (1) the effect of the real/imaginary covariance, including the real/imaginary correlation and the relative sizes of the real and imaginary variances (Section 5.2), (2) effect of temporal correlation (Section 5.3), and (3) the effect of spatial smoothing through simulating a region of interest (ROI) (Section 5.4).

5.1. Simulation Setup

We simulated complex-valued voxel time series according to (3.2) with X specified as in Section 4.1. From properties of the conditional multivariate normal distribution, these simulated real and imaginary time series, denoted by yR and yI, respectively, can be generated as

yR~N(Xβcosθ,σR2Rn)yIyR~N(Xβsinθ+ρσI/σR(yR-Xβcosθ),σR2(1-ρ2)Rn). (5.1)

Because we generated magnitude and phase data without nuisance trends, we did not apply the CV running line. Further, we did not apply spatial smoothing to the simulated data, but instead quantified its effect through the simulation parameters. Otherwise, our simulation studies mimicked the finger-tapping data analysis pipeline in using the same X, order detection procedure, H0 and Ha, and specifications for cluster thresholding. The corresponding magnitude time series were computed from these simulated complex-valued time series, resembling fMRI data collection in practice. In practice fMRI data are unitless, so we parameterized the simulations through the SNR ≡ β0/σ and the CNR ≡ β1/σ, where σ2=(σR2+σI2)/2, making the results applicable to any fMRI dataset. Unless otherwise specified, the simulation parameters were SNR = 190, CNR = 0.05, θ = π/4, and σ = 0.011, which are typical estimates from the dataset smoothed with FWHM=4voxels. Other parameters varied according to the study performed. At each set of parameters, we replicated 100,000 time series (Sections 5.2 and 5.3) and 10,000 ROIs (Section 5.4).

We evaluated the activation statistics using measures that differed according to the simulation study. In Sections 5.2 and 5.3, we generated voxel time series without a spatial context and used standard hypothesis testing methodology. In these studies, we calculated the true and false positive rates as the proportions of truly activated and non-activated time series detected as activated, using the χ12 null distribution. For a test statistic to be of any practical use, its false positive rate must (at least approximately) be equal to the significance level. In addition, we calcuated the area under the receiver operating characteristic (ROC) curve (AUC) as per calculations detailed in Section S-4.1. For evaluating performance in obtaining activation maps in Section 5.4, we used the Jaccard (1901) Index (JI) to measure the percent overlap between the estimated and true activation maps for different statistics. The JI, or modified overlap, introduced by Maitra (2010) in fMRI as a preferable alternative to the more-commonly used overlap or Dice coefficient (Dice, 1945; Sørensen, 1948), is defined between the ith and jth activation maps as mωij = Vij/(Vi+VjVij), where Vi, Vj, and Vij are the number of voxels declared activated in map i, map j, and both maps i and j, respectively. Thus mωij can be interpreted as the proportion of voxels that are activated in both maps i and j from among those activated in either of them.

5.2. Effect of covariance of real and imaginary errors

To examine the effect of the covariance Σ, we varied the ratio of standard deviations σR/σI and the correlation coefficient ρ, with the former varied such that σ is constant while the SNR and CNR remain unaffected. To isolate this effect, we assumed temporal independence when simulating time series and calculating activation statistics. Figure 9(a) shows that the false positive rates of both the CV- and MO-model-based statistics are approximately equal to the significance of 0.05 at all covariance parameters, so both statistics are potentially useful. However, the powers and AUCs of ΛC,0 is greater than ΛM,0 whenever the real and imaginary variances are unequal, and increasingly so as σR/σI gets farther from 1. In view of this result, we note that the distributions of σ̂R/σ̂I for the finger-tapping dataset at various smoothing levels are given in Figure S-12(f). We see that σ̂R/σ̂I tends to be close to 1 for the unsmoothed data but is farther from 1 for the smoothed data. This is a potential reason that the ΛC,p̂C map showed more power than the ΛM,p̂M map for the smoothed data, but the two maps were similar for the unsmoothed data.

Fig 9.

Fig 9

Plots showing the (a) false positive rate, (b) power and (c) AUC for the complex-valued and magnitude-only activation statistics in terms of σR/σI and ρ. Color and line types indicate the same settings across all displays.

Interestingly also, the power and AUC functions in Figure 9 are greater for negative ρ than for positive ρ. In fact, this is part of a larger issue where the specific form of the power/AUC functions in terms of σR/σI and ρ depends on the value of θ. This is discussed in detail in Section S-4.2. No matter the values of θ and Σ, however, the power and AUC of the CV model-based activation statistic are always greater than or equal to its MO model-based counterpart.

5.3. Varying the AR(1) coefficient

To examine the effect of the temporal dependence upon the activation statistics, we simulated time series with AR(1) errors with coefficients varying from 0 to 0.8. To isolate this effect, we let σR = σI and ρ = 0. (A separate simulation with ρ = 0.5, not shown here, gave nearly identical results.) We first studied the ability of each model to detect the correct AR order. Figure 10(a) demonstrates that the complex-valued model detects the true order (1) for a greater proportion of simulated time series than the magnitude-only model. This greater power to reject H0 : α1 = 0 may be attributed to the fact that the complex-valued dataset contains twice as many autocorrelated quantities as the magnitude-only data. Note also that most of the errors are of under-detection of the order since the significance level of 0.05 controls over-detection errors.

Fig 10.

Fig 10

Effect, for the complex-valued and magnitude-only fits, of increasing the AR(1) coefficient on (a) order detection and (b) the false positive rate.

Zarahn, Aguirre and D’Esposito (1997) found that the false positive rates of (magnitude-only data-based) activation statistics under the assumption of independence are inflated. To see whether this result carries over to the context of the complex-valued data, we computed the activation statistics ΛC,p̂C and ΛM,p̂M (based on the detected orders) and also ΛC,p and ΛM,p for p = 0, 1, 2. As shown in Figure 10(b), compared to the significance level of 0.05, the false positive rates of the LRTSs under independence are indeed inflated, and increasingly so as the AR(1) coefficient increases. (Remarkably, the false positive rate balloons to 0.38 for α1 = 0.8.) Thus, it is evident that independent-error-based activation statistics are unreliable even with small amounts of autocorrelation. With the correct order however, the LRTSs properly control the false positive rates. This control also holds when the order is over specified; note that the lines for p = 2 are not shown because they coincide with those for p = 1.) Our results here are thus similar to those in Section 4 where the spurious activation regions detected under the independence-assumed maps for the smoothed data could be considered false positives.

The false positive rates of ΛM,p̂M and ΛC,p̂C are in between those of the correct order-based statistics and the independence-based statistics because of order detection errors. However, the false positive rate of ΛC,p̂C is closer to 0.05 than ΛM,p̂M, which points to another advantage of the complex-valued model. To explain why, let Pf (·) be the false positive rate, which can be expressed as the weighted average

Pf(ΛD,p^D)=kP(p^D=k)Pf(ΛD,k), (5.2)

where D = C or M for the complex-valued or magnitude-only data models, respectively. In this case, we can approximate the average over all orders k to include only k = 0and 1 for two reasons: first, the proportion of voxels with p̂ > 1 is small, and second, PfD,k) ≈ PfD,1) for k > 1. Thus, we have the approximation

Pf(ΛD,p^D)P(p^D=0)Pf(ΛD,0)+P(p^D1)Pf(ΛD,1). (5.3)

The fact that PfM,p̂M) > PfC,p̂C) then follows from that PfD,0) is greater than PfD,1), for D = C,M, and that P(M = 0) > P(C = 0).

Section S-4.3 reports results of simulations to study the effect of autoregressive dependence for lags more than one. To conclude, we have shown that order under-detection inflates the false activation detection rate but over-detection does not, so we recommend using a larger significance level for sequentially detecting the order.

5.4. Examining a region of interest (ROI)

To perform a simulation study incorporating the spatial aspect of fMRI analysis, we simulated from the small region (ROI) of the finger-tapping dataset shown in Figure 11(a). The voxel time series within the ROI were simulated using model (3.2) with p = 1 and the four sets of parameter estimates in Table 1, each of which is representative of these estimates under one of the four levels of spatial smoothing. For each parameter set, we used the same parameters for each voxel in the replicate ROIs except for β1, which was zero or positive depending on whether the voxel was part of the anatomical left central sulcus in Figure 3(b). We computed four activation maps for each replicate ROI based on the four activation statistics, using cluster thresholding on each map. Figures 11(b, c, e, f) show the true and false activation rates over the replicated ROIs for the parameter set corresponding to the smoothed data with FWHM = 4 voxels. (See Figure S-18 for similar plots for the other parameter sets.) The false positive rates in Figure 11(e, f) for the independent error-based activation statistics are greater than their counterparts which have autocorrelated errors. As in Section 5.3, autocorrelation in the simulated time series makes the independent-model-based activation statistics fail to follow the χ12 null distribution under H0, as illustrated in Figure S-19. In fact, the false positive rates increase with amount of smoothing. However, they are similar for all four activation statistics with the unsmoothed data, which can be attributed to its near absence of autocorrelation.

Fig 11.

Fig 11

(a) Location of simulated ROI (blue box) with voxels (dark orange) chosen to be activated according to the anatomical left central sulcus in Figure 3(b). (b,c,e,f) Images of true activation detection rates (legend in d) and false positive rates (legend in g) for the simulated ROIs for the parameter values corresponding to spatial smoothing with FWHM = 4 voxels.

Table 1.

Parameter values used in simulating the ROIs in Section 5.4.

FWHM β0 β1 θ σR σI ρ α1
0 1.89 0.0090 π/4 0.0887 0.0882 0.011 0.036
2 1.89 0.0027 π/4 0.0225 0.0225 0.133 0.177
4 1.89 0.0019 π/4 0.0113 0.0118 0.343 0.373
6 1.89 0.0018 π/4 0.0086 0.0094 0.394 0.473

The above findings once again suggest that independence model-based activation statistics are inapplicable with spatial smoothing because of their inflated false positive rates. Therefore, we remove them from consideration when computing Jaccard indices to measure the agreement between the simulation-based activation maps and the known truth. Table 2 gives the mean Jaccard indices over ROI replications; see Figure S-20 for kernel density estimates of the distributions. On the whole, the Jaccard indices are similar under the parameter set representing the unsmoothed data, but with smoothing, the complex-valued model-based indices are greater than their the magnitude-only model-based counterparts, indicating that activation maps obtained via complex-valued AR(p) modeling have higher agreement with the truly activated voxels. This conforms with our observations for the finger-tapping dataset. (It should be noted that the Jaccard indices in Table 2 are not meant to be compared between parameter sets, but rather only between complex-valued and magnitude-only models within the same parameter set.)

Table 2.

Mean Jaccard similarity coefficients over replications of simulated ROIs for each LRT statistic, measuring the percent overlap between the statistic-based activation maps and the truth, when different amounts of spatial smoothing is applied (determined by FWHM, with 0 denoting no spatial smoothing).

FWHM = 0 FWHM = 2 FWHM = 4 FWHM = 6
Magnitude-only model 0.844 0.730 0.678 0.717
Complex-valued model 0.843 0.741 0.700 0.744

6. Discussion

In this paper, we have expanded and further developed complex-valued time series analysis of fMRI data. As explained in the introduction, such datasets are truly complex-valued when collected, but most analysis methods routinely discard the phase information, utilizing only the magnitude images in the data analysis. In this paper therefore, we have proposed a general model for complex-valued time series, extending the independence assumptions of Rowe and Logan (2004), and have derived a LRTS for detecting activated brain voxels under this model framework. We compared its performance to a statistic similarly derived under a Gaussian-assumed magnitude-only linear model. We applied this methodology to a unilateral finger-tapping dataset of the type used for neurosurgical planning, in which the purpose is clear identification of the left functional central sulcus. Our analysis showed that the complex-valued AR(p) model-based activation map more clearly and accurately identifies this brain region than the magnitude-only model and the independence-based complex-valued model in cases where the images were spatially smoothed, as commonly done in practice. For the unsmoothed data, the activation maps performed similarly. To explain this result, we found that temporal and real/imaginary correlation is induced by smoothing and proposed a model to interpret these findings. For further clarification, we performed several simulation studies. In one, we found that the complex-valued model has greater detection power than the magnitude-only model because it estimates the variances and correlation between of the real and imaginary errors, which are not estimable under the magnitude-only model. In another, we found that independence-based activation statistics can have greatly inflated false positive rates when even small amounts of autocorrelation are present in the time series. Further, we found that the complex-valued model is better able to discriminate between activated and nonactivated voxels, as measured by both the AUC and the Jaccard index. We also emphasize that these results occurred at SNR values typical of fMRI data (on the order of 50 for the unsmoothed data and higher for the smoothed data), which represents an improvement over findings in previous complex-valued fMRI studies (Nan and Nowak, 1999; Rowe and Logan, 2004), that only showed substantial improvements in complex-valued over magnitude-only model-based activation detection at SNRs below 5.

There are many areas that could benefit from further attention. In this paper, we have shown improved activation detection with complex-valued AR modeling of fMRI time series and shown its potential in the context of neurosurgical planning by accurately mapping the (left) functional central sulcus. Note, however, that the objective of neurosurgical planning is preoperative brain mapping that allows for the preservation of cerebral function during brain tumor resection. Such planning also may need to map tactile, visual, language or other functional areas which will involve other experimental paradigms, for which complex-valued fMRI time series analysis of the kind we have recommended in this paper can potentially provide greater accuracy. Further, while we hope that the development of this paper will encourage statisticians to recommend that this data not be routinely discarded but analyzed more thoroughly for more accurate activation results, we note that vast numbers of legacy magnitude-valued fMRI datasets exist (and continue to be collected). Especially at lower SNRs, the observations in these datasets are Rice-distributed, but they have all been analyzed using Gaussian AR models. Therefore, it would be of interest to develop methodology for Rice-distributed time series.

Supplementary Material

1

Acknowledgments

The authors are very grateful to the Editor, an Associate Editor and three reviewers whose very detailed and insightful comments on earlier versions of this manuscript greatly improved its content and presentation.

Footnotes

SUPPLEMENTARY MATERIAL

Contents: Supplement to “Complex-valued Time Series Modeling for Improved Activation Detection in fMRI Studies (http://www.e-publications.org/ims/support/dowload/imsart-ims.zip). Section S-1 displays the real, imaginary, magnitude and phase components of the data for all slices of the finger-tapping dataset used in this paper. Section S-2 provides additional details and derivations on our methodology. Further details on the analysis of the finger-tapping dataset are provided in Section S-3 while more simulation-based analyses are in Section S-4.

References

  1. Adrian DW, Maitra R, Rowe DB. Supplement to “Complex-valued Time Series Modeling for Improved Activation Detection in fMRI Studies”. 2018 doi: 10.1214/17-AOAS1117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bandettini PA, Petridou N, Bodurka J. Direct detection of neuronal activity with MRI: Fantasy, possibility, or reality? Applied Magnetic Resonance. 2005;29:65–88. [Google Scholar]
  3. Bandettini PA, Jesmanowicz A, Wong EC, Hyde JS. Processing strategies for time-course data sets in functional MRI of the human brain. Magnetic Resonance in Medicine. 1993;30:161–173. doi: 10.1002/mrm.1910300204. [DOI] [PubMed] [Google Scholar]
  4. Belliveau JW, Kennedy DN, McKinstry RC, Buchbinder BR, Weisskoff RM, Cohen MS, Vevea JM, Brady TJ, Rosen BR. Functional mapping of the human visual cortex by magnetic resonance imaging. Science. 1991;254:716–719. doi: 10.1126/science.1948051. [DOI] [PubMed] [Google Scholar]
  5. Brown TR, Kincaid BM, Ugurbil K. NMR chemical shift imaging in three dimensions. Proceedings of the National Academy of Sciences, USA. 1982;79:3523–3526. doi: 10.1073/pnas.79.11.3523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bruce IP, Karaman MM, Rowe DB. A statistical examination of SENSE image reconstruction via an isomorphism representation. Magnetic Resonace Imaging. 2011;29:1267–1287. doi: 10.1016/j.mri.2011.07.016. [DOI] [PubMed] [Google Scholar]
  7. Bullmore E, Brammer M, Williams SCR, Rabe-Hesketh S, Janot N, David A, Mellers J, Howard R, Sham P. Statistical methods of estimation and inference for function MR image analysis. Magnetic Resonance in Medicine. 1996;35:261–277. doi: 10.1002/mrm.1910350219. [DOI] [PubMed] [Google Scholar]
  8. Cox RW. AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. Computers and biomedical research, an international journal. 1996;29:162–173. doi: 10.1006/cbmr.1996.0014. [DOI] [PubMed] [Google Scholar]
  9. Cox RW. AFNI: What a long strange trip it has been. NeuroImage. 2012;62:743–747. doi: 10.1016/j.neuroimage.2011.08.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cox RW, Hyde JS. Software tools for analysis and visualization of fMRI data. NMR in Biomedicine. 1997;10:171–178. doi: 10.1002/(sici)1099-1492(199706/08)10:4/5<171::aid-nbm453>3.0.co;2-l. [DOI] [PubMed] [Google Scholar]
  11. Dice LR. Measures of the amount of ecologic association between species. Ecology. 1945;26:297–302. [Google Scholar]
  12. Feng Z, Caprihan A, Blagoev KB, Calhoun VD. Biophysical modeling of phase changes in BOLD fMRI. NeuroImage. 2009;47:540–548. doi: 10.1016/j.neuroimage.2009.04.076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Fisher NI, Lee AJ. Regression Models for an Angular Response. Biometrics. 1992;48:665–677. [Google Scholar]
  14. Forman SD, Cohen JD, Fitzgerald M, Eddy WF, Mintun MA, Noll DC. Improved Assessment of Significant Activation in Functional Magnetic Resonance Imaging (fMRI): Use of a Cluster-Size Threshold. Magnetic Resonance in Medicine. 1995;33:636–647. doi: 10.1002/mrm.1910330508. [DOI] [PubMed] [Google Scholar]
  15. Friston KJ, Jezzard P, Turner R. Analysis of functional MRI time-series. Human Brain Mapping. 1994;1:153–171. [Google Scholar]
  16. Friston KJ, Frith CD, Liddle PF, Dolan RJ, Lammertsma AA, Frackowiak RSJ. The relationship between global and local changes in PET scans. Journal of Cerebral Blood Flow and Metabolism. 1990;10:458–466. doi: 10.1038/jcbfm.1990.88. [DOI] [PubMed] [Google Scholar]
  17. Friston KJ, Holmes AP, Worsley KJ, Poline JB, Frith CD, Frackowiak RSJ. Statistical parametric maps in functional imaging: A general linear approach. Human Brain Mapping. 1995;2:189–210. [Google Scholar]
  18. Friston KJ, Josephs O, Zarahn E, Holmes AP, Rouqette S, Poline JB. To smooth or not to smooth? Bias and efficiency in fMRI time-series analysis. Neuroimage. 2000;12:196–208. doi: 10.1006/nimg.2000.0609. [DOI] [PubMed] [Google Scholar]
  19. Genovese CR, Lazar NA, Nichols TE. Thresholding of statistical maps in functional neuroimaging using the false discovery rate. Neuroimage. 2002;15:870–878. doi: 10.1006/nimg.2001.1037. [DOI] [PubMed] [Google Scholar]
  20. Glisson TH. Introduction to Circuit Analysis and Design. Springer; The Netherlands: 2011. [Google Scholar]
  21. Glover GH. Deconvolution of impulse response in event-related BOLD fMRI. Neuroimage. 1999;9:416–429. doi: 10.1006/nimg.1998.0419. [DOI] [PubMed] [Google Scholar]
  22. Gudbjartsson H, Patz S. The Rician distribution of noisy data. Magnetic Resonance in Medicine. 1995;34:910–914. doi: 10.1002/mrm.1910340618. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hahn AD, Nencka AS, Rowe DB. Improving robustness and reliability of phase-sensitive fMRI analysis using temporal off-resonance alignment of single-echo timeseries (TOAST) Neuroimage. 2009;44:742–752. doi: 10.1016/j.neuroimage.2008.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hahn AD, Nencka AS, Rowe DB. Enhancing the Utility of Complex-Valued Functional Magnetic Resonance Imaging Detection of Neurobiological Processes Through Postacquisition Estimation and Correction of Dynamic B0 Errors and Motion. Human Brain Mapping. 2012;33:288–306. doi: 10.1002/hbm.21217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Hahn AD, Rowe DB. Physiologic noise regression, motion regression, and TOAST dynamic field correction in complex-valued fMRI time series. NeuroImage. 2012;59:2231–2240. doi: 10.1016/j.neuroimage.2011.09.082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Harnsberger HR, Osborn AG, Ross JS, Moore KR, Salzman KL, Carrasco CR, Halmiton BE, Davidson HC, Wiggins RH. Diagnostic and Surgical Imaging Anatomy: Brain, Head and Neck, Spine. 3. Amirsys; Salt Lake City, Utah: 2007. [Google Scholar]
  27. Hoogenrad FG, Reichenbach JR, Haacke EM, Lai S, Kuppusamy K, Sprenger M. In vivo measurement of changes in venous blood-oxygenation with high resolution functional MRI at .95 Tesla by measuring changes in susceptibilty and velocity. Magnetic Resonace in Medicine. 1998;39:97–107. doi: 10.1002/mrm.1910390116. [DOI] [PubMed] [Google Scholar]
  28. Jaccard P. Ètude comparative de la distribution florale dans une portion des Alpes et des Jura. Bulletin del la Sociètè Vaudoise des Sciences Naturelles. 1901;37:547579. [Google Scholar]
  29. Jain AK. Fundaments of Digital Image Processing. Prentice Hall; 1989. [Google Scholar]
  30. Jesmanowicz A, Nencka A, Hyde JS. Direct Radiofrequency Phase Control in MRI by Digital Waveform Playback at the Larmor Frequency. Magnetic Resonace Imaging. 2014;71:846–852. doi: 10.1002/mrm.24713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Jesmanowicz A, Wong EC, Hyde JS. Phase Correction for EPI Using Internal Reference Lines. Proceedings from the International Society of Magnetic Resonance in Medicine. 1993;12:1239. [Google Scholar]
  32. Jezzard P, Clare S. Principles of nuclear magnetic resonance and MRI. In: Jezzard P, Matthews PM, Smith SM, editors. Functional MRI: An Introduction to Methods. Vol. 3. Oxford University Press; 2001. pp. 67–92. [Google Scholar]
  33. Karaman M, Bruce IP, Rowe DB. Incorporating relaxivities to more accurately reconstruct MR images. Magnetic Resonace Imaging. 2015;33:374–384. doi: 10.1016/j.mri.2015.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Karaman M, Nencka AS, Bruce IP, Rowe DB. Quantification of the Statistical Effect of Spatiatemporal Processing of Nontask fMRI Data. Brain Connectivity. 2014;4:649–661. doi: 10.1089/brain.2014.0278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kruggel F, von Cramon DY. Modeling the hemodynamic response in single-trial functional MRI experiments. Magnetic Resonance in Medicine. 1999;42:787–797. doi: 10.1002/(sici)1522-2594(199910)42:4<787::aid-mrm22>3.0.co;2-v. [DOI] [PubMed] [Google Scholar]
  36. Kumar A, Welti D, Ernst RR. NMR Fourier Zeugmatography. Journal of Magnetic Resonance. 1975;18:69–83. doi: 10.1016/j.jmr.2011.09.019. [DOI] [PubMed] [Google Scholar]
  37. Kwong KK, Belliveau JW, Chesler DA, Goldberg IE, Weisskoff RM, Poncelet BP, Kennedy DN, Hoppel BE, Cohen MS, Turner R, Cheng HM, Brady TJ, Rosen BR. Dynamic Magnetic Resonance Imaging of Human Brain Activity During Primary Sensory Stimulation. Proceedings from the National Academy of Sciences, USA. 1992;89:5675–5679. doi: 10.1073/pnas.89.12.5675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lai S, Glover G. Detection of BOLD fMRI signals using complex data. Proceedings from the International Society of Magnetic Resonance in Medicine. 1997;5:1671. [Google Scholar]
  39. Lazar NA. The Statistical Analysis of Functional MRI Data. Springer; 2008. [Google Scholar]
  40. Lee CC, Jack CR, Riederer SJ. Mapping of the Central Sulcus with Functional MR: Active versus Passive Activation Tasks. American Journal of Neuroradiology. 1998;19:847–852. [PMC free article] [PubMed] [Google Scholar]
  41. Lee CC, Ward HA, Sharbrough FW, Meyer FB, Marsh WR, Raffel C, So EL, Cascino GD, Shin C, Xu Y, Riederer SJ, Jack CR. Assessment of Functional MR Imaging in Neurosurgical Planning. American Journal of Neuroradiology. 1999;20:1511–1519. [PMC free article] [PubMed] [Google Scholar]
  42. Lee J, Shahram M, Schwartzman A, Pauly JM. Complex Data Analysis in High-Resolution SSFP fMRI. Magnetic Resonance in Medicine. 2007;57:905–917. doi: 10.1002/mrm.21195. [DOI] [PubMed] [Google Scholar]
  43. Ljunggren S. A Simple Graphical Representation of Fourier-Based Imaging Methods. Journal of Magnetic Resonance. 1983;54:338–343. [Google Scholar]
  44. Logan BR, Geliazkova MP, Rowe DB. An evaluation of spatial thresholding techniques in fMRI analysis. Human Brain Mapping. 2008;29:1379–1389. doi: 10.1002/hbm.20471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Logan BR, Rowe DB. An evaluation of thresholding techniques in fMRI analysis. Neuroimage. 2004;22:95–108. doi: 10.1016/j.neuroimage.2003.12.047. [DOI] [PubMed] [Google Scholar]
  46. Maitra R. A re-defined and generalized percent-overlap-of-activation measure for studies of fMRI reproducibility and its use in identifying outlier activation maps. Neuroimage. 2010;50:124135. doi: 10.1016/j.neuroimage.2009.11.070. [DOI] [PubMed] [Google Scholar]
  47. Marchini JL, Ripley BD. A new statistical approach to detecting significant activation in functional MRI. Neuroimage. 2000;12:366–380. doi: 10.1006/nimg.2000.0628. [DOI] [PubMed] [Google Scholar]
  48. Menon RS. Postacquisition suppression of large-vessel BOLD signals in high-resolution fMRI. Magnetic Resonace in Medicine. 2002;47:1–9. doi: 10.1002/mrm.10041. [DOI] [PubMed] [Google Scholar]
  49. Miller JW. Exact Maximum Likelihood Estimation in Autoregressive Processes. Journal of Time Series Analysis. 1995;16:607–615. [Google Scholar]
  50. Nan FY, Nowak RD. Generalized Likelihood Ratio Detection for fMRI Using Complex Data. IEEE Transactions on Medical Imaging. 1999;18:320–329. doi: 10.1109/42.768841. [DOI] [PubMed] [Google Scholar]
  51. Nencka AS, Hahn AD, Rowe DB. The Use of Three Navigator Echoes in Cartesian EPI Reconstruction Reduces Nyquist Ghosting. Proceedings from the International Society of Magnetic Resonance in Medicine. 2008;16:3032. [Google Scholar]
  52. Nencka AS, Hahn AD, Rowe DB. A Mathematical Model for Understanding the STatistical effects of k-space (AMMUST-k) preprocessing on observed voxel measurements in fcMRI and fMRI. Journal of Neuroscience Methods. 2009;181:268–282. doi: 10.1016/j.jneumeth.2009.05.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Ogawa S, Lee TM, Nayak AS, Glynn P. Oxygenation-sensitive contrast in magnetic resonance image of rodent brain at high magnetic fields. Magnetic Resonance in Medicine. 1990;14:68–78. doi: 10.1002/mrm.1910140108. [DOI] [PubMed] [Google Scholar]
  54. Petridou N, Schafer A, Gowland P, Bowtell R. Phase vs. magnitude information in functional magnetic resonace imaging time series: toward understanding the noise. Magnetic Resonace Imaging. 2009;27:1046–1057. doi: 10.1016/j.mri.2009.02.006. [DOI] [PubMed] [Google Scholar]
  55. Pourahmadi M. Foundations of Time Series Analysis and Prediction Theory. Wiley; 2001. [Google Scholar]
  56. Purdon PL, Weisskoff RM. Effect of temporal autocorrelation due to physiological noise and stimulus paradigm on voxel-level false-positive rates in fMRI. Human Brain Mapping. 1998;6:239–249. doi: 10.1002/(SICI)1097-0193(1998)6:4&#x0003c;239::AID-HBM4&#x0003e;3.0.CO;2-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Rosen BR, Savoy RL. fMRI at 20: Has it changed the world? NeuroImage. 2012;62:1316–1324. doi: 10.1016/j.neuroimage.2012.03.004. [DOI] [PubMed] [Google Scholar]
  58. Rowe DB. Modeling both the magnitude and phase of complex-valued fMRI data. Neuroimage. 2005;25:1310–1324. doi: 10.1016/j.neuroimage.2005.01.034. [DOI] [PubMed] [Google Scholar]
  59. Rowe DB. Image Reconstruction in Functional MRI. Chapman & Hall/CRC; 2016. Handbook of Neuroimaging Data Analysis; pp. 205–232. [Google Scholar]
  60. Rowe DB, Logan BR. A complex way to compute fMRI activation. Neuroimage. 2004;23:1078–1092. doi: 10.1016/j.neuroimage.2004.06.042. [DOI] [PubMed] [Google Scholar]
  61. Rowe DB, Meller CP, Hoffman RG. Characterizing phase-only fMRI data with an angular regression model. Journal of Neuroscience Methods. 2007;161:331–341. doi: 10.1016/j.jneumeth.2006.10.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Rumeau C, Tzourio N, Murayama N, Peretti-Viton P, Levrier O, Joliot M, Mazoyer B, Salamon G. Location of Hand Function in the Sensorimotor Cortex: MR and Functional Correlation. American Journal of Neuroradiology. 1994;15:567–572. [PMC free article] [PubMed] [Google Scholar]
  63. Smith SM. Preparing fMRI data for statistical analysis. Oxford University Press; 2001. Functional MRI: An Introduction to Methods. [Google Scholar]
  64. Sørensen T. A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on Danish commons. Biologiske Skrifter/Kongelige Danske Videnskabernes Selskab. 1948;5:134. [Google Scholar]
  65. Tweig DB. The k-trajectory formulation of the NMR imaging process with applications in analysis and synthesis of imaging methods. Medical Physics. 1983;10:610–21. doi: 10.1118/1.595331. [DOI] [PubMed] [Google Scholar]
  66. Wang T, Wei T. Statistical analysis of MR imaging and its applications in image modeling. Proceedings of the IEEE International Conference on Image Proccessing and Neural Networks. 1994;1:866–870. [Google Scholar]
  67. Woo CW, Krishnan A, Wager TD. Cluster-extent based thresholding in fMRI analyses. NeuroImage. 2014;91:412–419. doi: 10.1016/j.neuroimage.2013.12.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Worsley KJ, Marrett S, Neelin P, Vandal AC, Friston KJ, Evans AC. A unified statistical approach for determining significant voxels in images of cerebral activation. Human Brain Mapping. 1996;4:58–73. doi: 10.1002/(SICI)1097-0193(1996)4:1<58::AID-HBM4>3.0.CO;2-O. [DOI] [PubMed] [Google Scholar]
  69. Zarahn E, Aguirre GK, D’Esposito M. Empirical Analyses of BOLD fMRI Statistics. I. Spatially Unsmoothed Data Collected under Null-Hypothesis Conditions. NeuroImage. 1997;5:179–197. doi: 10.1006/nimg.1997.0263. [DOI] [PubMed] [Google Scholar]
  70. Zhao F, Jin T, Wang P, Hu X, Kim SG. Sources of Phase Changes in BOLD and CBV-Weighted fMRI. Magnetic Resonance in Medicine. 2007;57:520–527. doi: 10.1002/mrm.21159. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES