Skip to main content
Human Brain Mapping logoLink to Human Brain Mapping
. 2002 Jul 19;17(1):13–16. doi: 10.1002/hbm.10036

Using larger dimensional signal subspaces to increase sensitivity in fMRI time series analyses

Eric Zarahn 1,
PMCID: PMC6872014  PMID: 12203684

Abstract

It has been explained previously how using large dimensional signal‐subspaces can reduce/eliminate bias in the estimated fMRI response (Burock and Dale [2000]: Hum Brain Mapp 11:249–260). It has also been explained how one can project this less biased estimate onto a one‐dimensional subspace of interest (Burock and Dale [2000]: Hum Brain Mapp 11:249–260). In cases where there are multiple, correlated characterized response components per event type, separately projecting the full hemodynamic response onto one‐dimensional subspaces of interest can lead to bias. We present an approach for both estimating the full hemodynamic response and obtaining from it unbiased estimates of effects of theoretical interest (in the context of ordinary least‐squares estimation). The latter estimates are identical to those obtained by projecting the original data into the space defined by the (possibly multi‐dimensional) effects of theoretical interest, but the ensuing statistical inference can be more sensitive. Hum. Brain Mapping 17:13–16, 2002. © 2002 Wiley‐Liss, Inc.

Keywords: fMRI, regression, restriction, basis

INTRODUCTION

Blood oxygenation level dependent functional magnetic resonance imaging (BOLD fMRI) yields time series reflecting the oxygenation state of hemoglobin in venous blood [Kwong et al., 1992; Ogawa et al., 1993]. Changes in neural activity lead to changes in fMRI signal under normal physiological conditions [Ogawa et al., 1998; Logothetis et al., 2001]. Testing for such temporal changes in fMRI signal using the general linear model (GLM) [Friston et al., 1995; Worsley and Friston, 1995] can be conceptualized as comprising two sequential steps. In the first, the entire data vector is projected into the column space of a design matrix (i.e., a set of predictor variables is simultaneously fit to the data). Of particular relevance here is that the residuals of this initial fitting stage are used to generate an estimate of the error variance, which plays a crucial role in subsequent statistical tests (t‐ or F‐tests). To avoid bias in the estimated error variance, the design matrix should ideally be able to represent any possible response [Johnston, 1972]. In the second step, fit components of theoretical interest are tested for statistical significance. These fit components are specified by contrast vectors, which constrain (or restrict) the aspect of the fit being tested. This note concerns the use of restriction matrices [Johnston, 1972; Burock and Dale, 2000] to test for particular fMRI response components at the contrast stage when using a design matrix that initially allows for more lenient response fitting.

It will be useful to introduce the concept of characterized and uncharacterized response components. Characterized components of the response are defined as those that have scientific meaning or are plausible based on the current state of knowledge about the system under study. Most likely, such meaning will have been acquired by a response component through its consistent previous observation. By this definition, characterized waveforms (but not their weightings) are known before the performance of the experiment. Note that characterized components can contain both effects of interest and effects of no‐interest [Friston et al., 1995] relative to the hypothesis being tested. Uncharacterized components are defined as being not explicitly known a priori, and so are simply taken to be the orthogonal complement of the characterized components with respect to some larger space (see Theory). Admitting the possibility of uncharacterized components allows (albeit, in a limited sense) for unexpected yet systematic behavior of the fMRI response.

THEORY

Restriction matrices

Although the concept of restriction matrices is basic to the GLM [Johnston, 1972], Burock and Dale [2000] explicitly introduced the concept of restriction matrices to the fMRI literature.1 Their specific approach will be subsequently referred to as “B&D” in the current note.

A k′ by k restriction matrix R allows one to test for the presence of signal in a given k′ (1 ≤ k′k) dimensional subspace of the columns of a rank k design matrix (G). Each row of R is simply a contrast vector. R may thus be considered a k′‐dimensional contrast. The null hypothesis, Ho, is expressed as

equation image (1)

where q is a k′ by 1 vector whose ith row represents the amplitude under Ho of the ith contrast (q = 0 in most fMRI applications), and β is the parameter vector of the design matrix [Johnston, 1972; Worsley and Friston, 1995]. Ho can be assessed with an F‐test for general k′, though only the special case of k′ = 1 will be explicitly considered here. Note that β is estimated at the initial fitting step using G, whereas R comes into play only at the contrast specification step.

As mentioned in the Introduction, systematic components in the data that are not in the span of the design matrix will positively bias the estimate σˆ2 of the error variance. The utility of the restriction matrix approach is that it allows one to estimate characterized effects while initially modeling the signal in a larger subspace. By doing so, one attempts to model more systematic components in the data, and hence reduce the bias on σˆ2 (Johnston, 1972). The goal of this note concerns the specification of R such that Rβˆ is an unbiased estimator of a desired linear combination (represented by the contrast vector c) of characterized waveform amplitudes regardless of whether the characterized components are explicitly represented in the large design matrix. B&D dealt with this problem for a special case; this special case and its limitations are discussed below.

In B&D, the temporal response to each type of event is modeled in the design matrix with a standard basis (i.e., the response amplitude at each peri‐stimulus time point is separately represented). As a consequence, β in B&D directly represents the temporal response to the modeled events; R is simply taken to comprise the appropriately weighted, hypothesized waveforms themselves. B&D implicitly assumes that there is only one characterized response associated with each event type. This assumption was valid for their simulations. A problem arises, however, if there are multiple, correlated characterized waveforms per event type, as can arise in certain designs [Zarahn, 2000]. In such cases, estimation of the desired linear combination of characterized waveform amplitudes with B&D is biased. A second issue is that it is not immediately obvious what form R should have when event‐related responses are modeled in the design matrix other than with a standard basis. Below, the choice of R is generalized to deal with these issues.

General restriction matrix approach

The specification of R is a change‐of‐bases problem. Details of the derivation are not presented. The expectation of the fMRI response2 y is assumed to be representable in two k dimensional temporal bases, which are the column spaces of G A and G B

equation image (2)

Moreover, G A and G B are assumed to share the same column space. The heuristic distinction made between the two bases is that G A is constructed based on some minimal constraint (e.g., finite duration response), whereas G B is constructed in part based on stronger prior knowledge. In particular, G B is partitioned to reflect the possibility that the fMRI response can possess both characterized3 and uncharacterized components

equation image (3)

where the columns of G c define a characterized basis and the columns of G u define an uncharacterized basis. Let βA be the parameter vector in the GLM corresponding to G A, and let βc be the partition of the parameter vector of G B corresponding to G c. In practice, βA will be estimated, even though the hypotheses of interest are most transparently expressed as linear combinations of βc. The restriction matrix problem is to find an R such that E[RβˆA] = cTβc.

By definition, an explicit form for G u will not be known. It was explained previously how G u is taken to be the orthogonal complement of G c with respect to G A (such that G cT G u = 0). If this assumption is adopted, G u does not even need to be given an explicit form, and subsequent computations are made very simple.

Given these hypotheses, a unique solution to the problem of finding an R such that E[RβˆA] = cTβc is to let

equation image (4)

Although RβˆA possesses the same variance as c Tβˆc (βˆc being the estimate of βc computed directly from the data using G c as the design matrix), their estimated variances can be systematically different. This is because if G c were used as the design matrix, signal components in the span of the column space of G u would contribute systematically to σˆ2.

RESULTS

Bias in σˆ2 due to possibly incomplete modeling of fMRI responses with solely characterized components was examined in an event‐related design involving the subject being presented letters to be remembered across a delay.4 Define %Δσˆ2 as

equation image

where σˆc2 and σˆA2 are the estimated error variances obtained from using G c and G A as the design matrices, respectively. %Δσˆ2 averaged across all voxels in the brain was 12.8%. When restricted to the suprathreshold5 local maxima for the t‐test (using σˆc2 as σˆ2) assessing the amplitude of neural responses to the letter presentation, %Δσˆ2 was 25.7%. This suggests that in this dataset, the stronger the expression of characterized response components, the stronger the expression of uncharacterized response components.

DISCUSSION

It is well known that bias in error variance estimation might be mitigated by using a design matrix that can represent all plausible response components, as opposed to simply those of clear theoretical interest [Johnston, 1972]. A method has been presented for specification of restriction matrices to allow for the estimation of desired linear combinations of characterized fMRI waveform amplitudes when using such ‘larger’ design matrices. The approach as described is valid for ordinary least‐squares estimation of GLM parameters (e.g., as employed in the SPM99 package; Wellcome Department of Neurology), though it could be easily extended to generalized least‐squares estimation [Burock and Dale, 2000]. Unlike B&D, this approach is not limited to standard bases and yields unbiased estimates even when there are multiple, correlated characterized components per event type [Zarahn, 2000].

Assumed orthogonality of characterized and uncharacterized components

Andrade et al. [1999] discussed how orthogonalizing predictor variables with respect to others changes inference on the coefficients of the latter. For example, say non‐orthogonal predictors e and f comprise a design matrix [e f]. Imagine that e is orthogonalized with respect to f (yielding e*), and that a new design matrix [e* f] is created. The coefficient of f obtained using this design matrix will reflect the expression of both e and f in the data. That is, the coefficient of f in the context of e* will be biased with respect to the coefficient of f that would have been obtained from the original design matrix, [e f].

As characterized and uncharacterized components are assumed to be orthogonal in the current method, one might contemplate the question “What if the characterized and uncharacterized components are not actually orthogonal? Would not inference on the characterized components then be biased by the orthogonalization?” The answer is subtle: recall that we do not start off with a set of known uncharacterized components that are then orthogonalized with respect to the characterized components. Rather, we simply define the uncharacterized components as the orthogonal complement of the characterized components with respect to some larger space. This does not obviate the point of Andrade et al. [1999], but simply reflects the fact that the definition of the characterized components (and so the uncharacterized components) depends on the current state of knowledge about the fMRI response. Furthermore, as explained below, it is not the case that the current method involves orthogonalization of known effects of no‐interest relative to effects of interest.

Contrasting characterized/uncharacterized and interest/no‐interest

The dichotomy of characterized and uncharacterized effects used here is not identical to that of effects of interest and effects of no‐interest [Friston et al., 1995]. The relationship between the two dichotomies is that although every effect of interest is a characterized effect (and every uncharacterized effect is an effect of no‐interest), there can be characterized effects that are effects of no‐interest. In fact, every effect that one can represent explicitly before the performance of the experiment is a characterized effect, and characterized effects need not be mutually orthogonal. Therefore, one need not be concerned that the orthogonality assumption will induce confounding of effects of interest by characterized effects of no‐interest [Andrade et al., 1999].

The characterized/uncharacterized dichotomy is not practically relevant in most (non‐fMRI) GLM applications. This is because in almost all regression circumstances one deals solely with characterized response components, whether they are of interest or not. For example, consider a psychological study in which the effect of age on reaction time is under examination, with gender and IQ included as effects of no‐interest in the GLM. In this example, not only the effect of interest (age) but also the effects of no‐interest (gender and IQ) would fall into the class of characterized effects.

Acknowledgements

The MATLAB 5.3 (Mathworks, South Natick, MA) code used in conjunction with the SPM99 program (Wellcome Department of Neurology) implementing the theory outlined in this work is available from the author.

Footnotes

1

In that paper, feasible generalized least‐squares was used instead of ordinary least‐squares estimation. Though this does not fundamentally change the approach to be described, the assumptions of orthogonality are slightly different under the two types of estimation. Therefore, the current method only properly pertains to ordinary least‐squares estimation.

2

To clarify, y is the fMRI response for an entire experiment, and is not necessarily a single “event‐related” response. Thus, y can contain overlapping responses from multiple occurrences of multiple event types.

3

For clarity, G c itself could be partitioned into effects of interest and those of no‐interest [Friston et al., 1995]. This has no effect on estimation.

4

Sternberg Item Recognition Paradigm [Sternberg, 1966]; GE EPIBOLD 5.7, TR/TE = 3,000/50 msec, flip angle = 90°, FOV = 24 cm, matrix = 64 × 64, slice thickness = 7 mm, 15 axial slices; data processing and analysis performed with SPM99 (Wellcome Department of Neurology); the columns of G A comprised a Fourier series (order = 4, fundamental period = 36 sec); data were spatially smoothed with an isotropic Gaussian kernel (FWHM = 8 mm); the columns of G c comprised convolutions of a hemodynamic response function (using the default parameters in SPM99) with rect functions corresponding to event components [Zarahn, 2000]; patient was a 19‐year‐old female diagnosed with schizophrenia.

5

α = 0.05, corrected for multiple comparisons using Gaussian random field theory as implemented in SPM99 [Worsley et al., 1996].

REFERENCES

  1. Andrade A, Paradis A, Rouquette S, Poline J (1999): Ambiguous results in functional neuroimaging data analysis due to covariate correlation. NeuroImage 10: 483–486. [DOI] [PubMed] [Google Scholar]
  2. Burock M, Dale A (2000): Estimation and detection of event‐related fMRI signals with temporally correlated noise: a statistically efficient and unbiased approach. Hum Brain Mapp 11: 249–260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Friston KJ, Holmes AP, Worsley KJ, Poline JP, Frith CD, Frackowiak RSJ (1995): Statistical parametric maps in functional imaging: a general linear approach. Hum Brain Mapp 2: 189–210. [Google Scholar]
  4. Johnston J (1972): Econometrics methods. New York: McGraw‐Hill. [Google Scholar]
  5. Kwong KK, Beliveau JW, Chesler DA, Goldberg IE, Weisskoff RM, Poncelet BP, Kennedy DN, Hoppel BE, Cohen MS, Turner R, et al. (1992): Dynamic magnetic resonance imaging of human brain activity during primary sensory stimulation. Proc Natl Acad Sci USA 89: 5675–5679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Logothetis N, Pauls J, Augath M, Trinath T, Oeltermann A (2001): Neurophysiological investigation of the basis of the fMRI signal. Nature 412: 150–157. [DOI] [PubMed] [Google Scholar]
  7. Ogawa S, Menon R, Kim S‐G, Ugurbil K (1998): On the characteristics of functional magnetic resonance imaging of the brain. Annu Rev Biophys Biomolec Struct 27: 447–474. [DOI] [PubMed] [Google Scholar]
  8. Ogawa S, Menon RS, Tank DW, Kim SG, Merkle H, Ellermann JM, Ugurbil K (1993): Functional brain mapping by blood oxygenation level‐dependent contrast magnetic resonance imaging. A comparison of signal characteristics with a biophysical model. Biophys J 64: 803–812. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Sternberg S (1966): High‐speed scanning in human memory. Science 153: 652. [DOI] [PubMed] [Google Scholar]
  10. Worsley KJ, Friston KJ (1995): Analysis of fMRI time‐series revisited—again. NeuroImage 2: 173–182. [DOI] [PubMed] [Google Scholar]
  11. Worsley KJ, Marrett S, Neelin P, Vandal AC, Friston KJ, Evans AC (1996): A unified statistical approach for determining significant signals in images of cerebral activation. Hum Brain Mapp 4: 58–73. [DOI] [PubMed] [Google Scholar]
  12. Zarahn E (2000): Testing for neural responses during temporal components of trials with BOLD fMRI. NeuroImage 11: 783–796. [DOI] [PubMed] [Google Scholar]

Articles from Human Brain Mapping are provided here courtesy of Wiley

RESOURCES