Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Jun 24.
Published in final edited form as: Neuroimage. 2011 Dec 8;59(4):4160–4167. doi: 10.1016/j.neuroimage.2011.11.088

SimTB, a simulation toolbox for fMRI data under a model of spatiotemporal separability

Erik B Erhardt b,a,*, Elena A Allen a, Yonghua Wei b, Tom Eichele c,e, Vince D Calhoun a,d
PMCID: PMC3690331  NIHMSID: NIHMS340872  PMID: 22178299

Abstract

We introduce SimTB, a MATLAB toolbox designed to simulate functional magnetic resonance imaging (fMRI) datasets under a model of spatiotemporal separability. The toolbox meets the increasing need of the fMRI community to more comprehensively understand the effects of complex processing strategies by providing a ground truth that estimation methods may be compared against. SimTB captures the fundamental structure of real data, but data generation is fully parameterized and fully controlled by the user, allowing for accurate and precise comparisons. The toolbox offers a wealth of options regarding the number and configuration of spatial sources, implementation of experimental paradigms, inclusion of tissue-specific properties, addition of noise and head movement, and much more. A straightforward data generation method and short computation time (3–10 seconds for each dataset) allow a practitioner to simulate and analyze many datasets to potentially understand a problem from many angles. Beginning MATLAB users can use the SimTB graphical user interface (GUI) to design and execute simulations while experienced users can write batch scripts to automate and customize this process. The toolbox is freely available at http://mialab.mrn.org/software together with sample scripts and tutorials.

Keywords: simulation, fMRI, group analysis

1. Introduction

Simulating functional magnetic resonance imaging (fMRI) data is an important strategy for evaluating processing and analysis methods. These methods include fMRI data preprocessing (Allen et al., 2010; Della-Maggiore et al., 2002), algorithm comparisons and activation detection techniques (Erhardt et al., 2011; Correa et al., 2007; Li et al., 2007; Ma et al., 2007; Lu et al., 2003; Dimitriadou et al., 2004; Worsley et al., 2005), hypothesis testing strategies (Logan and Rowe, 2004; Marchini and Presanis, 2004), and motion-correction algorithms (Ardekani et al., 2001; Freire and Mangin, 2001; Pickens et al., 2005; Morgan et al., 2007).

For dozens of studies, a specific one-time simulation has been developed to understand a model or analysis method (e.g., LeVan and Gotman, 2009; Esposito and Goebel, 2011; Moosmann et al., 2008; Schmithorst and Holland, 2004; Li et al., 2007; Yourganov et al., 2011). Comparing across these one-time simulations, many are based on different models having different assumptions. Some models have as few as three sources including baseline, source of interest, and noise (Della-Maggiore et al., 2002; Logan and Rowe, 2004; Marchini and Presanis, 2004; Baumgartner et al., 2000; Lu et al., 2003; Dimitriadou et al., 2004; Baumgartner et al., 1998; Beckmann and Smith, 2004; Bellec et al., 2006; Bullmore et al., 2001; Jahanian et al., 2004). Other fMRI simulation software use first principles of the MR physics (Drobnjak et al., 2006; Xu et al., 2007; Kim et al., 2008), bootstrap from real data (Bellec et al., 2009), or interpolate to introduce motion (Ardekani et al., 2001; Freire and Mangin, 2001; Pickens et al., 2005). Some of these are computationally demanding, requiring hours or days for a single simulation, others rely on existing fMRI subject data, and others are too simple or specialized to be of general use (see Sec. 4.1). An unintended consequence of a one-time simulation is the possibility that the simulation design may be biased to present a particular method favorably.

Here we introduce a simulation toolbox, SimTB, running under the crossplatform MATLAB environment (The Mathworks, Inc.). The toolbox allows for exible generation of fMRI datasets under a model of spatiotemporal separability1 and is designed to facilitate the testing of a variety of analytic methods. The spatiotemporal model is well-supported because similar spatiotemporal networks are inferred in real fMRI data whether using spatial or temporal independent component analysis (ICA) (Calhoun et al., 2009), general linear model (GLM), seed-based (Whitfield-Gabrieli et al., 2009), or clustering approaches (Van Den Heuvel et al., 2008). Users have full control over data generation including the creation and manipulation of spatial sources, implementation of block- and event-related experimental designs, inclusion of tissue-specific baselines, addition of head movement, and more. Importantly, all simulation features are controlled by a relatively small number of parameters, providing a level of abstraction that makes it possible to answer questions that would be impossible to address in real data. Because of its exibility, the toolbox has broad application for the testing of preprocessing strategies (e.g., motion correction, spatial/intensity normalization), univariate analyses (e.g., GLM), multivariate anlayses (e.g., ICA), and connectivity analyses (e.g., seed-based correlation). We have used SimTB to extensively explore subject variability within the group ICA framework (Allen et al., 2011a). Simulation parameters can be selected through a graphical user interface (GUI) or defined in a batch script called from the command line.

The rest of the manuscript continues as follows. In Section 2 we provide the exposition of the model generating the data. In Section 3 we demonstrate use of the toolbox with a simple simulation and some output from the toolbox. In Section 4 we discuss other functions of the toolbox and compare SimTB with other fMRI simulation software.

2. Materials and methods

2.1. Model

In SimTB, we adopt a data generation model consistent with spatiotemporal separability, that is, data can be expressed as the product of time courses (TCs) and SMs. A high-level owchart of the data generation is in Figure 1. Specifically, for each subject, i = 1, …, M, we assume there are up to C sources or components, each consisting of a spatial map (SM), activation TC, and an amplitude. The no-noise (nn) data is a linear combination of amplitude-scaled and baseline-shifted TC and SM sources,

YinnT·V=[RiT·Cdiag(gi)C·CSiC·V+JTVT·V]bi1·1JT1T·1u1·V, (1)

where Yinn is the time-by-voxel (T-by-V) no-noise data for subject i, Ri is a matrix of C column vectors of TCs, Si is a matrix of C row vectors of SMs, gi is a vector of C source amplitudes defined as a percent signal change of the baseline, bi is a baseline intensity scalar, u is a vector of voxel tissue type baseline modifiers, JTV is a T-by-V matrix of ones, and ⊙ denotes the Hadamard (element-wise) matrix product. These and other variables are further defined in the following sections, as well as the method for producing the final subject data.

Figure 1. Flowchart of data generation.

Figure 1

Values in small caps and circled indicate parameters to be specified. (A) Simulation dimension is determined by the number of subjects, time points (and seconds per time point), and voxels (representing a number of selected sources). (B) Time courses are the sum of task block, task event, and unique event time series modeled into a blood oxygen level-dependent (BOLD) TC and normalized. (C) Spatial maps are selected, translated, rotated, resized, and normalized. (D) The no-noise data combines the TCs and SMs scaled by component amplitudes, and scaled to a tissue type weighted baseline. (E) The final dataset includes motion and noise.

2.1.1. Spatial sources

A template of the 30 default SMs is shown in Figure 2A on a square image of V=V×V voxels, where side length V is specified by the user. SimTB has no requirements regarding the shape of the SMs (or TCs), and users can specify SMs using any 2-D function defined on x, y ∈ [−1, 1]. Default SMs are modeled after sources commonly seen in axial slices of real fMRI data and most are created by combinations of simple Gaussian distributions. For example, a source with a single region of activation can be defined as

Sic=exp([wxic{(xx0ic) cos(θic)(yy0ic) sin(θic)}]2)× exp([wyic{(xx0ic) sin(θic)+(yy0ic) cos(θic)}]2)

where location (x0ic, y0ic), orientation θic, and width (wxic, wyic) parameterize the activation. Users can vary the location (x0ic, y0ic) and orientation θic of activation blobs across subjects, i = 1, …, M, and sources, c = 1, …, C. The spatial extent of the whole source can also be varied with the “spread” parameter, ρic. SMs are normalized to have a maximum intensity of 1 and are transformed as Sic=Sic1/ρic, where ρic describes the expansion (ρic > 1) or contraction (ρic < 1) of the source and Sic is the modified SM for subject i. Finally, a little Gaussian noise distributed as 𝒩(0, 2.5 × 10−5) is added so that each subject SM is unique.

Figure 2.

Figure 2

(A) Configuration of default sources and (B) default tissue baseline. Sources are labeled and 27 of 30 have been selected for a simulation. Spatial maps are designed to represent sources observed in axial slices of real fMRI data.

We recommend specifying base parameters related to the dimensions and quality of data to fall within the range of values typically reported for whole-brain fMRI studies at conventional voxel sizes and field strengths. For the example simulation (AOD experiment) presented herein we use fewer voxels than are typically present in real data (~17,000 versus ~50,000 intracranial voxels) in order to keep dataset size and computational time manageable. Accordingly, we set the true number of sources to be proportional to the number of components that can be consistently and stably estimated in real data, which appears to be in the range of 70 to 100 (Kiviniemi et al., 2009; Ystad et al., 2010; Abou-Elseoud et al., 2010; Allen et al., 2011b). We note that the SimTB framework allows users to specify and define the shapes of any number of sources, making it exible and potentially suitable for a number of testing applications.

2.1.2. Time courses

Each source TC is T time points in length, where the user specifies the repetition time (TR) in seconds per sample. TCs are constructed under the assumption that source activations result from underlying neural events as well as noise. Neural events can follow block or event-related experimental designs, or can represent unexplained, random deviations from baseline. We refer to an underlying event time series as TS to distinguish it from the subsequent TC that is created with a hemodynamic model.

Experimental paradigms are designed with task blocks and task events which can be assigned to several sources and can be identical across subjects, while unique events refer to unexplained deviations that are unique to each source and subject. Each task block is described by a block length and an inter-stimulus interval. If multiple task blocks are defined, their order is pseudo-randomized so that each task block occurs roughly an equal number of times. Task events and unique events are defined by a probability of occurrence at each TR. For a given source, the TS is created by adding together amplitude-scaled task blocks, task events, and unique events. Amplitudes for task inputs can be negative or positive (indicating suppression or activation with the task), or can be zero (indicating that source activation does not follow the task).

Generating the fMRI blood oxygen level-dependent (BOLD)-like TCs from the event TS may be done in several ways, including linear convolution with a canonical hemodynamic response function (HRF) (difference of two gamma functions) (Friston et al., 1995) and the Windkessel balloon model (Buxton and Frank, 1997; Buxton et al., 1998; Mandeville et al., 1999; Friston et al., 2000). Users may vary hemodynamic parameters between sources and subjects, and define their own TC source models. After creation of the TCs, each source TC is scaled to have a peak-to-peak range of one. As with the SMs, Gaussian noise distributed as 𝒩(0, 2.5 × 10−5) is added to ensure non-zero TCs.

2.1.3. Baseline intensity

A baseline intensity, bi, is specified for each subject. An optional tissue-type modifier ui scales the baseline for each voxel. Tissue types with corresponding intensity levels, ω are assigned to each source. Tissue types are assigned to each source and voxel intensity levels are then determined by

ui=JTV+c=1C(ωc1)|Sic|.

For example, Figure 2B displays the default baseline intensity map where four tissue types are defined: sinus signal dropout (ω6 = 0.3), cerebrospinal uid (CSF) (ω14,15 = 1.5), white matter (ω16,17 = 0.7) and gray matter (ωc = 1 for all other sources). By default, bi = 800, thus the intensity map ranges from 0.3 × 800 = 240 in areas with signal dropout to 1.5 × 800 = 1200 in CSF.

2.1.4. Dataset, putting it all together

SMs and TCs are scaled according to the source amplitudes, gi, which are specified in terms of peak-to-peak percent signal change relative to the local baseline. Source features are linearly combined to form the no-noise dataset as in equation 1. Motion may then be added by specifying the maximum x- and y-translation and rotation for each subject. Rotation and translation timeseries are generated via an autoregressive process of model order 1, AR(1), with parameter 0.95, which is a relatively smooth, but effectively bounded, random walk. This model assumes that the head moves randomly between time points but tends toward a central position more than extremes. The dataset is transformed accordingly by linear interpolation at each time point. Datasets are padded with an enlarged bounding box to accommodate translation. Incorporating more complex motion patterns based on the estimated motion parameters from real data will be in upcoming software versions.

To construct the noisy subject data matrix, Yi, we add Rician noise to the data relative to a specified contrast-to-noise ratio (CNR) (Gudbjartsson and Patz, 1995). We define CNR as σ̂s/σ̂n, where σ̂s is the temporal standard deviation of the true signal and σ̂n is the temporal standard deviation of the noise. We calculate σ̂s as the 30% trimmed mean of the standard deviations of the no-noise voxel timeseries (i.e., columns of Yinn). The CNR level for each subject is chosen by the user, and the noise standard deviation is then calculated as σ̂n = σ̂s/CNR. Each element of Yi is then Yitv=(Yitvnn+ν1tv)2+ν2tv2, t = 1, …, T and v = 1, …, V, where ν1tv and ν2tv are distributed as 𝒩(0,σ^n2).

The final dataset, Yi, is saved to disk in .mat or .nii format along with the simulation parameters, the true SMs and TCs, and the motion parameters. When saved in a standard imaging format (.nii), simulated datasets can be used with pre-processing or analysis streams developed in any of the standard functional imaging analysis packages (e.g., SPM2, AFNI3, and FSL4).

3. Results

3.1. Auditory Oddball paradigm

To demonstrate use of SimTB, we simulate data consistent with an auditory oddball (AOD) experiment, which consists of detecting an infrequent target sound within a series of standard sounds and different novel sounds (Kiehl et al., 2001). In this event-related paradigm, three stimuli are presented to each participant in random order. The standard stimulus is a baseline tone that occurs frequently, the target stimulus is a distinct tone that subjects should press a button upon hearing, and the novel stimulus is a random digital noise. We model distinct effects of standard, target, and novel tones on the BOLD signals of different sources. In addition, we adjust the baseline intensity and temporal properties of sources to match the statistical moments found in real data.

3.2. Simulation parameters

Below, we describe the simulation parameters and their implementation. Simulation parameters can be selected through the GUI. Alternatively, parameters can be defined in a batch script that is called at the command line. The batch script for the AOD experiment described here is distributed with SimTB. With either method, the final step in creating the parameter structure involves a validation procedure where parameters are checked for consistency with each other and for appropriate values overall. This step helps users to avoid mistakes and conflicts in simulation design.

We simulate M = 5 subjects, each with up to C = 27 sources in a dataset with V = 148 × 148 voxels and T = 150 time points collected at TR = 2 seconds. Selected sources are those filled in Figure 2A. Some sources are not “of interest” and are present with probability 0.9 (S2, S3, S9, S11, S12, S19, S20, S21, S25, and S26), that is, some sources may be absent for each subject. A built-in display function can be used to visualize parameter values. For example, Figure 3A shows the component presence for all subjects and sources. Black squares indicated the few sources that are absent from the datasets of individual subjects.

Figure 3. Selected parameters for all subjects and output for Subject 1.

Figure 3

(A) Source presence/absence for each subject and source. For components not “of interest”, presence was random with 0.9 probability of being present. (B) Amplitudes for task events and (C) unique events for each subject and source. The amplitude for unique events are reduced for task event-related sources. (D) For Subject 1, Source 27, the task event and unique event TCs are combined, modeled into a BOLD signal, and scaled to have standard deviation one, then a little Gaussian noise is added. (E) Summary of the final data for subject 1 including TCs and SMs, with correlations. (F) TC correlation between AUD Sources 27 and 28 is high with other sources having task events.

To mimic between-subject spatial variability, the sources for each subject will be given a small amount of translation, rotation, and spread via normal deviates. Translation in the horizontal and vertical directions of each source have a standard deviation of 0.1 voxels, rotation has a standard deviation of 1 degree, and spread has a center of 1 and standard deviation of 0.03.

To define the TCs, we use task events and unique events. We define four task event types which occur in a random order for each subject. In addition to the three task event types mentioned earlier (1=standards, 2=targets, and 3=novels), we include a spike event (4) that occurs rarely (probability of occurrence at each TR is 0.05) and is assigned only to CSF sources to match the TC skewness of CSF seen in real data. Task-related stimuli occur at each TR with probability 0.6, 0.075, and 0.075, respectively (8:1:1 ratio), thus no auditory stimulus occurs on a quarter of the TRs (see experiment events in Figure 3D).

Sources are separately modulated by each event type (Fig. 3B). As can be seen in Figure 3B), auditory sources (AUD) are modulated strongly by novel, target and standard tones, whereas motor sources (MOT) are mostly modulated by targets (to which subjects should respond), the precuneus (P) is only modulated by targets, and the hippocampus (HIP) is only modulated by novels. Note that the default mode network (DMN) component is negatively modulated (decreases activity) to all stimuli.

All sources have unique events that occur with a probability of 0.2 at each TR (see unique events in Figure 3D). Unique event amplitudes are displayed in Figure 3C. For sources not of interest (no task modulation), the unique event amplitude is 1. For task-modulated sources, unique events are added with small amplitudes (0.2 to 0.5) so that sources responding to the same events have similar but not identical activation. CSF sources have smaller unique events (amplitude of 0.05).

From the event time series, TCs are generated using convolution with a canonical HRF. In general, each source may be generated with a different set of hemodynamic parameters. Here, we use the same set of parameters for all but four sources. The two frontal sources (FR: S4 and S5) have a 1 second onset delay relative to other sources (Fig 4). The two CSF sources (CSF: S14 and S15) use a spike model which has much faster dynamics than the canonical HRF (e.g., peak at about 3 seconds rather than about 6 seconds). Figure 3D shows the construction of the TC for auditory Source 27 for Subject 1, as generated with a display function provided in SimTB. Here the relative frequencies and amplitudes of Standard, Novel, and Target events is evident. Unique events for auditory S27 are common but of low amplitude. Note the lack of CSF spike events since they only appear for the CSF sources.

Figure 4. Cross-correlation of selected components for Subject 1.

Figure 4

S27 and S28 are highly correlated. S4 has a one-second HRF delay relative to other components, so peak cross-correlation is between 0 and 1 TR lag. S8 is negatively correlated with the task.

A summary of all the TCs and SMs including their pairwise correlations for Subject 1 is shown in Figure 3E. Here, we see the relationships between simulation parameters and resulting SMs and TCs. For instance, the absence of S12 in Subject 1 in Figure 3A is reected in the at TC and blank SM in Figure 3E. Note that sources not modulated by the task, such as S1 (see Figure 3B) still have activation TCs as specified by convolution with unique events. We can also see the distinct “spike-like” behavior of CSF TCs (S12 and S13) which is created by a hemodynamic model with faster dynamics, rare positive events, and low noise (unique events). Correlation plots at the bottom of Figure 3E reflect the similarity between the TCs due to shared task events and SMs due to spatial overlap. For example, Figure 3F highlights the two TC correlation rows for AUD sources (S27 and S28). High correlation values with other sources (e.g., S4, S5, S18, etc.) reflect the shared task events in Figure 3B.

The baseline intensity for all subjects is bi = 800, though in general this value can be different for each subject. For all “gray matter” components, percent signal change is centered at gic = 3 with a standard deviation of 0.25. To approximate the statistical moments of real data, the tissue-type modifier u is increased for CSF (S14 and S15 are 1.2 times larger), decreased for white matter (WM S16 and S17 are 0.8 times as large), and increased for the frontal component (S6 is 1.15 times larger). The resulting dataset baseline is displayed in Figure 5A. With component activations as specified above, we are able to match the statistical moments in real data quite well (Figure 5B).

Figure 5.

Figure 5

(A) Tissue type map used for simulation. Note, compared to the default (Fig. 2B), WM is 0.8 (up from 0.7), CSF is 1.2 (down from 1.5), and frontal is 1.15 (up from 0.3). (B) First four statistical moments in real data averaged over 603 healthy normal controls over T = 148 time points (Allen et al., 2011b), as well as simulated data for 5 subjects with T = 150 time points. Except for real-data edge effects from warping and movement, the first four central statistical moments (mean, variance, skewness, and kurtosis) can be closely matched with simulated data.

Rician noise is added to the data of each subject to reach the desired CNR level. Here, CNR is uniformly distributed over subjects from 0.65 to 2, reflecting the range of typical CNR values for BOLD contrast at conventional voxel sizes and field strengths. In addition, we simulate head motion with a maximum translation of 0.02 of the image length, and a maximum rotation of 5 degrees. Note that the extent of motion for each subject can also be controlled. As an example, we designate Subject 1 to move up to 0.5 of the maximum. Resulting motion parameters for Subjects 1 and 2 are shown in Figure 6, where greater rotational and translational motion can be seen for Subject 2.

Figure 6. Motion for Subjects 1 and 2.

Figure 6

Parameters for x-translation, y-translation, and rotation are shown, where Subject 1 has half the motion of Subject 2.

4. Discussion

In the above example, we showed that by specifying relatively few parameters, one can generate datasets that model distributed and differential activation to numerous stimuli, approximate statistics found in real fMRI data, and incorporate inter-subject variability. To our knowledge, there is no other available simulation framework that allows for such flexible and easy data generation and we envision that SimTB will be useful in a variety of applications. For example, in a recent study (Allen et al., 2011a) we use SimTB extensively to consider performance of group ICA methods under various forms of inter-subject variability. Other applications might include comparisons of methods to estimate effective connectivity, which can be simulated by setting the onset time of the HRF (or other BOLD model) to delay between selected sources (see Fig 4). A whole-brain spatial source is included for those interested in the effect of global mean regression (Murphy et al., 2009). SimTB can also be useful for assessing preprocessing techniques, such as estimating motion. Data normalization methods in a group setting is possible, such as voxelwise or datawise z-scoring or intensity normalization because baseline intensity and contrast-to-noise ratios are subject specific (Allen et al., 2010). Smoothing methods may also be compared, for example, by defining small spatial sources aligned differently between subjects and imposing different smoothing strategies. Coregistration may be compared by first modifying each subject’s structural map by varying the location spatial sources with different tissue types, then further by post-hoc applying a warping/alignment to the simulated data. SimTB is useful for almost any analysis where testing on a simulation first is important with strict control of input parameters.

There are a number of ways to introduce further realism into the analyses. By capitalizing on the flexibility of the toolbox and the large number of customizable parameters, users may incorporate increasingly realistic sources of noise/artifacts using their own models in order to create datasets that are most appropriate for testing particular methods. For example, physiological noise may be simulated by adding a spatially diffuse source (such as the whole-brain source S1) and time course consistent with the spectral content of physiological noise. Susceptibility artifacts might be simulated by defining regions with altered signal intensity (this is controlled via the tissue type parameter) as well as specifying altered percent signal change in these regions. Furthermore, 2D maps may be approximated from 3D networks in real data, though many decomposition methods make no distinction5. For example, if the shape of the spatial map is in consideration, then a simple strategy would be to model a 2D map from a slice of the 3D map (essentially this has already been done for many of the default sources). If, instead of shape, the distributional properties (e.g., kurtosis) are of consideration, then one might calculate the moments of a 3D map, then design a 2D source to approximate those statistics.

The toolbox includes a number of other functions to help with tasks related to simulation and analysis. There are multiple ways to visualize the parameters (Fig. 3), tissue maps (Fig. 2B), task-modulated TCs, and many of these are available “on-the-fly” as the simulation is being created so it can be stopped and tweaked if needed. After simulation, a movie of the dataset is available to assess differences in activation and motion over time. Contour maps of SMs across subjects can be viewed. For analysis, a function is included for automatic source matching between the simulated and estimated SMs derived from an analysis.

4.1. Comparing SimTB with other fMRI simulation tools

Because SimTB captures the fundamental structure of the data in a simple way and controls every aspect of the simulation (not derived from real data), while ignoring the additional realism that is often not essential to understand an estimation method, precise comparisons are possible. SimTB simulates datasets quickly (3–10 seconds for each subject) and is much faster compared to physics-oriented simulated scanner for understanding MRI (POSSUM), part of FSL (FMRIB’s software library), allowing the practitioner to simulate and analyze many datasets to potentially understand a problem from many angles. However, POSSUM uses concepts of MR physics to help understand effects of specified scanning parameters, including tools for pulse sequence generation, slice profile, B0 field inhomogeneities, signal generation, noise addition, motion, and image reconstruction (Drobnjak et al., 2006). SimTB simulates data entirely from a model and defined parameters, whereas some methods require existing fMRI data to simulate fMR images, such as POSSUM, bootstrap resampling of real data (Bellec et al., 2009), and the METU-fMRISim toolbox which requires a structural MRI anatomic brain volume (Türkay, 2009). As an aside, efMRI software (Rorden, 2009) can be used in conjunction with SimTB to optimize statistical efficiency of an experimental design (Friston et al., 1999).

5. Conclusion

SimTB is well-suited for a variety of applications. Though the model makes assumptions (spatiotemporal separability), the simulated data captures the fundamental structure of fMRI data. Importantly, the model does so without relying on real datasets, which have unknown features that cannot be completely modeled.

Acknowledgements

This research was supported by NIH 1R01-EB006841, NIH 1R01-EB005846, NIH 2R01-EB000840, NIH 1 P20 RR021938-01, and DOE DE-FG02-08ER64581 (PI: Calhoun). We thank Eswar Damaraju, Martin Havlicek, and Arvind Caprihan for providing comments that improved the toolbox and manuscript.

Footnotes

1

A process has the property of spatiotemporal separability if it can be factored into the product of spatial processes and temporal processes, p(s, t) = p(s)p(t); there is no constraint on the individual spatial processes or temporal processes. Some methods having a spatiotemporal separability structure include independent component analysis (ICA), the general linear model (GLM), canonical component analysis (CCA), seed-based analysis (a form of GLM), and independent vector analysis (IVA).

2

Statistical Parametric Mapping, http://www.fil.ion.ucl.ac.uk/spm

3

Analysis of Functional NeuroImages, http://afni.nimh.nih.gov/afni

4

FMRIB Software Library, http://www.fmrib.ox.ac.uk/fsl

5

Decomposition techniques that do not necessarily use the data dimension, and therefore do not distinguish between 2D and 3D data, include ICA, principal components analysis (PCA), independent vector analysis (IVA), clustering, GLM, partial least squares (PLS), likelihood-based methods, and the latent variable analysis (LVA) umbrella including tensor and sparse decomposition methods.

Conflicts of interest. The authors declare no competing financial interests in relation to the work presented.

References

  1. Abou-Elseoud A, Starck T, Remes J, Nikkinen J, Tervonen O, Kiviniemi V. The effect of model order selection in group PICA. Human Brain Mapping. 2010;31(8):1207–1216. doi: 10.1002/hbm.20929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Allen E, Erhardt E, Eichele T, Mayer AR, Calhoun VD. Comparison of pre-normalization methods on the accuracy of group ICA results. Organization for Human Brain Mapping. 2010 Jun [Google Scholar]
  3. Allen E, Erhardt E, Wei Y, Eichele T, Calhoun V. Capturing inter-subject variability with group independent component analysis of fMRI data: a simulation study. NeuroImage. 2011a doi: 10.1016/j.neuroimage.2011.10.010. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Allen EA, Erhardt EB, Damaraju E, Gruner W, Segall JM, Silva RF, Havlicek M, Rachakonda S, Fries J, Kalyanam R, Michael AM, Caprihan A, Turner JA, Eichele T, Adelsheim S, Bryan AD, Bustillo J, Clark VP, Ewing SWF, Filbey F, Ford CC, Hutchison K, Jung RE, Kiehl KA, Kodituwakku P, Komesu YM, Mayer AR, Pearlson GD, Phillips JP, Sadek JR, Stevens M, Teuscher U, Thoma RJ, Calhoun VD. A baseline for the multivariate comparison of resting state networks. Frontiers in Systems Neuroscience. 2011b;5:12. doi: 10.3389/fnsys.2011.00002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ardekani B, Bachman A, Helpern J. A quantitative comparison of motion detection algorithms in fMRI. Magnetic Resonance Imaging. 2001;19(7):959–963. doi: 10.1016/s0730-725x(01)00418-0. [DOI] [PubMed] [Google Scholar]
  6. Baumgartner R, Ryner L, Richter W, Summers R, Jarmasz M, Somorjai R. Comparison of two exploratory data analysis methods for fMRI: fuzzy clustering vs. principal component analysis. Magnetic Resonance Imaging. 2000;18(1):89–94. doi: 10.1016/s0730-725x(99)00102-2. [DOI] [PubMed] [Google Scholar]
  7. Baumgartner R, Windischberger C, Moser E. Quantification in functional magnetic resonance imaging: fuzzy clustering vs. correlation analysis. Magnetic Resonance Imaging. 1998;16(2):115–125. doi: 10.1016/s0730-725x(97)00277-4. [DOI] [PubMed] [Google Scholar]
  8. Beckmann C, Smith S. Probabilistic independent component analysis for functional magnetic resonance imaging. IEEE Transactions on Medical Imaging. 2004;23(2):137–152. doi: 10.1109/TMI.2003.822821. [DOI] [PubMed] [Google Scholar]
  9. Bellec P, Perlbarg V, Evans AC. Bootstrap generation and evaluation of an fmri simulation database. Magnetic Resonance Imaging. 2009;27(10):1382–1396. doi: 10.1016/j.mri.2009.05.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bellec P, Perlbarg V, Jbabdi S, Pélégrini-Issac M, Anton J, Doyon J, Benali H. Identification of large-scale networks in the brain using fMRI. NeuroImage. 2006;29(4):1231–1243. doi: 10.1016/j.neuroimage.2005.08.044. [DOI] [PubMed] [Google Scholar]
  11. Bullmore E, Long C, Suckling J, Fadili J, Calvert G, Zelaya F, Carpenter T, Brammer M. Colored noise and computational inference in neurophysiological (fMRI) time series analysis: resampling methods in time and wavelet domains. Human Brain Mapping. 2001;12(2):61–78. doi: 10.1002/1097-0193(200102)12:2&#x0003c;61::AID-HBM1004&#x0003e;3.0.CO;2-W. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Buxton R, Frank L. A model for the coupling between cerebral blood flow and oxygen metabolism during neural stimulation. Journal of Cerebral Blood Flow & Metabolism. 1997;17(1):64–72. doi: 10.1097/00004647-199701000-00009. [DOI] [PubMed] [Google Scholar]
  13. Buxton R, Wong E, Frank L. Dynamics of blood flow and oxygenation changes during brain activation: the balloon model. Magnetic Resonance in Medicine. 1998;39(6):855–864. doi: 10.1002/mrm.1910390602. [DOI] [PubMed] [Google Scholar]
  14. Calhoun VD, Liu J, AdalI T. A review of group ICA for fMRI data and ICA for joint inference of imaging, genetic, and ERP data. NeuroImage. 2009;45(1):S163–S172. doi: 10.1016/j.neuroimage.2008.10.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Correa N, Adali T, Calhoun V. Performance of blind source separation algorithms for fMRI analysis using a group ICA method. Magnetic Resonance Imaging. 2007;25(5):684–694. doi: 10.1016/j.mri.2006.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Della-Maggiore V, Chau W, Peres-Neto P, McIntosh A. An empirical comparison of SPM preprocessing parameters to the analysis of fMRI data. NeuroImage. 2002;17(1):19–28. doi: 10.1006/nimg.2002.1113. [DOI] [PubMed] [Google Scholar]
  17. Dimitriadou E, Barth M, Windischberger C, Hornik K, Moser E. A quantitative comparison of functional MRI cluster analysis. Artificial Intelligence in Medicine. 2004;31(1):57–71. doi: 10.1016/j.artmed.2004.01.010. [DOI] [PubMed] [Google Scholar]
  18. Drobnjak I, Gavaghan D, Süli E, Pitt-Francis J, Jenkinson M. Development of a functional magnetic resonance imaging simulator for modeling realistic rigid-body motion artifacts. Magnetic Resonance in Medicine. 2006;56(2):364–380. doi: 10.1002/mrm.20939. [DOI] [PubMed] [Google Scholar]
  19. Erhardt EB, Rachakonda S, Bedrick E, Allen EA, Adali T, Calhoun VD. Comparison of multi-subject ICA methods for analysis of fMRI data. Human Brain Mapping. 2011 Dec;32(12):2075–2095. doi: 10.1002/hbm.21170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Esposito F, Goebel R. Extracting functional networks with spatial independent component analysis: the role of dimensionality, reliability and aggregation scheme. Current Opinion in Neurology. 2011;24(4):378. doi: 10.1097/WCO.0b013e32834897a5. [DOI] [PubMed] [Google Scholar]
  21. Freire L, Mangin J. Motion correction algorithms may create spurious brain activations in the absence of subject motion. NeuroImage. 2001;14(3):709–722. doi: 10.1006/nimg.2001.0869. [DOI] [PubMed] [Google Scholar]
  22. Friston K, Mechelli A, Turner R, Price C. Nonlinear responses in fMRI: the Balloon model, Volterra kernels, and other hemodynamics. NeuroImage. 2000;12(4):466–477. doi: 10.1006/nimg.2000.0630. [DOI] [PubMed] [Google Scholar]
  23. Friston K, Zarahn E, Josephs O, Henson R, Dale A. Stochastic designs in event-related fMRI. NeuroImage. 1999;10(5):607–619. doi: 10.1006/nimg.1999.0498. [DOI] [PubMed] [Google Scholar]
  24. Friston KJ, Holmes AP, Worsley KJ, Poline JB, Frith CD, Frackowiak RSJ, et al. Statistical parametric maps in functional imaging: a general linear approach. Human Brain Mapping. 1995;2(4):189–210. [Google Scholar]
  25. Gudbjartsson H, Patz S. The Rician distribution of noisy MRI data. Magnetic Resonance in Medicine. 1995;34(6):910–914. doi: 10.1002/mrm.1910340618. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Jahanian H, Hossein-Zadeh G, Soltanian-Zadeh H, Ardekani B. Controlling the false positive rate in fuzzy clustering using randomization: Application to fMRI activation detection. Magnetic Resonance Imaging. 2004;22(5):631–638. doi: 10.1016/j.mri.2004.01.035. [DOI] [PubMed] [Google Scholar]
  27. Kiehl K, Laurens K, Duty T, Forster B, Liddle P. An event-related fMRI study of visual and auditory oddball tasks. Journal of Psychophysiology. 2001;15(4):221–240. [PubMed] [Google Scholar]
  28. Kim B, Yeo D, Bhagalia R. Comprehensive mathematical simulation of functional magnetic resonance imaging time series including motion-related image distortion and spin saturation effect. Magnetic Resonance Imaging. 2008;26(2):147. doi: 10.1016/j.mri.2007.05.007. [DOI] [PubMed] [Google Scholar]
  29. Kiviniemi V, Starck T, Remes J, Long X, Nikkinen J, Haapea M, Veijola J, Moilanen I, Isohanni M, Zang YF, Tervonen O. Functional segmentation of the brain cortex using high model order group PICA. Human Brain Mapping. 2009;30(12):3865–3886. doi: 10.1002/hbm.20813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. LeVan P, Gotman J. Independent component analysis as a model-free approach for the detection of bold changes related to epileptic spikes: A simulation study. Human brain mapping. 2009;30(7):2021–2031. doi: 10.1002/hbm.20647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Li Y, Adalı T, Calhoun V. Estimating the number of independent components for functional magnetic resonance imaging data. Human Brain Mapping. 2007;28(11):1251–1266. doi: 10.1002/hbm.20359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Logan B, Rowe D. An evaluation of thresholding techniques in fMRI analysis. NeuroImage. 2004;22(1):95–108. doi: 10.1016/j.neuroimage.2003.12.047. [DOI] [PubMed] [Google Scholar]
  33. Lu Y, Jiang T, Zang Y. Region growing method for the analysis of functional MRI data. NeuroImage. 2003;20(1):455–465. doi: 10.1016/s1053-8119(03)00352-5. [DOI] [PubMed] [Google Scholar]
  34. Ma L, Wang B, Chen X, Xiong J. Detecting functional connectivity in the resting brain: a comparison between ICA and CCA. Magnetic Resonance Imaging. 2007;25(1):47–56. doi: 10.1016/j.mri.2006.09.032. [DOI] [PubMed] [Google Scholar]
  35. Mandeville J, Marota J, Ayata C, Zaharchuk G, Moskowitz M, Rosen B, Weisskoff R. Evidence of a cerebrovascular postarteriole Windkessel with delayed compliance. Journal of Cerebral Blood Flow & Metabolism. 1999;19(6):679–689. doi: 10.1097/00004647-199906000-00012. [DOI] [PubMed] [Google Scholar]
  36. Marchini J, Presanis A. Comparing methods of analyzing fMRI statistical parametric maps. NeuroImage. 2004;22(3):1203–1213. doi: 10.1016/j.neuroimage.2004.03.030. [DOI] [PubMed] [Google Scholar]
  37. Moosmann M, Eichele T, Nordby H, Hugdahl K, Calhoun V. Joint independent component analysis for simultaneous EEG-fMRI: principle and simulation. International Journal of Psychophysiology. 2008;67(3):212–221. doi: 10.1016/j.ijpsycho.2007.05.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Morgan V, Dawant B, Li Y, Pickens D. Comparison of fMRI statistical software packages and strategies for analysis of images containing random and stimulus-correlated motion. Computerized medical imaging and graphics: the official journal of the Computerized Medical Imaging Society. 2007;31(6):436. doi: 10.1016/j.compmedimag.2007.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Murphy K, Birn R, Handwerker D, Jones T, Bandettini P. The impact of global signal regression on resting state correlations: Are anti-correlated networks introduced? NeuroImage. 2009;44(3):893–905. doi: 10.1016/j.neuroimage.2008.09.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Pickens D, Li Y, Morgan V, Dawant B. Development of computer-generated phantoms for fMRI software evaluation. Magnetic Resonance Imaging. 2005;23(5):653–663. doi: 10.1016/j.mri.2005.04.007. [DOI] [PubMed] [Google Scholar]
  41. Rorden C. efMRI version 9 sept 2009. 2009 http://www.cabiatl.com/CABI/resources/fmrisim/
  42. Schmithorst V, Holland S. Comparison of three methods for generating group statistical inferences from independent component analysis of functional magnetic resonance imaging data. Journal of Magnetic Resonance Imaging. 2004;19(3):365–368. doi: 10.1002/jmri.20009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Türkay KD. Ph.D. thesis. Middle East Technical University; 2009. Dec, Simulated fMRI toolbox. URL http://etd.lib.metu.edu.tr/upload/12611465/index.pdf. [Google Scholar]
  44. Van Den Heuvel M, Mandl R, Hulshoff Pol H. Normalized cut group clustering of resting-state fMRI data. PLoS One. 2008;3(4):e2001. doi: 10.1371/journal.pone.0002001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Whitfield-Gabrieli S, Thermenos H, Milanovic S, Tsuang M, Faraone S, McCarley R, Shenton M, Green A, Nieto-Castanon A, LaViolette P, Wojcik J, E GJD, Seidman LJ. Hyperactivity and hyperconnectivity of the default network in schizophrenia and in first-degree relatives of persons with schizophrenia. Proceedings of the National Academy of Sciences. 2009;106(4):1279–1284. doi: 10.1073/pnas.0809141106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Worsley K, Chen J, Lerch J, Evans A. Comparing functional connectivity via thresholding correlations and singular value decomposition. Philosophical Transactions of the Royal Society B: Biological Sciences. 2005;360(1457):913. doi: 10.1098/rstb.2005.1637. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Xu Y, Wu G, Rowe D, Ma Y, Zhang R, Xu G, Li S. COmplex-Model-Based Estimation of thermal noise for fMRI data in the presence of artifacts. Magnetic Resonance Imaging. 2007;25(7):1079–1088. doi: 10.1016/j.mri.2006.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Yourganov G, Chen X, Lukic A, Grady C, Small S, Wernick M, Strother S. Dimensionality estimation for optimal detection of functional networks in BOLD fMRI data. NeuroImage. 2011;56(2):531–543. doi: 10.1016/j.neuroimage.2010.09.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Ystad M, Eichele T, Lundervold AJ, Lundervold A. Subcortical functional connectivity and verbal episodic memory in healthy elderly-a resting state fMRI study. NeuroImage. 2010;52(1):379–388. doi: 10.1016/j.neuroimage.2010.03.062. [DOI] [PubMed] [Google Scholar]

RESOURCES