Skip to main content
PLOS One logoLink to PLOS One
. 2011 Dec 12;6(12):e27594. doi: 10.1371/journal.pone.0027594

A Simple and Objective Method for Reproducible Resting State Network (RSN) Detection in fMRI

Gautam V Pendse 1,*, David Borsook 1,2, Lino Becerra 1,2
Editor: Wang Zhan3
PMCID: PMC3236249  PMID: 22174743

Abstract

Spatial Independent Component Analysis (ICA) decomposes the time by space functional MRI (fMRI) matrix into a set of 1-D basis time courses and their associated 3-D spatial maps that are optimized for mutual independence. When applied to resting state fMRI (rsfMRI), ICA produces several spatial independent components (ICs) that seem to have biological relevance - the so-called resting state networks (RSNs). The ICA problem is well posed when the true data generating process follows a linear mixture of ICs model in terms of the identifiability of the mixing matrix. However, the contrast function used for promoting mutual independence in ICA is dependent on the finite amount of observed data and is potentially non-convex with multiple local minima. Hence, each run of ICA could produce potentially different IC estimates even for the same data. One technique to deal with this run-to-run variability of ICA was proposed by [1] in their algorithm RAICAR which allows for the selection of only those ICs that have a high run-to-run reproducibility. We propose an enhancement to the original RAICAR algorithm that enables us to assign reproducibility Inline graphic-values to each IC and allows for an objective assessment of both within subject and across subjects reproducibility. We call the resulting algorithm RAICAR-N (N stands for null hypothesis test), and we have applied it to publicly available human rsfMRI data (http://www.nitrc.org). Our reproducibility analyses indicated that many of the published RSNs in rsfMRI literature are highly reproducible. However, we found several other RSNs that are highly reproducible but not frequently listed in the literature.

Introduction

Independent component analysis (ICA) [2][5] models the observed data as a linear combination of a set of statistically independent and unobservable sources [6]. first proposed the application of ICA to the analysis of functional magnetic resonance imaging (fMRI) data. Subsequently, ICA has been applied to fMRI both as an exploratory tool for the purpose of identifying task related components [6] as well as a signal clean up tool for the purpose of removing artifacts from the fMRI data [7]. Recently, it has been shown that ICA applied to resting state fMRI (rsfMRI) in healthy subjects reveals a set of biologically meaningful spatial maps of independent components (ICs) that are consistent across subjects - the so called resting state networks (RSNs) [8]. Hence, there is a considerable interest in applying ICA to rsfMRI data in order to define the set of RSNs that characterize a particular group of human subjects, a disease, or a pharmacological effect.

Several variants of the linear ICA model have been applied to fMRI data including square ICA (with equal number of sources and sensors) [9], non-square ICA (with more sensors than sources) [6], and non-square ICA with additive Gaussian noise (noisy ICA) [10]. All of these models are well known in the ICA literature [2], [3], [5], [11]. Since the other ICA models are specializations of the noisy ICA model, we will assume a noisy ICA model henceforth.

Remarkably, the ICA estimation problem is well posed in terms of the identifiability of the mixing matrix given several non-Gaussian and at most 1 Gaussian source in the overall linear mixture [3], [12][14]. In the presence of more than 1 Gaussian source, such as in noisy ICA, the mixing matrix corresponding to the non-Gaussian part of the linear mixture is identifiable (upto permutation and scaling). In addition, the source distributions are uniquely identifiable (upto permutation and scaling) given a noisy ICA model with a particular Gaussian co-variance structure, for example, the isotropic diagonal co-variance. For details, see section 2.1.2.

While these uniqueness results are reassuring, a number of practical difficulties prevent the reliable estimation of ICs on real data. These difficulties include (1) true data not describable by an ICA model, (2) ICA contrast function approximations, (3) multiple local minima in the ICA contrast function, (4) confounding Gaussian noise and (5) model order overestimation. See section 2.1.3 for more details. A consequence of these difficulties is that multiple ICA runs on the same data or different subsets of the data produce different estimates of the IC realizations.

One technique to account for this run-to-run variability in ICA was proposed by [15] in their algorithm ICASSO. Using repeated runs of ICA with bootstrapped data using various initial conditions, ICASSO clusters ICs across ICA runs using agglomerative hierarchical clustering and also helps in visualizing the estimated ICs. The logic is that reliable ICs will show up in almost all ICA runs and thus will form a tight cluster well separated from the rest. [16] proposed a technique similar to ICASSO called self-organizing group ICA (sogICA) which allows for clustering of ICs via hierarchical clustering in across subject ICA runs. When applied to multiple ICA runs across subjects, ICASSO does not restrict the IC clusters to contain only 1 IC from each subject per ICA run. In contrast, sogICA allows the user to select the minimum number of subjects for a “group representative” IC cluster containing distinct subjects. By labelling each ICA run as a different “subject” sogICA can also be applied to analyze multiple ICA runs across subjects.

Similar in spirit to ICASSO and sogICA [1], proposed an intuitive approach called RAICAR (Ranking and Averaging Independent Component Analysis by Reproducibility) for reproducibility analysis of estimated ICs. The basic idea in RAICAR is to select only those ICs as “interesting” or “stable” which show a high run-to-run “reproducibility”. RAICAR uses simple and automated spatial cross-correlation matrix based IC alignment, which has been shown to be more accurate compared to ICASSO [1]. RAICAR is applicable to both within subject as well as across subjects reproducibility analysis.

A few limitations of ICASSO, sogICA and RAICAR are worth noting:

  • ICASSO requires the user to select the number of IC clusters and is inapplicable without modification for across subjects analysis of ICA runs since the IC clusters are not restricted to contain only 1 IC per ICA run.

  • sogICA requires the user to select the minimum number of subjects for a “group representative” cluster and also a cutoff on within cluster distances.

  • RAICAR uses an arbitrary threshold on the reproducibility indices selected “by eye” or set at an arbitrary value, such as Inline graphic of the maximum reproducibility value.

We propose a simple extension to RAICAR that avoids subjective user decisions and allows for an automatic reproducibility cutoff. The reproducibility indices calculated in RAICAR differ in magnitude significantly depending on whether the input to RAICAR:

  • (a) is generated using multiple ICA runs on the same data

  • (b) comes from multiple ICA runs on varying data sets (e.g. between and across subject runs)

See Figure 1 for an illustration of this effect. Obviously, the reproducibility indices are much lower in case (b) since we account for both within subject and between subjects variability in estimating ICs. Case (b) is also of great interest from a practical point of view since we are often interested in making statements about a group of subjects. Hence, it is clear that a cutoff on RAICAR reproducibility values for the purposes of selecting the “highly reproducible” components should be data dependent. In this work,

Figure 1. Figure illustrates the variation in normalized reproducibility from RAICAR depending on whether the input to RAICAR is (a) Multiple ICA runs on single subject data or (b) Multiple ICA runs across subjects.

Figure 1

Notice that the normalized reproducibility is much lower for across subjects analysis compared to within subject analysis.

  1. We propose a modification of the original RAICAR algorithm by introducing an explicit “null” model of no reproducibility.

  2. We use this “null” model to automatically generate Inline graphic-values for each IC via simulation. This allows for an objective cutoff specification for extracting reproducible ICs (e.g. reproducible at Inline graphic) within and across subjects. We call the resulting algorithm RAICAR-N (N stands for “null” hypothesis test).

  3. We validate RAICAR-N by applying it to publicly available human rsfMRI data.

1.1 Notation

  • The set of real numbers will be denoted by Inline graphic. Scalars variables and functions will be denoted in a non-bold font (e.g., Inline graphic or Inline graphic). Vectors will be denoted in a bold font (except Greek letters) using lower case letters (e.g., Inline graphic). Matrices will be denoted in bold font using upper case letters (e.g., Inline graphic). The transpose of a matrix Inline graphic will be denoted by Inline graphic and its inverse will be denoted by Inline graphic. Inline graphic will denote the Inline graphic identity matrix and Inline graphic will denote a vector or matrix of all zeros whose size should be clear from context. Inline graphic is the number of ways of choosing Inline graphic objects from Inline graphic objects when order does not matter.

  • The Inline graphicth component of vector Inline graphic will be denoted by Inline graphic whereas the Inline graphicth component of vector Inline graphic will be denoted by Inline graphic. The element Inline graphic of matrix Inline graphic will be denoted by Inline graphic or Inline graphic. Estimates of variables will be denoted by putting a hat on top of the variable symbol. For example, an estimate of Inline graphic will be denoted by Inline graphic.

  • If Inline graphic is a random vector with a multivariate Normal distribution with mean Inline graphic and covariance Inline graphic then we will denote this distribution by Inline graphic. The joint density of vector Inline graphic will be denoted by Inline graphic whereas the marginal density of Inline graphic will be denoted as Inline graphic. Inline graphic denotes the expectation of Inline graphic with respect to both random variables Inline graphic and Inline graphic.

Methods

The organization of this article revolves around the following sequence of questions, which ultimately lead to the development of RAICAR-N:

  1. Why is a reproducibility assessment necessary in ICA analysis? In order to answer this question, we cover the fundamentals of ICA including identifiability issues in sections 2.1 and 2.2.

  2. How does the original RAICAR algorithm assess reproducibility? The answer to this question in section 2.3 will set up the stage for RAICAR-N.

  3. How does RAICAR-N permit calculation of reproducibility Inline graphic-values? In section 2.4, we describe the RAICAR-N “null” model and a simulation based approach for assigning Inline graphic-values to ICs.

  4. How to promote diversity in group ICA runs given a limited number of subjects when using RAICAR-N and how to display the non-Gaussian spatial structure in estimated ICs? These issues are covered in section 2.5 and 2.6.

  5. How can RAICAR-N be extended for between group comparison of ICs and how does it compare to other approaches in the literature? This question is addressed in section 4.4.

2.1 ICA background

In this section, we provide a brief introduction to ICA along with a discussion of associated issues related to model order selection, identifiability and run-to-run variability. The noisy ICA model assumes that observed data Inline graphic is generated as a linear combination of unobservable independent sources confounded with Gaussian noise:

graphic file with name pone.0027594.e046.jpg (2.1)

In this model,

graphic file with name pone.0027594.e047.jpg (2.2)

If the marginal density of the Inline graphicth source Inline graphic is Inline graphic then the joint source density Inline graphic factorizes as Inline graphic because of the independence assumption but is otherwise assumed to be unknown. Also, since the elements of Inline graphic are independent their co-variance matrix Inline graphic is diagonal. The set of variables Inline graphic represents the unknown parameters in the noisy ICA model. Before discussing the identifiability of model 2.1, we briefly discuss the choice of model order or the assumed number of ICs Inline graphic.

2.1.1 Estimating the model order Inline graphic

Rigorous estimation of the model order Inline graphic in noisy ICA is difficult as the IC densities Inline graphic are unknown. This means that Inline graphic, the marginal density of the observed data given the model order and the ICA parameters cannot be derived in closed form (by integrating out the ICs) without making additional assumptions on the form of IC densities. Consequently, standard model selection criteria such as Bayes information criterion (BIC) [17] cannot be easily applied to the noisy ICA model to estimate Inline graphic. One solution is to use a factorial mixture of Gaussians (MOG) joint source density model as in [5], and use the analytical expression for Inline graphic in conjunction with BIC. This solution is quite general in terms of allowing for an arbitrary Gaussian noise co-variance Inline graphic, but maximizing Inline graphic with respect to Inline graphic becomes computationally intractable using an expectation maximization (EM) algorithm for Inline graphic ICs [5]. Another rigorous non-parametric approach for estimating Inline graphic that is applicable to the noisy ICA model with isotropic diagonal Gaussian noise co-variance i.e., with Inline graphic is the random matrix theory based sequential hypothesis testing approach of [18]. To the best of our knowledge, these are the only 2 rigorous approaches for estimating Inline graphic in the noisy ICA model.

Approximate approaches for estimating Inline graphic commonly used in fMRI literature (e.g., [10]) consist of first relaxing the isotropic diagonal noisy ICA model (with Inline graphic) into a probabilistic PCA (PPCA) model of [19] where the source densities are assumed to be Gaussian i.e., where Inline graphic. When using the PPCA model, it becomes possible to integrate out the Gaussian sources to get an expression for Inline graphic that can be analytically maximized [19]. Subsequently, methods such as BIC can be applied to estimate Inline graphic. Alternative approaches for estimating Inline graphic in the PPCA model consist of the Bayesian model selection of [20], or in data-rich situations such as fMRI, even the standard technique of cross-validation [21].

From a biological point of view, it has been argued [22] that the number of extracted ICs simply reflect the various equally valid views of the human functional neurobiology - smaller number of ICs represent a coarse view while a larger number of ICs represent a more fine grained view. However, it is worth noting that from a statistical point of view, over-specification of Inline graphic will lead to over-fitting of the ICA model, which might render the estimated ICs less generalizable across subjects. On the other hand, under-specification of Inline graphic will result in incomplete IC separation. Both of these scenarios are undesirable.

2.1.2 Identifiability of the noisy ICA model

To what extent is the noisy linear ICA model identifiable? Consider a potentially different decomposition of the noisy ICA model 2.1:

graphic file with name pone.0027594.e084.jpg (2.3)

where

graphic file with name pone.0027594.e085.jpg (2.4)

What can be said about the equivalence between the parameterizations in 2.1 and 2.3?

Identifiability of Inline graphic: Equating the expectations of the right hand size of 2.3 and 2.1 and noting that Inline graphic have mean Inline graphic we get:

graphic file with name pone.0027594.e095.jpg (2.5)

Thus the mean vector Inline graphic is exactly identifiable.

Identifiability of Inline graphic : A fundamental decomposition result states that the noisy ICA problem is well-posed in terms of the identifiability of the mixing matrix Inline graphic upto permutation and scaling provided that the components of Inline graphic are independent and non-Gaussian [3], [12][14]. If Inline graphic is a diagonal scaling matrix and Inline graphic is a permutation matrix then the identifiability result can be stated as:

graphic file with name pone.0027594.e102.jpg (2.6)

where 2.3 is another decomposition of Inline graphic with Inline graphic containing independent and non-Gaussian components. In other words, the mixing matrix Inline graphic is identifiable upto permutation and scaling.

Identifiability of Inline graphic and Inline graphic : Equating the second moments of the right hand side of 2.3 and 2.1 and noting the equality of means 2.5 and the independence of Inline graphic and Inline graphic we get:

graphic file with name pone.0027594.e110.jpg (2.7)

Let Inline graphic be a Inline graphic matrix and Inline graphic be a Inline graphic orthogonal matrix such that:

graphic file with name pone.0027594.e115.jpg (2.8)

From 2.8 and 2.7 we get:

graphic file with name pone.0027594.e118.jpg (2.9)
graphic file with name pone.0027594.e119.jpg

Case 1: Inline graphic and Inline graphic. The second equation in 2.9 along with the orthogonality of Inline graphic gives Inline graphic and thus Inline graphic. If we fix the scaling of Inline graphic by selecting Inline graphic then from the first equation in 2.9 we get:

graphic file with name pone.0027594.e127.jpg (2.10)

In other words, the noise co-variance Inline graphic is uniquely determined and for a fixed scaling Inline graphic, the source variances Inline graphic are also uniquely determined upto permutation.

Case 2: Inline graphic and Inline graphic arbitrary positive definite matrices. Suppose Inline graphic is a square matrix and let Inline graphic be the diagonal matrix obtained by setting the non-diagonal elements of Inline graphic to Inline graphic and similarly let Inline graphic be the matrix obtained by setting the diagonal elements of Inline graphic to 0. The noise-covariance is partially identifiable by the following conditions:

graphic file with name pone.0027594.e139.jpg (2.11)

For a fixed scaling Inline graphic, the sources variances Inline graphic are constrained by:

graphic file with name pone.0027594.e143.jpg (2.12)

In general, the source variances Inline graphic cannot be uniquely determined as noted in [14].

Identifiability of the distribution of Inline graphic : Is the distribution of the non-Gaussian components of Inline graphic identifiable? From 2.1 and 2.3:

graphic file with name pone.0027594.e147.jpg (2.13)

Substituting 2.5 and 2.6 in 2.13 we get:

graphic file with name pone.0027594.e148.jpg (2.14)

Left multiplying both sides by Inline graphic from 2.8 we get:

graphic file with name pone.0027594.e150.jpg (2.15)

Let Inline graphic be the characteristic functions of Inline graphic and Inline graphic respectively. Then

graphic file with name pone.0027594.e154.jpg (2.16)

where Inline graphic and Inline graphic is a vector of real numbers of length equal to that of the corresponding random vectors in 2.16. Using 2.15, we can write:

graphic file with name pone.0027594.e158.jpg (2.17)

Noting the independence of Inline graphic and Inline graphic:

graphic file with name pone.0027594.e161.jpg (2.18)

Now Inline graphic and Inline graphic are multivariate Gaussian random vectors both with mean Inline graphic and co-variance matrix Inline graphic and Inline graphic respectively. Hence, their characteristic functions are given by [23], [24]:

graphic file with name pone.0027594.e167.jpg (2.19)
graphic file with name pone.0027594.e168.jpg

Claim 2.1 A sufficient condition for identifiability upto permutation and scaling of the non-Gaussian distributions in Inline graphic given two different parameterizations in 2.1 and 2.3 is:

graphic file with name pone.0027594.e170.jpg (2.20)

Proof• From 2.20 and 2.11, we get:

graphic file with name pone.0027594.e171.jpg (2.21)

Thus from 2.19,

graphic file with name pone.0027594.e172.jpg (2.22)

From 2.19, Inline graphic and Inline graphic are not equal to 0 for any finite Inline graphic, therefore, from 2.22 and 2.18 we get:

graphic file with name pone.0027594.e176.jpg (2.23)

Note that Inline graphic is a diagonal scaling matrix with entries Inline graphic on the diagonal and Inline graphic is a permutation matrix. Thus,

graphic file with name pone.0027594.e180.jpg (2.24)

where Inline graphic is some permutation of integers Inline graphic. Suppose Inline graphic is the characteristic function of the Inline graphicth component of Inline graphic and Inline graphic is the characteristic function of the Inline graphicth component of Inline graphic. Since the components of Inline graphic and Inline graphic are independent by assumption, the joint characteristic functions Inline graphic and Inline graphic factorize:

graphic file with name pone.0027594.e193.jpg (2.25)
graphic file with name pone.0027594.e194.jpg

From 2.25 and 2.23

graphic file with name pone.0027594.e195.jpg (2.26)

All characteristic functions satisfy [23], [24]:

graphic file with name pone.0027594.e196.jpg (2.27)

Since Inline graphic is simply a permutation of integers Inline graphic, there exists a Inline graphic such that Inline graphic. Then set Inline graphic in 2.26. Then 2.27 and 2.26 imply:

graphic file with name pone.0027594.e203.jpg (2.28)

Select the scaling matrix as Inline graphic and thus Inline graphic is a diagonal matrix with elements Inline graphic on the diagonal. Thus Inline graphic and 2.28 can be re-written as:

graphic file with name pone.0027594.e208.jpg (2.29)

Therefore,

graphic file with name pone.0027594.e209.jpg (2.30)
graphic file with name pone.0027594.e210.jpg
graphic file with name pone.0027594.e211.jpg

Hence the characteristic function of the Inline graphicst component of Inline graphic is identical to the characteristic function of the (possibly sign-flipped) Inline graphicth component of Inline graphic. Since characteristic functions uniquely characterize a probability distribution [23], the distribution of Inline graphic and Inline graphic is identical. Next, by setting Inline graphic, we can find a distribution from Inline graphic that matches the Inline graphicnd component Inline graphic of Inline graphic. Proceeding in a similar fashion, it is clear that the distribution of each component of Inline graphic is uniquely identifiable upto sign flips for the choice Inline graphic. For a general Inline graphic, the source distributions are uniquely identifiable upto permutation and (possibly negative) scaling, as claimed.

While the source distributions might not be uniquely identifiable for arbitrary co-variance matrices Inline graphic, they are indeed uniquely identifiable upto permutation and scaling for the noisy ICA model with isotropic Gaussian noise co-variance. For more general conditions that guarantee uniqueness of source distributions, please see [25], [26].

Corollary 2.2 If Inline graphic and Inline graphic, then the source distributions are uniquely identifiable upto sign flips for Inline graphic.

Proof Suppose Inline graphic and Inline graphic. Then from 2.9 Inline graphic and thus Inline graphic. The corollary then follows from Claim 2.1.

Corollary 2.3 If Inline graphic, then the source distributions are uniquely identifiable up to sign flips for Inline graphic.

Proof If Inline graphic, then noting that Inline graphic, we get Inline graphic. Hence from 2.12, we get Inline graphic. The corollary then follows from Claim 2.1.

2.1.3 Why is there a run-to-run variability in estimated ICs?

From the discussion in section 2.1.2, it is clear that for a noisy ICA model with isotropic diagonal additive Gaussian noise co-variance:

  1. The noisy ICA parameters Inline graphic are uniquely identifiable up to permutation and scaling.

  2. The source distributions in Inline graphic are uniquely identifiable upto permutation and scaling.

While the above theoretical properties of ICA are reassuring, there are a number of practical difficulties that prevent the reliable estimation of ICs on real data:

  1. Validity of the ICA model: The assumption that the observed real data is generated by an ICA model is only that - an “assumption”. If this assumption is not valid, then the uniqueness results do not hold anymore.

  2. Mutual information approximations: From an information theoretic point of view, the ICA problem is solved by minimizing a contrast function which is an approximation to the mutual information [27] between the ICs that depends on the finite amount of observed data. Such an approximation is necessary, since we do not have access to the marginal source densities Inline graphic. Different approximations to mutual information will lead to different objective functions and hence different solutions. This is one of the reasons why different ICA algorithms often produce different IC estimates even for the same data.

  3. Non-convexity of ICA objective functions: The ICA contrast function is potentially non-convex and hence has multiple local minima. Since global minimization is a challenging problem by itself, most ICA algorithms will only converge to local minima of the ICA contrast function. The run-to-run variability of IC estimates will also depend on the number of local minima in a particular ICA contrast function.

  4. IC estimate corruption by Gaussian noise: For noisy ICA, the IC realizations cannot be recovered exactly even if the true mixing matrix Inline graphic and mean vector Inline graphic are known in 2.1. Commonly used estimators for recovering realization of ICs include the least squares [10] as well as the minimum mean square error (MMSE) [14]. Consider the least squares estimate Inline graphic of a realization of Inline graphic based on Inline graphic:
    graphic file with name pone.0027594.e248.jpg (2.31)
    This means that even for known parameters, IC realization estimates Inline graphic will be corrupted by correlated Gaussian noise. Hence using different subsets of the data under the true model will also lead to variability in estimated ICs.
  5. Over-fitting of the ICA model: Over specification of the model order leads to the problem of over-fitting in ICA. As we describe below, this can lead to (1) the phenomenon of IC “splitting” and (2) an increase in the variance of the IC estimates.

1. IC “splitting”

Suppose that the true model order or the number of non-Gaussian sources in an ICA decomposition of Inline graphic such as 2.1 is Inline graphic. Then a fundamental result in [12, Theorem 1] states that for any other ICA decomposition of Inline graphic, the number of non-Gaussian sources remains the same while the number of Gaussian sources can change. In other words, Inline graphic cannot have two different ICA decompositions containing different number of non-Gaussian sources.

In view of this fact, how can a model order Inline graphic ICA decomposition containing Inline graphic non-Gaussian sources be “split” into a Inline graphic ICA decomposition containing Inline graphic non-Gaussian sources when performing ICA estimation using an assumed model order of Inline graphic? As we describe below, the order Inline graphic ICA decomposition is only an approximation to the order Inline graphic ICA decomposition.

Let Inline graphic be the Inline graphicth column of Inline graphic in 2.1. In the presence of noise, it might be possible to approximate:

graphic file with name pone.0027594.e264.jpg (2.32)

Here:

  • Inline graphic is the contribution of the Inline graphicth non-Gaussian source Inline graphic to the ICA model 2.1.

  • Inline graphic and Inline graphic are independent non-Gaussian random variables that are also independent with respect to all non-Gaussian sources Inline graphic in 2.1.

  • Inline graphic and Inline graphic are the basis time courses corresponding to Inline graphic and Inline graphic respectively.

  • The time courses Inline graphic and Inline graphic look similar to each other.

Note that if Inline graphic, then 2.32 can be made into an equality by choosing Inline graphic. By replacing Inline graphic in 2.1 using 2.32, we arrive at an approximate model order Inline graphic decomposition of Inline graphic. In this decomposition, the component Inline graphic from a model order Inline graphic decomposition appears to be “split” into two sub-components: Inline graphic and Inline graphic.

2. Inflated variance of IC estimates

Overestimation of model order will lead to over-fitting of the mixing matrix Inline graphic. In other words, Inline graphic could have several columns that are highly correlated with each other. This could happen as a result of IC “splitting” as discussed above. Now, for a given realization Inline graphic, the variance of Inline graphic is given by Inline graphic (for isotropic Gaussian co-variance). An increase in number of columns of Inline graphic and the fact that many of them are highly correlated implies that the variability of IC estimates Inline graphic is inflated.

In other words, running ICA multiple times on the same data or variations thereof with random initialization could produce different ICs.

2.2 ICA algorithms, single subject ICA and group ICA

In this section, we give a brief summary of how the ICA parameters are estimated in practice and also summarize the two most common modes of ICA application to fMRI data - single subject ICA (section 2.2.1) and temporal concatenation based group ICA (section 2.2.2).

Given several independent observations Inline graphic as per the noisy ICA model 2.1, most ICA algorithms estimate the ICA parameters Inline graphic and the realizations of Inline graphic in 2 steps. We only consider the case with Inline graphic, since as shown in section 2.1.2, the mixing matrix Inline graphic and source distributions of Inline graphic are identifiable upto permutation and scaling for this case.

  1. First, the diagonal source co-variance is arbitrarily set as Inline graphic. The mean vector Inline graphic is estimated as Inline graphic. Then, using PCA or PPCA [19], the mixing matrix Inline graphic is estimated, upto an orthogonal rotation matrix Inline graphic, to be in a signal subspace which is spanned by the principal eigenvectors corresponding to the largest eigenvalues of the data co-variance matrix Inline graphic. The noise variance Inline graphic is estimated in this step as well.

  2. Next, an estimator Inline graphic for the source realizations is defined using techniques such as least squares or MMSE. The only unknown involved in these estimates is the orthogonal rotation matrix Inline graphic.

  3. Finally, the non-Gaussianity of the empirical density of components of Inline graphic is optimized with respect to Inline graphic using algorithms such as fixed point ICA [27], [28].

For more details on noisy ICA estimation, please see [10] and for more details on ICA algorithms, please see [29].

2.2.1 Single subject ICA

How is ICA applied to single subject fMRI data? Suppose we are given a single subject fMRI scan which we rearrange as a Inline graphic 2D matrix Inline graphic in which column Inline graphic is the Inline graphic observed time-course Inline graphic in the brain at voxel Inline graphic. Observed time-courses Inline graphic are considered to be Inline graphic independent realizations of Inline graphic as per the linear ICA model 2.1. Suppose Inline graphic is the Inline graphic matrix containing the estimated source realizations at the Inline graphic voxels. The Inline graphic th row of Inline graphic is the Inline graphicth IC. In other words, we decompose the time by space fMRI 2D matrix into a set of basis time-courses and a set of Inline graphic 3D IC maps using ICA.

2.2.2 Group ICA

How is ICA applied to data from a group of subjects in fMRI? Suppose we collect fMRI images from Inline graphic subjects. First, we register all subjects to a common space using a registration algorithm (e.g., affine registration). Next, we rearrange each of the fMRI scans into Inline graphic 2D matrices Inline graphic, each of size Inline graphic. Column Inline graphic in Inline graphic is the demeaned time-course observed at voxel location Inline graphic for subject Inline graphic. The matrices Inline graphic are temporally concatenated to get a Inline graphic matrix Inline graphic as follows:

graphic file with name pone.0027594.e337.jpg (2.33)

Column Inline graphic of Inline graphic is the Inline graphic vector Inline graphic which is assumed to follow a linear ICA model 2.1. Inline graphic are considered to be independent realizations of the model 2.1. Suppose Inline graphic is a Inline graphic matrix containing the estimated source realizations at the Inline graphic voxels. The Inline graphicth row of Inline graphic is the Inline graphic th group IC. In group ICA, the joined time-series across subjects is modeled using noisy linear ICA. In practice, Inline graphic is the PCA reduced data set for subject Inline graphic. The PCA reduction is either done separately for each subject using subject specific data co-variance [9] or an average data co-variance across subjects [8]. The average co-variance approach requires each subject to have the same number of time points in fMRI scans.

2.3 The original RAICAR algorithm

In this section, we give a brief introduction to the RAICAR algorithm of [1]. Suppose we are given a data set which we decompose into Inline graphic ICs using ICA (e.g., single subject or group ICA). Our goal is to assess which ICs consistently show up in multiple ICA runs i.e., the reproducibility of each of these Inline graphic ICs. To that extent, we run the ICA algorithm Inline graphic times. Suppose Inline graphic is the Inline graphic vector (e.g. spatial ICA map re-arranged into a vector) of the Inline graphic th IC from Inline graphic th ICA run. Suppose Inline graphic is a Inline graphic absolute spatial cross-correlation coefficient matrix between the ICs from runs Inline graphic and Inline graphic:

graphic file with name pone.0027594.e362.jpg (2.34)

where Inline graphic denotes absolute value. Inline graphic is the absolute spatial cross-correlation coefficient between IC Inline graphic from run Inline graphic and IC Inline graphic from run Inline graphic. The matrices Inline graphic are then arranged as elements of a Inline graphic block-matrix Inline graphic such that the Inline graphic th row and Inline graphic th column of Inline graphic is Inline graphic (Figure 2). This block matrix Inline graphic is the starting point for a RAICAR across-run component matching process.

Figure 2. Pictorial depiction of the original RAICAR algorithm [1].

Figure 2

The ICA algorithm is run Inline graphic times with each run producing Inline graphic ICs. Inline graphic is a Inline graphic block matrix with elements Inline graphic where Inline graphic is the Inline graphic absolute spatial cross-correlation matrix between ICs from runs Inline graphic and Inline graphic. The numbered green circles indicate the sequence of steps in applying RAICAR to a given data set. Our definition of normalized reproducibility in box 7 averages un-thresholded correlation coefficients thereby avoiding the selection of a correlation coefficient threshold prior to averaging.

Since ICs within a particular run cannot be matched to each other, the Inline graphic matrices Inline graphic along the block-diagonal of Inline graphic are set to Inline graphic as shown in Figure 2 with a gray color. The following steps are involved in a RAICAR analysis:

  1. Find the maximal element of Inline graphic. Suppose this maximum occurs in matrix Inline graphic at position Inline graphic. Hence component Inline graphic from run Inline graphic matches component Inline graphic from run Inline graphic. Let us label this matched component by Inline graphic (the first matched component).

  2. Next, we attempt to find from each run Inline graphic (Inline graphic and Inline graphic) a component that matches with component Inline graphic. Suppose element Inline graphic is the maximal element in the Inline graphic th column of Inline graphic. Then component Inline graphic is the best matching component from run Inline graphic with the Inline graphic th component from run Inline graphic.

Similarly, suppose element Inline graphic is the maximal element in the Inline graphic th row of Inline graphic. Then component Inline graphic is the best matching component from run Inline graphic with component Inline graphic from run Inline graphic. As noted in [1], in most cases Inline graphic. However, it is possible that Inline graphic. Hence the component number Inline graphic matching Inline graphic from run Inline graphic is defined as follows:

graphic file with name pone.0027594.e421.jpg (2.35)

We would also like to remove component Inline graphic of run Inline graphic from further consideration during the matching process. To that extent, we zero out the Inline graphic th row from Inline graphic and the Inline graphic th column from Inline graphic.

  1. Once a matching component Inline graphic has been found for all runs Inline graphic, we also zero out the Inline graphicth row from Inline graphic and the ith column from Inline graphic. Similarly, we zero out the Inline graphic th column from Inline graphic and the Inline graphic th row from Inline graphic. This eliminates component Inline graphic from run Inline graphic and component Inline graphic from run Inline graphic from further consideration during the matching process.

  2. Steps 1–3 complete the matching process for one IC component across runs. These steps are repeated until Inline graphic components are matched across the Inline graphic runs. We label the matched component Inline graphic as Inline graphic which contains a set of Inline graphic matching ICs one from each of the Inline graphic ICA runs.

Suppose matched component Inline graphic, Inline graphic consists of the matched ICs Inline graphic. Form the Inline graphic cross-correlation matrix Inline graphic between the matched components in Inline graphic. The Inline graphic th element of this matrix is simply:

graphic file with name pone.0027594.e454.jpg (2.36)

The normalized reproducibility of Inline graphic is then defined as:

graphic file with name pone.0027594.e456.jpg (2.37)

The double sum in 2.37 is simply the sum of the upper triangular part of Inline graphic excluding the diagonal. The normalizing factor Inline graphic is simply the maximum possible value of this sum. Hence the normalized reproducibility satisfies: Inline graphic.

Note that our definition of normalized reproducibility is slightly different from that in [1]. Whereas [1] averages the thresholded absolute correlation coefficients, we simply average the un-thresholded absolute correlation coefficients to compute reproducibility thereby avoiding the selection of a threshold on the absolute correlation coefficients.

2.4 The RAICAR-N enhancement

In this section, we describe how to compute reproducibility Inline graphic-values for each matched component in RAICAR. Note that the RAICAR “component matching” process can be used to assess the reproducibility of any spatial component maps - not necessarily ICA maps. For instance, RAICAR can be used to assess the reproducibility of a set of PCA maps across subjects.

In order to generate reproducibility Inline graphic-values for the matched component maps:

  1. We need to determine the distribution of normalized reproducibility that we get from the RAICAR “component matching” process when the input to RAICAR represents a set of “non-reproducible component maps” across the Inline graphic runs.

  2. In addition, we would also like to preserve the overall structure seen in the observed sets of spatial component maps across the Inline graphic runs when generating sets of “non-reproducible component maps” across the Inline graphic runs.

Hence for IC reproducibility assessment, we propose to use the original set of ICs across the Inline graphic runs to generate the “non-reproducible component maps” across the Inline graphic runs.

Suppose Inline graphic ICA runs are submitted to RAICAR which gives us a Inline graphic vector of observed normalized reproducibility values Inline graphic - one for each IC. We propose to attach Inline graphic-values for measuring the reproducibility of each IC in a data-driven fashion as follows:

  1. First, we label the Inline graphic ICs across the Inline graphic runs using unique integers. In run 1, the ICs are labelled using integers Inline graphic. In run 2, the ICs are labelled using integers Inline graphic and so on. In run Inline graphic, the ICs are labelled using integers Inline graphic.

  2. Our “null” hypothesis is:
    graphic file with name pone.0027594.e477.jpg (2.38)
    Inline graphic

To do this, we randomly permute the integers Inline graphic to get the permuted integers Inline graphic. Obviously Inline graphic.

  1. The Inline graphic sets “non-reproducible component runs under Inline graphic” are constructed by assigning components with labels:

    • Inline graphic to run 1 under Inline graphic.

    • Inline graphic to run 2 under Inline graphic

    • Inline graphic to run Inline graphic under Inline graphic

  2. After Inline graphic runs have been generated under Inline graphic, we subject these to a RAICAR analysis. This gives us Inline graphic values of normalized reproducibility, one for each matched component under Inline graphic.

  3. Steps 1–4 are repeated Inline graphic times to build up a pooled Inline graphic vector of normalized reproducibility Inline graphic under Inline graphic.

  4. Finally, we assign a Inline graphic-value for reproducibility to each matched IC across the Inline graphic runs. The observed reproducibility for Inline graphic th matched IC is Inline graphic and its Inline graphic-value is:
    graphic file with name pone.0027594.e504.jpg (2.39)
  5. Only those components with Inline graphic are considered to be significantly reproducible. We can use a fixed and objective value for Inline graphic such as Inline graphic. Note that this fixed cutoff is independent of the amount of variability in the input to RAICAR-N. Please see Figure 3 for a pictorial depiction of this process.

Figure 3. Pictorial depiction of the process for generating a “null” distribution in RAICAR-N.

Figure 3

Our “null” hypothesis is: “Inline graphic: None of the ICs are reproducible. Hence, we can randomly label IC Inline graphic from run Inline graphic as IC Inline graphic from run Inline graphic”. Therefore we randomly split the Inline graphic ICs across Inline graphic runs into Inline graphic parts and run the RAICAR algorithm on each set of randomly split ICs. This gives us a set of “null” reproducibility values which can be used to compute Inline graphic-values for the observed reproducibility of ICs in the original RAICAR run. The green circles indicate the sequence of steps for generating the “null” distribution after the steps in Figure 2.

Figure 4 shows a schematic of the RAICAR-N analysis process.

Figure 4. Flowchart for a group ICA based RAICAR-N analysis.

Figure 4

The Inline graphic single subject data sets are first pre-processed and subsequently bootstrapped to create Inline graphic groups, each group containing Inline graphic distinct subjects. Each group of Inline graphic subjects is submitted to a temporal concatenation group ICA analysis. The resulting IC maps (either raw ICs or ICs scaled by noise standard deviation) are subjected to a RAICAR analysis. The cross-realization cross correlation matrix (CRCM) is randomly permuted multiple times: Inline graphic where Inline graphic is a random permutation of integers from Inline graphic. The permuted CRCMs are subjected to a RAICAR analysis to generate a realization of reproducibility values under the “null” hypothesis. The computed “null” distribution of reproducibility values is used to assign Inline graphic values to the observed reproducibility of the original RAICAR run. Finally, reproducible ICs are averaged using a random effects analysis and the resulting Inline graphic-statistic images are subjected to GammaInline graphic, Student Inline graphic and GammaInline graphic mixture modeling.

2.5 How many subjects should be used per group ICA run in RAICAR-N?

The input to RAICAR-N can either be single subject ICA runs or group ICA runs across a set of subjects. Note that the individual subject ICA runs are spatially unconstrained whereas a group ICA spatially constrains the group ICs across a set of subjects. Hence the number of ICs that can be declared as significantly reproducible at the group level are usually more than those that can be declared significantly reproducible at the single subject level. Hence the following question is relevant:

Suppose we have a group of Inline graphic subjects. We randomly select Inline graphic subjects and form a single group of subjects. We repeat this process Inline graphic times to get Inline graphic groups of Inline graphic subjects each of which is subjected to a group ICA analysis. Given the number of subjects Inline graphic, how should we choose Inline graphic and Inline graphic?

First, we discuss the choice of Inline graphic. If Inline graphic then each of the Inline graphic groups will contain the same Inline graphic subjects and hence there will be no diversity in the Inline graphic groups. We would like to control the amount of diversity in the Inline graphic groups of Inline graphic subjects. Consider any 2 subjects Inline graphic and Inline graphic. The probability Inline graphic that both Inline graphic and Inline graphic appear in a set of Inline graphic randomly chosen subjects from Inline graphic subjects is given by:

graphic file with name pone.0027594.e551.jpg (2.40)

The expected number of times that Inline graphic and Inline graphic appear together in sets of Inline graphic subjects out of Inline graphic independently drawn sets is:

graphic file with name pone.0027594.e556.jpg (2.41)

Ideally, we would like Inline graphic to be only a small fraction of Inline graphic. Hence we impose the restriction:

graphic file with name pone.0027594.e559.jpg (2.42)

where Inline graphic is a user defined constant such as Inline graphic. This implies that the chosen value of Inline graphic must satisfy:

graphic file with name pone.0027594.e563.jpg (2.43)

In practice, we choose the largest value of Inline graphic that satisfies this inequality. As shown in Figure 5, if Inline graphic and Inline graphic then the largest value of Inline graphic that satisfies 2.43 is Inline graphic. The number of group ICA runs Inline graphic should be as large as possible. From our experiments on real fMRI data we can roughly say that values of Inline graphic give equivalent results.

Figure 5. Figure shows a plot ofInline graphic vs Inline graphic for Inline graphic in blue.

Figure 5

The red line shows the Inline graphic cutoff. The largest value of Inline graphic for which Inline graphic is Inline graphic.

2.6 How to display the estimated non-Gaussian spatial structure in ICA maps?

The ICs have been optimized for non-Gaussianity. However, there can be many types of non-Gaussian distributions. It has been empirically found that the non-Gaussian distributions of ICs found in fMRI data have the following structure:

  1. A central Gaussian looking part and

  2. A tail that extends out on either end of the Gaussian

It has been suggested in [10] that a Gaussian/Gamma mixture model can be fitted to this distribution and the Gamma components can be thought of as representatives of the non-Gaussian structure. We follow a similar approach:

  1. The output of a RAICAR-N analysis is a set of spatial ICA maps (either Inline graphic-transformed maps or raw maps) concatenated into a 4-D volume.

  2. We do a voxelwise transformation to Normality using the voxelwise empirical cumulative distribution function as described in [30].

  3. Next, we submit the resulting 4-D volume to a voxelwise group analysis using ordinary least squares. The design matrix for group analysis depends on the question being considered. In our case, the design matrix was simply a single group average design.

  4. The resulting Inline graphic-statistic maps are subjected to Student Inline graphic, GammaInline graphic and GammaInline graphic mixture modeling. The logic is that if the original ICA maps are pure Gaussian (i.e., have no interesting non-Gaussian structure) then the result of a group average analysis will be a pure Student Inline graphic map which will be captured by a single Student Inline graphic (i.e., the GammaInline graphic and GammaInline graphic will be driven to Inline graphic class fractions). Hence the “null” hypothesis will be correctly accounted for.

  5. If the Gamma distributions have Inline graphic posterior probability at some voxels then those voxels are displayed in color to indicate the presence of significant non-Gaussian structure over and above the background Student Inline graphic distribution.

Examples of Student Inline graphic, GammaInline graphic and GammaInline graphic mixture model fits are shown in Figure 6.

Figure 6. Examples of displaying non-Gaussian spatial structure using a StudentInline graphic, GammaInline graphic and GammaInline graphic mixture model.

Figure 6

Notice how the GammaInline graphic density is driven to near Inline graphic class fraction in the absence of significant negative non-Gaussian structure.

Results

3.1 Human rsfMRI data

rsfMRI data titled: Baltimore (Pekar, J.J./Mostofsky, S.H.; n = 23 [8M/15F]; ages: 20–40; TR = 2.5; # slices = 47; # timepoints = 123), a part of the 1000 functional connectomes project, was downloaded from the Neuroimaging Informatics Tools and Resources Clearinghouse (NITRC): http://www.nitrc.org/projects/fcon_1000/.

3.2 Preprocessing

Data was analyzed using tools from the FMRIB software library (FSL: http://www.fmrib.ox.ac.uk/fsl/). Preprocessing steps included motion correction, brain extraction, spatial smoothing with an isotropic Gaussian kernel of 5 mm FWHM and 100 s high-pass temporal filtering. Spatial ICA was performed using a noisy ICA model as implemented in FSL MELODIC [10] in either single subject or multi-subject temporal concatenation mode also called group ICA. Please see section 2.2 for a brief summary of single subject ICA and group ICA. In each case, we fixed the model order of ICA at Inline graphic to be consistent with the model order range typically extracted in rsfMRI and fMRI [16], [31]. For temporal concatenation based group ICA, single subject data was first affinely registered to the MNI 152 brain and subsequently resampled to 4×4×4 resolution (MNI 4×4×4) to decrease computational load. Please see Figure 4 for a schematic of the RAICAR-N analysis process. In this work, we report across subject RAICAR-N analyses, but as shown in Figure 7, within subject ICA runs can also be entered into RAICAR-N.

Figure 7. Inline graphic-value cutoffs for within and across single subject analysis using RAICAR-N.

Figure 7

This figure illustrates the intuitive fact that within subject ICA runs are much more reproducible compared to across subject ICA runs.

3.3 RAICAR-N analysis with 1 ICA run per subject

Spatial ICA was run once for each of the Inline graphic subjects in their native space. The resulting set of ICA components across subjects were transformed to MNI 4×4×4 space and were submitted to a RAICAR-N analysis. In all RAICAR-N analyses reported in this article, we used the Inline graphic-transformed IC maps - which are basically the raw IC maps divided by a voxelwise estimate of noise standard deviation (named as melodic_IC.nii.gz in MELODIC). It is also possible to use the raw IC maps as inputs to RAICAR-N. ICA components were sorted according to their reproducibility and Inline graphic-values were computed for each ICA component. Please see Figure 8.

Figure 8. Single subject rsfMRI ICA runs across 23 subjects were combined using a RAICAR-N analysis.

Figure 8

Figure (a) shows the observed values of normalized reproducibility (bottom) as well as the “null” distribution of normalized reproducibility across Inline graphic simulations (top). Figure (b) shows the Inline graphic-values for each IC along with the Inline graphic and Inline graphic cutoff lines.

We compared the reproducible RSNs from the single subject RAICAR-N analysis to the group RSN maps reported in literature [8]. Please see Figure 9.

Figure 9. The top 8 “reproducible” ICs from a RAICAR-N analysis on single subject ICA runs compared with standard RSN maps reported in literature [8].

Figure 9

We are able to declare 4 “standard” RSNs as significantly reproducible at a Inline graphic-value Inline graphic. There are 2 other “standard” RSNs that achieve a reproducibility Inline graphic-value between Inline graphic and Inline graphic as well as 2 “non-standard” RSNs that achieve Inline graphic-values of Inline graphic and Inline graphic respectively. We also could not find 2 of the published RSNs in [3] as reproducible in single subject ICA runs.

To summarize, when single subject ICA runs are combined across subjects:

  • We are able to declare 4 “standard” RSNs as significantly reproducible at a Inline graphic-value Inline graphic.

  • There are 2 other “standard” RSNs that achieve a reproducibility Inline graphic-value between 0.05 and 0.06.

  • There are 2 other “non-standard” RSNs that are of interest: one achieves a Inline graphic-value of 0.0125 and the other achieves a Inline graphic-value of 0.05699.

3.4 RAICAR-N on random sets of 5 subjects - 50 group ICA runs

To promote diversity across the group ICA runs, as discussed in section 2.5, Inline graphic subjects were drawn at random from the group of Inline graphic subjects and submitted to a temporal concatenation based group ICA. This process was repeated Inline graphic times and the resulting set of 50 group ICA maps were submitted to a RAICAR-N analysis. ICA components were sorted according to their reproducibility and Inline graphic-values were computed for each ICA component. Please see Figure 10.

Figure 10. Inline graphic subjects were randomly drawn from the set of Inline graphic subjects and submitted to a temporal concatenation based group ICA.

Figure 10

This process was repeated Inline graphic times and the resulting ICA maps were submitted to a RAICAR-N analysis. Figure (a) shows the observed values of normalized reproducibility (bottom) as well as the “null” distribution of normalized reproducibility across Inline graphic simulations (top). Figure (b) shows the Inline graphic-values for each IC along with the Inline graphic and Inline graphic cutoff lines.

We compared the reproducible RSNs from the single subject RAICAR-N analysis to the RSN maps reported in literature [8]. Please see Figure 11.

Figure 11. The top 15 “reproducible” ICs fromInline graphic runs of Inline graphic subject group ICA RAICAR-N analysis compared with standard RSN maps reported in literature [8].

Figure 11

We are able to declare 8 “standard” RSNs as significantly reproducible at a Inline graphic-value of Inline graphic. There are 6 other “non-standard” RSNs that can be declared as significantly reproducible at a Inline graphic-value of Inline graphic and 1 other “non-standard” RSN that achieves a Inline graphic-value of Inline graphic.

In summary, when 50 random 5 subject group ICA runs (from a population of 23 subjects) are combined using RAICAR-N:

  • We are able to declare 8 “standard” RSNs as significantly reproducible at a Inline graphic-value Inline graphic.

  • There are 6 other “non-standard” RSNs that can be declared as significantly reproducible at a Inline graphic-value Inline graphic.

  • There is 1 other “non-standard” RSN that achieves a Inline graphic-value of 0.05299.

3.5 RAICAR-N on random sets of 5 subjects - 100 group ICA runs

To promote diversity across the group ICA runs, as discussed in section 2.5, Inline graphic subjects were drawn at random from the group of Inline graphic subjects and submitted to a temporal concatenation based group ICA. This process was repeated Inline graphic times and the resulting set of 100 group ICA maps were submitted to a RAICAR-N analysis. ICA components were sorted according to their reproducibility and Inline graphic-values were computed for each ICA component. Please see Figure 12.

Figure 12. Inline graphic subjects were randomly drawn from the set of Inline graphic subjects and submitted to a temporal concatenation based group ICA.

Figure 12

This process was repeated Inline graphic times and the resulting ICA maps were submitted to a RAICAR-N analysis. Figure (a) shows the observed values of normalized reproducibility (bottom) as well as the “null” distribution of normalized reproducibility across Inline graphic simulations (top). Figure (b) shows the Inline graphic-values for each IC along with the Inline graphic and Inline graphic cutoff lines.

We compared the reproducible RSNs from the single subject RAICAR-N analysis to the RSN maps reported in literature [8]. Please see Figure 13.

Figure 13. The top 15 “reproducible” ICs fromInline graphic runs of Inline graphic subject group ICA RAICAR-N analysis compared with standard RSN maps reported in literature [8].

Figure 13

We are able to declare 8 “standard” RSNs as significantly reproducible at a Inline graphic-value of Inline graphic. There are 6 other “non-standard” RSNs that can be declared as significantly reproducible at a Inline graphic-value of Inline graphic and 1 other “non-standard” RSN that achieves a Inline graphic-value of Inline graphic.

In summary, when 100 random 5 subject group ICA runs (from a population of 23 subjects) are combined using RAICAR-N:

  • We are able to declare 8 “standard” RSNs as significantly reproducible at a Inline graphic-value Inline graphic.

  • There are 6 other “non-standard” RSNs that can be declared as significantly reproducible at a Inline graphic-value Inline graphic.

  • There is 1 other “non-standard” RSN that achieves a Inline graphic-value of 0.05824.

Discussion

As discussed in section 2.1.2, in the noisy linear ICA model with isotropic diagonal Gaussian noise co-variance, for a given true model order, the mixing matrix and the source distributions are identifiable upto permutation and scaling. However, as pointed out in section 2.1.3, various factors prevent the convergence of ICA algorithms to unique IC estimates. These factors include ICA model not being the true data generating model, approximations to mutual information used in ICA algorithms, multiple local minima in ICA contrast functions, confounding Gaussian noise as well as variability due to model order over-estimation. A practical implication of these factors is that ICA algorithms converge to different IC estimates depending on how they are initialized and on the specific data used as input to ICA. Hence, there is a need for a rigorous assessment of reproducibility or generalizability of IC estimates. A set of reproducible ICs can then be used as ICA based characteristics of a particular group of subjects.

We proposed an extension to the original RAICAR algorithm for reproducibility assessment of ICs within or across subjects (Figure 7). The modified algorithm called RAICAR-N builds up a “null” distribution of normalized reproducibility values under a random assignment of observed ICs across the Inline graphic runs. This “null” distribution is used to compute reproducibility Inline graphic-values for each observed matched component from RAICAR. An objective cutoff such as Inline graphic can be used to detect “significantly reproducible” components. This avoids subjective user decisions such as selection of the number of clusters in ICASSO or the reproducibility cutoff in RAICAR or a cutoff on intra cluster distance in sogICA.

4.1 Results for publicly available rsfMRI data

We applied RAICAR-N to publicly available Inline graphic subject rsfMRI data from http://www.nitrc.org/. We analyzed the data in 2 different ways:

  1. Inline graphic ICs were extracted for each of the Inline graphic subjects. The Inline graphic single subject ICA runs were subjected to a RAICAR-N analysis (after registration to standard space).

  • In single subject ICA based RAICAR-N analysis (Figures 8, 9), we are able to declare 6 out of the 8 ICs reported in [8] (which used group ICA) as “reproducible” (4 ICs have Inline graphic-values Inline graphic and 2 ICs have Inline graphic-values Inline graphic). This is consistent with the 5 reproducible RSNs reported in [32] using single subject ICA analysis.

  1. Inline graphic subjects were randomly drawn from Inline graphic subjects to create one group of subjects which was subjected to a group ICA analysis in which Inline graphic components were extracted. This process was repeated Inline graphic or Inline graphic times and the resulting group ICA runs were subjected to a RAICAR-N analysis.

  • In group ICA based RAICAR-N analysis (Figures 10, 11, 12, and 13), we are able to declare all 8 components reported in [8] as “reproducible” (at Inline graphic). Some of the ICs detected as “reproducible” in the group ICA based RAICAR-N on human rsfMRI data are not shown in [8] but do appear in the more recent paper [31]. RAICAR-N results for Inline graphic are almost identical to those for Inline graphic suggesting that Inline graphic runs of group ICA are sufficient for a RAICAR-N reproducibility analysis.

4.2 Single subject ICA vs Group ICA

Based on our results, it appears that single subject ICA maps are less reproducible compared to group ICA maps as illustrated in Figures 8 and 10. A single subject ICA based analysis is more resistant to subject specific artifacts. On the other hand, a group ICA based analysis makes the strong assumption that ICs are spatially identical across subjects. If this assumption is true, group ICA takes advantage of temporal concatenation to constrain the ICs spatially across subjects thereby reducing their variance. Hence, when there are no gross artifacts in individual rsfMRI data sets, group ICA is expected to be more sensitive for reproducible IC detection. As seen in Figures 9 and 11, our results agree with this proposition. All ICs declared as “reproducible” in the single subject based RAICAR-N analysis continue to remain “reproducible” in the group ICA based RAICAR-N analysis.

4.3 How should subjects be grouped for group ICA?

This raises the question of how the subjects should be grouped together for individual group ICA runs in preparation for RAICAR-N. If all Inline graphic subjects are used in all group ICA analyses then there is no diversity in the individual group ICA runs. In this case, a RAICAR-N analysis will capture algorithmic variability due to non-convexity of ICA objective function but not dataset variability. Hence, our conclusions might not be generalizable to a different set of Inline graphic subjects.

Another option is to randomly select Inline graphic subjects out of Inline graphic for each group ICA run and submit the resulting Inline graphic group ICA runs to RAICAR-N. In this case, we will account for both algorithmic and data set variability via a RAICAR-N analysis. In other words, we will be able to determine those ICs that are “reproducible” across different sets of Inline graphic subjects and across multiple ICA runs. A key question is: How should we choose Inline graphic and Inline graphic? In section 2.5, we proposed a simple method to determine the number of subjects Inline graphic to be used in a single group ICA run out of the Inline graphic subjects - the key idea is to form groups with enough “diversity”. Multiple such group ICA runs can then be submitted to a RAICAR-N analysis for reproducibility assessment. Clearly, the larger the value of Inline graphic, the larger the value of Inline graphic. Hence, increasing the number of subjects Inline graphic in a study will allow us to make conclusions that are generalizable to a larger set of Inline graphic subjects. Also, conclusions generalizable to Inline graphic subjects are expected to hold for Inline graphic subjects but not vice versa.

4.4 RAICAR-N for group comparisons of reproducible ICs

In the present work, our focus was on enabling the selection of reproducible ICs for a given single group of subjects. However, RAICAR-N can be extended for between group analysis of reproducible components as well. Before we describe how to do so, it is useful to discuss other approaches for group analysis of RSNs described in Appendix S1. Suppose we have two groups of subjects Inline graphic and Inline graphic.

4.4.1 Discussion of single group ICA based approaches

1. Subject specific maps corresponding to group ICA maps derived using ICA back projection or dual regression are not true ICs, i.e., they are not solutions to an ICA problem.

2. These approaches do not account for either the algorithmic or the data set variability of an ICA decomposition. The single group ICA decomposition will contain both reproducible and non-reproducible ICs, but there is no systematic way to differentiate between the two.

3. Both dual regression and ICA back projection using data derived IC templates are circular analyses. First, group ICA using all data is used to derive template IC maps or template time courses. Next least-squares based ICA back projection or dual regression using a subset of the same data is used to derive subject specific maps and time courses corresponding to each IC. Thus model Inline graphic (group ICA) on data Inline graphic is used to learn an assumption Inline graphic (template IC maps or template time courses) that is then used to fit model Inline graphic (dual regression or ICA back projection) on a subset of the same data Inline graphic. This is circular analysis [33], .

It is easy to avoid circular analysis in a dual regression approach via cross-validation. For example, one can split the groups Inline graphic and Inline graphic into two random parts, a “training” set and a “test” set. First, the “training” set can be used to derive template IC maps using group ICA. Next, the “training” set based template IC maps can be used as spatial regressors for dual regression on the “test” set. Alternatively, the template ICs for dual regression can also come from a separate ICA decomposition on a independent data set unrelated to groups Inline graphic and Inline graphic such as human rsfMRI data. This train/test approach cleanly avoids the circular analysis problem. It is not clear how to use cross-validation for an ICA back projection approach since template time courses cannot be assumed to remain the same across ICA decompositions.

4. Subject specific structured noise is quite variable in terms of its spatial structure. Hence, a group ICA analysis cannot easily model or account for subject specific structured noise via group level ICs. Consequently, subject specific spatial maps in ICA back projection or dual regression will have a noise component that is purely driven by the amount of structured noise in individual subjects. On the other hand, a single subject ICA based analysis can accurately model subject specific structured noise via single subject ICs.

4.4.2 Discussion of multiple ICA run approaches

1. [35] report that using different sets of template ICs in template based methods using spatial correlation such as [36] can result in the selection of different ICs in individual ICA runs. This is not surprising since IC correspondence derived from template based methods does depend on the particular template used. This is similar to a seed based correlation analysis being dependent on the particular seed ROI used. It is worth noting that template free approaches such as sogICA and RAICAR do not rely on any template.

2. [22] state that individual runs across subjects (or groups of subjects) can be quite variable in terms of the spatial structure of the estimated ICs. For example [22], point out that an IC might be apparently split into two sub-components in some subjects but not others. The real problem is that the same model order could lead to over-fitting in some subjects (or groups of subjects) but not in others. Hence, the observed differences in a group comparison might be biased by the unknown difference in the amount of over-fitting across groups Inline graphic and Inline graphic.

As described in 2.1.3, over-fitting can lead to the phenomenon of component “splitting” in ICA. This is not limited to single subject ICA but can also occur in group ICA. For instance [35], report the “default mode” network as split into three sub networks using group ICA and note that component “splitting” can also reflect functional segregation or hierarchy within a particular IC and is not necessarily a consequence of model order overestimation in every case.

Over-fitting can be correctly accounted for by a reproducibility analysis. This is because we expect the real and stable non-Gaussian sources to be reproducible across multiple ICA runs (algorithmic variability) and across different subjects or groups of subjects (data set variability).

If we want the results of a between group ICA analysis to be generalizable to an independent group of subjects then we must account for both the algorithmic and data variability of ICA. We propose to modify RAICAR-N for enabling between group comparisons of “reproducible” ICs as follows:

  1. Enter multiple within and across subject (or within and across sets of subjects) ICA runs for groups Inline graphic and Inline graphic into a RAICAR analysis. Perform the RAICAR component matching process across groups Inline graphic and Inline graphic.

  2. Use RAICAR-N to compute reproducibility Inline graphic-values separately for group Inline graphic and Inline graphic for each matched component across groups Inline graphic and Inline graphic.

  3. Only ICs that are separately reproducible in both groups Inline graphic and Inline graphic and that are maximally similar to each other are used for between group comparisons.

4.5 Limitations of a RAICAR-N analysis

In this work, we focussed on developing an objective method for reproducible IC detection across multiple ICA runs. However, our approach has some limitations that are worth mentioning:

  1. RAICAR-N gives no indication to the extent of biological relevance of a reproducible IC. For example, an artifactual component consistently appearing across fMRI runs could be highly reproducible and yet non-interesting.

  2. The Inline graphic-values in RAICAR-N can be conservative. Certain borderline reproducible ICs may not achieve statistical significance in RAICAR-N due to the differences in the amount of structured noise in individual ICA decompositions. One approach to increase the sensitivity of RAICAR-N is to denoise the individual fMRI runs via ICA and remove gross artifacts prior to a RAICAR-N analysis.

  3. RAICAR-N does not allow us to relate the reproducible ICs to each other. These relationships might be important for the identification of functionally related brain networks. A recent paper [37] proposes to use the mutual information between spatial ICs as a similarity measure for agglomerative heirarchical clustering of ICs. The number of IC clusters is decided using the well-known ANOVA based approach developed in [38]. Each IC cluster can then be thought of as a functionally related brain network.

RAICAR-N is useful for objective and non-parametric reproducibility assessment but does not attempt to relate the ICs to each other. On the other hand, the work by [37] uses a subjective definition of “qualified clusters” and a hard cutoff on the “quality index” to identify the single most stable run of ICA for IC clustering. RAICAR-N and the work by [37] have different primary objectives (IC reproducibility vs. IC clustering) and it appears that the benefits of both can be realized by feeding the reproducible ICs from RAICAR-N into the algorithm of [37].

To summarize, a RAICAR-N analysis:

  • can be applied for “reproducible” component detection either within or across subjects in any component based analysis - not necessarily ICA. For instance, a set of PCA maps across subjects can be submitted to a RAICAR-N analysis.

  • is simple to implement and accounts for both algorithmic and data set variability of an ICA decomposition.

  • avoids any user decisions except the final Inline graphic-value cutoff which can be objectively pre-set at standard values such as Inline graphic.

  • can be extended to enable comparisons of reproducible ICs between groups Inline graphic and Inline graphic.

Multiple group ICA runs using groups of subjects with enough “diversity” can be used to account for the run-to-run variability in ICA algorithms both due to the non-convex ICA objective function as well as across subjects data variability. These group ICA runs can be subjected to a RAICAR-N “reproducibility” analysis. RAICAR-N enables the objective detection of “reproducible components” in any component based analysis of fMRI data such as ICA and can also be used for a between group comparison of “reproducible” ICs.

Supporting Information

Appendix S1

Background on group comparison of ICA results.

(TEX)

Acknowledgments

We would also like to thank Dr. Christian Beckmann for making the IC image files from his 2005 paper [8] available to us.

Footnotes

Competing Interests: The authors have declared that no competing interests exist.

Funding: This work was supported by the National Institutes of Health (NIH) under the grants K24NS064050 (to DB) and R01NS065051 (to DB). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Yang Z, LaConte S, Weng X, Hu X. Ranking and Averaging Independent Component Anal-ysis by Reproducibility (RAICAR). Human Brain Mapping. 2008;29:711–725. doi: 10.1002/hbm.20432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Jutten C, Herault J. Blind separation of sources, Part I: An adaptive algorithm based on neuromimetic architecture. Signal Processing. 1991;24:1–10. [Google Scholar]
  • 3.Comon P. Independent Component Analysis, a new concept? Signal Processing. 1994;36:287–314. [Google Scholar]
  • 4.Bell AJ, Sejnowski TJ. An information-maximization approach to blind separation and blind deconvolution. Neural Computation. 1995;7:1129–1159. doi: 10.1162/neco.1995.7.6.1129. [DOI] [PubMed] [Google Scholar]
  • 5.Attias H. Independent factor analysis. Neural Computation. 1999;11:803–851. doi: 10.1162/089976699300016458. [DOI] [PubMed] [Google Scholar]
  • 6.McKeown MJ, Makeig S, Brown GG, Jung TP, Kindermann SS, et al. Analysis of fMRI data by blind separation into independent spatial components. Human Brain Mapping. 1998;6:160–188. doi: 10.1002/(SICI)1097-0193(1998)6:3<160::AID-HBM5>3.0.CO;2-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Tohka J, Foerde K, Aron AR, Tom SM, Toga AW, et al. Automatic independent component labeling for artifact removal in fMRI. NeuroImage. 2008;39:1227–1245. doi: 10.1016/j.neuroimage.2007.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Beckmann CF, DeLuca M, Devlin JT, Smith SM. Investigations into resting-state connectiv-ity using independent component analysis. Philos Trans R Soc Lond B (Biol Sci) 2005;360:1001–1013. doi: 10.1098/rstb.2005.1634. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Calhoun VD, Adali T, Pearlson GD, Pekar JJ. A method for making group inferences from functional MRI data using independent component analysis. Human Brain Mapping. 2001;14:140–151. doi: 10.1002/hbm.1048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Beckmann CF, Smith SM. Probabilistic independent component analysis for functional mag-netic resonance imaging. IEEE Transactions on Medical Imaging. 2004;23:137–152. doi: 10.1109/TMI.2003.822821. [DOI] [PubMed] [Google Scholar]
  • 11.Cardoso J. Blind signal separation: statistical principles. Proceedings of the IEEE. 1998;9:2009–2025. [Google Scholar]
  • 12.Rao CR. A Decomposition Theorem for Vector Variables with a Linear Structure. The Annals of Mathematical Statistics. 1969;40:1845–1849. [Google Scholar]
  • 13.Theis FJ. A New Concept for Separability Problems in Blind Source Separation. Neural Computation. 2004;16:1827–1850. doi: 10.1162/0899766041336404. [DOI] [PubMed] [Google Scholar]
  • 14.Davies M. Identifiability Issues in Noisy ICA. IEEE Signal Processing Letters. 2004;11:470–473. [Google Scholar]
  • 15.Himberg A, Hyvarinen A, Esposito F. Validating the independent components of neuroimaging time-series via clustering and visualization. NeuroImage. 2004;22:1214–1222. doi: 10.1016/j.neuroimage.2004.03.027. [DOI] [PubMed] [Google Scholar]
  • 16.Esposito F, Scarabino T, Hyvarinen A, Himberg J, Formisano E, et al. Independent com-ponent analysis of fMRI group studies by self-organizing clustering. NeuroImage. 2005;25:193–205. doi: 10.1016/j.neuroimage.2004.10.042. [DOI] [PubMed] [Google Scholar]
  • 17.Kass RE, Raftery AE. Bayes factors and model uncertainty. 1993. Technical Report Report 254, March 1993. Department of Statistics, University of Washington, USA.
  • 18.Kritchman S, Nadler B. Non-Parametric Detection of the Number of Signals: Hypothesis Testing and Random Matrix Theory. IEEE Transactions on Signal Processing. 2009;57:3930–3941. [Google Scholar]
  • 19.Tipping ME. Probabilistic Principal Component Analysis. Journal of the Royal Statistical Society Series B (Statistical Methodology) 1999;61:611–622. [Google Scholar]
  • 20.Minka TP. Automatic choice of dimensionality for PCA. 2000. Technical Report 514. MIT Media Laboratory Vision and Modeling Group, MIT.
  • 21.Hastie T, Tibshirani R, Friedman JH. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York: Springer; 2009. [Google Scholar]
  • 22.Cole DM, Smith SM, Beckmann CF. Advances and Pitfalls in the Analysis and Interpretation of Resting-State FMRI Data. Frontiers in Systems Neuroscience. 2010;4 doi: 10.3389/fnsys.2010.00008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Feller W. An Introduction to Probability Theory, Vol. II. New York: Wiley; 1966. [Google Scholar]
  • 24.Wlodzimierz B. The normal distribution: Characterizations with applications. New York: Springer-Verlag; 1995. [Google Scholar]
  • 25.Eriksson J, Koivunen V. Identifiability, separability, and uniqueness of linear ICA models. Signal Processing Letters, IEEE. 2004;11:601–604. [Google Scholar]
  • 26.Eriksson J, Koivunen V. Complex random vectors and ICA models: identifiability, uniqueness, and separability. IEEE Transactions on Information Theory. 2006;52:1017–1029. [Google Scholar]
  • 27.Hyvarinen A. New approximations of differential entropy for independent component analysis and projection pursuit. Advances in Neural Information Processing Systems. 1998;10:273–279. [Google Scholar]
  • 28.Hyvarinen A. Fast and Robust Fixed-Point Algorithms for Independent Component Analysis. IEEE Transactions on Neural Networks. 1999;10(3):626–634. doi: 10.1109/72.761722. [DOI] [PubMed] [Google Scholar]
  • 29.Hyvarinen A, Karhunen J, Oja E. Independent Component Analysis. John Wiley and Sons; 2001. [Google Scholar]
  • 30.van Albada SJ, Robinson PA. Transformation of arbitrary distributions to the normal dis-tribution with application to EEG test-retest reliability. Journal of Neuroscience Methods. 2007;161:205–211. doi: 10.1016/j.jneumeth.2006.11.004. [DOI] [PubMed] [Google Scholar]
  • 31.Smith SM, Fox PT, Miller KL, Glahn DC, Fox PM, et al. Correspondence of the brains functional architecture during activation and rest. Proc Natl Acad Sci USA. 2009;106:13040–13045. doi: 10.1073/pnas.0905267106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.DeLuca M, Beckmann CF, DeStefano N, Matthews PM, Smith SM. fMRI resting state networks define distinct modes of long-distance interactions in the human brain. Neuroimage. 2005;29:1359–1367. doi: 10.1016/j.neuroimage.2005.08.035. [DOI] [PubMed] [Google Scholar]
  • 33.Kriegeskorte N, Simmons WK, Bellgowan PSF, Baker CI. Circular analysis in systems neu-roscience: the dangers of double dipping. Nature Neuroscience. 2009;12:535–540. doi: 10.1038/nn.2303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Vul E, Kanwisher N, Hanson S, Bunzl M. Foundational issues in human brain mapping. MIT Press; 2010. pp. 71–92. chapter 6. [Google Scholar]
  • 35.Zuo X, Kelly C, Adelstein JS, Klein DF, Castellanos FX, et al. Reliable Intrinsic Connec-tivity Networks: Test-Retest Evaluation Using ICA and Dual Regression Approach. NeuroImage. 2010;49:2163–2177. doi: 10.1016/j.neuroimage.2009.10.080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Harrison BJ, Pujol J, Lopez-Sola M, Hernandez-Ribas R, Deus J, et al. Consistency and functional specialization in the default mode brain network. Proc Natl Acad Sci U S A. 2008;105:9781–9786. doi: 10.1073/pnas.0711791105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ma S, Correa N, Li X, Eichele T, Calhoun V, et al. Automatic Identification of Functional Clusters in fMRI Data using Spatial Dependence. 2011. IEEE Transactions on Biomedical Engineering 2011 Sep 6 [Epub ahead of print] [DOI] [PMC free article] [PubMed]
  • 38.Ratliff D, Pieper RD. Deciding final clusters: An approach using intra and intercluster distances. Plant Ecology. 1981;48:83–86. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix S1

Background on group comparison of ICA results.

(TEX)


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES