Abstract
The extraction of information from multiple sets of data is a problem inherent to many disciplines. This is possible by either analyzing the datasets jointly as in data fusion, or separately and then combining, as in data integration. However, selecting the optimal method to combine and analyze multiset data is an ever-present challenge. The primary reason for this is the difficulty in determining the optimal contribution of each dataset to an analysis as well as the amount of potentially exploitable complementary information among datasets. In this paper, we propose a novel classification rate based technique to unambiguously quantify the contribution of each dataset to a fusion result as well as facilitate direct comparisons of fusion methods on real data and apply a new method, independent vector analysis (IVA), to multiset fusion. This classification rate based technique is used on functional magnetic resonance imaging data collected from 121 patients with schizophrenia and 150 healthy controls during the performance of three tasks. Through this application, we find that though optimal performance is achieved by exploiting all tasks, each task does not contribute equally to the result and this framework enables effective quantification of the value added by each task. Our results also demonstrate that data fusion methods are more powerful than data integration methods, with the former achieving a classification rate of 73.5 percent and the latter achieving one of 70.9 percent, a difference which we show is significant, when all three tasks are analyzed together. Finally, we show that IVA, due to its flexibility, has equivalent or superior performance compared with the popular data fusion method joint independent component analysis.
Index Terms: Functional Magnetic Resonance Imaging (fMRI), Data Fusion, ICA, IVA, Schizophrenia
I. Introduction
The collection of data from multiple sensors has become common in many fields, since each sensor is expected to provide a complementary view of the system under study [1], [2]. This raises the issue of how best to utilize each of these datasets in order to maximize the use of available information for the given task. When applied to medical imaging, the methods used to analyze data from multiple sensors fall into two broad categories: data integration and data fusion [3], [4]. In data integration, the datasets are analyzed separately and the results are combined [5]–[7]. Data fusion methods, on the other hand, analyze the datasets together, enabling exploitation of the information shared across datasets, see e.g., [4], [8]–[10]. However, there are several issues that prevent straightforward selection of the ideal method when analyzing multiple real world datasets, these include: quantification of the additive value of each dataset, determining the combination of datasets that achieves the best performance, and quantification of the joint information between datasets within an analysis.
Due to its non-invasive nature and high spatial resolution, functional magnetic resonance imaging (fMRI) data has become increasingly utilized to study both human and animal neurological activity [11]. To facilitate understanding of such a complicated system, fMRI data has been combined with multimodal data, i.e., data of different types, such as electroencephalography [12]–[17] or diffusion tensor imaging [18], [19], or with multiset data, i.e., fMRI data from different stimuli or tasks [20]–[23]. Since little is known about the relationships between datasets, it is important to minimize the underlying assumptions placed on the data while allowing the datasets to fully interact. This has lead to the popularity of data-driven fusion methods for jointly analyzing multiset fMRI data, such as the popular joint independent component analysis (jICA) [24], an extension of independent component analysis (ICA) to multiple datasets, that has been used in many other medical applications, see e.g., [25]–[27]. Despite its widespread use, jICA is a fairly constrained model, since it assumes that all datasets have identical mixing matrices, and thus may perform poorly when the modeling assumptions are not met [28]. This assumption also means that jICA inherently requires each task to contribute similarly to the result, though this may not always be the case in practice [29], [30].
In this paper, we propose a novel classification rate based technique to assess the performance of different multiset analysis methods for various combinations of datasets. We should note that perfect classification is not the goal of this work, especially since the patients with schizophrenia were receiving antipsychotic and/or mood stabilizing medication during the scanning sessions, thus making it difficult to attribute observed differences to the disease or to the medication [31]–[33]. Instead, our goal is to enable unambiguous quantification of the additive value of each dataset, determination of the combination of datasets that achieves the greatest performance, and quantification of the interaction among datasets within an analysis. Note that the proposed technique differs fundamentally from the nonparametric prediction, activation, influence, and reproducibility resampling framework [34], whose goal is to assess how well a model generalizes to new data rather than how useful a dataset is to a fusion analysis. We apply this technique to fMRI data from three tasks: an auditory oddball (AOD) task, a Sternberg item recognition paradigm (SIRP) task, and a sensorimotor (SM) task, drawn from 121 patients with schizophrenia and 150 healthy controls. In addition, we also propose the use of independent vector analysis (IVA) [35], a more recent and flexible multiset extension of ICA that exploits similarities across different datasets to achieve a successful decomposition, for fusing multitask fMRI datasets.
The remainder of the paper is organized as follows. In Section II, we introduce our classification procedure, the fMRI tasks, as well as the the features used in this study. Section III contains the results of the classification procedure on the multitask fMRI data and associated discussions. We present our conclusions in Section IV.
II. Materials and Methods
A. Feature Extraction
Since the timing of each task is different, it is difficult to directly fuse multitask fMRI data. Instead, for each task and subject, the data from each voxel is analyzed using a simple linear regression using the statistical parametric mapping toolbox (SPM) [36], where the regressors are created by convolving the hemodynamic response function (HRF) in SPM with the desired predictors. Note that for these regressions, the noise was assumed to be Gaussian and uncorrelated. The resulting regression coefficient maps or “features” are then used for each subject and task. This reduction using a lower-dimensional yet still multivariate representation of the data facilitates exploration of the associations across these feature sets, see e.g., [4], [20], [37]. Such an investigation provides a natural way to discover links across tasks as well as simplifying the identification of biomarkers of disease. See Section II-D for greater detail about the features used in this study.
B. jICA/IVA
Given an fMRI feature dataset from M subjects, X ∈ ℝM×V, where the mth row of X is formed by flattening the feature of V voxels from the mth subject, the noiseless ICA model can be written as
| (1) |
where the C latent sources or neural activation patterns, s ∈ ℝC, are linearly mixed by the mixing matrix, A ∈ ℝM×C. By assuming spatial independence on the part of the latent sources, ICA seeks to estimate a demixing matrix, W, such that the estimated components are given by ŝ(v) = Wx(v). For simplicity, we will drop the sample indices in the rest of the discussion. Due to the fact that we seek to maximize independence between the components, ŝ1, …, ŝC, estimation of the demixing matrix can be accomplished through the minimization of their mutual information, written as
| (2) |
where H(·) is the entropy.
Since it is usually assumed that the number of subjects is larger than the number of latent sources, ICA is generally performed following preprocessing using principal component analysis (PCA). Determining the appropriate order for the PCA preprocessing step for fMRI data is an active area of research, see e.g, [38]–[43]. Despite its widespread study, the majority of order selection techniques have been optimized for single subject or single task analyses, potentially limiting their applicability to multiset fusion. In this work, we used the order estimated using a modification to the procedure in [44] because, to the best of our knowledge, it is the only method that has shown desirable performance for the task of order estimation for data fusion of medical imaging data. The technique, named PCA and canonical correlation analysis (PCA-CCA), estimates a common order for two datasets through a series of binary hypothesis tests. The process is begun by assuming that the common order is 0 and the null hypothesis is that the current common order is appropriate. If the null hypothesis is rejected, the common order is increased by 1 and the binary hypothesis test is performed again. This process is repeated until the null hypothesis cannot be rejected or the common dimension is equal to M/2. In this application, we estimated the order for each pairwise combination of datasets and then selected the highest estimated order to enable the retention of the most signal.
Though there are many ICA algorithms, in this work, we used ICA-EBM, due to the fact that it has shown superior performance in both simulated and real neurological data when compared with the popular Infomax algorithm [45], see e.g., [30], [46], [47]. This performance is due to the fact ICA-EBM does not assume any distribution for the latent sources and instead attempts to upper bound their entropy through the use of several measuring functions [46]. These measuring functions describe a wide variety of distributions, including those that are: unimodal, bimodal, symmetric, and skewed [46], thus generally leading to accurate estimation of all sources within the mixture.
Note that the columns of the estimated mixing matrix, Â, provide the loadings of each component across subjects. Thus, the pth column of the estimated mixing matrix, âp, represents the relative weights of the pth source estimate, ŝp, for each corresponding subject. Since the dataset has been reduced to a feature for each subject, to look for differences in the expression of components between two groups, a two-sample t-test can be performed on the subject covariations, where one group is represented by the subject covariations from the patients with schizophrenia and the other by the subject covariations from the healthy controls [4].
The model in (1) can be extended in a straightforward manner to K datasets, or tasks, as
| (3) |
Due to the inherent scaling and permutation ambiguities of ICA, running an ICA individually for each task and aligning the results is impractical and suboptimal. For this reason and to exploit interactions across different tasks, one may assume that each task is mixed with the same mixing matrix, A, which enables the solution of the problem posed in (3) through the performance of a single ICA in which the sources from the disparate datasets form underlying “joint sources” [13], [20]. Joint estimation of the sources, and hence, the determination of the individual components, can then be achieved through the performance of a single ICA on the horizontally concatenated X[k] defined as
| (4) |
This method, referred to as jICA [24], is one of the most popular multitask fMRI data fusion methods, see e.g., [20], [29]. However, jICA is reliant on the assumption that each dataset has the same mixing matrix, thus it may perform poorly when this is not the case [28].
An alternative to jICA for two datasets is canonical correlation analysis (CCA) [48] or its extension to multiple datasets, multiset CCA (MCCA) [49], which maximizes the correlation of sources across datasets. Neither CCA nor MCCA restrict their solution space by assuming that each mixing matrix is identical, but rather estimate a separate mixing matrix, Â[k], for each dataset simultaneously. Because of their less restrictive forms, CCA, MCCA, and related methods have successfully been applied to multisubject analyses, see e.g., [50]–[54], as well as the fusion of multimodal neurological data, see e.g., [10], [18], [55]. Despite their widespread use, CCA and MCCA have two important limitations that can hurt their applicability to real data: first, both CCA and MCCA only exploit second-order statistics and second, both CCA and MCCA limit the solution space by assuming that the demixing matrices are orthogonal. A way to overcome the shortcomings of CCA and MCCA is through the use of IVA, a recent multiset extension of ICA, which exploits similarities across datasets to achieve a successful decomposition. Unlike CCA or MCCA, IVA does not limit the demixing matrices to be orthogonal and can take advantage of second as well as higher order statistics [56]. This flexibility has lead to the use of IVA in multisubject fMRI analyses, see e.g., [57]–[59], however, to our knowledge, it has not yet been applied to the problem of fusing multitask feature data.
The IVA formulation requires the definition of the cth source component vector (SCV) as , i.e., by concatenating the cth component from each of the datasets. Given the definition of an SCV we can write the mutual information cost function for IVA in a similar manner to that of ICA,
| (5) |
The difference between (2) and (5) is that we are now minimizing the mutual information between SCVs and not components. Rewriting (5), we have
| (6) |
The new term, , in (6) is the mutual information within the SCV and is being maximized through the minimization of (6). Thus, we see how IVA exploits the shared information between across datasets, when it exists, since if the term is not present, (6) reduces to individual ICAs on each dataset separately [56].
C. Classification Procedure
Since there is no ground truth for the components or subject covariations, we propose to use classification rate to determine the relative performance of different techniques on real multitask fMRI datasets. We use this as our metric for two reasons: first, classification rate has an unambiguous meaning that can be defined for different combinations of datasets and analysis techniques, and second, classification rate effectively encapsulates the total discriminative power, between patients and controls. A diagram of the classification procedure is shown in Figure 1 and the process is described below.
Fig. 1.

Classification process for a single feature dataset. For the case where multiple datasets are analyzed, the ICA step is replaced with either jICA or IVA, performed on the concatenated feature datasets or the collection of feature datasets, respectively. The procedure is as follows: (a) the data is split into a training set, XTrain, and a test set, XTest. (b) the training dataset is dimension reduced using PCA, ICA is run, and the discriminatory components, , and corresponding subject covariations, ÃTrain, are selected. (c) in the final stage, ÃTrain is used to train the classifier, is regressed onto XTest producing ÃTest, and ÃTest is used to test the classifier. This process is repeated N times and the mean classification rate is evaluated. Note that for jICA there is a single joint set of subject covariations, and . for IVA and well as the data integration technique, there are dataset specific subject covariations and ; however, for IVA the subject covariations are derived by fusing information across all datasets whereas for the data integration technique the subject covariations are extracted from each dataset individually.
We will describe the classification process for a single dataset using ICA, since the extension to multiple datasets and different analysis techniques is, as we will show, straightforward. The first step is randomly subsampling 190 subjects from the original fMRI feature dataset, X, in order to produce a dataset to train the classifier, XTrain. The feature data from the remaining 81 subjects is formed into XTest, which will be used to test the trained classifier. In order to reduce any bias introduced in the random selection of subjects, the proportion of patients and controls is kept the same in X, XTrain, and XTest. In the next step, PCA, using the order specified using [44], then ICA, using the entropy bound minimization (ICA-EBM) algorithm [46], are performed on XTrain. Following ICA-EBM, a two-sample t-test is performed on each column of the estimated subject covariations, ÂTrain, and those that are declared significant, i.e., p < 0.05, as well as the corresponding spatial maps are formed into ÃTrain and , respectively.
The final stage of the classification process begins with the training of a classifier, such as a radial basis function kernel support vector machine (KSVM) using the rows of ÃTrain. Next, ÃTest is found by regressing onto XTest and is used to test the classifier produced using ÃTrain. The value of the kernel parameter, if any, is selected by computing the average classification rate for 800 independent Monte-Carlo subsamplings of the data with different kernel parameters and finding the value with the highest average classification rate. Once the optimal value of the kernel parameter is selected, the classification procedure is run on N independent Monte-Carlo samplings of the data and the mean classification rate is computed. In our results, we use 200 as the value of N.
For jICA, the process is identical to the one described above, except X is composed of the concatenated feature datasets. With IVA, the process is similar to the one described above, except the two-sample t-test is run on every column of all estimated mixing matrices, , and the columns showing a significant difference between the patients and controls are all concatenated together when training the classifier. In order to compare these two data fusion methods with a technique that does not analyze the data sets jointly, i.e., data integration, we took the same setup from IVA, but replaced the IVA step with a separate ICA analysis for each feature dataset individually. In this work, we use the IVA using a multivariate Gaussian and Laplacian (IVA-GL) algorithm [60] since it has been shown to be an effective IVA algorithm for analysis of fMRI data, see e.g., [60], [61]. Note that to facilitate comparisons between methods and remove possible confounding from the use of different orders for different methods, the same order, 24, was used in the PCA step for all methods. Possible effects of the order on classification performance were studied by repeating the ICA analyses for different orders and no statistically significant differences, after a Bonferroni correction, were observed between the classification rates at different orders.
Note that if the classifier requires the determination of a parameter, such as the value of kernel parameter for KSVM, the selection of the optimal value is done for each method individually. One final note regarding the computational complexity, since T ≫ C, K, by far the most time consuming step in the proposed framework is the jICA/IVA step. The computational complexity is approximately per iteration and approximately per iteration for jICA using EBM and IVA-GL, respectively.
D. FMRI Tasks and Features
The datasets used in this study are from the Mind Research Network Clinical Imaging Consortium Collection (publicly available at http://coins.mrn.org). These datasets were obtained from 271 subjects, 150 healthy controls and 121 patients with schizophrenia. Next, we briefly introduce the tasks used in this study as well as the multivariate features extracted from each task.
1) Auditory Oddball Task (AOD)
This auditory task involved subjects listening to three different types of auditory stimuli: standard (1 kHz tones occurring with probability 0.82), novel (computer generated, complex sounds occurring with probability 0.09), and target (1.2 kHz tones with probability 0.09, to which a right thumb button press was required), in a pseudo-random order [62], [63]. Each run consisting of 90 stimuli each with a 200 ms duration and a randomly changing interstimulus interval of 550–2,050 ms. The order of novel and target stimuli was changed between runs to ensure that the responses did not depend on the stimulus order [63]. In addition, the sequences of stimuli were designed such that they would produce orthogonal blood-oxygen-level dependent responses for each of the three stimuli [64], [65]. For this task, the regressor was created by modeling the target and standard stimuli as delta functions convolved with the default SPM HRF in addition to their temporal derivatives [22]. Subject averaged contrast images between the target versus the standard tones were used as the feature for this task.
2) Sternberg Item Recognition Paradigm Task (SIRP)
In this visual task, the subjects had to remember a set of 1, 3, or 5 randomly chosen integers between 0 and 9. The task paradigm consisted of: a 1.5 second learn condition, a blank screen for 0.5 seconds, a 6 second encode condition, where the sequence of digits was presented together, and a 38 second probe condition, where the subject was shown a series of integers and had to indicate, with a button press with the right thumb, whether it was a member of the memorized set or not [63]. Each probe digit was presented for up to 1.1 seconds in a pseudo-randomly jittered fashion within a 2.7 second interval [63]. A total of 84 probes, 42 targets and 42 foils was obtained per scan and the prompt-encode-probe conditions were run twice for each set size in a pseudo-random order [63]. For this task, the regressor was created by convolving the probe response block for the three digit set with the default SPM HRF [22]. This was done for both runs of the probe response and the average map was used as the feature for this task.
3) Sensory Motor Task (SM)
In this auditory task, the subjects were presented with a sequence of auditory stimuli consisting of 16 different tones each lasting 200 ms and ranging in frequency from 236 Hz to 1,318 Hz with a 500 ms inter-stimulus interval [63]. The first tone presented was set at the lowest pitch and each subsequent tone was higher than the previous one until the highest tone was reached and which point the order of the tones was reversed [63]. Each tonal change required a button press with the right thumb. A total of 15 increase-and-decrease blocks were alternated with 15 fixation blocks, with each block lasting 16 seconds in duration [63]. For this task, the regressor was created by convolving the entire increase-and-decrease block with the default SPM HRF [22]. The responses for each subject were averaged over the separate runs and the average map was used as the feature for this task.
III. Results and Discussion
The classification results for the individual multitask datasets as well as their combinations, using KSVM, are shown in Figure 2. We note that though we only show the results of KSVM in this paper, the trends that we observe are similar to those observed when either k-nearest neighbors or Fisher discriminant analysis are used as the classifier, though the actual classification rates are different. It also should be noted that perfect classification is not primary goal of this work, but rather to determine the comparative advantages of different combinations of fMRI tasks and multiset analysis techniques. Several noteworthy trends can be observed in Figure 2. First, we can see that equivalent or improved classification rate is achieved when multiple datasets are jointly analyzed than when only a single dataset is analyzed. This is due to the fact that there is more total discriminative power available in a combination of multiple datasets than there is in any of the datasets individually. Additionally, the AOD dataset has the highest classification rate of all of the individual dataset analyses, while the SM dataset has the lowest. This means that, when analyzed separately, the AOD dataset is best able to differentiate between patients and controls, while the SM dataset has more difficulty. We also note that there is a clear advantage to be found through the use of data fusion, i.e., jICA and IVA-GL, over the use of data integration, i.e., combined ICAs, in all cases, except where only the AOD and SIRP datasets are used, where equivalent performance is achieved. This shows that the results for jICA and IVA-GL are more than the “sum of their parts,” hence, emphasizing importance of allowing datasets to fully interact with each other in an analysis [2], [66]. Additionally, in all cases IVA-GL performs better than or is statistically equivalent to jICA, showing the advantages of using a less constrained model when the signal-to-noise ratio is relatively high [28]. The strong performance of IVA-GL when compared with the ICA-based techniques is noteworthy since the algorithm used for the ICA-based techniques was EBM, which is able to fit a much larger range of latent source distributions than IVA-GL, thus showing the power of methods that exploit complementary information across datasets. Since IVA-GL is best able to summarize the total discriminative power of a combination of datasets, we can use the difference in classification rate between a two dataset analysis and the three dataset analysis to assess the contribution of the third dataset on the fusion result. For example, the additional contribution of the SM dataset is the difference between the classification rate of IVA-GL using the AOD and SIRP datasets and the classification rate of IVA-GL using all three datasets. Based on this, the additional contribution of AOD is 4.5 percent, the additional contribution of SIRP is 1.7 percent, and the additional contribution of SM is 5.5 percent. Thus, the SM and AOD datasets contribute much more exploitable information than the SIRP dataset. In order to provide a qualitative assessment of the differences observed in Figure 2, 2-sample t-tests were performed on the classification rates and the results are shown in Table I.
Fig. 2.

Average classification results using KSVM for individual datasets and combinations of datasets using either data fusion, with jICA and IVA-GL, or data integration using combined ICAs. The first three points from the left refer to the case where only one dataset is analyzed. The fourth, fifth, and sixth points from the left refer to combinations of two datasets. The rightmost point shows the classification performance when all three task datasets are jointly analyzed. Note that error bars are omitted for clarity, since the largest value of the standard error is 0.0035.
TABLE I.
Significance of the difference in classification rate using KSVM of IVA-GL compared to the data integration technique, combined ICAs, and the popular data fusion method, jICA. Statistical significance is assessed through a two-sample t-test performed on the classification rates obtained for 200 independent subsamplings of the fMRI feature datasets. If the difference corresponds to p ≥ 0.05, the result is marked “not significant” (NS). Note that there is no correction for multiple comparisons, but all significant differences would remain so after the conservative Bonferroni correction.
| Datasets | Significance of Difference (p-values)
|
|
|---|---|---|
| IVA-GL to Combined ICAs | IVA-GL to JICA | |
| AOD and SIRP | NS | NS |
| AOD and SM | 1.0×10−10 | NS |
| SIRP and SM | 1.4 × 10−16 | 9.5 ×10−5 |
| All Three Datasets | 2.3 × 10−9 | 8.5 × 10−4 |
As can be seen from Table I, all of the major differences observed visually in Figure 2 are statistically significant. No correction for multiple comparisons is performed, however we note that all of the observed differences remain statistically significant even after the conservative Bonferroni correction. An interesting result that can be seen in both Figure 2 and Table I is that for the combination of AOD and SIRP feature datasets, there is no statistical difference in the classification rates of IVA-GL compared with that of the combined ICAs. This means that there is no measurable difference in performing data fusion or data integration for this combination of datasets. These results suggest that there is little joint information that is exploited between the AOD and SIRP datasets. On the other hand, there is significant improvement in performance when fusing the AOD and SM datasets, implying that there is a significant amount of joint information between the AOD and SM datasets.
In order to probe this difference, we ran both IVA-GL and the data integration technique on all of the data 10 times and selected a run from each method for comparison using the minimum spanning tree (MST)-based method described in [67]. The MST-based method works by aligning components across different runs of the algorithm, then selecting the run to be used for comparison by finding the run that has the highest correlation with the T-map constructed from a one-sample t-test run on the aligned components. The reason that the components from this run are used is two-fold. First, each of the results of a method on the subsampled data may be seen as rough approximations of the results of the same method on the whole dataset, since we are greatly reducing the dimension of the datasets using PCA prior to performing the analysis. Second, since both the ICA and IVA algorithms are iterative in nature, there is a natural need to determine a result that best represents the average decomposition of a given set of data. For this reason, we use the MST-based method described in [67], since it has shown superior performance to the popular method of ICASSO [68]. Using the run selected by the MST-based method, we find the p-values by performing a two-sample t-test on each column of the estimated mixing matrices of that run, where one group is the subject covariations of the controls and the other is the subject covariations of the patients. Those p-values that pass a significance threshold of p < 0.05 are declared statistically significant and the corresponding z-scored components are aligned between the data fusion and data integration techniques. Two components are considered aligned if they have a sample Pearson correlation coefficient above 0.5, which we find matches with the results produced through visual alignment. The results are shown in Figures 3 and 4.1
Fig. 3.

Statistically significant components for the combination of the AOD and SIRP datasets. (a) the significant components obtained through the use of data fusion, using IVA-GL, are shown in the first two rows. The third and fourth rows contain those significant components obtained through the use of data integration, using ICA-EBM, that have a correlation above 0.5 with the components obtained using IVA-GL. The aligned components are in the same column. Those components obtained using ICA-EBM that do not have a correlation above 0.5 with any of the components obtained using IVA-GL are shown in the (b) and (c) for the AOD and SIRP datasets, respectively. The maps have been flipped such that the activation (red and orange) represents an increase in controls over patients and deactivation (blue) corresponds to a decrease in controls versus patients. The p-values for each component are located above the corresponding spatial map and those that remain significant after a Bonferroni correction are displayed in green. All spatial maps are Z-maps thresholded at Z=2.7.
Fig. 4.

Statistically significant components for the combination of the AOD and SM datasets. (a) the significant components obtained through the use of data fusion, using IVA-GL, are shown in the first two rows. The third and fourth rows contain those significant components obtained through the use of data integration, using ICA-EBM, that have a correlation above 0.5 with the components obtained using IVA-GL. The aligned components are in the same column. Those components obtained using ICA-EBM that do not have a correlation above 0.5 with any of the components obtained using IVA-GL are shown in the (b) and (c) for the AOD and SM datasets, respectively. The maps have been flipped such that the activation (red and orange) represents an increase in controls over patients and deactivation (blue) corresponds to a decrease in controls versus patients. The p-values for each component are located above the corresponding spatial map and those that remain significant after a Bonferroni correction are displayed in green. All spatial maps are Z-maps thresholded at Z=2.7.
There are two trends which become clear from the comparison of Figures 3 and 4. The first is that the proportion of components that are declared significant in the IVA-GL results which are not seen in the ICA results is much higher for the combination of AOD and SM than it is for AOD and SIRP. This would imply that the IVA-GL decomposition shows greater difference for the combination of AOD and SM, i.e., there is greater joint information that is exploited. This helps explain the significant improvement in the classification rate of the data fusion technique over the data integration technique for the combination of AOD and SM as well as the reason for its absence in the combination of AOD and SIRP. We should note that the regions found to be significantly different in patients and controls that do not correspond to the ventricles and are found for both IVA and the individual ICAs in Figure 3 correspond to: medial visual cortex, fronto-insular cortex, thalamus, cerebellum, left fronto-parietal cortex, motor cortex, visual cortex, and sensorimotor cortex. These regions have previously been shown to differentiate patients with schizophrenia from controls, see e.g., [69]–[74]. Similarly, the regions found activating significantly differently in patients and controls in Figure 4 correspond to the: medial visual cortex, default mode network (DMN), motor cortex, thalamus, left fronto-parietal cortex, cerebellum, visual cortex, and sensorimotor cortex. These regions have also been previously noted as being impacted by schizophrenia, see e.g., [69]–[75], consistent the idea of cortico-cerebellar-thalamic-cortical circuit disruption or cognitive dysmetria in schizophrenia [76], thus increasing our overall confidence in the results of this analysis. In addition to these, we also have differentiating components with spatial activations corresponding to the posterior parietal lobe in Figure 3 and the temporal lobe and right fronto-parietal lobe in Figure 4 that are seen in the IVA results and not in the ICA results. These regions have also been shown previously to differentiate patients from controls, see e.g., [69], [74], and they, coupled with the higher, on average, significance values for the components show the advantages of a fusion analysis over individual analyses.
The second trend that can be seen from both Figure 3 and Figure 4 is that there are multiple significant components that are found in the single dataset analyses and not found in the fusion results. This is due to the fact that the fusion results report joint components, thus they may mask weaker individual components [77]. The regions found in the ICA results and not in the IVA results correspond to the cerebellum, DMN, fronto-parietal cortex, and medial-visual cortex in Figure 3 and to the cerebellum, DMN, temporal cortex, and frontal lobe in Figure 4. These regions have been shown to be affected by schizophrenia, see e.g., [69], [70], [74], [75] and their absence from the IVA results demonstrates the importance of performing a comparison between the fusion results with the ICA results in order determine the joint as well as dataset-specific information.
These results also motivate the use of the proposed framework for automated feature selection. Determining optimal features from different modalities\tasks is a continuing challenge in many data fusion studies, see e.g., [4], [20], [23], [24], [62], [78]. If the goal of a fusion study is the determination of differences between a set of groups, the technique developed in this paper can be used for automated selection of optimal features based on the combination of features that achieves the highest classification rate.
IV. Conclusions
It has become increasingly common for fMRI data from multiple stimuli or tasks to be collected in medical studies, since each condition is expected to provide a complementary view of the problem under study. This raises the issue of how to most effectively combine datasets from different subjects and tasks. In this paper, we have developed a data-driven classification procedure to determine the performance of different multitask analysis techniques as well as the discriminatory information that a new dataset introduces. We have applied this new method to real multitask fMRI data drawn form patients with schizophrenia as well as healthy controls. Through this application, we find that though we achieve the best performance by exploiting all tasks, each task does not contribute equally to the result and we quantify the value added by each task. Our results show that data fusion methods, through their exploitation of shared information across datasets, are more powerful than data integration methods, which do not take advantage of such information. We also show that the more flexible IVA-GL has equivalent or superior performance compared with the popular data fusion method joint independent component analysis (jICA). Finally, it is important to note that the proposed framework can be applied, not only to fMRI data from multiple tasks, but also to explore the interactions among data from different sensors, i.e., multimodal data, thus enabling quantification of the additive value of each modality in the analysis. However, for such data, rather than modeling the spatial maps directly, the subject covariations should instead be modeled using the tIVA framework [28].
Supplementary Material
Acknowledgments
This work was supported by the following grants: NIH-NIBIB R01 EB 005846 and NSF-CCF 1618551.
Footnotes
An exploration of how the IVA and combined ICAs relate to the jICA results can be found in the supplementary materials.
References
- 1.James AP, Dasarathy BV. Medical image fusion: A survey of the state of the art. Information Fusion. 2014;19:4–19. [Google Scholar]
- 2.Lahat D, Adalı T, Jutten C. Multimodal data fusion: An overview of methods, challenges, and prospects. Proceedings of the IEEE. 2015 Sep;103(9):1449–1477. [Google Scholar]
- 3.Savopol F, Armenakis C. Merging of heterogeneous data for emergency mapping: Data integration or data fusion? International Archives of Photogrammetry Remote Sensing and Spatial Information Sciences. 2002;34(4):668–674. [Google Scholar]
- 4.Calhoun VD, Adalı T. Feature-based fusion of medical imaging data. IEEE Trans Inf Technol Biomed. 2009 Sep;13(5):711–720. doi: 10.1109/TITB.2008.923773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zhang D, Wang Y, Zhou L, Yuan H, Shen D, Alzheimer’s Disease Neuroimaging Initiative Multimodal classification of Alzheimer’s disease and mild cognitive impairment. NeuroImage. 2011;55(3):856–867. doi: 10.1016/j.neuroimage.2011.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Westman E, Muehlboeck JS, Simmons A. Combining MRI and CSF measures for classification of Alzheimer’s disease and prediction of mild cognitive impairment conversion. NeuroImage. 2012;62(1):229–238. doi: 10.1016/j.neuroimage.2012.04.056. [DOI] [PubMed] [Google Scholar]
- 7.Wee CY, Yap PT, Zhang D, Denny K, Browndyke JN, Potter GG, Welsh-Bohmer KA, Wang L, Shen D. Identification of MCI individuals using structural and functional connectivity networks. NeuroImage. 2012;59(3):2045–2056. doi: 10.1016/j.neuroimage.2011.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Arndt C, Loffeld O. Information gained by data fusion. Lasers, Optics, and Vision for Productivity in Manufacturing I International Society for Optics and Photonics. 1996:32–40. [Google Scholar]
- 9.Choi K, Craddock RC, Holtheiemer PE, Yang Z, Hu X, Mayberg H. A combined functional-structural connectivity analysis of major depression using joint independent components analysis. Intl Soc Mag Reson Med. 2008 May;:3555. [Google Scholar]
- 10.Correa NM, Adalı T, Li YO, Calhoun VD. Canonical correlation analysis for data fusion and group inferences. IEEE Signal Processing Magazine. 2010;27(4):39–50. doi: 10.1109/MSP.2010.936725. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Logothetis NK. What we can do and what we cannot do with fMRI. Nature. 2008;453(7197):869–878. doi: 10.1038/nature06976. [DOI] [PubMed] [Google Scholar]
- 12.Martinez-Montes E, Valdes-Sosa PA, Miwakeichi F, Goldman RI, Cohen MS. Concurrent EEG/fMRI analysis by multiway partial least squares. NeuroImage. 2004;22:1023–1034. doi: 10.1016/j.neuroimage.2004.03.038. [DOI] [PubMed] [Google Scholar]
- 13.Calhoun VD, Adalı T, Pearlson G, Kiehl K. Neuronal chronom-etry of target detection: Fusion of hemodynamic and event related potential data. NeuroImage. 2006;30:544–553. doi: 10.1016/j.neuroimage.2005.08.060. [DOI] [PubMed] [Google Scholar]
- 14.Valdes-Sosa PA, Sanchez-Bornot JM, Sotero RC, Iturria-Medina Y, Aleman-Gomez Y, Bosch-Bayard J, Carbonell F, Ozaki T. Model driven EEG/fMRI fusion of brain oscillations. Human Brain Mapping. 2009;30:2701–2721. doi: 10.1002/hbm.20704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Mijovic B, Vanderperren K, Novitskiy N, Vanrumste B, Stiers P, den Bergh BV, Lagae L, Sunaert S, Wagemans J, Huffel SV, Vos MD. The “why” and “how” of joint ICA: Results from a visual detection task. NeuroImage. 2012;60(2):1171–1185. doi: 10.1016/j.neuroimage.2012.01.063. [DOI] [PubMed] [Google Scholar]
- 16.Youssofzadeh V, Faye I, Malik AS, Reza F, Kamel N, Abdullah JM. Comparison of independent component analysis algorithms for EEG-fMRI data fusion. 4th International Conference on Intelligent and Advanced Systems (ICIAS) 2012 Jun;2:676–679. [Google Scholar]
- 17.Gibson RM, Fernández-Espejo D, Gonzalez-Lara LE, Kwan BY, Lee DH, Owen AM, Cruse D. Multiple tasks and neuroimaging modalities increase the likelihood of detecting covert awareness in patients with disorders of consciousness. Front Hum Neurosci. 2014;8(950):1–9. doi: 10.3389/fnhum.2014.00950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Sui J, He H, Pearlson GD, Adalı T, Kiehl KA, Yu Q, Clark VP, Castro E, White T, Mueller BA, Ho BC, Andreasen NC, Calhoun VD. Three-way (N-way) fusion of brain imaging data based on mCCA+jICA and its application to discriminating schizophrenia. NeuroImage. 2013;66:119–132. doi: 10.1016/j.neuroimage.2012.10.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Eshaghi A, Riyahi-Alam S, Saeedi R, Roostaei T, Nazeri A, Aghsaei A, Doosti R, Ganjgahi H, Bodini B, Shakourirad A, Pakravan M, Ghana’ati H, Firouznia K, Zarei M, Azimi AR, Sahraian MA. Classification algorithms with multi-modal data fusion could accurately distinguish neuromyelitis optica from multiple sclerosis. NeuroImage: Clinical. 2015;7:306–314. doi: 10.1016/j.nicl.2015.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Calhoun VD, Adalı T, Kiehl KA, Astur R, Pekar JJ, Pearlson GD. A method for multitask fMRI data fusion applied to schizophrenia. Human Brain Mapping. 2006;27:598–610. doi: 10.1002/hbm.20204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fitzgerald PB, Srithiran A, Benitez J, Daskalakis ZZ, Oxley TJ, Kulkarni J, Egan GF. An fMRI study of prefrontal brain activation during multiple tasks in patients with major depressive disorder. Human Brain Mapping. 2008;29(4):490–501. doi: 10.1002/hbm.20414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Michael AM, Baum SA, Fries JF, Ho BC, Pierson RK, Andreasen NC, Calhoun VD. A method to fuse fMRI tasks through spatial correlations: Applied to schizophrenia. Human Brain Mapping. 2009;30(8):2512–2529. doi: 10.1002/hbm.20691. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ramezani M, Marble K, Trang H, Johnsrude IS, Abolmaesumi P. Joint sparse representation of brain activity patterns in multi-task fMRI data. IEEE Transactions on Medical Imaging. 2015;34(1):2–12. doi: 10.1109/TMI.2014.2340816. [DOI] [PubMed] [Google Scholar]
- 24.Calhoun VD, Adalı T, Giuliani NR, Pekar JJ, Kiehl KA, Pearlson GD. Method for multimodal analysis of independent source differences in schizophrenia: Combining gray matter structural and auditory oddball functional data. Human Brain Mapping. 2006;27(1):47–62. doi: 10.1002/hbm.20166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Brookings T, Ortigue S, Grafton S, Carlson J. Using ICA and realistic BOLD models to obtain joint EEG/fMRI solutions to the problem of source localization. NeuroImage. 2009;44(2):411–420. doi: 10.1016/j.neuroimage.2008.08.043. [DOI] [PubMed] [Google Scholar]
- 26.Mangalathu-Arumana J, Beardsley SA, Liebenthal E. Within-subject joint independent component analysis of simultaneous fMRI/ERP in an auditory oddball paradigm. NeuroImage. 2012;60(4):2247–2257. doi: 10.1016/j.neuroimage.2012.02.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yuan Z, Ye J. Fusion of fNIRS and fMRI data: identifying when and where hemodynamic signals are changing in human brains. Frontiers in Human Neuroscience. 2013;7:1–9. doi: 10.3389/fnhum.2013.00676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Adalı T, Levin-Schwartz Y, Calhoun VD. Multi-modal data fusion using source separation: Two effective models based on ICA and IVA and their properties. Proceedings of the IEEE. 2015;103(9):1478–1493. doi: 10.1109/JPROC.2015.2461624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ramezani M, Abolmaesumi P, Marble K, Trang H, Johnsrude I. Fusion analysis of functional MRI data for classification of individuals based on patterns of activation. Brain Imaging and Behavior. 2014;9(2):149–161. doi: 10.1007/s11682-014-9292-1. [DOI] [PubMed] [Google Scholar]
- 30.Adalı T, Levin-Schwartz Y, Calhoun VD. Multi-modal data fusion using source separation: Application to medical imaging. Proceedings of the IEEE. 2015;103(9):1494–1506. doi: 10.1109/JPROC.2015.2461624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lui S, Li T, Deng W, Jiang L, Wu Q, Tang H, Yue Q, Huang X, Chan RC, Collier DA, Meda SA, Pearlson G, Mechelli A, Sweeney JA, Gong Q. Short-term effects of antipsychotic treatment on cerebral function in drug-naive first-episode schizophrenia revealed by resting state functional magnetic resonance imaging. Archives of General Psychiatry. 2010;67(8):783–792. doi: 10.1001/archgenpsychiatry.2010.84. [DOI] [PubMed] [Google Scholar]
- 32.Ho BC, Andreasen NC, Ziebell S, Pierson R, Magnotta V. Long-term antipsychotic treatment and brain volumes: a longitudinal study of first-episode schizophrenia. Archives of General Psychiatry. 2011;68(2):128–137. doi: 10.1001/archgenpsychiatry.2010.199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Meng X, Jiang R, Lin D, Bustillo J, Jones T, Chen J, Yu Q, Du Y, Zhang Y, Jiang T, Sui J, Calhoun VD. Predicting individualized clinical measures by a generalized prediction framework and multimodal fusion of MRI data. NeuroImage. 2017;145:218–229. doi: 10.1016/j.neuroimage.2016.05.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Strother SC, Anderson J, Hansen LK, Kjems U, Kustra R, Sidtis J, Frutiger S, Muley S, LaConte S, Rottenberg D. The quantitative evaluation of functional neuroimaging experiments: the NPAIRS data analysis framework. NeuroImage. 2002;15(4):747–771. doi: 10.1006/nimg.2001.1034. [DOI] [PubMed] [Google Scholar]
- 35.Kim T, Eltoft T, Lee TW. Independent vector analysis: An extension of ICA to multivariate components. In: Rosca J, Erdogmus D, Prncipe J, Haykin S, editors. Independent Component Analysis and Blind Signal Separation, ser Lecture Notes in Computer Science. Vol. 3889. Springer; Berlin/Heidelberg: 2006. pp. 165–172. [Google Scholar]
- 36.SPM5. Statistical Parametric Mapping. 2011 http://www.fil.ion.ucl.ac.uk/spm/software/spm5.
- 37.Ramezani M, Rasoulian A, Hollenstein T, Harkness K, Johnsrude I, Abolmaesumi P. Joint source based analysis of multiple brain structures in studying major depressive disorder. Proc SPIE. 2014;9034:1–6. [Google Scholar]
- 38.Cordes D, Nandy RR. Estimation of the intrinsic dimensionality of fMRI data. NeuroImage. 2006;29(1):145–154. doi: 10.1016/j.neuroimage.2005.07.054. [DOI] [PubMed] [Google Scholar]
- 39.Li YO, Adalı T, Calhoun VD. Estimating the number of independent components for functional magnetic resonance imaging data. Human Brain Mapping. 2007;28(11):1251–1266. doi: 10.1002/hbm.20359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Xie X, Cao Z, Weng X, Jin D. Estimating intrinsic dimensionality of fMRI dataset incorporating an AR(1) noise model with cubic spline interpolation. Neurocomputing. 2009;72(4):1042–1055. [Google Scholar]
- 41.Chen S, Ross TJ, Chuang KS, Stein EA, Yang Y, Zhan W. A new approach to estimating the signal dimension of concatenated resting-state functional MRI data sets. Magnetic Resonance Imaging. 2010;28(9):1344–1352. doi: 10.1016/j.mri.2010.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Yourganov G, Chen X, Lukic AS, Grady CL, Small SL, Wernick MN, Strother SC. Dimensionality estimation for optimal detection of functional networks in BOLD fMRI data. NeuroImage. 2011;56(2):531–543. doi: 10.1016/j.neuroimage.2010.09.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hui M, Li R, Chen K, Jin Z, Yao L, Long Z. Improved estimation of the number of independent components for functional magnetic resonance data by a whitening filter. IEEE Journal of Biomedical and Health Informatics. 2013 May;17(3):629–641. doi: 10.1109/jbhi.2013.2253560. [DOI] [PubMed] [Google Scholar]
- 44.Levin-Schwartz Y, Song Y, Schreier PJ, Calhoun VD, Adalı T. Sample-poor estimation of order and common signal subspace with application to fusion of medical imaging data. NeuroImage. 2016;134:486–493. doi: 10.1016/j.neuroimage.2016.03.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Bell A, Sejnowski T. An information maximization approach to blind separation and blind deconvolution. Neural Computation. 1995;7:1129–1159. doi: 10.1162/neco.1995.7.6.1129. [DOI] [PubMed] [Google Scholar]
- 46.Li XL, Adalı T. Independent component analysis by entropy bound minimization. IEEE Transactions on Signal Processing. 2010 Dec;58(10):5151–5164. [Google Scholar]
- 47.Calhoun VD, Potluru VK, Rhlypo R, Silva RF, Pearlmutter BA, Caprihan A, Plis SM, Adalı T. Independent component analysis for brain fMRI does indeed select for maximal independence. PLoS ONE. 2013;8(10):1–8. doi: 10.1371/journal.pone.0073309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Hotelling H. Relations between two sets of variates. Biometrika. 1936;28(3/4):321–377. [Google Scholar]
- 49.Kettenring JR. Canonical analysis of several sets of variables. Biometrika. 1971;58(3):433–451. [Google Scholar]
- 50.Borga M, Friman O, Lundberg P, Knutsson H. A canonical correlation approach to exploratory data analysis in fMRI. Proceedings of the ISMRM Annual Meeting; Honolulu, Hawaii. May, 2002. p. 1. [Google Scholar]
- 51.Li YO, Adalı T, Wang W, Calhoun VD. Joint blind source separation by multiset canonical correlation analysis. IEEE Transactions on Signal Processing. 2009;57(10):3918–3929. doi: 10.1109/TSP.2009.2021636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Rustandi I, Just MA, Mitchell TM. Integrating multiple-study multiple-subject fMRI datasets using canonical correlation analysis. Proc MICCAI 2009 Workshop: Statist Model Detection Issues in Intra-and Inter-Subject Functional MRI Data Anal. 2009 Sep;:1–9. [Google Scholar]
- 53.Deleus F, Van Hulle MM. Functional connectivity analysis of fMRI data based on regularized multiset canonical correlation analysis. Journal of Neuroscience methods. 2011;197:143–157. doi: 10.1016/j.jneumeth.2010.11.029. [DOI] [PubMed] [Google Scholar]
- 54.Zhang Y, Zhou G, Jin J, Wang M, Wang X, Cichocki A. L1-regularized multiway canonical correlation analysis for SSVEP-based BCI. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2013 Nov;21(6):887–896. doi: 10.1109/TNSRE.2013.2279680. [DOI] [PubMed] [Google Scholar]
- 55.Chen X, Wang ZJ, McKeown MJ. A three-step multimodal analysis framework for modeling corticomuscular activity with application to Parkinson’s disease. IEEE Journal of Biomedical and Health Informatics. 2014 Jul;18(4):1232–1241. doi: 10.1109/JBHI.2013.2284480. [DOI] [PubMed] [Google Scholar]
- 56.Adalı T, Anderson M, Fu GS. Diversity in independent component and vector analyses: Identifiability, algorithms, and applications in medical imaging. IEEE Signal Processing Magazine. 2014 May;31(3):18–33. [Google Scholar]
- 57.Lee JH, Lee TW, Jolesz FA, Yoo SS. Independent vector analysis (IVA): Multivariate approach for fMRI group study. NeuroImage. 2008;40(1):86–109. doi: 10.1016/j.neuroimage.2007.11.019. [DOI] [PubMed] [Google Scholar]
- 58.Michael A, Anderson M, Miller R, Adalı T, Calhoun VD. Preserving subject variability in group fMRI analysis: Performance evaluation of GICA versus IVA. Frontiers in Systems Neuroscience. 2014;8:106–123. doi: 10.3389/fnsys.2014.00106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lee JH, Lee TW, Jolesz FA, Yoo SS. Independent vector analysis (IVA) for group fMRI processing of subcortical area. International Journal of Imaging Systems and Technology. 2008;18(1):29–41. doi: 10.1002/ima.20141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ma S, Calhoun VD, Phlypo R, Adalı T. Dynamic changes of spatial functional network connectivity in healthy individuals and schizophrenia patients using independent vector analysis. NeuroImage. 2014;90:196–206. doi: 10.1016/j.neuroimage.2013.12.063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Gopal S, Miller RL, Michael A, Adalı T, Cetin M, Rachakonda S, Bustillo JR, Cahill N, Baum SA, Calhoun VD. Spatial variance in resting fMRI networks of schizophrenia patients: an independent vector analysis. Schizophrenia Bulletin. 2016;42(1):152–160. doi: 10.1093/schbul/sbv085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Kiehl KA, Liddle PF. An event-related functional magnetic resonance imaging study of an auditory oddball task in schizophrenia. Schizophrenia Research. 2001;48(2):159–171. doi: 10.1016/s0920-9964(00)00117-1. [DOI] [PubMed] [Google Scholar]
- 63.Gollub RL, Shoemaker JM, King MD, White T, Ehrlich S, Sponheim SR, Clark VP, Turner JA, Mueller BA, Magnotta V, O’Leary D, Ho BC, Brauns S, Manoach DS, Seidman L, Bustillo JR, Lauriello J, Bockholt J, Lim KO, Rosen BR, Schulz SC, Calhoun VD, Andreasen NC. The MCIC collection: a shared repository of multi-modal, multi-site brain image data from a clinical investigation of schizophrenia. Neuroinformatics. 2013;11(3):367–388. doi: 10.1007/s12021-013-9184-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Clark VP, Fannon S, Lai S, Benson R, Bauer L. Responses to rare visual target and distractor stimuli using event-related fMRI. Journal of Neurophysiology. 2000;83(5):3133–3139. doi: 10.1152/jn.2000.83.5.3133. [DOI] [PubMed] [Google Scholar]
- 65.Clark VP. A history of randomized task designs in fMRI. NeuroImage. 2012;62(2):1190–1194. doi: 10.1016/j.neuroimage.2012.01.010. [DOI] [PubMed] [Google Scholar]
- 66.Schouten TM, Loitfelder M, de Vos F, Seiler S, van der Grond J, Lechner A, Hafkemeijer A, Möller C, Schmidt R, de Rooij M, Romboutss SARB. Combining anatomical, diffusion, and resting state functional magnetic resonance imaging for individual classification of mild and moderate Alzheimer’s disease. NeuroImage: Clinical. 2016 Jan;11:46–51. doi: 10.1016/j.nicl.2016.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Du W, Levin-Schwartz Y, Fu GS, Ma S, Calhoun VD, Adalı T. The role of diversity in complex ICA algorithms for fMRI analysis. Journal of Neuroscience Methods. 2016;264:129–135. doi: 10.1016/j.jneumeth.2016.03.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Himberg J, Hyvarinen A. ICASSO: software for investigating the reliability of ICA estimates by clustering and visualization. IEEE 13th Workshop on Neural Networks for Signal Processing. 2003 Sep;:259–268. [Google Scholar]
- 69.Sui J, Adalı T, Pearlson G, Yang H, Sponheim SR, White T, Calhoun VD. A CCA+ICA based model for multi-task brain imaging data fusion and its application to schizophrenia. NeuroImage. 2010;51:123–134. doi: 10.1016/j.neuroimage.2010.01.069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Kim DI, Mathalon DH, Ford JM, Mannell M, Turner JA, Brown GG, Belger A, Gollub R, Lauriello J, Wible C, OLeary D, Lim K, Toga A, Potkin SG, Birn F, Calhoun VD. Auditory oddball deficits in schizophrenia: an independent component analysis of the fMRI multisite function BIRN study. Schizophrenia Bulletin. 2009;35(1):67–81. doi: 10.1093/schbul/sbn133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Schneider F, Habel U, Reske M, Kellermann T, Stöcker T, Shah NJ, Zilles K, Braus DF, Schmitt A, Schlösser R, Wagner M, Frommann I, Kircher T, Rapp A, Meisenzahl E, Ufer S, Ruhrmann S, Thienel R, Sauer H, Henn FA, Gaebel W. Neural correlates of working memory dysfunction in first-episode schizophrenia patients: an fMRI multi-center study. Schizophrenia Research. 2007;89(1):198–210. doi: 10.1016/j.schres.2006.07.021. [DOI] [PubMed] [Google Scholar]
- 72.Deserno L, Sterzer P, Wüstenberg T, Heinz A, Schlagenhauf F. Reduced prefrontal-parietal effective connectivity and working memory deficits in schizophrenia. Journal of Neuroscience. 2012;32(1):12–20. doi: 10.1523/JNEUROSCI.3405-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Gur RE, Turetsky BI, Loughead J, Snyder W, Kohler C, Elliott M, Pratiwadi R, Ragland JD, Bilker WB, Siegel SJ, Kanes SJ, Arnold SE, Gur RC. Visual attention circuitry in schizophrenia investigated with oddball event-related functional magnetic resonance imaging. American Journal of Psychiatry. 2007;164(3):442–449. doi: 10.1176/ajp.2007.164.3.442. [DOI] [PubMed] [Google Scholar]
- 74.Liu J, Pearlson G, Windemuth A, Ruano G, Perrone-Bizzozero NI, Calhoun VD. Combining fMRI and SNP data to investigate connections between brain function and genetics using parallel ICA. Human Brain Mapping. 2009;30(1):241–255. doi: 10.1002/hbm.20508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Whitfield-Gabrieli S, Thermenos HW, Milanovic S, Tsuang MT, Faraone SV, McCarley RW, Shenton ME, Green AI, Nieto-Castanon A, LaViolette P, Wojcikb J, Gabrielia JDE, Seidman LJ. Hyperactivity and hyperconnectivity of the default network in schizophrenia and in first-degree relatives of persons with schizophrenia. Proceedings of the National Academy of Sciences. 2009;106(4):1279–1284. doi: 10.1073/pnas.0809141106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Sears LL, Andreasen NC, Oleary DS. Cerebellar functional abnormalities inschizophrenia are suggested by classical eyeblink conditioning. Biological Psychiatry. 2000;48(3):204–209. doi: 10.1016/s0006-3223(00)00247-x. [DOI] [PubMed] [Google Scholar]
- 77.Svensén M, Kruggel F, Benali H. ICA of fMRI group study data. NeuroImage. 2002;16(3):551–563. doi: 10.1006/nimg.2002.1122. [DOI] [PubMed] [Google Scholar]
- 78.Zhang D, Shen D, Alzheimer’s Disease Neuroimaging Initiative Predicting future clinical changes of MCI patients using longitudinal and multimodal biomarkers. PloS One. 2012;7(3):1–15. doi: 10.1371/journal.pone.0033182. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
