Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Mar 1.
Published in final edited form as: J Neurosci Methods. 2022 Jan 11;369:109478. doi: 10.1016/j.jneumeth.2022.109478

A novel 5D brain parcellation approach based on spatio-temporal encoding of resting fMRI data from deep residual learning

Behnam Kazemivash 1, Vince D Calhoun 2
PMCID: PMC9394484  NIHMSID: NIHMS1772872  PMID: 35031344

Abstract

Objective:

Brain parcellation is an essential aspect of computational neuroimaging research and deals with segmenting the brain into (possibly overlapping) sub-regions employed to study brain anatomy or function. In the context of functional parcellation, brain organization which is often measured via temporal metrics such as coherence, is highly dynamic. This dynamic aspect is ignored in most research, which typically applies anatomically based, fixed regions for each individual, and can produce misleading results.

Methods:

In this work, we propose a novel spatio-temporal-network (5D) brain parcellation scheme utilizing a deep residual network to predict the probability of each voxel belonging to a brain network at each point in time.

Results:

We trained 53 4D brain networks and evaluate the ability of these networks to capture spatial and temporal dynamics as well as to show sensitivity to individual or group-level variation (in our case with age).

Conclusion:

The proposed system generates informative spatio-temporal networks that vary not only across individuals but also over time and space.

Significance:

The dynamic 5D nature of the developed approach provides a powerful framework that expands on existing work and has potential to identify novel and typically ignored findings when studying the healthy and disordered brain.

Keywords: Brain Parcellation, Residual Deep Neural Network, ICA, fMRI, Neuroimaging

I. INTRODUCTION

THE brain is incredibly complex and the most sophisticated organ in the body, comprised of copious neurons collaborating precisely to control perception, emotion, intelligence, creativity and general day-to-day human life activities [1, 2]. It is composed of different anatomic subparts including cerebrum, cerebellum and brainstem which are anatomically linked and functionally coupled to conduct a specific cognitive function or produce a behavioral response to external stimulus [3, 4].

Computational neuroimaging is an interdisciplinary research domain that deploys many fields including visual neuroscience, computer science and psychology for investigating brain organization and function and for developing computational theories [1, 5]. Brain parcellation as a branch of computational neuroimaging seeks to segment the brain into a mosaic of anatomic or functionally distinct regions which are spatially homogeneous [6, 7]. Brain parcellation has a significant role in exploring the connectome, a comprehensive brain map of neural connections, and can be utilized as a prior for studies of brain disorders [8].

Due to the natural diversity of brain shape and cortical folding (gyrification), as well as the cytoarchitectonic variations which are not visible via macroscopic imaging approaches like magnetic resonance imaging (MRI), proposing an efficient and rigorous parcellation scheme which reflects the functional and structural variability among individuals can significantly advance the field [9, 10]. Existing parcellation approaches can be divided into atlas (anatomy)-based and connectivity (function)-based approaches [11]. Atlas-based approaches are widely used and typically segment the brain by applying a predefined anatomical template on the input image and generating a set of regions over associated template. Despite the popularity of these methods, there are drawbacks [12, 13]. Obviously, these methods rely heavily on the given template and do not take natural variance of the brain into account for different subjects beyond the geometric warping of the fixed atlas into a standardized space using either voxel-based or surface-based approaches. The spatial registration procedures exploited in these techniques can be computationally expensive [14, 15].

Connectivity-based methods, on the other hand, segment the brain into temporally coherent portions of homogeneous patterns considering similarity metrics. Choosing a suitable similarity metric is a critical step in connectivity-based methods and can significantly affect the result [16, 17]. Independent component analysis (ICA) is a type of connectivity-based approach [18]. ICA is widely used and provide a source-based technique that can adapt to the data at hand. ICA-based methods adapt to an individual brain, but typically assume a fixed spatial pattern for each network over time within a given person. Changes over time are captured by a single timecourse which modulates the contribution of the network to each point in time. Furthermore, ICA-based approaches commonly exploit free parameters such as the number of components which can be used to model the brain at different spatial scales [19]. Another advantage of ICA, relative to most other parcellation approaches, is the components have values at all voxels, thus allowing for overlap. This facilitates modeling of voxels which participate in multiple functional networks.

There is a considerable amount of research studying functional connectivity and brain dynamics for estimating and visualizing synchronized interactions of focal or distributed brain regions through time and linking of these to cognition and behavior in the healthy and disordered brain [20, 21]. There are also many different ways to model brain dynamism including spatial, temporal, and spatio-temporal dynamics, each of which provide unique and relevant information on brain activity and connectivity [22]. Temporal dynamic approaches to brain connectivity deals with studying time-varying properties of functional magnetic resonance imaging (fMRI) data by evaluating transient changes in temporally correlated or mutually informed information between fixed brain regions or spatial nodes [23]. More recently there have been recognition of the need for modeling changes in the spatial nodes/networks over time [22, 24] including changes in shape, size or translation of active regions. If we consider both spatial and temporal features for a given subject simultaneously then spatio-temporal dynamics is taken into account [25].

While abundant research has been conducted on brain parcellation, to our knowledge, none of them has modeled the full 4D nature of human brain function, allowing for both temporal and spatial changes over time. The computational complexity for such an approach can be challenging, but the improved efficiency of machine learning algorithms provides a substantial opportunity. We herein propose a novel multi-model system which adopts deep residual network structure to predict brain networks for fMRI images which can be used for tracking spatio-temporal dynamics over space, time, and network, resulting in a 5-dimensional parcellation (i.e., 4D spatio-temporal information for each brain network).

The remaining text is organized as follows. Section II reviews related scientific literature followed by theoretical background in section III. Also, section IV discusses details of the proposed method and continues with results in section V. Next, we have discussion section VI to analyze experimental results and finally, we have conclusion in section VII.

II. RELATED WORK

The idea of employing predefined templates in brain parcellation or taking advantages of mathematical similarity metrics has been widely investigated in the neuroimaging and neuroscience community. Machine learning algorithms being applied in brain parcellation have shown good results [26]. Recently, Liu et al. presented a new multi-atlas adversarial fully convolutional network (FCN) [27] for brain parcellation in which the generative model is an FCN merged with brain atlases information and the discriminative model is a convolutional neural network with multi-scale deep features. Bonmati et al. implemented a novel and efficient framework based on information theory that models brain regions as a random walk on the connectome by applying a Markov process in which the different nodes refer to brain regions. They also used an agglomerative information bottleneck technique to cluster functional or anatomical patches by minimizing loss of information through computing mutual information as the similarity metric [28].

In another approach, Thyreau et al. proposed a novel and efficient scheme bridging atlas-warping techniques and the precision of landmark localization in a deep learning model, leading to meaningful and interesting results [29]. Moreover, Luo et al. conducted research adopting an eigen clustering technique for brain parcellation utilizing the precuneus–cortical functional connectivity derived from fMRI images and showed that it is robust to noise [30].

In other work, Zhao et al. proposed a two-step parcellation scheme which maps resting fMRI (rs-fMRI) signals to cortical surface and then registers that into standard space. For the second step, the system estimates the similarity between vertices based on the correlation of rs-fMRI time series. This approach has shown acceptable results based on standard metrics [31]. Similarly, Boukhdhir et al. presented a novel scheme to detect dynamic states of segments; In their method, a clustering analysis has been executed to regroup short time windows into states with similar seed patches [32].

In addition to studying relevant work, Graham et al. [33] proposed a novel brain parcellation scheme by generating a hierarchical classification tree which is adapted from a segmentation task and extending it for predicting decisions by modeling an uncertainty factor for each of them. However, none of the above approaches model the full 4D aspect of brain functional connectivity.

III. THEORETICAL BACKGROUND

Our brain parcellation scheme exploits ICA output to extract components that are used as prior information to train a deep residual model in a supervised manner. ICA extracts maximally spatially independent maps from the fMRI data and their associated time courses. The data, for a given fMRI image X which is time-by-space, are decomposed into a time course matrix A which is time-by-component and the spatial maps S which is component-by-voxel, written as

X=AS (1)

Here, X is a 4D fMRI image (time by volume size), A is the time course matrix which size is the length of time points by component number and finally S is the spatial maps matrix with size of component number by voxel size. This can also be done at the group level as in [34]. We leverage a recently proposed fully automated spatially constrained ICA pipeline called NeuroMark [35] which enables us to estimate brain functional network from functional magnetic resonance imaging (fMRI) data that can be used to link brain network abnormalities among different datasets, studies, and disorders. As mentioned, the ICA decomposition does not explicitly model temporal variation of the spatial maps. We can utilize this pipeline however by taking the spatial maps as priors to initialize the model.

IV. METHOD

We propose a novel predictive model that encodes spatio-temporal dynamics to estimate individual subject brain network parcellations that are linked across subjects as well. Accordingly, we frame the approach as a regression problem by predicting the probability of each voxel belonging to different brain networks. Each brain network is derived from a combination of temporal and spatial ICA components computed for a given subject using the approach discussed in the theoretical section. Next, we train a deep residual model for each of the brain networks via a supervised approach by using the fMRI time series images as input and generating labels as the target. In terms of novelty, the proposed scheme utilizes a deep learning structure to encode a set of 4D fMRI images into network specific brain parcels which vary through time as shown in figure 1. To do this, for each network we apply the pre-trained model to each timepoint in a repetitive manner to achieve the entire 4D network. We do this for all networks, resulting in a full 5D parcellation of the brain.

Fig. 1.

Fig. 1.

General concept of ICA-based methods versus the proposed approach. In ICA approach, we get separate components for time courses and spatial maps, but the proposed system generates a 5D output which encodes spatio-temporal dynamics of the brain and enables us to study variability in different network over time or subjects. It is also mentioned in the figure that ICA components utilized as prior to train the proposed model.

A. Label Generation

Generating labels is a critical step in proposed approach due to our supervised model training. First, we applied ICA on fMRI images each of which are a 4D tensor with one temporal axis and 3 dimensions for indicating spatial coordinates. By applying ICA and extracting components, we have k time courses and spatial maps separately where a time course is a vector with size t equal to the length of time axis in the original fMRI image, and a spatial map is a vector of which the size is product of spatial coordinates, the voxel size. Then, we compute the outer product of time courses and spatial maps for all k components in order to generate full 4D tensors indicating each brain network over time and is mandatory for training our model.

Ni=TCiSMi (2)

where, TCi is a specific time course, SMi refers to a given spatial map and is the outer product operator. Clearly, Ni is a matrix with size of t-by-voxel size. We then build up a 3D tensor by repeating the procedure for all k components. Next, we generate probabilistic maps by applying the following equation:

PMi=|Ni|/i=1k|Ni|

Here, PM shows the probabilistic map, and the dominator is the elementwise summation of absolute values of N which forms a matrix with same shape as N. This procedure guarantees all elements range between 0 and 1 because we divide each element with norm-1 of a vector that includes different values of each element in k maps.

Finally, we transform each probability network into a 4D tensor by deploying a predefined mask with the same shape as the input fMRI image.

B. Model Configuration

Our regression model is configured as a deep residual network with 36 layers including conv3D, transposed Conv3D, batch normalization, max pooling and drop out layers which are grouped as encoding and decoding blocks and finally skip connections inside and outside of each block. The model structure has 2 focal sections of encoder and decoder sections; The encoder begins by a conv3D layer with output channel size of 64 and kernel size of 3 followed by sigmoid function. Next, we have residual encoding block with 3 volumetric convolution and batch normalization layers with same out-channel size of 32 and also both of last conv layers use padding size of 1. Also, we apply sigmoid function on each couple of conv and batch norm layers’ output.

Furthermore, we have a 3D max pooling layer just after each encoding block with stride of 1 and same kernel size as convolution layers. The remaining encoding blocks have identical configuration and dependencies as first block except out-channel size of 16 and 8. The last layer in encoder section of model is a 3D dropout layer with ratio of 0.5. On the decoder side, we have 2 consecutive decoding block each of which has a couple of 3D conv transposed layers followed by batch normalization layers and a sigmoid activation function immediately after this layer. The sizes of the out-channels are, respectively, 16 and 32.

We also have a dropout layer with ratio of 0.5. Subsequently, there is another decoding block with an out-channel size of 64 and similar remaining parameters. The last layer is a 3D conv transposed layer with the out-channel size of 1 and kernel size of 3. Furthermore, we applied mean squared error as a loss function in the supervised training procedure as shown in the equation below:

L(h,(x,y))=def1n×i=1n(h(xi)yi)2

Where h denotes the hypothesis, (x,y) is set of input-label tuples. The schematic diagram of the model structure contains encoding and decoding blocks along with skip connections, as shown in figure 2.

Fig. 2.

Fig. 2.

Schematic diagram of model organization with detail structure of (b) encoding block and (c) decoding block. The base model has 36 layers including conv3D, max pooling, drop out, batch norm and conv transposed layers grouped into encoding/decoding blocks. Data flow and skip connections between different layers are shown by black arrows.

C. Model input templates

For the spatial priors, we utilize the NeuroMark templates [36]. A template of replicable independent components (ICs) constructed after spatially matching correlated group-level ICs between two large (N>800) healthy control fMRI datasets, genomics superstructure project (GSP) and human connectome project (HCP). The template was then used as priors for a spatially constrained ICA algorithm applied to each subject individually to identify 53 functionally relevant resting-state networks (RSNs) for each individual that are maximally spatially independent. Each RSN consists of one spatial map, with a size of 53×63×52 voxels, and its associated time course. RSNs are grouped into seven domains, namely subcortical (SC), auditory (AU), sensory motor (SM), visual (VI), cognitive control (CC), default mode (DM), and cerebellar (CB), by function similarity.

D. fMRI data: Acquisition and preprocessing

To evaluate preliminary results, we applied our approach to data from the UK Biobank study (six subjects to evaluate the ability of our approach to capture variability within subjects, and forty subjects to evaluate the ability of our approach to capture changes related to a subject variable, age). Participants were scanned once by a 3-Tesla (3T) Siemens Skyra scanner with a 32-channel receive head coil, acquired all in one site. A gradient-echo echo planar imaging (GE-EPI) paradigm was used to collect/obtain resting-state fMRI scans. The EPI-based acquisition parameters include multiband acceleration factor of 8 (i.e., eight slices were acquired simultaneously), no iPAT, fat saturation, flip angle (FA)=52°, spatial resolution = 2.4×2.4×2.4mm, repeat time (TR)=0.735s, echo time (TE)=39ms, and 490 volumes. Subjects were instructed to stare at a crosshair passively and remain relaxed, not thinking about anything, during the six minutes and ten second resting-state scanning period.

The preprocessing steps performed by UK Biobank are as follows. An intra-modal motion correction tool, MCFLIRT [37], was applied to minimize the distortions due to head motions. Grand-mean intensity normalization scaled the entire 4D dataset by a single multiplicative factor to facilitate comparability of brain scans between subjects. The data were filtered by a high-pass temporal filter (Gaussian-weighted least squares straight line fitting, with σ = 50.0 s) to remove residual temporal drift. Geometric distortions of EPI scans were corrected by using FSL’s Topup tool [38]. EPI unwarping is followed by a gradient distortion correction unwarping phase. More details on the imaging protocol and preprocessing steps can be found in [39].

V. RESULTS

We implemented the entire system including the model, preprocessing and post-processing procedures in python 3.6 mainly using the Pytorch [40], NumPy and Nilearn libraries. All experiments executed on a TESLA V100 GPU with 32 GB of dedicated RAM. We trained 53 models each of which grouped into a specific domain including cerebellar, sensory-motor, default mode, auditory, subcortical, cognitive control and visual networks. Our model benefits from the Adam optimizer with an adaptive learning rate of 0.00001 and step size of 5 for optimizing the loss function and 200 epochs for training phase.

We also used an early stopping technique to avoid overfitting and to improve the generalization of the trained model. Furthermore, to evaluate the ability of our approach to capture within and between subject variability, we trained our model using data from 3 fMRI subjects from the UK Biobank’s dataset [41] each of which has 490 timepoints; A total of 1470 volumes with dimensions of (53,63,52) as input samples (dataset), were normalized using a min-max method. We split the dataset into training, validation and test dataset by grouping 70, 10 and 20 percent of the entire dataset, respectively. In other words, we have 1030, 150 and 290 volumes for each of training, validation and test sets. It is worthwhile to mention that all sets do not have any sample in common.

Finally, we trained each of 53 models by feeding input fMRI images (each of volumes as an individual sample) and relevant probability network, extracted from ICA as priors based on described configuration. Moreover, training and validation losses converged to low values after primary fluctuations as shown in Figure 3 for a sample subcortical network. Consequently, the trained models show consistent behavior facing unseen samples due to the average mean squared error for test set which ranges between 29 to 33 for all networks and are located in reasonable range, although some of them are a bit higher than other ones as a consequence of applying same model parameters for all of them. We can easily obtain a similar error range for all networks by applying specific learning and decay rates for each of them separately.

Fig. 3.

Fig. 3.

The upper plot shows the Mean Squared Error loss on training and validation set for one of subcortical networks (network 1) which emphasizes on the convergences of model loss for both errors. Moreover, both errors are minimized by desirable low values. Furthermore, lower plot depicts the averaged MSE loss on test set for all 53 models each of which is related to a specific brain network. However, some of models have higher averaged MSE than others that’s mainly consequence of applying same learning rate and also decay rate for all of them instead of specific parameters but all of models’ averaged losses fluctuate in a reasonable range.

Results suggest the proposed multi-model system shows meaningful parcels, not only facilitating comparison among different subjects but also highlighting individual variability in decompositions. For example, comparing the output for one of the trained models for different subjects, we show the sensory-motor network in figure 4, which clearly emphasizes the fact that the model captures variability among subjects in the networks (in contrast to fixed atlas-based approaches); We can see smooth changes around active brain area on the output of a sensory-motor network for different subjects at timepoint 100 in figure 4 and it is worth mentioning that we also see variation along time as well, that is the spatial patterns vary within an individual over time. Moreover, the magnitude of active region varies from subject to subject or even from time point to time point that indicates ability of the trained model to capture dynamics.

Fig. 4.

Fig. 4.

Initial differences on the output (3D image is the mean over time within a 4D sensory-motor network) for 6 different subjects on a random timepoint (timepoint 100), highlighting the ability of our approach to capture individual variability. The “pb” unit in the figure denotes probability of each voxel belonging to this brain network and the magnitude of active region is shown on top of each image which shows different values for each of subjects.

Generally, our trained models show remarkable performance and reasonable results which can be utilized for studying healthy or disordered brain function. We depict the output of all 53 networks for a random subject by computing the average of timepoints and applying z-score to have better representation in figure 5. As is clear from the output, all trained models have extracted biologically reasonable brain regions for each of the brain networks as expected.

Fig. 5.

Fig. 5.

The model output for all 53 brain networks including subcortical, auditory, sensory-motor, visual, cognitive control, default mode and cerebellar networks given a sample subject. All figures are being applied different thresholds to have more informative display.

A. Spatial Variability

One of the important features and goals of the proposed approach is to capture spatial variability over time within each network for different subjects. We next summarize the spatial variability feature in three ways. First, we processed fMRI data with our model and applied z-scoring on the model output. Next, we computed timepoint-wise differences of the previous output and summed up these differences, to capture variability over time within each voxel of a given network. Finally, we apply this to the multiple networks. We also compute the absolute timepoint-wise differences to evaluate the regions which are showing higher or lower variability regardless of sign.

The spatial variability for the sensory motor, default mode, auditory, subcortical, cerebellar and visual networks is show in figure 6. One important observation is we see that spatial variability is highlighting regions that fall within or near the thresholded network boundaries on the left but are different from those of the overall mean activity, showcasing a key advantage of our approach to provide additional insights into brain activity. For example, the absolute variation in the sensorimotor network highlights the sensorimotor strip, most of which is not captured by the amplitude (presumably because it fluctuations between positive and negative connectivity which averages to a low value over time). This highlights the additional information that can be captured when accounting for spatial variation. Another interesting example, the default mode network show very high variability in the posterior cingulate, but not in the anterior (medial frontal) portion of the network. Regarding variation, the auditory network shows two distinct clusters showing positive and negative variation. These are only a few observations which highlight the (usually ignored) information that can be captured by our proposed approach.

Fig. 6.

Fig. 6.

Spatial variability in sample subcortical, visual, auditory, cerebellar, default mode and sensory motor networks for a given subject. First column shows the model output (3D mean amplitude), second column is the sum of the absolute differences in time and third one shows the sum of the differences in time. All variation plots are in the same coordinates with original model output to make a fair comparison. Interestingly, the spatial variability is in some cases, most informative in regions that are highly relevant to the given network domain, but which are not captured in the average output.

B. Age Effects

Furthermore, to study the ability of the model to capture group differences, we explored the capability of our trained models to capture age differences between subjects. To do this, we selected 20 young subjects who were less 40 years old and 20 old subjects older than 60 and computed output of the model for all of them. To summarize the captured spatial dynamics within one network (the sensorimotor network), we applied a k-means algorithm with 5 segments as the primary number of clusters using all 40 outputs. This provides us with groups of voxels which show similar spatial variability. Next, we measured the magnitude of each cluster for each subject and computed the group differences between the old and young groups for each cluster. Results showed that young subjects have higher magnitude than older subjects in all clusters.

The clustered regions for the sensory-motor network along with magnitude of the cluster showing the largest age difference (pink color) is shown in figure 7. Note that clusters with pink, green, yellow, black and blue colors, respectively, include voxels with higher to lower probability values with voxel sizes of 857, 203, 408, 402, 34860.

Fig. 7.

Fig. 7.

The upper figure shows clustered regions for output of a sensory-motor network after applying a threshold to study age effects on young vs old groups of subjects (Group differences). All clusters are marked using different colors and pink, green, yellow, black and blue clusters, respectively, contain voxels with higher to lower probabilities. The lower figure indicates the two-sample T-Test score for young vs old subjects for all clusters. According to results of this network, the T-value for young vs old group difference shows higher mean values for younger subjects with significant differences in the pink cluster (p<0.004).

Furthermore, we computed the two sample T-Test score for each cluster as a metric to evaluate differences in model output for young versus old subjects as is shown in figure 7. In this network, the pink cluster showed a significant difference between young and old (p < 0.004) whereas the lowest score belongs to the blue region, comprising the main and largest portion of the network. Important the magnitude of these changes is quite variable as are the spatial locations of the changes, which tend to focus on the boundaries of the network. Such information would be completely ignored in a typical approach that does not focus on spatial dynamic information. Clearly, this experiment highlights ability of the model to capture changes in spatial variability in different types of groups like young and old subjects.

C. Temporal Variability

The proposed approach also enables us to study functional dynamicity in the activity of brain networks by encoding temporal variation through different time-points. This is a salient property that can open up wide range of new studies to characterize the full spatiotemporal information in these brain networks and use them to study the healthy and disordered brain. Hereupon, we have studied preliminary evidence of temporal variation on the model’s output by computing magnitude, mean and standard deviation of active regions and show they exhibit significant fluctuations over time as is shown in figure 8 for a sample visual network.

Fig. 8.

Fig. 8.

The figure shows the oscillations of magnitude (blue), mean (green) and standard deviation (red) of active region in a visual network for a given subject over 490 time points

Another approach typically used in ICA methods is to measure temporal variability or changes over time in the temporal coherence among voxels, which can be captured with metrics such as functional network connectivity (FNC) [42]. To do this, we grouped all networks into subcortical, auditory, sensory-motor, visual, cognitive control, default mode and cerebellar categories and then computed the cross correlation of all 53 networks over all timepoints and show the result as a heatmap in figure 9.

Fig. 9.

Fig. 9.

The heatmap shows the correlation between all 53 networks grouped into Subcortical (1–5), Auditory (6–7), Sensory-Motor (8–16), Visual (17–25), Cognitive Control (26–42), Default Mode (43–49) and Cerebellar (50–53) respective categories for a given subject (Static FNC).

The results exhibit the expected modularity, with generally higher correlation between networks in the same group like network 44 and 46 belonging to the default mode category with ratio of 0.37 or network 18 and 20 from visual category with ratio 0.41. We also see negative correlation between different groups like network 20 (visual network) and network 26 (cognitive control) with ratio −0.2.

Furthermore, we also illustrate changes in this FNC over time (i.e., dynamic FNC [43]) to provide additional insight into the model behavior regarding encoding temporal variation. To do this, we computed the cross-correlation between all networks using a sliding window technique with window size of 50 and overlapping ratio of 10. For this analysis, we computed the correlation between all networks within the first 50 timepoints, then we moved the sliding window 40 timepoints forward to repeat the calculation using the next batch of timepoints by shifting the window by 10 timepoints from the previous one. Next, we proceeded with remaining timepoints which resulted in 12 distinct heatmaps, vary over the time and clearly shows dynamicity of functional connectivity network as is shown in figure 10. Given that we now have access to the full 5D space-time-network information, this can be further exploited in future studies to evaluate dynamics over time or space or both.

Fig. 10.

Fig. 10.

Dynamic FNC is shown by 12 heatmaps computed by sliding window technique for a sample subject and each of heatmaps shows the correlation between all networks for 50 timepoints that overlap 10 timepoints with the previous one ordered left to right and top to bottom. A dynamic FNC approach emphasizes variation of functional connectivity over time which provide an important tool for capturing temporal variation in functional connectivity (i.e., the chronnectome [19]), encoded by our model.

VI. DISCUSSION

We introduce a novel approach to perform a full 5D parcellation of functional activity, thus preserving the, typically ignored, spatiotemporal behavior of each intrinsic network. We implemented our approach by training a residual network to predict the probability of each voxel belonging to each brain network using ICA priors. Results show our model not only generates reasonable maps of brain segments but is also able to captures spatiotemporal dynamics of the brain, providing a rich feature space useful for further analysis.

Preliminary results show that the proposed model captures differences among subjects, spatial variability within networks, and temporal coupling among different networks. Hence, we see obvious differences in model output for different subjects as is shown in figure 4 for a sample sensory-motor network. Results show that the magnitude of active region vary from subject to subject along with smooth changes in boundaries and shape of active region. Moreover, our approach is able to encode the full spatiotemporal patterns while preserving within and between subject information. This provides a new window into brain activity, highlighting information that is missed by approaches that ignore spatial dynamics. For example, if we focus on the sensory-motor sagittal view of absolute differences plot, we see 3 hot spots located along the SM border that exhibit high variability (temporal fluctuations) over time. This information is not visible in the mean activity, e.g., the original model output. We also see a source of high variability in the auditory network coronal view in the absolute differences plot which is shown in the mean map. This may suggest the network is transiently linking to other network such as the default mode network and should be explored in further studies with larger numbers of subjects. Additionally, exploration of the temporal variation shows high correlation within the same network categories. It is worth mentioning that we observe notable correlation between two different categories (domains) as well, for example, network 7 and 8 from the auditory and sensory-motor domains show high inter-domain correlation, but we also see negative/inverse correlation for some other networks which demonstrate the fact that an increase in activity of a network is linked to a decrease in other network like network 19 from visual category with network 31 from cognitive control which show an inverse activity. We also studied dynamic FNC to evaluate the ability of the model to capture temporal variation in the coupling between networks. Accordingly, we observe significant changes in correlation or anti-correlation between different networks over time as shown in figure 10 and is a consequence of model capability in encoding temporal variations. Together these results highlight the richness of the estimated 5D output and its potential to be applied to study various aspects of the healthy and disordered brain.

A wide range of methods from other research domains have been adapted to study brain connectivity, including the Spatio-Temporal Graph Convolution (ST-CGN) [25] which took advantage of a graph convolutional network to model the non-stationary properties of functional connectivity by learning the importance of edges between brain networks and showed 80 percent accuracy in predicting gender and more than 75 percent on predicting age using the encoded temporal dynamics. However, this model, like most others, did not include spatial variability as a consequence of using a predefined atlas for extracting ROIs. Ignoring spatial and temporal variability may lead to incomplete or even incorrect conclusions and is a motivation for our approach that generates probabilistic maps which not only encode temporal dynamics but also capture spatial variability (e.g., size, shape, translation) between subjects. Similarly, a study of rs-network dynamics employed for schizophrenia diagnosis [44] trained a support vector machine (SVM) with functional connectivity data to classify subjects into patient/control showed 90 percent of accuracy in classification task, however this approach also did not model the spatial or spatio-temporal dynamics. Functional connectivity data has also been used to study Alzheimer’s disease [45] by training an end-to-end deep recurrent neural network with rs-fMRI data. This study resulted in 90 percent accuracy in classification and highlights the importance of spatio-temporal dynamics in biomarker studies, however the end-to-end model also comes with a limitation in that does not naturally enable separation of the mixture of spatial and temporal variabilities. We attempted to address these limitations with our proposed parcellation model which blends together the strengths of deep learning and blind source separation to enhance interpretability with a focus on brain networks. Our proposed model also has a strength relative to typical methods in brain parcellation since it encodes spatio-temporal dynamics into 5D probabilistic maps which can be used for further studies on brain functionality.

VII. CONCLUSION

We proposed a novel 5D brain parcellation method using a deep residual network structure which is able to capture spatio-temporal dynamics of the brain. Our method goes beyond existing brain parcellation techniques by providing information about the full spatio-temporal variation over space, time, and networks. We utilized ICA priors to train a model with deep residual structure in order to predict probability of each voxel in a given timepoint belonging to each brain network and obtained a full 5D result after additional post processing steps. According to the results, the proposed approach provides output which is sensitive to individual variations as demonstrated by comparing same network and timepoint for different subjects; Moreover, we studied model performance in differentiating two groups of young and old subjects which showed statistically significant differences in model output between two groups within one active region (pink cluster). This provides evidence that the spatial dynamic aspects of the data are sensitive to different aspects of the subjects, and supports continued work in capturing such information in future studies. We also studied spatial variation via computing timepoint wise differences of model output that reveals regions which simultaneously fall within network boundaries but are different from those of the overall mean activity. This is a key advantage of our approach to provide additional insight into brain activity. Furthermore, we evaluated temporal interactions among networks by computing functional network connectivity (FNC) by grouping all networks into the various domains (subcortical, auditory, sensory-motor, visual, cognitive control, default mode and cerebellar categories) and observed the expected modular output, including generally higher correlation within each domain. We believe that the key feature of the proposed approach in encoding spatio-temporal dynamics of brain networks provides broad applicability to study brain disorders, behavior and cognition, and also group differences. We are planning multiple future studies along these lines.

Highlights.

  • A novel 5D brain parcellation scheme was proposed

  • Proposed system enables us to capture spatial and temporal dynamics of brain

  • Proposed system generates informative spatio-temporal networks that are sensitive to individuals and group-level variation

Footnotes

CRediT author statement

Behnam Kazemivash: Conceptualization, Methodology, Software, Writing- Original draft preparation, Visualization, Investigation.

Vince D. Calhoun: Supervision, Writing- Reviewing and Editing, Conceptualization

Declaration of interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Behnam Kazemivash, Department of Computer Science, Georgia State University, Atlanta, GA 30332 and also with the Tri-Institutional Center for Translational Research in Neuroimaging and Data Science (TReNDS), Atlanta, GA 30303 USA.

Vince D. Calhoun, Tri-Institutional Center for Translational Research in Neuroimaging and Data Science (TReNDS), Georgia State University, Georgia Institute of Technology, and Emory University, Atlanta GA 30303.

REFERENCES

  • [1].Eickhoff SB, Yeo BT, & Genon S.(2018). Imaging-based parcellations of the human brain. Nature Reviews Neuroscience, 19(11), 672–686. [DOI] [PubMed] [Google Scholar]
  • [2].Xu J, Potenza MN, Calhoun VD, Zhang R, Yip SW, Wall JT, ... & Moran JM. (2016). Large-scale functional network overlap is a general property of brain functional organization: reconciling inconsistent fMRI findings from general-linear-model-based analyses. Neuroscience & Biobehavioral Reviews, 71, 83–100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Du Y, Liu J, Sui J, He H, Pearlson GD, & Calhoun VD (2014, August). Exploring difference and overlap between schizophrenia, schizoaffective and bipolar disorders using resting-state brain functional networks. In 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (pp. 1517–1520). IEEE; [DOI] [PubMed] [Google Scholar]
  • [4].Luo Z, Zeng LL, Qin J, Hou C, Shen H, & Hu D.(2020). Functional parcellation of human brain precuneus using density-based clustering. Cerebral Cortex, 30(1), 269–282. [DOI] [PubMed] [Google Scholar]
  • [5].Petrides M.(2012). The human cerebral cortex: an MRI atlas of the sulci and gyri in MNI stereotaxic space. Elsevier/Academic Press. [Google Scholar]
  • [6].Dillon K, & Wang YP (2020). Resolution-based spectral clustering for brain parcellation using functional MRI. Journal of neuroscience methods, 335, 108628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Hu Y, Li X, Wang L, Han B, & Nie S.(2020). T-distribution stochastic neighbor embedding for fine brain functional parcellation on rs-fMRI. Brain Research Bulletin, 162, 199–207. [DOI] [PubMed] [Google Scholar]
  • [8].Gorbach NS, Schütte C, Melzer C, Goldau M, Sujazow O, Jitsev J, ... & Tittgemeyer M. (2011). Hierarchical information-based clustering for connectivity-based cortex parcellation. Frontiers in neuroinformatics, 5, 18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Arslan S, Ktena SI, Makropoulos A, Robinson EC, Rueckert D, & Parisot S.(2018). Human brain mapping: A systematic comparison of parcellation methods for the human cerebral cortex. Neuroimage, 170, 5–30. [DOI] [PubMed] [Google Scholar]
  • [10].Dohmatob E, Richard H, Pinho AL, & Thirion B.(2021). Brain topography beyond parcellations: local gradients of functional maps. NeuroImage, 229, 117706. [DOI] [PubMed] [Google Scholar]
  • [11].de Reus MA, & Van den Heuvel MP (2013). The parcellation-based connectome: limitations and extensions. Neuroimage, 80, 397–404. [DOI] [PubMed] [Google Scholar]
  • [12].Sun L, Shao W, Wang M, Zhang D, & Liu M.(2019). High-order feature learning for multi-atlas based label fusion: Application to brain segmentation with MRI. IEEE Transactions on Image Processing, 29, 2702–2713. [DOI] [PubMed] [Google Scholar]
  • [13].Abdedayem F, Kallel F, Chaabane M, Hamida AB, & Sellami L.(2020, September). fMRI Imaging Based Human Brain Parcellation Methods: A review. In 2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) (pp. 1–5). IEEE. [Google Scholar]
  • [14].Wu G, Wang Q, Zhang D, Nie F, Huang H, & Shen D.(2014). A generative probability model of joint label fusion for multi-atlas based brain segmentation. Medical image analysis, 18(6), 881–890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Cabezas M, Oliver A, Lladó X, Freixenet J, & Cuadra MB (2011). A review of atlas-based segmentation for magnetic resonance brain images. Computer methods and programs in biomedicine, 104(3), e158–e177. [DOI] [PubMed] [Google Scholar]
  • [16].Nanetti L, Cerliani L, Gazzola V, Renken R, & Keysers C.(2009). Group analyses of connectivity-based cortical parcellation using repeated k-means clustering. Neuroimage, 47(4), 1666–1677. [DOI] [PubMed] [Google Scholar]
  • [17].Eickhoff SB, Thirion B, Varoquaux G, & Bzdok D.(2015). Connectivity‐based parcellation: Critique and implications. Human brain mapping, 36(12), 4771–4792. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Calhoun VD, & Adali T.(2012). Multisubject independent component analysis of fMRI: a decade of intrinsic networks, default mode, and neurodiagnostic discovery. IEEE reviews in biomedical engineering, 5, 60–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Iraji A, Faghiri A, Lewis N, Fu Z, DeRamus T, Qi S, ... & Calhoun V.(2019, September). Ultra-high-order ICA: an exploration of highly resolved data-driven representation of intrinsic connectivity networks (sparse ICNs). In Wavelets and Sparsity XVIII (Vol. 11138, p. 111380I). International Society for Optics and Photonics. [Google Scholar]
  • [20].Lurie DJ, Kessler D, Bassett DS, Betzel RF, Breakspear M, Kheilholz S, ... & Poldrack RA. (2020). Questions and controversies in the study of time-varying functional connectivity in resting fMRI. Network Neuroscience, 4(1), 30–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [21].Cheng H, Zhu H, Zheng Q, Liu J, & He G.(2020). Functional parcellation of the hippocampus by semi-supervised clustering of resting state fMRI data. Scientific Reports, 10(1), 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Iraji A, Miller R, Adali T, & Calhoun VD (2020). Space: A missing piece of the dynamic puzzle. Trends in Cognitive Sciences, 24(2), 135–149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Calhoun VD, Miller R, Pearlson G, & Adalı T.(2014). The chronnectome: time-varying connectivity networks as the next frontier in fMRI data discovery. Neuron, 84(2), 262–274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Fan L, Zhong Q, Qin J, Li N, Su J, Zeng LL, ... & Shen H. (2021). Brain parcellation driven by dynamic functional connectivity better capture intrinsic network dynamics. Human brain mapping, 42(5), 1416–1433. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Gadgil S, Zhao Q, Pfefferbaum A, Sullivan EV, Adeli E, & Pohl KM (2020, October). Spatio-Temporal Graph Convolution for Resting-State fMRI Analysis. In International Conference on Medical Image Computing and Computer-Assisted Intervention (pp. 528–538). Springer, Cham. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26].Kim J, Calhoun VD, Shim E, & Lee JH (2016). Deep neural network with weight sparsity control and pre-training extracts hierarchical features and enhances classification performance: Evidence from whole-brain resting-state functional connectivity patterns of schizophrenia. Neuroimage, 124, 127–146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Liu X, Zhao H, Zhang S, & Tan Z.(2019, April). Brain Image Parcellation Using Multi-Atlas Guided Adversarial Fully Convolutional Network. In 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019) (pp. 723–726). IEEE. [Google Scholar]
  • [28].Bonmati E, Bardera A, & Boada I.(2017). Brain parcellation based on information theory. Computer Methods and Programs in Biomedicine, 151, 203–212. [DOI] [PubMed] [Google Scholar]
  • [29].Thyreau B, & Taki Y.(2020). Learning a cortical parcellation of the brain robust to the MRI segmentation with convolutional neural networks. Medical Image Analysis, 61, 101639. [DOI] [PubMed] [Google Scholar]
  • [30].Luo Z, Zeng LL, Qin J, Hou C, Shen H, & Hu D.(2020). Functional parcellation of human brain precuneus using density-based clustering. Cerebral Cortex, 30(1), 269–282. [DOI] [PubMed] [Google Scholar]
  • [31].Zhao J, Tang C, & Nie J.(2020). Functional Parcellation of Individual Cerebral Cortex Based on Functional MRI. Neuroinformatics, 18(2), 295–306. [DOI] [PubMed] [Google Scholar]
  • [32].Boukhdhir A, Zhang Y, Mignotte M, & Bellec P.(2020). Unraveling reproducible dynamic states of individual brain functional parcellation. bioRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [33].Graham MS, Sudre CH, Varsavsky T, Tudosiu PD, Nachev P, Ourselin S, & Cardoso MJ (2020). Hierarchical brain parcellation with uncertainty. In Uncertainty for Safe Utilization of Machine Learning in Medical Imaging, and Graphs in Biomedical Image Analysis (pp. 23–31). Springer, Cham. [Google Scholar]
  • [34].Calhoun VD, Adali T, Pearlson GD, & Pekar JJ (2001). A method for making group inferences from functional MRI data using independent component analysis. Human brain mapping, 14(3), 140–151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Du Y, Fu Z, Sui J, Gao S, Xing Y, Lin D, ... & Hong E.(2019). NeuroMark: a fully automated ICA method to identify effective fMRI markers of brain disorders. medRxiv, 19008631. [Google Scholar]
  • [36].Du Y, Fu Z, Sui J, Gao S, Xing Y, Lin D, ... & Alzheimer’s Disease Neuroimaging Initiative. (2020). NeuroMark: An automated and adaptive ICA based pipeline to identify reproducible fMRI markers of brain disorders. NeuroImage: Clinical, 28, 102375. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [37].Jenkinson M, Bannister P, Brady M, & Smith S.(2002). Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage, 17(2), 825–841. [DOI] [PubMed] [Google Scholar]
  • [38].Andersson JL, Skare S, & Ashburner J.(2003). How to correct susceptibility distortions in spin-echo echo-planar images: application to diffusion tensor imaging. Neuroimage, 20(2), 870–888. [DOI] [PubMed] [Google Scholar]
  • [39].Miller KL, Alfaro-Almagro F, Bangerter NK, Thomas DL, Yacoub E, Xu J, ... & Smith SM (2016). Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nature neuroscience, 19(11), 1523–1536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [40].Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, ... & Desmaison A.(2019). Pytorch: An imperative style, high-performance deep learning library. In Advances in neural information processing systems (pp. 8026–8037). [Google Scholar]
  • [41].Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, ... & Cortes A. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203–209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [42].Jafri MJ, Pearlson GD, Stevens M, & Calhoun VD (2008). A method for functional network connectivity among spatially independent resting-state components in schizophrenia. Neuroimage, 39(4), 1666–1681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [43].Allen EA, Damaraju E, Plis SM, Erhardt EB, Eichele T, & Calhoun VD (2014). Tracking whole-brain connectivity dynamics in the resting state. Cerebral cortex, 24(3), 663–676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [44].Kottaram A, Johnston L, Ganella E, Pantelis C, Kotagiri R, & Zalesky A.(2018). Spatio‐temporal dynamics of resting‐state brain networks improve single‐subject prediction of schizophrenia diagnosis. Human Brain Mapping, 39(9), 3663–3681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [45].Wang M, Lian C, Yao D, Zhang D, Liu M, & Shen D.(2019). Spatial-temporal dependency modeling and network hub detection for functional MRI analysis via convolutional-recurrent network. IEEE Transactions on Biomedical Engineering, 67(8), 2241–2252. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES