Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

Research Square logoLink to Research Square
[Preprint]. 2023 Dec 13:rs.3.rs-3740218. [Version 1] doi: 10.21203/rs.3.rs-3740218/v1

Voxel-wise Fusion of Resting fMRI Networks and Gray Matter Volume for Alzheimer’s Disease Classification using Deep Multimodal Learning

Vaibhavi S Itkyal 1, Anees Abrol 2, Theodore J LaGrow 3, Alex Fedorov 4, Vince D Calhoun 5
PMCID: PMC10760243  PMID: 38168287

Abstract

Alzheimer’s disease (AD) is a prevalent neurodegenerative disorder requiring accurate and early diagnosis for effective treatment. Resting-state functional magnetic resonance imaging (rs-fMRI) and gray matter volume analysis from structural MRI have emerged as valuable tools for investigating AD-related brain alterations. However, the potential benefits of integrating these modalities using deep learning techniques remain unexplored. In this study, we propose a novel framework that fuses composite images of multiple rs-fMRI networks (called voxelwise intensity projection) and gray matter segmentation images through a deep learning approach for improved AD classification. We demonstrate the superiority of fMRI networks over commonly used metrics such as amplitude of low-frequency fluctuations (ALFF) and fractional ALFF in capturing spatial maps critical for AD classification. We use a multi-channel convolutional neural network incorporating the AlexNet dropout architecture to effectively model spatial and temporal dependencies in the integrated data. Extensive experiments on the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset of AD patients and cognitively normal (CN) validate the efficacy of our approach, showcasing improved classification performance of 94.12% test accuracy and an area under the curve (AUC) score of 97.79 compared to existing methods. Our results show that the fusion results generally outperformed the unimodal results. The saliency visualizations also show significant differences in the hippocampus, amygdala, putamen, caudate nucleus, and regions of basal ganglia which are in line with the previous neurobiological literature. Our research offers a novel method to enhance our grasp of AD pathology. By integrating data from various functional networks with structural MRI insights, we significantly improve diagnostic accuracy. This accuracy is further boosted by the effective visualization of this combined information. This lays the groundwork for further studies focused on providing a more accurate and personalized approach to AD diagnosis. The proposed framework and insights gained from fMRI networks provide a promising avenue for future research in deep multimodal fusion and neuroimaging analysis.

Introduction

There is a dynamic loop between brain structure and brain function which comprises the neural basis of cognition, learning and plasticity (V. Calhoun, 2018; Segall et al., 2012; Sui et al., 2014; Zatorre et al., 2012). As individuals undergo the natural process of aging, the human brain experiences alterations in structure and cognitive capabilities. Notably, rapid transformations in both structure and function, accompanied by symptoms such as dementia, cognitive decline, and the presence of biomarkers such as tau and beta-amyloid deposition, signify the onset of Alzheimer’s Disease (AD) (Beason-Held et al., 2013; Hersi et al., 2017). However, relying solely on tau and β-amyloid levels for predictions related to AD or cognitive outcomes is insufficient for accurate diagnosis and prognosis (Khoury & Ghossoub, 2019), (Hardy & Higgins, 1992; J. C. Lee et al., 2019). The degeneration of the brain associated with Alzheimer’s disease commences years before the manifestation of clinical symptoms. Recent advancements in positron emission tomography (PET) used to visualize the deposition of amyloid and tau have enhanced our ability to detect Alzheimer’s disease during its preclinical and prodromal stages. Issues arise as this approach is encumbered by high costs, ionizing radiation, and necessitates specialized tracers and equipment (Hansson et al., 2019). Given that Alzheimer’s disease affects a significant portion of the global population, early and precise AD diagnosis is imperative to facilitate timely interventions and enhance patient well-being (Cummings et al., 2022; Dubois et al., 2021; L.-K. Huang et al., 2020). To develop effective and clinically relevant early detection methodologies, it is vital to distinguish prodromal AD, which is often studied through the lens of mild cognitive impairment (MCI) (Liss et al., 2021; Sabbagh et al., 2020; Wee et al., 2012). MCI is characterized by individuals affected by dementia with high propensity to progress to more severe forms of dementia. The Alzheimer’s Disease Neuroimaging Initiative (ADNI) has generated and made publicly available a wide array of data, including imaging, clinical, genetic, and various biomarkers, from individuals categorized as being cognitively normal (CN), MCI and AD. ADNI is a multisite study which uses standardized protocols and quality control measures to minimize variations across sites and enhance the reliability of neuroanatomical change detection (Jack Jr. et al., 2008). The study collected both structural magnetic resonance imaging (sMRI) and resting-state functional magnetic resonance imaging (rs-fMRI) data, however most prior work has focused on sMRI and there is much less work focused on joint analysis of sMRI and fMRI data. Ergo, there is untapped potential for harnessing information provided by both sMRI and rs-fMRI for the purpose of differentiating AD, CN and MCI.

Structural imaging techniques, exemplified by the analysis of voxelwise gray matter volume, have showcased their efficacy in identifying AD-related anatomical modifications (Dukart et al., 2013). In parallel, rs-fMRI has been studied as a way to elucidate the functional brain changes associated with AD (Agosta et al., 2012). Prior studies have focused on features such as amplitude of low-frequency fluctuation (ALFF) and fractional amplitude of low-frequency fluctuation (fALFF) to capture spatial patterns within rs-fMRI data (Abrol, Hassanzadeh, et al., 2021; Fedorov et al., 2021), (Turner et al., 2013). ALFF, while useful, has the drawback that it collapses all the information into a spectral power band, thus losing information about widely studied brain networks though some information is still retained (V. D. Calhoun & Allen, 2013) and it can be more sensitive to artifacts that impact the subcortical and periventricular regions disproportionately (Zuo et al., 2010). Overall, structural measures tend to show more sensitivity to AD related changes and are much more widely used as a result. However, no work to our knowledge has evaluated the use of input feature which preserve resting brain networks at the voxelwise level.

In this work we explore novel avenues for representing and synthesizing spatial information from intrinsic networks extractive from rs-fMRI datasets and combining it with gray matter maps. The progression of AI-driven solutions within clinical neuroimaging and the broader healthcare landscape provides the promise of more flexible and powerful models. Despite the emergence of innovative models, most studies have overlooked the integration of multimodal and data fusion techniques (V. D. Calhoun & Sui, 2016; Sui et al., 2012, 2014). Furthermore, investigations into functional dynamics, particularly concerning rs-fMRI datasets, for the comprehension and representation of spatial information, have been limited (for a comprehensive review please see the following(Hutchison et al., 2013; Keilholz et al., 2017; Lurie et al., n.d.)). This gap signifies missed opportunities on multiple fronts: (i) Converting and summarizing functional insights from rs-fMRI data into structural maps, and (ii) Employing deep multimodal neural architectures for the classification of AD, MCI, and CN subjects, along with the utilization of interpretability methods to discern influential features that impact model predictions.

In this study, we introduce an innovative deep learning framework based on a 3D convolutional neural network (CNN) architecture, drawing inspiration from the well-established AlexNet (Abrol, Fu, et al., 2021). Our model exhibits the capability to effectively differentiate between AD, MCI and CN subjects, employing the ADNI dataset (total subjects = 730). Our methodology is designed to harness complementary insights from both modalities: it exploits functional connectivity patterns derived from rs-fMRI and anatomical characteristics obtained through the analysis of gray matter volume. Through voxelwise fusion of these modalities, we reinforce the discriminative capacity of our classification model, consequently enhancing the accuracy of AD diagnosis. Our approach hinges on voxel-wise data fusion of rs-fMRI networks and gray matter volume, facilitated by multimodal deep learning. Rigorous cross-validation experiments substantiate our method’s performance is favorable against existing techniques. Our proposed approach attains an impressive test accuracy of 94.12% when distinguishing between AD and CN within the ADNI dataset (total subjects = 466). Notably, our deep learning model exhibits high accuracy and robustness across multiple experimental scenarios. One of main areas of study in fMRI is focused on resting networks, each of which produced a spatial map of connectivity (functional coupling) at each voxel. However most approaches do not leverage these multiple networks. Here, we introduce a way to incorporate the most important information across a large number of resting networks via creating voxel-wise intensity projection (VIP) networks, generated via independent component analysis (ICA) i.e., ICA followed by VIP i.e., iVIP, compared to conventional metrics such as ALFF and fALFF for AD classification. Moreover, our results showcase the enriched and informative spatial patterns offered by the representation of our fMRI networks, facilitating the precise localization of AD-related alterations. Our findings underscore the substantial promise of CNN-based models for automating early Alzheimer’s disease diagnosis across various stages, signifying the potential to deliver valuable insights into the intricate neurobiological mechanisms underpinning AD. This, in turn, may contribute to early and accurate diagnosis, and thus pave the way for personalized treatment strategies.

Methods

Previous studies (Egorova et al., 2017; Reyes et al., 2016; Turner et al., 2013 -j. Wang et al., 2016; Zou et al., 2008) have used measures such as ALFF and fALFF for multimodal deep learning. In our paper, we present a new technique named iVIP. This method calculates a voxel-by-voxel summary of resting fMRI networks, similar to those derived through Independent Component Analysis (ICA). We then explore the potential for using iVIP for classification purposes in the ADNI data, and compare sMRI only, fMRI only, and fused fMRI and sMRI scenarios. To do this we utilize a modified version of the 3D CNN AlexNet3D dropout architecture (Krizhevsky et al., 2017). We apply our approach to the ADNI dataset. We perform a two-way (AD vs CN) and three-way (AD vs MCI vs CN) classification task and several experiments to fuse the sMRI and iVIP modalities in our deep learning architecture. In addition, to visualize and determine the relevant brain regions that the network uses for classification, we compute saliency maps from the deep learning model and investigate voxelwise differences between classes. Figure 1 depicts an overview of the proposed approach for performing multimodal deep learning classification using the ADNI dataset.

Figure 1.

Figure 1

An overview of the data processing and prediction pipeline for the unimodal and multimodal analysis that was conducted in our study. We used the sMRI and rs-fMRI from the ADNI dataset and preprocessed sMRI using SPM 12 and used GIFT toolbox for computing ICNs. After that we computed iVIPs from the ICNs and used them as a different modality with sMRI for our classification predictions. We did a two-way (AD vs CN) and three-way classification (AD vs MCI vs CN). We then compared our results quantitatively as well as using saliency maps for all the experiments.

Data

The data utilized in this study comprises imaging and diagnostic information sourced from the Alzheimer’s Disease Neuroimaging Initiative (ADNI). The research protocols of the ADNI study were subject to approval by the institutional review boards at all participating centers, and this approval process is meticulously detailed in the official document, which can be accessed via this link: ADNI Approval Documentation.

Prior to inclusion in the study, all individuals provided written, informed consent in full compliance with the ethical principles set forth in the Declaration of Helsinki. Furthermore, this study received explicit approval from the institutional review boards at each research site that was involved in the investigation.

It is imperative to note that all research methodologies adhered rigorously to the established guidelines and ethical standards. The sMRI scans (specifically T1 MRIs) and rs-fMRI scans were procured from the ADNI data portal (https://adni.loni.usc.edu/). For the purposes of this study, sMRI and rs-fMRI data obtained from the ADNI dataset were exclusively derived from the initial visit scans, encompassing a total of 730 subjects. This cohort consisted of 383 individuals classified as CN, 83 subjects diagnosed with AD, and 264 individuals with MCI. The Table 1 summarizes the ADNI data used for two-way vs three-way classification as well as notes the ratio of male and female (M/F) participants used in the study.

Table 1.

Summary of the ADNI data used for classification. 8-fold cross-validation was used for both two-way and three-way classification.

#Subjects M/F Total subjects AD MCI CN
Two-way classification (AD vs CN) 188/278 466 83 - 383
Three-way classification (AD vs MCI vs CN) 330/400 730 83 264 383

Data Preprocessing

In this study, we utilized sMRI and rs-fMRI data from the ADNI dataset. We focused on the initial visit scans of subjects for whom sMRI, rs-fMRI, and diagnosis or research group information were available and utilized for the subsequent analysis. Data were first preprocessed as in prior work (Abrol et al., 2020), briefly summarized below.

sMRI preprocessing:

The preprocessing of the sMRI data commenced with spatial normalization and segmentation of tissue probability maps. This segmentation specifically targeted gray matter, white matter, and cerebral spinal fluid, employing the modulated normalization algorithm within SPM 12. Subsequently, the gray matter images underwent transformation to a standardized space, modulation, and were subjected to smoothing using a Gaussian kernel featuring a full width at half maximum (FWHM) of 6 mm. The resultant preprocessed gray matter volume images were characterized by voxel dimensions of 121 × 145 × 121 and a voxel size of 1.5 × 1.5 × 1.5 mm³ in spatial terms.

To ensure data quality, we conducted a thorough quality control (QC) assessment of the preprocessed sMRI and fMRI datasets. This included the exclusion of images exhibiting low correlation with individual and/or group level masks. For the fMRI data, images manifesting high head motion were meticulously discarded to eliminate potential confounding effects on functional connectivity.

fMRI preprocessing:

The preprocessing of the fMRI data was executed through an SPM12 pipeline encompassing rigid body motion correction, slice-timing correction, warping to the standard MNI space employing the EPI template, resampling to isotropic voxels of (3mm)³, and subsequent Gaussian smoothing with a FWHM of 6mm. Leveraging fully automated spatially constrained ICA (scICA) with the Neuromark_fMRI_1.0 template (Du et al., 2020), we obtained 53 spatially independent components characterized by a high correlation threshold for multi-dataset alignment. These components were pre-computed and made accessible via the GIFT toolbox (https://trendscenter.org/software/gift/).

All maps, encompassing sMRI and rs-fMRI processed data, were transformed to the standard MNI space. Subsequently, they were resampled to isotropic voxels of (3mm)³, yielding a consistent voxel dimension of 53 × 63 × 52 voxels within each map. Gaussian smoothing, featuring a FWHM of 6mm, was applied as the final step in the data preparation pipeline.

We estimated 53 intrinsically connected networks (ICNs) via the scICA Neuromark_fMRI_1.0 template followed by computation of the iVIPs. The ICNs are subject specific multi-network ICA networks reflecting synchronized activity (Du et al., 2020). To extract the most important features of the data, we computed the maximum value across the 53 ICNs for each voxel, representing it as max(ICN). Additionally, we computed both the absolute minimum and maximum of the absolute values of the 53 ICNs for each voxel, denoted as abs(min(ICN)) and max(abs(ICN)), respectively. These post-processed 3D iVIP images were employed as input, either unimodally or multimodally, in our neural network model.

In order to benchmark our novel representation of rs-fMRI spatial maps, we conducted a comparative analysis with the widely used ALFF and fALFF approaches, using the preprocessing approach from this previous work (Abrol, Hassanzadeh, et al., 2021). These traditional metrics have been recognized for their data reduction capabilities while preserving voxel dimensionality (Abrol, Hassanzadeh, et al., 2021; V. Calhoun, 2018; Turner et al., 2013).

Multimodal Deep Convolution Neural Network

A multi-channel version of a 3D CNN inspired from AlexNet (Krizhevsky et al., 2017) was used for classification tasks. Our model is a specialized deep learning architecture tailored for 3D multimodal deep learning which has been designed to address the heightened challenges posed by the curse of dimensionality prevalent in neuroimaging datasets (Alzubaidi et al., 2021; Singh et al., 2020). This model features a sequence of 3D convolutional layers designed to adapt to diverse data modalities, batch normalization for training stability, rectified linear units (ReLU) for nonlinear feature extraction, and adaptive average pooling for spatial adaptability. We have also incorporated dropout layers to prevent overfitting. The model takes three-dimensional data, fMRI or sMRI images, as input, represented as a sequence of 3D volumes. The convolutional layers capture spatial patterns and features in the input volumes at different levels of abstraction. The model progressively reduces the spatial dimensions of the input through max pooling operations. Dropout regularization is applied to mitigate overfitting (Srivastava et al., 2014). The output of the convolutional layers is flattened and passed through a fully connected classifier. The classifier consists of two fully connected layers with ReLU activation and dropout, followed by a final linear layer that produces the model’s predictions. The significance of this model lies in its remarkable aptitude for both two-way (AD vs CN) and three-way classification tasks (AD vs MCI vs CN). Moreover, the final layers of the model include fully connected layers, allowing for the translation of feature representations into classification or regression predictions. The architecture can be customized to accommodate the number of modalities, classes, and channels.

Experiments

For unimodal and multimodal deep learning, we performed three experiments.

Experiment 1:

We used a single channel CNN with one modality at a time – i.e., sMRI or max(ICN) or abs(min(ICN)) or max(abs(ICN)). We also did a unimodal comparison with the existing state-of-the art rs-fMRI spatial map representation (such as ALFF and fALFF) with the iVIP. The inputs for the 1 channel CNN are visualized in Fig. 2.

Figure 2.

Figure 2

Processed sMRI and rs-fMRI which were used as the inputs for the 1 and 2 channel CNN in this study. This is a montage representation of the inputs that were used in Experiment 1 and Experiment 2. Note that the iVIP is a general/umbrella term for max(ICN), abs(min(ICN)) or max(abs(ICN)).

Experiment 2:

We used a two-channel CNN with one modality (iVIP or sMRI) in each channel. Thus, we had channel 1 as the sMRI and channel 2 included either max(ICN) or abs(min(ICN)) or max(abs(ICN)). The different inputs for the 2 channel CNN are depicted in Fig. 2.

Experiment 3:

We used a three channel CNN with fused modalities. Channel 1 was the square of sMRI, channel 2 was the product of the two modalities i.e., sMRI * iVIP whereas channel 3 was the square of the iVIP modality. Note here the iVIP modality was either max(ICN), abs(min(ICN)) or max(abs(ICN)). The inputs in our different 3 channel CNN are visualized in Fig. 4.

Figure 4.

Figure 4

Inputs for 3 channel 3D CNN. These are the different inputs which were used as different channels in the 3 channel CNN. We used sMRI*sMRI as channel 1, sMRI*iVIP as channel 2 and iVIP*iVIP as the channel 3. Note iVIP varied in each case between max(ICN), abs(min(ICN)) or max(abs(ICN)).

DL training

We conducted training and testing procedures for the deep learning architectures on an NVIDIA CUDA parallel computing platform. This platform utilized two Intel(R) Xeon(R) Gold 6230 CPUs operating at 2.10 GHz, and it was hosted on the TReNDS slurm-managed cluster. This cluster featured nodes equipped with 4 NVIDIA Tesla V100 SXM2 32 GB GPUs, enabling efficient parallel processing. We leveraged the GPU-accelerated NVIDIA CUDA toolkit (cudatoolkit), CUDA Deep Neural Network (cudnn), and PyTorch tensor libraries to enhance computational performance. For optimizing the models, we adopted the Adam algorithm, as implemented in the torch.optim package. This choice was motivated by its computational efficiency, minimal memory requirements, and suitability for tasks involving high-dimensional parameter spaces. In our custom code, we also made use of several essential packages, including nipy, scipy, numpy, nibabel, and pandas. These packages played crucial roles in basic image processing and read-write operations. In all experiments, a learning rate scheduler callback was employed. It dynamically reduced the learning rate by a factor of 0.5 when the validation accuracy metric reached a plateau. To mitigate overfitting and enhance generalization performance in the testing phase, early stopping was implemented. It had a patience level of 20 epochs for the classification tasks and 40 epochs for the regression tasks. Cross-entropy loss function was used for classification. To ensure a fair and consistent comparison of time complexity in the deep learning runs, each run was allocated a maximum of 8 CPU threads.

We performed 8-fold cross-validation (CV) for each experiment. Splitting of the train and test set was done separately for two-way and three-way classification using the stratified Monte Carlo i.e., randomized and repetitive sub-sampling CV. The data for the binary classification (AD vs CN) was partitioned into training (350 scans), validation (58 scans), and test sets (58 scans). For the three-way classification (AD vs MCI vs CN), 540 scans were allocated to the training set, while both the test and validation cohorts received 95 scans each. We also ensured stratification of the data during train and test splitting based on the two (AD vs CN) and three (AD vs MCI vs CN) classes respectively. For each iteration, the training, validation, and test samples were selected exactly once, ensuring consistency in the comparison by maintaining their uniformity across various experiments. In each repetition, hyperparameter tuning was conducted using the validation dataset, and the reported performance metrics were assessed using the independent held-out test set. A mask was computed for each subject with a threshold of sMRI voxels > 0.03. This mask was used for the same subject’s sMRI as well as iVIP. After masking we applied min-max normalization to ensure that each CNN channel has values between [0,1]. Hence the major difference between the maps is that the CN have a higher maximum value than the AD. For the Experiment 3 where we computed product, square; we ensured that the min-max normalization was done (as it ensures equal distribution of the modalities in the result) after the computation of each modality as well as after computing the product and square of the fused modalities. All the three experiments were computed separately for two (AD vs CN) and three (AD vs MCI vs CN) way classification.

Hyperparameters

Based on prior work (Abrol, Fu, et al., 2021), We performed an exhaustive hyperparameter tuning for different learning rate (0.1, 0.01, 0.001, 0.0001 and 0.00001), batch sizes (4, 8, 16, 32, 64) as well as combination of kernel size-stride length-padding (5–2-0 or 3–2-1). We tuned values for each experiment and reported the quantitative results for the optimized hyperparameters for each experiment.

Saliency and voxel-wise statistical analysis

Saliency maps were generated for individual test subjects through the application of guided back-propagation (Simonyan et al., 2014) on the trained neural network architecture. In the context of multimodal architectures, it’s essential to emphasize that these saliency maps are computed separately for each input feature map. To facilitate a comparative analysis of saliency the maps derived from all the subjects in each experiment were smoothed with a FWHM = 10 and then averaged. This procedure was carried out for both two-way and three-way classification.

Results

Our study leveraged sMRI and rs-fMRI data from the ADNI dataset each associated with one of three cognitive status categories: AD or MCI or CN. To facilitate classification, two distinct tasks were undertaken: a binary classification (AD vs. CN) and a ternary classification (AD vs. MCI vs. CN) using our 3D CNN architecture. A total of 466 subjects were used for the binary classification. For the three-way classification, a total of 730 subjects were used. Importantly, only a single scan per subject was utilized in both binary and ternary classification analyses. Please refer to Table 1 for more details about the data. The study employed a specialized 3D CNN model tailored for multiclass classification, designed to excel in distinguishing AD, MCI or CN statuses based on sMRI and fMRI data, serving as the primary tool for predictive performance and subsequent analyses.

ALFF/fALFF vs iVIP

Our proposed approach utilized a novel feature called iVIP which used intensity projection to create a voxelwise summary of 53 spatial ICN networks using max(ICN) or abs(min(ICN)) or max(abs(ICN)) was compared with the widely used ALFF/fALFF methods. Table 2 shows the test accuracy as well as balanced test accuracy comparison over an 8-fold cross validation as a part of the Experiment 1. We used a repeated stratified sub-sampling procedure for these unimodal comparisons. The Table 2 clearly indicates that the iVIP outperforms the ALFF and fALFF measures quantitatively. We note these findings, particularly from the results of the two-way classification that the performance of the iVIP (max(ICN) or abs(min(ICN)) or max(abs(ICN))), is higher than the corresponding ALFF/fALFF results. The same 3D CNN architecture was used for the one channel classification and the hyperparameters were tuned separated for each modality (iVIP or fALFF or ALFF).

Table 2.

Comparing prediction accuracies of our novel representation (iVIP) with the established ALFF and fALFF computations for the rs-fMRI dataset. The values in the table are represented in the format: mean ± standard deviation. The maximum values across different fMRI measures (i.e., for each column) for several accuracy or AUC scores are bold.

Two-way classification (AD vs CN) Three-way classification (AD vs MCI vs CN)
Different fMRI measures Test Accuracy Balanced Test Accuracy AUC Test Accuracy Balanced Test Accuracy AUC
ALFF 80.25 ± 0.35 64.44 ± 1.6 70.01 ± 0.19 50.01 ± 0.13 36.12 ± 0.21 49.96 ± 0.39
fALFF 80.8 ± 0.5 60.71 ±3.5 71.96 ± 0.3 46.95 ± 0.24 40.82 ± 0.25 52.13 ± 0.01
max(ICN) 84.81 ± 0.2 71.6 ± 0.95 83.58 ± 0.53 51.38 ± 0.2 40.97 ± 0.15 59.83 ± 0.04
abs(min(ICN)) 84.05 ± 0.18 66.6 ± 0.63 79.69 ± 0.29 49.96 ± 0.21 38.75 ± 0.15 60.26 ± 0.09
max(abs(ICN)) 85.02 ± 0.17 71.47 ± 0.64 83.4 ± 0.49 50.94 ± 0.17 40.72 ± 0.15 60.96 ± 0.9

Results (Table 2) also showed that the DL models from iVIP outperform the corresponding ALFF/fALFF models with the highest test accuracy of 85.02% by max(abs(ICN)) for the AD vs CN classification and 51.38% by max(ICN) for the AD vs MCI vs CN classification. We also note that the AUC value for two-way and three-way classification is higher for all the iVIPs than the corresponding ALFF/fALFF AUC. The highest AUC for two-way classification was noted by the max(ICN) at a value of 83.58 whereas for the three-way classification we observe that max(abs(ICN)) with a value of 60.96 outperforms the other modalities. A chi-squared test on the test accuracies showed, for all different fMRI measures, iVIP was significantly higher than ALFF/fALFF (iVIP vs p-ALFF = 0.000008 and iVIP vs p-fALFF = 0.000001). This was also the case for the max(ICN) vs ALFF/fALFF (p = 0.000048) as well as for abs(min(ICN)) vs ALFF/fALFF (p < 0.000006). For the three-way classification i.e., AD vs MCI vs CN, iVIP (only max(ICN) and max(abs(ICN))) also significantly outperforms fALFF/ALFF measures (p = 0.000001). Whereas on performing the chi-squared test on the AUC values for the iVIP vs respective fALFF/ALFF values, we determined iVIP outperformed ALFF/fALFF measures.

Based on our quantitative comparisons we find that the proposed iVIP measures sigificantly outperforms ALFF/fALFF for two-way and three-way classification. iVIP’s test and balanced accuracy scores are higher or similar to ALFF’s in some cases but overall, we report a significant difference in the AUC values. Thus, we used iVIP measures in our further multimodal analysis of Experiment 2 and Experiment 3.

Quantitative analysis: performance comparison

We performed two-way and three-way classification for Experiment 2 and 3. iVIP was used in the multimodal models which had sMRI and rs-fMRI data. We used a single channel 3D CNN network for Experiment 1, and 2-channel and 3-channel 3D CNN networks for Experiment 2 and 3, respectively. Our Experiment 2 results are summarized under the rows labeled ‘sMRI + iVIP’ whereas our Experiment 3 results are summarized under the rows labeled as ‘Fused’. Tables 3 and 4 summarizes our findings for two-way and three-way classification results respectively.

Table 3.

Performance for the two-way classification experiments (AD vs CN) with 466 subjects

Test Accuracy Balanced Test Accuracy F1 Precision Recall AUC
sMRI 91.18 ± 0.13 86.36 ± 0.69 76.67 ± 1.15 76.31 ± 2.28 81 ± 2.85 96.26 ± 0.09
max(ICN)
iVIP 84.81 ± 0.2 71.6 ± 0.95 49.91 ± 2.2 53.41 ± 5.28 52.3 ± 3.99 83.58 ± 0.53
sMRI + iVIP 91.88 ± 0.18 84.28 ± 0.9 76.37 ± 1.6 77.28 ± 2.62 80.23 ± 3.6 96.88 ± 0.08
Fused 92.38 ± 0.18 86.09 ± 0.64 77.74 ± 1.54 74.37 ± 3.15 81.61 ± 2.34 96.9 ± 0.18
abs(min(ICN))
iVIP 84.05 ± 0.18 66.6 ± 0.63 42.22 ± 2.5 48.71 ± 5.2 41 ± 2.8 79.69 ± 0.29
sMRI + iVIP 93.31 ± 0.11 88.43 ± 0.67 78.52 ± 0.88 79.99 ± 1.5 80.4 ± 2.86 97.11 ± 0.1
Fused 94.12 ± 0.13 88.29 ± 0.64 78.75 ± 1.65 81.36 ± 3.4 83.63 ± 2.69 97.79 ± 0.18
max(abs(ICN))
iVIP 85.02 ± 0.17 71.47 ± 0.64 49.34 ± 1.25 54.37 ± 3.45 51.15 ± 3.26 83.4 ± 0.49
sMRI + iVIP 91.81 ± 0.25 83.23 ± 1.03 72.1 ± 1.96 79.53 ± 2.69 69.64 ± 3.85 95.59 ± 0.15
Fused 92.74 ± 0.13 85.06 ± 0.56 74.84 ± 1.24 80.56 ± 2.7 73.47 ± 2.31 95.02 ± 0.2

Note: In Table 3 and 4, ‘Fused’ indicates a combined version of sMRI and iVIP in the following way - sMRI*sMRI + sMRI * iVIP + iVIP * iVIP. Note that different channels in the CNN are separated by a ‘+’ sign. The values in the table are represented in the format: mean ± standard deviation. The maximum values across different models (i.e., for each column) for accuracy, F1, precision, recall, AUC scores are bold.

Table 4.

Performance for the three-way classification experiments (AD vs MCI vs CN) for 730 subjects

Test Accuracy Balanced Test Accuracy F1 Precision Recall AUC
sMRI 58.08 ± 0.09 54.85 ± 0.19 56.70 ± 0.2 63.16 ± 0.33 54.85 ± 0.19 62.26 ± 0.63
max(ICN)
iVIP 51.38 ± 0.2 40.97 ± 0.15 39.81 ± 0.26 51.51 ± 1.73 40.97 ± 0.16 59.83 ± 0.04
sMRI + iVIP 60.31 ± 0.06 55.39 ± 0.24 56.09 ± 0.2 63.59 ± 0.25 55.39 ± 0.24 66.69 ± 0.11
Fused 60.72 ± 0.19 56.95 ± 0.22 57.70 ± 0.24 64.79 ± 0.36 55.95 ± 0.22 66.96 ± 0.19
abs(min(ICN))
iVIP 49.96 ± 0.21 38.75 ± 0.15 37.32 ± 0.21 51.31 ± 1.96 38.75 ± 0.15 60.26 ± 0.09
sMRI + iVIP 62.68 ± 0.13 58.65 ± 0.32 58.28 ± 0.37 65.36 ± 0.52 58.65 ± 0.33 66.56 ± 0.19
Fused 62.55 ± 0.15 58.78 ± 0.34 59.10 ± 0.43 63.37 ± 0.49 56.78 ± 0.36 67.06 ± 0.21
max(abs(ICN))
iVIP 50.94 ± 0.17 40.72 ± 0.15 39.56 ± 0.23 50.78 ± 1.63 40.72 ± 0.16 60.96 ± 0.9
sMRI + iVIP 60.5 ± 0.15 51.73 ± 0.21 56.09 ± 0.25 64.71 ± 0.37 55.73 ± 0.21 67.26 ± 0.63
Fused 60.46 ± 0.17 51.55 ± 0.29 57.25 ± 0.32 63.19 ± 0.45 57.56 ± 0.2 68.31 ± 0.01

Results show unimodal sMRI outperforms unimodal iVIP, and, importantly, the fused models outperform both unimodal models. That is, sMRI + iVIP as well as the fused multimodal model (sMRI*sMRI + sMRI * iVIP + iVIP * iVIP) perform the best. The highest performing test accuracy (94.12%) for the two-way classification is from the fused model with abs(min(ICN)). The fused model with abs(min(ICN)) test accuracy is also significantly higher than the corresponding unimodal measures (p < 0.05).

Our results from Table 3 also show that the fused models are significantly better (p < 0.05) than the corresponding unimodal models for two-way classification in all cases. We also note that the AUC value of 97.79, is the highest for the fused model of the abs(min(ICN)).

Similar to Table 3, we observe that the unimodal sMRI performs better than the unimodal iVIP, but we see a significant increase in the results from the multimodal models i.e., sMRI + iVIP and the fused multimodal model (sMRI*sMRI + sMRI * iVIP + iVIP * iVIP). The highest performing test accuracy (62.68%) for the three-way classification is from the multimodal model i.e., sMRI + abs(min(ICN)). Statistical significance analysis shows that that for the 730 subjects in the three-way classification, all our Experiment 3 results i.e., the ‘fused’ model, are significantly better than the corresponding unimodal models based on their test accuracy scores (p < 0.05). We also note that all our multimodal model’s AUC scores are significantly higher than the unimodal model’s scores with the highest AUC as 68.31 for the three-way classification using the fused model with the max(abs(ICN)).

Qualitative Analysis: Visualization of Saliency Maps

We extracted the saliency maps, using guided back-propagation on a trained neural network, for individual test subjects. We visualised these saliency maps after smoothing, averaging and thresholding to enable a comparative analysis in the context of different experiments for both two-way (Fig. 5) and three-way (Fig. 6) classification experiments.

Figure 5.

Figure 5

Visualisation of saliency maps for two-way classification (AD vs CN). A) Experiment 1 i.e only one modality (sMRI or max(ICN) or abs(min(ICN)) or max(abs(ICN))) was used at a time for predicting the classes. B) Experiment 2 i.e two modalities were used at the same time for predicting the classes via a 2-channel 3D CNN. The first channel was the sMRI modality. The second channel varied for different iVIP. C) Experiment 3 saliency maps i.e from a 3-channel 3D CNN where each channel as sMRI*sMRI or sMRI*iVIP or iVIP*iVIP. Note: All the maps were constructed from the saliency values after masking, smoothing and averaging them.

Figure 6.

Figure 6

Visualisation of saliency maps for three-way classification (AD vs MCI vs CN). A) In Experiment 1, individual modalities such as sMRI, max(ICN), abs(min(ICN)), or max(abs(ICN)) were utilized independently to predict classes. B) Experiment 2 involved using two modalities simultaneously in a 2-channel 3D CNN. The first channel represented the sMRI modality, while the second channel varied based on different iVIP. C) In Experiment 3, saliency maps were generated using a 3-channel 3D CNN. Each channel represented combinations such as sMRI*sMRI, sMRI*iVIP, or iVIP*iVIP. All saliency maps were obtained after masking, smoothing, and averaging.

Age-related alterations impact diverse brain systems, influencing memory and cognitive control. Our saliency maps from the two-way classification (Fig. 5) indicate that in Experiment 1 and 2, the sMRI maps show the areas of hippocampus, amygdala, and caudate nucleus. We see that from the visualization of max(ICN) we see major differences in the cingulate cortex, thalamus and caudate nucleus; whereas the abs(min(ICN)) show differences in the thalamus, cingulate cortex and caudate nucleus. The max(abs(ICN)) only shows major differences in the thalamus and caudate nucleus. But for the three-way classification results besides the same regions from the two-way classification for Experiment 1 and 2, we observe additional regions with major differences. For sMRI the cingulate cortex shows variations but major differences in the putamen, caudate nucleus and globus pallidus are observed for the abs(min(ICN)) and max(abs(ICN)) modality. For experiment 3, in two-way classification results we observe variations in the hippocampus, caudate nucleus, subthalamic nucleus, substantia niagra, amygdala, putamen, and thalamus for the three channels (i.e sMRI*sMRI, sMRI*iVIP and iVIP*iVIP). Whereas for the three-way classification results we observe that the major differences are from hippocampus from all the three channels followed by thalamus, amygdala and subthalamic nucleus.

Discussion

Our research work provides a novel method for the representation of the 4D rs-fMRI dataset in spatial maps which summaries the different functional networks (ICNs) using the iVIP, followed by a unimodal or multimodal deep 3D CNN classifier. The proposed iVIP approach for voxel-wise intensity projections of the ICNs summarizes the spatial information from the 4D rs-fMRI dataset using sc-ICA. After, iVIP the max(ICN), abs(min(ICN)) and max(abs(ICN)) basically were computed to identify the voxels with the most activating ICN, most absent or least activating ICN, most extreme ICN which is regardless of its direction. For two-way as well as three-way classification, the test accuracy as well as AUC scores for iVIP were the highest. Our quantitative results evidently indicated that the iVIP method outperformed previously and more commonly used method for spatial map representation i.e., ALFF and fALFF. Using iVIP we are able to summarize the most important contribution of the resting networks, hence we further used the iVIP representation along with sMRI, unimodally as well as multimodally to classify the research group categories (AD vs CN or AD vs MCI vs CN) from the ADNI dataset.

Most of the best performing models from previous work pertaining image classification using MRI datasets used AlexNet architecture (Abrol, Fu, et al., 2021; Naz et al., 2022; Rahim et al., 2023), hence we used our model which was an adapted version of the AlexNet model. We performed several unimodal as well as multimodal classification experiments with fused modalities as well as multichannel 3D CNN representation to predict the research classes. Quantitative results (e.g., test accuracy scores) show that the fused multimodal model from Experiment 3 performed significantly better than the respective unimodal models for two-way classification (AD vs CN) as well as three-way classification (AD vs MCI vs CN). Our best performing model from the two-way classification had the highest test accuracy of 94.12% and AUC of 97.79 from the fused model. The highest accuracy for the three-way classification results was 62.68% with an AUC of 68.31 from the fused model. Our fused models allow us to incorporate the most critical resting state networks fused with the sMRI. Thus, our results show a high classification accuracy with multimodal models generally outperforming the unimodal models.

Aging affects regions such as the hippocampus, putamen, middle temporal, post-central, superior, and mid-frontal gyri, consistent with prior research on age-related changes in gray matter volume and functional connectivity (Salami et al., 2012). Our saliency results for sMRI indicate major differences in the hippocampus, subthalamic nucleus, amygdala, and caudate nucleus which is in line with the neurobiological literature (de Jong et al., 2008; Heimer, 1983; Selden et al., 1994; Z. Wang et al., 2016). Whereas our iVIP modality shows major differences in the cingulate cortex, subthalamic nucleus, substantia niagra, thalamus, caudate nucleus, putamen, globus pallidus which is also in line with the literature for functional changes in the brain due to AD (Amanzio et al., 2011; Botzung et al., 2019; de Jong et al., 2008; Grahn et al., 2008; Heimer, 1983; C. Huang et al., 2002; P.-L. Lee et al., 2020; Lehéricy et al., 1991; Mattila et al., 2002; Selden et al., 1994). Also, we see that the fused model i.e., Experiment 3 for two-way classification gives insights into more relevant regions such as the subthalamic nucleus, substantia niagra, amygdala, putamen besides the other regions that are relevant for the diagnosis of AD but missed in the unimodal models or the non-fused models. Hence, on examining of saliency results, we demonstrate the potential of deep learning techniques to yield more nuanced and varied interpretations of significant high-dimensional neuroimaging features. In summary, our findings suggest that the patterns of discrimination captured by deep learning models in diagnostic classification align with established neurobiological knowledge regarding the brain regions that experience both structural and functional changes in Alzheimer’s disease.

Thus, our proposed multimodal disease classification framework based on 3D CNN in this study shows the potential for a unified, end-to-end multimodal fusion approach as our multimodal models significantly performed better than the corresponding unimodal models. Our approach is adaptable and could be used to train data, whether it’s raw or preprocessed, and features from all modalities. Our multimodal framework offers improved tuning and optimization of network weights compared to the unimodal training approach employed in the previous studies (Tajammal et al., 2023; Tufail et al., 2020). Nevertheless, our initial findings provide substantial evidence of the advantages of disease classification through multimodal fusion using deep learning, combining sMRI and iVIP spatial maps that summarize rs-fMRI ICN.

Our study has some limitations, firstly that the maximum number of subjects used for analysis was 730, which, while large enough to train the current models and provide some confidence in the results, it does not fully leverage the potential of nonlinear models. Our future research would hence aim at leveraging other AD datasets as well as incorporating multiple scans from the same subject. Secondly, we used voxel-wise saliency maps for visualization which are sometimes sensitive to the hyperparameters of the 3D neural network and the initial parameters of the network. Although our saliency maps were computed for each subject across several folds which would have combatted this limitation, in our future work we will employ multiple other alternative interpretability methods such as SHapley Additive exPlanations (SHAP) (Lundberg & Lee, 2017). A notable constraint in our investigation is the absence of a comparison among various deep learning architectures. Instead, we opted for a comparison with the widely acknowledged and high-performing AlexNet models from comparative literature which demonstrated best results with this model (Abrol, Fu, et al., 2021; Naz et al., 2022; Rahim et al., 2023). This decision was made to mitigate biases arising from diverse architectures and ensure a more robust evaluation of our results with the existing state-of-the-art architecture for the 3D image classification of the AD dataset. This provides us for an opportunity for future research which could encompass a comprehensive comparison of various deep learning architectures to determine the most suitable framework for diagnostic classification and predictive tasks in the realm of neuroimaging data analysis. Additionally, we are keen to explore the progression of AD using longitudinal datasets and expanding the scope by incorporating multiple AD datasets such as the Open Access Series of Imaging Studies (OASIS) (LaMontagne et al., 2019) in our future work. Although, our current study primarily focuses on classification applications, the adaptability of this framework to study and predict important continuous variables such as brain age and clinical assessments based on regression is readily feasible.

Figure 3.

Figure 3

Illustration of the multimodal 3D CNN architecture which we used in our analysis. The structural and functional features from Figure 2 were used as inputs for one channel or multichannel input in the network. The network would predict a class of the research group i.e., AD or CN or MCI depending on whether it was a two-way or three-way classification task.

Funding

This work was supported by NIH (Grant number: RF1AG063153) and NSF (Grant number: 2112455). Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12–2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.

Footnotes

Competing Interests

The authors have no relevant financial or non-financial interests to disclose.

Ethics approval

The research protocols of the ADNI study were subject to approval by the institutional review boards at all participating centers, and this approval process is meticulously detailed in the official document, which can be accessed via this link: ADNI Approval Documentation. Furthermore, this study received explicit approval from the institutional review boards at each research site that was involved in the investigation. It is imperative to note that all research methodologies adhered rigorously to the established guidelines and ethical standards.

Consent to participate

Prior to inclusion in the study, all individuals provided written, informed consent in full compliance with the ethical principles set forth in the Declaration of Helsinki.

Consent to publish

The authors affirm that human research participants provided informed consent.

Contributor Information

Vaibhavi S. Itkyal, Emory University

Anees Abrol, Georgia State University.

Theodore J. LaGrow, Georgia Institute of Technology

Alex Fedorov, Emory University.

Vince D. Calhoun, Georgia State University

Data Availability

The datasets used, processed, and analyzed during our study current study are available in the repository, Alzheimer’s Disease Neuroimaging Initiative (ADNI). The sMRI scans (specifically T1 MRIs) and rs-fMRI scans were procured from the ADNI data portal (https://adni.loni.usc.edu/). Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). For up-to-date information, see www.adni-info.org.

References

  • 1.Abrol A., Bhattarai M., Fedorov A., Du Y., Plis S., & Calhoun V. (2020). Deep residual learning for neuroimaging: An application to predict progression to Alzheimer’s disease. Journal of Neuroscience Methods, 339, 108701. 10.1016/j.jneumeth.2020.108701 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Abrol A., Fu Z., Salman M., Silva R., Du Y., Plis S., & Calhoun V. (2021). Deep learning encodes robust discriminative neuroimaging representations to outperform standard machine learning. Nature Communications, 12(1), Article 1. 10.1038/s41467-020-20655-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Abrol A., Hassanzadeh R., Plis S., & Calhoun V. (2021). Deep learning in resting-state fMRI. 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 3965–3969. 10.1109/EMBC46164.2021.9630257 [DOI] [PubMed] [Google Scholar]
  • 4.Agosta F., Pievani M., Geroldi C., Copetti M., Frisoni G. B., & Filippi M. (2012). Resting state fMRI in Alzheimer’s disease: Beyond the default mode network. Neurobiology of Aging, 33(8), 1564–1578. 10.1016/j.neurobiolaging.2011.06.007 [DOI] [PubMed] [Google Scholar]
  • 5.Alzubaidi L., Zhang J., Humaidi A. J., Al-Dujaili A., Duan Y., Al-Shamma O., Santamaría J., Fadhel M. A., Al-Amidie M., & Farhan L. (2021). Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data, 8(1), 53. 10.1186/s40537-021-00444-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Amanzio M., Torta D. M. E., Sacco K., Cauda F., D’Agata F., Duca S., Leotta D., Palermo S., & Geminiani G. C. (2011). Unawareness of deficits in Alzheimer’s disease: Role of the cingulate cortex. Brain, 134(4), 1061–1076. 10.1093/brain/awr020 [DOI] [PubMed] [Google Scholar]
  • 7.Beason-Held L. L., Goh J. O., An Y., Kraut M. A., O’Brien R. J., Ferrucci L., & Resnick S. M. (2013). Changes in Brain Function Occur Years before the Onset of Cognitive Impairment. The Journal of Neuroscience, 33(46), 18008–18014. 10.1523/JNEUROSCI.1402-13.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Botzung A., Philippi N., Noblet V., Loureiro de Sousa P., & Blanc F. (2019). Pay attention to the basal ganglia: A volumetric study in early dementia with Lewy bodies. Alzheimer’s Research & Therapy, 11(1), 108. 10.1186/s13195-019-0568-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Calhoun V. (2018). Data-driven approaches for identifying links between brain structure and function in health and disease. Dialogues in Clinical Neuroscience, 20(2), 87–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Calhoun V. D., & Allen E. (2013). Extracting intrinsic functional networks with feature-based group independent component analysis. Psychometrika, 78(2), 243–259. 10.1007/s11336-012-9291-3 [DOI] [PubMed] [Google Scholar]
  • 11.Calhoun V. D., & Sui J. (2016). Multimodal Fusion of Brain Imaging Data: A Key to Finding the Missing Link(s) in Complex Mental Illness. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 1(3), 230–244. 10.1016/j.bpsc.2015.12.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Cummings J., Lee G., Nahed P., Kambar M. E. Z. N., Zhong K., Fonseca J., & Taghva K. (2022). Alzheimer’s disease drug development pipeline: 2022. Alzheimer’s & Dementia: Translational Research & Clinical Interventions, 8(1), e12295. 10.1002/trc2.12295 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.de Jong L. W., van der Hiele K., Veer I. M., Houwing J. J., Westendorp R. G. J., Bollen E. L. E. M., de Bruin P. W., Middelkoop H. A. M., van Buchem M. A., & van der Grond J. (2008). Strongly reduced volumes of putamen and thalamus in Alzheimer’s disease: An MRI study. Brain, 131(12), 3277–3285. 10.1093/brain/awn278 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Du Y., Fu Z., Sui J., Gao S., Xing Y., Lin D., Salman M., Abrol A., Rahaman M. A., Chen J., Hong L. E., Kochunov P., Osuch E. A., & Calhoun V. D. (2020). NeuroMark: An automated and adaptive ICA based pipeline to identify reproducible fMRI markers of brain disorders. NeuroImage: Clinical, 28, 102375. 10.1016/j.nicl.2020.102375 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Dubois B., Villain N., Frisoni G. B., Rabinovici G. D., Sabbagh M., Cappa S., Bejanin A., Bombois S., Epelbaum S., Teichmann M., Habert M.-O., Nordberg A., Blennow K., Galasko D., Stern Y., Rowe C. C., Salloway S., Schneider L. S., Cummings J. L., & Feldman H. H. (2021). Clinical diagnosis of Alzheimer’s disease: Recommendations of the International Working Group. The Lancet. Neurology, 20(6), 484–496. 10.1016/S1474-4422(21)00066-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Dukart J., Kherif F., Mueller K., Adaszewski S., Schroeter M. L., Frackowiak R. S. J., Draganski B., & Initiative, for the A. D. N. (2013). Generative FDG-PET and MRI Model of Aging and Disease Progression in Alzheimer’s Disease. PLOS Computational Biology, 9(4), e1002987. 10.1371/journal.pcbi.1002987 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Egorova N., Veldsman M., Cumming T., & Brodtmann A. (2017). Fractional amplitude of low-frequency fluctuations (fALFF) in post-stroke depression. NeuroImage: Clinical, 16, 116–124. 10.1016/j.nicl.2017.07.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Fedorov A., Sylvain T., Geenjaar E., Luck M., Wu L., DeRamus T. P., Kirilin A., Bleklov D., Calhoun V. D., & Plis S. M. (2021). Self-Supervised Multimodal Domino: In Search of Biomarkers for Alzheimer’s Disease. 2021 IEEE 9th International Conference on Healthcare Informatics (ICHI), 23–30. 10.1109/ICHI52183.2021.00017 [DOI] [Google Scholar]
  • 19.Grahn J. A., Parkinson J. A., & Owen A. M. (2008). The cognitive functions of the caudate nucleus. Progress in Neurobiology, 86(3), 141–155. 10.1016/j.pneurobio.2008.09.004 [DOI] [PubMed] [Google Scholar]
  • 20.Hansson O., Lehmann S., Otto M., Zetterberg H., & Lewczuk P. (2019). Advantages and disadvantages of the use of the CSF Amyloid β (Aβ) 42/40 ratio in the diagnosis of Alzheimer’s Disease. Alzheimer’s Research & Therapy, 11(1), 34. 10.1186/s13195-019-0485-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hardy J. A., & Higgins G. A. (1992). Alzheimer’s Disease: The Amyloid Cascade Hypothesis. Science, 256(5054), 184. [DOI] [PubMed] [Google Scholar]
  • 22.Heimer L. (1983). Basal Ganglia. In Heimer L. (Ed.), The Human Brain and Spinal Cord: Functional Neuroanatomy and Dissection Guide (pp. 199–209). Springer; US. 10.1007/978-1-4684-0150-9_15 [DOI] [Google Scholar]
  • 23.Hersi M., Irvine B., Gupta P., Gomes J., Birkett N., & Krewski D. (2017). Risk factors associated with the onset and progression of Alzheimer’s disease: A systematic review of the evidence. NeuroToxicology, 61, 143–187. 10.1016/j.neuro.2017.03.006 [DOI] [PubMed] [Google Scholar]
  • 24.Huang C., Wahlund L.-O., Svensson L., Winblad B., & Julin P. (2002). Cingulate cortex hypoperfusion predicts Alzheimer’s disease in mild cognitive impairment. BMC Neurology, 2(1), 9. 10.1186/1471-2377-2-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Huang L.-K., Chao S.-P., & Hu C.-J. (2020). Clinical trials of new drugs for Alzheimer disease. Journal of Biomedical Science, 27(1), 18. 10.1186/s12929-019-0609-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Hutchison R. M., Womelsdorf T., Allen E. A., Bandettini P. A., Calhoun V. D., Corbetta M., Della Penna S., Duyn J. H., Glover G. H., Gonzalez-Castillo J., Handwerker D. A., Keilholz S., Kiviniemi V., Leopold D. A., de Pasquale F., Sporns O., Walter M., & Chang C. (2013). Dynamic functional connectivity: Promise, issues, and interpretations. NeuroImage, 80, 360–378. 10.1016/j.neuroimage.2013.05.079 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Improved Detection of Substantia Nigra Pathology in Alzheimer’s Disease—Schneider Julie A., Bienias Julia L., Gilley David W., Kvarnberg David E., Mufson Elliott J., Bennett David A., 2002. (n.d.). Retrieved November 19, 2023, from 10.1177/002215540205000111 [DOI] [PubMed]
  • 28.Jack C. R. Jr., Bernstein M. A., Fox N. C., Thompson P., Alexander G., Harvey D., Borowski B., Britson P. J., Whitwell L., J., Ward C., Dale A. M., Felmlee J. P., Gunter J. L., Hill D. L. G., Killiany R., Schuff N., Fox-Bosetti S., Lin C., Studholme C., … Weiner M. W. (2008). The Alzheimer’s disease neuroimaging initiative (ADNI): MRI methods. Journal of Magnetic Resonance Imaging, 27(4), 685–691. 10.1002/jmri.21049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Keilholz S., Caballero-Gaudes C., Bandettini P., Deco G., & Calhoun V. (2017). Time-Resolved Resting-State Functional Magnetic Resonance Imaging Analysis: Current Status, Challenges, and New Directions. Brain Connectivity, 7(8), 465–481. 10.1089/brain.2017.0543 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Khoury R., & Ghossoub E. (2019). Diagnostic biomarkers of Alzheimer’s disease: A state-of-the-art review. Biomarkers in Neuropsychiatry, 1, 100005. 10.1016/j.bionps.2019.100005 [DOI] [Google Scholar]
  • 31.Krizhevsky A., Sutskever I., & Hinton G. E. (2017). ImageNet classification with deep convolutional neural networks. Communications of the ACM, 60(6), 84–90. 10.1145/3065386 [DOI] [Google Scholar]
  • 32.LaMontagne P. J., Benzinger T. L., Morris J. C., Keefe S., Hornbeck R., Xiong C., Grant E., Hassenstab J., Moulder K., Vlassenko A. G., Raichle M. E., Cruchaga C., & Marcus D. (2019). OASIS-3: Longitudinal Neuroimaging, Clinical, and Cognitive Dataset for Normal Aging and Alzheimer Disease (p. 2019.12.13.19014902). medRxiv. 10.1101/2019.12.13.19014902 [DOI] [Google Scholar]
  • 33.Lee J. C., Kim S. J., Hong S., & Kim Y. (2019). Diagnosis of Alzheimer’s disease utilizing amyloid and tau as fluid biomarkers. Experimental & Molecular Medicine, 51(5), 53. 10.1038/s12276-019-0250-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Lee P.-L., Chou K.-H., Chung C.-P., Lai T.-H., Zhou J. H., Wang P.-N., & Lin C.-P. (2020). Posterior Cingulate Cortex Network Predicts Alzheimer’s Disease Progression. Frontiers in Aging Neuroscience, 12. 10.3389/fnagi.2020.608667 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Lehéricy S., Hirsch E. C., Hersh L. B., & Agid Y. (1991). Cholinergic neuronal loss in the globus pallidus of Alzheimer disease patients. Neuroscience Letters, 123(2), 152–155. 10.1016/0304-3940(91)90918-J [DOI] [PubMed] [Google Scholar]
  • 36.Liss J. L., Seleri Assunção S., Cummings J., Atri A., Geldmacher D. S., Candela S. F., Devanand D. P., Fillit H. M., Susman J., Mintzer J., Bittner T., Brunton S. A., Kerwin D. R., Jackson W. C., Small G. W., Grossberg G. T., Clevenger C. K., Cotter V., Stefanacci R., … Sabbagh M. N. (2021). Practical recommendations for timely, accurate diagnosis of symptomatic Alzheimer’s disease (MCI and dementia) in primary care: A review and synthesis. Journal of Internal Medicine, 290(2), 310–334. 10.1111/joim.13244 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Lundberg S., & Lee S.-I. (2017). A Unified Approach to Interpreting Model Predictions (arXiv:1705.07874). arXiv. 10.48550/arXiv.1705.07874 [DOI] [Google Scholar]
  • 38.Lurie D. J., Kessler D., Bassett D. S., Betzel R. F., Breakspear M., Kucyi A., Liégeois R., Lindquist M. A., McIntosh A. R., Shine J. M., Thompson W. H., Bielczyk N. Z., Douw L., Kraft D., Miller R. L., Muthuraman M., Pasquini L., Razi A., Vidaurre D., … Calhoun V. D. (n.d.). On the nature of resting fMRI and time-varying functional connectivity. [DOI] [PMC free article] [PubMed]
  • 39.Mattila P., Togo T., & Dickson D. W. (2002). The subthalamic nucleus has neurofibrillary tangles in argyrophilic grain disease and advanced Alzheimer’s disease. Neuroscience Letters, 320(1), 81–85. 10.1016/S0304-3940(02)00006-X [DOI] [PubMed] [Google Scholar]
  • 40.Naz S., Ashraf A., & Zaib A. (2022). Transfer learning using freeze features for Alzheimer neurological disorder detection using ADNI dataset. Multimedia Systems, 28(1), 85–94. 10.1007/s00530-021-00797-3 [DOI] [Google Scholar]
  • 41.Rahim N., El-Sappagh S., Ali S., Muhammad K., Del Ser J., & Abuhmed T. (2023). Prediction of Alzheimer’s progression based on multimodal Deep-Learning-based fusion and visual Explainability of time-series data. Information Fusion, 92, 363–388. 10.1016/j.inffus.2022.11.028 [DOI] [Google Scholar]
  • 42.Reyes A., Thesen T., Wang X., Hahn D., Yoo D., Kuzniecky R., Devinsky O., & Blackmon K. (2016). Resting-state functional MRI distinguishes temporal lobe epilepsy subtypes. Epilepsia, 57(9), 1475–1484. 10.1111/epi.13456 [DOI] [PubMed] [Google Scholar]
  • 43.Sabbagh M. N., Boada M., Borson S., Chilukuri M., Doraiswamy P. M., Dubois B., Ingram J., Iwata A., Porsteinsson A. P., Possin K. L., Rabinovici G. D., Vellas B., Chao S., Vergallo A., & Hampel H. (2020). Rationale for Early Diagnosis of Mild Cognitive Impairment (MCI) supported by Emerging Digital Technologies. The Journal of Prevention of Alzheimer’s Disease, 7(3), 158–164. 10.14283/jpad.2020.19 [DOI] [PubMed] [Google Scholar]
  • 44.Salami A., Eriksson J., & Nyberg L. (2012). Opposing effects of aging on large-scale brain systems for memory encoding and cognitive control. The Journal of Neuroscience: The Official Journal of the Society for Neuroscience, 32(31), 10749–10757. 10.1523/JNEUROSCI.0278-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Segall J., Allen E., Jung R., Erhardt E., Arja S., Kiehl K., & Calhoun V. (2012). Correspondence between structure and function in the human brain at rest. Frontiers in Neuroinformatics, 6. 10.3389/fninf.2012.00010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Selden N., Geula C., Hersh L., & Mesulam M.-M. (1994). Human striatum: Chemoarchitecture of the caudate nucleus, putamen and ventral striatum in health and Alzheimer’s disease. Neuroscience, 60(3), 621–636. 10.1016/0306-4522(94)90491-X [DOI] [PubMed] [Google Scholar]
  • 47.Simonyan K., Vedaldi A., & Zisserman A. (2014). Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps (arXiv:1312.6034). arXiv. 10.48550/arXiv.1312.6034 [DOI] [Google Scholar]
  • 48.Singh S. P., Wang L., Gupta S., Goli H., Padmanabhan P., & Gulyás B. (2020). 3D Deep Learning on Medical Images: A Review. Sensors (Basel, Switzerland), 20(18), 5097. 10.3390/s20185097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Srivastava N., Hinton G., Krizhevsky A., Sutskever I., & Salakhutdinov R. (2014). Dropout: A Simple Way to Prevent Neural Networks from Overfitting. Journal of Machine Learning Research, 15(56), 1929–1958. [Google Scholar]
  • 50.Sui J., Huster R., Yu Q., Segall J. M., & Calhoun V. D. (2014). Function–structure associations of the brain: Evidence from multimodal connectivity and covariance studies. NeuroImage, 102, 11–23. 10.1016/j.neuroimage.2013.09.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Sui J., Yu Q., He H., Pearlson G., & Calhoun V. (2012). A Selective Review of Multimodal Fusion Methods in Schizophrenia. Frontiers in Human Neuroscience, 6. 10.3389/fnhum.2012.00027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Tajammal T., Khurshid S. K., Jaleel A., Qayyum Wahla S., & Ziar R. A. (2023). Deep Learning-Based Ensembling Technique to Classify Alzheimer’s Disease Stages Using Functional MRI. Journal of Healthcare Engineering, 2023, e6961346. 10.1155/2023/6961346 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Tufail A. B., Ma Y.-K., & Zhang Q.-N. (2020). Binary Classification of Alzheimer’s Disease Using sMRI Imaging Modality and Deep Learning. Journal of Digital Imaging, 33(5), 1073–1090. 10.1007/s10278-019-00265-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Turner J., Damaraju E., Van Erp T., Mathalon D., Ford J., Voyvodic J., Mueller B., Belger A., Bustillo J., McEwen S., Potkin S., BIRN F. I., & Calhoun V. (2013). A multi-site resting state fMRI study on the amplitude of low frequency fluctuations in schizophrenia. Frontiers in Neuroscience, 7. 10.3389/fnins.2013.00137 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Wang J. -j., Chen X., Sah S. K., Zeng C., Li Y. -m., Li N., Liu M. -q., & Du S. -l. (2016). Amplitude of low-frequency fluctuation (ALFF) and fractional ALFF in migraine patients: A resting-state functional MRI study. Clinical Radiology, 71(6), 558–564. 10.1016/j.crad.2016.03.004 [DOI] [PubMed] [Google Scholar]
  • 56.Wang Z., Zhang M., Han Y., Song H., Guo R., & Li K. (2016). Differentially disrupted functional connectivity of the subregions of the amygdala in Alzheimer’s disease. Journal of X-Ray Science and Technology, 24(2), 329–342. 10.3233/XST-160556 [DOI] [PubMed] [Google Scholar]
  • 57.Wee C.-Y., Yap P.-T., Zhang D., Denny K., Browndyke J. N., Potter G. G., Welsh-Bohmer K. A., Wang L., & Shen D. (2012). Identification of MCI Individuals Using Structural and Functional Connectivity Networks. Neuroimage, 59(3), 2045–2056. 10.1016/j.neuroimage.2011.10.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Zatorre R. J., Fields R. D., & Johansen-Berg H. (2012). Plasticity in gray and white: Neuroimaging changes in brain structure during learning. Nature Neuroscience, 15(4), Article 4. 10.1038/nn.3045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Zou Q.-H., Zhu C.-Z., Yang Y., Zuo X.-N., Long X.-Y., Cao Q.-J., Wang Y.-F., & Zang Y.-F. (2008). An improved approach to detection of amplitude of low-frequency fluctuation (ALFF) for resting-state fMRI: Fractional ALFF. Journal of Neuroscience Methods, 172(1), 137–141. 10.1016/j.jneumeth.2008.04.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Zuo X.-N., Di Martino A., Kelly C., Shehzad Z. E., Gee D. G., Klein D. F., Castellanos F. X., Biswal B. B., & Milham M. P. (2010). The oscillating brain: Complex and reliable. NeuroImage, 49(2), 1432–1445. 10.1016/j.neuroimage.2009.09.037 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The datasets used, processed, and analyzed during our study current study are available in the repository, Alzheimer’s Disease Neuroimaging Initiative (ADNI). The sMRI scans (specifically T1 MRIs) and rs-fMRI scans were procured from the ADNI data portal (https://adni.loni.usc.edu/). Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). For up-to-date information, see www.adni-info.org.


Articles from Research Square are provided here courtesy of American Journal Experts

RESOURCES