Abstract
Resting-state functional MRI (rs-fMRI), coupled with advanced multivariate time-series analysis methods such as Granger causality, is a promising tool for the development of novel functional connectivity biomarkers of neurologic and psychiatric disease. Recently large-scale Granger causality (lsGC) has been proposed as an alternative to conventional Granger causality (cGC) that extends the scope of robust Granger causal analyses to high-dimensional systems such as the human brain. In this study, lsGC and cGC were comparatively evaluated on their ability to capture neurologic damage associated with HIV-associated neurocognitive disorders (HAND). Functional brain network models were constructed from rs-fMRI data collected from a cohort of HIV+ and HIV− subjects. Graph theoretic properties of the resulting networks were then used to train a support vector machine (SVM) model to predict clinically relevant parameters, such as HIV status and neuropsychometric (NP) scores. For the HIV+/− classification task, lsGC, which yielded a peak area under the receiver operating characteristic curve (AUC) of 0.83, significantly outperformed cGC, which yielded a peak AUC of 0.61, at all parameter settings tested. For the NP score regression task, lsGC, with a minimum mean squared error (MSE) of 0.75, significantly outperformed cGC, with a minimum MSE of 0.84 (p < 0.001, one-tailed paired t-test). These results show that, at optimal parameter settings, lsGC is better able to capture functional brain connectivity correlates of HAND than cGC. However, given the substantial variation in the performance of the two methods at different parameter settings, particularly for the regression task, improved parameter selection criteria are necessary and constitute an area for future research.
Keywords: Resting-state fMRI, functional connectivity, large-scale Granger causality, support vector machine, graph theory, HIV-Associated Neurocognitive Disorders (HAND)
1. INTRODUCTION
HIV-associated neurocognitive disorders (HAND) are responsible for an array of cognitive, motor, and behavioral symptoms in over 25% of individuals with HIV,1 including those with optimal viral control using highly active antiretroviral therapy (HAART).2 Currently, the standard for HAND diagnosis and disease monitoring is an extensive battery of neuropsychometric (NP) tests, which are susceptible to confounding factors such as socioeconomic status and learning effects.3,4 In addition, NP tests are neither sensitive nor specific for HAND and cannot detect the presence of HAND before the onset of neurologic and psychiatric signs of damage. Resting-state functional MRI (rs-fmRI) has been shown to detect patterns of altered brain connectivity in a variety of disease states, including autism spectrum disorder,5 schizophrenia,6 and Alzheimer disease.7,8 In many such studies, conventional Granger causality (cGC) is used as a method to quantify directed information transfer between different regions of the brain on the basis of rs-fMRI time-series. Though cGC has offered promising initial results, it suffers from the mathematical limitations of its underlying autoregressive model, which limit its application for the analysis of complex high-dimensional systems such as the brain. Large-scale Granger causality (lsGC), which extends the scope of cGC with the addition of an embedded dimension reduction step, was specifically designed to circumvent this so-called curse of dimensionality and has recently been shown to perform better than cGC in a machine learning classification task of HIV+ and HIV− subjects in which functional brain networks were used as input features.9 In this study, a quantitative retrospective analysis was performed on a rs-fMRI dataset obtained from a larger cohort of HIV+ and HIV− subjects with the objective of distinguishing HIV+ and HIV− subjects and predicting overall scores of NP tests. This work continues our group’s application of big data analysis in biomedical imaging by means of advanced pattern recognition and machine learning methods.9–13
2. DATA
2.1 Functional MRI Acquisition
The study participants were comprised of 20 healthy HIV− subjects with no history of severe psychiatric or neurologic disease (9 females; mean age 42 years; range 21-60 years) and 14 HIV+ subjects (5 females; mean age 46 years; range 23-71 years) with symptoms of HAND. In the HIV+ cohort, 4 subjects underwent neuroimaging and neuropsychometric evaluation twice. All subjects provided written consent, in accordance with the study protocol approved by the Institutional Review Board of the University of Rochester. The scan was acquired using a 3.0 Tesla, Siemens Magnetom TrioTim scanner, using the following fMRI scan parameters: Echo time (TE) = 23 ms, repetition time (TR) = 1650 ms, flip angle (FA) = 84°, 96 × 96 acquisition matrix. A total of 250 temporal scan volumes with 25 slices at a slice distance of 5 mm were acquired from each subject. During the fMRI acquisition period, the subjects were instructed to lie still in the scanner with eyes closed. High-resolution structural T1-weighted magnetization-prepared rapid gradient echo (MPRAGE) scans were also collected and used for registration.
2.2 Neuropsychometric testing
Neuropscyhometric (NP) testing was performed on all subjects using a standard battery for six cognitive domains: executive function, specialized function, motor ability, learning, memory, and attention. Scores from individual domains were combined into an overall score, which was normalized as a z-score.
2.3 Preprocessing
Preprocessing steps were performed using C-PAC14 and its underlying dependencies in the FMRIB Software Library (FSL, version 5.0, http://fsl.fmrib.ox.ac.uk/).15 The first 10 brain volumes acquired for each subject were discarded to eliminate the effects of initial saturation of the blood oxygen level dependent (BOLD) fMRI signal. On the 240 remaining volumes, head motion correction, slice timing correction, brain extraction, and registration to the standard Montreal Neurological Institute (MNI152) atlas space16,17 was performed. Removal of the whole brain time-series was done using a nuisance regressor. In addition, high-pass filtering was performed to remove signal drifts and the voxel time-series were normalized to unit standard deviation and zero mean.
3. METHODS
3.1 Large-scale Granger causality
Large-scale Granger causality9 is a multivariate and directional method that is built on the principle of Granger causality (GC),18 in which time-series A is said to Granger-cause time-series B if the inclusion of information from time-series A in a predictive autoregressive model of time-series B decreases the variance of the residuals between the predicted and actual time-series B. Unlike conventional GC (cGC), however, lsGC can be extended to high-dimensional systems due to the embedding of a dimension reduction step using principal components analysis.
3.2 Parameter selection for Granger causality analysis
To perform lsGC, two model parameters must be chosen: number of retained principal components c and autoregressive order ρ. Parameter selection was informed by the Akaike Information Criterion (AIC)19 and Bayesian Information Criterion (BIC)20 as well as by a previous lsGC rs-fMRI study.10 The following set of lsGC parameters were chosen: ρ = {1, 2, 3, 4, 5}, c = {1, 2}. For cGC, ρ = {1, 2} was evaluated, as orders greater than 2 could not be performed due to the mathematical limitations of cGC (c is not relevant for cGC, as there is no dimension reduction step).
3.3 Network property computation
90 regions of interest (ROIs) were defined according to the Automated Anatomical Labeling (AAL)21 template and the mean time-series from each ROI was extracted. For each subject, a 90 × 90 weighted affinity matrix was generated using lsGC and cGC with each parameter setting. Negative entries as well as entries on the main diagonal of the matrix were set to 0. The matrix was then binarized by setting the top 30% of entries to 1 and the bottom 70% of the entries to 0. This matrix is thus an equivalent to a network graph, in which ROIs are represented as vertices and the edges between them are denoted by the nonzero entries of the matrix.
The following local properties were calculated for each node of each subject’s brain network. In-degree: The sum of all incoming edges for a particular vertex. Out-degree: The sum of all outgoing edges for a particular vertex. Clustering coefficient: The fraction of connected triples centered on a node that are also triangles (a connected triple is a set of three vertices for which one vertex has direct incoming or outgoing edges to the other two; a triangle is a set of three vertices that are all directly connected by either outgoing or incoming edges). Betweenness centrality: The fraction of all shortest paths in a network that pass through a particular vertex22 (the shortest path between vertex i and vertex j is the minimum number of edges that must be traversed to travel from i to j; if such a path does not exist for a given pair of vertices, as might happen when a network becomes disconnected, then the maximum finite path length for the entire network is assigned as the shortest path length for that pair of vertices).
The following global properties were also calculated for each subject’s network. Degree variance: the variance of the incoming or outgoing edges of all vertices in a network. Modularity: The extent to which a network be partitioned into Louvain community detection algorithm-defined modules,23 which have high intramodular connectivity and low intermodular connectivity.22 Small-worldness: The ratio of the random network-normalized global clustering coefficient and the random network-normalized shortest path length.24 Degree distribution-matched random networks were produced for each empirical brain network by iteratively rewiring each edge approximately 10 times. Motif frequency: the frequency of occurrence of 13 three-vertex motifs.25 Generation of the random networks and calculation of the network parameters were performed using the Brain Connectivity Toolbox (brain-connectivity-toolbox.net)26 in MATLAB (2015b, The MathWorks, Natick, MA).
3.4 Machine learning with support vector machines
Each brain network property was normalized across all subjects to have a mean of 0 and a standard deviation of 1. All network properties and demographic factors (age and gender) for each subject were then concatenated into a single feature vector and used as the input features for a support vector machine27 (SVM) model. To train and test the model, a 10-fold iterative cross-validation scheme was employed, in which the data were split into training (90%) and testing sets (10%). An SVM model was generated using the training set and its performance was assessed on the testing set. The outputs were either the classification of each subject as HIV+ or HIV− or the prediction of the normalized NP score for each subject. For the HIV+/− classification task, performance on the testing set was assessed using accuracy of classification and the area under the receiver operating characteristic curve (AUC), in which an AUC of 1 indicates a perfect classifier while an AUC of 0.5 indicates a classifier that performs no better than random chance. For the NP score regression, performance was assessed using the mean squared error (MSE). This scheme was iterated 100 times and the mean and standard error of the resulting AUC, accuracy, or MSE distributions were computed.
4. RESULTS
Functional brain network properties were used as input features to classify HIV− and HIV+ subjects with SVM, and an iterative 10-fold cross-validation scheme was employed to train and test the model. The resulting classification accuracies and AUCs were highly dependent on the specific variant of Granger causality (GC) employed. As can be seen in Fig. 2, the AUCs of lsGC at c = 1 and c = 2 are significantly above those of even the peak AUC of cGC. Additionally, the classification performance of lsGC at c = 2 is generally greater than that at c = 1.
4.1 Neuropsychometric score regression
In addition to the HIV+/− classification task depicted in Fig. 2, functional brain network properties were also used with SVM to predict the overall NP score for each subject. A cross-validation scheme similar to that employed in the classification task was used, with one major difference: HIV status of subjects was considered an additional feature for the regression task. The results of this analysis are shown in Fig. 3. For all orders considered, lsGC at c = 1 performs as well as or better than the other methods. Peak performance of cGC occurs at ρ = 2, where it performs better than lsGC at c = 2. LsGC at c = 2 appears extremely sensitive to the choice of ρ, as it initially decreases sharply, then increases gradually with ρ. It can be seen that at ρ = 1, lsGC generally outperforms cGC. As order is increased to 3, the performance of lsGC at c = 1 and c = 2 improves beyond the peak performance of cGC. At ρ > 3, however, lsGC at c = 2 starts to give progressively higher values of the MSE.
5. NEW AND BREAKTHROUGH WORK
Here, large-scale Granger causality was used to generate functional brain networks that served as the basis for predicting clinically meaningful properties of human subjects, including HIV status and NP scores using a machine learning system trained with lsGC-derived network properties; these measures may capture additional levels of complexity not available with the raw connectivity matrices, which serve as the basis for traditional multi-voxel pattern analysis (MVPA) approaches. The high AUC and accuracy values obtained with this approach suggest that it is a viable method of extracting meaningful information from high-dimensional brain networks. Additionally, the application of functional brain network measures into a machine learning system to predict a real-valued variable in a regression task, i.e. NP score prediction, is novel. This contribution is significant, as it represents an extension of the scope of machine learning in rs-fMRI; such a regression system has the ability to detect nuanced patterns of neurologic damage in a continuum, which may more accurately reflect the underlying biological reality than binary disease vs. healthy classification.
6. CONCLUSION
A comparative analysis of cGC and lsGC for predicting HIV status and NP scores of human subjects on the basis of resting-state brain connectivity network properties is presented here. An iterative cross-validation procedure showed that lsGC (peak accuracy 0.76; peak AUC 0.83) significantly outperformed cGC (peak accuracy 0.59; peak AUC 0.61) in the HIV status classification task at nearly all parameter settings tested. In the NP score regression task, the performance of lsGC and cGC was highly dependent on parameter settings, though the peak performance of lsGC (MSE 0.75) was significantly higher than that of cGC (MSE 0.84). These results show that lsGC has the potential to augment resting-state functional connectivity analyses conventionally performed using cGC, though optimal parameter settings for lsGC, particularly for the prediction of NP scores, remains a topic for future research.
Acknowledgments
This research was funded by the National Institutes of Health (NIH) Award R01-DA-034977 and NIH training grant T32-GM-007356. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Health. This work was conducted as a Practice Quality Improvement (PQI) project related to American Board of Radiology (ABR) Maintenance of Certificate (MOC) for Prof. Dr. Axel Wismüller.
Footnotes
This work is not being and has not been submitted for publication or presentation elsewhere.
References
- 1.Sacktor N, Skolasky RL, Seaberg E, Munro C, Becker JT, Martin E, Ragin A, Levine A, Miller E. Prevalence of HIV-associated neurocognitive disorders in the Multicenter AIDS Cohort Study. Neurology. 2016 Jan;86:334–340. doi: 10.1212/WNL.0000000000002277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Robertson KR, Smurzynski M, Parsons TD, Wu K, Bosch RJ, Wu J, McArthur JC, Collier AC, Evans SR, Ellis RJ. The prevalence and incidence of neurocognitive impairment in the HAART era. AIDS (London, England) 2007 Sep;21:1915–1921. doi: 10.1097/QAD.0b013e32828e4e27. [DOI] [PubMed] [Google Scholar]
- 3.Valcour VG. Evaluating cognitive impairment in the clinical setting: practical screening and assessment tools. Topics in antiviral medicine. 2011;19(5):175–180. [PMC free article] [PubMed] [Google Scholar]
- 4.Overton ET, Kauwe JSK, Paul R, Tashima K, Tate DF, Patel P, Carpenter CCJ, Patty D, Brooks JT, Clifford DB. Performances on the CogState and standard neuropsychological batteries among HIV patients without dementia. AIDS and Behavior. 2011;15(8):1902. doi: 10.1007/s10461-011-0033-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Weng SJ, Wiggins JL, Peltier SJ, Carrasco M, Risi S, Lord C, Monk CS. Alterations of resting state functional connectivity in the default network in adolescents with autism spectrum disorders. Brain Research. 2010 Feb;1313:202–214. doi: 10.1016/j.brainres.2009.11.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lynall ME, Bassett DS, Kerwin R, McKenna PJ, Kitzbichler M, Muller U, Bullmore E. Functional Connectivity and Brain Networks in Schizophrenia. The Journal of Neuroscience. 2010 Jul;30:9477–9487. doi: 10.1523/JNEUROSCI.0333-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Sanz-Arigita EJ, Schoonheim MM, Damoiseaux JS, Rombouts SARB, Maris E, Barkhof F, Scheltens P, Stam CJ. Loss of Small-World Networks in Alzheimer’s Disease: Graph Analysis of fMRI Resting-State Functional Connectivity. PLOS ONE. 2010 Nov;5:e13788. doi: 10.1371/journal.pone.0013788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Khazaee A, Ebrahimzadeh A, Babajani-Feremi A. Classification of patients with MCI and AD from healthy controls using directed graph measures of resting-state fMRI. Behavioural Brain Research. 2016 doi: 10.1016/j.bbr.2016.06.043. [DOI] [PubMed] [Google Scholar]
- 9.Dsouza AM, Abidin AZ, Leistritz L, Wismüller A. Exploring Connectivity with Large-Scale Granger Causality on Resting-State Functional MRI. Journal of Neuroscience Methods. 2017:1–32. doi: 10.1016/j.jneumeth.2017.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.D’Souza AM, Abidin AZ, Leistritz L, Wismüller A. Large-scale Granger causality analysis on resting-state functional MRI. 2016;9788:97880L. doi: 10.1117/12.2217264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.D’Souza AM, Abidin AZ, Nagarajan MB, Wismüller A. Mutual connectivity analysis (MCA) using generalized radial basis function neural networks for nonlinear functional connectivity network recovery in resting-state functional MRI. 2016;9788 doi: 10.1117/12.2216900. 97880K-9788-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.DSouza AM, Abidin AZ, Wismüller A. Investigating changes in resting-state connectivity from functional MRI data in patients with HIV associated neurocognitive disorder using MCA and machine learning. 2017;10137 doi: 10.1117/12.2254189. 101371C-10137-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Abidin AZ, DSouza AM, Nagarajan MB, Wang L, Qiu X, Schifitto G, Wismüller A. Alteration of brain network topology in HIV-associated neurocognitive disorder: A novel functional connectivity perspective. NeuroImage: Clinical. 2018;17:768–777. doi: 10.1016/j.nicl.2017.11.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Craddock C, Sikka S, Cheung B, Khanuja R, Ghosh SS, Yan C, Li Q, Lurie D, Vogelstein J, Burns R. Towards automated analysis of connectomes: The configurable pipeline for the analysis of connectomes (c-pac) Front Neuroinform. 2013;42 [Google Scholar]
- 15.Smith SM, Jenkinson M, Woolrich MW, Beckmann CF, Behrens TEJ, Johansen-Berg H, Bannister PR, De Luca M, Drobnjak I, Flitney DE, Niazy RK, Saunders J, Vickers J, Zhang Y, De Stefano N, Brady JM, Matthews PM. Advances in functional and structural MR image analysis and implementation as FSL. NeuroImage. 2004;23(Suppl 1):S208–19. doi: 10.1016/j.neuroimage.2004.07.051. [DOI] [PubMed] [Google Scholar]
- 16.Mazziotta J, Toga A, Evans A, Fox P, Lancaster J, Zilles K, Woods R, Paus T, Simpson G, Pike B, Holmes C, Collins L, Thompson P, MacDonald D, Iacoboni M, Schormann T, Amunts K, Palomero-Gallagher N, Geyer S, Parsons L, Narr K, Kabani N, Goualher GL, Boomsma D, Cannon T, Kawashima R, Mazoyer B. A probabilistic atlas and reference system for the human brain: International Consortium for Brain Mapping (ICBM) Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences. 2001 Aug;356:1293–1322. doi: 10.1098/rstb.2001.0915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mazziotta J, Toga A, Evans A, Fox P, Lancaster J, Zilles K, Woods R, Paus T, Simpson G, Pike B, Holmes C, Collins L, Thompson P, MacDonald D, Iacoboni M, Schormann T, Amunts K, Palomero-Gallagher N, Geyer S, Parsons L, Narr K, Kabani N, Le Goualher G, Feidler J, Smith K, Boomsma D, Pol HH, Cannon T, Kawashima R, Mazoyer B. A Four-Dimensional Probabilistic Atlas of the Human Brain. Journal of the American Medical Informatics Association. 2001 Sep;8:401–430. doi: 10.1136/jamia.2001.0080401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Granger CWJ. Investigating Causal Relations by Econometric Models and Cross-spectral Methods. Econometrica. 1969;37(3):424–438. [Google Scholar]
- 19.Akaike H. A new look at the statistical model identification. IEEE Transactions on Automatic Control. 1974 Dec;19:716–723. [Google Scholar]
- 20.Schwarz G. Estimating the Dimension of a Model. 1978:461–464. [Google Scholar]
- 21.Tzourio-Mazoyer N, Landeau B, Papathanassiou D, Crivello F, Etard O, Delcroix N, Mazoyer B, Joliot M. Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage. 2002;15(1):273–289. doi: 10.1006/nimg.2001.0978. [DOI] [PubMed] [Google Scholar]
- 22.Newman MEJ. The structure and function of complex networks. Society for industrial and applied mathematics review. 2003 [Google Scholar]
- 23.Blondel VD, Jean-Loup G, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment. 2008;2008(10):P10008. [Google Scholar]
- 24.Watts DJ, Strogatz SH. Collective dynamics of’small-world’ networks. Nature. 1998;393(6684):440–442. doi: 10.1038/30918. [DOI] [PubMed] [Google Scholar]
- 25.Sporns O, Kötter R. Motifs in Brain Networks. PLOS Biology. 2004 Oct;2:e369. doi: 10.1371/journal.pbio.0020369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Rubinov M, Sporns O. Complex network measures of brain connectivity: Uses and interpretations. NeuroImage. 2010 Sep;52:1059–1069. doi: 10.1016/j.neuroimage.2009.10.003. [DOI] [PubMed] [Google Scholar]
- 27.Cortes C, Vapnik V. Support-vector networks. Machine learning. 1995;20(3):273–297. [Google Scholar]