Abstract
HIV‐Associated Neurocognitive Disorder (HAND) is the most common constellation of cognitive dysfunctions in chronic HIV infected patients age 60 or older in the U.S. Only few published methods assist in distinguishing HAND from other forms of age‐associated cognitive decline, such as Mild Cognitive Impairment (MCI). In this report, a data‐driven, nonparameteric model to identify morphometric patterns separating HAND from MCI due to non‐HIV conditions in this older age group was proposed. This model enhanced the potential for group separation by combining a smaller, longitudinal data set containing HAND samples with a larger, public data set including MCI cases. Using cross‐validation, a linear model on healthy controls to harmonize the volumetric scores extracted from MRIs for demographic and acquisition differences between the two independent, disease‐specific data sets was trained. Next, patterns distinguishing HAND from MCI via a group sparsity constrained logistic classifier were identified. Unlike existing approaches, our classifier directly solved the underlying minimization problem by decoupling the minimization of the logistic regression function from enforcing the group sparsity constraint. The extracted patterns consisted of eight regions that distinguished HAND from MCI on a significant level while being indifferent to differences in demographics and acquisition between the two sets. Individually selecting regions through conventional morphometric group analysis resulted in a larger number of regions that were less accurate. In conclusion, simultaneously analyzing all brain regions and time points for disease specific patterns contributed to distinguishing with high accuracy HAND‐related impairment from cognitive impairment found in the HIV uninfected, MCI cohort. Hum Brain Mapp 37:4523–4538, 2016. © 2016 Wiley Periodicals, Inc.
Keywords: sparsity, logistic regression, HIV‐associated neurocognitive disorder, mild cognitive impairment, MRI
INTRODUCTION
HIV‐Associated Neurocognitive Disorder (HAND) affects upward of 50% of HIV infected (HIV+) individuals [Heaton et al., 2011; McArthur et al., 2010] and is linked to decreased everyday function, reduced adherence to therapy, and increased mortality [Ettenhofer et al., 2009; Heaton et al., 2004, 2011;]. Distinguishing HAND from other forms of age‐associated cognitive compromise, such as those broadly termed Mild Cognitive Impairment (MCI) and often due to Alzheimer's disease [Brew et al., 2009; Cohen et al., 2015], is an emerging clinical dilemma as the neuropathogenic mechanisms associated with HIV and aging are mostly unknown. The success of combination antiretroviral therapy (cART) has led to a rapid rise in the number of chronic HIV+ patients over the age of 60 in the United States [Kirk and Goetz, 2009]. Aging HAND patients are at risk of also developing age‐related cognitive decline due to Alzheimer's disease. However, clinicians currently do not have assessment tools to perform a differential diagnosis on HAND patients for early treatment of age‐associated cognitive impairments.
To enhance separation of these overlapping conditions, an increasing number of neuroimaging studies have been investigating the independent and interactive effects of HIV and aging [Ances and Hammoud, 2014; Chang et al., 2013; Cysique et al., 2013; Cysique and Brew, 2014; Holt et al., 2012; Liu and Press, 2013; Pfefferbaum et al., 2014; Thomas et al., 2013; Towgood et al. 2012]. However, imaging studies published on aging with chronic HIV infection limited analysis to parametric models assuming that groups comprising subjects with similar demographics are homogeneous [Ances et al., 2012; Cardenas et al., 2009; Chang et al., 2011; Cysique et al., 2013; Jahanshad et al., 2012; Liu and Press, 2013; McMurtray et al., 2008; Nir et al., 2014; Sullivan and Pfefferbaum, 2003; Thomas et al., 2013]. The homogeneity of variance assumption was generally not met due to the notable variation of brain anatomy even in the normal population [Pfefferbaum et al., 2013]; thus most ongoing investigations are limited to small sizes of carefully selected samples. In addition, earlier studies were restricted to individuals under the age of 60 limiting the relevance of the findings to older cohorts, who are at increased risk for other neurodegenerative disorders [Valcour et al., 2004]. To identify morphometric patterns distinguishing HAND from MCI in this older age group, we now propose a data‐driven, nonparameteric model that enhances the potential for discovery by complementing a smaller, longitudinal MRI data set containing HAND samples from the UCSF HIV Over 60 Cohort Study (UHES) with a larger, publicly available data set from the Alzheimer's Disease Neuroimaging Initiative (ADNI), which includes MCI samples.
Identifying morphometric group differences based on independently collected MR data sets were subject to differences in MR acquisition, clinical scoring, and demography characteristics [Focke et al., 2011; Han et al., 2006; Jovicich et al., 2013; Simmons et al., 2011; Stonnington et al., 2008; Westman et al., 2011]. Commonly, a general additive model has been applied to the morphometric measurements to regress out the cohort specific factors [Bartsch et al., 2014; Deoni et al., 2008; van Erp et al., 2015; Pfefferbaum et al., 2016]. For each region, the residual scores were then applied to statistical tests to investigate morphometric group differences, such as in [Pfefferbaum et al., 2013]. To refine analysis on an individual subject level, the medical imaging community has frequently relied on sparse classifiers [Rosa et al., 2015; Xiang et al., 2014]. From the large set of image features, sparse classifiers pick a few scores informative with respect to distinguishing cohorts under investigation. This approach enabled extraction of disease specific patterns from a relatively small number of samples [Liu et al., 2012; Ye et al., 2012], that is, the number of samples was much smaller than the number of measurements extracted from the images.
The selection process of sparse classifiers counts the number of features deemed as influential via the l 0‐“norm” and then determines a solution so that the count is below a predefined threshold [Carroll et al., 2009; Lv et al., 2015; Rao et al., 2011; Yamashita et al., 2008]. A generalization of this concept are group sparsity models, which enforce sparsity on groups of image features [Ng et al., 2010; Ryali et al., 2010; Wu et al., 2010]. Grouping can be used to model application specific constraints, such as the similar effect of MCI and HAND on each cerebral hemisphere. To solve the underlying minimization problem, however, these methods relaxed the feature selection process from (group) cardinality constraints to weighting features by replacing, for example, the l 0‐“norm” with the l 1‐norm or in the case of group cardinality constraints with the l 2‐norm [Friedman et al., 2010; Li et al., 2012; Meier et al., 2008; Yuan et al., 2012]. The solution of those methods related to the original sparse problem only under specific assumptions, for example, compressed sensing [Candès et al., 2006], which generally do not hold for medical image applications. In addition, the number of measures selected by the classifier depended on the training data due to the soft selection scheme. Confining selection to a predefined number required choosing measures whose weight was above a certain threshold. This practice, however, has been controversial because classification depends on the measures below the threshold, and the relevance of those weights with respect to the effects of disease on morphometric features were unclear [Haufe et al., 2014; Sabuncu 2014]. Alternatively, the upper bound associated with the sparse constraint was tuned with respect to a training data set so the classifier returned the wanted number of measures [Ma and Huang, 2008; Vounou et al., 2012; Zhang and Shen, 2012; Zhang et al., 2012]. The tuning, however, was data dependent, that is, the upper bound generally needed to be adjusted for each training set to enforce the same number of scores to be selected. Comparing patterns of different subsets of even the same data set, that is, folds, is thus not trivial as each pattern is the solution to a minimization problem, whose sparsity constraint is unique to a fold. Our approach [Zhang et al., 2016] avoided these issues by working on the original, group sparsity constrained, logistic classification model defined by the l 0‐“norm.” By determining a local solution to this non‐convex and non‐continuous minimization problem, our method directly selected patterns (without thresholding or changing upper bounds) that were potentially meaningful image markers for distinguishing diseases. In [Zhang et al., 2016], we compared the accuracy of our algorithm to a model constrained by (non‐group) sparsity, and an approach that relaxed sparsity constraints via the l 1‐ or l 2‐norm and then determined the optimal global solutions for the relaxed model. All methods were applied to a data set containing a ground‐truth diagnosis, that is, the cine MRI images of adults with and without reconstructive surgery of tetralogy of Fallot (TOF) during infancy. In accordance with the cardiac literature, our method correctly identified the right ventricle to be most impacted by TOF and generally obtained statistically significant higher classification accuracy than the alternative approaches.
To identify image markers distinguishing MCI from HAND in patients of age 60 and older, this article describes our findings when applying our group sparsity constraint, logistic classifier to a dataset, which combined the UHES data set containing aging HAND cases (n = 15) and a matched healthy control group (n = 21) with a larger, publicly available, and demographically matched subset of the ADNI project consisting of MCI cases (n = 80) and healthy controls (n = 26) (see also Fig. 1). Each sample was characterized by the region‐of‐interest (ROI) specific volume scores of baseline and follow‐up. As the data associated with each disease were collected in separate studies, we regressed out demographic and acquisition differences between UHES and ADNI from the volume scores to ensure that the patterns identified by our algorithm were impartial to those differences. We then used the grouping capabilities of our method to combine the regressed scores across hemispheres and time points associated with the same ROI to account for the interdependencies of ROI‐specific measures, that is, the change between baseline and follow‐up scores of the same ROI and subject was small (average correlation across all regions r = 0.93) and neither HAND nor MCI expected to have an asymmetric effect on brain structure.
Figure 1.

Matching demographic of cohorts of the ADNI and UHES data set to the HAND samples. [Color figure can be viewed at http://wileyonlinelibrary.com]
To the best of our knowledge, group sparsity was the simplest mechanism with respect to logistic regression models to enforce the assumptions of bilateral and temporal consistency. Enforcing these assumptions in (non‐group) sparsity constraint models, such as [Ryali et al., 2010], required training the model on summary scores of our regional volume measures, for example, compute the average volume of a region across time and brain hemispheres. This would have reduced the resolution of the scores available to our method making our approach less reliable in general. Alternatively, the simpler, sparsity constraint model could have been trained on the original volume scores and ignored the assumptions. However, the resulting patterns would then be difficult to interpret clinically as our method most likely would have selected only one hemisphere of an impacted structure as the other hemisphere of that region would not add significantly more information to automatically distinguish both diseases. A similar argument applied to regions selected at baseline or follow‐up. The need for structured sparsity constraint beyond group sparsity, such as tree structured sparsity [Chen and Huang, 2014], would have been useful, for example, if our analysis included scores across several regions, such as the volume of the frontal lobe. However, this would have also increased the complexity of our model making it more difficult to solve, that is, the convergence of our method was unknown.
We measured the accuracy of our algorithm on this data set via five‐fold cross‐validation and then inferred from the resulting confusion matrix the significance of the sparse, regional patterns (or image biomarkers) for differentiating MCI from HAND. We first parameterized a linear regression model based on the healthy controls of the training set and then applied it to all samples to reduce the prevalence of factors associated with differences in data collection while aiming to preserve the differences between MCI and HAND. Based on the resulting residual score, we trained the group‐sparsity constrained logistic classifier on the diseased samples. To avoid biasing the results toward a pattern with a specific number of regions, we automatically determined the number of regions to be selected via parameter exploration. On the test set, we then measured the significance of the patterns extracted by the resulting ensemble of classifiers [Rokach, 2010] in distinguishing the diseased samples from both data sets. We checked the neutrality of those patterns to differences in data collection between UHES and ADNI by computing their accuracy in correctly assigning healthy controls to the two data sets.
Our approach was to identify anatomical patterns distinguishing HAND from MCI via the following novel study design:
We examined the differences between two diseases by fusing two independently collected data sets, each only containing cases of one disease. We ensured that the significant findings of our analysis represented differences between the two disease groups by demonstrating that the patterns were impartial, that is, not significant, to differences between control groups of the two data sets.
Using cross‐validation, we parameterized a linear regression model for data harmonization based on the healthy controls of the training set and then applied it to test sets for further analysis. Although this method has been common practice in the machine learning community, most neuroimaging studies configured those linear models on a subset of controls that were then also used for analysis [Bartsch et al., 2014; Deoni et al., 2008; van Erp et al., 2015]. Using the same data for training and testing models, however, increased the risk for reporting overly optimistic findings [Coombes et al., 2007; Taylor et al., 2008].
We proposed an algorithm that determined a group sparsity constrained local minimum to a logistic classification problem. Extending [Zhang and Pohl, 2015; Zhang et al., 2016], we applied the approach to neuroimaging data for the first time. The approach simultaneously searched all regions and time points for patterns distinguishing HAND from MCI. In doing so, we identified patterns that were more accurate than those defined by individually selecting regions via more common morphometric analysis.
MATERIALS AND METHODS
Samples
The Memory and Aging Center at University of California, San Francisco, provided imaging data of 15 individuals diagnosed with HIV‐Associated Neurocognitive Disorder (HAND) and 22 healthy subjects. Informed consent was obtained from all participants, and the experimental protocols were approved by the institutional ethics board. All subjects were between the ages of 60 and 70. Two MRI scans were acquired of each subject, with the scans at least 10 months apart, using 3T Siemens Magnetom Trio Tim scanners. Each subject underwent standardized comprehensive neurological and medical evaluations, including a complete neurological examination, medical history with HIV history, medications, family history, social history, laboratory tests, and a neuropsychological testing battery augmented with tests of psychomotor (grooved pegboard task) and motor speed (finger tapping) for greater sensitivity in HIV. The neuropsychological battery tapped multiple domains including memory, executive function, psychomotor speed, visuospatial and motor abilities, and attention. A proxy informant interview was completed by a trained technician using the Clinical Dementia Rating scale (CDR) [Morris, 1993].
All samples were reviewed in consensus conference meetings that included a behavioral neurologist, an HIV specialist and a neuropsychologist. This team was expert at distinguishing Alzheimers Disease from other disorders as it also reviewed cases for the UCSF Alzheimers Disease Research Center on a ragular basis. Alzheimers disease criteria and HAND criteria (guided by 2007 (Frascati) criteria [Antinori et al., 2007]) were applied based on history, cognitive testing, neurological examination, and functional testing. None of the UHES cases had clinical, neurological or cognitive testing consistent with MCI due to Alzheimer's disease.
All HAND subjects had a CDR score of ≤0.5. At baseline, their mean duration of HIV infection was 19.6 years, the mean CD4 count was 546, the mean CD4 nadir score was 162, 94% of the HAND subjects were taking ART, and for 80% of them the viral load was undetectable. The group of healthy controls was matched to the HAND group with respect to age (at baseline), gender, and “Years between Baseline and Follow‐Up,” which was the difference in years between the follow‐up and baseline visit across the samples of each cohort (see also the cohort specific mean and standard deviation listed in Table 1). We referred to these data as UHES.
Table 1.
Demographic information of the two data sets
| UHES | ADNI | ||||
|---|---|---|---|---|---|
| HAND | Control | MCI | Control | ||
| Siemens | GE | ||||
| Number of subjects | 15 | 21 | 80 | 18 | 8 |
| Gender (M/F) | 14 /1 | 19/2 | 70 /10 | 15 /3 | 7/1 |
| Age (mean ± SD) | 63.8 ± 3.3 | 65.1 ± 3.2 | 65.1 ± 3.2 | 65.2 ± 3.2 | 65.6 ± 3.2 |
| Years difference between Baseline and Follow‐up (mean±SD) | 1.9 ± 0.9 | 2.1 ± 0.9 | 2.1 ± 0.9 | 2.3 ± 1.2 | 1.9 ± 0.7 |
From the ADNI project, we downloaded data of 148 Cognitive Impairment cases, 116 healthy controls scanned on 3T Siemens machines, and 96 healthy cases scanned on 3T GE scanner. Selection criteria were age range of 60–70, at least two MRI scanning sessions, scanners had to be acquired on a 3T GE scanner or 3T Siemens scanner. With respect to the cognitive impaired cases, we further confined the sample set to those with CDR ≤ 0.5 and that were diagnosed with MCI (Early Mild Cognitive Impairment, Mild Cognitive Impairment, or Impairment).1 MCI patients reported a subjective memory concern either autonomously or via an informant or clinician. They had testing impairment in memory, however, they had no significant levels of impairment in other cognitive domains, essentially preserved activities of daily living and there were no signs of dementia. HIV infection was an exclusion criteria from the ADNI study.2 Note, while a small subset of samples of the ADNI project received amyloid testing for the diagnosis of Alzheimer's disease, all cases were diagnosed by experts comparable to those in the UHES projects. As shown in Figure 1, we then matched the three groups to the HAND cases of the UHES data set with respect to age, gender, and time between baseline and follow‐up scans. The resulting matched ADNI data set used for our analysis consisted of 80 MCI, 18 controls scanned on 3T Siemens, and 8 controls scanned on 3T GE machines.
Data Acquisition
All MRIs were acquired on 3T scanners. For the UHES data, a Siemens Megnetom Trio Tim scanner acquired a magnetization‐prepared 180° radio‐frequency pulses and rapid gradient‐echo (MPRAGE) sequence (TR = 2,300 ms, TI = 900 ms, TE = 2.98 ms, flip angle = 9, matrix = 256 × 256, FOV = 256 mm, slice dimensions = 1.0 × 1.0 × 1.0 mm, 160 slices). All ADNI data was acquired using the standardized ADNI acquisition protocol.3 3T Siemens scanners acquired a MPRAGE sequence (TR = 2,300 ms, TI = 900 ms, TE = 2.98 ms, flip angle = 9, matrix = 256 × 256, FOV = 256 mm, slice dimensions = 1.0 × 1.0 × 1.2 mm, 160 slices). GE scanners acquired an Inversion Recovery‐SPoiled Gradient Recalled (IR‐SPGR) echo sequence (TI = 400 ms, flip angle = 11, matrix = 256 × 256, FOV = 240 mm, slice dimensions = 1.0 × 1.0 × 1.2 mm, 200 slices).
Image Processing
Image processing involved bias field correction via the Computational Morphometry Toolkit (CMTK) [Rohlfing, 2009]. Skull stripping was the result of majority voting [Rohlfing et al., 2004] applied to the maps extracted by the Robust Brain Extraction (ROBEX) method [Iglesias et al., 2011], FSL BET [Smith, 2002], and AFNI [Cox, 1996] from the native and bias field corrected T 1‐weighted images. Based on the SRI24 longitudinal analysis platform [Sullivan et al., 2011a], the follow‐up, biased‐corrected T 1‐weighted image was non‐rigidly registered to the corresponding baseline scan using the non‐rigid deformation between the skull‐stripped images as initialization and coarse deformation field for encoding. Afterward, the volumes of supratentorial (svol), ventricular, cortical and cerebellar structures were extracted from the images, where the regions of interest (ROIs) were defined according to the non‐rigidly aligned SRI24 atlas [Rohlfing et al., 2010]. For each subject, this process resulted in 206 volume scores describing the corresponding brain morphometry across both time points.
Identifying Disease Specific Patterns
From those regional volume scores, we automatically extracted patterns separating HAND from MCI measuring their significance via five‐fold cross validation. Each fold of the cross‐validation was composed of 20% of the MCI cases, 20% of the ADNI controls scanned under GE, 20% of the ADNI controls scanned under Siemens, 20% of the UHES controls, and 20% of the HAND cases. With respect to each fold, we first harmonized the data to reduce the potential effect on pattern extraction due to differences between the ADNI and UHES data set associated with acquisition, demographics (e.g., gender or age) or healthy morphometric variation (i.e., svol at baseline). We then extracted the patterns from the harmonized scores via our group‐cardinality constrained, logistic regression solver [Zhang et al., 2016]. Finally, we computed the accuracy and significance of the method and corresponding patterns. We now briefly describe our approach with the mathematical details given in the Appendix. The logistic regression software used for this analysis can be downloaded via https://dx.doi.org/10.6084/m9.figshare.3398332 or its current version via https://github.com/sibis-platform/PDLG.
We harmonized the regional volume scores by fitting a linear regression model to the baseline and follow‐up measures of the controls of the training data and then computing the residual by applying the model to all samples. Measuring the quality of the residuals in harmonizing the volume scores was defined with respect to the null hypothesis that our group‐sparsity constrained logistic regression model was not better than chance in correctly assigning healthy controls to the two data sets. Testing of the null hypothesis will be reviewed at the end of this section.
Next, we applied our algorithm for logistic regression problems constrained by group‐cardinality [Zhang et al., 2016] to the residual volume scores. Logistic regression identified the residual volume scores, that is, ROIs, relevant for distinguishing the two diseases while the group‐cardinality encoded constraints specific to our proposed application. Specifically, we note that the change between baseline and follow‐up scores of the same ROI and subject was small, that is, highly correlated (r = 0.93). In addition, current medical literature is consistent in reporting absence of asymmetry in the disease effects on brain anatomy. Thus, we assumed that a residual score specific to a single time point and brain hemisphere was less informative for distinguishing the two diseases than the group of scores of that ROI across time points and hemispheres. Combining residual scores of ROI's left and right hemisphere at baseline and follow‐up resulted in 52 groups. From those 52 groups, our method selected subsets, which we also refer to as patterns, as most informative with respect to distinguishing MCI from HAND. Note, this sparsity model is motivated by the specific experimental setup forcing our method to select all residual scores associated with a region (across both hemispheres and time) or not. One might choose a different grouping of scores when studying other image features or diseases, such as for disorders asymmetrically impacting brain morphometry a non‐bilateral grouping would be more suitable.
To avoid biases introduced in our findings by manually fixing the size “g” of the pattern, that is, the only parameter to be set in our approach, we automatically determined “g” via parameter exploration confined to the residual scores of the MCI and HAND cases of the training set. We defined the parameter space of “g” by selecting values between the smallest pattern (consisting of more than one group) and 33% of the groups, that is, . For each parameter g, we defined a group‐cardinality constrained logistic regressor by computing the optimal patterns (i.e., weights of the regressor — see Equation (A9) of the Appendix for definition). We computed the normalized accuracy (nAcc) of each regressor (or pattern) with respect to correct assignment of the disease training data to the UHES or ADNI data set, that is, we separately computed the accuracy for each cohort and then averaged their values to account for the imbalance in cohort size. On our training data, parameter exploration did not return a single optimal cardinality setting as the training accuracy of multiple regressors was 100%. We instead combined all regressors into a single ensemble of classifiers [Rokach, 2010] using the nAcc score of each regressor as the weight of the regressor in the final decision of the ensemble. In other words, the ensemble determined the label of a subject by the weighted average across the set of regressors, all of which were unique with respect to the group‐cardinality constraint. Once trained, we computed the residual scores of all the test subjects (control and disease) and recorded the ensemble's assignment of the test sample to the ADNI or UHES data set.
After testing each fold, we computed the nAcc score of our approach in correctly assigning the diseased cohorts of the test data to the ADNI or UHES data set. We also computed the p‐value of the ensembles by applying the two‐sided Fisher Exact Test [Fisher, 1935] to the corresponding confusion matrix. This calculation tested the null hypothesis that a classifier is not better than chance in correctly assigning samples to the two data sets. We repeated those computations for the controls. If the P‐value associated with the disease cohorts were significant (P ≤ 0.001) and non‐significant for the controls, then we interpreted the results generated by our approach as being informative with respect to separating HAND cases from those with MCI and impartial to difference between ADNI and UHES data.
In this scenario, we viewed patterns highly informative for separating the two cohorts as those who were part of the patterns identified by a majority of applicable regressors. As patterns extracted by regressors with larger cardinality constraints could not be sub‐patterns of those extracted by regressors with smaller cardinality constraints, we defined the number of applicable regressors as those whose cardinality constraint was at last as high as the size of the pattern. Finally, we computed the frequency of appearance of each pattern, that is, the number of times a pattern was part of ones extracted by regressors divided by the number of applicable regressors, and focused further analysis on those with frequency above 50%.
Regions not selected by our approach could still be affected by either disorder. Exclusion of a region by our method indicated that the region did not contain adequate complementary information to enhance the identified pattern. Nonetheless, that hypothetical region could be part of another pattern distinguishing MCI from HAND, although that pattern might not be so robust as the one identified by our approach; alternatively, inclusion of that region might expand the size of the pattern beyond that permitted by the sparsity constraint model.
RESULTS
We tested the proposed approach by comparing the corresponding accuracy scores and P‐values to alternative approaches.
Classification Results
Table 2 summarizes the nAcc scores of applying our approach to the original volume scores (omitting harmonization) and the residual volume scores (with harmonization). The table specifies those scores with respect to distinguishing the disease cohorts and the controls from both data sets. Furthermore, the table shows the results of training and testing the regression model with three different data configurations: one just on the baseline score (T = 1), one just on the follow‐up data (T = 1), and one using the data from both time points (T = 2) (as proposed in the previous section). Scores in bold refer to the experiments in which P‐values were significantly different (P ≤ 0.001) from randomly assigning samples to each data set.
Table 2.
Accuracy scores in % for different implementations
| Implementation | Set | Baseline | Follow‐up | Baseline + Follow‐up |
|---|---|---|---|---|
|
Original volume scores (no harmonization) |
Disease | 73.8% | 70.8% | 77.7% |
| Control | 82.8% | 77.6% | 79.9% | |
|
Residual scores (with harmonization) |
Disease | 64.2% | 81.5% | 88.1% |
| Control | 61.8% | 60.9% | 61.4% |
Scores in bold refer to experiments in which P‐values were significantly different (P < 0.001) from randomly assigning samples to each data set.
When omitting data harmonization, the ensemble was more accurate in distinguishing the controls than the disease from both data sets. The opposite was true when applying our approach to the (harmonized) residual scores. Now, the scores in distinguishing the control groups from both data sets were not statistically significant, while separating cases of HAND from MCI was significant for those experiments including measurements from follow‐up visits. Distinguishing HAND from MCI relying solely on the follow‐up measures resulted in a nAcc score of 81.5% (vs. 64.2% when only using baseline measures). Combining baseline and follow‐up measurements resulted in the most accurate separation (88.1%) between the two cohorts while the accuracy in distinguishing the control groups (61.4%, P‐value = 0.1) was within the range defined by the two cross‐sectional experiments.
Pattern Analysis
We now further review the patterns obtained by the most accurate ensemble, which was based on residual scores of the baseline and follow‐up visits of all 102 regions. In our experiment, the ensembles of classifiers identified 78 unique patterns, whose frequencies were plotted in Figure 2. Of those, the frequency of eight patterns were above 50%. The 8 patterns comprised eight bilateral regions (see also Fig. 3): medial orbitofrontal cortex, cerebellum VIII, parahippocampal gyrus, precuneus, superior temporal pole, precentral gyrus, cerebellum VIIb, and inferior temporal gyrus.
Figure 2.

Frequency of patterns across all regressors of the ensemble. [Color figure can be viewed at http://wileyonlinelibrary.com]
Figure 3.

Three‐dimensional models of the selected regions. [Color figure can be viewed at http://wileyonlinelibrary.com]
If we reduced the threshold to patterns with frequency above 20%, that is, the frequency before the curve in Figure 2 flattens out, then nine more patterns were included in the analysis. However, these extra patterns only contained two more regions: fusiform cortex and postcentral gyrus. These two regions appeared only once across those patterns and across all patterns no more than 25 times, which was much fewer than the appearance of any of the other eight regions (46–71 times). We concluded that the first eight patterns with the associated eight regions were most relevant with respect to distinguishing HAND from MCI.
Table 3 lists the eight selected regions in the order of their number of appearances across all 78 patterns, the regions associated with each pattern, and the accuracy of each pattern (last two rows) and region (last two columns) in distinguishing the cohorts. For each region and pattern, we computed the accuracy and P‐value (two‐sided Fischer Exact Test) via five‐fold cross‐validation of the logistic regression (without any cardinality constraints) applied to the corresponding residual scores.
Table 3.
Patterns with their relative frequency being above 50%
| Regions | % of Appearn. | Patterns | nAcc(%) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 45 | 6 | 7 | 8 | Dis. | Ctr. | ||
| Medial orbitofrontal cortex | 88.8 | x | x | xx | x | 64.8 | 54.8 | |||
| Cerebellum 8 | 83.8 | x | x | x | x | x | x | 61.5 | 51.4 | |
| Parahippocampal gyrus | 75.0 | x | x | x | x | x | 52.1 | 50.0 | ||
| Precuneus | 75.0 | x | x | x | x | 65.4 | 50.5 | |||
| Superior temporal pole | 66.3 | x | x | 58.8 | 48.5 | |||||
| Precentral gyrus | 62.5 | x | 63.5 | 55.2 | ||||||
| Cerebellum 7b | 62.5 | x | x | x | 68.1 | 53.8 | ||||
| Inferior temporal gyrus | 57.5 | x | 65.4 | 57.1 | ||||||
| nAcc for Disease groups (%) | 70.2 | 70.8 | 90.8 | 76.3 87.5 | 88.1 | 68.3 | 70.2 | |||
| nAcc for Control groups(%) | 54.8 | 51.8 | 44.6 | 53.3 44.1 | 52.3 | 53.8 | 54.2 | |||
Scores in bold referred to the experiments whose P‐values were significant (P < 0.001).
The P‐value in distinguishing the controls of the two data sets was not significant (i.e., P > 0.001) for any pattern or region. By contrast, all patterns except Pattern 7 were significant in distinguishing the disease groups, whereas the disease specific classification accuracy for regions was never higher than 68.1%, the lowest accuracy across all patterns was 68.3% for Pattern 7 and the highest was 90.8% for Pattern 3. The three patterns with the highest accuracy (Pattern 3, 5, 6) were also the only ones that included the three regions most frequently identified: the medial orbitofrontal cortex, cerebellum VIII, and parahippocampal gyrus. Interestingly, our experiments relying on just each region individually reported non‐significant P‐values with respect to distinguishing the two disease cohorts. The three regions that resulted in significant P‐values were the cerebellum VIIb, precuneus, and inferior temporal gyrus. The remaining two of the eight regions identified by the top eight patterns, the superior temporal pole and the precentral gyrus, neither had significant P‐values nor emerged as part any of the top three patterns.
Morphometric Analysis
To compare our findings on the eight regions with more conventional morphometric volumetric analysis, we parameterized the linear regression model of Section Data Harmonization based on the baseline volume scores of all controls and computed the residuals for all volume scores (including disease and follow‐up visit). For each of the eight regions, we computed the slopes between follow‐up and baseline residual scores. Table 4 lists the corresponding p‐values when applying the Mann–Whitney U test [Mann and Whitney, 1947] to those slopes. Entries in bold were significantly different (P ≤ 0.001) between the corresponding cohorts. For those regions, Figure 4 showed the box plots of the slopes based on the residual volume scores for all cohorts.
Table 4.
P‐values of Mann–Whitney U tests based on volume change (slope)
| Regions |
HAND left |
vs. MCI right | HAND vs. left |
Control right |
MCI vs. left |
Control right |
|---|---|---|---|---|---|---|
| Medial orbitofrontal cortex | 0.084 | 0.003 | 0.038 | 0.005 | 0.770 | 0.813 |
| Cerebellum 8 | 0.082 | <0.001 | 0.285 | 0.005 | 0.246 | 0.108 |
| Parahippocampal gyrus | 0.034 | 0.988 | 0.028 | 0.178 | 0.524 | 0.031 |
| Precuneus | 0.003 | 0.043 | <0.001 | 0.010 | 0.417 | 0.609 |
| Superior temporal pole | 0.295 | 0.010 | 0.118 | 0.002 | 0.381 | 0.386 |
| Precentral gyrus | 0.001 | 0.002 | 0.005 | 0.001 | 0.537 | 0.755 |
| Cerebellum 7b | 0.015 | <0.001 | 0.005 | 0.001 | 0.254 | 0.066 |
| Inferior temporal gyrus | 0.003 | 0.010 | 0.003 | 0.020 | 0.863 | 0.616 |
Values in bold were significant (P < 0.001).
Figure 4.

The slopes of the residual scores for regions with significant P‐value. [Color figure can be viewed at http://wileyonlinelibrary.com]
The trajectory for none of the regions was significantly different between MCI and controls. However, the trajectories of three regions were significantly different for HAND versus MCI (right cerebellum VIIb, right cerebellum VIII, left precentral) and HAND versus controls (left precuneus, right precentral, right cerebellum VIIb). For these regions, subjects with HAND aged significantly faster than the corresponding other cohorts (see also Fig. 4). For the other regions, the residual scores regressed much faster for the HAND cases than the other groups. However, the change was not significant. The cerebellum VIIb was the only region that significantly distinguished HAND from MCI and HAND from controls. It was also the only region that overlapped with the significant findings of the region specific classification of the previous section.
DISCUSSION
Without data harmonization, the classifier encoded differences in data collection between the ADNI and the UHES data set as it was more accurate in distinguishing controls than disease groups. This was not the case for the classifier based on the (harmonized) residual scores, which identified patterns distinguishing HAND from MCI on a significant level while not being significantly affected by differences in data collection (for experiment based on baseline and follow‐up P = 0.1). Identifying such a bias in our patterns would require reapplying our method to a larger data set. Based on our experience from Zhang et al. [2016] and the results of Table 2, training our approach on a larger data set generally increases its accuracy in distinguishing MCI from HAND, that is, results in improved patterns. These improved patterns might also bring the accuracy associated with the control groups closer to 50%, which would then confirm again that our analysis is unbiased toward differences in data collection between the two sets.
When relying solely on follow‐up scores, the accuracy in distinguishing HAND from MCI was higher than just based on the ones for the baseline scans reflecting the understanding that the underlying patterns were more distinct as both degeneration progressed. Combining baseline and follow‐up measures resulted in the most accurate separation, which was significantly better (P < 0.001) than solely relying on the baseline scores according to the Delongs test [DeLong et al., 1988]. These findings underline the strength of trajectories extracted from the longitudinal analysis over cross‐sectional volume scores in identifying differences between HAND and MCI. Our experiments also showed that each of the eight highest ranked patterns was also more informative in differentiating cases of HAND from MCI than each of the eight regions associated with those patterns as the disease specific classification accuracy of the patterns was always higher than the accuracy scores of any single region. In other words, the significance of the identified image biomarkers, that is, patterns, for distinguishing MCI from HAND is much higher than differences based solely on the individual regions within those patterns.
Of the three most frequently identified regions (medial orbitofrontal cortex, cerebellum VIII, and parahippocampal), the medial orbitofrontal cortex, which is involved in the cognitive processing of decision‐making, had been found to be abnormally small in MCI and Alzheimer's patients 4 years before the occurrence of any cognitive symptoms [Tondelli et al., 2012]. Nothing had been reported on the role of MCI on the cerebellum VIII or more generally the cerebellum, which is known to subserve motor tasks and likely be involved in selective cognitive functions commonly associated with the frontal lobe functions [Schmahmann and Pandya, 2008]. However, in comparison to healthy controls, higher atrophy rates in the cerebellum have been published for HIV+ [Chang et al., 2011; Elsheikh et al., 2010; Klunder et al., 2008; Sullivan et al., 2011b]. Finally, heightened atrophy in the parahippocampal gyrus, which contributes to memory encoding and retrieval, has been reported in HIV+ [Maki et al., 2009; Wilson et al., 2015] and MCI cohort [Pantel et al., 2003].
Next, we reviewed the literature for the three regions associated with significant P‐values, that is, cerebellum VIIb, precuneus, and inferior temporal gyrus. Whereas volume deficits of the cerebellum VIIb had not been associated with the functional deficits of HAND or MCI, we noted above the higher atrophy rates in the cerebellum of HIV+ subjects. The precuneus, however, which is associated with episodic memory and consciousness, is one of the few structures known to atrophy faster in MCI subjects than healthy controls [Pihlajamaki et al., 2009; Tondelli et al., 2012]. The inferior temporal gyrus has also been reported to have higher atrophy rates in MCI and HIV exhibiting cognitive decline [Küper et al., 2011; McDonald et al., 2012]. This region is associated with high level visual processing, such as the shape recognition [Martin et al., 1996]. Finally, the regions that neither had significant P‐values nor emerged as part any of the top three patterns, that is, the superior temporal pole and the precentral gyrus, were also the only regions where neither MCI nor HIV had been associated significant volume deficits.
From our morphometric analysis on those eight regions, we concluded that the patterns found by our logistic regression approach were mostly influenced by HAND as significant differences in regional trajectories could only be found in group comparisons including this disease. With respect to at least one hemisphere of four regions, HAND had significantly faster tissue loss than the other cohorts (see also Fig. 4). These findings were not only consistent with the medical literature but also with the diminished function observed in HAND samples. The cerebellum VIIb was the only region where the significant findings of the morphometric analysis agreed with those of the logistic classifier. We thus concluded that common morphometric analysis might not directly translate to decision making on an individual subject level. To investigate this possibility further, we computed the P‐value of all 102 regions. We then trained a logistic regressor on those 14 regions, whose slope was significantly different between MCI and HAND at least for one hemisphere. The 14 regions were the middle frontal gyrus, rolandic operculum, anterior cingulate gyrus, calcarine sulcus, lingual gyrus, middle occipital, fusiform gyrus, inferior parietal lobule, angular gyrus, transverse temporal gyri, crus I of cerebellar hemisphere, crus II of cerebellar hemisphere, lobule VIIB of cerebellar hemisphere, and lobule VIII of cerebellar hemisphere. Five‐fold cross‐validation resulted in a nAcc score of 73.8% for distinguishing HAND from MCI, whereas for the control group it was 56.1%. In comparison, when we trained the classifier on the eight regions selected by the eight top patterns, then the accuracy increased to 94.8% for distinguishing the disease cohorts while the accuracy dropped to 47.5% for the control groups. These results underline the importance of selecting patterns of regions (as it was done by our group cardinality constrained logistic regressor) instead of individual regions when separating HAND from MCI.
CONCLUSION
We proposed a group sparsity constrained logistic classifier to identify morphometric patterns separating HAND from MCI in patients between the ages of 60 and 70. Confining classifiers to sparsity constrained solutions has been popular in medical image analysis as they reduce the risk of overfitting of the classifier on small sample sizes, which is a common characteristic for neuroimaging studies. These classifiers approximated the sparse solution by changing the underlying regularization from counting non‐zero weights via the l 0‐“norm” to analyzing the weights' absolute values by applying the l 1‐norm or, in case of group sparsity constrained problems, the l 2‐norm. These approximations have brought in question the resulting solutions with respect to the original problem. We therefore suggested herein an algorithm that directly solves the original, sparse problem.
We applied our proposed classifier to a data set containing 15 cases of HAND provided by the UHES study and 80 age matched MCI cases downloaded from the ADNI project. Using cross‐validation, we trained a linear model on healthy controls of both studies to harmonize the volumetric scores for demographic and acquisition differences across the two data sets while preserving disease specific pattern distinguishing HAND from MCI. We identified patterns of various sizes by training our proposed group sparsity constrained logistic classifier on samples of both diseases. During testing, we then showed that data harmonization was successful as the accuracy of those patterns was non‐significant in assigning healthy controls to the correct data set. However, the patterns were significant in correctly assigning disease cases to HAND or MCI with the longitudinal study design being more accurate than the cross‐sectional analysis.
The most frequently identified patterns were composed of a total of eight regions. Our findings were largely in line with the literature, which reported on some of these regions as having measured volume deficits in HIV subjects, being significant for characterizing the early onset of MCI, or predicting subjects with MCI converting to Alzheimer's disease. Furthermore, our morphometric group analysis revealed that HAND had significantly faster tissue loss than MCI or controls in at least one hemisphere in half of those regions, that is, the right cerebellum VIIb, right cerebellum VIII, and left precentral with respect to MCI and the left precuneus, right precentral, right cerebellum VIIb with respect to controls.
Using morphometric group analysis to individually select regions resulted in a pattern that was larger in size and less informative than the most frequently identified patterns by our proposed approach. We therefore conclude that simultaneously analyzing all brain regions and time points for disease specific patterns, as done by our group sparsity constrained logistic classification, enhances the ability to distinguish HAND from cognitive impairment found in the non‐HIV infected population, such as the MCI cohort.
ACKNOWLEDGMENTS
With respect to the ADNI data, collection and sharing for this project was funded by the Alzheimers Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: Abbott; Alzheimers Association; Alzheimers Drug Discovery Foundation; Amorfix Life Sciences Ltd.; AstraZeneca; Bayer HealthCare; BioClinica, Inc.; Biogen Idec Inc.; Bristol‐Myers Squibb Company; Eisai Inc.; Elan Pharmaceuticals Inc.; Eli Lilly and Company; F. Hoffmann‐La Roche Ltd and its affiliated company Genentech, Inc.; GE Healthcare; Innogenetics, N.V.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Medpace, Inc.; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Servier; Synarc Inc.; and Takeda Pharmaceutical Company. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (http://www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimers Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of California, Los Angeles.
MODEL AND NUMERICAL SOLUTION TO PATTERN EXTRACTION
We extracted patterns separating HAND from MCI by first applying logistic regression constrained by group‐cardinality [Zhang et al., 2016] to harmonized volume scores. We harmonized the regional volume scores via linear regression. For each ROI “i,” the linear regression model was defined by the following factors for subject “s” at time point “t” the volume score , the baseline svol score , and encoding the acquiring scanner (see Table 5 for details), and the age at scanning . More specifically, the following linear regression model was fitted to the control group:
| (A1) |
Table 5.
Encoding for scanners
| Scanner | uADNI‐G | uADNI‐S |
|---|---|---|
| ADNI Siemens | 0 | 1 |
| ADNI GE | 1 | 0 |
| UHES Siemens | 0 | 0 |
Based on this fitting, the residuals were computed for all subjects (including those with MCI or HAND)
| (A2) |
Based on the residual volume scores , our group‐cardinality constrained logistic regression model aims to automatically identify the regional patterns distinguishing the two diagnosis. To define the model, we assumed (without loss of generality) training of the model was based on N = 76 samples of the MCI and HAND cases. encoded the assignment of the N samples to the two data sets, where was +1 if subject “s” was a part of UHES, that is, HAND, and −1 otherwise. The input of the model were the residual volume scores
of R = 102 ROIs extracted over T = 2 timepoints. Combining r s with b s, we defined the label weighted feature vectors . The model then encoded the influence of each element in as in correctly classifying the subject “s” via the weight vector “w.”
To determine the optimal with respect to separating the two cohorts while being consistent across time and hemisphere, we coupled with each ROI a group , which consisted of the indexes associated with the ROI's left and right hemisphere at baseline and follow‐up. This resulted in M = 52 groupings . We then used those groupings to define , the subvector of w indexed by , and , the l 2‐norm of that grouping. Based on those definitions, the group cardinality of w was then simply the number of nonzero components of
| (A3) |
which was formally determined via the l 0‐“norm” . For our model to identify meaningful patterns, we further assumed that no more than groups with differentiate cases of HAND from MCI, that is,
| (A4) |
Having defined the group cardinality constraint, we assumed that the optimal was the result of minimizing the average logistic loss function, which, for label weight scalar v and w, was defined as
| (A5) |
with
| (A6) |
being the logistic function. In other words, our model was fully described by the following minimization problem
| (A7) |
Alternatively to choose ROIs across time and hemispheres, one could have selected individual residual volume scores of w by replacing with . The result would have been a sparse logistic regression model, which, however ignored our early assumptions of the dependency of regional scores across time and hemispheres.
We defined a possible pattern for distinguishing the two diseases as the set of ROIs whose corresponding values in were non‐zero. In other words, we were only interested in ROIs selected by our model rather than the absolute values of , which, according to [Haufe et al., 2014; Sabuncu, 2014], should not be used to study the impact of diseases on brain anatomy. To find a numerical solution for , we computed a local minimum to Eq. (A7) according to [Zhang et al., 2016] by decoupling the minimization of from the sparsity constraint
Specifically, we introduced the auxiliary variable as an unconstrained approximation to the sparse solution by rewriting Eq. (A7) as
| (A8) |
We iteratively solved Eq. (A8) via penalty decomposition (PD), that is, we approximated via the local minimum of the following non‐convex problem defined with respect to the penalty parameter :
| (A9) |
Initializing at 0.1 (according to [Lu and Zhang, 2013]), PD iterated between determining a local minimum of the above equation via Block Coordinate Descent (BCD) and increased by a factor of (set according to [Lu and Zhang, 2013]) until convergence, that is,
| (A10) |
which was the termination criterion for PD suggested in [Lu and Zhang, 2013].
BCD alternated between minimizing Eq. (A9) with fixed w and by fixing v and y. When w was set to , finding the minimum with respect to v and y turned into solving the smooth and convex problem
| (A11) |
via gradient descent. In turn, minimizing the objective function just with respect to w, that is,
| (A12) |
could now be solved in closed form. To do so, we sorted in descending order and recorded the order of the indexes as , i.e.,
| (A13) |
From Eq. (A12), had to be equal to for the first “g” groups and otherwise zero, i.e., for and otherwise .4 In theory, this solution was not unique as it could happen that so that an alternative solution was defined by for and otherwise . However, we had not experienced this issue in practice.
Proving the convergence of PD to a local minimum of Eq. (A7) was non‐trivial (see also [Zhang et al., 2016]) as Eq. (A9) required a search within the group‐sparsity constrained space , which was non‐convex and non‐continuous. However, one could generalize the findings of [Lu and Zhang, 2013] to group sparsity to gain a better understanding of the convergence properties of the approach. Specifically, at each iteration of PD, the BCD approach generated a sequence of . Assuming that the sequence converged to an accumulation point , then was also a local minimum of Eq. (A11). Furthermore, PD produced a sequence of . If the sequence converged to an accumulation point and then was a local minimum of Eq. (A7). In our experiments, these conditions where always fulfilled so that convergence of our implementation was not an issue and proving convergence for the more general case of was not needed. Also note, our algorithm can only determine a local solution. We showed in [Zhang et al. (2016)] that this local solution is superior to the global solution of an existing method that relaxes the group sparsity constraint.
CONFLICTS OF INTEREST
In the last year, Dr. Valcour has served as a consultant for ViiV Healthcare and Merck for advice related to HIV and aging.
Footnotes
See http://adni.loni.usc.edu/methods/documents/mri-protocols/ for further detail.
The proof was provided by the supplemental information of [Zhang et al., 2016].
REFERENCES
- Ances BM, Hammoud DA (2014): Neuroimaging of HIV‐associated neurocognitive disorders (HAND). Curr Opin HIV AIDS 9:545–551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ances BM, Ortega M, Vaida F, Heaps J, Paul R (2012): Independent effects of HIV, aging, and HAART on brain volumetric measures. J Acquir Immune Defic Syndr 59:469–477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Antinori A, Arendt G, Becker J, Brew B, Byrd D, Cherner M, Clifford D, Cinque P, Epstein L, Goodkin K, Gisslen M, Grant I, Heaton R, Joseph J, Marder K, Marra C, McArthur J, Nunn M, Price R, Pulliam L, Robertson K, Sacktor N, Valcour V, Wojna V (2007): Updated research nosology for hiv‐associated neurocognitive disorders. Neurology 69:1789–1799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartsch H, Thompson WK, Jernigan TL, Dale AM (2014): A web‐portal for interactive data exploration, visualization, and hypothesis testing. Front Neuroinform 8:25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brew BJ, Crowe S, Landay A, Cysique LA, Guillemin G (2009): Neurodegeneration and ageing in the HAART era. J Neuroimmune Pharmacol 4:163–174. [DOI] [PubMed] [Google Scholar]
- Candès EJ, Romberg J, Tao T (2006): Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans Inform Theory 52:489–509. [Google Scholar]
- Cardenas VA, Meyerhoff DJ, Studholme C, Kornak J, Rothlind J, Lampiris H, Neuhaus J, Grant RM, Chao LL, Truran D, Weiner MW (2009): Evidence for ongoing brain injury in human immunodeficiency virus‐positive patients treated with antiretroviral therapy. J Neurovirol 15:324–333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carroll MK, Cecchi GA, Rish I, Garg R, Rao AR (2009): Prediction and interpretation of distributed neural activity with sparse models. NeuroImage 44:112–122. [DOI] [PubMed] [Google Scholar]
- Chang L, Andres M, Sadino J, Jiang C, Nakama H, Miller E, Ernst T (2011): Impact of apolipoprotein E ε4 and HIV on cognition and brain atrophy: Antagonistic pleiotropy and premature brain aging. NeuroImage 58:1017–1027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chang L, Holt JL, Yakupov R, Jiang CS, Ernst T (2013): Lower cognitive reserve in the aging human immunodeficiency virus‐infected brain. Neurobiol Aging 34:1240–1253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen C, Huang J (2014): The benefit of tree sparsity in accelerated MRI. Med Image Anal 18:834–842. [DOI] [PubMed] [Google Scholar]
- Cohen RA, Seider TR, Navia B (2015): HIV effects on age‐associated neurocognitive dysfunction: Premature cognitive aging or neurodegenerative disease? Alzheimer's Res Ther 7:1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coombes KR, Wang J, Baggerly KA (2007): Microarrays: Retracing steps. Nat Med 13:1276–1277. [DOI] [PubMed] [Google Scholar]
- Cox RW (1996): AFNI: Software for analysis and visualization of functional magnetic resonance neuroimages. Comput Biomed Res 29:162–173. [DOI] [PubMed] [Google Scholar]
- Cysique LA, Brew BJ (2014): The effects of HIV and aging on brain functions: Proposing a research framework and update on last 3 years' findings. Curr Opin HIV AIDS 9:355–364. [DOI] [PubMed] [Google Scholar]
- Cysique LA, Moffat K, Moore DM, Lane TA, Davies N, Carr A, Brew BJ, Rae C (2013): HIV, vascular and aging injuries in the brain of clinically stable HIV‐infected adults: A 1H MRS study. PloS One 8:e61738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- DeLong E, DeLong D, Clarke‐Pearson D (1988): Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44:837–845. [PubMed] [Google Scholar]
- Deoni SC, Williams SC, Jezzard P, Suckling J, Murphy DG, Jones DK (2008): Standardized structural magnetic resonance imaging in multicentre studies using quantitative T1 and T2 imaging at 1.5T. NeuroImage 40:662–671. [DOI] [PubMed] [Google Scholar]
- Elsheikh BH, Maher WE, Kissel JT (2010): Cerebellar atrophy associated with human immunodeficiency virus infection. Arch Neurol 67:634–635. [DOI] [PubMed] [Google Scholar]
- Ettenhofer ML, Hinkin CH, Castellon SA, Durvasula R, Ullman J, Lam M, Myers H, Wright MJ, Foley J (2009): Aging, neurocognition, and medication adherence in HIV infection. Am J Geriatr Psychiatry 17:281–290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fisher RA (1935): The logic of inductive inference. J R Soc Ser a 98:38–54. [Google Scholar]
- Focke NK, Helms G, Kaspar S, Diederich C, Tóth V, Dechent P, Mohr A, Paulus W (2011): Multi‐site voxel‐based morphometry—not quite there yet. NeuroImage 56:1164–1170. [DOI] [PubMed] [Google Scholar]
- Friedman J, Hastie T, Tibshirani R (2010): A note on the group Lasso and a sparse group Lasso. arXiv preprint arXiv:1001.0736.
- Han X, Jovicich J, Salat D, van der Kouwe A, Quinn B, Czanner S, Busa E, Pacheco J, Albert M, Killiany R, Maguire P, Rosas D, Makris N, Dale A, Dickerson B, Fischl B (2006): Reliability of MRI‐derived measurements of human cerebral cortical thickness: The effects of field strength, scanner upgrade and manufacturer. NeuroImage 32:180–194. [DOI] [PubMed] [Google Scholar]
- Haufe S, Meinecke F, Görgen K, Dähne S, Haynes JD, Blankertz B, Bießmann F (2014): On the interpretation of weight vectors of linear models in multivariate neuroimaging. NeuroImage 87:96–110. [DOI] [PubMed] [Google Scholar]
- Heaton RK, Marcotte TD, Mindt MR, Sadek J, Moore DJ, Bentley H, McCutchan JA, Reicks C, Grant I (2004): The impact of HIV‐associated neuropsychological impairment on everyday functioning. J Int Neuropsychol Soc 10:317–331. [DOI] [PubMed] [Google Scholar]
- Heaton RK, Franklin DR, Ellis RJ, McCutchan JA, Letendre SL, LeBlanc S, Corkran SH, Duarte NA, Clifford DB, Woods SP, Collier AC, Marra CM, Morgello S, Mindt MR, Taylor MJ, Marcotte TD, Atkinson JH, Wolfson T, Gelman BB, McArthur JC, Simpson DM, Abramson I, Gamst A, Fennema‐Notestine C, Jernigan TL, Wong J, Grant I (2011): HIV‐associated neurocognitive disorders before and during the era of combination antiretroviral therapy: Differences in rates, nature, and predictors. J Neurovirol 17:3–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holt JL, Kraft‐Terry SD, Chang L (2012): Neuroimaging studies of the aging HIV‐1‐infected brain. J Neurovirol 18:291–302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iglesias JE, Liu CY, Thompson PM, Tu Z (2011): Robust brain extraction across datasets and comparison with publicly available methods. IEEE Trans Med Imag 30:1617–1634. [DOI] [PubMed] [Google Scholar]
- Jahanshad N, Valcour VG, Nir TM, Kohannim O, Busovaca E, Nicolas K, Thompson PM (2012): Disrupted brain networks in the aging HIV+ population. Brain Connect 2:335–344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jovicich J, Marizzoni M, Sala‐Llonch R, Bosch B, Bartrés‐Faz D, Arnold J, Benninghoff J, Wiltfang J, Roccatagliata L, Nobili F, T Hensch, A Tränkner, P Schönknecht, M Leroy, R Lopes, R Bordet, V Chanoine, J‐P Ranjeva, M Didic, H Gros‐Dagnac, P Payoux, G Zoccatelli, F Alessandrini, A Beltramello, N Bargalló, O Blin, GB Frisoni, The PharmaCog Consortium. (2013): Brain morphometry reproducibility in multi‐center 3T MRI studies: A comparison of cross‐sectional and longitudinal segmentations. NeuroImage 83:472–484. [DOI] [PubMed] [Google Scholar]
- Kirk JB, Goetz MB (2009): Human immunodeficiency virus in an aging population, a complication of success. J Am Geriatr Soc 57:2129–2138. [DOI] [PubMed] [Google Scholar]
- Klunder AD, Chiang MC, Dutton RA, Lee SE, Toga AW, Lopez OL, Aizenstein HJ, Becker JT, Thompson PM (2008): Mapping cerebellar degeneration in HIV/AIDS. Neuroreport 19:1655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Küper M, Rabe K, Esser S, Gizewski E, Husstedt I, Maschke M, Obermann M (2011): Structural gray and white matter changes in patients with HIV. J Neurol 258:1066–1075. [DOI] [PubMed] [Google Scholar]
- Li S, Yin H, Fang L (2012): Group‐sparse representation with dictionary learning for medical image denoising and fusion. IEEE Trans Biomed Eng 59:3450–3459. [DOI] [PubMed] [Google Scholar]
- Liu A, Press DZ (2013): Does HIV age your brain? Neurology 80:1178–1179. [DOI] [PubMed] [Google Scholar]
- Liu M, Zhang D, Shen D (2012): Ensemble sparse classification of Alzheimer's disease. NeuroImage 60:1106–1116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu Z., Zhang Y. (2013): Sparse approximation via penalty decomposition methods. SIAM Journal on Optimization 23:2448–2478. [Google Scholar]
- Lv J, Jiang X, Li X, Zhu D, Chen H, Zhang T, Zhang S, Hu X, Han J, Huang H, Zhang J, Guo L, Liu T (2015): Sparse representation of whole‐brain fMRI signals for identification of functional networks. Med Image Anal 20:112–134. [DOI] [PubMed] [Google Scholar]
- Ma S, Huang J (2008): Penalized feature selection and classification in bioinformatics. Brief Bioinform 9:392–403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maki P, Cohen M, Weber K, Little D, Fornelli D, Rubin L, Perschler P, Gould F, Martin E (2009): Impairments in memory and hippocampal function in HIV‐positive vs HIV‐negative women a preliminary study. Neurology 72:1661–1668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mann HB, Whitney DR (1947): On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat 18:50–60. [Google Scholar]
- Martin A, Wiggs CL, Ungerleider LG, Haxby JV (1996): Neural correlates of category‐specific knowledge. Nature 379:649–652. [DOI] [PubMed] [Google Scholar]
- McArthur JC, Steiner J, Sacktor N, Nath A (2010): Human immunodeficiency virus‐associated neurocognitive disorders: Mind the gap. Ann Neurol 67:699–714. [DOI] [PubMed] [Google Scholar]
- McDonald CR, Gharapetian L, McEvoy LK, Fennema‐Notestine C, Hagler DJ, Holland D, Dale AM (2012): Relationship between regional atrophy rates and cognitive decline in mild cognitive impairment. Neurobiol Aging 33:242–253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McMurtray A, Nakamoto B, Shikuma C, Valcour V (2008): Cortical atrophy and white matter hyperintensities in HIV: The Hawaii aging with HIV cohort study. J Stroke Cerebrovasc Dis 17:212–217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meier L, Van De Geer S, Bühlmann P (2008): The group Lasso for logistic regression. J R Soc Ser B 70:53–71. [Google Scholar]
- Morris JC (1993): The clinical dementia rating (cdr): current version and scoring rules. Neurology 43:2412–2414. [DOI] [PubMed] [Google Scholar]
- Ng B, Vahdat A, Hamarneh G, Abugharbieh R (2010): Generalized sparse classifiers for decoding cognitive states in fMRI In: Machine Learning in Medical Imaging. Vol. 6357 of Lecture Notes in Computer Science. pp. 108–115. [Google Scholar]
- Nir TM, Jahanshad N, Busovaca E, Wendelken L, Nicolas K, Thompson PM, Valcour VG (2014): Mapping white matter integrity in elderly people with HIV. Hum Brain Mapp 35:975–992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pantel J, Kratz B, Essig M, Schröder J (2003): Parahippocampal volume deficits in subjects with aging‐associated cognitive decline. Am J Psychiatry 160:379–382. [DOI] [PubMed] [Google Scholar]
- Pfefferbaum A, Rohlfing T, Rosenbloom MJ, Chu W, Colrain IM, Sullivan EV (2013): Variation in longitudinal trajectories of regional brain volumes of healthy men and women (ages 10 to 85years) measured with atlas‐based parcellation of MRI. NeuroImage 65:176–193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pfefferbaum A, Rogosa DA, Rosenbloom MJ, Chu W, Sassoon SA, Kemper CA, Deresinski S, Rohlfing T, Zahr NM, Sullivan EV (2014): Accelerated aging of selective brain structures in human immunodeficiency virus infection: A controlled, longitudinal magnetic resonance imaging study. Neurobiol Aging 35:1755–1768. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pfefferbaum A, Rohlfing T, Pohl KM, Lane B, Chu W, Kwon D, Nichols BN, Brown SA, Tapert SF, Cummins K, Thompson WK, Brumback T, Meloy MJ, Jernigan TL, Dale A, Colrain IM, Baker FC, Prouty D, De Bellis MD, Voyvodic JT, Clark DB, Luna B, Chung T, Nagel BJ, Sullivan EV (2016): Adolescent development of cortical and white matter structure in the NCANDA sample: Role of sex, ethnicity, puberty, and alcohol drinking. Cerebr Cortex 26:4101–4121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pihlajamaki M, Jauhiainen AM, Soininen H (2009): Structural and functional MRI in mild cognitive impairment. Curr Alzheimer Res 6:179–185. [DOI] [PubMed] [Google Scholar]
- Rao A, Lee, Y , Gass A, Monsch A (2011): Classification of Alzheimer's disease from structural MRI using sparse logistic regression with optional spatial regularization In: Engineering in Medicine and Biology Society, EMBC, 2011 Annual International Conference of the IEEE. IEEE, pp. 4499–4502. [DOI] [PubMed] [Google Scholar]
- Rohlfing T (2009): User guide to the computational morphometry toolkit. Insight J http://hdl.handle.net/10380/3140. [Google Scholar]
- Rohlfing T, Russakoff DB, Maurer Jr CR (2004): Performance‐based classifier combination in atlas‐based image segmentation using Expectation‐Maximization parameter estimation. IEEE Trans Med Imag 23:983–994. [DOI] [PubMed] [Google Scholar]
- Rohlfing T, Zahr NM, Sullivan EV, Pfefferbaum A (2010): The SRI24 multichannel atlas of normal adult human brain structure. Hum Brain Mapp 31:798–819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rokach L (2010): Ensemble‐based classifiers. Artif Intell Rev 33:1–39. [Google Scholar]
- Rosa MJ, Portugal L, Hahn T, Fallgatter AJ, Garrido MI, Shawe‐Taylor J, Mourao‐Miranda J (2015): Sparse network‐based models for patient classification using fMRI. NeuroImage 105:493–506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ryali S, Supekar K, Abrams DA, Menon V (2010): Sparse logistic regression for whole‐brain classification of fMRI data. NeuroImage 51:752–764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sabuncu M (2014): A universal and efficient method to compute maps from image‐based prediction models In: Medical Image Computing and Computer‐Assisted Intervention – MICCAI 2014. Vol. 8675 of Lecture Notes in Computer Science. pp. 353–360. [DOI] [PubMed] [Google Scholar]
- Schmahmann JD, Pandya DN (2008): Disconnection syndromes of basal ganglia, thalamus, and cerebrocerebellar systems. Cortex 44:1037–1066. (special Issue on 'Brain Hodology ‐ Revisiting disconnection approaches to disorders of cognitive function). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simmons A, Westman E, Muehlboeck S, Mecocci P, Vellas B, Tsolaki M, Kłoszewska I, Wahlund LO, Soininen H, Lovestone S, Evans A, Spenger C (2011): The AddNeuroMed framework for multi‐centre MRI assessment of Alzheimer's disease: Experience from the first 24 months. Int J Geriatr Psychiatry 26:75–82. [DOI] [PubMed] [Google Scholar]
- Smith SM (2002): Fast robust automated brain extraction. Hum Brain Mapp 17:143–155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stonnington CM, Tan G, Klöppel S, Chu C, Draganski B, Jack CR, Chen K, Ashburner J, Frackowiak RS (2008): Interpreting scan data acquired from multiple scanners: A study with Alzheimer's disease. NeuroImage 39:1180–1185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sullivan EV, Pfefferbaum A (2003): Diffusion tensor imaging in normal aging and neuropsychiatric disorders. Eur J Radiol 45:244–255. [DOI] [PubMed] [Google Scholar]
- Sullivan EV, Pfefferbaum A, Rohlfing T, Baker FC, Padilla ML, Colrain IM (2011a): Developmental change in regional brain structure over 7 months in early adolescence: Comparison of approaches for longitudinal atlas‐based parcellation. NeuroImage 57:214–224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sullivan EV, Rosenbloom MJ, Rohlfing T, Kemper CA, Deresinski S, Pfefferbaum A (2011b): Pontocerebellar contribution to postural instability and psychomotor slowing in HIV infection without dementia. Brain Imaging Behav 5:12–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taylor JM, Ankerst DP, Andridge RR (2008): Validation of biomarker‐based risk prediction models. Clin Cancer Res 14:5977–5983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomas JB, Brier MR, Snyder AZ, Vaida FF, Ances BM (2013): Pathways to neurodegeneration effects of HIV and aging on resting‐state functional connectivity. Neurology 80:1186–1193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tondelli M, Wilcock GK, Nichelli P, Jager CAD, Jenkinson M, Zamboni G (2012): Structural MRI changes detectable up to ten years before clinical Alzheimer's disease. Neurobiol Aging 33:825–836. [DOI] [PubMed] [Google Scholar]
- Towgood KJ, Pitkanen M, Kulasegaram R, Fradera A, Kumar A, Soni S, Sibtain NA, Reed L, Bradbeer C, Barker GJ, Kopelman MD (2012): Mapping the brain in younger and older asymptomatic HIV‐1 men: Frontal volume changes in the absence of other cortical or diffusion tensor abnormalities. Cortex 48:230–241. [DOI] [PubMed] [Google Scholar]
- Valcour V, Shikuma C, Shiramizu B, Watters M, Poff P, Selnes O, Holck P, Grove J, Sacktor N (2004): Higher frequency of dementia in older HIV‐1 individuals the Hawaii aging with HIV‐1 cohort. Neurology 63:822–827. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Erp TGM, Hibar DP, Rasmussen JM, Glahn DC, Pearlson GD, Andreassen OA, Agartz I, Westlye LT, Haukvik UK, Dale AM, Melle I, Hartberg CB, Gruber O, Kraemer B, Zilles D, Donohoe G, Kelly S, McDonald C, Morris DW, Cannon DM, Corvin A, Machielsen MWJ, Koenders L, de Haan L, Veltman DJ, Satterthwaite TD, Wolf DH, Gur RC, Gur RE, Potkin SG, Mathalon DH, Mueller BA, Preda A, Macciardi F, Ehrlich S, Walton E, Hass J, Calhoun VD, Bockholt HJ, Sponheim SR, Shoemaker JM, van Haren NEM, Pol HEH, Ophoff RA, Kahn RS, Roiz‐Santianez R, Crespo‐Facorro B, Wang L, Alpert KI, Jonsson EG, Dimitrova R, Bois C, Whalley HC, McIntosh AM, Lawrie SM, Hashimoto R, Thompson PM, Turner JA (2015): Subcortical brain volume abnormalities in 2028 individuals with schizophrenia and 2540 healthy controls via the enigma consortium. Mole Psychiatry 21:547–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vounou M, Janousova E, Wolz R, Stein JL, Thompson PM, Rueckert D, Montana G (2012): Sparse reduced‐rank regression detects genetic associations with voxel‐wise longitudinal phenotypes in Alzheimer's disease. NeuroImage 60:700–716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Westman E, Simmons A, Muehlboeck JS, Mecocci P, Vellas B, Tsolaki M, Kłoszewska I, Soininen H, Weiner MW, Lovestone S, Spenger C, Wahlund LO (2011): AddNeuroMed and ADNI: Similar patterns of Alzheimer's atrophy and automated MRI classification accuracy in europe and north america. NeuroImage 58:818–828. [DOI] [PubMed] [Google Scholar]
- Wilson TW, Heinrichs‐Graham E, Becker KM, Aloi J, Robertson KR, Sandkovsky U, White ML, O'Neill J, Knott NL, Fox HS, Swindells S (2015): Multimodal neuroimaging evidence of alterations in cortical structure and function in HIV‐infected older adults. Hum Brain Mapp 36:897–910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu F, Yuan Y, Zhuang Y (2010): Heterogeneous feature selection by group Lasso with logistic regression Proceedings of the International Conference on Multimedia, pp. 983–986. [Google Scholar]
- Xiang S, Yuan L, Fan W, Wang Y, Thompson PM, Ye J (2014): Bi‐level multi‐source learning for heterogeneous block‐wise missing data. NeuroImage 102:192–206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yamashita O, Sato Ma, Yoshioka T, Tong F, Kamitani Y (2008): Sparse estimation automatically selects voxels relevant for the decoding of fMRI activity patterns. NeuroImage 42:1414–1429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ye J, Farnum M, Yang E, Verbeeck R, Lobanov V, Raghavan N, Novak G, DiBernardo A, Narayan VA (2012): Sparse learning and stability selection for predicting mci to ad conversion using baseline adni data. BMC Neurol 12:1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yuan L, Wang Y, Thompson PM, Narayan VA, Ye J (2012): Multi‐source feature learning for joint analysis of incomplete multiple heterogeneous neuroimaging data. Neuroimage 61:622–632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Y, Pohl KM (2015): Solving logistic regression with group cardinality constraints for time series analysis Medical Image Computing and Computer‐Assisted Intervention – MICCAI 2015. Volume 9351 of the series Lecture Notes in Computer Science, pp. 459–466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang D, Shen D (2012): Multi‐modal multi‐task learning for joint prediction of multiple regression and classification variables in Alzheimer's disease. NeuroImage 59:895–907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang S, Zhan Y, Metaxas DN (2012): Deformable segmentation via sparse representation and dictionary learning. Med Image Anal 16:1385–1396. [DOI] [PubMed] [Google Scholar]
- Zhang Y, Kwon D, Pohl KM (2016): Computing group cardinality constraint solutions for logistic regression problems. Med Image Anal, 35:58–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
