Skip to main content
NeuroImage : Clinical logoLink to NeuroImage : Clinical
. 2024 Aug 10;43:103650. doi: 10.1016/j.nicl.2024.103650

Prognostic enrichment for early-stage Huntington’s disease: An explainable machine learning approach for clinical trial

Mohsen Ghofrani-Jahromi a, Govinda R Poudel b, Adeel Razi a, Pubu M Abeyasinghe a, Jane S Paulsen c, Sarah J Tabrizi d, Susmita Saha a, Nellie Georgiou-Karistianis a,
PMCID: PMC11367643  PMID: 39142216

Graphical abstract

graphic file with name ga1.jpg

Keywords: Huntington’s Disease, Biomarkers, Neuroimaging, Stratification, Clinical Trials, Machine Learning

Highlights

  • A novel machine learning pipeline for predicting ventricular enlargement in Huntington’s Disease.

  • Genetic, cognitive, motor, and imaging-derived biomarkers are integrated to achieve more accurate and personalized prognosis.

  • Increased homogeneity of recruited participants for clinical trials in early stages of Huntington’s Disease.

  • Explainable AI applied for clinical reliability of the proposed stratification model.

Abstract

Background

In Huntington’s disease clinical trials, recruitment and stratification approaches primarily rely on genetic load, cognitive and motor assessment scores. They focus less on in vivo brain imaging markers, which reflect neuropathology well before clinical diagnosis. Machine learning methods offer a degree of sophistication which could significantly improve prognosis and stratification by leveraging multimodal biomarkers from large datasets. Such models specifically tailored to HD gene expansion carriers could further enhance the efficacy of the stratification process.

Objectives

To improve stratification of Huntington’s disease individuals for clinical trials.

Methods

We used data from 451 gene positive individuals with Huntington’s disease (both premanifest and diagnosed) from previously published cohorts (PREDICT, TRACK, TrackON, and IMAGE). We applied whole-brain parcellation to longitudinal brain scans and measured the rate of lateral ventricular enlargement, over 3 years, which was used as the target variable for our prognostic random forest regression models. The models were trained on various combinations of features at baseline, including genetic load, cognitive and motor assessment score biomarkers, as well as brain imaging-derived features. Furthermore, a simplified stratification model was developed to classify individuals into two homogenous groups (low risk and high risk) based on their anticipated rate of ventricular enlargement.

Results

The predictive accuracy of the prognostic models substantially improved by integrating brain imaging features alongside genetic load, cognitive and motor biomarkers: a 24 % reduction in the cross-validated mean absolute error, yielding an error of 530 mm3/year. The stratification model had a cross-validated accuracy of 81 % in differentiating between moderate and fast progressors (precision = 83 %, recall = 80 %).

Conclusions

This study validated the effectiveness of machine learning in differentiating between low- and high-risk individuals based on the rate of ventricular enlargement. The models were exclusively trained using features from HD individuals, which offers a more disease-specific, simplified, and accurate approach for prognostic enrichment compared to relying on features extracted from healthy control groups, as done in previous studies. The proposed method has the potential to enhance clinical utility by: i) enabling more targeted recruitment of individuals for clinical trials, ii) improving post-hoc evaluation of individuals, and iii) ultimately leading to better outcomes for individuals through personalized treatment selection.

1. Introduction

Huntington's disease (HD) is an inherited disorder characterized by a prolonged asymptomatic phase of insidious neuropathological brain tissue damage, followed by episodes of behavioural complications, and ultimately progressive decline in cognitive and motor functions (Ghosh et al., 2017). It is caused by an expansion of the cytosine-adenine-guanine (CAG) trinucleotide repeat in a mutated huntingtin gene (mHTT), which leads to aggregation of toxic proteins in the brain (Haider et al., 2017). HD treatments, to date, are palliative and an effective disease-modifying treatment is yet to be found. The pre-symptomatic (pre-HD) stage of the disease is an important period during which a preventive intervention against brain tissue deterioration could be mostly effective (Kim et al., 2021, Jamwal et al., 2020).

A significant challenge in HD drug trials stems from the heterogeneity among HD gene expansion carriers (HDGECs), highlighting a crucial need for improved stratification to categorize trial participants into more homogenous groups (Paulsen et al., 2019). A solution, advocated by the FDA, is prognostic enrichment (Fda, 2019, Maheux et al., 2023). which aims to strategically identify and include individuals with higher risk of experiencing severe disease progression. This results in more consistent responses to experimental treatments thereby facilitating the ability to detect treatment effects (Oxtoby et al., 2022).

Currently, stratification of individuals in HD clinical trials mostly relies on genetic load, calculated by CAG and age product known as the CAP score (Warner et al., 2022), as well as cognitive, motor, and functional assessment scores that are quantitatively encapsulated in the composite Unified Huntington’s Disease Rating Scale (cUHDRS) (Estevez-Fraga et al., 2021). Another weighted combination of these biomarkers is proposed in the normalized prognostic index (PIN) (Long et al., 2017), specifically tailored to enrich a recruited population by providing a predictive assessment of future motor diagnosis risk. Although the number of CAG repeats is a significant determinant of HD onset and progression rate, the specific effect differs broadly among individuals (Reilmann et al., 2019). In recent years, advanced assays for accurately measuring biofluids in the blood and cerebrospinal fluid have promised objective biomarkers (McColgan et al., 2022), and genetic modifiers of CAG repeats are continually investigated (Lee et al., 2022, Tabrizi et al., 2022). Nevertheless, in vivo brain imaging remains a reliable source of information that can reflect the severity of the latent neuropathologic state of individuals with HD (Kinnunen et al., 2021). PREDICT-HD (Paulsen et al., 2014, Ciarochi et al., 2016), TRACK-HD (Tabrizi et al., 2013), and IMAGE-HD (Poudel et al., 2015, Poudel et al., 2019) are longitudinal observational imaging studies that aimed to identify sensitive biomarkers (Paulsen et al., 2008), predict onset (Langbehn et al., 2010), profile the natural history (Abeyasinghe et al., 2021), and improve endpoints for HD clinical trials (Paulsen et al., 2010). These studies have generated a wealth of information, now being leveraged for other investigations. For example, built upon these studies and others, the recently developed data-driven HD Integrated Staging System (HD-ISS) (Tabrizi et al., 2022) has recognized the volumes of caudate and putamen as key biomarkers of pathogenesis. This has opened up the possibility of recruiting HDGECs, both pre-HD and symptomatic HD (symp-HD), for preventive trials by using HD-ISS stages 0, 1, and 2, that precede traditional Shoulson-Fahn stages (now incorporated in HD-ISS stage 3) (Shoulson and Fahn, 1979). However, HD-ISS serves as a first-stage classifier and further granular differentiation of HDGECs is required.

While the consideration of caudal and putaminal volumes in HD-ISS stage 1 is a major leap, brain deterioration in HD is not limited to the striatum, and other subcortical structures and cortical regions are also impacted during the disease course (Johnson et al., 2021, Tan et al., 2021, Van Den Bogaard et al., 2011). Generally, atrophy in deep and cortical regions is accompanied by a quadratic-like (Mofrad et al., 2021, Bethlehem et al., 2022) enlargement of the lateral ventricles, which are filled with cerebrospinal fluid (CSF) in order to compensate for tissue loss and preserve total intracranial volume (ICV) (Nestor et al., 2008, Mak et al., 2017). As a result, ventricular volume has been commonly used as a surrogate measure of brain atrophy in HD clinical trials (Tabrizi et al., 2019, UniQure Biopharma, 2019, Novartis-Pharmaceuticals, 2021) and other brain disorders (Petracca et al., 2121, Apostolova et al., 2013). One study found that incorporating the volume of lateral ventricles, as one of the inputs of a classification model that aimed for HD stage identification, significantly improved the discriminative power, as compared to the same model being trained solely with features derived from the striatum (Kohli et al., 2022, Kohli et al., 2023).

During the past decade, alongside the abundance of multi-modal data in neurodegenerative disorders, there has been a proliferation of artificial intelligence (AI) models for clinical support in diagnosis, prognosis, and treatment management (Bron et al., 2022, Kopf and Claassen, 2021, Mestre, 2022, Amirmoezzi et al., 2019). Machine learning models have proved the feasibility of: predicting the future motor state of pre-HD individuals from baseline scans (Castro et al., 2017); introducing new disease states within its course (Mohan et al., 2022); and improvements for phase II or III clinical trials (Koval et al., 2022). Moreover, predictive models of neurodegenerative conditions have become more clinically viable by incorporating Explainable AI, which is crucial for providing transparency and understanding of the underlying processes within such models, thereby increasing trust in their predictions (Schulz et al., 2020, de Jong et al., 2021, Antoniadi et al., 2022, Junaid et al., 2023, Rasheed et al., 2022).

This study presents a machine learning pipeline based on random forests (Strobl et al., 2009, Ghazaleh et al., 2021) to predict the enlargement rate of lateral ventricles within a 3-year time window. We took advantage of the nonlinear growth of lateral ventricles to identify fast-progressor HDGECs, especially in HD-ISS stages 1, and 2. For this purpose, we conducted a search for a novel scale aimed at stratifying moderate and fast progressor groups into more granular levels of individual risk. We hypothesized that random forests could find homogenous subspaces spanned by features from HD genetic load, cognitive, motor, and volumetric imaging biomarkers to achieve more accurate predictions of brain atrophy. The clinical utility of our proposed approach is to provide: 1) an HD-specific and early-stage-targeted prognostic enrichment and subsequent stratification tool for enhancing sample homogeneity based on predicted brain atrophy, facilitating the selection of participants for future HD clinical trials, 2) a patient-specific prognostic model enabling individualized post-hoc assessment of treatment effects, and 3) inter-variable relationships among biomarkers of HD progression.

2. Materials and methods

2.1. Data preparation

We used data from previously published cohorts: PREDICT (Paulsen et al., 2013), TRACK (Tabrizi et al., 2009), TrackON (Klöppel et al., 2015), and IMAGE (Jfd et al., 2013). The datasets involved repeated visits, with structural MRI scans being acquired along with genetic, cognitive, motor and functional assessments. Table 1 presents a summary of the baseline data regarding demographic information, CAG repeats, cognitive and motor assessments, as well as the length of study engagement (years), separated by cohorts. We limited our analyses to images obtained exclusively from 3Tesla scanners. Pre-processing and quality control were conducted similarly to our earlier study (Abeyasinghe et al., 2021), in which FreeSurfer (Reuter et al., 2010) was employed for subcortical segmentation, cortical parcellation and extraction of imaging-derived features. Volumes of deep and cortical regions were divided by ICV, while cortical thickness measurements were used without normalization as suggested in the literature (Westman et al., 2013). Longitudinal change of hemispheric differences in HD-related atrophy is demonstrated to be limited (Wijeratne et al., 2020), therefore, within our analyses we merged the volumes of subcortical structures from both hemispheres.

Table 1.

Demographic characteristics of Huntington’s Disease individuals at baseline.


TRACK
PREDICT
IMAGE
Aggregated Data
pre-HD symp-HD pre-HD symp-HD pre-HD symp-HD pre-HD symp-HD
N 156 111 106 12 35 31 297 154
%Female 55 % 55 % 59 % 75 % 60 % 35 % 58 % 53 %
Age 40.2 (9.0) 49.6 (9.4) 42.5 (12.4) 52.0 (11.7) 41.8 (9.0) 52.6 (9.4) 41.2 (10.5) 50.4 (9.6)
CAG 43.2 (2.4) 43.4 (2.8) 42.2 (2.5) 42.4 (2.8) 42.2 (1.9) 42.9 (2.1) 42.7 (2.5) 43.2 (2.7)
TMS 3.1 (2.8) 23.7 (10.8) 5.3 (5.2) 25.7 (14.1) 0.8 (1.1) 15.2 (6.8) 3.6 (4.0) 22.9 (11.0)
SDMT 51.6 (10) 34.0 (10.3) 52.1 (10.8) 39.2 (12.3) 52.7 (9.2) 42.1 (11.0) 51.9 (10.3) 35.6 (10.9)
SWR 100 (16.8) 77.9 (19.5) 99.2 (15.8) 77.7 (29) 103 (17.2) 93.9 (17.9) 100 (16.5) 80 (20.8)
LVER 680 (6 1 7) 2166 (1521) 707 (8 4 7) 2041 (1284) 532 (6 4 1) 2100 (1296) 673 (7 1 0) 2143 (1453)
Engagement 3.78 (1.85) 2.76 (1.45) 2.47 (0.47)

Mean (SD) for quantitative variables and count or percentage for categorical variables. CAG: cytosine-adenine-guanine repeats, TMS: Total Motor Score, SDMT: Symbol Digit Modalities Test, SWR: Stroop Word Reading test, LVER: Lateral Ventricular Enlargement Rate (mm3/year). Engagement refers to the duration, in years, between the initial and final MRI acquisition.

2.2. Input feature domains

Initially, the method of Minimum Redundancy Maximum Relevance (Peng et al., 2005) was used to identify key predictor domains and remove those features that were the least informative in association with ventricular volume. At this step, cortical surface area and curvature were excluded from the feature set. The remaining features, categorized into seven domains, are depicted in Fig. 1A. The features in each domain were not combined into a single parameter. Instead, to assess the impact of each domain on model performance, we sequentially incorporated them into the complete input feature set and recorded performance metrics. These feature domains comprised the CAP score, calculated as [Age × (CAG-35.5)] (Mason et al., 2018), UHDRS-TMS, cognitive biomarkers including symbol digit modalities test (SDMT) and Stroop Word Reading Test (SWR) (Poudel et al., 2014, Andre et al., 2014), together with 162 imaging derived features grouped into subcortical, cortical, and ventricular volumes (see Supplementary Table S1 for details). The hallmark structures affected in HD (Tan et al., 2021), including caudate, putamen, pallidum, thalamus, hippocampus, amygdala, accumbens, and corpus callosum were grouped together as the subcortical domain. Features related to the ventricular system involved the volumes of the lateral, inferior lateral, the 3rd and the 4th ventricles, and CSF. Our analyses showed that the choroid plexus was significantly larger in the later HD-ISS stages (see section 3 in the supplementary materials), so we included this feature in this group as well. We categorized both the volumes and thicknesses of 34 cortical regions as cortical morphometry features, each of which present bilaterally, based on the Desikan-Killiany atlas (Desikan et al., 2006). The volume of white matter hypo-intensities on T1-weighted images, which has been reported to be strongly correlated with T2-weighted hyper-intensities and to be also associated with age and pathology (Wei et al., 2019), was included as a feature domain in our analyses. Supplementary Fig. S1 illustrates how this feature is distributed across different groups and cohorts in this study.

Fig.1.

Fig.1

An overview of the pipeline for training the machine learning models for prognosis and stratification. (A) Within the sequential feature selection, biomarkers from different domains were incrementally added to the input feature set and fed into the models for training. (B) The random forest models were trained and then evaluated on an independent test set for each subset of selected features. (C) For each individual, the rate of forthcoming lateral ventricular enlargement was calculated using linear regression analysis and selected as the ground truth for the prognostic random forest regression model. (D) To stratify the population into two homogenous groups, a classification task was defined as forecasting whether an individual’s prospective rate of ventricular enlargement would be below or over a threshold of 750 mm3/year, which was set to be close to the median value to avoid class imbalance.

2.3. Prognostic model of lateral ventricular enlargement

The forthcoming lateral ventricular enlargement rate (LVER) over time was considered as an indicator of brain tissue loss (Lundervold et al., 2019). It was calculated using linear regression analysis for each individual separately, as schematically shown in Fig. 1C. Subsequently, LVER served as the dependent variable for training random forest regression models, using scikit-learn, a python library for machine learning (Pedregosa et al., 2011). Clinical trials are typically conducted over periods not exceeding 4 years (Estevez-Fraga et al., 2024, Kinnunen et al., 2021). For those individuals in PREDICT and TRACK, who participated in the studies for periods longer than 4 years, the longitudinal data was divided into two shorter segments, each segment having its own baseline features and corresponding calculated LVER value. However, it is noteworthy that this decision was made to obtain more consistent LVER calculations; our post-hoc analyses revealed that separation into segments did not significantly impact the model accuracy. For more details, see Section 17 in the supplementary materials. Extra care was taken to ensure exclusive allocation of these segments to either training or test sets. This technique yielded 537 input–output pairs, 80 % allocated for training, 10 % for validation, and 10 % for test.

To examine the prognostic value of each feature domain, we employed a wrapper-based sequential forward feature selection approach (Ruiz et al., 2006), where each domain was incrementally added to the input feature subset of the training dataset. The model was trained and tuned at each step, and the effect on the reduction of the prediction error in the prognostic model was recorded for both a 10-fold cross-validation setting and an independent unseen test set. Seven prognostic models were trained, each with its hyperparameters being tuned. Age and sex were included in the input feature set for all models to account for their potential confounding effect; similarly, ICV was also included in models V, VI, and VII which had imaging features.

Both feature selection and hyperparameter tuning processes were carried out using only the training dataset, with the test set remaining untouched. The number of constituent decision tree estimators in the random forest was 300 and the tuned hyperparameters included the maximum number of leaf nodes, minimum number of samples in each leaf, and the maximum number of samples and features being used for training.

2.4. Stratification model for moderate and fast progression

To evaluate the predictive power of the proposed pipeline, we assessed its capability to stratify the population in the pooled dataset into two groups: moderate and fast progressors, based on their anticipated brain atrophy. The distribution of ground truth LVERs across the pooled dataset is illustrated in Fig. 1D, with a detailed depiction of the distribution over HD-ISS stages provided in Supplementary Fig. S2. We selected the median LVER (750 mm3/year) as the reference threshold for classification. These stratification models were trained for prognostic enrichment in clinical trials using baseline features used for the prognostic models, but with the aim of identifying individuals who are more likely to exhibit fast disease progression and screening out those projected to experience moderate progression. The assumption here is that the HD continuum is sufficiently represented by the 537 baseline samples in the pooled dataset and that a random forest classifier could partition the space spanned by the input features into homogenous subspaces not relying on the individuals’ motor diagnosis, but rather based on the expected rate of pathologic neurodegeneration.

We used Shapley additive explanations (SHAP) (Lundberg and Lee, 2017) to reveal the basis for the classification outcomes and to demonstrate the contribution of input features, their importance, and interactions.

3. Results

3.1. Evaluation of the prognostic model

The prognostic model was repeatedly trained and evaluated while the input feature set sequentially incorporated the seven defined domains. Fig. 2 illustrates how the inclusion of each domain reduced the mean absolute error (MAE) in the predictions. The first three models, highlighted by yellow, were fed with features conventionally used for recruitment in clinical trials (CAP, SDMT, SWR, and TMS). By incrementally appending the imaging-derived features to the input set, at each step, a sharp decline was seen in the prediction error. For instance, by including the volume of white matter hypo-intensities, the cross-validated prediction error reduced by 3 %. The reduction was 24.6 % when comparing Model VI to Model I (refer to Supplementary Table S2). The error bars represent the standard deviation of MAE in a 10-fold cross validation setting. Alongside the improvement in predictive power, the uncertainty of the models, measured by coefficients of variation in MAE, also dropped: 0.13 for model I but 0.09 for model VI. This highlights how objectively measured imaging biomarkers compensated for the interrater variability and subjectivity of cognitive and motor assessments (Scheid et al., 2022). A similar trend of error reduction was seen in the unseen test set, shown in orange. It is worth noting that the hyperparameters were tuned solely to prevent overfitting to the training data, a common issue when random forests are used for regression tasks. This tuning procedure did not significantly affect evaluation metrics. A regularized linear regression model with considerably fewer hyperparameters showed a similar trend of MAE reductions, but with a smaller difference between train and test metrics (Supplementary Fig. S3).

Fig.2.

Fig.2

Sequential integration of biomarker domains into the feature set. By successively integrating each domain of biomarkers in the input features set, the error and the uncertainty of the prognostic models dropped. Mean absolute error is computed by taking average of deviations between true and predicted values of lateral ventricular enlargement rate in the train, validation, and test set. The test set comprised 53 samples, unseen during training, and was used to assess the generalizability of the model.

3.2. Performance of the stratification model

Stratification models were trained with the aim of segregating the pooled dataset into two groups of moderate and fast progressors. A classifier with input features and hyperparameters similar to prognostic model VI, could achieve the maximum accuracy of 0.77 on the independent test set and 0.81 within cross validation (precision = 0.83, recall = 0.80). The choice of the stratification threshold can be customized based on the specific clinical requirements. Supplementary Fig. S6 illustrates the relationships between the selected threshold and specificity, sensitivity, and accuracy which could be useful in fine-tuning the model's performance and selecting an optimal threshold that balances the specificity-sensitivity trade-off according to a study's goals. This allows selection of the threshold based on whether the focus in enrichment is to screen out individuals with moderate progression or prioritize the identification of fast progressors for inclusion.

Fig. 3 displays the SHAP values of the stratification counterpart of prognostic model VII, where the features are ranked according to their importance. The left and right ends of the spectrum represent moderate and fast LVERs, respectively. Red tones indicate higher values of each specific feature, while blue tones indicate lower values. The results validate the model's ability to learn and align with the clinical research findings regarding the pathological HD processes, including ventricular enlargement, subcortical atrophy, cortical thinning, and alterations in cognitive and motor scores. For example, higher TMS values associate with faster rates of ventricular enlargement; however, as compared to the red tones, blue tones have an extensively narrower distribution. This is due to the fact that TMS values for most individuals in the early stages of disease are near zero (Table 1), resulting in a skewness in TMS distribution known as the floor effect (Abreu et al., 2021). In the proposed pipeline, this issue is compensated by the contribution of features in rows above TMS, i.e., the ventricular, putaminal, and accumbal volumes (Fig. 3), which exhibit more symmetrical distributions compared to TMS and SDMT.

Fig.3.

Fig.3

Shapley additive explanations computed for the stratification model. Features are ranked based on their contribution to class predictions made by the trained model. Each feature is represented by a spectrum ranging from blue to red, with blue tones indicating lower feature values and red tones indicating higher values. The left and right ends of each spectrum correspond to moderate and fast lateral ventricular enlargement. The results demonstrate how imaging biomarkers can enhance the predictive power of conventional clinical and cognitive scores, allowing for more accurate homogenization and stratification of Huntington’s disease cohorts. (Inf-Lat-Vent: Inferior Lateral Ventricle, CC_central: central subdivision of corpus callosum).

3.3. Prognostic enrichment based on lateral ventricular enlargement rate

Fig. 4 illustrates the relationship between the baseline PIN score calculated using the formula given by Long et al. (Long et al., 2017) and the ground truth values of LVER. There exists extensive variability in LVER values across any specific range of PIN scores. This is due to the fact that the enlargement of the ventricular volume generally follows a quadratic-like pattern compared to caudate or putamen that atrophy linearly (Hobbs et al., 2010). For details, see Section 16 in the supplementary materials. In a trial that targets a specific range of PIN scores, the characteristics of recruited participants may become more homogenous by narrowing down participant selection based on the anticipated values of LVER. This can be achieved by employing the proposed pipeline which enables training and validating both the prognostic and the stratification models using any combination of the input feature domains. In a scenario, where imaging features were fully absent in the input feature set, the stratification model achieved a cross-validated accuracy of 0.72. Conversely, feeding the model exclusively with the imaging-derived features resulted in a higher cross-validated accuracy of 0.81. Refer to Supplementary Table S2 and Fig. S10 for details of the model evaluation.

Fig.4.

Fig.4

Association between baseline Normalized Prognostic Index (PIN) and the forthcoming lateral ventricular enlargement rates (LVER). While the values of LVER are associated with the PIN scores, there exists extensive variability in LVER values across any specific range of PIN scores. For instance, PIN scores near zero encompass a wide range of 100 to 3000 mm3/year for forthcoming LVER. In clinical trials, where participants are primarily stratified based on their PIN scores, recruitment can be further enriched by employing the proposed models to select individuals with narrower ranges of predicted LVER, thereby achieving a more homogeneous population. The size of the circles corresponds to the volume of lateral ventricles at baseline, normalized by intracranial volume.

A probabilistic perspective of the stratification model can be attained by quantifying its certainty in determining whether an individual with HD has surpassed the LVER threshold of 750 mm3/year. This is determined by the consensus among decision trees within a random forest. Higher certainties signify a more robust consensus among the trees, indicating a greater inclination towards classifying an individual as surpassing the threshold. Fig. 5 compares the model’s certainty, denoted as P(LVER>750 mm3/year), with the PIN score. Pearson’s kurtosis (K) and Shannon’s entropy (H) are used to quantify the distribution characteristics. Lower kurtosis and higher entropy correspond to a greater level of distinction among the samples and less concentration around the mean. While the PIN score demonstrates a high level of distinction among individuals in HD-ISS stage 3, characterized by lower kurtosis and higher entropy, it lacks the ability to strongly discriminate between individuals in earlier stages. Conversely, the proposed probabilistic metric offers extensive distinction among individuals in HD-ISS stages 2 or 1, albeit at the expense of reduced discriminatory power in HD-ISS stage 3.

Fig.5.

Fig.5

Comparative Analysis of Prognostic Metrics within HD-ISS Stages. Here, in a leave-one-out setting, for each individual, the prognostic model is trained using imaging data from other individuals and then employed for prediction. Baseline samples are stratified by HD-ISS stages. In stages 1 and 2, the proposed prognostic metric, P(LVER>750 mm3/year), exhibits a broader and flatter distribution than PIN (lower kurtosis). Equivalently, higher entropy indicates more diverse and less concentrated values, suggesting the applicability of P(LVER>750 mm3/year) for prognostic enrichment in stages 1 and 2. The proposed prognostic metric does not provide value over PIN for stage 0 and can be marginally beneficial in stage 3 for screening out slow-progressors.

3.4. Associations between cortical degeneration and ventricular enlargement

A comparison between prognostic models VI and VII suggests that incorporating features related to cortical morphometry may not significantly enhance the predictive power of the models. Nevertheless, model explanations demonstrated that several cortical morphometry features are informative for discriminating between moderate and fast disease progression.

In two separate analyses, we trained the models exclusively based on cortical measurements. Once fed with cortical thickness features, the cross-validated accuracy was 0.73 ± 0.04 (precision = 0.74, recall = 0.77), whereas it was 0.68 ± 0.07 (precision = 0.71, recall = 0.69) when trained using cortical volumes.

The thicknesses of the lateral occipital, superior temporal and superior parietal and the volumes of lateral occipital, the left pars opercularis and superior parietal cortices were the most important cortical morphometry features, consistent with previous research (Johnson et al., 2021). Degenerated cortical regions ranked by their contributions to model decisions based on SHAP are shown in Fig. 6. Regions depicted in darker tones associate with higher predictive power of the forthcoming atrophy rate and could potentially serve as monitoring biomarkers (FDA-NIH Biomarker Working Group, 2016) within trials similar to PRECREST (Rosas et al., 2014) that aim to slow cortical thinning.

Fig.6.

Fig.6

Cortical regions predictive of moderate and fast progression. Darker tones correspond to (A) cortical thicknesses and (B) cortical volumes of regions that had higher impact in predicting fast ventricular enlargement. These regions may potentially serve as monitoring biomarkers to assess treatment response in trials that aim to halt cortical atrophy.

4. Discussion

We developed a novel machine learning pipeline fed with multiple combinations of baseline HD biomarkers. We trained and validated regression and classification random forests for prognosis and stratification, respectively. Leveraging longitudinal brain scans alongside conventional HD biomarkers from PREDICT, TRACK, TRACKON and IMAGE, we focused on the accelerated enlargement of lateral ventricles over time as a proxy measure of brain atrophy. We systematically incorporated seven different domains of biomarkers into the input feature set, one at a time. This sequential integration allowed us to evaluate and rank the contribution of each domain to the improvement of predictive accuracy of the models. Particularly, the inclusion of imaging-derived features resulted in significant reductions in both errors and variances of predictions.

We adhered to the latest guidelines that recommend to view the disease course in neurodegenerative disorders as a continuum (JNPD Research and Innovation, 2019), and utilized random forests to classify participants into homogeneous groups of moderate and fast progressors based on their forthcoming brain tissue loss rather than their clinical motor diagnosis. This approach allowed the trained models to extrapolate HD stages without relying on data from neurologically normal individuals, which may introduce additional factors in the model that are related to neurodevelopmental aspects of HD and thus not specific to its neurodegenerative process (Van Der Plas et al., 2020, Kubera et al., 2018). Müller and colleagues proposed a disease progression score derived from a multiplicative and divisive relationship among the volumes of the parietal lobe, striatum, and lateral ventricles (Müller et al., 2018). The proposed model in this study also seeks a nonlinear relationship, however among all input features and LVER to identify HD individuals who are more likely to exhibit considerable ventricular expansion in future. What distinguishes the present study from similar previous data-driven works is that the proposed model is trained exclusively with features from HDGECs, without incorporating data from healthy controls. This choice is grounded in the clinical trial context, where non-gene carriers are not part of the study population and their inclusion in computational modelling schemes does not necessarily guarantee higher accuracy when the goal is segregating HDGECs into homogeneous subgroups. This is different from the common event-based approaches used in modelling the progression of neurodegenerative disorders, including HD (Müller et al., 2018, Wijeratne et al., 2021, Castro et al., 2022, Wijeratne et al., 2023), which is to view the disease course as consisting of two major distinct states: an initial state, where HDGECs are assumed to exhibit characteristic biomarkers similar to healthy individuals, and a subsequent state within which biomarker trajectories in HDGECs are expected to deviate from healthy individuals. The results of HD-YAS study (Scahill et al., 2020, Scahill et al., 2022) suggest that the former assumption holds true for cognitive and motor assessments. However, this may not be the case for cortical measurements, which have been presumed to be affected by the neurodevelopmental component in HD (Tan et al., 2021). An example is presented in Supplementary Fig. S7, where insular thickness was found to be significantly greater in pre-HD group not only compared to those with symptomatic HD, but also to healthy controls. Therefore, in order to enable a machine learning model to capture a holistic view of the dynamics of cortical changes throughout the HD course, it is essential to provide it with data that comprise a broader age range than the aggregated datasets used in this study and also more complicated features extracted from the cortex (Stoebner et al., 2023, Shishegar et al., 2021).

Improving the homogeneity of trial populations is crucial for obtaining reliable results in phase II or III clinical trials, which can be achieved by recruiting participants with less dissimilar prognosed disease progression (Maheux et al., 2023, Oxtoby et al., 2022). We selected random forests as our modelling approach, since their intrinsic design involves recursively partitioning a high-dimensional feature space by minimizing an impurity function in each of the nodes, ultimately resulting in input samples being clustered in optimized subdivisions with similar target values (Strobl et al., 2009), reminiscent of the goal of stratification in clinical applications. We took advantage of the non-parametric architecture of random forests to predict short-term brain atrophy in contrast to previous literature that used parametric models to fit predefined long-term trajectories to the biomarkers in individuals (Koval et al., 2022).

It is likely that while an individual in HD-ISS stage 1 does not exhibit noticeable motor or cognitive problem, their prognosed brain atrophy is higher than others at the same stage. On the other hand, a common practice in interventional clinical trials is to select participants from previously available cohorts, such as Enroll-HD (Long et al., 2023), where imaging data may be unavailable. We ranked the importance of conventional motor and cognitive biomarkers of HD progression with respect to their contributions to the decisions made by the stratification model. Our analysis revealed that the PIN had a higher importance than the CAP, cUHDRS, and their individual components (Supplementary Fig. S9). These findings are consistent with previous studies that reported on the predictive value of the PIN score (Langbehn et al., 2022). Given that PIN has also shown strong association with neurofilament light in HD (Parkin et al., 2022), we suggest that our proposed metric for prognostic enrichment could serve as a complementary tool along with the HD-ISS, after recruited participants are PIN-stratified. In such an arrangement, by leveraging information from imaging-derived features after MRI acquisition, the proposed models can further differentiate between individuals whose PIN scores suggest relatively similar prognosis (Fig.4, Fig.5). Moreover, the probability of anticipated LVER surpassing a certain threshold can be considered as a biomarker of HD progression, with higher probabilities highlighting more severe conditions. Such metrics that incorporate imaging information offer greater robustness, compared to composite scores that are based on cognitive and/or motor assessments that may be susceptible to fluctuations due to practice effect, rater subjectivity, or variability (Supplementary Fig. S11).

Inter-variable relationships among striatal and non-striatal degeneration, cortical thinning, and non-imaging biomarkers based on the value of each feature are presented by SHAP in Fig. 3. In an investigation of the spatiotemporal dynamics of cortical atrophy, Johnson et al. found that occipital and parietal regions undergo significantly higher rates of disease-related degeneration compared to frontal and temporal regions (Johnson et al., 2021). This is consistent with our findings, which indicated that parietal and occipital regions are more predictive of ventricular enlargement rates (Fig. 6).

A published post-hoc analysis of the participants in the GENERATION-HD1 tominersen study revealed that they were a decade older with respect to their brain age (Hawellek et al., 2022). Furthermore, the volumes of the lateral ventricles and choroid plexus have shown to serve as one of the dominant aspects of brain aging (Bethlehem et al., 2022, Smith et al., 2020). Our study confirms the implication of ventricular system as an informative biomarker of accelerated brain atrophy in HD progression. Accordingly, a surrogate endpoint for experimental therapeutics could be the inhibition of pathologic ventricular enlargement, particularly in earlier disease stages. Nevertheless, the applicability of ventricular volume as a measure of trial efficacy is constrained by intervention-induced brain inflammation (Tabrizi et al., 2019). One limitation of using the proposed prognostic model for evaluating treatment effects is that ventricular enlargement may be confounded by processes other than atrophy when the intervention involves intrathecal administration (McColgan et al., 2023). Further investigation of the proposed model with real-world data from clinical trials is essential. In particular, pre- and post-intervention LVER estimations should be compared, with contributions from each imaging-derived feature analysed using SHAP waterfall plots (see supplementary Figures S12 and S13). This potential functionality could facilitate post-hoc trial analysis by enabling a simultaneous comparison of changes in imaging-derived features alongside their corresponding SHAP values in model explanations. Utilizing these subject-specific estimations may also help to minimize, if not eliminate, the requirement for a large number of placebo-receiving participants, addressing the longstanding ethical issue of placebo arms within clinical trials and potentially improving success rates (Largent et al., 2022).

Ventricular expansion due to normal aging is inevitable and may confound the estimations, thereby limiting the applicability of the proposed models. On the other hand, it has been reported that conventional approaches to confound removal may lead to unintended leakage of data resulting in unrealistic improved performance and hindered reproducibility (Hamdan et al., 2022). In order to avoid this issue while accounting for normal aging, we trained a separate prognostic model using imaging domain features from healthy control individuals to decompose the effects of normal and pathogenic neurodegeneration and obtain a “lower bound” for the anticipated ventricular expansion (Schwarz, 2021), which could potentially be used to evaluate if a clinical trial has been successful in slowing brain atrophy (Supplementary Fig. S8).

The training dataset was not adjusted beyond the quality check of the automated brain segmentation. Even instances with negative ventricular change over time were included in the training set to ensure model generalizability. Multi-site participation is typically required for HD clinical trials to achieve sufficient statistical power. Volumetric MRI has shown to be less susceptible to multisite discrepancies than diffusion and functional MRI (Gregory et al., 2020). We also conducted out-of-sample testing for all configurations of using dissimilar cohorts for training and testing and found no significant bias towards a specific cohort (Supplementary Table S8). The use of random forests, which are composed of a large number of decision tree estimators ensembled together, makes the stratification model robust to measurement noise. However, for such AI-based systems to be used in clinical research, it is crucial that all image segmentations be thoroughly reviewed by domain experts to ensure clinical reliability. This is especially important as these systems are typically trained and tuned by non-clinicians.

Another limitation of this study is the absence of biofluid biomarkers in the dataset, which could potentially enhance the accuracy of the prognostic model and increase the sensitivity of the stratification model for identifying pre-HD fast progressors. Furthermore, it is worth exploring whether the inclusion of features related to diffusion or resting-state functional MRI (Polosecki et al., 2020) could improve the models' performance. Another limitation is that the calculation of ground truth values required at least two longitudinal brain scans, which led to the exclusion of several individuals from analysis. However, more advanced techniques, such as generative models, can incorporate data from subjects with only one brain scan as well. Additionally, recent studies have reported that deep learning-based brain segmentation tools (Kemenczky et al., 2022) may be less susceptible to head movement artifacts than traditional tools, such as Freesurfer.

In summary, we developed for the first time an explainable machine learning pipeline, based on various combinations of input features from baseline, to predict expected lateral ventricular enlargement within HD progression and to stratify individuals into groups with more similar prognosis. Our study shows that ventricular volume is an informative biomarker of accelerated brain atrophy in HD progression. We observed that the inclusion of imaging-derived features resulted in significant improvements in predictive accuracy. Our findings suggest that the proposed stratification model could serve as a complementary enrichment tool along with the HD-ISS, after recruited participants are stratified with respect to their PIN scores. This study provides insights into potential surrogate endpoints for experimental therapeutics in HD and could inform future clinical trial design.

Funding

TRACK-HD and TrackON-HD were supported by the CHDI Foundation, a not-for-profit organisation dedicated to finding treatments for Huntington’s disease. IMAGE-HD was supported by CHDI Foundation research agreement A-3433 and the National Health and Medical Research Council (NHMRC) Australia grant 606,650 (N.G.-K.). The PREDICT-HD study was funded by the NCATS and the NIH (NIH; R01—NS040068, U01—NS105509, U01—NS103475).

Data from the PREDICT-HD, TRACK-HD, and IMAGE-HD studies can be accessed through CHDI foundation upon request. The written scripts used for model development and evaluation in this study are available at: https://github.com/mghofrani/HD_prognostic_model.

CRediT authorship contribution statement

Mohsen Ghofrani-Jahromi: Writing – review & editing, Writing – original draft, Visualization, Validation, Methodology, Conceptualization. Govinda R. Poudel: Writing – review & editing, Validation, Supervision, Methodology, Conceptualization. Adeel Razi: Writing – review & editing, Supervision, Conceptualization. Pubu M. Abeyasinghe: Writing – review & editing, Visualization, Supervision, Project administration, Methodology, Data curation. Jane S. Paulsen: Writing – review & editing, Visualization, Validation. Sarah J. Tabrizi: Writing – review & editing, Validation, Methodology, Conceptualization. Susmita Saha: Writing – review & editing, Validation, Supervision, Methodology. Nellie Georgiou-Karistianis: Writing – review & editing, Writing – original draft, Validation, Supervision, Project administration, Methodology, Investigation, Funding acquisition, Data curation, Conceptualization.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

IMAGE-HD data used in this work was generously provided by the participants in the IMAGE-HD study and made available by Professor Nellie Georgiou-Karistianis, Principal Investigator. TRACK-HD and TrackON-HD data used in this work were generously provided by the participants in these studies and made available by Dr Sarah Tabrizi, Principal Investigator. PREDICT-HD data used in this work was generously provided by the participants in PREDICT-HD study and made available by the PREDICT-HD investigators and coordinators of the Huntington study group, Dr Jane S. Paulsen, Principal Investigator. The writers would like to acknowledge valuable inputs from Dr. Atefeh Zamani, University of New South Wales, Dr. Amir Saberi, AGL Energy, and Dr. Salman Ghofrani, Shiraz University.

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.nicl.2024.103650.

Appendix A. Supplementary data

The following are the Supplementary data to this article:

Supplementary Data 1
mmc1.docx (11.7MB, docx)

Data availability

The authors do not have permission to share data.

References

  1. Abeyasinghe P.M., Long J.D., Razi A., et al. Tracking Huntingtonʼs disease progression using motor, functional, cognitive, and imaging markers. Mov. Disord. 2021;36(10):2282–2292. doi: 10.1002/MDS.28650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Abreu D., Ware J., Georgiou-Karistianis N., et al. Utility of Huntington’s disease assessments by disease stage: floor/ceiling effects. Front. Neurol. 2021;12(July) doi: 10.3389/fneur.2021.595679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Amirmoezzi Y., Salehi S., Parsaei H., Kazemi K., Torabi J.A. A knowledge-based system for brain tumor segmentation using only 3D FLAIR images. Australas. Phys. Eng. Sci. Med. 2019;42:529–540. doi: 10.1007/s13246-019-00754-5. [DOI] [PubMed] [Google Scholar]
  4. Andre R., Scahill R.I., Haider S., Tabrizi S.J. Biomarker development for Huntington’s disease. Drug Discov. Today. 2014;19(7):972–979. doi: 10.1016/j.drudis.2014.03.002. [DOI] [PubMed] [Google Scholar]
  5. Antoniadi A.M., Galvin M., Heverin M., Wei L., Hardiman O., Mooney C. A clinical decision support system for the prediction of quality of life in ALS. J Pers Med. 2022;12(3) doi: 10.3390/jpm12030435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Apostolova L.G., Babakchanian S., Hwang K.S., et al. Ventricular atrophy and its clinical correlates in the imaging cohort from the ADCS MCI Donepezil/Vitamin E study. Alzheimer Dis. Assoc. Disord. 2013;27(2):174. doi: 10.1097/WAD.0b013e3182677b3d. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bethlehem R.A.I., Seidlitz J., White S.R., et al. Brain charts for the human lifespan. Nature. 2022;604(7906):525–533. doi: 10.1038/s41586-022-04554-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bron E.E., Klein S., Reinke A., et al. Ten years of image analysis and machine learning competitions in dementia. Neuroimage. 2022;253 doi: 10.1016/j.neuroimage.2022.119083. [DOI] [PubMed] [Google Scholar]
  9. Castro E., Polosecki P., Rish I., et al. Baseline multimodal information predicts future motor impairment in premanifest Huntington’s disease. NeuroImage Clin. December 2017;2018(19):443–453. doi: 10.1016/j.nicl.2018.05.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Castro E., Polosecki P., Pustina D., Wood A., Sampaio C., Cecchi G.A. Predictive modeling of huntington’s disease unfolds thalamic and caudate atrophy dissociation. Mov. Disord. 2022;37(12):2407–2416. doi: 10.1002/mds.29219. [DOI] [PubMed] [Google Scholar]
  11. Ciarochi J.A., Calhoun V.D., Lourens S., et al. Patterns of co-occurring gray matter concentration loss across the huntington disease prodrome. Front. Neurol. 2016;7(SEP):1–14 doi: 10.3389/fneur.2016.00147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. de Jong J., Cutcutache I., Page M., et al. Towards realizing the vision of precision medicine: AI based prediction of clinical drug response. Brain. 2021;144(6):1738–1750. doi: 10.1093/brain/awab108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Desikan R.S., Ségonne F., Fischl B., et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage. 2006;31(3):968–980. doi: 10.1016/j.neuroimage.2006.01.021. [DOI] [PubMed] [Google Scholar]
  14. Estevez-Fraga C., Scahill R.I., Durr A., et al. Composite UHDRS correlates with progression of imaging biomarkers in huntington’s disease. Mov. Disord. 2021;36(5):1259–1264. doi: 10.1002/mds.28489. [DOI] [PubMed] [Google Scholar]
  15. Estevez-Fraga C., Tabrizi S.J., Wild E.J. Huntington’s disease clinical trials corner: march 2024. J. Huntingtons Dis. 2024;13(1):1–14. doi: 10.3233/JHD-240017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. FDA Enrichment strategies for clinical trials to support determination of effectiveness of human drugs and biological products guidance for industry. FDA Guid Ind. 2019;29(8):908–913. [Google Scholar]
  17. FDA-NIH Biomarker Working Group. BEST (Biomarkers, EndpointS, and other Tools) Resource. Food Drug Adm. Published online 2016. https://www.ncbi.nlm.nih.gov/books/NBK338448/. [PubMed]
  18. Ghazaleh N., Houghton R., Palermo G., Schobel S.A., Wijeratne P.A., Long J.D. Ranking the predictive power of clinical and biological features associated with disease progression in huntington’s disease. Front. Neurol. 2021;12(May):6–9. doi: 10.3389/fneur.2021.678484. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ghosh S., Sun Z., Li Y., et al. An exploration of latent structure in observational huntington’s disease studies. AMIA Jt Summits Transl. Sci. Proceedings AMIA Jt Summits Transl. Sci. 2017;2017:92–102. http://www.ncbi.nlm.nih.gov/pubmed/28815114 [PMC free article] [PubMed] [Google Scholar]
  20. Gregory S., Lohse K.R., Johnson E.B., et al. Longitudinal structural MRI in neurologically healthy adults. J. Magn. Reson. Imaging. 2020;52(5):1385–1399. doi: 10.1002/jmri.27203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Haider S., Wild E., Tabrizi S.J. Huntington’s disease and other choreas. Neurodegeneration. Published Online. 2017:248–283. [Google Scholar]
  22. Hamdan, S., Love, B.C., von Polier, G.G., et al., 2022. Confound-leakage: Confound Removal in Machine Learning Leads to Leakage. arXiv Prepr arXiv221009232. Published online 2022. [DOI] [PMC free article] [PubMed]
  23. Hawellek, D.J., Engemann, D.A., Holiga, S., Napolitano, A., Abaei, M., McColgan, P., 2022. E10 Cross-sectional exploration of the clinical utility of brain age as an imaging-based marker of disease pathology in Huntington’s disease (HD) based on the tominersen phase III trial generation HD 1. In: E: Imaging. Vol 93. BMJ Publishing Group Ltd; 2022:A35.1-A35. doi:10.1136/jnnp-2022-ehdn.86.
  24. Hobbs N.Z., Barnes J., Frost C., et al. Onset and progression of pathologic atrophy in Huntington disease: a longitudinal MR imaging study. Am. J. Neuroradiol. 2010;31(6):1036–1041. doi: 10.3174/ajnr.A2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Jamwal S., Elsworth J.D., Rahi V., et al. Expert review of neurotherapeutics gene therapy and immunotherapy as promising strategies to combat Huntington ’ s disease- associated neurodegeneration : emphasis on recent updates and future perspectives. Expert Rev. Neurother. 2020;20(11):1123–1142. doi: 10.1080/14737175.2020.1801424. [DOI] [PubMed] [Google Scholar]
  26. Jfd D., Egan G.F., Gray M.A., et al. Multi-Modal Neuroimaging in Premanifest and Early Huntington ’ s Disease: 18 Month Longitudinal Data from the IMAGE-HD Study. 2013;8(9):16–22. doi: 10.1371/journal.pone.0074131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. JNPD Research and Innovation Strategy.; 2019. https://www.neurodegenerationresearch.eu/initiatives/the-2018-jpnd-research-and-innovation-strategy/.
  28. Johnson E.B., Ziegler G., Penny W., et al. Archival report dynamics of cortical degeneration over a decade in Huntington ’ s disease. Biol. Psychiatry. 2021;89(8):807–816. doi: 10.1016/j.biopsych.2020.11.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Johnson E.B., Ziegler G., Penny W., et al. Dynamics of cortical degeneration over a decade in Huntington’s disease. Biol. Psychiatry. 2021;89(8):807–816. doi: 10.1016/j.biopsych.2020.11.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Junaid M., Ali S., Eid F., El-Sappagh S., Abuhmed T. Explainable machine learning models based on multimodal time-series data for the early detection of Parkinson’s disease. Comput. Methods Programs Biomed. 2023;234 doi: 10.1016/j.cmpb.2023.107495. [DOI] [PubMed] [Google Scholar]
  31. Kemenczky P., Vakli P., Somogyi E., et al. Effect of head motion-induced artefacts on the reliability of deep learning-based whole-brain segmentation. Sci. Rep. 2022;12(1):1–13. doi: 10.1038/s41598-022-05583-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Kim A., Lalonde K., Truesdell A., et al. New avenues for the treatment of huntington’s disease. Int. J. Mol. Sci. 2021;22(16) doi: 10.3390/ijms22168363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kinnunen, K.M., Mullin, A.P., Pustina, D., et al., 2021. Recommendations to Optimize the Use of Volumetric MRI in Huntington ’ s Disease Clinical Trials;12(October). doi:10.3389/fneur.2021.712565. [DOI] [PMC free article] [PubMed]
  34. Kinnunen K.M., Schwarz A.J., Turner E.C., et al. Volumetric MRI-based biomarkers in Huntington’s disease: an evidentiary review. Front. Neurol. 2021;12(September) doi: 10.3389/fneur.2021.712555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Klöppel S., Gregory S., Scheller E., et al. Compensation in preclinical Huntington’s disease: evidence from the track-On HD study. EBioMedicine. 2015;2(10):1420–1429. doi: 10.1016/j.ebiom.2015.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Kohli M., Pustina D., Warner J.H., et al. E13 Predicting Huntington’s disease state using structural MRI: it’s more than just the striatum. Published Online. 2022 [Google Scholar]
  37. Kohli M., Pustina D., Warner J.H., Alexander D.C., Scahill R.I. Predicting Huntington ’ s disease state with ensemble learning & sMRI : more than just the striatum. Published Online. 2023 [Google Scholar]
  38. Kopf A., Claassen M. Latent representation learning in biology and translational medicine. Patterns. 2021;2(3) doi: 10.1016/j.patter.2021.100198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Koval I., Dighiero-Brecht T., Tobin A.J., et al. Forecasting individual progression trajectories in Huntington disease enables more powered clinical trials. Sci. Rep. 2022;12(1):18928. doi: 10.1038/s41598-022-18848-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kubera K.M., Schmitgen M.M., Hirjak D., Wolf R.C., Orth M. Cortical neurodevelopment in pre-manifest Huntington’s disease. NeuroImage Clin. 2018;2019(23) doi: 10.1016/j.nicl.2019.101913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Langbehn D.R., Hayden M.R., Paulsen J.S., et al. CAG-repeat length and the age of onset in Huntington Disease (HD): a review and validation study of statistical approaches. Am. J. Med. Genet. Part B Neuropsychiatr. Genet. 2010;153(2):397–408. doi: 10.1002/ajmg.b.30992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Langbehn D.R., Fine E.M., Meier A., Hersch S. Utility of the Huntington’s disease prognostic index score for a perimanifest clinical trial. Mov. Disord. 2022;37(5):1040–1046. doi: 10.1002/mds.28944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Largent E.A., Walter S., Childs N., et al. Putting participants and study partners FIRST when clinical trials end early. Alzheimer’s Dement. 2022 doi: 10.1002/alz.12732. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Lee J.M., Huang Y., Orth M., et al. Genetic modifiers of Huntington disease differentially influence motor and cognitive domains. Am. J. Hum. Genet. 2022;109(5):885–899. doi: 10.1016/j.ajhg.2022.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Long J.D., Langbehn D.R., Tabrizi S.J., et al. Validation of a prognostic index for Huntington’s disease. Mov. Disord. 2017;32(2):256–263. doi: 10.1002/mds.26838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Long J.D., Gantman E.C., Mills J.A., et al. Applying the Huntington’s disease integrated staging system (HD-ISS) to observational studies. J. Huntingtons Dis. 2023;12(1):57–69. doi: 10.3233/JHD-220555. [DOI] [PubMed] [Google Scholar]
  47. Lundberg S.M., Lee S.I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process Syst. 2017;30 [Google Scholar]
  48. Lundervold, A.J., Vik, A., Lundervold, A., 2019. Lateral ventricle volume trajectories predict response inhibition in older age—a longitudinal brain imaging and machine learning approach. Jäncke L, ed. PLoS One; 14(4):e0207967. doi:10.1371/journal.pone.0207967. [DOI] [PMC free article] [PubMed]
  49. Maheux E., Koval I., Ortholand J., et al. Forecasting individual progression trajectories in Alzheimer’s disease. Nat. Commun. 2023;14(1):761. doi: 10.1038/s41467-022-35712-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Mak E., Su L., Williams G.B., et al. Longitudinal whole-brain atrophy and ventricular enlargement in nondemented Parkinson’s disease. Neurobiol. Aging. 2017;55:78–90. doi: 10.1016/j.neurobiolaging.2017.03.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Mason S.L., Daws R.E., Soreq E., et al. Predicting clinical diagnosis in Huntington’s disease: an imaging polymarker. Ann. Neurol. 2018;83(3):532–543. doi: 10.1002/ana.25171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. McColgan P, Gregory S, Zeun P, et al. Neurofilament light associated connectivity in young-adult Huntington’s disease is related to neuronal genes. Brain. Published online June 27, 2022:awac227. doi:10.1093/brain/awac227. [DOI] [PMC free article] [PubMed]
  53. McColgan P., Thobhani A., Boak L., et al. Tominersen in adults with manifest Huntington’s disease. N. Engl. J. Med. 2023;389(23):2203–2205. doi: 10.1056/NEJMc2300400. [DOI] [PubMed] [Google Scholar]
  54. Mestre T.A. Using big data in movement disorders: disease states and progression in Huntington’s disease. Mov. Disord. 2022;37(3):441–443. doi: 10.1002/mds.28943. [DOI] [PubMed] [Google Scholar]
  55. Mofrad S.A., Lundervold A., Lundervold A.S. A predictive framework based on brain volume trajectories enabling early detection of Alzheimer’s disease. Comput. Med. Imaging Graph. 2021;90 doi: 10.1016/j.compmedimag.2021.101910. [DOI] [PubMed] [Google Scholar]
  56. Mohan A., Sun Z., Ghosh S., et al. A machine-learning derived Huntington’s disease progression model: insights for clinical trial design. Mov. Disord. 2022;37(3):553–562. doi: 10.1002/mds.28866. [DOI] [PubMed] [Google Scholar]
  57. Müller H.P., Huppertz H.J., Dreyhaupt J., et al. Combined cerebral atrophy score in Huntington’s disease based on atlas-based MRI volumetry: sample size calculations for clinical trials. Parkinsonism Relat. Disord. 2018;2019(63):179–184. doi: 10.1016/j.parkreldis.2019.02.004. [DOI] [PubMed] [Google Scholar]
  58. Nestor S.M., Rupsingh R., Borrie M., et al. Ventricular enlargement as a possible measure of Alzheimer’s disease progression validated using the Alzheimer’s disease neuroimaging initiative database. Brain. 2008;131(Pt 9):2443–2454. doi: 10.1093/brain/awn146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Novartis-Pharmaceuticals A. Dose range finding study with open-label extension to evaluate the safety of Oral LMI070/branaplam in early manifest huntington’s disease (VIBRANT-HD) Published Online. 2021 https://clinicaltrials.gov/ct2/show/NCT05111249 [Google Scholar]
  60. Oxtoby N.P., Shand C., Cash D.M., Alexander D.C., Barkhof F. Targeted screening for alzheimer’s disease clinical trials using data-driven disease progression models. Front. Artif. Intell. 2022;5(May):1–9. doi: 10.3389/frai.2022.660581. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Parkin G.M., Corey-Bloom J., Long J.D., Snell C., Smith H., Thomas E.A. Associations between prognostic index scores and plasma neurofilament light in Huntington’s disease. Parkinsonism Relat. Disord. 2022;97:25–28. doi: 10.1016/j.parkreldis.2022.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Paulsen J.S., Langbehn D.R., Stout J.C., et al. Detection of Huntington’s disease decades before diagnosis: the Predict-HD study. J. Neurol. Neurosurg. Psychiatry. 2008;79(8):874–880. doi: 10.1136/JNNP.2007.128728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Paulsen J.S., Wang C., Duff K., et al. Challenges assessing clinical endpoints in early Huntington disease. Mov. Disord. 2010;25(15):2595–2603. doi: 10.1002/mds.23337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Paulsen J.S., Smith M.M., Long J.D., Investigators P.H.D. Group C of the HS. Cognitive decline in prodromal Huntington disease: implications for clinical trials. J. Neurol. Neurosurg. Psychiatry. 2013;84(11):1233–1239. doi: 10.1136/jnnp-2013-305114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Paulsen J.S., Long J.D., Ross C.A., et al. Prediction of manifest Huntington’s disease with clinical and imaging measures: a prospective observational study. Lancet Neurol. 2014;13(12):1193–1201. doi: 10.1016/S1474-4422(14)70238-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Paulsen J.S., Lourens S., Kieburtz K., Zhang Y. Sample enrichment for clinical trials to show delay of onset in huntington disease. Mov. Disord. 2019;34(2):274–280. doi: 10.1002/mds.27595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Pedregosa F., Varoquaux G., Gramfort A., et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 2011;12:2825–2830. [Google Scholar]
  68. Peng H., Long F., Ding C. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 2005;27(8):1226–1238. doi: 10.1109/TPAMI.2005.159. [DOI] [PubMed] [Google Scholar]
  69. Petracca, M., Pontillo, G., Cocozza, S., et al., 2020. Estimating brain atrophy in clinical settings: linear measures of ventricular enlargement for evaluation of disease activity and long-term disability progression in Multiple Sclerosis (2121). Published online 2020.
  70. Polosecki P., Castro E., Rish I., et al. Resting-state connectivity stratifies premanifest Huntington’s disease by longitudinal cognitive decline rate. Sci. Rep. 2020;10(1) doi: 10.1038/s41598-020-58074-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Poudel G.R., Stout J.C., Domínguez D.J.F., et al. White matter connectivity reflects clinical and cognitive status in Huntington’s disease. Neurobiol. Dis. 2014;65:180–187. doi: 10.1016/j.nbd.2014.01.013. [DOI] [PubMed] [Google Scholar]
  72. Poudel G.R., Stout J.C., Churchyard A., Chua P., Egan G.F., Georgiou-Karistianis N. Longitudinal change in white matter microstructure in Huntington’s disease: the IMAGE-HD study. Neurobiol. Dis. 2015;74:406–412. doi: 10.1016/j.nbd.2014.12.009. [DOI] [PubMed] [Google Scholar]
  73. Poudel G.R., Harding I.H., Egan G.F., Georgiou-Karistianis N. Network spread determines severity of degeneration and disconnection in Huntington’s disease. Hum. Brain Mapp. 2019;40(14):4192–4201. doi: 10.1002/hbm.24695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Rasheed K., Qayyum A., Ghaly M., Al-Fuqaha A., Razi A., Qadir J. Explainable, trustworthy, and ethical machine learning for healthcare: a survey. Comput. Biol. Med. 2022;149 doi: 10.1016/j.compbiomed.2022.106043. [DOI] [PubMed] [Google Scholar]
  75. Reilmann R., McGarry A., Grachev I.D., et al. Safety and efficacy of pridopidine in patients with Huntington’s disease (PRIDE-HD): a phase 2, randomised, placebo-controlled, multicentre, dose-ranging study. Lancet Neurol. 2019;18(2):165–176. doi: 10.1016/S1474-4422(18)30391-0. [DOI] [PubMed] [Google Scholar]
  76. Reuter M., Rosas H.D., Fischl B. Highly accurate inverse consistent registration: a robust approach. Neuroimage. 2010;53(4):1181–1196. doi: 10.1016/j.neuroimage.2010.07.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Rosas H.D., Doros G., Gevorkian S., et al. PRECREST: a phase II prevention and biomarker trial of creatine in at-risk Huntington disease. Neurology. 2014;82(10):850–857. doi: 10.1212/WNL.0000000000000187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Ruiz R., Riquelme J.C., Aguilar-Ruiz J.S. Incremental wrapper-based gene selection from microarray data for cancer classification. Pattern Recogn. 2006;39(12):2383–2392. [Google Scholar]
  79. Scahill R.I., Zeun P., Osborne-Crowley K., et al. Biological and clinical characteristics of gene carriers far from predicted onset in the Huntington’s disease Young Adult Study (HD-YAS): a cross-sectional analysis. Lancet Neurol. 2020;19(6):502–512. doi: 10.1016/S1474-4422(20)30143-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Scahill R.I., Sahakian B.J., Robbins T.W., et al. E01 The HD young adult study 2: longitudinal follow up. Published Online. 2022 [Google Scholar]
  81. Scheid B.H., Aradi S., Pierson R.M., et al. Predicting severity of Huntington’s disease with wearable sensors. Front Digit Heal. 2022:4. doi: 10.3389/fdgth.2022.874208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Schulz M.A., Yeo B.T.T., Vogelstein J.T., et al. Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets. Nat. Commun. 2020;11(1) doi: 10.1038/s41467-020-18037-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Schwarz A.J. The use, standardization, and interpretation of brain imaging data in clinical trials of neurodegenerative disorders. Neurotherapeutics. 2021;18(2):686–708. doi: 10.1007/s13311-021-01027-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Shishegar R., Pizzagalli F., Georgiou-Karistianis N., Egan G.F., Jahanshad N., Johnston L.A. A gyrification analysis approach based on Laplace Beltrami eigenfunction level sets. Neuroimage. 2021;229 doi: 10.1016/j.neuroimage.2021.117751. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Shoulson I., Fahn S. Huntington disease: Clinical care and evaluation. Neurology. 1979;29(1):1. doi: 10.1212/WNL.29.1.1. [DOI] [PubMed] [Google Scholar]
  86. Smith S.M., Elliott L.T., Alfaro-Almagro F., et al. Brain aging comprises many modes of structural and functional change with distinct genetic and biophysical associations. Elife. 2020:9. doi: 10.7554/eLife.52677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Stoebner Z.A., Hett K., Lyu I., et al. Comprehensive shape analysis of the cortex in Huntington’s disease. Hum. Brain Mapp. 2023;44(4):1417–1431. doi: 10.1002/hbm.26125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Strobl C., Malley J., Tutz G. An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychol. Methods. 2009;14(4):323–348. doi: 10.1037/a0016973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Tabrizi S.J., Langbehn D.R., Leavitt B.R., et al. Biological and clinical manifestations of Huntington’s disease in the longitudinal TRACK-HD study: cross-sectional analysis of baseline data. Lancet Neurol. 2009;8(9):791–801. doi: 10.1016/S1474-4422(09)70170-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Tabrizi S.J., Scahill R.I., Owen G., et al. Predictors of phenotypic progression and disease onset in premanifest and early-stage Huntington’s disease in the TRACK-HD study: analysis of 36-month observational data. Lancet Neurol. 2013;12(7):637–649. doi: 10.1016/S1474-4422(13)70088-7. [DOI] [PubMed] [Google Scholar]
  91. Tabrizi S.J., Leavitt B.R., Landwehrmeyer G.B., et al. Targeting Huntingtin expression in patients with Huntington’s disease. N. Engl. J. Med. 2019;380(24):2307–2316. doi: 10.1056/nejmoa1900907. [DOI] [PubMed] [Google Scholar]
  92. Tabrizi S.J., Schobel S., Gantman E.C., et al. A biological classification of Huntington’s disease: the integrated staging system. Lancet Neurol. 2022;21(7):632–644. doi: 10.1016/S1474-4422(22)00120-X. [DOI] [PubMed] [Google Scholar]
  93. Tabrizi S.J., Estevez-Fraga C., van Roon-Mom W.M.C., et al. Potential disease-modifying therapies for Huntington’s disease: lessons learned and future opportunities. Lancet Neurol. 2022;21(7):645–658. doi: 10.1016/S1474-4422(22)00121-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Tan B., Shishegar R., Poudel G.R., Fornito A., Georgiou-Karistianis N. Cortical morphometry and neural dysfunction in Huntington’s disease: a review. Eur. J. Neurol. 2021;28(4):1406–1419. doi: 10.1111/ene.14648. [DOI] [PubMed] [Google Scholar]
  95. UniQure Biopharma B.V. Safety and proof-of-concept (POC) Study with AMT-130 in adults with early manifest Huntington disease. Published Online. 2019 [Google Scholar]
  96. Van Den Bogaard S.J.A., Dumas E.M., Acharya T.P., et al. Early atrophy of pallidum and accumbens nucleus in Huntington’s disease. J. Neurol. 2011;258(3):412–420. doi: 10.1007/s00415-010-5768-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Van Der Plas E., Schultz J.L., Nopoulos P.C. The neurodevelopmental hypothesis of Huntington’s disease. J. Huntingtons Dis. 2020;9(3):217–229. doi: 10.3233/JHD-200394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Warner J.H., Long J.D., Mills J.A., et al. Standardizing the CAP score in huntington’s disease by predicting age-at-onset. J Huntingtons Dis. 2022;11(2):153–171. doi: 10.3233/JHD-210475. [DOI] [PubMed] [Google Scholar]
  99. Wei K., Tran T., Chu K., et al. White matter hypointensities and hyperintensities have equivalent correlations with age and CSF β-amyloid in the nondemented elderly. Brain Behav. 2019;9(12):e01457. doi: 10.1002/brb3.1457. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Westman E., Aguilar C., Muehlboeck J.S., Simmons A. Regional magnetic resonance imaging measures for multivariate analysis in Alzheimer’s disease and mild cognitive impairment. Brain Topogr. 2013;26(1):9–23. doi: 10.1007/s10548-012-0246-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Wijeratne P.A., Johnson E.B., Eshaghi A., et al. Robust markers and sample sizes for multicenter trials of huntington disease. Published Online. 2020:751–762. doi: 10.1002/ana.25709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Wijeratne P.A., Johnson E.B., Gregory S., et al. A Multi-study model-based evaluation of the sequence of imaging and clinical biomarker changes in Huntington’s disease. Front Big Data. 2021;4(August):1–8. doi: 10.3389/fdata.2021.662200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Wijeratne P.A., Eshaghi A., Scotton W.J., et al. The temporal event-based model: learning event timelines in progressive diseases. Imaging Neurosci. 2023 doi: 10.1162/imag_a_00010. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data 1
mmc1.docx (11.7MB, docx)

Data Availability Statement

The authors do not have permission to share data.


Articles from NeuroImage : Clinical are provided here courtesy of Elsevier

RESOURCES