Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Feb 1.
Published in final edited form as: Mov Disord. 2016 Nov 28;32(2):256–263. doi: 10.1002/mds.26838

Validation of a Prognostic Index for Huntington's Disease

Jeffrey D Long 1, Douglas R Langbehn 2, Sarah J Tabrizi 3, Bernhard G Landwehrmeyer 4, Jane S Paulsen 5, John Warner 6, Cristina Sampiao 7
PMCID: PMC5318276  NIHMSID: NIHMS819166  PMID: 27892614

Abstract

Background

The aim was to develop a prognostic index for motor diagnosis in Huntington's disease and examine its predictive performance in external observational studies.

Methods

The pre-diagnosis Neuro-biological Predictors of Huntington's Disease study (N=945 gene-positive) was used to select a Cox regression model for computing a prognostic index. Cross-validation was used for selecting a model with good internal validity performance using the research sites as natural splits of the data set. Then the external predictive performance was assessed using pre-diagnosis data from three additional observational studies, The Cooperative Huntington Observational Research Trial (N=358), TRACK-HD (N=118), and REGISTRY (N=480).

Results

Model selection yielded a prognostic index computed as the weighted combination of the UHDRS total motor score, symbol digit modalities test, baseline age, and cytosine-adenine-guanine expansion. External predictive performance was very good for the first two of the three studies, with the third being a much more progressed cohort than the other studies. The databases were pooled and a final Cox regression model was estimated. The regression coefficients were scaled to produce the prognostic index for Huntington's disease, and a normed version, which is scaled relative to a 10-year 50% probability of motor diagnosis.

Conclusion

The positive results of this comprehensive validity analysis provide evidence that the prognostic index is generally useful for predicting Huntington's disease progression in terms of risk of future motor diagnosis. The variables for the index are routinely collected in on-going observational studies and the index can be used to identify cohorts for clinical trial recruitment.

Keywords: Huntington's disease, motor diagnosis, prognostic index, disease progression, external validation

Introduction

Huntington's disease (HD) is an inherited progressive neuro-degenerative disorder caused by a cytosine-adenine-guanine (CAG) repeat expansion in the HTT gene. HD signs and symptoms include motor, cognitive, and psychiatric features. There are no current treatments to slow the progression of the disease, but early-phase clinical trials have commenced with the ultimate goal of changing the disease course1.

Resources for trial recruitment include clinical research platforms, such as Enroll-HD2. Enroll-HD has an embedded observational study with regular visits at which clinical data are collected. Characterizing the progression level of candidate participants is essential for helping to recruit individuals for whom a treatment has measurable efficacy.

Pre-diagnosis progression has traditionally been characterized by a combination of age and CAG expansion because of the well-known association with the timing of motor onset35. Additional work has shown that the prediction of motor diagnosis, defined as the highest examiner rating on the UHDRS diagnostic confidence level (DCL)6, is enhanced by considering clinical variables along with the influence of CAG length7.

Recently, data from several HD observational studies has become available: PREDICT-HD8, COHORT9, TRACK-HD10, and REGISTRY11. The databases provide a unique opportunity to develop a prognostic index for HD (PIHD) and externally validate its predictive performance. Developing a PIHD that can be used across studies would be useful for estimating progression levels in pre-diagnosis HD and recruiting appropriate candidates for clinical trials. For example, the PIHD can be useful when researchers want to recruit individuals from disparate sources who are just a few years away from estimated diagnosis.

In this study, the pre-diagnosis PREDICT-HD database will be used to develop a PIHD using CAG, baseline age (age at visit entry), and UHDRS motor and cognitive variables that are common among the studies. Imaging and other technology intensive variables will be omitted for the pragmatic reasons that they are not measured in all studies and existing registries like Enroll-HD do not have such variables. Once the PIHD is developed with the PREDICT-HD database, the PIHD will be externally validated in the remaining studies by predicting the timing of motor diagnosis in the remaining pre-diagnosis gene-positive HD individuals.

Methods

Study population

Neuro-biological Predictors of Huntington's Disease (PREDICT-HD) is a longitudinal observational study of pre-diagnosis HD with 32 sites in six countries (AUS, CAN, DEU, ESP, GBR, USA)8,1215 with data collected 2002-2014. TRACK-HD is a longitudinal prospective observational study of pre-diagnosis and early HD with four sites in four countries (CAN, FRA, GBR, NE)10,16,17 with data collected 2008-2011. The Cooperative Huntington Observational Research Trial (COHORT) is a longitudinal observational study of HD or at-risk individuals with 38 sites in three countries (AUS, CAN, USA)9 with data collected 2006-2011. REGISTRY is a longitudinal observational study that includes pre-diagnosis HD, manifest (diagnosed) HD, and at-risk individuals, with a total of 150 mostly European sites11,18 with data collected 2004-2012.

HD gene carriers who did not have a motor diagnosis at baseline (study entry) were used for the analysis. Motor diagnosis was defined as DCL = 4, which is the highest rating and indicates that the examiner has ≥ 99% confidence that the patient exhibits unequivocal HD motor signs6. Additional inclusion criteria were ≥ 18 years of age (REGISTRY did not exclude Juvenile HD), a lab-confirmed CAG ≥ 36 (CAG range among studies was 36-66), and complete data on the variables for the analysis. Sample sizes and descriptive statistics for key variables at baseline after applying the inclusion criteria are shown in Table 1.

Table 1.

Descriptive statistics for variables measured at study entry for the pre-diagnosis participants meeting the analysis inclusion criteria. Mean (SD) for quantitative variables and count (percentage) for categorical variables.

PREDICT COHORT TRACK REGISTRY
N 945 358 118 480
Sites 32 38 4 89
Site Sample Size 28.64 (16.42) 9.42 (8.89) 29.50 (0.58) 5.39 (6.02)
Female 605 (64) 225 (63) 64 (54) 257 (54)
Follow-up(Years) 4.83 (2.95) 2.06 (1.02) 2.71 (0.64) 2.34 (2.13)
Motor DX 225 (24) 83 (23) 21 (18) 319 (66)
Age 40.40 (10.36) 42.06 (12.51) 40.80 (8.86) 45.19 (12.18)
CAG 42.42 (2.68) 42.41 (2.81) 43.14 (2.41) 43.91 (3.73)
DCL=3 44 (5) 46 (13) 2 (2) 136 (28)
TMS 4.93 (5.29) 6.20 (8.39) 2.53 (1.68) 12.07 (15.61)
STROOP 98.88 (17.37) 90.62 (19.87) 99.67 (16.10) 79.41 (24.86)
SDMT 50.64 (11.57) 44.93 (12.11) 51.41 (10.24) 38.48 (17.01)

Note. Motor DX = prospective motor diagnosis (DCL = 4); CAG = cytosine-adenine-guanine expansion; TMS = total motor score; STROOP = Stroop word, SDMT = symbol digit modalities test.

Study activities were reviewed and approved by institutional review boards (PREDICT-HD, COHORT) or local ethics committees (TRACK-HD, REGISTRY). Informed consent procedures were carried out for each participant, and signed consents for participation and the distribution of de-identified data for collaborative research were obtained.

Statistical analysis

The first analysis used the PREDICT-HD database for variable selection. Ten pre-planned Cox regression models were specified based on previous work with PREDICT-HD7. The models were specified to represent different levels of parsimony, with the full model (Model 1) having the most predictors (see Figure 1). All models used the predictor scores at baseline to predict the hazard of motor diagnosis (the first occurrence of DCL = 4). A method to reduce bias in model selection is cross-validation (CV), which is often used with several splits of data in order to reduce variance. The participating research sites in PREDICT-HD provided natural and convenient splits (subgroups of participants). Thus, leave-one-site-out CV (LOSO-CV) was used for model selection19. For LOSO-CV, each Cox model was estimated using all the sites but one, and then the performance of the model for predicting the timing of motor diagnosis was evaluated in the omitted site. The process was repeated leaving each site out in turn, resulting in a predictive performance index for all 32 sites. For each estimated Cox model, the PIHD was computed as the weighted combination of the regression weights and the predictor scores. Focus was on prediction, so the proportional hazards assumption was not a primary concern. However, Schoenfeld residuals showed general consistency with the proportional hazards assumption for the main model discussed below.

Fig. 1.

Fig. 1

Boxplots and individual values (circles) of Harrell's C for 10 models among the PREDICT sites. TMS, total motor score; STROOP, Stroopword; SDMT, symbol digit modalities test; AGE, baseline age; CAP5Age × (CAG - 34).

Predictive performance was assessed using the integrated Brier score20, Harrell's C (concordance) statistic21, and a C statistic weighted for censoring22. The indexes provided similar model evaluations and we focus on Harrell's C because it is the most commonly reported (extended results are presented in the appendix). C indexes the ability of the PIHD to discriminate between individuals with longer time to motor diagnosis and those with shorter time. C is similar to the area under the ROC curve in sensitivity/specificity analysis, but it accounts for the right-censoring due to dropout and study termination. C = 0.5 indicates no better than chance prediction of the timing of motor diagnosis, whereas C = 1 is perfect prediction (each pair of individuals is correctly ordered in terms of timing of diagnosis). A guideline for the evaluation of the magnitude of C is provided by a survey of values found in external validation studies in oncology and cardiovascular disease23. The survey showed mean C = 0.78, and we consider this value to be the benchmark for typical and acceptable predictive performance.

The second analysis was an external validation that included all the databases and used the model selected in the first analysis. To provide a benchmark for comparison, internal validation for each study was performed using LOSO-CV, and the mean C was computed among sites. For the external validation, the weights estimated from PREDICT-HD were used to compute the PIHD in each other study, then C was computed. The Cox regression weights and standard errors (SEs) of the PIHD predictors were estimated independently in each study (values were multiplied by 1000 to avoid very small numbers). A significance test of the difference between a study's regression weight and the corresponding PREDICT-HD regression weight was computed using a multiple-group analysis based on dummy coding with combined databases24.

Risk groups for motor diagnosis were formed by using the quartiles of the PIHD as computed in PREDICT-HD applied to all the studies. In PREDICT-HD the quartiles divided the PIHD distribution into four equal groups (25% each). These same quartiles were used in the other studies, but they did not necessarily divide the other distributions into quarters. Survival was defined as the probability of not having a motor diagnosis, and survival curves were estimated using cubic splines25 to provide smooth curves over the follow-up period. Similar cubic spline survival curves and bootstrapped 95% CIs were constructed for the pooled data as described below.

Results

Model selection results using only the PREDICT-HD database are shown in Figure 1. The figure consists of boxplots of distributions of Harrell's C among the sites for the 10 models. Predictive performance varied by site, with some sites showing worse than random performance for some models (C < 0.5), and other sites showing perfect performance (C = 1.0). Several models had similar good performance (Model 1 through Model 6), but the summary statistics slightly favored Model 4. Model 4 had the largest median (C = 0.87) and the largest lower and upper quartiles (25% = 0.82, 75% = 0.93) indicating its C distribution had the closest proximity to perfect prediction (shifted closest to the top of Figure 1). Similar results were shown for the Brier score and the weighted C, and Model 4 had the best aggregate rank among all the measures (see appendix). Therefore, we considered Model 4 to be an adequate model to carry forward for further consideration (other models could have been carried forward as well). In all subsequent analysis, the PIHD was computed as the weighted composite of the UHDRS total motor score (TMS), symbol digit modalities test (SDMT), and the CAG-Age Product (CAP)5, with the latter being CAP = Age at baseline × (CAG – 34).

External validation results using C are shown in Table 2. For COHORT and TRACK-HD, the PIHD external predictive performance (CEXT) was very similar to the internal performance (CINT). For example, COHORT had internal performance of CINT = 0.85 and external performance of CEXT = 0.84. The REGISTRY study showed low internal predictive performance (CINT = 0.64) and even lower external performance (CEXT = 0.56). The Cox regression weights for TRACK were larger in absolute value than their PREDICT-HD counterparts, and much more so for TMS. However, the estimated SEs were also large and there were no statistically significant differences. (Entry criteria for the TRACK-HD study included a maximum screening TMS of 5, and the restricted range may be related to these estimates.) The COHORT SDMT weight was significantly smaller than the PREDICT-HD weight, but the differences for TMS (larger) and CAP (smaller) were not significant. Each REGISTRY regression weight was significantly smaller in absolute value than the corresponding PREDICT-HD weight, which was indicative of the general weakness of the predictors in the REGISTRY database.

Table 2.

External validation results: C statistic (internal and external validation), Cox regression weights (SEs), and tests of difference with the PREDICT weights.

Study CINT CEXT TMSa SDMTa CAPa
PREDICT 0.85 56.93 (9.44) -35.01 (7.02) 7.63 (0.79)
COHORT 0.85 0.84 58.25 (12.71) -22.45*** (13.09) 4.46 (1.41)
TRACK 0.79 0.79 219.99 (121.86) -61.45 (27.21) 9.23 (5.39)
REGISTRY 0.64 0.56 0.43*** (5.27) -5.56*** (5.05) 2.33** (0.45)

Note.

a

Regression weights and SEs are multiplied by 1000 to avoid small numbers;

CINT = internal validation Harrell's C; CEXT = external validation;

***

p<.001,

**

p<.01,

*

p<.05;

TMS = total motor score; SDMT = symbol digit modalities test; CAP = Age × (CAG - 34).

Figure 2 shows the smooth survival curves for the motor diagnosis risk groups along with the counts and percentages of group membership (survival is the probability of not being diagnosed). The risk gradient of the survival curves displayed by PREDICT-HD (upper left panel) was similar for COHORT (upper right) and TRACK-HD (lower left) for the early years (up to year 3) in the sense that the risk curves were in the same order from top to bottom (Low ≥ Mid-Low > Mid-High > High). In contrast, the curve ordering for REGISTRY (lower right) was inconsistent, as there were several order changes (crossing lines) in the early years among the three lower risk groups. However, similar to the other studies, the REGISTRY High risk group curve was consistently the lowest over time (there was some early overlapping with Mid-High). By design, the PIHD quartiles define four equal groups for PREDICT-HD, but the same PREDICT-HD PIHD cutoffs did not result in equal groups in the other studies. However, COHORT and TRACK-HD had a similar 50% split as PREDICT-HD into lower risk of motor diagnosis (Low + Mid-Low) and higher risk of motor diagnosis (Mid-High + High). On the other hand, REGISTRY had more skewed classification percentages, with 82% of the sample classified as having a higher risk of motor diagnosis.

Fig. 2.

Fig. 2

Cubic spline survival curves for risk groups of motor diagnosis defined by the PREDICT-HD study cutoffs. Counts (percentages) are shown for each group.

Pooling among databases is justified when baseline hazards (and survival curves) are similar26. Cubic spline estimates of the baseline hazards are shown in the appendix. Similar to Figure 2, there was reasonable similarity among PREDICT-HD, COHORT, and TRACK-HD. Thus, the three databases were combined, and a final Cox regression model was estimated. Based on the estimated Cox regression weights (multiplied by 1000 and rounded for simplicity), the formula is computed as

PIHD=51×TMS+(34)×SDMT+7×Age×(CAG34).

Larger values index greater risk of motor diagnosis, and hence, greater predicted progression. For example, if a person has TMS = 10, SDMT = 90, age = 40, and CAG = 42, then PIHD = -310, whereas a person with TMS = 15, SDMT = 70, age = 50, and CAG = 42 has PIHD = 1185. The PIHD was “normalized” (scaled) to enhance interpretation. The survival curve was estimated for each PIHD based on the fitted model. For the 10-year time point, the PIHD value associated with a 0.5 survival probability was selected as the centering constant (= 883), and the SD of the PIHD distribution was selected as the scaling constant (= 1044). The prognostic index normed for HD (PINHD) is

PINHD=PIHD8831044.

PINHD is in SD units and is interpreted relative to 50% 10-year survival. PINHD < 0 indicates greater than 50% 10-year survival, and PINHD > 0 indicates less than 50% 10-year survival. From the example above, the person with PIHD = -310 has PINHD = -1.14 that is 1.14 SDs below the 50% 10-year survival mark, and the person with PIHD = 1185 has PINHD = 0.29 that is 0.29 SDs above the mark. Figure 3 shows the survival curves and bootstrapped 95% CIs for different ranges of PINHD defined by a lower cutoff value. For example, a lower cutoff of 1 defines the right-tail range of PINHD ≥ 1 for the pooled data. For this cohort, the figure indicates that 40% are expected to have a motor diagnosis a little after 2 years (60% remain pre-diagnosis), and 70% are expected to have a motor diagnosis by 4 years (30% remain pre-diagnosis).

Fig. 3.

Fig. 3

Cubic spline survival curves (95% CIs) for ranges of the prognositc index normed for Huntington's disease (PINHD). Curves are based on pooling PREDICT-HD, TRACK-HD, and COHORT.

Discussion

To our knowledge, this is the first external validation of a prognostic index for motor diagnosis (conversion) in pre-diagnosis HD. External validation is the gold standard for reproducibility27,28, and the very good predictive performance in two out of the three external studies underscores the general usefulness of the PIHD. The external predictive performance was slightly higher than that found in similar validation studies23.

The poor performance in the REGISTRY study can be explained by a lack of variability in progression level. The REGISTRY cohort was much more progressed at baseline, with many individuals having clinical scores in the ranges typically associated with diagnosis. The lack of progression variability resulted in the PIHD variables (TMS, SDMT and CAP) having limited predictive power. Like the other studies, REGISTRY had a standard protocol, but it was the only study to enroll individuals less than 18 years of age, it had the largest variety of languages (13), the largest number of sites (see Table 1), and all the sites were European for our analysis.

Though the predictive performance of the PIHD was poor for the REGISTRY study, this does not necessary threaten the validity of the PIHD. Consistent with conventional benchmarks of motor and cognitive performance, the PIHD was successful in correctly classifying most of the REGISTRY individuals as having a high risk of motor diagnosis.

The PIHD or its normed version (PINHD) can be used to predict progression, with higher scores indicating greater risk of motor diagnosis. For clinical purposes, predicted progression provides general information about the current status and course of the disease. The survival curves indicate what happens to an individual's cohort with a common PIHD. Such information can be useful for treatment strategies and life planning.

For research purposes, PIHD provides a more accurate index. Progression is commonly indexed by the CAG-Age Product (CAP), which is a type of “burden score” of age and CAG expansion that has several variants35,29. Our results show that when CAP is supplemented with the TMS and SDMT, predictive performance increases. The added variables include one of the core components of the UHDRS, the summation of motor signs (TMS), and the other is one of the three UHDRS cognitive tests (SDMT) that also has a motor component.

Because PINHD is easy to compute with commonly measured variables, it is useful for recruiting cohorts of individuals for clinical trials from among those who are registered in on-going observational studies, such as Enroll-HD. The anonymized Enroll-HD database is available to researchers online (https://www.enroll-hd.org/for-researchers/access-data/), and the PINHD can be computed using an individual's last wave of data. There is an optional consent for participation in future research studies, which has a very high rate of endorsement (> 90%)2. One use of PINHD is to identify consenting individuals who have progression levels appropriate for a clinical trial. If a preventative trial requires participants to be many years from motor diagnosis, then a certain bottom percent (e.g., 20%) of individuals with the smallest PINHD values might be selected from Enroll-HD. More finely tuned selection is possible if the researcher can identify the desired proportion of converters at the end of the proposed study. Each PINHD value (or range) is associated with a cohort that has a specific expected rate of conversion over a given time interval. After deciding upon the non-treated survival curve that is appropriate for the clinical trial, the potential pool of candidates can be identified by the associated PINHD value(s). For example, an intervention that targets relatively advanced progression might require the untreated group to have 40% survival (60% conversion) at a study terminus of 3.5 years. Figure 2 indicates that the appropriate survival curve is associated with the range of PINHD ≥ 1. Consenting individuals from Enroll-HD whose PINHD falls within this range might be candidates for recruitment.

In addition to identifying whom to recruit, PINHD is useful for determining the sample size when the endpoint of the clinical trial is conversion itself (time to DCL = 4). In such trials, the goal is to determine if motor diagnosis is delayed by treatment relative to placebo. The expected number of events and the hypothesized treatment effect on the hazard ratio are the key determinants of sample size for survival-based clinical trials30. The survival curve for a PINHD can be used to estimate the number of expected conversions in the (untreated) placebo group at study end.

PINHD can also be informative for recruitment when the trial is not directly concerned with survival, as when change in TMS is the outcome. Based on the analysis of large observational studies, it is well established that the rate of deterioration for imaging and clinical variables accelerates as they increasingly depart from control-like initial levels10,15,17,31. The rate of deterioration also increases with CAP32. Because PINHD is based on clinical variables and CAP, deterioration of clinically relevant outcome measures is expected to accelerate as PINHD increases. Consider the example of change in ICV-corrected putamen volume × 1000 (the scaling is used to avoid small numbers) using the PREDICT-HD database. We found that PINHD = -1 was associated with an annual rate of decrease of β = -0.0627 (95% CI = [-0.0714, -0.0539]), and PINHD = 1 was associated with the greater rate of β = -0.0881 (95% CI = [-0.0987, -0.0774]).

An ancillary analysis with results presented in the appendix showed that the weights of the PIHD and its predictive performance were similar when estimated separately at seven annual follow-up visits. Thus, there is evidence that the PIHD will not change much when computed over a several-year span. The implication for recruitment is that if individuals have repeated observations, then only the last wave of data might be sufficient for predicting progression level.

The emphasis of this study was on developing a prognostic index with variables that are routinely collected in observational studies. Potentially important predictors, such as putamen volume, were intentionally excluded because COHORT and REGISTRY did not have imaging, and neither does Enroll-HD. Recent results using PREDICT-HD7 show that imaging variables were predictive of motor diagnosis when considered in isolation. However, when important clinical variables (e.g., TMS, SDMT) and genetic variables (i.e., CAP) were already in the model, the boost in predictive power was modest when imaging variables were added. Therefore, the exclusion of imaging variables does not substantially lower the predictive power of the PIHD. A potentially more powerful prognostic index could be developed using a combination of newly developed or newly discovered imaging, clinical, genetic, or wet biomarker variables.

A caveat regarding the analysis is that only individuals without a motor diagnosis at baseline were analyzed. This resulted in the exclusion of diagnosed participants. There is a potential for selection bias if the included and excluded individuals were representative of different populations.

The databases used in this study may not represent the more general clinic population. PREDICT-HD and TRACK included participants who had pre-manifest genetic testing. Individuals who have such testing may be less than 10% of the HD population in North America31 (though the percentage is probably higher in Europe and elsewhere).

Another qualification is the overlap of participants for COHORT and PREDICT-HD. The data from these studies was concurrently collected and they had many sites in common. Because of patient confidentiality and the anonymized data, it is not possible to identify unique participants. Nonetheless, we did use birth year, CAG length, and gender to examine commonalities among the studies. Approximately a third of the COHORT subjects could not be mapped to the PREDICT-HD subjects (overlap is probably 75% at most).

We caution that the PINHD computed for one person does not necessarily predict progression for that individual. The survival curves of the models and the figures indicate the risk of motor diagnosis for a cohort with a common PINHD. Due to individual variability, prediction for a single person is difficult, and the survival curve for a person's cohort may have little bearing on their actual risk for motor diagnosis32.

Finally, our findings demonstrate that a relatively simple risk score computed on readily available variables measured at a single visit has reasonable generalizability and usefulness for predicting progression towards HD motor diagnosis, and thus, may be valuable for targeted recruitment for clinical trials. Further analysis will be needed to assess if usefulness is improved adding newly developed variables.

Supplementary Material

Supp Appendix2

Figure A1. Harrell's C and Cox regression weights for PREDICT-HD follow-up visits.

Figure A2. Estimated baseline hazards based on the pooled data.

Table A1. Extended model selection results. Harrell's C, C with inverse censoring weights, integrated Brier score, mean standardized rank, and p-value of aggregated rank. Trimmed means among sites are reported (20% trimming).

Acknowledgments

Samples and/or data from the COHORT Study, which received support from HP Therapeutics, Inc., were used in this study. We thank the Huntington Study Group COHORT investigators and coordinators who collected data and/or samples used in this study, as well as participants and their families, who made this work possible.

TRACK-HD was supported by the CHDI Foundation, a not-for-profit organization dedicated to finding treatments for Huntington's disease. We thank the TRACK-HD study participants and their families. We also acknowledge the support of the National Institute for Health Research University College London Hospitals Biomedical Research Centre and the Manchester Biomedical Research Centre.

PREDICT-HD was supported by the US National Institutes of Health (NIH) under the following grants: 5R01NS040068, 5R01NS054893, 1S10RR023392, 1U01NS082085, 5R01NS050568, 1U01NS082083, and 2UL1TR000442-06 (JS Paulsen principal investigator). This research was also supported by CHDI Foundation grant A3917, and the National Alliance for Medical Image Computing, which provided general data collection/analysis support. We thank the staff at the PREDICT-HD sites, the study participants, the National Research Roster for Huntington Disease Patients and Families, the Huntington's Disease Society of America, and the Huntington Study Group.

We thank the REGISTRY participants and families, CHDI, EHDN, and the REGISTRY investigators of EHDN.

Footnotes

Financial Disclosures: Dr. Long has a consulting agreement with NeuroPhage, is a paid consultant for Roche Pharma and Azevan Pharmaceuticals, and receives funding from CHDI, Michael J. Fox Foundation, and NIH.

Dr. Langbehn received research funding from CHDI, from the Neurology Institute of the University college of London, and from NIH/NIDA via grant 5R01AA021165-03. He is also a paid statistical consultant for Roche and Voyager pharmaceutical companies for Huntington's disease clinical trial design.

Dr. Tabrizi has has served on advisory boards or had consultancies with F. Hoffmann-La Roche Ltd, Ionis Pharmaceuticals, Ixico Technologies, Shire Human Genetic Therapies, Takeda Pharmaceuticals International and TEVA Pharmaceuticals. All honoraria paid for these consultancies and advisory boards go to UCL, Dr. Tabrizi's employer.

Dr. Landwehrmeyer has provided consulting services, advisory board functions, clinical trial services and/or lectures for Allergan, Alnylam, Aventis, Amarin, AOP Orphan Pharmaceuticals AG, Bayer Pharma AG, Desitin, Genzyme, GlaxoSmithKline, Hoffmann-La Roche, Ipsen, ISIS Pharma, Lundbeck, Neurosearch Inc, Medesis, Medivation, Medtronic, Novartis, Pfizer, Prana Biotechnology, Raptor Pharmaceuticals, Sangamo/Shire, Sanofi-Aventis, Siena Biotech, Temmler Pharma GmbH, Trophos, and TEVA and has received research grant support from the CHDI Foundation, the Bundesministerium f?r Bildung und Forschung (BMBF), the Deutsche Forschungsgemeinschaft (DFG), the European Commission (EU-FP7). His study site Ulm has received compensation in the context of the observational REGISTRY-Study of European Huntington's Disease Network (EHDN).

Dr. Paulsen has served on an advisory board for Lundbeck LLC, has a consulting agreement with ProPhase LLC, and has support from CHDI and NIH (5RO1NS040068, 5R01NS054893).

Dr. Warner is supported by CHDI Management/CHDI Foundation.

Dr. Sampaio is supported by CHDI Management/CHDI Foundation, has consultancy agreements in the last 24 months with Deerfield GSK, Neuroderm, Neurotrope, Nestle, Stealth, vTV therapeutics, and has collected honorariuns from the International Parkinson Disease and Movement Disorders Society.

Contributor Information

Jeffrey D. Long, University of Iowa, Departments of Psychiatry, Biostatistics.

Douglas R. Langbehn, University of Iowa, Departments of Psychiatry, Biostatistics.

Sarah J. Tabrizi, UCL Huntington's Disease Centre, Dept of Neurodegenerative Disease, University College London Institute of Neurology.

Bernhard G. Landwehrmeyer, CHDI Foundation, University of Ulm, Department of Neurology.

Jane S. Paulsen, University of Iowa, Departments of Psychiatry, Psychology, Neurology.

John Warner, CHDI Foundation.

Cristina Sampiao, CHDI Foundation, University of Lisbon, Laboratory of Clinical Pharmacology and Therapeutics.

References

  • 1.Wild EJ, Tabrizi SJ. Targets for future clinical trials in Huntington's disease: What's in the pipeline. Movement Disorders. 2014;29:1434–1445. doi: 10.1002/mds.26007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Landwehrmeyer BG, Fitter-Attas C, Giuliano J, et al. Data analytics from Enroll-HD, a global clinical research platform for Huntington's disease. Movement Disorder Clinical Practice. 2016 doi: 10.1002/mdc3.12388. Epub. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Langbehn DR, Hayden MR, Paulsen JS. CAG-repeat length and the age of onset in Huntington Disease (HD): A review and validation study of statistical approaches. American Journal of Medical Genetics, Part B. 2010;153:397–408. doi: 10.1002/ajmg.b.30992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Penney JB, Vonsattel JP, MacDonald ME, Gusella JF, Myers RH. CAG repeat number governs the development rate of pathology in Huntington's disease. Annals of Neurology. 1997;41:689–692. doi: 10.1002/ana.410410521. [DOI] [PubMed] [Google Scholar]
  • 5.Zhang Y, Long JD, Mills JA, Warner JH, Lu W, Paulsen JS. Indexing disease progression at study entry with individuals at-risk for Huntington disease. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics. 2011;156:751–763. doi: 10.1002/ajmg.b.31232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Huntington Study Group. Unified Huntington's Disease Rating Scale: Reliability and-consistency. Movement Disorders. 1996;11:136–142. doi: 10.1002/mds.870110204. [DOI] [PubMed] [Google Scholar]
  • 7.Long JD, Paulsen JS. Multivariate prediction of motor diagnosis in Huntington disease: 12 years of PREDICT-HD. Movement Disorders. 2015;12:1664–1672. doi: 10.1002/mds.26364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Paulsen J, Long J, Ross C, et al. Prediction of manifest Huntington's disease with clinical and imaging measures: A prospective observational study. Lancet Neurology. 2014;13:1193–1201. doi: 10.1016/S1474-4422(14)70238-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Dorsey ER the Huntington Study Group COHORT Investigators. Characterization of a large group of individuals with huntington disease and their relatives enrolled in the COHORT study. PLoS ONE. 2012;7:1–10. doi: 10.1371/journal.pone.0029522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tabrizi SJ, Scahill RI, Owen G, et al. Predictors of phenotypic progression and disease onset in premanifest and early-stage Huntington's disease in the TRACK-HD study: Analysis of 36-month observational data. Lancet Neurology. 2013;12:637–649. doi: 10.1016/S1474-4422(13)70088-7. [DOI] [PubMed] [Google Scholar]
  • 11.Handley O, Landwehrmeyer B REGISTRY Steering Committee and the EHDN REGISTRY Investigators. European huntington's disease network registry: Current status. Journal of Neurology, Neurosurgery and Psychiatry. 2012;83:A47. [Google Scholar]
  • 12.Paulsen JS, Hayden M, Stout JC, et al. Preparing for preventive clinical trials: The Predict-HD study. Archives of Neurology. 2006;63:883–890. doi: 10.1001/archneur.63.6.883. [DOI] [PubMed] [Google Scholar]
  • 13.Paulsen JS, Langbehn DR, Stout JC, et al. Detection of Huntington's disease decades before diagnosis: The PREDICT-HD study. Journal of Neurology, Neurosurgery & Psychiatry. 2008;79:874–880. doi: 10.1136/jnnp.2007.128728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Paulsen JS, Wang C, Duff K, et al. Challenges assessing clinical endpoints in early Huntington disease. Movement Disorders. 2010;15:2595–2603. doi: 10.1002/mds.23337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Paulsen JS, Long JD, Johnson HJ, et al. Clinical and biomarker changes in premanifest Huntington disease show trial feasibility: A decade of the PREDICT-HD study. Frontiers in Aging Neuroscience. 2014;6:1–11. doi: 10.3389/fnagi.2014.00078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Tabrizi SJ, Langbehn DR, Leavitt BR, et al. Biological and clinical manifestations of Huntington's disease in the longitudinal TRACK-HD study: Cross-sectional analysis of baseline data. Lancet Neurology. 2009;8:791–801. doi: 10.1016/S1474-4422(09)70170-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Tabrizi S, Scahill R, Durr A, et al. Biological and clinical changes in premanifest and early stage Huntington's disease in the TRACK-HD study: The 12-month longitudinal analysis. Lancet Neurology. 2011;10:31–42. doi: 10.1016/S1474-4422(10)70276-3. [DOI] [PubMed] [Google Scholar]
  • 18.Orth M, Handley OJ, Schwenke C, Landwehrmeyer B. Observing Huntington's disease: The European Huntington's disease network's REGISTRY. Journal of neurology, neurosurgery, and psychiatry. 2010;82:1409–1412. doi: 10.1136/jnnp.2010.209668. [DOI] [PubMed] [Google Scholar]
  • 19.König IR, Malley JD, Weimar C, Diener HC, Ziegler A. Practical experiences on the necessity of external validation. Statistics in Medicine. 2007;26:5499–5511. doi: 10.1002/sim.3069. [DOI] [PubMed] [Google Scholar]
  • 20.Gerds TA, Schumacher M. Consistent estimation of the expected Brier score in general survival models with right-censored event times. Biometrical Journal. 2006;48:1029–1040. doi: 10.1002/bimj.200610301. [DOI] [PubMed] [Google Scholar]
  • 21.Harrell FE, Califf RM, Pryor DB, Lee KL, Rosati RA. Evaluating the yield of medical tests. Journal of the American Medical Association. 1982;247:2543–2546. [PubMed] [Google Scholar]
  • 22.Uno H, Cai T, Pencina MJ, D'Agostino RB, Wei WL. On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Statistics in Medicine. 2011;30:1105–1117. doi: 10.1002/sim.4154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Collins GS, Groot JA de, Dutton S, et al. External validation of multivariable prediction models: A systematic review of methodological conduct and reporting. BMC Medical Research Methodology. 2014;14:40–51. doi: 10.1186/1471-2288-14-40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Clogg CC, Petkova E, Haritou A. Statistical methods for comparing regression coefficients between models. American Journal of Sociology. 1995;100:1261–1312. [Google Scholar]
  • 25.Royston P, Parmar MKB. Restricted mean survival time: An alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome. BMC Medical Research Methodology. 2013;13:152–162. doi: 10.1186/1471-2288-13-152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Royston P, Altman DG. External validation of a cox prognostic model: Principles and methods. BMC Medical Research Methodology. 2013;13:33–48. doi: 10.1186/1471-2288-13-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Bouwmeester W, Zuithoff NPA, Mallett S, Geerlings MI, Vergouwe Y, et al. Reporting and methods in clinical prediction research: A systematic review. PLoS Medicine. 2012;9:1–13. doi: 10.1371/journal.pmed.1001221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Debray TPA, Vergouweb Y, Koffijberg H, Nieboer D, Steyerberg EW, Moons KGM. A new framework to enhance the interpretation of external validation studies of clinical prediction models. Journal of Clinical Epidemiology. 2015;68:279–289. doi: 10.1016/j.jclinepi.2014.06.018. [DOI] [PubMed] [Google Scholar]
  • 29.Ross CA, Aylward EH, Wild EJ, et al. Huntington disease: Natural history, biomarkers and prospects for therapeutics. Nature Reviews Neurology. 2014;10:204–216. doi: 10.1038/nrneurol.2014.24. [DOI] [PubMed] [Google Scholar]
  • 30.Machin D, Campbell MJ, Tan SB, Tan SH. Sample size tables for clinical studies. Hoboken, NJ: Wiley-Blackwell; 2009. [Google Scholar]
  • 31.Paulsen JS, Smith MM, Long JD. Cognitive decline in prodromal Huntington: Disease: Implications for clinical trials. Journal of Neurology, Neurosurgery, and Psychiatry. 2013;84:1233–1239. doi: 10.1136/jnnp-2013-305114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Long JD, Paulsen JS, Marder K, Zhang Y, Kim J, Mills JA. Tracking motor impairments in the progression of Huntington's disease. Movement Disorders. 2014;29:311–319. doi: 10.1002/mds.25657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Shoulson I, Young AB. Milestones in Huntington's disease. Movement Disorders. 2011;26:1127–1133. doi: 10.1002/mds.23685. [DOI] [PubMed] [Google Scholar]
  • 32.Henderson R, Keiding N. Individual survival time prediction using statistical models. Journal of Medical Ethics. 2005;31:703–706. doi: 10.1136/jme.2005.012427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Kolde R, Laur S, Adler P, Vilo J. Robust rank regression for gene list integration and meta-analysis. Bioinformatics. 2012;28:573–580. doi: 10.1093/bioinformatics/btr709. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp Appendix2

Figure A1. Harrell's C and Cox regression weights for PREDICT-HD follow-up visits.

Figure A2. Estimated baseline hazards based on the pooled data.

Table A1. Extended model selection results. Harrell's C, C with inverse censoring weights, integrated Brier score, mean standardized rank, and p-value of aggregated rank. Trimmed means among sites are reported (20% trimming).

RESOURCES