Skip to main content
Nature Portfolio logoLink to Nature Portfolio
. 2022 Sep 8;2(9):605–616. doi: 10.1038/s43588-022-00299-w

Identifying patterns in amyotrophic lateral sclerosis progression from sparse longitudinal data

Divya Ramamoorthy 1, Kristen Severson 2, Soumya Ghosh 2, Karen Sachs 1,3; Answer ALS, Jonathan D Glass 4, Christina N Fournier 4; Pooled Resource Open-Access ALS Clinical Trials Consortium; ALS/MND Natural History Consortium, Todd M Herrington 5,6, James D Berry 5, Kenney Ng 2, Ernest Fraenkel 1,
PMCID: PMC10766562  PMID: 38177466

Abstract

The clinical presentation of amyotrophic lateral sclerosis (ALS), a fatal neurodegenerative disease, varies widely across patients, making it challenging to determine if potential therapeutics slow progression. We sought to determine whether there were common patterns of disease progression that could aid in the design and analysis of clinical trials. We developed an approach based on a mixture of Gaussian processes to identify clusters of patients sharing similar disease progression patterns, modeling their average trajectories and the variability in each cluster. We show that ALS progression is frequently nonlinear, with periods of stable disease preceded or followed by rapid decline. We also show that our approach can be extended to Alzheimer’s and Parkinson’s diseases. Our results advance the characterization of disease progression of ALS and provide a flexible modeling approach that can be applied to other progressive diseases.

Subject terms: Translational research, Computational models, Amyotrophic lateral sclerosis


A machine learning approach is presented to identify dominant patterns in disease progression in amyotrophic lateral sclerosis (ALS), Alzheimer’s disease and Parkinson’s disease. The nonlinearity of ALS progression has important clinical implications.

Main

Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease with a complex pathophysiology resulting in heterogeneous symptoms and progression1,2. The median length of survival from symptom onset is approximately three years; however, some individuals survive decades with the disease3. Longitudinal functional clinical metrics have gained widespread use as a tool to measure ALS progression. These clinical metrics, such as the Revised ALS Functional Rating Scale (ALSFRS-R), are a proxy for disease progression. However, they are imperfect measures. Individuals are often evaluated at different stages of their disease, making comparisons across patients challenging4. While people with ALS invariably decline over time, some of the measures can increase for short durations or reach plateaus5. Additionally, the metrics of clinical disease progression are based on subjective assessments of patients’ daily functioning, such as the ability to climb stairs ‘normally’ or ‘slowly’, which introduces a potential source of error6. Furthermore, interventions such as percutaneous endoscopic gastronomy and non-invasive ventilation can affect these clinical metrics. The interconnectedness of function and the variability in the measurement of these clinical metrics present challenges in modeling ALS progression.

The heterogeneity of ALS makes it difficult to determine if a disease-modifying therapy is effectively slowing progression7,8. Traditional modeling approaches have dealt with the complexities in ALS by first assuming that ALS outcome measures, particularly the ALSFRS-R, progress in a linear fashion911. Many ALS clinical trials use changes in the linear slope of ALSFRS-R over time or changes in ALSFRS-R from baseline as primary endpoints1214. For example, edaravone was approved in the USA on the basis of a 2.5 ALSFRS-R point difference in decline between the treatment and control arms over 6 months (ref. 14), and the estimated effect from the ALS sodium phenylbutyrate–taurursodiol clinical trial was a change in slope of 0.42 points per month12. Large global crowdsourcing analyses designed to produce better models for clinical trials have also assumed a linear decline in ALSFRS-R15,16.

Despite the widespread use of linear models in predicting patient progression, there is evidence that ALS progression can be nonlinear and can vary across disease severity1719. Several approaches have been proposed to deal with nonlinearity in the context of clinical trials. For example, nonlinear parametric models, which assume a particular shape of the trajectory in advance, have been used to capture these complexities. One notable example is the D50 model, which represents the progression of ALS with a two-parameter sigmoid20,21. However, by requiring a particular parametric form, these models are restricted to identifying prespecified trajectory shapes17,18,2224, which may not represent the actual heterogeneity in disease progression patterns. Models such as the mixed model for repeated measures can be used in conjunction with unstructured time and covariance structures that reduce reliance on parametric assumptions; however, these models can suffer from being statistically underpowered, especially in clinical cohorts with sparse longitudinal data25.

Less attention has been paid to a more fundamental question: are there common patterns of clinical progression in ALS? If such patterns exist, they could be used to improve patient stratification, which can impact clinical trial planning8,26. Defining distinct clusters can also enable research aiming to identify disease mechanisms that contribute to modulating disease progression in ALS. Heterogeneity in the clinical progression of ALS may reflect environmental or genetic modifiers of disease, and robustly characterizing heterogeneity in progression patterns can aid in the search for these modifiers. Current clustering efforts for disease progression patterns are limited by requiring parametric assumptions in their cluster assignments or outcome measures15,19,24. There is a need for computational methods that can flexibly identify patient clusters with minimal assumptions.

To model the full complexity of ALS progression, we turned to computational methods that are more flexible than traditional parametric models. We propose a framework for aggregating patient trajectories into clusters using Gaussian processes27,28 and determining the number of clusters using a Dirichlet process mixture model29,30. We also modify this mixture of Gaussian processes model to incorporate prior clinical knowledge. For example, since patients with ALS are expected to decline over time, we incorporate monotonic biases into our model, which encourage declining trajectories to be identified but also allow for the detection of patterns that do not fit prior expectations.

We show that this method can improve the characterization of ALS progression patterns, identifying clusters of participants with similar trajectories from longitudinal clinical scores. The nonlinear progression patterns are robust to sparse data and consistent across study populations, and correspond to survival outcomes. While we focus on clinical ALS outcome measures, we also demonstrate that the method can analyze Alzheimer’s and Parkinson’s data. Our results provide an advance in modeling progression patterns in ALS and other diseases.

Results

Modeling approach

We developed a mixture of Gaussian processes model with strong inductive bias towards monotonic decline (MoGP) to characterize patterns in disease progression (Extended Data Fig. 1). The model leverages two Bayesian non-parametric methods: Gaussian process regression27,28 and Dirichlet process clustering29,30. Gaussian process regression does not require the specification of a particular functional form, but instead learns trajectories from data, enabling the model to capture a wide variety of possibly nonlinear progression patterns. Dirichlet process clustering can be used to identify clusters from data when it is difficult to specify an expected number of clusters a priori. The use of Dirichlet processes is motivated by the uncertainty in the existence and number of ALS progression subtypes and avoids restrictive modeling assumptions.

Extended Data Fig. 1. Model Workflow.

Extended Data Fig. 1

Input, training, and optimization of the Mixture of Gaussian Processes model.

Our approach includes notable improvements over previous MoGP models28,31,32, in that we incorporate clinical knowledge relevant to ALS progression. Specifically, we implement a monotonic inductive bias as well as clinically informed parameter priors for Gaussian process regression and Dirichlet process clustering components. Each component of the model is discussed in more detail in Methods.

Elucidating ALS disease progression trajectory patterns

We sought longitudinal ALSFRS-R scores from a wide range of sources. The model was evaluated on five study populations (Supplementary Tables 1 and 2). Three observational studies were used, Answer ALS (AALS) (https://www.answerals.org/), the Emory ALS Clinic database (EMORY)3 and the ALS/MND Natural History Consortium database (NATHIST)33, and two overlapping clinical trial datasets, the Pooled Resource Open-Access ALS Clinical Trials (PRO-ACT)34,35 and the Clinical Trial of Ceftriaxone in ALS (CEFT)13,36.

To characterize patterns in ALS progression, we first applied MoGP to PRO-ACT, which is the largest publicly available dataset of ALSFRS-R scores (Fig. 1 and Extended Data Fig. 2). The analysis identified diverse clusters, including some clusters identifying slow-progression populations (Fig. 1n) and others capturing faster-progression groups (Fig. 1r).

Fig. 1. Identifying trajectory clusters with varying patterns of decline, using a mixture of Gaussian processes model.

Fig. 1

The 24 largest clusters (out of 92) from PRO-ACT are shown. The first-year slope is calculated as the difference between 48 and the mean cluster score 1 yr after symptom onset, divided by the time from symptom onset. n indicates the number of ALS patients in each cluster. The shaded area indicates the 0.95 confidence interval.

Extended Data Fig. 2. Clusters spanning 90% of all individuals in PROACT.

Extended Data Fig. 2

The first year slope is calculated as the difference between 48 and the mean cluster score one year after symptom onset, divided by the time from symptom onset. N indicates the number of ALS patients in each cluster. Shaded area indicates 0.95 confidence interval.

Notably, in many cases, the patterns of decline were highly nonlinear, with some following sigmoidal (Fig. 1d,k), convex (Fig. 1m,u,v) and concave (Fig. 1o,q) curves. Linear patterns were also detected in some clusters (Fig. 1g,j,t). To estimate how well a linear model fit in the first year generalizes to subsequent timepoints, we computed the slope of the mean function of each cluster in the first year after symptom onset (‘first-year slope’). This is relevant to previous studies that utilize a first-year slope calculation, including previous ALS DREAM Challenges15,16. While this first-year slope closely reflects actual trajectories for the linear clusters (Fig. 1g,j,t), for others it is either an overestimation (Fig. 1i,k,l,o,x) or an underestimation (Fig. 1s,v), indicating nonlinearity in the trajectory pattern. These errors in the first-year estimations can be large; for instance, it overestimates the disease trajectory in cluster K by 24.20 ALSFRS-R points and underestimates the trajectory in cluster V by 9.48 ALSFRS-R points when both are evaluated 3 yr after symptom onset. This diversity highlights the complexity of progression trajectories in ALS. Analysis of other study populations (Extended Data Figs. 3–6) also revealed many clusters that were highly nonlinear.

Extended Data Fig. 3. Clusters spanning 90% of all individuals in AALS.

Extended Data Fig. 3

The first year slope is calculated as the difference between 48 and the mean cluster score one year after symptom onset, divided by the time from symptom onset. N indicates the number of ALS patients in each cluster. Shaded area indicates 0.95 confidence interval.

Identifying nonlinear patterns across heterogeneous studies

We compared the performance of MoGP against two other approaches, which assume that progression is linear. A standard model in the field is to calculate linear slopes fit to patient data with an onset anchor37 (slope model: SM). The slope model is fit to each patient separately and does not identify clusters. We also benchmarked our model against a mixture of Gaussian processes model with a linear kernel (linear kernel model: LKM). The LKM retains the ability to cluster trajectories using a Dirichlet process but does not allow for nonlinear functions, allowing us to separate the contribution of clustering and the assumption of linearity in our models.

For all study populations, the error was lower in the MoGP model than in the LKM and SM (Fig. 2a). Across the populations, using the MoGP reduced error by more than one ALSFRS-R point as compared with the LKM for at least 27.16% of participants; at least 8.33% of patients have an improvement in accuracy greater than two ALSFRS-R points (Supplementary Table 3). Importantly, the error of the MoGP was lower even though the LKM used a larger number of clusters to model the data (Supplementary Table 6). It is also notable that the MoGP, which identified clusters as large as 88 participants, was able to match or outperform the patient-specific SM (Fig. 2a and Supplementary Table 4), which would have been expected to markedly outperform MoGP if substantial nonlinear structure did not exist in the data. The results are replicated across the five different datasets, suggesting that complex nonlinearity is a common feature of ALS progression and is not a unique feature of a single dataset.

Fig. 2. Estimating nonlinearity of trajectories.

Fig. 2

a, Cumulative distribution function (CDF) of root mean squared error (RMSE) between a participant’s predicted cluster membership and cluster model mean. P values calculated with two-sided Kolmogorov–Smirnov two-sample tests between MoGP and LKM distributions, and between MoGP and SM distributions. b, A subset of nonlinear clusters from PRO-ACT visualized; n indicates the number of ALS patients per cluster. The shaded area indicates the 0.95 confidence interval.

Source data

The clusters with the most substantial nonlinearity often followed sigmoidal trajectory patterns, with varying inflection points (Fig. 2b). In some of these clusters, patients had slow progression for a period of time, followed by a consistent sharp decline. This pattern of progression appears consistent with a sudden loss of ability to carry out functions that we refer to as a ‘functional cliff’. In other cases, the pattern is more consistent with a rapid period of decline followed by a slower phase. Since there are many settings in which patient-specific parametric models are very useful, we compared our model with a patient-specific sigmoidal model (SG)20. Somewhat surprisingly, despite the fact that the MoGP models groups of patients, rather than individuals, MoGP outperforms a patient-specific sigmoid model by one or more ALSFRS-R points for 4.20–9.43% of patients across the studies (Supplementary Table 5). This indicates that, while a sigmoidal model captures much of the nonlinearity, it does not represent the full complexity of progression patterns.

MoGP clusters varied considerably in their rates of progression and the stability of their progression patterns. MoGP enables the characterization of each of these properties through the mean function slope and kernel function length-scale parameters respectively, both of which are learned and optimized through the training process. The model provides estimates for each of these parameters, and these can be used to approximate similarity between clusters depending on the desired clustering property (Extended Data Fig. 7).

Extended Data Fig. 7. Dominant ALS progression patterns, identified using length-scale and negative mean function slope.

Extended Data Fig. 7

Length-scale indicates trajectory stability; negative mean function slope corresponds to rate of progression. Learned model parameters from the PRO-ACT reference model are k-means clustered (Left plot; k=6, marker size corresponds to cluster size), with clusters ≥ N=5 visualized, and percentage of individuals that fall within each of the trajectory patterns are labeled (Right plots).

Source data

Clustering trajectories on the basis of the optimized slope and length-scale parameters reveals interesting patterns (Extended Data Fig. 7). The dominant clinical progression patterns in ALS are sigmoidal fast progression (Extended Data Fig. 7b, 17.48% of individuals), stable slow progression (Extended Data Fig. 7e, 17.38%), unstable slow progression (Extended Data Fig. 7f, 32.98%) and unstable medium progression (Extended Data Fig. 7d, 30.82%). As might have been expected, some types of progression were associated with specific sites of onset. Clusters with fast sigmoidal progression have the highest percentage of individuals with bulbar onset (30.14% of individuals), while those with stable slow progression have the highest percentage of individuals with limb onset (76.97%) (Supplementary Table 8).

Overall, the MoGP model promotes the ability to learn these complex disease progression trajectories better than currently used clinical models, while stratifying patients to reveal common patterns of disease.

Evaluating the robustness of the clusters to sparse data

As clinical data for ALS patients are often incomplete or sparse, we sought to evaluate MoGP performance in these settings. We tested robustness using PRO-ACT, which is the largest of our sources and is a compendium of data from several clinical trials. We also tested robustness using data from CEFT, which is a small clinical cohort within PRO-ACT that may be more representative of common clinical settings. We compared MoGP’s performance against LKM and SM.

We first evaluated the model’s ability to recreate randomly withheld data points (‘interpolation’). Across all interpolated tests for PRO-ACT, we found that the clusters identified by MoGP had lower reconstruction error than the LKM (Fig. 3a, P ≤ 1 × 10−4), and a lower error than the SM when 50% and 75% of training data are included (Fig. 3a, P ≤ 1 × 103). These trends persisted when compared with CEFT (Fig. 3c).

Fig. 3. Evaluating robustness of cluster assignments with sparse datasets.

Fig. 3

a,c, MoGP, LKM and SM were trained on interpolated data and RMSE was calculated between withheld data and the mean predicted trajectory. b,d, Models were trained on right-censored data. P values were calculated with a Wilcoxon signed-rank one-sided test. The box plot represents the interquartile range around the mean; whiskers indicate the proportion (1.5) of the interquartile range past the low and high quartiles to extend the plot whiskers. Points outside the whisker range represent outlier samples. Number of patients evaluated: a, n = 1,327 patients for all comparisons; b, 0.25 yr, n = 2,786; 0.5 yr, n = 2,465; 1 yr, n = 1,379; 1.5 yr, n = 261; 2 yr, n = 135; c, n = 228 for all comparisons; d, 0.25 yr, n = 453; 0.5 yr, n = 437; 1 yr, n = 323; 1.5 yr, n = 215; 2 yr, n = 130.

Source data

One of the most common uses for trajectory modeling is to predict future ALSFRS-R scores. We therefore evaluated the model’s ability to predict future ALSFRS-R scores for patients with right-censored data (‘prediction’). In clinical trials, these predictions are often made with the SM. For PRO-ACT, when only three or six months of data from baseline were provided, the SM and LMK were the most accurate (Fig. 3b). However, when one or more years of training data were provided, the MoGP model outperformed the LKM and SM (Fig. 3b, P ≤ 1 × 10−2, except for 1.5 yr, where P = 1.34 × 101 for SM), and more accurately predicted future disease progression by more than 0.22, 0.41 and 1.28 ALSFRS-R points at 1, 1.5 and 2 yr respectively. This trend was strengthened in CEFT, in which six months of training data were sufficient to see an improvement in progression forecasting (Fig. 3d, P ≤ 1 × 101).

For the majority of comparisons, the MoGP identified fewer clusters per mixture model than the SM or LKM, indicating that the lower reconstruction error was not due to overfitting of the cluster assignments (Supplementary Fig. 1).

Transferring trajectories across study populations

Because ALS is heterogeneous and characteristics of study populations can differ considerably, it is important to test whether trajectory models capture patterns that are consistent across populations. To answer this question, we trained MoGP on a large database (‘reference model’) and used it to predict patient trajectories in other study populations that varied in data collection frequency and follow-up period. We benchmarked the MoGP results against models in which both the test and training sets were derived from the same study population (‘study-specific models’). The study-specific models allow us to evaluate possible overfitting of the reference model. If the reference model was overfit, we would expect it to have a much higher error than the study-specific models.

We found that the reference model, trained on PRO-ACT, demonstrated strong performance on external datasets, indicating that the trajectory clusters are not overfit to the reference model data (Fig. 4a). Importantly, we found that for all test datasets the reference model outperformed the study-specific models (Fig. 4b, P = 0.0312 for AALS, CEFT, EMORY; P = 0.0625 for NATHIST). AALS had the lowest error when the reference model was used (2.16 ALSFRS-R points), followed by CEFT (2.25), EMORY (2.32) and NATHIST (2.59) (Fig. 4b). These errors were similar to the baseline error (1.88 ALSFRS-R points) when the reference model was tested on the held-out data from PRO-ACT, the study on which it was trained. We additionally benchmarked the model against a reference model for which the cluster labels were randomly shuffled. This randomized control had a mean error of 11.74 ALSFRS-R points, which was much higher than the errors of the reference models. Given that CEFT is a subset of PRO-ACT, it is interesting that the CEFT study-specific model had a higher error than its reference model counterpart; these results suggest that the larger size of the PRO-ACT dataset may allow it to capture trajectories more accurately. The reference model’s ability to outperform all of the study-specific models is strong evidence that the trajectory patterns identified by MoGP are transferable across ALS study populations.

Fig. 4. Assessing trajectory consistency across datasets.

Fig. 4

a, The reference model was trained on PRO-ACT and used to predict progression trajectories of participants in other datasets; the four largest reference model clusters are shown. b, Average test error between cluster mean function and participant ALSFRS-R scores, using the reference model and study-specific models. P values were calculated with a Wilcoxon signed-rank one-sided test. The error bars show the 0.95 confidence interval around the mean. N = 5 test–train splits.

Source data

Corresponding survival outcomes with trajectory clusters

Next, we evaluated if the MoGP clusters, which were trained only on ALSFRS-R data, were able to reflect the duration of patient survival from symptom onset to death. The results of the Kaplan–Meier analysis are presented in Fig. 5. Some clusters (Fig. 5c,e) reflected longer survival durations, with very few deaths recorded, while other clusters reflected shorter durations. For example, cluster D had a median survival of 2.90 yr from symptom onset, corresponding to faster progression (Fig. 5d,i). Of all pairwise combinations of clusters, 63.40% corresponded to differential survival outcomes when MoGP was used. By contrast, when LKM was used 50.99% of pairwise combinations of clusters corresponded to differential survival outcomes (P < 0.05). These results demonstrate that incorporating nonlinearity improves the correspondence of clusters to survival outcomes and provides evidence that these progression clusters are clinically relevant.

Fig. 5. Survival outcomes for trajectory clusters.

Fig. 5

ak, The five largest PRO-ACT clusters are shown with MoGP clusters (ae) and associated Kaplan–Meier survival curves (fj). n indicates the number of ALS patients in each cluster. The shaded area indicates the 0.95 confidence interval. k, The number of individuals at risk, censored and with recorded deaths observed at each time displayed.

Characterizing patterns of decline in alternative ALS measures

In addition to ALSFRS-R scores, there are other important clinical metrics that can be used to monitor ALS disease progression. One is forced vital capacity, which is a spirometer-based measure of lung function and has been used as an indicator of survival and disease progression38. Furthermore, while the ALSFRS-R total is commonly used as an aggregate measure, its component subscores measuring fine motor, gross motor, bulbar and respiratory function can also be analyzed to identify subscore-specific patterns. When we applied MoGP to forced vital capacity and ALSFRS-R subscores from PRO-ACT, we saw that the nonlinearity persisted in these domains. The nonlinear trajectories were particularly pronounced for forced vital capacity and bulbar function (Fig. 6).

Fig. 6. MoGP trajectory patterns for secondary endpoints of ALS disease progression.

Fig. 6

Measures include a, forced vital capacity and b, ALSFRS-R subscores (fine motor, gross motor, bulbar, respiratory domains). Trajectory colors for each panel are unrelated and correspond to the relative number of participants in each cluster, scaled from the largest to the smallest of the five largest clusters from PRO-ACT. Participants with minimal change in score (≤1 point) were excluded from the model. n indicates the number of ALS patients in each cluster.

A key advance of this work is the identification of clusters of patients, which can be used to investigate genetic or environmental causes that may underlie ALS progression. For instance, the C9orf72 repeat expansion is the most common of the known causes of ALS, and it is associated with faster-progression ALS, as indicated by reduced survival39,40. However, even among patients who share this common genetic cause of ALS, there is some evidence of uncharacterized heterogeneity in ALS progression patterns41. As an example use-case, we asked if MoGP can be used to stratify patients who carry this repeat expansion. We analyzed data from AALS, a study population with both clinical and molecular data available. The patients with the C9orf72 repeat expansion did not correspond to a single cluster, supporting the hypothesis of heterogeneous progression within this group. As more data accumulate, such analyses could aid in the search for genetic or environmental variables that modify the aggressiveness of C9orf72.

Revealing patterns in Alzheimer’s and Parkinson’s endpoints

The MoGP approach can be applied to functional rating scales that are widely used in other neurodegenerative diseases. We applied MoGP to the Alzheimer’s Disease Assessment Scale—Cognitive Subscale (ADAS-Cog-13 (refs. 42,43). The model showed a range of disease progression patterns, with varying severities of progression (Extended Data Fig. 8a and Supplementary Fig. 5). The majority of the largest clusters showed linear trajectories, in which the first-year slope appropriately captured later progression; clusters E and H, while largely linear, deviate from the first-year slope, showing counterexamples to this trend. It is noteworthy that the clusters varied substantially in the rates of conversion of mild cognitive impairment (MCI) to Alzheimer’s disease. Ninety percent of those in cluster F had an MCI diagnosis at baseline, compared with 5.26% in cluster G (Supplementary Table 9).

Extended Data Fig. 8. Identifying progression clusters from Alzheimer’s and Parkinson’s clinical measures.

Extended Data Fig. 8

Eight largest clusters are visualized. N indicates number of individuals in each cluster. The first year slope is calculated as: (mean cluster at one year – mean cluster score at initial value), divided by the time from the initial value.

Similarly to ALS and Alzheimer’s disease, Parkinson’s disease is heterogeneous in its symptom presentation and progression, which creates challenges in therapeutic discovery. Unlike ALS and Alzheimer’s disease, there are widely used medications for Parkinson’s disease that can provide symptomatic relief, although they do not slow or stop the progression of the disease44. We characterized patterns in motor decline by applying MoGP to Part III of the Movement Disorder Society—Unified Rating Scale (MDS-UPDRS)45 using only data from the ‘off state’, that is, when not affected by medications. MoGP identified a number of progression trajectories (Extended Data Fig. 8b and Supplementary Fig. 6), with some showing stability of motor scores (clusters C, F), while others showed clear motor function decline (clusters A, B, D).

Interesting trends emerged from this analysis. Over 90% of individuals in clusters with an unstable slow progression pattern (Supplementary Fig. 8 and Supplementary Table 10) had tremor-dominant (TD)46 Parkinson’s disease, as opposed to postural instability/gait difficulty (PIGD). In contrast with previous studies of the linearity of MDS-UPDRS scores47, our results also point to nonlinear complexity in some clusters (clusters C, E, G) (Extended Data Fig. 8b). These analyses demonstrate that MoGP’s flexibility enables it to characterize long-term heterogeneity in time-series metrics in a number of diverse clinical settings.

Discussion

The improved performance of an MoGP model over the slope and linear kernel models indicates that linear models are insufficient to capture the heterogeneity in ALS disease progression. While some patients do indeed have linear trajectories, a substantial portion of patients have nonlinear trajectories. Our work also finds that, while a simple parametric nonlinear model—a two-parameter sigmoid—is better than a linear model, it still fails to capture the full range of patient trajectory patterns, motivating the use of non-parametric models that can capture both linear and nonlinear trajectories.

Previous work has suggested that the functional cliff patterns seen here may be a result of inconsistencies in the ALSFRS-R or issues related to the ordinal scale used in ALSFRS-R as opposed to a linearly weighted interval scale48,49. However, the consistency of MoGP-identified patterns across different study populations suggests that the patterns are not the result of deficiencies in the ALSFRS-R. Critically, we also observed nonlinear patterns in vital capacity scores, which are measured independently of ALSFRS-R scores. These findings support the view that nonlinearity is reflective of changes in patient function and not problems in measurement. These findings also have implications for the analysis of clinical trials, many of which use ALSFRS-R and vital capacity metrics as primary or secondary endpoints. In many trajectory clusters, functional cliffs or sigmoidal patterns in disease progression may obscure the detection of therapeutic efficacy if linear models are used. Our results support accounting for nonlinearity when evaluating ALS clinical trial efficacy, with particular salience for clinical trials that are 1 yr in duration or longer.

Our work also demonstrates how existing clinical databases in ALS can be leveraged to enable the characterization of disease progression in sparse datasets from different study populations. A MoGP model trained on the PRO-ACT database accurately predicted trajectories for clinical datasets from AALS, EMORY, CEFT and NATHIST datasets. The transferability of MoGP-identified clusters across the datasets indicates that the trajectory cluster patterns are robust to batch effects due to clinician or site differences, and may reflect underlying disease processes. One of the properties of the Dirichlet process model underlying MoGP clustering is that it will naturally scale the number of identified clusters within a given dataset depending on the number of samples in that dataset; we can use a reference model on a clinical cohort of any size. This non-parametric property of the model underlies the difference in the total number of clusters found in the varying datasets. Conversely, when it is useful to analyze fewer clusters the trajectories can easily be grouped together on the basis of their mean slope and length scale, revealing dominant modes of disease progression. The identification of these clusters creates an opportunity to search for molecular, environmental or other factors that may modify disease progression.

As in many clinical studies, the datasets and therefore the progression patterns in this analysis are influenced by both selection bias and attrition bias. Selection bias refers to the sample of the population that is included in each study. Studies such as AALS, which require enrollment and consent to undergo additional monitoring, tend to be biased towards slower-progressing ALS. The EMORY dataset, which has a high percentage of enrollment from the clinic, is likely to be more reflective of a clinical population, although it reflects a group of patients with higher rates of progression on average. Overall though, observational studies tend to have less standardized frequencies of data collection and sparser measurements. On the flip side, clinical trial datasets typically collect extensive longitudinal data, but because of enrollment criteria can be skewed towards faster-progression individuals. The variation in ages of onset and prevalence of sites of onset differ across clinical cohorts, which can indicate additional potential selection biases. Other variables that can be used to evaluate selection bias but were partially missing or unavailable across our studies include diagnostic delay, forced vital capacity, frontotemporal dementia and C9orf72 status. Attrition bias also plays a strong role in ALS datasets, given the rapid pace of disease progression, with patient monitoring becoming increasingly difficult in late-stage disease; this bias may particularly affect the tail end of the identified trajectory patterns. Given the large sample size in our study, and the consistency of the patterns across datasets, we expect that we are sampling the clinical population as broadly as possible, although future work will involve determining the extent to which these trajectories remain consistent in new datasets.

Ultimately, by identifying clusters of patients who have similar disease progression trajectories, these models could be used to identify molecular correlates that may be associated with ALS progression subtypes. While this work focuses on ALSFRS-R and vital capacity, the field of ALS has identified a growing number of molecular biomarkers and clinical metrics in which progression is poorly understood50,51. This paper points to the complexity of disease progression in ALS and the necessity of more accurately accounting for heterogeneous trajectory patterns in clinical trial models and research studies.

Methods

Study populations

ALS data in this study were collected from five cohorts: PRO-ACT, CEFT, AALS, EMORY and NATHIST (Supplementary Tables 1 and 2). All scores used for this analysis are clinician reported. The populations varied in size, with PRO-ACT having the largest total number of participants (2,923 participants with at least three ALSFRS-R visits recorded). The populations differed in the median number of months followed (between 11 and 17 months) and the median frequency of clinical visits (between four and nine visits). The median slope between the populations also varied, with CEFT and EMORY having the fastest-progression populations (−0.84 and −0.89 ALSFRS-R points/month, respectively), and AALS having the slowest-progression population (−0.55 ALSFRS-R points/month). CEFT had a median of 16.80 months of follow-up, while PRO-ACT had a median of 11.95 months, indicating that CEFT participants likely comprise some of the longest subject records in PRO-ACT. The differences between the populations allowed us to measure the robustness of our model to data collection methods, frequency of clinical visits and duration of follow-up.

Modeling approach

We characterize disease progression in ALS using a framework for identifying trajectory patterns from longitudinal data. While previous work on disease progression modeling has focused on patient-specific prediction models16,52,53, a critical advance of this work is the characterization of distinct and large trajectory clusters. Furthermore, we provide a principled approach to characterizing the shapes of disease progression patterns in ALS, which leverages Bayesian non-parametric methods to minimize the number of assumptions that are required for regression models. We show that this method can flexibly be applied to a number of functional clinical measures for progressive diseases. Each component of the model is detailed further below. Further details, including the mathematical specification of the model, can be found in Supplementary notes.

The modeling approach of clustering over temporal progression patterns has been shown to improve the characterization of disease progression in other conditions. For example, Peterson et al. demonstrated the use of an autoregressive Gaussian process model for predicting metrics of Alzheimer’s progression54; however, the model made a fundamentally different assumption about the structure of the data—that there is a single global progression type, and that each patient follows a noisy version of this global progression type—which is an assumption that does not capture the full heterogeneity of ALS phenotypes. Furthermore, the model requires fixed time intervals of visits, which are not available in many clinical ALS datasets54. Zhao et al. present a related clustering approach in multiple sclerosis, although their model relies heavily on prior domain knowledge on how to group patients into subgroups, which has not as yet been clearly defined in ALS55. Other models, such as additive Gaussian process regression56, can be used to characterize patterns in time-series data, although they lack the ability to stratify patients into disease subtypes.

Gaussian process regression

Gaussian process regression allows the identification of nonlinear trajectory patterns while making minimal assumptions about the shape of the trajectory functions27,28. A Gaussian process is specified by a mean function and a covariance kernel. Because we expect ALS trajectories to be smooth functions with no discontinuities, our MoGP model uses a squared exponential kernel. The squared exponential kernel has two parameters: the signal variance, which determines the average distance of the function from the mean, and the length scale, which specifies the smoothness of the function. Each of these parameters is determined during the learning phase using the training data.

Monotonic inductive bias

Because ALS trajectories are expected to decline over time, we use a negative linear function in the Gaussian process models of MoGP. To further encourage declining trajectories, we modify the Dirichlet process clustering algorithm, such that an individual can only be placed in a cluster if their score at their initial visit is not substantially higher than the mean function of the current cluster at that point. We also impute an onset-anchor value, a maximum score of a clinical metric assigned to the date corresponding to symptom onset, which has been previously shown to improve prediction in ALS trajectories37.

Dirichlet process clustering

Dirichlet process mixtures29,30 can be used to identify clusters in data without needing to specify an expected number of clusters in advance. This unsupervised learning model begins by assuming that an infinite number of clusters can exist, and then narrows its prediction to a limited number of components best supported by the observed data. In our case, each mixture component is a function drawn from a Gaussian process. The resulting model clusters patient trajectories by probabilistically assigning them to those components that best explain them. The number of patients in each cluster is also learned from the model, and clusters can differ in size from each other. Through this data-driven approach, the algorithm can learn clusters of ALS patients who share disease progression patterns. The method can also predict the cluster membership and the disease progression pattern of a participant not included in the model, and provide an estimate of the confidence of this prediction.

Model evaluation

Evaluating trajectory nonlinearity

We evaluated how generalizable a linear model trained in the first year of disease progression is to subsequent data points, by calculating an anchored first-year slope. This was computed as the following: (48 − cluster mean function at 1 yr)/time from symptom onset. Anchoring indicates that a score of 48 (the maximum of the ALSFRS-R scale) is imputed at the time of symptom onset37. We compared our MoGP model against two benchmark linear models: an anchored slope model (SM), which is patient specific, and a mixture of Gaussian processes model with a linear kernel (LKM), which clusters patients using a linear parametric model. Additionally, to evaluate the extent to which a nonlinear parametric model represents ALS progression, we compared our model against a patient-specific two-parameter sigmoidal model (SG)20.

Because we are proposing the use of a data-driven model, we aimed to be as conservative as possible in removing patients from the dataset so as to not introduce additional selection bias. For this analysis, participants were excluded from the model if fewer than three complete ALSFRS-R visits were recorded, the first visit was more than 7 yr from symptom onset or an increase of more than six points in ALSFRS-R between consecutive visits was recorded (Supplementary Table 1). The ALSFRS-R is an updated version of the previously used ALSFRS metric, and includes additional questions measuring dyspnea, orthopnea and respiratory insufficiency6. The ALSFRS-R measure was used here because it is the current standard in clinical trial analysis1214. Seven years was selected as the point at which longitudinal data became sparse. Six ALSFRS-R points was selected because a jump such as this was unlikely to be seen unless there was a data-entry error.

For each model, the RMSE between a participant’s measured scores and their predicted cluster mean function were calculated. The RMSE was compared between the models; a lower RMSE indicates reduced error in that model and better model performance.

Robustness to sparse data

We simulated sparsity by withholding data and assessed the model’s ability to perform two tasks: (1) interpolation of ALSFRS-R scores for a patient with randomly withheld data points, and (2) forecasting future ALSFRS-R disease progression for patients with right-censored data. We tested this using PRO-ACT and CEFT. To have sufficient longitudinal measurements, for interpolation experiments we only included participants with ten or more longitudinal ALSFRS-R visits, and for prediction experiments we only included participants with four or more visits (Supplementary Table 7).

The reconstruction error for each participant was calculated using the RMSE between the original withheld data points and predicted values from the mean function for the participant’s trajectory cluster. This was done across all interpolated tests, in which 25%, 50% and 75% of clinic visits per patient were provided as training data, with selections randomly interspersed across visits. We additionally evaluated the ability of MoGP to predict future progression by using right-censored data with varying numbers of training data (including visits within 0.25, 0.5, 1, 1.5 and 2 yr since baseline visit).

Model generalizability

To evaluate whether clusters derived from one study population could be used to model external study populations, we trained a reference model and evaluated the transferability of this model to unseen ALS patient data. We predicted the cluster membership for each participant, and calculated the RMSE between the participant ALSFRS-R scores and the mean function of their predicted cluster.

We split all of our study populations into test and training datasets (60% train, 40% test; repeated across five randomly split test–train datasets). For our reference MoGP model, we used the training data from PRO-ACT, which was chosen because it contained the largest number of samples and is publicly available. For AALS, EMORY, CEFT and NATHIST, we used the training data from each study to train a separate model (study-specific model). For each study’s remaining test data, we predicted the trajectory function using the reference model and the study-specific model. To approximate the minimum error expected, we calculate the reconstruction error when the reference model is applied to the test set from PRO-ACT, the same study on which it is trained (‘baseline error’). We also benchmark against the error when cluster labels on the reference model are randomly shuffled (‘random cluster assignment’).

Relationship to alternative outcome measures

We calculated the Kaplan–Meier survival probability curves for the largest MoGP clusters identified from PRO-ACT. If no death was recorded, the participant was marked as censored using the latest date of a recorded ALSFRS-R score.

We trained MoGP models on forced vital capacity percentages (calculated as the maximum of three trials) and ALSFRS-R subscores (fine motor, gross motor, respiratory and bulbar domains). A maximum score of 100% was used for the forced vital capacity percentage model, and a maximum score of 12 was used for ALSFRS-R subscores.

We applied our method to ADAS-Cog-13 (refs. 42,43) from the Alzheimer’s Disease Neuroimaging Initiative (ADNI57). Individuals with a confirmed Alzheimer’s disease diagnosis at any point of the data collection were included in the model; this also included individuals who began the study with MCI and then converted to an Alzheimer’s disease diagnosis. To ensure sufficient longitudinal data, individuals with fewer than three longitudinal visits were excluded, with a total of 331 individuals included in the model. The correlation between the learned clusters and MCI to Alzheimer’s disease conversion was then calculated.

We also applied our method to the MDS-UPDRS45 scale from the Parkinson’s Progression Markers Initiative dataset58. In contrast to ALS and AD, for Parkinson’s disease there are medications that can mitigate symptoms although not long-term progression of the disease44. Because we were interested in characterizing progression patterns when not affected by medications, we focused on measurements of the MDS-UPDRS Part III in the off state, which is defined as either before the initiation of medication or after abstaining from medication for at least 12 h. Individuals with fewer than three longitudinal off-medication scores or a first visit more than 10 yr from symptom onset were excluded, with a total of 397 individuals included in the model. We calculated the correlation between the clusters and Parkinson’s disease subtypes of PIGD and TD, with the designation of PIGD/TD calculated following the method previously described by Stebbins et al.46. For the purpose of analyzing the Parkinson’s disease subtype correlation with cluster membership, we focused on individuals with a stable PIGD/TD designation (one that does not change over the course of the disease).

Statistics and reproducibility

To compare the cumulative distribution function of the RMSE between a participant’s predicted cluster membership and cluster model mean, P values were calculated with Kolmogorov–Smirnov two-sample two-sided tests. For interpolation and prediction experiments, to determine if a model error had decreased between the LKM or SM and the MoGP, a Wilcoxon signed-rank one-sided test was used. To assess trajectory consistency between reference models and study-specific models, a Wilcoxon signed-rank one-sided test was used. To calculate survival curves, a Kaplan–Meier estimator was used, with P values calculated via the logrank test with FDR correction. P values for cluster correlations were calculated using a hypergeometric test.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Supplementary information

Supplementary Information (2.9MB, pdf)

Supplementary Tables 1–10, Figs. 1–8, Algorithm 1, notes and Acknowledgements.

Reporting Summary (76.8KB, pdf)
Peer Review File (3.5MB, pdf)

Source data

Source Data Fig. 2 (462.7KB, xlsx)

Statistical source data.

Source Data Fig. 3 (2.5MB, xlsx)

Statistical source data.

Source Data Fig. 4 (12.6KB, xlsx)

Statistical source data.

Extended Data Fig. 7 (16.3KB, xlsx)

Statistical source data.

Acknowledgements

Data used in the preparation of this article were obtained from the PRO-ACT database, the ALS/MND Natural History Consortium, the Parkinson’s Progression Markers Initiative database and the ADNI database. This research includes the National Institute of Neurologic Disease and Stroke’s Archived Clinical Research data (Clinical Trial of Ceftriaxone in ALS, M. Cudkowicz, Massachusetts General Hospital) obtained from the NINDS Archived Clinical Research Datasets webpage. Additional information about the studies can be found in Supplementary Acknowledgements. The Answer ALS organization, ALS Finding a Cure and Packard Foundation supported the collection of the Answer ALS clinical dataset used in the manuscript. The Muscular Dystrophy Association contributed funding to the Emory ALS Clinic database that was included in this research. C.N.F. received funding from the Department of Veterans Affairs of Research and Development (IK2CX001595-02) and the Department of Defense (AL200156). K. Sachs received funding from the Muscular Dystrophy Association (award 574137). D.R. received funding from the NSF Gradate Research Fellowship Program (GRFP) and Siebel Scholars Fellowship. E.F. and D.R. received funding from Answer ALS, MIT–IBM Watson AI Lab (W1771646), the United States Army Medical Research Acquisition Activity (W81XWH-21-1-0245) and NIH (U54NS091046). T.M.H. received funding from the NIH/NINDS (K23NS099380). None of the organizations had any influence on the writing of the manuscript or the decision to submit it for publication.

Extended data

Extended Data Fig. 4. Clusters spanning 90% of all individuals in CEFT.

Extended Data Fig. 4

The first year slope is calculated as the difference between 48 and the mean cluster score one year after symptom onset, divided by the time from symptom onset. N indicates the number of ALS patients in each cluster. Shaded area indicates 0.95 confidence interval.

Extended Data Fig. 5. Clusters spanning 90% of all individuals in EMORY.

Extended Data Fig. 5

The first year slope is calculated as the difference between 48 and the mean cluster score one year after symptom onset, divided by the time from symptom onset. N indicates the number of ALS patients in each cluster. Shaded area indicates 0.95 confidence interval.

Extended Data Fig. 6. Clusters spanning 90% of all individuals in NATHIST.

Extended Data Fig. 6

The first year slope is calculated as the difference between 48 and the mean cluster score one year after symptom onset, divided by the time from symptom onset. N indicates the number of ALS patients in each cluster. Shaded area indicates 0.95 confidence interval.

Author contributions

D.R., K. Severson and S.G. contributed to model development and analyzed data. D.R., K. Severson, K.N. and E.F. contributed to project design. D.R. wrote the manuscript with input and revisions from all authors.

Peer review

Peer review information

Nature Computational Science thanks Mamede de Carvalho, Cassie S. Mitchell and Henk-Jan Westeneng for their contribution to the peer review of this work. This article has been peer reviewed as part of Springer Nature’s Guided Open Access initiative. Primary Handling Editor: Ananya Rastogi, in collaboration with the Nature Computational Science team. Peer reviewer reports are available.

Data availability

A pretrained reference model for this study can be downloaded here: http://fraenkel.mit.edu/mogp

Source Data for Figs. 24 and Extended Data Fig. 7 are available with this manuscript. Source Data for Fig. 1 and Extended Data Fig. 2 are available as a Python object from http://fraenkel.mit.edu/mogp. Other source data are unavailable at this time because they contain patient-level clinical data; however, all figures can be generated using the code provided, after downloading the datasets listed below.

Clinical data for this study can be obtained from the following sources.

AALS (ClinicalTrials.gov identifier NCT02574390) is available for download in the Answer ALS data portal (data.answerals.org). PRO-ACT can be downloaded from the PRO-ACT database (https://nctu.partners.org/ProACT). CEFT (ClinicalTrials.gov identifier NCT00349622) can be downloaded from National Institute of Neurological Disorders and Stroke (NINDS) (https://www.ninds.nih.gov/Current-Research/Research-Funded-NINDS/Clinical-Research/Archived-Clinical-Research-Datasets). EMORY is restricted access at this time due to containing information that could compromise patient privacy, but available with permission from Dr. Jonathan Glass (jglas03@emory.edu) for legitimate research. Response to requests will be provided within two weeks, all data provided will be fully de-identified, a DUA will need to be established and the source data will need to be acknowledged in any publications. NATHIST is available from the ALS/MND Natural History Consortium (https://www.data4cures.org/requestingdata) with a summary of proposed data use, data elements requested and publication intent. The Parkinson’s Progression Markers Initiative can be downloaded, with a data use agreement, online application and compliance with publication policy (https://www.ppmi-info.org/access-data-specimens/download-data). Applications for data access are reviewed by the Data and Publications Committee within one week of receipt. ADNI can be downloaded through the LONI Image and Data Archive (https://adni.loni.usc.edu/data-samples/access-data/#access_data). Access is contingent on adherence to the ADNI Data Use Agreement and its publication policies. The application process includes the acceptance of a data use agreement and submission of an online application form. The application must include the investigator’s institutional affiliation and the proposed uses of the ADNI data. ADNI data may not be used for commercial products or redistributed in any way.

Code availability

We provide the Python code for the MoGP framework as well as a pretrained reference model that researchers can use to generate predictions of cluster membership and trajectory function from input patient data. We also provide a pip-installable Python package associated with this work (mogp). All code used for data processing, modeling and figure generation can be found at https://github.com/fraenkel-lab/mogp. Code is also deposited on Zenodo (license BSD 3-Clause; 10.5281/zenodo.6744399)59.

Competing interests

K.N., K. Severson and S.G. were employed by IBM Research during this project. K. Sachs consults for Modulo Bio Inc.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A list of authors and their affiliations appears at the end of the paper.

Contributor Information

Ernest Fraenkel, Email: Fraenkel-admin@mit.edu.

Answer ALS:

Emily G. Baxi, Alyssa N. Coyne, Elizabeth Mosmiller, Lindsey Hayes, Aianna Cerezo, Omar Ahmad, Promit Roy, Steven Zeiler, John W. Krakauer, Jonathan Li, Aneesh Donde, Nhan Huynh, Miriam Adam, Brook T. Wassie, Alex Lenail, Natasha Leanna Patel-Murray, Yogindra Raghav, Karen Sachs, Velina Kozareva, Stanislav Tsitkov, Tobias Ehrenberger, Julia A. Kaye, Leandro Lima, Stacia Wyman, Edward Vertudes, Naufa Amirani, Krishna Raja, Reuben Thomas, Ryan G. Lim, Ricardo Miramontes, Jie Wu, Vineet Vaibhav, Andrea Matlock, Vidya Venkatraman, Ronald Holewenski, Niveda Sundararaman, Rakhi Pandey, Danica-Mae Manalo, Aaron Frank, Loren Ornelas, Lindsey Panther, Emilda Gomez, Erick Galvez, Daniel Perez, Imara Meepe, Susan Lei, Louis Pinedo, Chunyan Liu, Ruby Moran, Dhruv Sareen, Barry Landin, Carla Agurto, Guillermo Cecchi, Raquel Norel, Sara Thrower, Sarah Luppino, Alanna Farrar, Lindsay Pothier, Hong Yu, Ervin Sinani, Prasha Vigneswaran, Alexander V. Sherman, S. Michelle Farr, Berhan Mandefro, Hannah Trost, Maria G. Banuelos, Veronica Garcia, Michael Workman, Richie Ho, Robert Baloh, Jennifer Roggenbuck, Matthew B. Harms, Carolyn Prina, Sarah Heintzman, Stephen Kolb, Jennifer Stocksdale, Keona Wang, Todd Morgan, Daragh Heitzman, Arish Jamil, Jennifer Jockel-Balsarotti, Elizabeth Karanja, Jesse Markway, Molly McCallum, Tim Miller, Ben Joslin, Deniz Alibazoglu, Senda Ajroud-Driss, Jay C. Beavers, Mary Bellard, Elizabeth Bruce, Nicholas Maragakis, Merit E. Cudkowicz, James Berry, Terri Thompson, Steven Finkbeiner, Leslie M. Thompson, Jennifer E. Van Eyk, Clive N. Svendsen, and Jeffrey D. Rothstein

Pooled Resource Open-Access ALS Clinical Trials Consortium:

Alexander Sherman

ALS/MND Natural History Consortium:

Christian Lunetta, David Walk, Ghazala Hayat, James Wymer, Kelly Gwathmey, Nicholas Olney, Senda Ajroud-Driss, Terry Heiman-Patterson, Ximena Arcila-Londono, Kenneth Faulconer, Ervin Sanani, Alex Berger, and Julia Mirochnick

Extended data

is available for this paper at 10.1038/s43588-022-00299-w.

Supplementary information

The online version contains supplementary material available at 10.1038/s43588-022-00299-w.

References

  • 1.Brown RH, Al-Chalabi A. Amyotrophic lateral sclerosis. N. Engl. J. Med. 2017;377:162–172. doi: 10.1056/NEJMra1603471. [DOI] [PubMed] [Google Scholar]
  • 2.Mandrioli J, et al. Heterogeneity in ALSFRS-R decline and survival: a population-based study in Italy. Neurol Sci. 2015;36:2243–2252. doi: 10.1007/s10072-015-2343-6. [DOI] [PubMed] [Google Scholar]
  • 3.Traxinger K, Kelly C, Johnson BA, Lyles RH, Glass JD. Prognosis and epidemiology of amyotrophic lateral sclerosis. Neurol. Clin. Pract. 2013;3:313–320. doi: 10.1212/CPJ.0b013e3182a1b8ab. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Proudfoot M, Jones A, Talbot K, Al-Chalabi A, Turner MR. The ALSFRS as an outcome measure in therapeutic trials and its relationship to symptom onset. Amyotroph. Lateral Scler. Frontotemporal Degener. 2016;17:414–425. doi: 10.3109/21678421.2016.1140786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bedlack RS, et al. How common are ALS plateaus and reversals? Neurology. 2016;86:808–812. doi: 10.1212/WNL.0000000000002251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Cedarbaum JM, et al. The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function. J. Neurol. Sci. 1999;169:13–21. doi: 10.1016/S0022-510X(99)00210-5. [DOI] [PubMed] [Google Scholar]
  • 7.Goyal NA, et al. Addressing heterogeneity in amyotrophic lateral sclerosis CLINICAL TRIALS. Muscle Nerve. 2020;62:156–166. doi: 10.1002/mus.26801. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Kiernan MC, et al. Improving clinical trial outcomes in amyotrophic lateral sclerosis. Nat. Rev. Neurol. 2021;17:104–118. doi: 10.1038/s41582-020-00434-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Armon C, et al. Linear estimates of disease progression predict survival in patients with amyotrophic lateral sclerosis. Muscle Nerve. 2000;23:874–882. doi: 10.1002/(SICI)1097-4598(200006)23:6&#x0003c;874::AID-MUS5&#x0003e;3.0.CO;2-U. [DOI] [PubMed] [Google Scholar]
  • 10.Elamin M, et al. Predicting prognosis in amyotrophic lateral sclerosis: a simple algorithm. J. Neurol. 2015;262:1447–1454. doi: 10.1007/s00415-015-7731-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Labra J, Menon P, Byth K, Morrison S, Vucic S. Rate of disease progression: a prognostic biomarker in ALS. J. Neurol. Neurosurg. Psychiatry. 2016;87:628–632. doi: 10.1136/jnnp-2015-310998. [DOI] [PubMed] [Google Scholar]
  • 12.Paganoni S, et al. Trial of sodium phenylbutyrate–taurursodiol for amyotrophic lateral sclerosis. N. Engl. J. Med. 2020;383:919–930. doi: 10.1056/NEJMoa1916945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Cudkowicz ME, et al. Safety and efficacy of ceftriaxone for amyotrophic lateral sclerosis: a multi-stage, randomised, double-blind, placebo-controlled trial. Lancet Neurol. 2014;13:1083–1091. doi: 10.1016/S1474-4422(14)70222-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Abe K, et al. Safety and efficacy of edaravone in well defined patients with amyotrophic lateral sclerosis: a randomised, double-blind, placebo-controlled trial. Lancet Neurol. 2017;16:505–512. doi: 10.1016/S1474-4422(17)30115-1. [DOI] [PubMed] [Google Scholar]
  • 15.Kueffner R, et al. Stratification of amyotrophic lateral sclerosis patients: a crowdsourcing approach. Sci. Rep. 2019;9:690. doi: 10.1038/s41598-018-36873-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Küffner R, et al. Crowdsourced analysis of clinical trial data to predict amyotrophic lateral sclerosis progression. Nat. Biotechnol. 2015;33:51–57. doi: 10.1038/nbt.3051. [DOI] [PubMed] [Google Scholar]
  • 17.Gordon PH, et al. Progression in ALS is not linear but is curvilinear. J. Neurol. 2010;257:1713–1717. doi: 10.1007/s00415-010-5609-1. [DOI] [PubMed] [Google Scholar]
  • 18.Thakore NJ, Lapin BR, Pioro EP, Pooled Resource Open-Access ALS Clinical Trials Consortium Trajectories of impairment in amyotrophic lateral sclerosis: insights from the Pooled Resource Open-Access ALS Clinical Trials cohort. Muscle Nerve. 2018;57:937–945. doi: 10.1002/mus.26042. [DOI] [PubMed] [Google Scholar]
  • 19.Ackrivo J, et al. Classifying patients with amyotrophic lateral sclerosis by changes in FVC. A group-based trajectory analysis. Am. J. Respir. Crit. Care Med. 2019;200:1513–1521. doi: 10.1164/rccm.201902-0344OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Poesen K, et al. Neurofilament markers for ALS correlate with extent of upper and lower motor neuron disease. Neurology. 2017;88:2302–2309. doi: 10.1212/WNL.0000000000004029. [DOI] [PubMed] [Google Scholar]
  • 21.Steinbach R, et al. Applying the D50 disease progression model to gray and white matter pathology in amyotrophic lateral sclerosis. NeuroImage Clin. 2020;25:102094. doi: 10.1016/j.nicl.2019.102094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Gomeni R, Fava M. Consortium TPROAACT. Amyotrophic lateral sclerosis disease progression model. Amyotroph. Lateral Scler. Frontotemporal Degener. 2014;15:119–129. doi: 10.3109/21678421.2013.838970. [DOI] [PubMed] [Google Scholar]
  • 23.Ong ML, Tan PF, Holbrook JD. Predicting functional decline and survival in amyotrophic lateral sclerosis. PLoS ONE. 2017;12:e0174925. doi: 10.1371/journal.pone.0174925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Tang M, et al. Model-based and model-free techniques for amyotrophic lateral sclerosis diagnostic prediction and patient clustering. Neuroinformatics. 2019;17:407–421. doi: 10.1007/s12021-018-9406-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Bell ML, Rabe BA. The mixed model for repeated measures for cluster randomized trials: a simulation study investigating bias and type I error with missing continuous data. Trials. 2020;21:148. doi: 10.1186/s13063-020-4114-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Berry JD, et al. Improved stratification of ALS clinical trials using predicted survival. Ann. Clin. Transl. Neurol. 2018;5:474–485. doi: 10.1002/acn3.550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Rasmussen, C. E. & Williams, C. K. I. Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) (MIT Press, 2005).
  • 28.Rasmussen CE, Ghahramani Z. Infinite Mixtures of Gaussian Process Experts. Adv. Neural Inf. Process. Syst. 2002;14:881–888. [Google Scholar]
  • 29.Lo AY. On a class of Bayesian nonparametric estimates: I. Density estimates. Ann. Stat. 1984;12:351–357.. doi: 10.1214/aos/1176346412. [DOI] [Google Scholar]
  • 30.Escobar MD, West M. Bayesian density estimation and inference using mixtures. J. Am. Stat. Assoc. 1995;90:577–588. doi: 10.1080/01621459.1995.10476550. [DOI] [Google Scholar]
  • 31.Tresp V. Mixtures of Gaussian processes. Adv. Neural Inf. Process. Syst. 2000;13:7. [Google Scholar]
  • 32.Ross JC, Dy JG. Nonparametric mixture of Gaussian processes with constraints. Proc. Mach. Learn. Res. 2013;28:1346–1354. [Google Scholar]
  • 33.Center for Innovation and Bioinformatics (CIB) (ALS/MND Natural History Consortium, accessed 9 April 2022); https://www.data4cures.org/natural-history-consortium
  • 34.Atassi N, et al. The PRO-ACT database. Neurology. 2014;83:1719–1725. doi: 10.1212/WNL.0000000000000951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Pooled Resource Open-Access ALS Clinical Trials Database (PRO-ACT) Data Sets (ALS Association and Neurological Clinical Research Institute, accessed 4 December 2020); https://ncri1.partners.org/ProACT/Data/Index/1
  • 36.MD MEC. Clinical trial ceftriaxone in subjects with amyotrophic lateral sclerosis (ALS). ClinicalTrials.gov https://clinicaltrials.gov/ct2/show/NCT00349622 (Accessed 7 April 2022).
  • 37.Karanevich A, He J, Gajewski BJ. Using an anchor to improve linear predictions with application to predicting disease progression. Rev. Colomb. Estad. 2018;41:137–155. doi: 10.15446/rce.v41n2.68535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Czaplinski A, Yen AA, Appel SH. Forced vital capacity (FVC) as an indicator of survival and disease progression in an ALS clinic population. J. Neurol. Neurosurg. Psychiatry. 2006;77:390–392. doi: 10.1136/jnnp.2005.072660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Umoh ME, et al. Comparative analysis of C9orf72 and sporadic disease in an ALS clinic population. Neurology. 2016;87:1024–1030. doi: 10.1212/WNL.0000000000003067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Irwin DJ, et al. Cognitive decline and reduced survival in C9orf72 expansion frontotemporal degeneration and amyotrophic lateral sclerosis. J. Neurol. Neurosurg. Psychiatry. 2013;84:163–169. doi: 10.1136/jnnp-2012-303507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Floeter MK, et al. Disease progression in C9orf72 mutation carriers. Neurology. 2017;89:234–241. doi: 10.1212/WNL.0000000000004115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kueper JK, Speechley M, Montero-Odasso M. The Alzheimer’s Disease Assessment Scale—Cognitive Subscale (ADAS-Cog): modifications and responsiveness in pre-dementia populations. A narrative review. J. Alzheimeras Dis. 2018;63:423–444. doi: 10.3233/JAD-170991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Mohs RC, et al. Development of cognitive instruments for use in clinical trials of antidementia drugs: additions to the Alzheimer’s Disease Assessment Scale that broaden its scope. The Alzheimer’s Disease Cooperative Study. Alzheimer Dis. Assoc. Disord. 1997;11:S13–S21. doi: 10.1097/00002093-199700112-00003. [DOI] [PubMed] [Google Scholar]
  • 44.Armstrong MJ, Okun MS. Diagnosis and treatment of Parkinson disease: a review. J. Am. Med. Assoc. 2020;323:548–560. doi: 10.1001/jama.2019.22360. [DOI] [PubMed] [Google Scholar]
  • 45.Goetz CG, et al. Movement Disorder Society-sponsored revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS): process, format, and clinimetric testing plan. Mov. Disord. 2007;22:41–47. doi: 10.1002/mds.21198. [DOI] [PubMed] [Google Scholar]
  • 46.Stebbins GT, et al. How to identify tremor dominant and postural instability/gait difficulty groups with the movement disorder society unified Parkinson’s disease rating scale: comparison with the unified Parkinson’s disease rating scale. Mov. Disord. 2013;28:668–670. doi: 10.1002/mds.25383. [DOI] [PubMed] [Google Scholar]
  • 47.Holden SK, Finseth T, Sillau SH, Berman BD. Progression of MDS‐UPDRS scores over five years in de novo Parkinson disease from the Parkinson’s Progression Markers Initiative cohort. Mov. Disord. Clin. Pract. 2017;5:47–53. doi: 10.1002/mdc3.12553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Andres PL, et al. Fixed dynamometry is more sensitive than vital capacity or ALS rating scale. Muscle Nerve. 2017;56:710–715. doi: 10.1002/mus.25586. [DOI] [PubMed] [Google Scholar]
  • 49.Fournier CN, et al. Development and validation of the Rasch-Built Overall Amyotrophic Lateral Sclerosis Disability Scale (ROADS) JAMA Neurol. 2020;77:480–488. doi: 10.1001/jamaneurol.2019.4490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Simon NG, et al. Quantifying disease progression in amyotrophic lateral sclerosis. Ann. Neurol. 2014;76:643–657. doi: 10.1002/ana.24273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Huang F, et al. Longitudinal biomarkers in amyotrophic lateral sclerosis. Ann. Clin. Transl. Neurol. 2020;7:1103–1116. doi: 10.1002/acn3.51078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Taylor AA, et al. Predicting disease progression in amyotrophic lateral sclerosis. Ann. Clin. Transl. Neurol. 2016;3:866–875. doi: 10.1002/acn3.348. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Kimura F, et al. Progression rate of ALSFRS-R at time of diagnosis predicts survival time in ALS. Neurology. 2006;66:265–267. doi: 10.1212/01.wnl.0000194316.91908.8a. [DOI] [PubMed] [Google Scholar]
  • 54.Peterson, K., Rudovic, O., Guerrero, R. & Picard, R. W. Personalized Gaussian processes for future prediction of Alzheimer’s disease progression. Preprint at http://arxiv.org/abs/1712.00181 (2018).
  • 55.Zhao, Y., Chitnis, T., Healy, B. C., Dy, J. G. & Brodley, C. E. Domain induced Dirichlet mixture of Gaussian processes: an application to predicting disease progression in multiple sclerosis patients. In 2015 IEEE International Conference on Data Mining (Eds. Aggarwal, C. et al.) 1129–1134 (IEEE, 2015).
  • 56.Cheng L, et al. An additive Gaussian process regression model for interpretable non-parametric analysis of longitudinal data. Nat. Commun. 2019;10:1798. doi: 10.1038/s41467-019-09785-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Weiner MW, et al. Impact of the Alzheimer’s Disease Neuroimaging Initiative, 2004 to 2014. Alzheimer's Dement. 2015;11:865–884. doi: 10.1016/j.jalz.2015.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Parkinson Progression Marker Initiative The Parkinson Progression Marker Initiative (PPMI) Prog. Neurobiol. 2011;95:629–635. doi: 10.1016/j.pneurobio.2011.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.MoGP: Mixture of Gaussian Processes Model. Zenodo10.5281/zenodo.6744399 (2022).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information (2.9MB, pdf)

Supplementary Tables 1–10, Figs. 1–8, Algorithm 1, notes and Acknowledgements.

Reporting Summary (76.8KB, pdf)
Peer Review File (3.5MB, pdf)
Source Data Fig. 2 (462.7KB, xlsx)

Statistical source data.

Source Data Fig. 3 (2.5MB, xlsx)

Statistical source data.

Source Data Fig. 4 (12.6KB, xlsx)

Statistical source data.

Extended Data Fig. 7 (16.3KB, xlsx)

Statistical source data.

Data Availability Statement

A pretrained reference model for this study can be downloaded here: http://fraenkel.mit.edu/mogp

Source Data for Figs. 24 and Extended Data Fig. 7 are available with this manuscript. Source Data for Fig. 1 and Extended Data Fig. 2 are available as a Python object from http://fraenkel.mit.edu/mogp. Other source data are unavailable at this time because they contain patient-level clinical data; however, all figures can be generated using the code provided, after downloading the datasets listed below.

Clinical data for this study can be obtained from the following sources.

AALS (ClinicalTrials.gov identifier NCT02574390) is available for download in the Answer ALS data portal (data.answerals.org). PRO-ACT can be downloaded from the PRO-ACT database (https://nctu.partners.org/ProACT). CEFT (ClinicalTrials.gov identifier NCT00349622) can be downloaded from National Institute of Neurological Disorders and Stroke (NINDS) (https://www.ninds.nih.gov/Current-Research/Research-Funded-NINDS/Clinical-Research/Archived-Clinical-Research-Datasets). EMORY is restricted access at this time due to containing information that could compromise patient privacy, but available with permission from Dr. Jonathan Glass (jglas03@emory.edu) for legitimate research. Response to requests will be provided within two weeks, all data provided will be fully de-identified, a DUA will need to be established and the source data will need to be acknowledged in any publications. NATHIST is available from the ALS/MND Natural History Consortium (https://www.data4cures.org/requestingdata) with a summary of proposed data use, data elements requested and publication intent. The Parkinson’s Progression Markers Initiative can be downloaded, with a data use agreement, online application and compliance with publication policy (https://www.ppmi-info.org/access-data-specimens/download-data). Applications for data access are reviewed by the Data and Publications Committee within one week of receipt. ADNI can be downloaded through the LONI Image and Data Archive (https://adni.loni.usc.edu/data-samples/access-data/#access_data). Access is contingent on adherence to the ADNI Data Use Agreement and its publication policies. The application process includes the acceptance of a data use agreement and submission of an online application form. The application must include the investigator’s institutional affiliation and the proposed uses of the ADNI data. ADNI data may not be used for commercial products or redistributed in any way.

We provide the Python code for the MoGP framework as well as a pretrained reference model that researchers can use to generate predictions of cluster membership and trajectory function from input patient data. We also provide a pip-installable Python package associated with this work (mogp). All code used for data processing, modeling and figure generation can be found at https://github.com/fraenkel-lab/mogp. Code is also deposited on Zenodo (license BSD 3-Clause; 10.5281/zenodo.6744399)59.


Articles from Nature Computational Science are provided here courtesy of Nature Publishing Group

RESOURCES