Although machine‐learning (ML) approaches have been extensively utilized in neurodegenerative conditions, they can be challenging to implement in motor neuron diseases (MNDs) due to disease‐specific characteristics. The potential of ML algorithms has been explored by academic amyotrophic lateral sclerosis (ALS) studies, but they have not been developed into viable clinical applications to date. ALS studies traditionally conduct "group‐level" analyses to describe phenotype‐ or genotype‐associated clinical traits, survival characteristics, progression rates, biomarker profiles, and imaging signatures [1, 2, 3, 4]. These, although academically interesting, have limited utility for the interpretation of data from single individuals. The appeal of ML frameworks in a condition with considerable clinical heterogeneity, such as ALS, is the opportunity to categorize individual patients into clinically relevant subgroups. The long‐awaited transition from "group‐level" descriptive analyses to precision, "individual‐subject" data interpretation has been fueled by the emergence of large training datasets, in the form of purpose‐designed data repositories, national registries, or leftover data from clinical trials. Harnessing the availability of such data sources, a multitude of promising ML studies have been published demonstrating the prospect of accurately classifying a single individual into relevant diagnostic or prognostic subgroups [5].
There are important lessons to consider from early ML initiatives in ALS. Irrespective of the specific ML model implemented, cohort size for model training is crucial, which is one of the biggest challenges in ALS in contrast to more common neurodegenerative conditions. A considerable shortcoming of single‐centre ML studies is the lack of external validation, which coupled with small training datasets, increases the risk of model overfitting to local data. Binary classification studies merely categorizing individuals into "ALS" versus "healthy control" groups have limited practical appeal, as the diagnostic dilemma in the clinical setting is not whether an individual is healthy, but rather whether the constellation of findings represents incipient ALS or an alternative neurodegenerative or neuromuscular condition. Multiclass classification studies mirror real‐life clinical scenarios better, especially if multiple MND phenotypes are represented. The accurate categorization of an early stage upper motor neuron‐predominant case into "ALS" versus "probable PLS," for example, is hugely important due to the survival ramifications of the correct diagnosis [6, 7]. Another stereotyped caveat of ML studies in ALS is model validation on cohorts with long symptom duration. The categorization of patients with long symptom duration with considerable disability and marked biomarker changes is not ideal to test model accuracy. A more compelling validation of a model is whether early stage patients or asymptomatic gene‐carriers are accurately categorized into prospective diagnostic and prognostic groups based on peridiagnostic or presymptomatic biomarker profiles [8].
The critical appraisal of published ML studies in MND helps to outline desirable future study designs. Models should ideally be validated on external datasets; the choice of ML model should be determined by inherent data characteristics (missing data, number of features, etc.); multiclass classification models should be implemented preferably with disease‐mimics, disease‐controls, and several MND phenotypes; categorization beyond diagnostic groups into prognostic categories has additional clinical utility; and the implementation of several ML models on the same dataset may help to juxtapose the comparative efficiency of proposed models. The interrogation of quantitative biomarker panels (serum, cerebrospinal fluid [CSF], imaging) may support clinical decision‐making independently [9, 10]. An alternative strategy is the interpretation of demographic and clinical variables in ML models [11], which has a number of practical advantages compared to relying on instrumental metrics (magnetic resonance, positron emission tomography, CSF); data collection is easily harmonized across multiple sites, data acquisition is relatively cheap, data transfer is logistically simple, et cetera. Core clinical variables are typically already recorded, so with the appropriate approvals in place, extant data may potentially be used retrospectively for model training.
In this issue of European Journal of Neurology, Gromicho et al. from the University of Lisbon, Portugal present a particularly innovative ML study [12]. The authors implement dynamic Bayesian networks (DBNs) to evaluate the influence of the most commonly recorded clinical variables on disease progression in ALS. DBNs model variable dependencies that evolve over time and are trained upon multi‐time point observations. The five key determinants of disease progression according to the authors are symptom duration at first consultation, body mass index at diagnosis, subscores 1 (speech) and 9 (stairs) of the revised Amyotrophic Lateral Sclerosis Functional Rating Scale, and maximum expiratory pressure. The pragmatic relevance of identifying key determinants of progression rate is that patients entering clinical trials may be informedly stratified so that ensuing "slow progression" is not intuitively attributed to a putative drug effect, and that "fast progression" is not automatically regarded as failure to respond to therapy.
Despite its practical pitfalls, ML is one of the most exciting frontiers of ALS research, and it is gaining considerable momentum thanks to the increased availability of large datasets, multicentre data harmonization efforts, and dedicated international consortia. These developments offer unparalleled opportunities for model optimization and validation, paving the way for viable clinical applications.
AUTHOR CONTRIBUTIONS
Peter Bede: Conceptualization (equal); writing – original draft (equal). Kai Ming Chang: Conceptualization (equal); writing – original draft (equal). Ee Ling Tan: Conceptualization (equal); writing – original draft (equal).
CONFLICT OF INTEREST
None of the authors has any conflict of interest to disclose.
ACKNOWLEDGMENTS
P.B. is supported by the Health Research Board (HRB EIA‐2017‐019 & JPND‐Cofund‐2‐2019‐1), the Irish Institute of Clinical Neuroscience, the Spastic Paraplegia Foundation, the EU Joint Programme–Neurodegenerative Disease Research, the Andrew Lydon scholarship, and the Iris O'Brien Foundation. Open access funding provided by IReL.
DATA AVAILABILITY STATEMENT
This is an editorial commentary and not based on a specific dataset.
REFERENCES
- 1. Burke T, Elamin M, Bede P, et al. Discordant performance on the 'Reading the mind in the Eyes' test, based on disease onset in amyotrophic lateral sclerosis. Amyotroph Lateral Scler Frontotemporal Degener. 2016;17(7–8):467‐472. [DOI] [PubMed] [Google Scholar]
- 2. Christidi F, Karavasilis E, Rentzos M, et al. Hippocampal pathology in amyotrophic lateral sclerosis: selective vulnerability of subfields and their associated projections. Neurobiol Aging. 2019;84:178‐188. [DOI] [PubMed] [Google Scholar]
- 3. Abidi M, de Marco G, Couillandre A, et al. Adaptive functional reorganization in amyotrophic lateral sclerosis: coexisting degenerative and compensatory changes. Eur J Neurol. 2020;27(1):121‐128. [DOI] [PubMed] [Google Scholar]
- 4. El Mendili MM, Querin G, Bede P, Pradat PF. Spinal cord imaging in amyotrophic lateral sclerosis: historical concepts‐novel techniques. Front Neurol. 2019;10:350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Grollemund V, Chat GL, Secchi‐Buhour MS, et al. Development and validation of a 1‐year survival prognosis estimation model for amyotrophic lateral sclerosis using manifold learning algorithm UMAP. Sci Rep. 2020;10(1):13378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Finegan E, Chipika RH, Li Hi Shing S, et al. The clinical and radiological profile of primary lateral sclerosis: a population‐based study. J Neurol. 2019;266(11):2718‐2733. [DOI] [PubMed] [Google Scholar]
- 7. Bede P, Chipika RH, Finegan E, et al. Brainstem pathology in amyotrophic lateral sclerosis and primary lateral sclerosis: a longitudinal neuroimaging study. NeuroImage Clin. 2019;24:102054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Querin G, Bede P, El Mendili MM, et al. Presymptomatic spinal cord pathology in c9orf72 mutation carriers: a longitudinal neuroimaging study. Ann Neurol. 2019;86(2):158‐167. [DOI] [PubMed] [Google Scholar]
- 9. Blasco H, Patin F, Descat A, et al. A pharmaco‐metabolomics approach in a clinical trial of ALS: identification of predictive markers of progression. PLoS One. 2018;13(6):e0198116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Bede P, Murad A, Hardiman O. Pathological neural networks and artificial neural networks in ALS: diagnostic classification based on pathognomonic neuroimaging features. J Neurol. 2021;269:2440‐2452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Grollemund V, Le Chat G, Secchi‐Buhour MS, et al. Manifold learning for amyotrophic lateral sclerosis functional loss assessment: development and validation of a prognosis model. J Neurol. 2021;268(3):825‐850. [DOI] [PubMed] [Google Scholar]
- 12. Gromicho M, Leão T, Oliveira Santos M, et al. Dynamic Bayesian networks for stratification of disease progression in amyotrophic lateral sclerosis. Eur J Neurol. 2022;29(8):2201‐2210. doi: 10.1111/ene.15357. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
This is an editorial commentary and not based on a specific dataset.