Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2025 Nov 19;15:40769. doi: 10.1038/s41598-025-24664-7

Analytical and clinical validation of step counting method in people living with amyotrophic lateral sclerosis

Marcin Straczkiewicz 1,, Katherine M Burke 2,3, Kendall T Carney 2, Narghes Calcagno 4, Sravan Mandepudi 2, Alan Premasiri 5, Fernando G Vieira 5, James D Berry 2
PMCID: PMC12630758  PMID: 41258169

Abstract

Accelerometer-based digital measures offer a scalable and low-burden means of quantifying physical function, but existing processing algorithms may not quantify pathological gait correctly. In people living with amyotrophic lateral sclerosis (ALS), where gait patterns are slow, variable, and asymmetric, validated tools to quantify mobility are urgently needed. We proposed a step-counting algorithm designed for ankle-worn accelerometers that leverage wavelet-based decomposition to quantify heel strikes under heterogeneous gait patterns. We validated this method using five datasets comprising healthy individuals and those with ALS in controlled and semi-controlled activities, and we performed clinical validation in a free-living cohort of 305 people with ALS. We tested our method for accuracy in detecting steps and recognizing walking activity. Reference labels used for analytical validation were obtained from annotated studies or video-based ground truth. Step counting accuracy was assessed using Bland–Altman analysis while clinical validity was evaluated by comparing step counts to gross motor functioning on the ALS Functional Rating Scale-Revised (ALSFRS-R). Walking recognition was robust across walking conditions and body types; sensitivity ranged from 0.94 to 0.98, and specificity exceeded 0.95 across all evaluated datasets. The mean step counting bias was minimal (e.g., 0.44 steps), and the 95% limits of agreement were narrow (LoA = [−5.90, 5.40]) relative to reference standards, including video-annotated ground truth. Clinical validation indicated substantial differences between groups with various levels of gait impairment, e.g., participants who reported “walks with assist” on the ALSFRS-R accumulated a mean of 1283 (95% CI 1063, 1503) steps/day, while those reporting “normal” walking covered 3984 (95% CI 3537, 4432) steps/day. Our study covered analytical and clinical validation of a step-counting method developed for ankle-worn accelerometers and demonstrated its applicability to pathological gait. The method provides accurate quantification of walking activity in controlled and free-living environments, supporting its use as a digital endpoint in ALS research.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-025-24664-7.

Keywords: Gait, Assistive device, ALSFRS-R, Digital Health Technologies, Motor neuron disease

Subject terms: Biomarkers, Engineering, Health care, Medical research, Neurology, Neuroscience

Introduction

Accelerometer-based step counts have emerged as a valuable indicator of physical functioning in both clinical and epidemiological research, offering a scalable, low-burden approach to monitoring mobility across diverse populations. Gait speed has been referred to as the “sixth vital sign” due to its strong association with functional status, fall risk, hospitalization, and mortality13. In people living with amyotrophic lateral sclerosis (ALS), a progressive neurodegenerative condition characterized by motor neuron loss, functional decline, and ultimately respiratory failure, capturing ambulatory function is particularly critical. Gait impairment is often among the earliest manifestations of ALS46, and progressively worsens, leading to decreased mobility, increased fall risk, and compounding health complications79.

Despite its clinical relevance, mobility assessment in ALS has traditionally relied on infrequent and coarse-grained outcome measures such as the ALS Functional Rating Scale-Revised (ALSFRS-R), which may fail to detect subtle yet meaningful changes in walking behavior1013. As therapeutics evolve, there is an urgent need for sensitive, responsive, and ecologically valid outcome measures capable of detecting functional improvements and disease progression1416.

Digital health technologies (DHTs), including wearable accelerometers and smartphones, enable continuous, high-frequency assessment of motor function in both clinical and free-living environments1720. Growing evidence supports their utility in ALS and other neurological cohorts. Recent studies demonstrated that wearable-derived gait measures are associated with cognitive impairment and fall risk21, patient-reported gross motor functioning5, and overall disease progression22. However, these studies utilized proprietary methodologies that hinder reproducibility in subsequent investigations. Furthermore, some analyses leveraged solutions developed for healthy populations and wrist-worn devices, which may perform poorly with slower, asymmetric, or variable gait patterns typical to ALS20,2325. An improved solution may be offered by placing the sensor on a lower limb as it appears less sensitive to interruptions in typical limb swings, which can lead to substantial errors when participants begin to use assistive devices26.

In this work, we propose a simple, robust, and open-source step counting algorithm that leverages ankle-worn accelerometer data. The algorithm builds on fundamental features of walking, i.e., periodicity, intensity, and cadence, and is designed to accommodate the heterogeneity, speed fluctuations, and asymmetries characteristic to ALS gait. Because our purpose was to develop an algorithm that quantifies gait in individuals that may use assistive devices, we relied solely on ankle-worn sensors.

We validate the method using several publicly available datasets of healthy adults as well as clinical and free-living data from individuals with ALS, capturing a range of walking speeds, assistive device use, and disease severity. In addition to quantifying steps, we test the accuracy of the method to distinguish walking from other common daily activities that might inflate step counts in free-living observations27. This multi-context validation framework enables analytical and clinical evaluation of algorithmic performance and supports the feasibility of ankle-worn step counting as a sensitive, interpretable, and scalable digital endpoint for mobility in ALS.

Methods

Population, devices, data

The proposed method was validated analytically and clinically using six independent datasets comprising accelerometer data collected from healthy individuals and individuals diagnosed with ALS. Analytic validation was performed on five datasets, Human Physical Activity (SPADES)28, Daily Life Activities (DaLiAc)29, Identification of Walking, Stair Climbing, and Driving using Wearable Accelerometers (IWSCD)30, Pedometer Evaluation Project (PedEval)31, and Endurance Tasks to Measure Performance Fatigue in ALS (PFALS). Three of these datasets (SPADES, DaLiAc, and IWSCD) were used to study walking recognition, while two (PedEval, PFALS) were used to study step counting. The SPADES, DaLiAc, IWSCD, and PedEval datasets were identified in a public domain, while PFALS was collected by a research team at the Neurological Clinical Research Institute and Sean M. Healey & AMG Center for ALS at Massachusetts General Hospital (Boston, MA, USA). Clinical validation was performed on a dataset collected within the ALS Research Collaborative (ARC) led by the ALS Therapy Development Institute (Watertown, MA, USA). The PFALS and ARC datasets are not available in public domain due to privacy constraints, but data access can be requested by investigators (see Data Availability).

The following sections describe available information on the investigated datasets; their summary is provided in Table 1.

Table 1.

Demographics, body measures, and health status of participants involved in datasets included in this study. Age, height, weight, and BMI are provided as range (mean ± SD), when available.

SPADES DaLiAc IWSCD PedEval PFALS ARC
Health status Healthy Healthy Healthy Healthy Healthy (15) / ALS (12) ALS
N 42 19 32 30 27 305
Sex (females) 15 8 19 15 8 / 2 103
Age (years) 18–30 (23.5 ± 3.1) 18–55 (26.5 ± 7.7) 23–52 (39.0 ± 9.0) 19–27 (21.9 ± 5.4) 44–68 (60.3 ± 7.7) / 45–70 (59.9 ± 7.4) 20–78 (55.1 ± 10.6)
Height (cm) 151–180 (174.2 ± 8.5) 158–196 (177.0 ± 11.1) 147–193 (173.5 ± 11.1) 152–193 (171.0 ± 10.8) 158–183 (169.6 ± 6.3) / 152–190 (175.3 ± 11.9) 152–203 (173.6 ± 9.9)
Weight (kg) 51–112 (73.8 ± 15.0) 54–108 (75.2 ± 14.2) 45–140 (77.0 ± 22.9) 43–136 (70.5 ± 17.6) 54–99 (78.2 ± 13.2) / 57–103.9 (85.5 ± 14.9) 29.9–147 (79.1 ± 15.8)
BMI (kg/m2) 18–35 (24.7 ± 4.1) 17–34 (23.9 ± 3.7) 18–40 (25.2 ± 5.6) 17–37 (23.8 ± 3.7) 22–34 (27.7 ± 4.3) / 22–40 (27.9 ± 5.2) 10–41 (26.2 ± 4.5)
Data collection
 Environment Controlled Controlled Semi-controlled Controlled & Semi-controlled Controlled Free-living
 Device ActiGraph GT9X SHIMMER ActiGraph GT3X +  SHIMMER3 APDM Opal ActiGraph GT3X + 
 Body location Left ankle Left ankle Left ankle Left ankle Left foot Left ankle
 Sampling rate 80 204.8 100 15 128 30
Disease characteristics
 ALSFRS-R Q1—Q12 27–42 (34.9 ± 4.1) 12–48 (40.21 ± 5.58)
 ALSFRS-R Q7 2–4 (3.2 ± 0.7) 0–4 (3.31 ± 0.80)
 ALSFRS-R Q8 2–4 (2.5 ± 0.67) 1–4 (2.96 ± 0.88)
 ALSFRS-R Q9 0–4 (1.5 ± 1.4) 0–4 (2.71 ± 1.20)

Accuracy in walking recognition

SPADES

Forty-two participants (age: 23.5 ± 3.1 years; 15 females) performed a variety of activities of daily living, including normal walking, ascending stairs, descending stairs, treadmill walk (1, 2, 3, and 3.5 mph), lying, sitting, standing, reclining, handwriting, computer typing, filling shelves, sweeping, running, cycling, and jumping jacks in supervised (laboratory) conditions. Raw accelerometer data were collected from participants’ ankles using ActiGraph GT9X (Pensacola, Florida) with sampling rate of 80 Hz and a dynamic range of ± 8 g. Data were labelled by a research team member.

DaLiAc

Nineteen participants (age: 26.5 ± 7.7 years; 8 females) performed thirteen activities including different postures (sitting, lying, standing), household activities (washing dishes, vacuuming, sweeping), walking behaviors (normal walking, running, stairs climbing), and sports activities (cycling, rope jumping). Raw accelerometer data were collected from participants’ ankles using the SHIMMER (Dublin, Ireland) with a sampling rate of 204.8 Hz and dynamic range of ± 6 g. A dedicated smartphone app was used to label the beginning and ending time of each activity.

IWSCD

Twenty-four adults (age: 39.0 ± 9.0 years; 19 females) participated in a study to identify walking, stair walking and driving from raw accelerometry data. The study included a walking trial (approximately 1 km) followed by a driving trial (approximately 21 km). The walking trial included walking on a level ground, up and down stairs, and up and down inclined paths. Immediately after the walking period, participants were accompanied to their vehicle, which they drove on a predefined route that included both highway and city driving. The walking trial lasted between 10 and 14 min while the driving trial lasted between 18 and 30 min, depending on traffic. Participants wore ActiGraph GT3X + accelerometer on their left ankle that collected raw accelerometer data with sampling rate of 100 Hz and a dynamic range of ± 6 g.

Accuracy in step counting

PedEval

Thirty participants (age: 21.9 ± 5.4 years, 15 females) performed three prescribed walking tasks: (1) a two-lap stroll along a designated path (continuous walk), (2) a scavenger hunt across four rooms (semi-continuous walk), and (3) a toy-assembling assignment using pieces distributed across a dozen bins located around a room (discontinuous walk). Participants were asked to wear the SHIMMER3 sensor on their left ankle. The sensor collected data with sampling frequency of 100 Hz and a dynamic range of ± 4 g. Step counts were visually assessed and manually annotated by a research team member.

PFALS

The investigated dataset was collected as part of a pilot study of endurance tasks to measure performance fatigue in ALS. We analyzed data from 12 participants (age: 59.9 ± 7.4 years, 2 females) with ALS and 15 healthy controls (age: 60.3 ± 7.7 years, 8 females). In this task, participants were asked to wear the APDM Opal devices (Opal, APDM, Oregon, United States) at their ankles, and walk at about 75% of their maximum walking speed back and forth for up to 20 min along the 10-m course, completing 180 degree turns at the cones positioned at each end. Each trip between the cones counted as one lap. In this study, data from the initial six laps of the trial was evaluated for all participants. Devices recorded continuous triaxial accelerometer measures at a sampling frequency of 128 Hz and a dynamic range of ± 2 g. We also used the APDM-dedicated software, Mobility Lab (version 2.0.0.201903301644), to extract step counts corresponding to each trial. This study was approved by the Institutional Review Board of Massachusetts General Brigham (IRB# 2020P000546).

Accuracy in distinguishing disease status

ARC

We analyzed data from 305 participants (age: 55.1 ± 10.6 years, 103 females) diagnosed with ALS wearing ActiGraph GT3X + accelerometers on their left ankle. Devices recorded continuous triaxial accelerometer measurements at a sampling frequency of 30 Hz with a dynamic range of ± 6 g. In this free-living observation, participants were instructed to wear the devices as much as possible for 7 days excluding bathing.

This study adhered to the guidelines outlined in the Declaration of Helsinki – Ethical Principles for Medical Research Involving Human Subjects. The study protocol was approved by the Institutional Review Board (ADVARRA Center for IRB Intelligence). All participants provided informed consent prior to any study procedures. There was no compensation for participation in the study.

Algorithm description

The proposed step counting method leverages the observation that walking consists of repetitive, oscillatory movements characterized by distinct periodicity and amplitude, particularly when captured by worn accelerometers. These periodic features arise from biomechanical events such as heel strikes, which produce transient, high-amplitude changes in acceleration along the axis aligned with the shin (Fig. 1A). To discriminate walking from other daily activities and to detect individual steps in heterogeneous free-living conditions, we apply a continuous wavelet transform (CWT) to the raw accelerometer signal. This transform decomposes the signal into a time–frequency representation, allowing for the detection of localized periodic events while being robust to signal non-stationarity (Fig. 1B). By focusing on the signal’s vertical component, i.e., where heel strikes are reflected as sharp spikes, our method provides a robust framework for detecting gait in individuals with varying movement patterns and gait abnormalities.

Fig. 1.

Fig. 1

Framework for detecting heel strikes using ankle-worn accelerometers. Signal from vertical axis (A) is decomposed using a continuous wavelet transform which uncovers sharp spikes corresponding to individual heel strikes (B). Local peaks in frequency-wise summation of wavelet coefficients help identify heel strikes locations within the signal (C). Identified locations correspond to heel strikes from one foot (D); assuming another foot making the same number of steps, total step counts are calculated as number of heel strikes multiplied by 2.

In our method, the CWT is implemented using the generalized Morse wavelet, defined as Inline graphic parameterized with a symmetry constant γ = 3, ensuring zero skewness, and a time-bandwidth product P2 = 10, which balances time and frequency localization (Supplementary Fig. 1). This configuration allows the wavelet coefficients to be particularly sensitive to abrupt changes in signal amplitude, such as those produced during heel strikes.

In our approach, wavelet coefficients (product of CWT) are computed across a physiologically relevant frequency range (up to 5Hz), and their values are summed at each time point to enhance the signal-to-noise ratio of periodic gait-related events (Fig. 1C). We identify local peaks in the resulting time series of summed coefficients and classify them as heel strikes if they exceed a fixed threshold set at 0.1 (Fig. 1D).

To prevent spurious peaks (e.g., from isolated stomps or noise) from being misclassified as steps, we apply additional temporal time constraints. Specifically, our algorithm enforces that timestamps Inline graphic of consecutive heel strikes from the same leg must be separated by time intervals Inline graphic with physiologically plausible Inline graphic seconds and Inline graphic seconds, corresponding to a very fast (about 2.35 steps per second) and a very slow walking speed (0.8 steps per second), respectively. Furthermore, the difference between consecutive between-strike intervals Inline graphic should not exceed 0.5 s to further exclude implausible or artifact-driven detections.

In the analytical validation, we focus on two gait-related measures: walking activity time and step counts. Walking activity is identified for all nonsingular heel strikes if Inline graphic seconds. Steps were included in the step count outcome measure if they were classified as walking activity. Assuming another foot making the same number of steps, total step counts were computed as number of detected heel strikes multiplied by 2.

Statistical analysis

All raw accelerometer data were resampled to 10 Hz using linear interpolation prior to analysis to ensure consistent temporal resolution across datasets32. This sampling rate was previously demonstrated as sufficient to capture gait characteristics across various devices and sensor body locations27. Walking periods were identified in physical activity data from three annotated datasets (SPADES, DaLiAc, and IWSCD). To evaluate analytical validity of our method for walking recognition, we compared the algorithm-derived walking segments to reference activity labels provided in the source datasets. Sensitivity and specificity metrics were calculated for each trial, averaged across repeated measures within subject, and then aggregated across participants. Sensitivity was defined as the proportion of correctly identified walking segments among all annotated walking episodes (including level walking, stair climbing, and treadmill walking), while specificity was calculated as the proportion of correctly excluded non-walking segments (e.g., sitting, standing, household tasks). We report sensitivity as mean and 95% confidence intervals (CI).

We assessed the accuracy of step counting using Bland–Altman analyses across two validation settings. Depending on the dataset, ground truth was assessed visually and manually annotated by research team members either by following a walking person (PedEval) or by evaluating video recording (PFALS). In the latter, step counts were assessed as a mean score assessed by two co-authors of this paper (KMB and KTC). We report the accuracy for step counting in terms of mean error (bias) and levels of agreement (LoA). For comparison, a similar analysis was conducted between ground truth step counts and those extracted using proprietary software.

In the clinical validation phase, we utilized data collected over a one-week monitoring period (ARC). Step counts were aggregated on the daily level for days with at least 16 h of wear time, as determined by non-wear detection algorithm proposed by Choi and colleagues33. Our analysis included mean step counts estimated during weeks with at least three days with sufficient wear-time.

These estimates were then compared across ALSFRS-R item response levels (0–4) using a one-way analysis of variance (ANOVA). Our analysis focused on ALSFRS-R gross motor function questions regarding turning in bed and adjusting bed clothes (Q7), walking (Q8), and climbing stairs (Q9). We further performed Tukey–Kramer post-hoc comparisons to examine differences between adjacent response levels, i.e., 4 vs. 3, 3 vs. 2, 2 vs. 1, and 1 vs. 0, where possible.

Statistics on demographics, health descriptions, and performed activities are reported using mean and standard deviations (SD).

Results

The aggregated dataset used to validate the method comprising data from 452 individuals, including 138 healthy individuals and 314 individuals diagnosed with ALS. Data from 93 individuals were used to determine the accuracy of walking recognition, data from 57 (45 healthy, 12 ALS) individuals were used for analytical validation of step counting, while data from 302 individuals were used for clinical validation in ALS.

Altogether we analyzed data from 80,989 h of activity, including over 21 h for analytical validation of walking recognition (8.75 h of walking activities, 12.65 h of non-walking activities), 15.57 h for analytical validation of step counting, and over 80,952 h (about 3373 days) for clinical validation in ALS.

Analytical validation

Walking recognition accuracy

Walking recognition was highly accurate for a wide range of walking activities, such as normal walking (e.g., in SPADES the mean sensitivity was 0.997, CI = [0.995, 0.999]), ascending stairs (0.947 [0.908, 0.987]), descending stairs (0.915 [0.847, 0.983]), or walking on a treadmill at 1 mph (0.983 [0.978, 0.987]) (Fig. 2A). Our method also accurately recognized non-walking activities, including repetitive activities, such as cycling (e.g., in DaLiAc mean specificity was equal 1 [1, 1]) or running (0.981 [0.951, 1]) (Fig. 2B). Our results showed that car driving data collected in IWSCD, often represented as high-frequency vibrations34, was entirely classified as non-walking (1[1, 1]) (Fig. 2C). Decreased specificity was observed for sweeping the floor (in DaLiAc 0.851 [0.815, 0.887]) and vacuuming (0.877 [0.814, 0.940]). However, it is unknown to the authors whether these activities in fact consisted of entirely non-walking movements. Finally, systematically decreased specificity was reported for jumping (in SPADES 0.669 [0.545, 0.792], in DaLiAc 0.852 [0.770, 0.934]).

Fig. 2.

Fig. 2

Sensitivity (blue) and specificity (red) estimated for walking recognition in populations of ambulatory participants investigated in SPADES (A), DaLiAc (B), and IWSCD (C) studies.

Step counting accuracy

Altogether, according to the ground truth, participants included in PedEval performed a mean (SD) of 1045.73 (85.81) steps during continuous, 681.91 (88.37) steps during semi-continuous, and 180.81 (16.65) steps during discontinuous walking activities, respectively. The corresponding estimations calculated using our method were 1048.15 (84.89) steps for continuous, 660.67 (93.45) steps for semi-continuous, and 190.29 (18.29) steps for discontinuous walking activities.

Using Bland–Altman analysis, we determined that the mean bias (LoA) was equal to 0.42 (−11.60, 12.44) or 0.04% for continuous walking (Fig. 3A), −4.33 (−61.81, 53.15) or 0.64% for semi-continuous walking (Fig. 3B), and 1.36 (−44.45, 47.17) or 0.76% for discontinuous walking (Fig. 3C), indicating excellent reliability of the algorithm on a population level.

Fig. 3.

Fig. 3

Bland–Altman plots on step counting accuracy in three walking scenarios investigated in PedEval dataset—continuous walk (A), semi-continuous walk (B), and discontinuous walk (C), and one walking scenario investigated in PFALS dataset (D). In each scenario, ground truth (GT) was assessed by a visual observer. Black dots correspond to observations from healthy individuals, grey dots correspond to observations from individuals diagnosed with ALS.

The mean (SD) ALSFRS-R total score of individuals participating in PFALS dataset was equal to 34.9 (4.1), mean response to Q7 was equal to 3.2 (0.7), Q8 2.5 (0.7), and Q9 1.5 (1.4).

In PFALS, ALS participants and healthy controls performed a similar number of steps, i.e., 1269 and 1249, respectively. In this controlled experiment, our method accurately counted steps regardless of participants’ health conditions. In healthy participants, mean bias (LoA) was equal to -0.75 (-5.06, 3.56) or -1.75%, while in ALS participants mean bias was equal to 1.40 (-4.28, 7.08) or 0.4%. Cumulatively, mean bias (LoA) was equal to 0.44 (54.01, 5.90) or 0.56%, suggesting accurate step counting performance across the two populations (Fig. 3D).

In contrast, an additional analysis indicated a substantial underestimation of step counts extracted using proprietary software. Here the mean bias in healthy participants was equal to 35.60 (13.10, 58.10) or 34.60%, in ALS participants 29.92 (24.16, 35.67) or 28.92%, and cumulatively 33.07 (15.23, 50.92) or 32.07% (Supplementary Fig. 2). These results suggest that step counts produced by a proprietary software need to be studied with caution.

Clinical validation

Clinical validation data were collected between September 2014 and January 2023 on 305 participants diagnosed with ALS. There were 251 participants who met the data inclusions criteria (at least 3 days with a minimum wear-time of 16 h). The mean (SD) data collection period was equal to 13.44 (8.44) days, while step counts were estimated for a mean (SD) of 5.13 (1.89) days. At baseline, study participants reported a mean (SD) ALSFRS-R total score equal to 40.21 (5.58), while their mean (SD) responses to Q7, Q8, and Q9 were equal to 3.31 (0.80), 2.96 (0.88), and 2.76 (1.20), respectively.

Using step counts, we observed statistically significant differences between participants reporting normal gross motor functioning and those reporting impairments in this domain (Fig. 4). For example, participants reporting “normal” gait (Q8) performed a mean of 3984 [3537, 4432] step counts per day, significantly more compared to those reporting “early ambulation difficulties” (3197 [2837, 3557], p = 0.013), and “walking with assistance” (1283 [1063, 1503], p =  < 0.001). Importantly, the algorithm estimated almost no steps for those reporting “nonambulatory functional movement” (81 [12, 150], p =  < 0.001), indicating that it is robust to non-walking activities of daily living.

Fig. 4.

Fig. 4

Mean and 95% CI of step counts stratified by responses to ALSFRS-R questions on gross motor functioning subscale (Q7—Q9) along with significant differences between adjacent groups. Asterisk ‘*’ indicates p-value < 0.05, ‘***’ p-value < 0.001. Values displayed next to the mean step counts indicate group size reporting a given functional status.

Discussion

This study demonstrates the validity and applicability of a step-counting method for quantifying ambulatory activity in individuals with ALS. By leveraging a continuous wavelet transform to detect periodic peaks corresponding to heel strikes, our algorithm offers a robust and interpretable approach for step detection using accelerometers placed on a foot or ankle, body location that appears favorable to capture gait events in individuals using assistive devices (Supplementary Fig. 2 panels B—D). We determined the utility of the method following principles of the V3 framework, which underscores the need for analytical and clinical validation of sensor-based digital health technologies in populations with changing health status35,36.

Our analytical validation was conducted in two ways, (1) focusing on the accuracy of the method to identify periods of walking activity and (2) counting steps within these periods. The walking recognition accuracy was verified using three independent, publicly available datasets. It was determined to be highly sensitive to walking activities and highly specific to most non-walking activities defined by the respective dataset authors. Notably, a marginally decreased specificity was observed for selected household activities, such as vacuuming or sweeping the floor. These activities, however, might include a small number of steps and a certain level of misclassification is warranted. Importantly, the method remained insensitive to activities that might present in the data as repetitive spikes, such as cycling or driving a car.

The proposed approach also demonstrated accurate estimation of step counts both in controlled and semi-controlled assessments. In each investigated scenario, the mean bias was close to zero, while wider levels of agreement were calculated for semi-controlled tasks. This variation may be attributed to complex movement patterns, such as hesitations, pauses, or weight-bearing steps that can produce signal artifacts resembling step-like events. Additionally, the absence of a standardized protocol for specific transitions, such as turning around, navigating obstacles, or climbing stairs, may have introduced variability in recorded signals that contributed to reported misclassification. These factors highlight the inherent challenges of step counting in naturalistic settings where movements often deviate from steady, linear gait patterns.

For clinical validation, we evaluated whether the method provides clinically meaningful insights into mobility patterns in people living with ALS, a population characterized by progressive motor decline and high interindividual variability in gait. Our results show that the method can differentiate step counts between individuals with normal gait, early-stage ambulatory difficulties, and those requiring assistance. This distinction aligns with known clinical trajectories in ALS and demonstrates the method’s ability to detect clinically relevant changes in walking behavior, which is a critical feature for a potential digital outcome measure used in a therapeutic evaluation or clinical trial. According to our knowledge, this is the first study to assess step counts in a large-scale free-living cohort of individuals with ALS. In our previous study (5), we enrolled twenty people living with ALS whose baseline daily step counts were estimated as 1871 [1137, 2605] steps, substantially lower than the step counts reported in this study. The lower step counts in the earlier cohort may reflect participants in a more advanced disease stage, as indicated by a substantially lower average ALSFRS-R total score (mean 31.40) compared to the group reported here (mean 40.21).

Our findings align with prior work which emphasizes that ankle-worn devices are particularly suited for step detection in populations with altered gait patterns, including those with neurodegenerative diseases and requiring assistance. It is worth highlighting that our method detects heel strikes using data collected with a low sampling frequency (10 Hz), in contrast to many commercial algorithms that require higher sampling frequencies to achieve comparable accuracy24,37. This property is especially important in free-living observations of individuals with ALS, where slower gait speeds, the use of assistive devices23, or variable walking surfaces27 may introduce artifacts that hinder approaches relying on time-domain signal analyses, and where employing more sophisticated devices may not be feasible. The clear distinction in step counts observed between participants with early ambulatory impairment and those requiring assistance (3197 [2837, 3557] vs. 1283 [1063, 1503]) underscores the method’s sensitivity to clinically relevant differences in mobility status.

These findings also highlight the potential of accelerometer-derived step counts to serve as a candidate digital endpoint in ALS clinical trials. Unlike the ALSFRS-R, which provides a coarse-grained summary of global function, step counts capture ambulatory activity continuously and at high resolution, enabling greater responsiveness to short-term changes. Establishing interpretability thresholds such as the Minimal Detectable Change (MDC) or Minimally Important Difference (MID) will be crucial to distinguish measurement variability from clinically meaningful change. Defining such thresholds whether through anchor-based analyses linking step count change to patient-reported functional decline, or distribution-based approaches will support the use of step counts in evaluating treatment effects38.

Several limitations should also be acknowledged. First, the method assumes a basic level of periodicity and intensity in the accelerometer signal, which may lead to errors in contexts involving sporadic or irregular stepping, such as turning, standing, or shuffling. This challenge is compounded by the lack of consensus on how to define “walking” in real-world settings, as it remains unclear whether a single step, two steps, or a continuous sequence of steps should be classified as walking27. Second, due to the secondary nature of this analysis and the absence of a standardized turning protocol in the original datasets (e.g., in PFALS), we were unable to assess the algorithm’s performance specifically during transitions such as straight walks versus turning. Third, while the algorithm provided meaningful indications across the included ALS datasets, the free-living observations did not include ground truth step counts, limiting our ability to assess its accuracy under naturalistic conditions. Fourth, cognitive and executive functions, which are increasingly recognized as important determinants of gait performance, were not systematically assessed in our cohorts. Prior studies have shown that cognitive impairment and executive dysfunction influence gait variability in ALS21 as well as in older adults39, and even subtle deficits may impact gait-related endpoints. Because our analyses did not incorporate formal neuropsychological evaluation, we cannot exclude the possibility that variability in cognitive status contributed to differences in step counts or walking recognition accuracy. Future research should explore these limitations but also consider the integration of additional sensor modalities (e.g., gyroscopes, magnetometers) to capture a broader range of gait-related metrics, including stride length, cadence variability, and measures of balance.

Conclusions

The reliance of our method on the fundamental properties of the gait signal, recorded from the ankle, rather than the wrist, supports its applicability across diverse measurement conditions. By validating the algorithm across multiple independent datasets, including those collected in controlled, semi-controlled, and free-living settings, we demonstrated its reproducibility and potential for use in real-world studies. Importantly, the algorithm accurately counts steps and quantifies walking activity in healthy participants and those with ALS who have gait disturbances – even those requiring assistance. This overcomes a major existing shortcoming of gait quantification in ALS trials: the floor effect driven by the inability to accurately count steps as gait deteriorates. With this method, researchers can more assuredly employ accelerometry in clinical trials. We have made our algorithm open-source to enhance transparency and support adoption by researchers in ALS and other populations. The method can also be adapted for other devices, or research questions, as needed. Together, these attributes position the method as a promising foundation for future development and application of digital outcome measures on mobility in ALS and other diseases characterized by deterioration of gait.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (1.6MB, docx)

Acknowledgements

The authors are grateful to participants and their caregivers. They also appreciate researchers who made their datasets publicly available.

Abbreviations

ALSFRS-R

ALS Functional Rating Scale-Revised

ARC

ALS Research Collaborative dataset

BMI

Body mass index

CI

Confidence intervals

CWT

Continuous wavelet transform

DaLiAc

Daily Life Activities dataset

DHT

Digital Health Technology

GT

Ground truth

IWSCD

Identification of Walking, Stair Climbing, and Driving using Wearable Accelerometers dataset

LoA

Levels of agreement

PedEval

Pedometer Evaluation Project dataset

PFALS

Endurance Tasks to Measure Performance Fatigue in ALS dataset

SD

Standard deviation

SPADES

Human Physical Activity dataset

Author contributions

MS proposed the method, identified open-source datasets, processed the data, performed statistical analysis, created figures, prepared the first draft. KMB designed the *PFALS* project, collected data in the *PFALS* project, provided clinical oversight, reviewed the analyses and manuscript drafts. KTC collected data in the PFALS project, provided clinical oversight, reviewed the analyses and manuscript drafts. NC designed the *PFALS* project, provided clinical oversight, reviewed the analyses and manuscript drafts. SM provided clinical oversight, reviewed the analysis and manuscript drafts. AP designed the *ARC* project, reviewed the analyses and manuscript drafts. FV designed the *ARC* project, reviewed the analyses and manuscript drafts. JDB designed the *PFALS* project, provided clinical oversight, reviewed the analyses and manuscript drafts.

Funding

The PFALS study was an investigator-initiated study conducted with funding from Biogen.

Data availability

The PFALS and ARC datasets may be shared upon request and after review and approval by the owners of the data. Shared data will consider deidentified summary statistics used for analytical analysis. Related correspondence should be sent to [jdberry@mgh.harvard.edu](mailto:JDBERRY@mgh.harvard.edu) (PFALS) and [fvieira@als.net](mailto:fvieira@als.net) (ARC).

Code availability

Methods used in this study are publicly available under the following link: (link to be provided upon manuscript acceptance).

Declarations

Ethics approval and consent to participate

This study adhered to the guidelines outlined in the Declaration of Helsinki – Ethical Principles for Medical Research Involving Human Subjects. The study protocol for ARC was approved by the Institutional Review Board (ADVARRA Center for IRB Intelligence). The study protocol for PFALS was approved by the Institutional Review Board of Massachusetts General Brigham (IRB# 2020P000546). In ARC and PFALS all participants provided informed consent prior to any study procedures.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Fritz, S. & Lusardi, M. White paper: “Walking speed: The sixth vital sign”. J. Geriatr. Phys. Ther.32(2), 46–49 (2009). [PubMed] [Google Scholar]
  • 2.Studenski, S. et al. Gait speed and survival in older adults. JAMA305(1), 50–58. 10.1001/jama.2010.1923 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Middleton, A., Fritz, S. L. & Lusardi, M. Walking speed: The functional vital sign. J. Aging Phys. Act.23(2), 314–322 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Radovanović, S. et al. Gait in amyotrophic lateral sclerosis: Is gait pattern differently affected in spinal and bulbar onset of the disease during dual task walking?. Amyotroph Lateral Scler Frontotemporal Degener15(7–8), 488–493 (2014). [DOI] [PubMed] [Google Scholar]
  • 5.Johnson, S. A. et al. Wearable device and smartphone data quantify ALS progression and may provide novel outcome measures. Npj Digit. Med.6(1), 34. 10.1038/s41746-023-00778-y (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Karas, M. et al. Tracking amyotrophic lateral sclerosis disease progression using passively collected smartphone sensor data. Ann. Clin. Transl. Neurol.11, 1380–1392. 10.1002/acn3.52050 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Colón-Emeric, C. S., McDermott, C. L., Lee, D. S. & Berry, S. D. Risk assessment and prevention of falls in older community-dwelling adults: A review. JAMA331(16), 1397–1406 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Verghese, J., Holtzer, R., Lipton, R. B. & Wang, C. Quantitative gait markers and incident fall risk in older adults. J. Gerontol. Ser. A64A(8), 896–901. 10.1093/gerona/glp033 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Schniepp, R. et al. Fall prediction in neurological gait disorders: differential contributions from clinical assessment, gait analysis, and daily-life mobility monitoring. J. Neurol.268(9), 3421–3434. 10.1007/s00415-021-10504-x (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Beswick, E. et al. A systematic review of digital technology to evaluate motor function and disease progression in motor neuron disease. J. Neurol.269(12), 6254–6268 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Chipika, R. H. et al. Tracking a fast-moving disease: Longitudinal markers, monitoring, and clinical trial endpoints in ALS. Front. Neurol.10.3389/fneur.2019.00229 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Proudfoot, M., Jones, A., Talbot, K., Al-Chalabi, A. & Turner, M. R. The ALSFRS as an outcome measure in therapeutic trials and its relationship to symptom onset. Amyotroph Lateral Scler Front Degener17(5–6), 414–425. 10.3109/21678421.2016.1140786 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Cedarbaum, J. M. et al. The ALSFRS-R: A revised ALS functional rating scale that incorporates assessments of respiratory function. BDNF ALS Study Group (Phase III). J. Neurol. Sci.169(1–2), 13–21 (1999). [DOI] [PubMed] [Google Scholar]
  • 14.Blair, H. A. Tofersen: First approval. Drugs83(11), 1039–1043 (2023). [DOI] [PubMed] [Google Scholar]
  • 15.Smith, S. E. et al. Tofersen treatment leads to sustained stabilization of disease in SOD1 ALS in a “real-world” setting. Ann. Clin. Transl. Neurol.12(2), 311–319 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Ruffo, P., Traynor, B. J. & Conforti, F. L. Advancements in genetic research and RNA therapy strategies for amyotrophic lateral sclerosis (ALS): Current progress and future prospects. J. Neurol.272(3), 233 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Straczkiewicz, M. et al. Free-living monitoring of ALS progression in upper limbs using wearable accelerometers. J. Neuroeng. Rehabil.21(1), 223. 10.1186/s12984-024-01514-7 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gupta, A. S., Patel, S., Premasiri, A. & Vieira, F. At-home wearables and machine learning sensitively capture disease progression in amyotrophic lateral sclerosis. Nat. Commun.14(1), 5080. 10.1038/s41467-023-40917-3 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Vieira, F. G. et al. A machine-learning based objective measure for ALS disease severity. Npj Digit. Med.5(1), 45. 10.1038/s41746-022-00588-8 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Musson, L. S. et al. The use of digital devices to monitor physical behavior in motor neuron disease: Systematic review. J. Med. Internet Res.27, e68479. 10.2196/68479 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Dubbioso, R. et al. Cognitive impairment is associated with gait variability and fall risk in amyotrophic lateral sclerosis. Eur. J. Neurol.30(10), 3056–3067 (2023). [DOI] [PubMed] [Google Scholar]
  • 22.Rutkove, S. B. et al. Improved ALS clinical trials through frequent at-home self-assessment: A proof of concept study. Ann. Clin. Transl. Neurol.7(7), 1148–1157 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Tsamassiotis, S. et al. Accelerometers can correctly count orthopaedic patients’ early post-operative steps while using walking aids. J. Exp. Orthop.12(1), e70134. 10.1002/jeo2.70134 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Pan, J. & Wei, S. Accuracy and reliability of accelerometer-based pedometers in step counts during walking, running, and stair climbing in different locations of attachment. Sci. Rep.14(1), 27761. 10.1038/s41598-024-78684-w (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Holdom, C. J. et al. Identification of passive wrist-worn accelerometry outcomes for improved disease monitoring and trial design in motor neuron disease. EBioMedicine10.1016/j.ebiom.2025.105779 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Servais, L. et al. First regulatory qualification of a digital primary endpoint to measure treatment efficacy in DMD. Nat. Med. U. S.29, 2391–2392 (2023). [DOI] [PubMed] [Google Scholar]
  • 27.Straczkiewicz, M., Huang, E. J. & Onnela, J. P. A “one-size-fits-most” walking recognition method for smartphones, smartwatches, and wearable accelerometers. Npj Digit. Med.6(1), 29. 10.1038/s41746-022-00745-z (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.John, D., Tang, Q., Albinali, F. & Intille, S. An open-source monitor-independent movement summary for accelerometer data processing. J. Meas. Phys. Behav.2(4), 268–281 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Leutheuser, H., Schuldhaus, D. & Eskofier, B. M. Hierarchical, multi-sensor based classification of daily life activities: Comparison with state-of-the-art algorithms using a benchmark dataset. PLoS ONE8(10), 1–11. 10.1371/journal.pone.0075196 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Karas, M. et al. Adaptive empirical pattern transformation (ADEPT) with application to walking stride segmentation. Biostatistics22(2), 331–347. 10.1093/biostatistics/kxz033 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mattfeld, R., Jesch, E. & Hoover, A. A new dataset for evaluating pedometer performance. In: 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 865–869 (2017).
  • 32.Straczkiewicz, M. et al. Open-source, step-counting algorithm for smartphone data collected in clinical and nonclinical settings: Algorithm development and validation study. JMIR Cancer9, e47646 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Choi, L., Liu, Z., Matthews, C. E. & Buchowski, M. S. Validation of accelerometer wear and nonwear time classification algorithm. Med. Sci. Sports Exerc.43(2), 357–364 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Straczkiewicz, M., Urbanek, J. K., Fadel, W. F., Crainiceanu, C. M. & Harezlak, J. Automatic car driving detection using raw accelerometry data. Physiol. Meas.37(10), 64 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Goldsack, J. C. et al. Verification, analytical validation, and clinical validation (V3): The foundation of determining fit-for-purpose for Biometric Monitoring Technologies (BioMeTs). Npj Digit. Med.3(1), 55. 10.1038/s41746-020-0260-4 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Bakker, J. P. et al. V3+ extends the V3 framework to ensure user-centricity and scalability of sensor-based digital health technologies. Npj Digit. Med.8(1), 51. 10.1038/s41746-024-01322-2 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.McCullagh, R., Dillon, C., O’Connell, A. M., Horgan, N. F. & Timmons, S. Step-count accuracy of 3 motion sensors for older and frail medical inpatients. Arch. Phys. Med. Rehabil.98(2), 295–302. 10.1016/j.apmr.2016.08.476 (2017). [DOI] [PubMed] [Google Scholar]
  • 38.Stipancic, K. L., Yunusova, Y., Berry, J. D. & Green, J. R. Minimally detectable change and minimal clinically important difference of a decline in sentence intelligibility and speaking rate for individuals with amyotrophic lateral sclerosis. J. Speech Lang. Hear Res.61(11), 2757–2771. 10.1044/2018_JSLHR-S-17-0366 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Montero-Odasso, M., Verghese, J., Beauchet, O. & Hausdorff, J. M. Gait and cognition: A complementary approach to understanding brain function and the risk of falling. J. Am. Geriatr. Soc.60(11), 2127–2136. 10.1111/j.1532-5415.2012.04209.x (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1 (1.6MB, docx)

Data Availability Statement

The PFALS and ARC datasets may be shared upon request and after review and approval by the owners of the data. Shared data will consider deidentified summary statistics used for analytical analysis. Related correspondence should be sent to [jdberry@mgh.harvard.edu](mailto:JDBERRY@mgh.harvard.edu) (PFALS) and [fvieira@als.net](mailto:fvieira@als.net) (ARC).

Methods used in this study are publicly available under the following link: (link to be provided upon manuscript acceptance).


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES