Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2023 Oct 27;19(10):e1011556. doi: 10.1371/journal.pcbi.1011556

Discovering individual-specific gait signatures from data-driven models of neuromechanical dynamics

Taniel S Winner 1,*, Michael C Rosenberg 1, Kanishk Jain 2, Trisha M Kesar 3, Lena H Ting 1,3, Gordon J Berman 4,*
Editor: Lyle J Graham5
PMCID: PMC10610102  PMID: 37889927

Abstract

Locomotion results from the interactions of highly nonlinear neural and biomechanical dynamics. Accordingly, understanding gait dynamics across behavioral conditions and individuals based on detailed modeling of the underlying neuromechanical system has proven difficult. Here, we develop a data-driven and generative modeling approach that recapitulates the dynamical features of gait behaviors to enable more holistic and interpretable characterizations and comparisons of gait dynamics. Specifically, gait dynamics of multiple individuals are predicted by a dynamical model that defines a common, low-dimensional, latent space to compare group and individual differences. We find that highly individualized dynamics–i.e., gait signatures–for healthy older adults and stroke survivors during treadmill walking are conserved across gait speed. Gait signatures further reveal individual differences in gait dynamics, even in individuals with similar functional deficits. Moreover, components of gait signatures can be biomechanically interpreted and manipulated to reveal their relationships to observed spatiotemporal joint coordination patterns. Lastly, the gait dynamics model can predict the time evolution of joint coordination based on an initial static posture. Our gait signatures framework thus provides a generalizable, holistic method for characterizing and predicting cyclic, dynamical motor behavior that may generalize across species, pathologies, and gait perturbations.

Author summary

In this manuscript, we introduce a novel, machine learning-based framework for quantifying, characterizing, and modifying the underlying neuromechanical dynamics that drive unique gait patterns. Standard methods for evaluating movement typically focus on extracting discrete gait variables ignoring the complex inter-limb and inter-joint spatiotemporal dependencies that occur during gait. Popular physiologically realistic modeling approaches encode these spatiotemporal dependencies but are too complex to characterize individual differences in the factors driving unique gait patterns or disorders. To circumvent these modeling complications, we develop a phenomenological model of gait that enables more holistic and interpretable characterizations of gait, encoding these complex spatiotemporal dependencies between humans’ joint angles arising from joint neural and biomechanical constraints. Our coined ‘gait signature’ framework provides a path towards understanding the neuromechanics of locomotion. This framework has potential utility for clinical researchers prescribing individualized therapies for pathologies or biomechanists interested in animal locomotion or other periodic movements assessed across different pathologies, neural perturbations, and or conditions.

Introduction

Locomotion is a ubiquitous, complex, and dynamic behavior that is essential for survival. Using cyclic patterns of joint angles, inter-limb and inter-joint coordination, animals effectively move through their environments: walking, running, trotting, swimming, flying, and crawling. Even within species and types of locomotion, variations in locomotor patterns often occur across behavioral contexts, groups, and individuals. Thus, although locomotor patterns can appear highly stereotyped, considerable inter- and intra-individual variability exists. Studies of locomotor behaviors have shown systematic differences in movement patterns based on a wide range of neural [14] and biomechanical perturbations [58] environmental challenges [9,10], psychological state [11,12], social status [13,14], injury [1517], and disease [4,1823]. Furthermore, locomotor impairments can arise from a wide range of physiological and neurological changes, from the subtle changes that may be indicators of progressive disorders (e.g., aging, cognitive impairments) to profound impairments with brain injury (e.g., stroke, spinal cord injury) that can severely limit locomotor function. Although locomotor deficits are often subjectively visible to a human observer, objectively characterizing and understanding sometimes subtle yet important differences in locomotion from a scientific and mechanistic standpoint has been challenging [2426]. For example, kinematic movement patterns (the continuous motion of joint angles over time) have been collected across a wide range of locomotor modes and species but revealing individual-specific differences in kinematics remains difficult. One barrier to progress is that interpreting individual differences in kinematics without an underlying dynamical model is challenging, as kinematics are the result of the complex neuromechanical dynamics that drive the spatiotemporal dependencies of joint kinematics over time. Thus, capturing these underlying gait dynamics is likely essential for interpreting differences in gait and movement across conditions and individuals.

Traditionally, gait dynamics are modeled using physiologically detailed neuromechanical equations, however making predictive models using this approach has often proved challenging [2729]. Partially, this difficulty arises because in order to understand the dynamics underlying gait, we also need to understand how neural feedback and control shape these dynamics. While many models (e.g., musculoskeletal models) that use principles like optimal control can generate simulations of unimpaired gait, as well as changes in gait due to altered biomechanical or neural constraints, they often fail to predict changes in gait kinematics following neurological injury [28] or more subtle perturbations [30,31]. Progress in the physiological modeling of locomotor circuitry in the spinal cord and brainstem demonstrates the role of neural circuits in gait dynamics. However, these models typically rely on simplified [32,33] biomechanical properties and cannot yet predict the deficits in gait specific to an individual [25,3436]. More importantly, if a hyper-realistic model of the neural and biomechanical system did exist, the relationships between the high-dimensional parameters and actual movement patterns would not likely be unique, as many parameters would not be identifiable, even given massive amounts of data, as many different parameter choices could lead to the same biomechanical output [37,38]. This non-identifiability limits the predictive power and generalizability of these models to other interventions and conditions outside of limited contexts, suggesting a need for a more holistic approach.

Despite these challenges, rich individual-specific information exists in gait data. For instance, through observation of movement, the human brain can perceive many socially salient features of an individual’s gait, suggesting that it should be possible to infer aspects of gait dynamics from kinematic data. As an example, humans can derive a host of information about individuals from movement patterns, including gender [39], body size [40], sexual orientation [41], emotion [42], individual differences in dancing [43], perceived affective states [44] and underlying intention [45]. Furthermore, judgements based on how individuals move can drive decisions such as partner desirability or attractiveness [46] diagnosis [47,48], and treatment planning [49,50].

Despite the rapid advent of technologies providing kinematic measurements through a wide range of techniques, from videos to wearable sensors, we are still limited in how kinematic data can help interpret individual differences in gait [51,52]. Current approaches to comparing biomechanical features or kinematic trajectories quantify between-group differences or inter-individual similarity but lack sufficient sensitivity to reveal interpretable differences in individuals’ gaits [5355]. Inter-joint coordination differs across individuals, as muscular coordination patterns vary across a variety of motor skills and deficits in individual-specific ways. Indeed, metrics of muscle coordination in children with cerebral palsy are consistent with clinician judgements of motor control complexity that predict intervention outcomes [56]. Recently, supervised machine learning methods have been used to classify differences in a large sets of gait kinematics that were labeled by groups or individuals [55,57]. However, these approaches have not modeled the underlying gait dynamics, nor can they discover subtle differences in gait that are not labeled a priori.

Here we develop a data-driven framework for modeling gait dynamics that represents multiple individuals in the same latent space. This latent space reveals individual- and group-level differences in the neuromechanical dynamics of gait. We used kinematic data from multiple healthy and neurologically impaired individuals, each walking at six different speeds, to train a recurrent neural network (RNN) that learns gait dynamics. This phenomenological approach infers complex spatiotemporal dynamics and enables future kinematic predictions to be made based on current and prior kinematic postures. Once trained, differences in gait dynamics across groups, individuals, and walking speed were projected onto a common, low-dimensional latent space of the model parameters. The stride-averaged representation of gait dynamics in the latent space constitutes a “gait signature” that we use to characterize differences across individuals, groups, gait speed, and impairment severity. To demonstrate the generalizability of gait dynamics, we show that interpolating gait signatures to predict gait kinematics at new walking speeds is more accurate than interpolating the kinematics themselves in healthy individuals. Further, we show that the low-dimensional basis functions we discovered have biomechanical interpretability in terms of the inter- and intra-limb coordination patterns that they generate. The dynamical projections onto each basis function for each trial can be independently driven through the trained gait dynamics model to reconstruct the kinematics associated with that specific basis function. We generated illustrations of the reconstructed joint angle kinematics to visualize and infer what aspects of gait coordination each subcomponent influences. These subcomponents of gait coordination can be manipulated independently (i.e., gait sculpting) to infer the relationships between specific underlying dynamical components and their corresponding kinematic phenotypes and to identify what specific gait rehabilitation strategies are likely required for individuals. Finally, our gait dynamics model is generative; it can predict individual-specific time evolution of kinematics from an initial arbitrary posture (self-driving) once the network is primed with several gait cycles of the individual’s kinematic data. This study establishes a new data-driven framework to quantitatively interpret individual-specific differences in gait dynamics with the potential to enable discovery in a wide range of gait coordination deficits, contexts and interventions in humans and other animals.

Results

Gait signatures: A low-dimensional representation of gait dynamics

We used motion capture to collect sagittal-plane kinematic data that consisted of 15 seconds of continuous gait kinematics from bilateral, hip, knee, and ankle joints from 5 able-bodied (AB) participants and 7 stroke survivors (> 6 months post-stroke, gait speeds 0.1 to 0.8 m/s) walking on a treadmill at a range of six different speeds each. Taking inspiration from neural network models that capture neural dynamics [5860] and biological systems, we implemented a recurrent neural network (RNN) model to capture the dynamical properties of gait. Our model input parameters only include kinematic data and do not include anthropometric information or clinical characteristics and do not account for differences in joint kinematics due to neural versus biomechanical constraints.

Developing the recurrent neural network (RNN) architecture and training the model

The gait dynamics model was developed in Python using common Python libraries, including TensorFlow, Keras, Pandas, and NumPy. We developed our code in Google Colab to facilitate open-source sharing of our dynamic framework, which can be found here: https://github.com/bermanlabemory/gait_signatures. The model architecture was selected based on two criteria: 1) minimizing model training and validation loss during model fitting, and 2) maximizing the similarity of short-time (single stride) and long-time (multiple strides) self-driven model predictions (termed: gait signature alignment) post model training (S1 Fig). By implementing these two model selection criteria we ensure 1) a high goodness-of-fit (model that best represents the underlying dynamics across all participants and gait speeds) and 2) the model is capable of predicting the time-evolution of gait (encode gait dynamics). We evaluated these criteria against alternative models by varying 2 hyperparameters (number of LSTM units and the lookback time, see Methods). The selected model architecture is a sequence-to-sequence RNN [61] consisting of an input layer, a hidden layer of 512 LSTM units, and an output layer. The RNN learns a map from time-series kinematic input data (0 to T-1) to kinematics one time-step in the future (1 to T) for all training trials (Fig 1A). The model was trained using the ‘mean squared error’ (mse) loss function until training and validation error converged and stabilized around the same point (< 0.03 degrees2). Thus, the model successfully learns the underlying dynamics of gait (S2 Fig). The model’s internal states capture trial-specific dynamics predicting the time evolution of joint kinematics; activation coefficients (H) and memory cell states (C) and are tuned based on kinematic inputs. Kinematic data was input in multivariate format, not concatenated [62,63]. In brief, our RNN model was designed to capture short and long-term gait dependencies in time [64,65] as well as inter-and intra-limb coordination over time, uncovering features of gait that were not previously targeted or used in gait analysis. To verify whether our model was generalizable, we conducted leave-one-out cross validation, where 12 different models were trained leaving a single individual’s 6 trials on each model run (S3 Fig). Stroke-survivors are known for having neurological impairments that result in heterogeneous gait dysfunction that are not fully understood; thus, we anticipate that our gait dynamics model will capture and shed light on these individual-specific deficits in gait coordination, identify similar coordination strategies or deficits amongst our stroke cohort, and allow us to compare these different gait dysfunctions to the able-bodied ‘normative’ gait (controls).

Fig 1. Pipeline figure outlining the steps to generating individual-specific gait signatures.

Fig 1

Continuous, multi-joint kinematics from multiple individuals are fed into the RNN model as input data and the model is trained sequence-to-sequence to predict one-step time shifted output kinematics. High dimensional internal parameter (H and C) time traces per individual are extracted and principal component analysis was applied to reduce the dimensionality of the data to form individual gait signatures (A). 3D time trace visualizations of 3 representative individuals (able-bodied (blue), high-functioning (red), low-functioning stroke (orange)) of the 1st 3 dominant principal component contributions (B, left). 3D projections of the 6-D gait signatures using multi-dimensional scaling (MDS) reveal different gait dynamics amongst the three gait groups: able-bodied (blue), high-functioning (red) and low-functioning (orange) stroke survivors (B, right). The size of the circles represents the individual’s trial speed (i.e., the smallest circles represent an individual’s slowest gait speed, and the size of the circles increase with gait speed).

Generating gait signatures

To generate gait signatures, kinematic trajectories from each walking speed trial across participants were fed as input into the trained neural network and the corresponding internal states (H and C parameters, see above) were extracted (Fig 1A). The internal activations prescribe the spatial and temporal dependencies generating the input kinematics. The resulting time-series of 1024 internal states (512 H, 512 C parameters) were reduced in dimension using Principal Components Analysis (PCA) and phase averaged [66]. Phase averaging is applicable here, as the underlying gait dynamics are periodic, and the translation from time to a phase between 0 and 2π allows us to describe all internal state dynamics in a speed-independent manner.

The first 6 Principal Components (PCs) explain ~72% of the variance in gait dynamics (S4 Fig), allowing us to focus on these modes for our visualization and analysis. The time-varying contributions of the first 3 dominant PCs were plotted in 3D for 3 representative individuals—able-bodied adults, high-functioning stroke (self-selected (SS) walking speed > 0.4m/s) and low-functioning stroke (SS speed < 0.4m/s)—highlighting that the gait dynamics between all 3 individuals are different (Fig 1B, left). The gait dynamics of the high-functioning stroke survivor (red), while spatially closer to the able-bodied individual (blue) than the low-functioning stroke survivor (orange), show observable differences in its dynamical trajectory between to the two individuals. To determine whether some structure exists amongst the three different subject groups, all the 6-dimensional gait signatures were projected onto a 3D map using Multidimensional scaling (MDS) [67] to visualize relative distances between all gait signatures (Fig 1B, right). The locations of the 3 MDS projections of the 3 representative individuals are not arbitrary, as they belong to clusters of gait signatures of the same gait group. Thus, gait signatures preserve key clinically relevant features of the underlying gait dynamics, independent of the individual or speed.

Gait signatures reveal that individual-specific differences in dynamics are favored in the gait representation, over differences in gait speed

Gait signatures of individuals’ 6 speed trials within both cohorts (healthy and stroke) are tightly grouped together. Gait signatures represent individual-specific dynamics; the unimpaired cohort exhibit a stereotyped low-dimensional structure across individuals in the able-bodied cohort (Fig 2Ai, left) vs. the impaired cohort, which display much more variable (i.e., highly individualized) low-dimensional representations (Fig 2Ai, right). Because the data are phase averaged over the gait cycle, we demonstrate that gait signature trajectories are well-aligned with the four gait phases (leg 1 swing, leg 1 stance, leg 2 swing, leg 2 stance), enabling phase-specific comparisons of differences in gait dynamics. The unimpaired group showed similar structure across the four gait events (Fig 2Aii, left), whereas there was much more variability within the impaired group (Fig 2Aii, right), revealing individual-specific differences within and across distinct parts of the gait cycle. The similarity between gait signatures was computed and visualized in a dimensionally reduced gait map space using MDS and colored according to the different individuals in the dataset (Fig 2Bi). The unimpaired group form a cluster in the gait map, showing that individuals in the unimpaired group are distinct from the impaired group. Stroke-survivors occupy distinct positions from other impaired individuals’ sub-clusters in the gait space that highlight the well-established but poorly understood heterogeneity in gait deficits in the stroke cohort. Furthermore, individual-specific gait signatures change slightly as individuals walk faster than their self-selected pace (Fig 2Bi). However, these within-subject speed-induced changes are much smaller than between-individual difference in gait signatures. The gait signatures of the individuals belonging to the able-bodied, high-functioning, and low functioning stroke survivor cohorts show 3x, 4x and 7x larger distances between individuals in the group versus within each individual 6 speed trials, respectively. We calculated the Euclidean distance between individuals’ self-selected speed trial gait signature and the calculated able-bodied centroid (Fig 2Bi, black square) and the results shown on the plot to the right reveal that low-functioning stroke survivors (characterized based on the clinical definition of having a self-selected walking speed of <0.4 m/s) are further away from the able-bodied cluster than the high-functioning stroke survivors. Note that no information from a clustering algorithm was used to characterize low- vs high-functioning individuals. The encircled low-functioning stroke survivors (Fig 2Bi, orange enclosure) were labeled post-hoc to demonstrate the lack of a single cluster characterizing low- vs high-functioning stroke survivors. Showing the validity of our approach, low-functioning stroke survivors are less dynamically similar to AB than higher functioning stroke survivors.

Fig 2. Gait signatures reveal highly individualized low dimensional representations of gait dynamics irrespective of absolute gait speed.

Fig 2

A) 3D unimpaired (left) and impaired (right) gait signatures colored by i) individual and ii) gait phase. Gait signatures are grouped together according to individuals within both cohorts (same hues of blue cluster together for unimpaired (i, left) and similarly the same hues of red cluster in the impaired cohort (i, right)). In our convention the right leg of all unimpaired individuals was assigned to be the paretic leg and left leg the non-paretic leg. Impaired individuals can have either left or right leg paresis. Unimpaired gait signatures reveal a similar looped structure across the four gait phases that occur during a gait cycle (leg 1 swing, leg 1 stance, leg 2 swing, leg 2 stance) (ii, left) whereas impaired signatures showed individual-specific differences across the four phases and were more variable (ii, right). B) 3D multidimensional scaling applied to all gait signatures shows the pronounced separation between unimpaired (blue hues in left section of map) and impaired (red hues in right section of map) gait dynamics (i). Impaired signatures (red hues) are located further away from the centroid of all unimpaired gait signatures (black square), indicating that they are less dynamically similar to the unimpaired individuals. The smallest circles represent an individual’s self-selected walking speed trial and larger circles correspond to the faster speed trials. Low-functioning stroke survivors (encapsulated in orange; based on self-selected gait speed < 0.4m/s) are located furthest away (largest Euclidean distances) from the unimpaired centroid (i). Gait speed does not appear to strongly influence the differences in dynamics between individuals as similar speed gait signatures are in different regions of the gait map (ii). Particularly, gait speed does not explain the heterogeneity in low-functioning stroke survivors’ gait dynamics.

Gait speed does not appear to strongly influence the differences in dynamics between individuals’ gaits (although the range of gait speeds for each participant may not have been wide enough to elicit major differences in their overall dynamics). It is worth noting that the walking speeds in our post-stroke cohort spanned the full speed range of each participant’s safe walking capacity, whereas this was not the case for our able-bodied cohort. Overall, as expected, the unimpaired group walked at faster speeds than the impaired group (Fig 2Bii). Individuals in the able-bodied cluster walk at a range of different speeds, but individual gait signatures still cluster tightly together. Despite the able-bodied dynamics being similar, there still exists inherent variability in their gait dynamics that may be explained by factors such as prior exercise and sports-training history, injury, disease, etc. Post-stroke individuals who walk at similar slower speeds, however, maintain their own distinct individualized groupings. Thus, individuals’ characteristic gait signatures were preserved across their range of walking speeds and were not grouped based on absolute walking speed. For example, several clinically similar post-stroke individuals (similar overground walking speed and Fugl-Meyer score [68]) have very different gait signatures that remain recognizable across a range of gait speeds (Fig 2). Although the low-functioning individuals in our sample are more dispersed than high-functioning individuals, we expect that the spaces between individuals represent a continuum of gait dynamics that would be filled given a larger sample size.

Furthermore, when used to distinguish between gait groups and identify individuals, gait signatures perform similarly to using a set of 26 commonly used discrete variables (S5 Fig). Discrete variables are already sufficient to classify between able-bodied and stroke gaits, with numerous studies identifying key variables that map to function/impairment [6971]. With Gait signatures, we achieve the same level of classification without needing to hand-pick discrete variables or to use force plates or inverse-dynamics analyses that would require more equipment, computation, and subject-specific anthropometry for each observation. Gait signatures also perform better than continuous kinematics and joint velocities at these same discrimination tasks (S5 Fig). These results serve as a positive control, as researchers previously could distinguish gait groups by building a classifier based on important subjectively selected discrete variables. Here, we have created a dynamical representation that can distinguish groups with similar accuracy. It is not surprising that the continuous kinematics performed worse than the RNN gait signatures (which were developed from these very same data), as the RNN model used the data to encode important time-varying changes in the kinematics, allowing for more information to be extracted. Thus, parameterizing the evolution of individuals’ walking patterns into a common subspace allows for a more holistic, less biased, and straightforward analysis of primarily their overall differences in gait dynamics, inter- and intra-limb coordination over any differences attributed to absolute gait speed. Gait signatures can allow gait researchers to study or analyze the dynamical differences underlying impairment independently from gait speed, facilitating analysis of dynamics between individuals who may not be capable of walking at the same speeds and allowing investigation of changes in the underlying mechanism of gait changes under different conditions (walking speed, gait rehabilitation intervention, age etc.)

Low-functioning stroke-survivors are less dynamically analogous to able-bodied and more dynamically variable compared to high-functioning stroke-survivors

Clinically, gait rehabilitation researchers use gait speed as a primary quantitative indicator of gait dysfunction [19,72,73]. While this coarse metric gives an overall value or number to one’s overall gait function, it does not identify the specific impairments underlying the individuals’ gait. To derive more precise measures or indicators of gait impairment, we anticipated that utilizing this gait signatures framework, we would be able to capture both subtle and obvious differences in kinematic patterns underlying impaired gait. In the clinic, stroke survivors are typically segmented into subgroups according to their self-selected walking speeds: high-functioning stroke survivors with a self-selected (SS) walking speed above 0.4m/s and low-functioning stroke survivors who adopt SS walking speeds less than 0.4m/s [74]. It is assumed that low-functioning stroke survivors are more impaired and thus adopt slower walking speeds to be able to navigate the environment safely. However, gait deficits of stroke survivors within either sub-group are heterogeneous across individuals and include different impairments such as foot drop, reduced paretic push-off during late stance, limited initial heel contact during early stance, as well as compensatory gait strategies such as hip circumduction and hip hiking. We expected that higher functioning individuals would have less severe impairments and would be more dynamically analogous to able-bodied individuals, whereas low-functioning stroke survivors would exhibit highly variable impairments from each other and be even less dynamically analogous to able-bodied dynamics compared to higher functioning stroke survivors.

To better visualize all developed individuals’ gait signatures across their 6 different speed trials in our dataset, we again used MDS to project the 6D gait signatures to 3D. This mapping allows us to visualize the relative locations of individuals in comparison to all the other gait signatures to gain insights on how dynamically similar they are from one another. A 3D MDS gait map of all gait signatures reveals that able-bodied and high-functioning stroke survivors are located near each other, whereas low-functioning stroke survivors are farther and more dispersed and form distinct clusters in different regions of the map (Fig 3A). Sub-group level analysis reveals significant differences in the Euclidean distance metric (distance between each gait signature and the able-bodied centroid) between the able-bodied group and the low- and high-functioning stroke survivor groups, respectively (Fig 3B). Able-bodied gait signatures are located closest to the centroid, followed by high-functioning and low-functioning stroke survivors (Fig 3B). The within-group dispersion of gait dynamics for the low- and high- functioning stroke survivors was calculated based on the radius of a hypersphere enclosing 95% of the groups’ gait signatures. Using a leave-one-out sample with replacement method, multiple within-group dispersion calculations were conducted for each group and the average within-group dispersion was expressed alongside the standard error in Fig 3C. The 95th percent radius was significantly higher in the low-functioning stroke-survivors gait signatures compared to the high-functioning, highlighting that low-functioning gait signatures were more dispersed from each other (higher inter-individual variability) and the RNN model can capture these individual-specific gait deficits in individuals with more severe gait impairment.

Fig 3. Comparison of gait signatures across three gait subgroups: able-bodied (AB), high functioning (HF) and low functioning (LF).

Fig 3

A) 3D gait map using multidimensional scaling highlights the relative distances between AB (blue), HF (red) and LF (orange) stroke survivors. LF stroke survivors are less clustered and occupy distinct regions of the map away from the able-bodied centroid (black square). B) Gait dynamics similarity based on Euclidean distance between AB centroid and each participant, showing larger distances within the low-versus high-functioning groups. C) Within-group dispersion of gait signatures based on the radius of a hypersphere enclosing 95% of each group’s gait signature reveals more dispersed gait signatures in low- versus high-functioning stroke survivors, highlighting the potential of gait signatures to capture individual differences in more severe gait impairments.

Gait signatures are biomechanically interpretable

While Principal Component trajectories and low-dimensional maps provide one way to compare the overall dynamics between individuals and groups, it remains to be seen what information the independent components of the 6D gait signature represent biomechanically. The contributions of each principal component (PC) to a gait signature fluctuates over the gait cycle, shown for an exemplar able-bodied, one high-functioning stroke survivor, and one low-functioning stroke survivor in Fig 4A. Superimposed individual stride-averaged PC projections from these 3 individuals (Fig 4B) highlight the specific differences in each PC. For PC1, both able-bodied and high-functioning stroke survivor traces are within the able-bodied 95% confidence interval, whereas the low-functioning stroke survivor is outside of these bounds around the middle of the gait cycle. For PC2, some regions of the low and high-functioning stroke survivor can be found outside of the confidence interval, however the entirety of the PC3 projection of the low-functioning stroke survivor is found outside of interval (vertically shifted). Given the generative nature of our RNN-based model, a specified number of the loadings on the PCs can be driven through the trained RNN model to reconstruct the corresponding kinematics. Thus, to interpret the individual PC components, the internal parameters corresponding to each isolated PC were driven through the gait dynamics model, generating gait predictions, i.e., a multi-joint coordination pattern and their temporal evolution over the gait cycle that can be visualized in an animation or gait movie. Stick figure snapshots (7 equally spaced samples of 100 frames) show that PC1 encodes dynamics driving hip flexion and extension, PC2 encodes dynamics driving knee flexion and extension and PC3 encodes dynamics driving primarily postural coordination (trunk location relative to joints) (S1S4 Videos). This framework can potentially allow for the identification and targeting of individual-specific gait deficits, informing the tailoring of precision rehabilitation strategies.

Fig 4. Biomechanical interpretation of gait signatures.

Fig 4

A) Gait signatures reveal different gait dynamics between exemplar AB, low-and high-functioning stroke survivors. B) The loadings on each principal component (PC), e.g., the contributions of each PC vary over the gait cycle and can be compared to the AB 95% confidence interval (gray). C) Each PC generates specific multi-joint gait coordination patterns when used to drive the gait model, enabling biomechanical interpretation of gait deficits and effects of treatment.

The gait dynamics model generalizes to unmeasured speeds

Our gait signature model can capture and predict nonlinear changes in dynamics in response to speed in cases where interpolation of kinematics may fail. We trained a different gait dynamics model using only 15s of data of the 2 fastest and 2 slowest walking speeds of each subject. Weighted averages of gait signatures from an individual walking at these four different gait speeds can be used to generate multi-joint kinematic trajectories that predict data from a gait speed that was not used to train the model (Fig 5). Predicted kinematics from interpolation of gait signatures across the four speeds resemble the measured kinematic reference more accurately than do the kinematics generated from interpolating gait kinematics, shown for an exemplary AB individual (Fig 5A) and low-functioning stroke survivor (Fig 5B). Kinematic prediction from interpolation of dynamics did considerably better than interpolating kinematics directly for the exemplary low-functioning stroke survivor shown in Fig 5B, indicating that interpolating gait signatures capture nonlinear (non-monotonic) changes in kinematics between speeds. The kinematic output of the interpolated kinematics follows that of the fast speed in the paretic hip closely but does not resemble the measured kinematic reference waveforms for the paretic knee or ankle angles. In some cases where interpolation of kinematics fails, the averaged dynamics do a better job at predicting kinematic trajectories at unseen speeds. Group level analyses show that the R2 values between the measured and predicted kinematics from interpolated gait dynamics are significantly higher (Wilcoxon paired signed rank test) than interpolating kinematics within the able-bodied cohort (Fig 5C), but not for stroke (Fig 5D). In general, averaging gait dynamics produced less variable R2 values and less R2 outliers than averaging kinematics in both the able-bodied (Fig 5C) and stroke survivors (Fig 5D). The range of R2 values in the able-bodied cohort for averaged dynamics was -0.20 to 1.00 compared to –1.30 to 0.98 in averaged kinematics whereas the range of R2 values in the stroke cohort for averaged dynamics was 0.46 to 1.00 compared to -0.50 to 1.00 in averaged kinematics. Two low-functioning stroke survivors show higher R2 values of their hip, knee and ankle kinematic traces when interpolating kinematics vs. dynamics. Post hoc analysis revealed that these two stroke survivors (ST4 and ST2) were furthest away from the able-bodied centroid (least dynamically similar to able-bodied) as shown in Fig 2Bi. These results suggest that the RNN largely captures more stereotyped able-bodied dynamics and has a harder time learning the dynamics from more variable stroke individuals, especially those that deviate furthest from able-bodied. We acknowledge that our model likely is not capable of generalizing to speeds beyond the ranges of the input data (extrapolating), as RNNs are highly dependent on the training data that it sees to learn patterns in the dataset. One benefit of this capability, however, is that any data that deviates from the walking speeds in the training set can still be analyzed, reducing the number of speeds required in the training set to achieve a model that is valid across a range of speeds. Additionally, our small sample size limits the amount of data the RNN sees for each diverse type of stroke dynamics, thus, with a larger sample size of stroke survivors and longer trials, the RNN may be able to make better kinematic predictions of lower-functioning stroke survivors. Moreover, this result highlights the utility in predicting kinematics in unseen conditions which in contrast cannot be made using discrete biomechanical or clinical metrics, nor with current biophysical models.

Fig 5. Data-driven gait dynamics model predicts non-linear changes in joint kinematics with gait speed.

Fig 5

Gait predictions of joint kinematics (green) at intermediate gait speeds not used in model training were generated by interpolating gait signatures between slow (dashed grey) and fast speeds (dashed black) lines and using them to drive the gait model. Interpolated kinematics from gait dynamics (green) and interpolated directly from kinematics (blue) were compared to the measured reference kinematics (black solid). A) Predictions in an exemplar AB participant are more accurate when interpolating gait signatures compared to interpolating gait kinematics across speeds. B) In an exemplar low-functioning stroke survivor, interpolated gait signatures predict nonlinear changes in kinematics better at intermediate speeds than interpolated gait kinematics. Averaging the kinematics fail in this case where there are larger differences between the slow and fast speed paretic kinematics; the averaged kinematics (blue) follow the fast speed paretic hip kinematics whereas the other angles do not reflect waveforms that resemble either the fast or slow speed. The gait model can therefore predict movement reasonably well when interpolating between tested speeds. There is a statistically significant difference between group level R2 comparisons (kinematics generated from interpolated dynamics vs interpolated kinematics) in the able-bodied (C) but not in stroke (D) cohorts. However, the range of R2 values are larger in both able-bodied and stroke kinematic predictions resulting from interpolated kinematics (-1.30–0.98, -0.50–1.00 respectively) vs. predicted from interpolated gait dynamics (-0.20–1.00,0.46–1.00 respectively). Thus, while the R2 values may not improve on average for the stroke survivors, the model’s performance is more robust overall.

Gait sculpting: Manipulating the PC components of an individual’s gait signature identifies specific coordination deficits in stroke survivors

Previously, we showed that we can leverage our model to reconstruct the kinematics of healthy PC projections of the gait signature to gain insight into their independent biomechanical interpretations. However, identifying and interpreting the biomechanics related to impaired PC dynamics of stroke-survivors’ gait would prove to be even more beneficial, as these dynamics can potentially serve as rehabilitation targets when designing tailored gait intervention/strategies for individuals. Here we present an example of how we use gait signatures to identify specific biomechanical or coordination targets in specific stroke survivors. Specifically, we utilize our finding that the phase-varying contributions of the 6 principal projections of the gait signature differ in individual-specific manners (Fig 6A). For example, AB2’s 6 PC contributions all lie within the 95% confidence interval of all able-bodied individuals. ST4 primarily shows major deviation from AB in PC 3 (located entirely above the AB confidence interval), impaired dynamics during paretic swing in PC 4 and overall irregular shapes in PC 5 and 6. ST2’s PC3 is largely within the AB confidence interval, however PC 4’s paretic swing shows major deviation, their PC 5 contribution is shifted below the AB confidence interval and PC 6 shows an irregular shape. ST3’s PC5 projection lies below the AB confidence interval and PC 6 projection is irregularly shaped compared to AB. To validate our finding that suggested that PC 3 primarily influences hip flexion or extension, we exchanged AB2’s healthy PC1 with that of ST4 (Fig 6A, orange boxes and arrow) and we observed if and how AB2’s original hip joint kinematics (Fig 6B, orange box, black trace) deviated (Fig 6B, orange box, red dashed trace and S5 Video). To gain further insight into how the other PC deviations manifest in movement, we manipulated the PC3 projection of AB2 by replacing it with that of ST4 (Fig 6A, brown boxes and arrow). The kinematic reconstruction from this manipulation (Fig 6B, brown box, red dashed trace and S6 Video) shows a vertical shift downwards for bilateral hip angles and the non-paretic knee. The vertical shifts in the hip flexion/extension angles suggest a major difference in this individual’s posture (perhaps stroke individual leaned forward more during gait) compared to able-bodied. We manipulated AB2’s PC4 projection by replacing it with that of ST2 (Fig 6A, green boxes and arrow). This manipulation affected specifically the paretic and non-paretic ankle angles and both knee joints primarily during the period between non-paretic stance and paretic swing (Fig 6B, green box, red dashed trace and S7 Video). This result highlights a coordination deficit between these specific joint angles and, if targeted accurately, may allow for corrected gait patterns of this stroke survivor. Conversely, we tested the effects of replacing an impaired PC projection with a healthy one to observe how gait impairments can potentially be improved. We replaced ST3’s PC5 with that of AB2 (Fig 6A, purple boxes and arrow) and observed a substantial change in the magnitude and shape of bilateral ankle angle trajectories and slight increase in non-paretic knee magnitude (Fig 6B, purple box, red dashed trace and S8 Video). We can infer that to make improvements to ST3’s PC5 towards able-bodied or normative kinematics, rehabilitation focusing on these specific knee and ankle strategies may prove useful.

Fig 6. Gait sculpting: interpolating between components of able-bodied and stroke gait dynamics to visualize anticipated gait improvement.

Fig 6

The components of individuals’ gait signatures can be manipulated (gait sculpting) to understand the relationship between specific underlying dynamics and their corresponding kinematic phenotype. A) The projection on each of the 1st 6 principal components (PCs) can be observed for a representative able-bodied (AB2), two low functioning stroke-survivors each having similar self-selected (SS) speeds and Fugl-Meyer (FM) scores (ST2 & ST4, as denoted in Fig 2) and another low functioning stroke survivor (ST3) who has a higher FM score and faster SS walking speed. The PC projections are colored according to the 4 gait phases (non-paretic swing, non-paretic stance, paretic swing, paretic stance). The right leg of the unimpaired individuals was arbitrarily assigned to be paretic and the left leg, non-paretic for consistency. Colored boxes and arrows (orange, brown, green, purple) show specific, single PC manipulations, for example, the orange boxes and arrow illustrate that the PC 1 projection of AB2 was replaced with the impaired PC 1 projection from ST4. B) The AB2:ST4 manipulation (orange) shows how AB2’s original phase averaged kinematics (black trace) was manipulated by ST4’s impaired PC 1 projection (red dashed traced). ST4’s impaired PC 1 manifests in AB2’s healthy kinematics showing deviation primarily in the hip kinematics (as suggested in Fig 4 where healthy PC 1 encodes a kinematic subcomponent corresponding to hip flexion/extension) and some deviation in the ankle angles, especially the paretic ankle. The AB2:ST4 manipulation (brown) shows how ST4’s impaired PC3 manifests in AB2’s healthy kinematics; we observe a vertical shift downwards (red trace) of the bilateral hip angles as well as the non-paretic knee. This change in hip flexion highlights that this impaired PC3 encodes a reduction in the hip flexion angles; pointing to a more crouched gait (trunk is leaning forward more). The AB2:ST2 manipulation (green) shows replacing AB2’s PC4 projection with ST2’s impaired PC4 dynamics shows deviation in the knee joints especially during paretic swing, a vertical shift upwards in the paretic ankle angle kinematics and deviations around the middle of the gait cycle (transition between non-paretic stance and paretic swing) in the non-paretic ankle kinematics. Alternatively, the AB2:ST3 manipulation (purple) the impaired PC5 in ST3 is replaced with the healthy PC5 projection from AB2 resulting in slight increase in non-paretic knee magnitude and reduced amplitude of paretic and non-paretic ankle flexion. The result of this manipulation points to potential predicted improvements (or deviations) that can occur when aiming to mimic PC5 healthy dynamics in this stroke survivor allowing offline in-silico testing of potential avenues for gait rehabilitation for this stroke survivor.

Self-driven signatures: Our gait dynamics model revealed robustness of gait predictions establishing the utility of gait signatures in precision medicine

The ability to predict future kinematics based on measured data is key to rapid, virtual design of personalized interventions. We demonstrate that the recurrent neural network model of gait dynamics, once primed with several gait cycles of data from either able-bodied or stroke participants, can predict future joint angle trajectories (Fig 7). Once the network is primed, an initial posture is presented (initial condition, denoted by blue vertical bar) after which the model self-drives i.e., predicts the general shape of future kinematics in a feedforward manner (without referencing previous measured data points) in an able-bodied (Fig 7Ai, left) and stroke individual (Fig 7Bii, left). A smooth transition is seen between the previously measured gait cycle (green) and the self-driven cycle (red trace) for both AB and stroke (Fig 7Ai, right and 7Aii, right respectively).To verify that the model was not generating a gait cycle prediction entirely by chance, we calculated the Euclidean distance between the kinematics of the predicted (self-driven) gait cycle and the kinematics from each of the measured gait cycles. We computed the distribution of distances between each predicted gait cycle with all other gait cycles from the same individual (Fig 7B, purple bars). We then compared the predicted gait cycle to the target gait cycle (Fig 7B, red bars). In the able-bodied individual, the predicted gait cycles are more similar to the target gait cycle (Fig 7Bi, red bar) than 79% of all gait cycles. However, in the stroke survivor, 60% of other gait cycles were more similar to the predicted gait cycle than the target gait cycle (Fig 7Bii, red bar). This suggests that the model is less able to accurately predict future kinematics in stroke gait. Note, to calculate the Euclidean distances between the gait cycles, we need to normalize the period of each gait cycle to the period of the self-driven cycle. To avoid the potential of bias due to this normalization, we also performed a comparison using a metric that was not manipulated in time–gait cycle duration. After priming the model, we presented the model with the first posture of the trial and ran the network forward in self-driving mode for the remainder of the trial length (15 seconds). Able-bodied self-driven predicted kinematics resembled the reference kinematics closely (Fig 7Ci, top plot) whereas stroke self-driven predicted kinematics matched the first gait cycle closely but soon converged to patterns reflecting able-bodied kinematics (Fig 7Cii, top plot). The gait cycle duration of the first few cycles of the self-driven kinematics match those of the measured kinematics (blue dots located close to the y = x line) in both the exemplary able-bodied (Fig 7Ci, bottom plot) and stroke individual shown (Fig 7Cii, bottom plot), however kinematics soon diverged to shorter and relatively consistent gait cycle durations (blue dots appearing almost horizontal in the plots) result in both cases. The model may preferentially predict able-bodied kinematics, which were less variable between individuals than were post-stroke kinematics. These results highlight that the model encodes gait dynamics that can predict kinematics over short timescales, but the variability and amount of training data may influence predictive power over long timescales. Our model provides a foundation for a sample-specific gait dynamics model to predict the effects of environmental perturbations, assistive devices, and treatments without extensive experimental sessions. In-silico predictions using the gait dynamics model may thus reduce the experimental time, cost, and participant burden required for personalized gait characterization, treatment personalization, and device design.

Fig 7. Our trained RNN model can predict the time evolution of kinematics from an initial posture.

Fig 7

The trained gait dynamics model can predict individual-specific time-evolution of gait kinematics from an arbitrary initial posture (self-driving) in able-bodied (A, i) and stroke (A, ii) once the network is primed with several gait cycles of an individual’s data (gait cyclex-n(measured), black solid). This predictive ability shows that the model encodes the gait dynamics underlying movement. Despite inter-cycle kinematic variability, the gait dynamics model can predict the general shape of the next gait cycle of kinematics (gait cyclex(predicted), red) in an able-bodied individual (A, i) and stroke survivor (A, ii), however, predicted kinematics (red) show larger deviation from the measured reference gait cycle (gait cyclex(measured), black dashed) in the stroke survivor. A smooth transition exists between the measured kinematics from the gait cycle preceding (gait cyclex-1(measured), green) the self-driven predicted cycle (red). For the representative able-bodied individual (B, i), the Euclidean distance (deviation) between the predicted gait cycle of kinematics and its respective measured kinematics (reference) is ~79% lower than the distance between the other gait cycles in the trial; ruling out that the kinematic predictions are attributed to chance. The deviation (Euclidean distance) of the predicted gait cycle of stroke (B, ii) kinematics to its reference gait cycle is ~40% lower than the distance between the other gait cycles in the trial. This suggests that the dynamical model is less able to accurately predict stroke kinematics better than chance. The dynamical model was first initialized with all the trial’s kinematics data (15 seconds) (black trace) after which the trial’s initial posture was presented to the model to self-drive kinematics (red trace) in feedforward mode for 15 seconds (C, i, top plot). The duration of each gait cycle from the measured kinematics is not well encoded by the dynamical model; gait cycle durations of the predicted kinematics are typically underestimated in both able-bodied (C, i, bottom plot) and stroke (C, ii, bottom plot) (to a larger degree) in self-driving mode and as such deviate from the y = x reference line (black).

Discussion

Summary

Here we establish a data-driven framework for comparing and predicting individual-specific locomotor patterns without needing to construct physiologically based mechanistic models. As an initial proof of concept, complex neuromechanical gait dynamics were modeled using a relatively simple recurrent neural network that captures the rules by which joint kinematics during gait transition from one time point to the next. Because the network was trained on multiple healthy and impaired individuals walking at several speeds, its internal parameters provide a basis for comparing, interpreting, and predicting gait dynamics. Gait signatures further capture coordination between joints and limbs without the need for pre-selecting gait features that may introduce bias and ignore the continuous nature of gait. We show that individuals have little variance in gait dynamics across speeds, leading to the individual-specific “gait signature” concept and enabling comparisons between individuals moving at different speeds. Across stroke survivors, we found greater heterogeneity in low-functioning individuals who exhibited disparate gait dynamics despite similar clinical metrics, highlighting the potential utility of gait signatures in providing more sensitive diagnoses to personalize therapies. Gait signatures provide a predictive simulation framework for sculpting gait dynamics to understand coordination deficits and predict kinematics, potentially forecasting the effects of rehabilitative devices or treatments. Finally, the gait signatures methodology can be readily applied to other periodic motions across species and across conditions that alter movement and may be a powerful adjunct to modern experimental methods aimed at understanding the neural mechanisms underlying movement.

Computational framework captures the neuromechanical dynamics of walking

Using a data driven modeling approach enabled us to learn the underlying gait dynamics based on data rather than constructing a neuromechanical gait model based on first principles. Data-driven approaches in gait have not focused on gait dynamics but have solved tasks based on unique features in multi-dimensional gait data such as classifying gait based on pathologies [75] or conditions such as fatigue and non-fatigue [76], identifying gait events (e.g., initial contact, loading response [7779]), and discriminating individuals [62,80]. Gait dynamics have typically been described though neuromusculoskeletal models based on physical principles focusing on musculoskeletal mechanics, [30,81] but they lack adequate representations of the neural systems that contribute to the resulting movement patterns, particularly in neurological impairments such as stroke [31]. Machine learning methods to capture dynamics have been used across physics, engineering, and neuroscience to learn the dynamics underlying complex systems when the governing equations are unknown [58,82,83]. Recently, machine learning models have been used in human gait to predict continuous kinetic variables such as ground-reaction forces [84] or joint torque [85,86] based on kinematic data. Dynamical machine learning models have also been used to encode gait dynamics, including responses to perturbations or assistive devices, but their model structure did not enable comparisons between individuals [8791]. Here, our RNN-based gait dynamics model provides a means to capture the rules underlying continuous, multi-joint coordination between bilateral lower limb joints, and how they evolve over time. Accordingly, we do not explicitly capture mechanical dynamics (i.e., the relationship motion and force), but the effects of force interactions within the body and environment and implicitly represented in how multi-joint kinematics evolve over time, with the network parameters and the internal states at each time point determining the output kinematics.

As gait arises from complex interactions between the nervous system and the musculoskeletal system that are not easily modeled from first principles, a data-driven approach provides a powerful framework for capturing and comparing neuromechanical constraints on gait dynamics. While biomechanical dynamics clearly play a role in movement, the activation of muscles by the nervous system enables the body to perform a variety of motor behaviors. However, the governing spatiotemporal dynamics of neuromuscular signals are poorly understood, especially in neuro-pathologies such as stroke. During behaviors such as locomotion, motor patterns can be characterized by the number and structure of motor modules, or muscle synergies, defining groups of co-activated muscles producing a biomechanical function for gait [92]. Similar motor modules are used within individuals across different task conditions [9395], and are shaped by learning and disease [96,97]. Particularly in post-stroke gait, motor modules appear to constrain motor function. Fewer motor modules are observed post-stroke with the number of modules correlated to reduced walking speed [98,99]. Further, different patterns of motor module merging are seen in slower walking stroke survivors, differentially affecting gait biomechanics in a manner that may necessitate individualized rehabilitation approaches [100]. Adding neural constraints such as motor modules on muscle activations in musculoskeletal simulations improve predictions of key physiological variables such as joint loading in osteoarthritis [101]. However, relating motor modules to kinematic gait patterns post-stroke and in other neurological disorders has been challenging, likely because the neural constraints are underspecified [27,102105]. Corroborating results from motor module analysis, there were greater differences in gait dynamics amongst the slowest walking stroke survivors. Since the gait signatures capture spatiotemporal constraints underlying gait dynamics, they provide a complementary approach to musculoskeletal simulations. Ultimately, gait signatures may play a complementary role to biophysical simulations, enabling the relationships between biomechanical principles, neural constraints, and the emergent gait dynamics to be revealed.

Gait signatures enable holistic comparison of gait dynamics across individuals, speeds, and groups

In contrast to other applications of dynamical machine learning models for gait, we capture multiple individuals within a single network, enabling comparisons of gait dynamics across groups, individuals, and gait conditions. Rather than using the network as a black box solely to generate predictions, we explicitly compare and interpret the model’s internal parameters to identify low-dimensional latent variables representing gait dynamics. To encourage a generalizable data-driven gait dynamics model, we omitted subject and trial condition (gait speed) labels as inputs to the neural network. Adding input labels might force the RNN to create separable gait models, whereas our goal was to have the network learn a structure that could be modified parametrically to represent individual differences in the neuromechanics of walking. Similarly, neuromusculoskeletal models assume common dynamic principles across individuals, using parameter variations to represent individual differences [27,28,103,104,106]. We intentionally designed a relatively simple RNN architecture (e.g., single layer, linear input/output) as a starting point to recover as much interpretability as possible, with the awareness that more complexity could be added to the model architecture (number of hidden layers, number of neurons, etc.) if required to fit a given data set robustly. The representations of gait dynamics that emerge from our model holistically capture the changes underlying measured kinematics, without being attributable to specific neural or biomechanical constraints. The loss of physiological interpretability is counterbalanced by the holistic approach to representing gait dynamics and explaining gait kinematics features.

Analogous to written signatures, we find that features of individual-specific gait signatures are largely preserved across walking speeds. Recognizable qualitative features of handwriting are preserved even as the size of letters changes quantitatively, or if different limbs, or writing instruments are used [107]. Similarly, it is well known that individuals can be recognized based on how they move or walk [28,40,108110], even if joint angle excursions are similar. We show that gait dynamics are more similar within individuals across speeds than between individuals, leading to the concept of the gait signature. In contrast, gait kinetics and kinematics vary characteristically across speeds, such that they cannot be directly compared across speeds [111]. The relatively small changes in gait signatures across speeds suggest that the signatures reflect changes in the spatiotemporal relationships between joint kinematics, rather than quantitative changes in their magnitude. As such, gait signatures appear to encode individual-specific constraints of walking, making it possible to compare gait either within or between individuals walking at different speeds.

Gait signatures characterize the high inter-individual variability in gait impairment amongst stroke survivors beyond overall gait function explained by clinical gait metrics. This heterogeneity is a direct reflection of the wide range of impairments in stroke survivors, including muscle weakness, impaired coordination, spasticity, abnormal synergistic activation (muscles not independently coordinated), and compensatory motion [19,53,112]. We found that higher-functioning stroke survivors were more dynamically similar to each other, whereas lower functioning stroke survivors were more dispersed. In fact, two low-functioning stroke survivors with similar clinical metrics (Fugl-Meyer score and gait speed) had quite different gait signatures. As such, gait signatures have the potential to provide insights into individual differences in gait dynamics that are simply not captured by clinical metric such as gait speed. Moreover, in contrast to higher-functioning stroke survivors who share similar gait dynamics, lower-functioning stroke survivors may require more individuals individualized rehabilitation approaches targeting specific aspects of gait dysfunction. Further gait signatures do not require a priori selection of which gait variables to compare [113116]. As such gait signatures provide a powerful, holistic approach to enhance the specificity and precision of gait diagnosis and treatment. Our study inclusion criteria exclude severe contractures or deformities that interfere with normal ambulation and in future work the gait signatures would need to be interpreted and correlated with clinical evaluation of strength, range of motion, sensorimotor impairment, and/or limb deformities. The demographic and clinical information of the stroke participants in our study are available in Supplementary Materials S1 Table. Gait signatures could be part of a set of multi-modal data to account for the diverse causal factors underlying each individual’s gait pattern (e.g., lesion neuroanatomy, medical confounding variables, musculoskeletal conditions, psychosocial variables, physiological contributors to gait and environmental factors). This framework can potentially extend to other diseases, disorders, injury, etc. to gain further insight into individuals’ specific impairments and uncover specific targets towards developing targeted therapies for individuals.

Gait signatures enable biomechanical interpretation and manipulation

Our gait dynamics model enables biomechanical interpretation of gait signatures and exploring “what if” scenarios to sculpt desirable gait dynamics. Gait signatures are based on principal components (PCs) of the gait model internal states, where the weightings on each PC vary over the gait cycle. The model parameters can be prescribed over the gait cycle, resulting in the predicted kinematic outputs (i.e., joint angles). The gait signature PCs and their time-varying weightings can be individually prescribed in the network as a method to reveal the specific inter- and intra-limb coordination patterns governed by each PC. Further, any combination of PC’s can be combined and reweighted to generate new kinematic output patterns. For example, we interchanged healthy and impaired PCs to gain deeper insight into how specific impaired PCs alter healthy gait and vice versa. Further, interpolating gait dynamics can predict gait kinematics at walking speeds that were not used in the training data. Especially when there was a nonlinear response in gait kinematics across speeds, interpolation of gait dynamics to predict gait kinematics performed better than interpolating gait kinematics directly. As such our data driven gait dynamics model can be used to show how changing select components of the gait signature alters gait kinematics, providing a potential framework to identify personalized therapeutic targets for gait rehabilitation.

Gait signatures have potential to predict future kinematics

Another powerful aspect of our gait signatures framework is its ability to generate future gait kinematics in the absence of new data. The model is self-driving for able-bodied individuals, predicting multiple cycles of gait kinematics in the future. However, the ability of the model to predict future stroke kinematics is limited to approximately one gait cycle in the future; rendering it promising in applications that provide control signals to rehabilitation devices (e.g., exoskeletons). Furthermore, our model is likely only capable of generalizing to speeds within the speed ranges of the input data. There was a moderate association between participants’ similarity to able-bodied gait signatures (distance to the able-bodied centroid) and the RNN’s ability to predict gait kinematics over one gait cycle (S6 Fig). This association is likely due to the RNN favoring able-bodied dynamics during model fitting, which were more homogeneous than those of high- or low-functioning stroke survivors. A larger post-stroke sample may improve the RNN’s ability to encode and predict pathological gait dynamics. Further, the reduced predictive power for the stroke participants can be attributed to our model architecture’s relative simplicity and short time-series (15 seconds/ 1500 sample points per trial). These factors should be addressed to improve the predictive capacity of the model for impaired gait in the future. Additionally, including more variables besides sagittal plane kinematics (e.g., frontal plane and coronal plane kinematics and joint forces, may improve learning of the underlying dynamics of gait and increase predictive capability of our model.

Generalization to other species and rhythmic movements

Because the input to this model are periodic sequences of behaviors, our gait dynamics framework should generalize to other species that display similar behavioral motions (e.g., flight, crawling, and walking). Physicists, computational biologists, and other scientists can benefit from this method by studying the dynamical behavior of species whose neuromechanical models and physics of complex terrains are difficult to model. This is the first study to our knowledge that uses a neural network to study the dynamics of gait in an interpretable manner. While much work is left to be done, we have provided a simplistic, unsupervised framework to discover individual-specific differences in walking in health and disease in humans. Despite being limited by a small dataset, we have shown that our model is generalizable to characterizing and predicting kinematics of one held-out subject using leave one out cross validation (S3 Fig). Here we focus on demonstrating the innovation, feasibility, and potential advantages of our RNN gait signature approach, justifying the need and potential for further development by scaling to larger-sample studies.

Importantly, this methodology relies on having a periodic or quasi-periodic pattern, as non-periodic patterns would not be able to generate a phase and subsequent signature. We also limited our inputs to gait kinematics, anticipating applications to the proliferation of new measurement modalities for movement in humans and animals such as wearable sensors and markerless video-based motion capture [117119]. However, the gait signatures framework could easily be extended to include other data types (e.g., force, muscle activity, joint loadings, center of mass dynamics) and experimental conditions (overground walking, biomechanical constraints, gait interventions, such as exoskeletons, functional electrical stimulation, or treatment e.g., drugs, optogenetics). In practice, a more comprehensive data set would be needed within each gait group to train a model capable of capturing the full range of variability in gait dynamics. Short of having a massive data set, it may also be possible to leverage synthetic gait data from simulations to span the full range of feasible gait dynamics variations.

Overall, by modeling the dynamics of individual’s gait based on measured data, we uncovered individual-specific representations of individuals’ neuromechanical constraints that allows direct comparisons between individuals who do not walk at the same speed. The gait signatures framework has implications for the diagnosis of disease, development of future tailored gait therapies or interventions and tracking meaningful changes in the fundamental neuromechanical mechanism of walking.

Materials and methods

Ethics statement

All participants provided written informed consent prior to study participation, and the study protocol was approved by the Emory University Institutional Review Board.

Human subject participants

To develop dynamical signatures of human gait, we collected data in seven post-stroke individuals (age = 56 ± 12 years; 2 females; 48 ± 25 months post-stroke; Lower Extremity Fugl-Meyer = 20 ± 4) and five able-bodied (AB) controls (age = 24 ± 4 years; 4 female). All post-stroke participants experienced a cortical or subcortical ischemic stroke, were able to walk on a treadmill for one minute without an orthotic device, and exhibited no signs of hemi-neglect, orthopedic conditions limiting walking, or cerebellar dysfunction.

Experimental design

Participants completed 15-second walking trials at six different speeds, distributed evenly between and ranging from each participant’s self-selected (SS) speed to the fastest safe and comfortable speed. Across stroke participants, gait speeds ranged from 0.3–1.6 m/s. Each participant’s fastest walking speed was determined by progressively increasing the treadmill speed from the SS speed until the participant could no longer comfortably or safely maintain the speed for 30 seconds. Participants rested for 1–2 minutes between consecutive gait trials. During data collection, speed increased from the participant’s SS to their fastest speed (i.e., not randomized).

Data acquisition

Reflective markers were attached to the trunk, pelvis, and bilateral shank, thigh, and foot segments [120]. We collected marker position data while participants walked on a split-belt instrumented treadmill (Bertec Corp., Ohio, USA) using a -7-camera motion analysis system (Vicon Motion Systems, Ltd., UK). Participants held onto a front handrail and wore an overhead safety harness that did not support body weight. Marker data were collected at 100 Hz, and synchronous ground reaction forces were recorded at 2000 Hz and were down sampled to 100Hz using previously established techniques [121123].

Data processing

Raw marker position data were labeled, gap-filled, and low-pass filtered in Vicon Nexus. Labeled marker trajectories and ground reaction force raw analog data were low-pass filtered In Visual 3D. Gait events (bilateral heel contact and toe-off) were determined using a 20-N vertical GRF cutoff, and sagittal-plane hip, knee, and ankle joint kinematics were calculated in Visual 3D (C-Motion Inc., Maryland, USA).

RNN model development

Our goal was to start with a simple RNN to reduce overfitting with too many parameters and deep layers. We wanted the simplest model capable of learning the dynamics underlying gait which also preserved interpretability. The simplest recurrent neural network (RNN) model architecture consisted of one input layer, one hidden layer and one output layer. The hidden layer was composed of long short-term memory (LSTM) units with a lookback parameter that spanned at least one gait cycle. Model hyperparameter selection is described in a later paragraph.

RNN model training

Model fitting on our selected dataset and architecture was executed on the order of minutes to tens of minutes, using Keras 3.7.13 and TensorFlow 2.8.2 on Google Colab’s standard GPU with high-RAM runtime (54.8 gigabytes). The RNN model was trained using bilateral, sagittal-plane, lower-limb joint angles from 5 able-bodied (AB) participants and 7 stroke survivors each walking on a treadmill at 6 steady speeds, ranging from each participant’s preferred speed to the fastest safe speed. Our training dataset was input to the RNN in multivariate format (not concatenated) [62,63]. We trained a sequence-to-sequence RNN with 512 long-short-term memory (LSTM) activations units in the single hidden layer, capable of using 15 seconds (sample rate of 100 Hz) time-series kinematic input data (0 to T-1) to predict kinematics one time-step in the future (1 to T) for all training data across individuals and speeds. Our data was batched according to the number of total trials (N = 72); thus, the LSTM maintains its internal state while a batch is being processed, after which the internal state can be maintained or cleared. Because our network retains its internal state from one time step to the next (i.e., the RNN is stateful), we have fine-grained control over when the internal state of the LSTM network is reset. The input data from all trials was ‘mini-batched’ into 2 training batches and 1 validation batch (499 samples each) that were used to update model weights on each model run (epoch). To format our data into equal length input and output mini batches for training and account for the output data being a one-time step shifted version of our input data, our lookback parameter must be one value less than a divisor of the trial length. For example, in our dataset (1500 sample length trials), a lookback parameter of 499 would result in the first mini batch input of samples [0:499] which will predict our reference output samples [1:500], our 2nd mini batch input data would include samples [501:999] and corresponding output [502:1000] and the last mini-batch input of samples [1001: 1499] predicts samples [1002: 1500]. This lookback parameter of 499 allows us to construct 3 mini-batches of shorter input and output data lengths which would be used to train and validate the RNN model (2:1 training: validation mini batch split). Similarly with the lookback parameter of 499 (2:1 training: validation mini-batch split) and 749 (1:1 training: validation split). Mean squared error was used as the LSTM loss function and ADAM as the optimization algorithm because it is fast, has a small memory footprint and is well suited for large-parameter deep learning models [124]. The model was trained for at least 5000 iterations or until training and validation error converged (< 0.75°). The training resulted in a sample-specific dynamical model structure defined by a single set of LSTM network weights (W). The model’s internal states capture trial-specific dynamics predicting the time evolution of joint kinematics; activation coefficients (H) and memory cell states (C) and are tuned based on kinematic input.

Model hyperparameter selection

We selected the hyperparameter values of 512 nodes in the LSTM layer and a 499-sample LSTM lookback length (number of samples preceding the current time point that is used to train the LSTM) were selected based on training and validation loss, as well as the ability to encode dynamics over short and long timescales. In two steps, we evaluated all pairs of the following hyperparameter values: 1024, 512, 256, and 128 LSTM nodes and 749, 499, and 249-sample lookback parameters. Because RNN performance can change with the parameters used to initialize the RNN, we fit an RNN gait dynamics model 20 times using random initial parameters, for each hyperparameter pair. First, we compared model training and validation loss for each hyperparameter pair: the ‘best’ hyperparameter pair would have low training and validation loss. The following [node-lookback] pairs were considered the best hyperparameter pairs: 512–499 (MSEtrain = 0.010 ± 0.001 deg2; MSEval = 0.018 ± 0.000 deg2), 256–749 (MSEtrain = 0.010 ± 0.002 deg2; MSEval = 0.015 ± 0.001 deg2), 256–499 (MSEtrain = 0.010 ± 0.001 deg2; MSEval = 0.017 ± 0.000 deg2). (S1A Fig). The training loss was not different between hyperparameter pairs (p > 0.235). The validation loss differed between all three models (p < 0.001), with the 256–749 model having the lowest validation loss. However, if the differences in validation loss of less than 0.003 deg2 corresponded to meaningful differences in performance was unclear.

Our second analysis was, therefore, used to compare the three hyperparameter pairs deemed best in the prior analysis. Here, we evaluated the models’ abilities to encode the average dynamical behavior over long timescales (long-time) and the stride-to-stride behavior (short-time). We defined the best model as the one with the highest long- and short-time performance. The following analysis was performed for 10 of the 20 random initializations. For long- and short-time analyses, we created a single set of reference dynamics as done in the manuscript: we performed one time-step predictions over the full (1500-sample) time-series. This step provided best-case predictions of the gait dynamics (S1B Fig).

Long-time performance

We generated long-time predictions of each trial’s gait signatures (RNN latent states) by simulating each participant’s gait dynamics forward in time, 1500 samples into the future. Each simulation was initialized by setting the RNNs’ latent states to those of the last sample of the trial’s reference dynamics and using the last sample of the trial’s kinematics. We then phase-averaged both the reference dynamics and the long-time predictions using the same technique as described in the main manuscript. Long-time performance was defined as the similarity of the phase-averaged latent states (i.e., the gait signatures) between the reference and the long-time predictions and was quantified using R2. Note that using R2 as a similarity metric, rather than the Euclidean distance metric used in the main manuscript, was needed to compare models with different numbers of nodes. Unlike R2, Euclidean distances are sensitive to the number of samples used to compare models, which would bias short- and long-time performance towards models with fewer nodes. Low R2 values between predictions indicates that the learned dynamics are sufficiently complex to capture instantaneous gait dynamics but can also accurately generate the time evolution of dynamics over the gait cycle—a major challenge in data-driven models of locomotion [8990].

The 512-node model captured gait dynamics over long time scales significantly better (i.e., more accurate predictions of the time-varying dynamics) than the 249-node models (S1B Fig). For long-time predictions, the 512-node model predictions (R2 = 0.50 ± 0.46) were better than the 249-node 499-sample lookback model (ΔR2 = 0.27 ± 0.06; p < 0.001; independent-samples t-tests) and the 249-node 499-sample lookback model (ΔR2 = 0.31 ± 0.07; p < 0.001).

Short-time performance

We generated short-time predictions by simulating single strides in each trial’s time-series, initialized from the first sample of each stride. Initialization used the latent RNN states and kinematics of the reference dynamics at the onset of a new stride (phase = 0 rad). For each initial condition, we integrated the dynamics forward in time, up to the onset of the next stride. For each stride, we then compare the similarity of the reference dynamics to the dynamics of the corresponding short-time prediction using R2. Short-time performance was quantified as the average R2 value across trials for a single model and initialization.

The 512-node model captured gait dynamics over short time scales significantly better (i.e., more accurate predictions of the time-varying dynamics) than the 249-node models (S1B Fig). For short-time predictions, the 512-node model predictions (R2 = 0.11 ± 0.51) were more accurate than the 249-node 499-sample lookback model (ΔR2 = 0.51 ± 0.13; p = 0.055; independent-samples t-tests) and the 249-node 499-sample lookback model (ΔR2 = 0.34 ± 0.09; p < 0.001). Based on difference in short- and long-time prediction performance, we selected the 512-node, 499-sample lookback hyperparameters for the RNN model.

Leave-one-out subject model evaluation for generalizability

Using the selected hyperparameters, 12 different models were trained where one different subject (all 6 speed trials per subject) was held out for evaluation on each model run. The same model architecture, training and validation setup was used as the original model trained using the full dataset (12 subjects). The minimum training loss, validation loss, and overall evaluated test loss for each model were extracted and box plots of each generated. The Wilcoxon Rank-Sum Test statistic was used to compare the means. Each model was evaluated on the 6 held-out speed trials from training and an average loss was calculated for each model. The reference kinematics, externally driven and self-driven predictions of each of the 6 held-out trials per model were phase averaged and R2 between the phase averaged externally driven and long-time self-driven predictions (see Long-Time Performance section, above) were calculated. Box plots for each metric across the held-out trials were generated and the Wilcoxon Rank-Sum Test statistic used to compare the means.

Computing gait signatures from RNN internal states

To develop the gait signatures, we extract the activation and cell states from the LSTM (denoted “H” and “C” respectively) which evolve over time (the course of the gait cycle) as the kinematics of each trial are fed through the trained RNN. These H and C parameters represent how the model’s internal parameters change as it encodes the prediction of future kinematic trajectories. The selected 512-node LSTM layer had 512 H parameters and 512 C parameters. Time-varying gait signatures were computed by identifying dominant modes of variation in the internal states using principal components analysis (PCA). A single PCA operation was used to transform the internal states for all participants into a common basis. Consequently, inter-trial differences in the time-varying activations of the principal components (modes) reflect differences in the underlying dynamics of the individual(s). These activations constituted the time-varying (1500 sample) gait signatures, which had the same dimension as the RNN’s hidden layer (1024 units). However, the first six principal components accounted for ~72% of the variance in the internal states.

To compare gait signatures within and between individuals, we phase averaged each trial’s signatures across strides. Rather than linearly interpolating the data between foot contact events before averaging, as is common in gait analysis [122,125127] we computed a continuous phase using the first 3 gait signature modes for each trial using Von Mises interpolation [128]. Compared to averaging across linearly interpolated strides, phase averaging is expected to reduce the variance in the data at any point in the stride [66,90]. As the domain for interpolation, we estimated the time-varying phase for each trial separately using the Phaser algorithm, using the first 3 principal components as phase variables [66]. To align phase estimates across trials, we defined zero-phase as the maximum of the first principal component.

Gait event estimation of phase averaged signatures

The force plates embedded in the treadmill captured precise gait event timing information (left heel strike, right toe off, right heel strike, left toe off) across individuals’ trials which we represented as a vector of 1’s, 2’s, 3’s and 4’s, respectively (ground truth markings for the 4 gait events). We leveraged the Phaser algorithm again [66] to develop a phase estimator to transform these 4 gait events over time into gait events over phase. For each trial, we determined the mode phase that corresponded to each of the 4 gait events to gain a representation of where the 4 gait events occurred during phase averaged dynamics (0–2π) for each trial.

Interpolation of unseen speed gait signatures to reconstruct kinematics

To demonstrate the generalizability of gait dynamics, we show that linearly interpolating gait signatures to predict gait kinematics at new walking speeds is more accurate than linearly interpolating the kinematics themselves. We trained another RNN model with the same architecture and hyperparameters to the first model, however using only the 2 slowest and 2 fastest speeds from each participant (i.e., we held out the 2 middle speed trials from each participant). We then linearly interpolated the 2-middle speeds’ internal states and ran the data through the trained RNN to reconstruct or predict kinematics. We compared the original phase averaged kinematics to the predicted kinematics resulting from each of the two linear interpolations (dynamics and kinematics) using the coefficient of determination. Furthermore, even when we reduced the dimensionality of the model’s internal states from the full 1024 to the first 6 principal components (the selected dimension on the gait signatures), it still performed better than interpolating kinematics (also rank = 6).

Biomechanical interpretation of principal components of the gait signature

To reconstruct kinematics from the corresponding underlying dynamics (internal state representations), we restored our trained model’s weights to a new model using the ‘model.set’ and ‘model.get_weights’ Keras built in functions. The function ‘model.predict’ takes in the hidden state values (Hs) only (first 512 of the 1024 internal-state time trajectories) and predicts the corresponding kinematics for the provided internal states. Using this framework, we provided this new model with independent principal component representations of individuals’ hidden states and visualized the corresponding kinematics through stick figure movie representations of the resulting kinematics over the walking trials.

Predicting time evolution of kinematics from an initial posture (self-driving)

Our trained generative gait model can take in a single initial posture of size (6,1) corresponding to a single time point representation of each of the 6 joint angles to predict the next time step posture/kinematics using command ‘model.predict’. To make further predictions, the predicted value is used as the new initial condition (posture) and predictions are made on a one-time step basis in a similar fashion for a pre-specified prediction length (self-driving). However, it is important to note that even though our current framework only predicts one time-step in the future, the LSTM layer remains stateful through each gait cycle, allowing the model to learn much longer timescales as seen when self-driving the network (S7 Fig).

Supporting information

S1 Video. Stick figure movie demonstrating the RNN’s kinematic reconstruction of all the PCs of dynamics.

(MP4)

S2 Video. Stick figure movie demonstrating that PC1 encodes dynamics driving hip flexion and extension.

(MP4)

S3 Video. Stick figure movie demonstrating that PC2 encodes dynamics driving knee flexion and extension.

(MP4)

S4 Video. Stick figure movie demonstrating that PC3 encodes dynamics driving postural coordination.

(MP4)

S5 Video. Stick figure movie demonstrating the kinematic reconstruction when AB2’s PC1 projection is replaced with ST4’s impaired PC1 projection.

(MP4)

S6 Video. Stick figure movie demonstrating the kinematic reconstruction when AB2’s PC3 projection is replaced with ST4’s impaired PC3 projection.

(MP4)

S7 Video. Stick figure movie demonstrating the kinematic reconstruction when AB2’s PC4 projection is replaced with ST2’s impaired PC4 projection.

(MP4)

S8 Video. Stick figure movie demonstrating the kinematic reconstruction when ST3’s impaired PC5 projection is replaced with AB2’s PC5 projection.

(MP4)

S1 Fig. Comparison of model performance on training and validation loss (left), and long- and short-time prediction performance (right).

In both plots, small dots represent the average values across trials for each random initialization of each model. Large dots and bars denote the average and standard deviation of model performance metrics across initializations. Left: Training and validation loss (RMSE) for all 12 hyperparameter pairs. Models in the lower-left consider are considered better. Right: Long- and short-time prediction performance (R2) for the 3 hyperparameter pairs with the lowest training and validation loss. Models in the upper-right corner are considered better.

(TIF)

S2 Fig. RNN model training (green) and validation (blue) loss curves.

(TIF)

S3 Fig. RNN dynamic learning generalizes across 12 leave-one-individual-out models.

The minimum train loss (blue) and validation loss (green) was low (< 0.02 degrees2) for 5 models that were trained each with a single able-bodied individual held out of the training data (A, i) and 7 models each with a stroke individual held out of training data (A, ii). The magnitude and range of the test loss (evaluation of model on the held-out data) (orange) was higher than the respective minimum training and validation losses for both held-out able-bodied (A, i) and stroke (A, ii) models. The magnitude and range of test losses evaluated on held-out able-bodied individuals, however, were lower than models evaluated on held-out stroke data. The models generate external predictions (blue) of held-out test trials with higher R2 values than that of self-driven predictions (red) in models evaluated on both able-bodied (B, i) and stroke trials (B, ii). The models can generate external kinematic predictions (blue) of held-out able-bodied (B, i) trials better than that of stroke (B, ii). Self-driven predictions of stroke kinematics were generally very low (R2 values below 0.5). Models were incapable of generating self-driven predictions for 5 of 30 able-bodied trials and 16 of 42 stroke trials. (C) shows reference (black), externally driven (blue) and self-driven (red) phase averaged kinematic predictions for an exemplary able-bodied trial (C, i) and exemplary stroke trial (C, ii). Models can predict kinematics of held-out able-bodied trials better (higher R2) than that of stroke.

(TIF)

S4 Fig. Cumulative proportion of variance explained by the first 100 principal components of gait dynamics.

Six (6) principal components (PCs) explained 77% of the variance in the gait dynamics. The top 6 dominant PCs were used to develop the gait signature.

(TIF)

S5 Fig. Support vector machine cross-validation classification accuracy of four different gait descriptors (discrete variables, gait signatures, kinematics, and a combination of kinematics & joint velocity) for discrimination between: a) gait group (able-bodied vs. stroke) and B) individuals.

Using k = 25 folds, RNN gait signatures distinguished between impaired and unimpaired gait with 100% accuracy, along with the 26 discrete variables (100%, p = 1), whereas kinematic (92.67 ± 0.15%, p < 0.05) and kinematics & velocity (88.67 ± 0.17%, p <0.05) discrimination were significantly lower. Using k = 6 folds, SVM classification of individuals was most accurate using RNN gait signatures and discrete variables (100%), lower using kinematics (88.9 ± 0.13%, p = 0.061) and significantly lower using a kinematics and velocity (68.10 ± 0.16%, p < 0.05).

(TIF)

S6 Fig. Relationship between participants’ similarity to able-bodied gait signatures and the RNN’s ability to predict gait kinematics over one gait cycle.

There exists a negative correlation between self-driven R2 and Euclidean distance to the AB centroid that is statistically significant at the 0.05 level (S6 Fig, dots). Lower functioning stroke survivors are located further away from the able-bodied centroid, however remarkably all but one (outlier) of the R2 values are above 0.73 in both high and lower-functioning individuals. Even though the model can better predict able-bodied future kinematics better than stroke (R2 values above 0.8), the ability of our model to predict at least a single gait cycle of future stroke kinematics with R2 above 0.73 is promising. To provide a control for this analysis, we found the average R2 values between the self-driven prediction of the last full gait cycle of each trial and 5 randomly selected gait cycles from other randomly selected individuals in the same gait group (able-bodied vs. stroke) (S6 Fig, stars). This control would effectively reveal whether our model learned each individual’s specific gait pattern or was just producing arbitrary (averaged) gait patterns. We expect that if the model learned individuals specific gait patterns, then the distribution of the R2 values of the self-driven kinematic predictions would be significantly different to that of the averaged R2 values corresponding to arbitrary gait patterns. In fact, the averaged R2 of the generic gait cycle comparisons ranged from 0.56 to 0.95 (compared to 0.61–0.99 in self-driven predictions). Further, the distributions of the R2 outputs for both able-bodied and stroke individuals were significantly different (at the 0.05 level) to self-driven R2 outputs with p-values 5.5x10-6 and 5.4x10-3 respectively using the Wilcoxon Rank-Sum test. This reveals that even for stroke survivors the model has learned their dynamics in some capacity and the model isn’t simply predicting arbitrary gait patterns.

(TIF)

S7 Fig. Graphical summary and pseudo code of our gait signatures framework and algorithm.

A) Training dataset: Using the 6-dimensional gait trajectory, the inputs (green) were concurrent segments from the gait trajectory, each one 499-time steps long. The outputs (blue) were 1-shifted (in time) segments of inputs. B) Model architecture: Our model consisted of an input layer, a hidden layer composed of 512 LSTM units, and a 6-unit Dense output layer. C) Stateful training: The hidden state of an RNN at time t is a function of the input at time t and the hidden state at time t-1. The model starts with processing the first mini-batch, calculating a new hidden state at each t and predicting gait kinematics at time t+1 given kinematics data at t. At the end of the mini-batch processing, the model calculates MSE over the entire mini-batch to calculate error for backpropagation and to update model weights. The final hidden state h(t+L) is used as the initial hidden state for generating predictions for the next mini batch (L is the temporal length of each mini-batch). The hidden state is initialized as zero before processing the first mini-batch.

(TIF)

S1 Table. Stroke participant’s demographics and clinical scores.

(TIF)

Data Availability

To ensure rigor, reproducibility, and promote open science, all software is shared under a GNU GPL 3.0 license and on GitHub. Links to Google Colab notebooks enable our software to be run on the cloud for users without computational resources. The gait signature code developed and used in this paper has been deposited at GitHub: https://github.com/bermanlabemory/gait_signatures. The RNN model and code was developed in Python programming language using built-in Python-based libraries such as Keras, Pandas, and NumPy. We revised the Phaser algorithm (https://github.com/sheim/phaser) to estimate phase for our kinematic time trajectories in the development of our phase averaged dynamics per trial. Shareable Jupyter Notebooks were developed on the Google Colab platform. The data analysis of the generated gait signatures was conducted in MATLAB 2022a (MathWorks).

Funding Statement

This work was supported by the Alfred P. Sloan Foundation’s Minority Ph.D. (MPHD) program: G-2019-11435, NSF GRFP 1937971 and NICHD F31HD107968 for TSW, NICHD F32HD108927 for MCR, HFSP RGY0076/2018 and NSF Physics of Living Systems Student Research Network (PHY-1806833) for KJ, NICHD R01HD095975 for TMK, the McCamish Foundation for TSW, MCR and LHT, NSF CMMI 1762211 / 1761679 for LHT, the Simons-Emory International Consortium on Motor Control (Simons Foundation, 707102) for TSW, KJ and GJB and an Emory University Nexus/Synergy II Grant for all authors. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Frisk RF, Lorentzen J, Nielsen JB. Contribution of corticospinal drive to ankle plantar flexor muscle activation during gait in adults with cerebral palsy. Exp Brain Res. 2019. Jun;237(6):1457–67. doi: 10.1007/s00221-019-05520-3 [DOI] [PubMed] [Google Scholar]
  • 2.Ivanenko YP, Grasso R, Zago M, Molinari M, Scivoletto G, Castellano V, et al. Temporal components of the motor patterns expressed by the human spinal cord reflect foot kinematics. J Neurophysiol. 2003. Nov;90(5):3555–65. doi: 10.1152/jn.00223.2003 [DOI] [PubMed] [Google Scholar]
  • 3.Krogh S, Aagaard P, Jønsson AB, Figlewski K, Kasch H. Effects of repetitive transcranial magnetic stimulation on recovery in lower limb muscle strength and gait function following spinal cord injury: a randomized controlled trial. Spinal Cord. 2022. Feb;60(2):135–41. doi: 10.1038/s41393-021-00703-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Young DR, Banks CL, McGuirk TE, Patten C. Evidence for shared neural information between muscle synergies and corticospinal efficacy. Sci Rep. 2022. May 27;12(1):8953. doi: 10.1038/s41598-022-12225-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Genthe K, Schenck C, Eicholtz S, Zajac-Cox L, Wolf S, Kesar TM. Effects of real-time gait biofeedback on paretic propulsion and gait biomechanics in individuals post-stroke. Top Stroke Rehabil. 2018. Apr;25(3):186–93. doi: 10.1080/10749357.2018.1436384 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Pan YT, Kang I, Joh J, Kim P, Herrin KR, Kesar TM, et al. Effects of Bilateral Assistance for Hemiparetic Gait Post-Stroke Using a Powered Hip Exoskeleton. Ann Biomed Eng. 2022. Aug 13; doi: 10.1007/s10439-022-03041-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Reisman DS, Bastian AJ, Morton SM. Neurophysiologic and rehabilitation insights from the split-belt and other locomotor adaptation paradigms. Phys Ther. 2010. Feb;90(2):187–95. doi: 10.2522/ptj.20090073 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Wang S, Pai YC, Bhatt T. Neuromuscular mechanisms of motor adaptation to repeated gait-slip perturbations in older adults. Sci Rep. 2022. Nov 18;12(1):19851. doi: 10.1038/s41598-022-23051-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.D’souza J, Natarajan DM, Kumaran D DS. Does the Environment Cause Changes in Hemiparetic Lower Limb Muscle Activity and Gait Velocity During Walking in Stroke Survivors? J Stroke Cerebrovasc Dis. 2020. Oct;29(10):105174. doi: 10.1016/j.jstrokecerebrovasdis.2020.105174 [DOI] [PubMed] [Google Scholar]
  • 10.Larsen RJ, Queen RM, Schmitt D. Adaptive locomotion: Foot strike pattern and limb mechanical stiffness while running over an obstacle. J Biomech. 2022. Oct;143:111283. doi: 10.1016/j.jbiomech.2022.111283 [DOI] [PubMed] [Google Scholar]
  • 11.Attwood AS, Ludwig CJH, Penton-Voak IS, Poh J, Kwong ASF, Munafò MR. Effects of state anxiety on gait: a 7.5% carbon dioxide challenge study. Psychol Res. 2021. Sep;85(6):2444–52. doi: 10.1007/s00426-020-01393-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Elkjær E, Mikkelsen MB, Michalak J, Mennin DS, O’Toole MS. Motor alterations in depression and anxiety disorders: A systematic review and meta-analysis. J Affect Disord. 2022. Nov 15;317:373–87. doi: 10.1016/j.jad.2022.08.060 [DOI] [PubMed] [Google Scholar]
  • 13.Steptoe A, Zaninotto P. Lower socioeconomic status and the acceleration of aging: An outcome-wide analysis. Proceedings of the National Academy of Sciences. 2020. Jun 30;117(26):14911–7. doi: 10.1073/pnas.1915741117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Zaninotto P, Sacker A, Head J. Relationship between wealth and age trajectories of walking speed among older adults: evidence from the English Longitudinal Study of Ageing. J Gerontol A Biol Sci Med Sci. 2013. Dec;68(12):1525–31. doi: 10.1093/gerona/glt058 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Garcia SA, Johnson AK, Brown SR, Washabaugh EP, Krishnan C, Palmieri-Smith RM. Dynamic knee stiffness during walking is increased in individuals with anterior cruciate ligament reconstruction. J Biomech. 2022. Nov 25;146:111400. doi: 10.1016/j.jbiomech.2022.111400 [DOI] [PubMed] [Google Scholar]
  • 16.Jang J, Wikstrom EA. Ankle joint contact force profiles differ between those with and without chronic ankle instability during walking. Gait Posture. 2022. Nov 25;100:1–7. doi: 10.1016/j.gaitpost.2022.11.012 [DOI] [PubMed] [Google Scholar]
  • 17.Milner CE, Foch E, Gonzales JM, Petersen D. Biomechanics associated with tibial stress fracture in runners: A systematic review and meta-analysis. J Sport Health Sci. 2022. Dec 5;S2095-2546(22)00116-8. doi: 10.1016/j.jshs.2022.12.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Ijmker T, Lamoth CJC. Gait and cognition: the relationship between gait stability and variability with executive function in persons with and without dementia. Gait Posture. 2012. Jan;35(1):126–30. doi: 10.1016/j.gaitpost.2011.08.022 [DOI] [PubMed] [Google Scholar]
  • 19.Jonkers I, Delp S, Patten C. Capacity to increase walking speed is limited by impaired hip and ankle power generation in lower functioning persons post-stroke. Gait Posture. 2009. Jan;29(1):129–37. doi: 10.1016/j.gaitpost.2008.07.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Moura Coelho R, Hirai H, Martins J, Krebs HI. Biomarkers for rhythmic and discrete dynamic primitives in locomotion. Sci Rep. 2022. Nov 23;12(1):20165. doi: 10.1038/s41598-022-24565-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Prosser LA, Atkinson HL, Alfano JM, Kessler SK, Ichord RB. Walking speed influences spatiotemporal but not symmetry measures of gait in children and adolescents with hemiplegia. Gait Posture. 2022. Oct;98:233–6. doi: 10.1016/j.gaitpost.2022.09.088 [DOI] [PubMed] [Google Scholar]
  • 22.Russo Y, Stuart S, Silva-Batista C, Brumbach B, Vannozzi G, Mancini M. Does visual cueing improve gait initiation in people with Parkinson’s disease? Hum Mov Sci. 2022. Aug;84:102970. doi: 10.1016/j.humov.2022.102970 [DOI] [PubMed] [Google Scholar]
  • 23.Troisi Lopez E, Sorrentino P, Liparoti M, Minino R, Polverino A, Romano A, et al. The kinectome: A comprehensive kinematic map of human motion in health and disease. Ann N Y Acad Sci. 2022. Oct;1516(1):247–61. doi: 10.1111/nyas.14860 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Correa TA, Baker R, Graham HK, Pandy MG. Accuracy of generic musculoskeletal models in predicting the functional roles of muscles in human gait. J Biomech. 2011. Jul 28;44(11):2096–105. doi: 10.1016/j.jbiomech.2011.05.023 [DOI] [PubMed] [Google Scholar]
  • 25.Geyer H, Herr H. A Muscle-Reflex Model That Encodes Principles of Legged Mechanics Produces Human Walking Dynamics and Muscle Activities. IEEE Trans Neural Syst Rehabil Eng. 2010. Jun;18(3):263–73. doi: 10.1109/TNSRE.2010.2047592 [DOI] [PubMed] [Google Scholar]
  • 26.Kuo AD, Donelan JM. Dynamic Principles of Gait and Their Clinical Implications. Physical Therapy. 2010. Feb 1;90(2):157–74. doi: 10.2522/ptj.20090125 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Falisse A, Serrancolí G, Dembia CL, Gillis J, Jonkers I, De Groote F. Rapid predictive simulations with complex musculoskeletal models suggest that diverse healthy and pathological human gaits can emerge from similar control strategies. Journal of The Royal Society Interface. 2019. Aug 30;16(157):20190402. doi: 10.1098/rsif.2019.0402 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Meyer AJ, Eskinazi I, Jackson JN, Rao AV, Patten C, Fregly BJ. Muscle Synergies Facilitate Computational Prediction of Subject-Specific Walking Motions. Front Bioeng Biotechnol. 2016;4:77. doi: 10.3389/fbioe.2016.00077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Pitto L, Kainz H, Falisse A, Wesseling M, Van Rossom S, Hoang H, et al. SimCP: A Simulation Platform to Predict Gait Performance Following Orthopedic Intervention in Children With Cerebral Palsy. Front Neurorobot. 2019;13:54. doi: 10.3389/fnbot.2019.00054 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.De Groote F, Falisse A. Perspective on musculoskeletal modelling and predictive simulations of human movement to assess the neuromechanics of gait. Proceedings of the Royal Society B: Biological Sciences. 2021. Mar 10;288(1946):20202432. doi: 10.1098/rspb.2020.2432 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Pitto L, Kainz H, Falisse A, Wesseling M, Van Rossom S, Hoang H, et al. SimCP: A Simulation Platform to Predict Gait Performance Following Orthopedic Intervention in Children With Cerebral Palsy. Frontiers in Neurorobotics [Internet]. 2019. [cited 2022 Aug 17];13. Available from: https://www.frontiersin.org/articles/10.3389/fnbot.2019.00054 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Dzeladini F, van den Kieboom J, Ijspeert A. The contribution of a central pattern generator in a reflex-based neuromuscular model. Front Hum Neurosci. 2014;8:371. doi: 10.3389/fnhum.2014.00371 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Taga G, Yamaguchi Y, Shimizu H. Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment. Biol Cybern. 1991;65(3):147–59. doi: 10.1007/BF00198086 [DOI] [PubMed] [Google Scholar]
  • 34.Angelidis E, Buchholz E, Arreguit J, Rougé A, Stewart T, von Arnim A, et al. A spiking central pattern generator for the control of a simulated lamprey robot running on SpiNNaker and Loihi neuromorphic boards. Neuromorph Comput Eng. 2021. Sep 1;1(1):014005. [Google Scholar]
  • 35.Kuo AD. The Relative Roles of Feedforward and Feedback in the Control of Rhythmic Movements. Motor Control. 2002. Apr 1;6(2):129–45. doi: 10.1123/mcj.6.2.129 [DOI] [PubMed] [Google Scholar]
  • 36.McCrea DA, Rybak IA. Organization of mammalian locomotor rhythm and pattern generation. Brain Research Reviews. 2008. Jan 1;57(1):134–46. doi: 10.1016/j.brainresrev.2007.08.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Holmes P, Full RJ, Koditschek D, Guckenheimer J. The Dynamics of Legged Locomotion: Models, Analyses, and Challenges. SIAM Review. 2006;48(2):207–304. [Google Scholar]
  • 38.Prinz AA, Bucher D, Marder E. Similar network activity from disparate circuit parameters. Nat Neurosci. 2004. Dec;7(12):1345–52. doi: 10.1038/nn1352 [DOI] [PubMed] [Google Scholar]
  • 39.Pollick FE, Kay JW, Heim K, Stringer R. Gender recognition from point-light walkers. Journal of Experimental Psychology: Human Perception and Performance. 2005;31(6):1247. doi: 10.1037/0096-1523.31.6.1247 [DOI] [PubMed] [Google Scholar]
  • 40.Troje NF. Decomposing biological motion: A framework for analysis and synthesis of human gait patterns. Journal of Vision. 2002. Sep 10;2(5):2. [DOI] [PubMed] [Google Scholar]
  • 41.Johnson KL, Gill S, Reichman V, Tassinary LG. Swagger, sway, and sexuality: Judging sexual orientation from body motion and morphology. Journal of Personality and Social Psychology. 2007;93(3):321. doi: 10.1037/0022-3514.93.3.321 [DOI] [PubMed] [Google Scholar]
  • 42.Gross MM, Crane EA, Fredrickson BL. Effort-Shape and kinematic assessment of bodily expression of emotion during gait. Human Movement Science. 2012. Feb 1;31(1):202–21. doi: 10.1016/j.humov.2011.05.001 [DOI] [PubMed] [Google Scholar]
  • 43.Brown DD, Wijffels G, Meulenbroek RGJ. Individual Differences in Sequential Movement Coordination in Hip-Hop Dance: Capturing Joint Articulation in Practicing the Wave. Front Psychol. 2021;12:731901. doi: 10.3389/fpsyg.2021.731901 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Edey R, Yon D, Cook J, Dumontheil I, Press C. Our own action kinematics predict the perceived affective states of others. J Exp Psychol Hum Percept Perform. 2017. Jul;43(7):1263–8. doi: 10.1037/xhp0000423 [DOI] [PubMed] [Google Scholar]
  • 45.Becchio C, Manera V, Sartori L, Cavallo A, Castiello U. Grasping intentions: from thought experiments to empirical evidence. Front Hum Neurosci. 2012;6:117. doi: 10.3389/fnhum.2012.00117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Neave N, McCarty K, Freynik J, Caplan N, Hönekopp J, Fink B. Male dance moves that catch a woman’s eye. Biol Lett. 2011. Apr 23;7(2):221–4. doi: 10.1098/rsbl.2010.0619 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Habersack A, Fischerauer SF, Kraus T, Holzer HP, Svehlik M. Kinematic and Kinetic Gait Parameters Can Distinguish between Idiopathic and Neurologic Toe-Walking. Int J Environ Res Public Health. 2022. Jan 12;19(2):804. doi: 10.3390/ijerph19020804 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Heinik J, Werner P, Dekel T, Gurevitz I, Rosenblum S. Computerized kinematic analysis of the clock drawing task in elderly people with mild major depressive disorder: an exploratory study. Int Psychogeriatr. 2010. May;22(3):479–88. doi: 10.1017/S1041610209991360 [DOI] [PubMed] [Google Scholar]
  • 49.Ferrante S, Chia Bejarano N, Ambrosini E, Nardone A, Turcato AM, Monticone M, et al. A Personalized Multi-Channel FES Controller Based on Muscle Synergies to Support Gait Rehabilitation after Stroke. Frontiers in Neuroscience [Internet]. 2016. [cited 2022 Dec 16];10. Available from: https://www.frontiersin.org/articles/10.3389/fnins.2016.00425 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Mikolajczyk T, Ciobanu I, Badea DI, Iliescu A, Pizzamiglio S, Schauer T, et al. Advanced technology for gait rehabilitation: An overview. Advances in Mechanical Engineering. 2018. Jul 1;10(7):1687814018783627. [Google Scholar]
  • 51.Porciuncula F, Roto AV, Kumar D, Davis I, Roy S, Walsh CJ, et al. Wearable Movement Sensors for Rehabilitation: A Focused Review of Technological and Clinical Advances. PM R. 2018. Sep;10(9 Suppl 2):S220–32. doi: 10.1016/j.pmrj.2018.06.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Stenum J, Cherry-Allen KM, Pyles CO, Reetzke RD, Vignos MF, Roemmich RT. Applications of Pose Estimation in Human Health and Performance across the Lifespan. Sensors (Basel). 2021. Nov 3;21(21):7315. doi: 10.3390/s21217315 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Chen G, Patten C, Kothari DH, Zajac FE. Gait differences between individuals with post-stroke hemiparesis and non-disabled controls at matched speeds. Gait & Posture. 2005. Aug 1;22(1):51–6. doi: 10.1016/j.gaitpost.2004.06.009 [DOI] [PubMed] [Google Scholar]
  • 54.Kettlety SA, Finley JM, Reisman DS, Schweighofer N, Leech KA. Speed-dependent biomechanical changes vary across individual gait metrics post-stroke relative to neurotypical adults [Internet]. bioRxiv; 2022. [cited 2022 Aug 15]. p. 2022.04.01.486769. Available from: https://www.biorxiv.org/content/10.1101/2022.04.01.486769v1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Ries AJ, Novacheck TF, Schwartz MH. A data driven model for optimal orthosis selection in children with cerebral palsy. Gait Posture. 2014. Sep;40(4):539–44. doi: 10.1016/j.gaitpost.2014.06.011 [DOI] [PubMed] [Google Scholar]
  • 56.Steele KM, Rozumalski A, Schwartz MH. Muscle synergies and complexity of neuromuscular control during gait in cerebral palsy. Dev Med Child Neurol. 2015. Dec;57(12):1176–82. doi: 10.1111/dmcn.12826 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Halilaj E, Rajagopal A, Fiterau M, Hicks JL, Hastie TJ, Delp SL. Machine learning in human movement biomechanics: Best practices, common pitfalls, and new opportunities. J Biomech. 2018. Nov 16;81:1–11. doi: 10.1016/j.jbiomech.2018.09.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Pandarinath C, O’Shea DJ, Collins J, Jozefowicz R, Stavisky SD, Kao JC, et al. Inferring single-trial neural population dynamics using sequential auto-encoders. Nat Methods. 2018. Oct;15(10):805–15. doi: 10.1038/s41592-018-0109-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Sussillo D, Churchland MM, Kaufman MT, Shenoy KV. A neural network that finds a naturalistic solution for the production of muscle activity. Nat Neurosci. 2015. Jul;18(7):1025–33. doi: 10.1038/nn.4042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Vyas S, Golub MD, Sussillo D, Shenoy KV. Computation Through Neural Population Dynamics. Annu Rev Neurosci. 2020. Jul 8;43:249–75. doi: 10.1146/annurev-neuro-092619-094115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Sutskever I, Vinyals O, Le QV. Sequence to Sequence Learning with Neural Networks. arXiv:14093215 [cs] [Internet]. 2014. Dec 14 [cited 2021 Jun 23]; Available from: http://arxiv.org/abs/1409.3215 [Google Scholar]
  • 62.Horst F, Lapuschkin S, Samek W, Müller KR, Schöllhorn WI. Explaining the unique nature of individual gait patterns with deep learning. Scientific Reports. 2019. Feb 20;9(1):2391. doi: 10.1038/s41598-019-38748-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Santhiranayagam BK, Lai DTH, Sparrow WA, Begg RK. A machine learning approach to estimate Minimum Toe Clearance using Inertial Measurement Units. J Biomech. 2015. Dec 16;48(16):4309–16. doi: 10.1016/j.jbiomech.2015.10.040 [DOI] [PubMed] [Google Scholar]
  • 64.Ahamed T, Costa AC, Stephens GJ. Capturing the continuous complexity of behaviour in Caenorhabditis elegans. Nat Phys. 2021. Feb;17(2):275–83. [Google Scholar]
  • 65.Hausdorff JM, Purdon PL, Peng CK, Ladin Z, Wei JY, Goldberger AL. Fractal dynamics of human gait: stability of long-range correlations in stride interval fluctuations. J Appl Physiol (1985). 1996. May;80(5):1448–57. doi: 10.1152/jappl.1996.80.5.1448 [DOI] [PubMed] [Google Scholar]
  • 66.Revzen S, Guckenheimer JM. Estimating the phase of synchronized oscillators. Phys Rev E Stat Nonlin Soft Matter Phys. 2008. Nov;78(5 Pt 1):051907. doi: 10.1103/PhysRevE.78.051907 [DOI] [PubMed] [Google Scholar]
  • 67.Cox MAA, Cox TF. Multidimensional Scaling. In: Chen C houh, Härdle W, Unwin A, editors. Handbook of Data Visualization [Internet]. Berlin, Heidelberg: Springer; 2008. [cited 2022 Dec 15]. p. 315–47. (Springer Handbooks Comp.Statistics). Available from: 10.1007/978-3-540-33037-0_14 [DOI] [Google Scholar]
  • 68.Fugl-Meyer AR, Jääskö L, Leyman I, Olsson S, Steglind S. The post-stroke hemiplegic patient. 1. a method for evaluation of physical performance. Scand J Rehabil Med. 1975;7(1):13–31. [PubMed] [Google Scholar]
  • 69.Kim H, Kim YH, Kim SJ, Choi MT. Pathological gait clustering in post-stroke patients using motion capture data. Gait & Posture. 2022. May 1;94:210–6. doi: 10.1016/j.gaitpost.2022.03.007 [DOI] [PubMed] [Google Scholar]
  • 70.Kaczmarczyk K, Wit A, Krawczyk M, Zaborski J. Gait classification in post-stroke patients using artificial neural networks. Gait & Posture. 2009. Aug 1;30(2):207–10. doi: 10.1016/j.gaitpost.2009.04.010 [DOI] [PubMed] [Google Scholar]
  • 71.Wikström J, Georgoulas G, Moutsopoulos T, Seferiadis A. Intelligent data analysis of instrumented gait data in stroke patients—A systematic review. Computers in Biology and Medicine. 2014. Aug 1;51:61–72. doi: 10.1016/j.compbiomed.2014.04.004 [DOI] [PubMed] [Google Scholar]
  • 72.Awad LN, Palmer JA, Pohlig RT, Binder-Macleod SA, Reisman DS. Walking speed and step length asymmetry modify the energy cost of walking after stroke. Neurorehabil Neural Repair. 2015. Jun;29(5):416–23. doi: 10.1177/1545968314552528 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Hornby TG, Reisman DS, Ward IG, Scheets PL, Miller A, Haddad D, et al. Clinical Practice Guideline to Improve Locomotor Function Following Chronic Stroke, Incomplete Spinal Cord Injury, and Brain Injury. J Neurol Phys Ther. 2020. Jan;44(1):49–100. doi: 10.1097/NPT.0000000000000303 [DOI] [PubMed] [Google Scholar]
  • 74.Bowden MG, Balasubramanian CK, Behrman AL, Kautz SA. Validation of a speed-based classification system using quantitative measures of walking performance poststroke. Neurorehabil Neural Repair. 2008;22(6):672–5. doi: 10.1177/1545968308318837 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Mannini A, Trojaniello D, Cereatti A, Sabatini AM. A Machine Learning Framework for Gait Classification Using Inertial Sensors: Application to Elderly, Post-Stroke and Huntington’s Disease Patients. Sensors (Basel) [Internet]. 2016. Jan 21 [cited 2019 Sep 2];16(1). Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4732167/ doi: 10.3390/s16010134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Zhang J, Lockhart TE, Soangra R. Classifying lower extremity muscle fatigue during walking using machine learning and inertial sensors. Ann Biomed Eng. 2014. Mar;42(3):600–12. doi: 10.1007/s10439-013-0917-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Aung MSH, Thies SB, Kenney LPJ, Howard D, Selles RW, Findlow AH, et al. Automated detection of instantaneous gait events using time frequency analysis and manifold embedding. IEEE Trans Neural Syst Rehabil Eng. 2013. Nov;21(6):908–16. doi: 10.1109/TNSRE.2013.2239313 [DOI] [PubMed] [Google Scholar]
  • 78.Castano-Pino YJ, Gonzalez MC, Quintana-Pena V, Valderrama J, Munoz B, Orozco J, et al. Automatic Gait Phases Detection in Parkinson Disease: A Comparative Study. Annu Int Conf IEEE Eng Med Biol Soc. 2020. Jul;2020:798–802. doi: 10.1109/EMBC44109.2020.9175268 [DOI] [PubMed] [Google Scholar]
  • 79.Chia Bejarano N, Ambrosini E, Pedrocchi A, Ferrigno G, Monticone M, Ferrante S. A novel adaptive, real-time algorithm to detect gait events from wearable sensors. IEEE Trans Neural Syst Rehabil Eng. 2015. May;23(3):413–22. doi: 10.1109/TNSRE.2014.2337914 [DOI] [PubMed] [Google Scholar]
  • 80.Barton GJ, Hawken MB, Scott MA, Schwartz MH. Movement deviation profile: a measure of distance from normality using a self-organizing neural network. Hum Mov Sci. 2012. Apr;31(2):284–94. doi: 10.1016/j.humov.2010.06.003 [DOI] [PubMed] [Google Scholar]
  • 81.Hainisch R, Kranzl A, Lin YC, Pandy MG, Gfoehler M. A generic musculoskeletal model of the juvenile lower limb for biomechanical analyses of gait. Computer Methods in Biomechanics and Biomedical Engineering. 2021. Mar 12;24(4):349–57. doi: 10.1080/10255842.2020.1817405 [DOI] [PubMed] [Google Scholar]
  • 82.Bongard J, Lipson H. Automated reverse engineering of nonlinear dynamical systems. Proc Natl Acad Sci U S A. 2007. Jun 12;104(24):9943–8. doi: 10.1073/pnas.0609476104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Sanchez-Gonzalez A, Godwin J, Pfaff T, Ying R, Leskovec J, Battaglia PW. Learning to Simulate Complex Physics with Graph Networks [Internet]. arXiv; 2020. [cited 2022 Dec 7]. Available from: http://arxiv.org/abs/2002.09405 [Google Scholar]
  • 84.Alcantara RS, Edwards WB, Millet GY, Grabowski AM. Predicting continuous ground reaction forces from accelerometers during uphill and downhill running: a recurrent neural network solution. PeerJ. 2022. Jan 4;10:e12752. doi: 10.7717/peerj.12752 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Camargo J, Molinaro D, Young A. Predicting biological joint moment during multiple ambulation tasks. J Biomech. 2022. Mar;134:111020. doi: 10.1016/j.jbiomech.2022.111020 [DOI] [PubMed] [Google Scholar]
  • 86.Giarmatzis G, Zacharaki EI, Moustakas K. Real-Time Prediction of Joint Forces by Motion Capture and Machine Learning. Sensors (Basel). 2020. Dec 4;20(23):6933. doi: 10.3390/s20236933 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Berrueta TA, Pervan A, Fitzsimons K, Murphey TD. Dynamical System Segmentation for Information Measures in Motion. IEEE Robotics and Automation Letters. 2019. Jan;4(1):169–76. [Google Scholar]
  • 88.Drnach L, Allen JL, Essa I, Ting LH. A Data-Driven Predictive Model of Individual-Specific Effects of FES on Human Gait Dynamics. In: 2019 International Conference on Robotics and Automation (ICRA). 2019. p. 5090–6.
  • 89.Maus HM, Revzen S, Guckenheimer J, Ludwig C, Reger J, Seyfarth A. Constructing predictive models of human running. J R Soc Interface. 2015. Feb 6;12(103):20140899. doi: 10.1098/rsif.2014.0899 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Rosenberg MC, Banjanin BS, Burden SA, Steele KM. Predicting walking response to ankle exoskeletons using data-driven models. J R Soc Interface. 2020. Oct;17(171):20200487. doi: 10.1098/rsif.2020.0487 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Wang Y, Srinivasan M. System Identification and Stability Analysis of Steady Human Walking and the Swing Leg Dynamics. In American Society of Mechanical Engineers Digital Collection; 2013. [cited 2022 Dec 15]. p. 19–23. Available from: https://asmedigitalcollection.asme.org/DSCC/proceedings-abstract/DSCC2012-MOVIC2012/45301/19/229110 [Google Scholar]
  • 92.Ting LH, Chiel HJ, Trumbower RD, Allen JL, McKay JL, Hackney ME, et al. Neuromechanical principles underlying movement modularity and their implications for rehabilitation. Neuron. 2015. Apr 8;86(1):38–54. doi: 10.1016/j.neuron.2015.02.042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Chvatal SA, Ting LH. Common muscle synergies for balance and walking. Front Comput Neurosci. 2013;7:48. doi: 10.3389/fncom.2013.00048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Torres-Oviedo G, Macpherson JM, Ting LH. Muscle synergy organization is robust across a variety of postural perturbations. J Neurophysiol. 2006. Sep;96(3):1530–46. doi: 10.1152/jn.00810.2005 [DOI] [PubMed] [Google Scholar]
  • 95.Torres-Oviedo G, Ting LH. Subject-specific muscle synergies in human balance control are consistent across different biomechanical contexts. J Neurophysiol. 2010. Jun;103(6):3084–98. doi: 10.1152/jn.00960.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Payne AM, Sawers A, Allen JL, Stapley PJ, Macpherson JM, Ting LH. Reorganization of motor modules for standing reactive balance recovery following pyridoxine-induced large-fiber peripheral sensory neuropathy in cats. J Neurophysiol. 2020. Sep 1;124(3):868–82. doi: 10.1152/jn.00739.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Sawers A, Bhattacharjee T, McKay JL, Hackney ME, Kemp CC, Ting LH. Small forces that differ with prior motor experience can communicate movement goals during human-human physical interaction. J Neuroeng Rehabil. 2017. 31;14(1):8. doi: 10.1186/s12984-017-0217-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Clark DJ, Ting LH, Zajac FE, Neptune RR, Kautz SA. Merging of healthy motor modules predicts reduced locomotor performance and muscle coordination complexity post-stroke. J Neurophysiol. 2010. Feb;103(2):844–57. doi: 10.1152/jn.00825.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Shin SY, Kim Y, Jayaraman A, Park HS. Relationship between gait quality measures and modular neuromuscular control parameters in chronic post-stroke individuals. J Neuroeng Rehabil. 2021. Apr 7;18(1):58. doi: 10.1186/s12984-021-00860-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Allen JL, Ting LH, Kesar TM. Gait Rehabilitation Using Functional Electrical Stimulation Induces Changes in Ankle Muscle Coordination in Stroke Survivors: A Preliminary Study. Front Neurol. 2018;9:1127. doi: 10.3389/fneur.2018.01127 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Walter JP, Kinney AL, Banks SA, D’Lima DD, Besier TF, Lloyd DG, et al. Muscle synergies may improve optimization prediction of knee contact forces during walking. J Biomech Eng. 2014. Feb;136(2):021031. doi: 10.1115/1.4026428 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Allen JL, Kesar TM, Ting LH. Motor module generalization across balance and walking is impaired after stroke. J Neurophysiol. 2019. Jul 1;122(1):277–89. doi: 10.1152/jn.00561.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.De Groote F, Jonkers I, Duysens J. Task constraints and minimization of muscle effort result in a small number of muscle synergies during gait. Frontiers in Computational Neuroscience [Internet]. 2014. [cited 2022 Nov 9];8. Available from: https://www.frontiersin.org/articles/10.3389/fncom.2014.00115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Falisse A, Pitto L, Kainz H, Hoang H, Wesseling M, Van Rossom S, et al. Physics-Based Simulations to Predict the Differential Effects of Motor Control and Musculoskeletal Deficits on Gait Dysfunction in Cerebral Palsy: A Retrospective Case Study. Front Hum Neurosci. 2020. Feb 18;14:40. doi: 10.3389/fnhum.2020.00040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Steele KM, Munger ME, Peters KM, Shuman BR, Schwartz MH. Repeatability of electromyography recordings and muscle synergies during gait among children with cerebral palsy. Gait Posture. 2019. Jan;67:290–5. doi: 10.1016/j.gaitpost.2018.10.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Fregly BJ, Reinbolt JA, Rooney KL, Mitchell KH, Chmielewski TL. Design of patient-specific gait modifications for knee osteoarthritis rehabilitation. IEEE Trans Biomed Eng. 2007. Sep;54(9):1687–95. doi: 10.1109/tbme.2007.891934 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Bernshteĭn NA. The co-ordination and regulation of movements. 1st English ed. Oxford, New York: Pergamon Press; 1967. 196 p. [Google Scholar]
  • 108.Beer RF, Dewald JP, Rymer WZ. Deficits in the coordination of multijoint arm movements in patients with hemiparesis: evidence for disturbed control of limb dynamics. Exp Brain Res. 2000. Apr;131(3):305–19. doi: 10.1007/s002219900275 [DOI] [PubMed] [Google Scholar]
  • 109.Sánchez N, Acosta AM, López-Rosado R, Dewald JPA. Neural Constraints Affect the Ability to Generate Hip Abduction Torques When Combined With Hip Extension or Ankle Plantarflexion in Chronic Hemiparetic Stroke. Front Neurol [Internet]. 2018. Jul 11 [cited 2019 Aug 8];9. Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6050392/ doi: 10.3389/fneur.2018.00564 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Zajac FE, Neptune RR, Kautz SA. Biomechanics and muscle coordination of human walking. Part I: introduction to concepts, power transfer, dynamics and simulations. Gait Posture. 2002. Dec;16(3):215–32. doi: 10.1016/s0966-6362(02)00068-1 [DOI] [PubMed] [Google Scholar]
  • 111.van Hedel HJA, Tomatis L, Müller R. Modulation of leg muscle activity and gait kinematics by walking speed and bodyweight unloading. Gait Posture. 2006. Aug;24(1):35–45. doi: 10.1016/j.gaitpost.2005.06.015 [DOI] [PubMed] [Google Scholar]
  • 112.Little VL, McGuirk TE, Perry LA, Patten C. Pelvic excursion during walking post-stroke: A novel classification system. Gait Posture. 2018. May;62:395–404. doi: 10.1016/j.gaitpost.2018.03.052 [DOI] [PubMed] [Google Scholar]
  • 113.Patterson KK, Parafianowicz I, Danells CJ, Closson V, Verrier MC, Staines WR, et al. Gait Asymmetry in Community-Ambulating Stroke Survivors. Archives of Physical Medicine and Rehabilitation. 2008. Feb 1;89(2):304–10. doi: 10.1016/j.apmr.2007.08.142 [DOI] [PubMed] [Google Scholar]
  • 114.Rinaldi LA, Monaco V. Spatio-temporal parameters and intralimb coordination patterns describing hemiparetic locomotion at controlled speed. J NeuroEngineering Rehabil. 2013;10(1):53. doi: 10.1186/1743-0003-10-53 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Schutte LM, Narayanan U, Stout JL, Selber P, Gage JR, Schwartz MH. An index for quantifying deviations from normal gait. Gait and Posture. 2000;7. [DOI] [PubMed] [Google Scholar]
  • 116.Wonsetler EC, Bowden MG. A systematic review of mechanisms of gait speed change post-stroke. Part 2: Exercise capacity, muscle activation, kinetics, and kinematics. Top Stroke Rehabil. 2017. Jul;24(5):394–403. doi: 10.1080/10749357.2017.1282413 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Cao Z, Hidalgo G, Simon T, Wei SE, Sheikh Y. OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [Internet]. arXiv; 2019. [cited 2022 Dec 16]. Available from: http://arxiv.org/abs/1812.08008 [DOI] [PubMed] [Google Scholar]
  • 118.Mathis A, Mamidanna P, Cury KM, Abe T, Murthy VN, Mathis MW, et al. DeepLabCut: markerless pose estimation of user-defined body parts with deep learning. Nat Neurosci. 2018. Sep;21(9):1281–9. doi: 10.1038/s41593-018-0209-y [DOI] [PubMed] [Google Scholar]
  • 119.Uhlrich SD, Falisse A, Kidziński Ł, Muccini J, Ko M, Chaudhari AS, et al. OpenCap: 3D human movement dynamics from smartphone videos [Internet]. Bioengineering; 2022. Jul [cited 2022 Aug 15]. Available from: http://biorxiv.org/lookup/doi/10.1101/2022.07.07.499061 [Google Scholar]
  • 120.Kesar TM, Binder-Macleod SA, Hicks GE, Reisman DS. Minimal detectable change for gait variables collected during treadmill walking in individuals post-stroke. Gait Posture. 2011. Feb;33(2):314–7. doi: 10.1016/j.gaitpost.2010.11.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Kesar TM, Reisman DS, Perumal R, Jancosko AM, Higginson JS, Rudolph KS, et al. Combined effects of fast treadmill walking and functional electrical stimulation on post-stroke gait. Gait Posture. 2011. Feb;33(2):309–13. doi: 10.1016/j.gaitpost.2010.11.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Kesar TM, Perumal R, Jancosko A, Reisman DS, Rudolph KS, Higginson JS, et al. Novel patterns of functional electrical stimulation have an immediate effect on dorsiflexor muscle function during gait for people poststroke. Phys Ther. 2010. Jan;90(1):55–66. doi: 10.2522/ptj.20090140 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Kesar TM, Perumal R, Reisman DS, Jancosko A, Rudolph KS, Higginson JS, et al. Functional electrical stimulation of ankle plantarflexor and dorsiflexor muscles: effects on poststroke gait. Stroke. 2009. Dec;40(12):3821–7. doi: 10.1161/STROKEAHA.109.560375 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. arXiv; 2017. [Google Scholar]
  • 125.Han K, Bae K, Levine N, Yang J, Lee JS. Biomechanical Effect of Foot Orthoses on Rearfoot Motions and Joint Moment Parameters in Patients with Flexible Flatfoot. Med Sci Monit. 2019. Aug 8;25:5920–8. doi: 10.12659/MSM.918782 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Caudron M, Detrembleur C, Van Cauter M. Is simultaneous bilateral total hip arthroplasty deleterious in a biomechanical point of view? A comparative gait analysis study. BMC Musculoskelet Disord. 2022. Oct 10;23:903. doi: 10.1186/s12891-022-05856-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Livolsi C, Conti R, Guanziroli E, Friðriksson Þ, Alexandersson Á, Kristjánsson K, et al. An impairment-specific hip exoskeleton assistance for gait training in subjects with acquired brain injury: a feasibility study. Sci Rep. 2022. Nov 11;12:19343. doi: 10.1038/s41598-022-23283-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Berman GJ, Choi DM, Bialek W, Shaevitz JW. Mapping the stereotyped behaviour of freely moving fruit flies. Journal of The Royal Society Interface. 2014. Oct 6;11(99):20140672. doi: 10.1098/rsif.2014.0672 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1011556.r001

Decision Letter 0

Lyle J Graham

14 Jul 2023

Dear Mrs. Winner,

Thank you very much for submitting your manuscript "Discovering individual-specific gait signatures from data- driven models of neuromechanical dynamics" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Lyle Graham

Section Editor

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This is an exciting work on investigating and interpreting the neuromechanical dynamics accountable for distinct gait patterns using a machine learning framework, particularly recurrent neural networks (RNNs). The authors collected joint kinematics using motion capture from 12 adults (5 able-bodies and 7 stroke survivors) walking at different speeds (0.3-1.8m/s). The study found that the RNN model could detect individual gait signatures that were consistent across various walking speeds, with stroke survivors showing greater gait variations at slower speeds. Although the contributions of the work are significant and relevant to the clinical researchers in examining animal locomotion patterns, there are a few concerns, as follows, that question the technical soundness of the work.

1. Referring Figure 1(b) and 2(b), the authors claim that the cluster of low functioning stroke survivors at self selected speed<0.4m/s is at farthest distance from the able-bodied cluster. However, it is quite evident from the results that encapsulated cluster of low functioning stroke has several outliers, i.e., self selected speed>0.4 m/s (bigger circles). The authors are suggested to justify the reasoning behind ignoring the effect of such outliers.

2. Given Figure 5, the generalisation capability of the trained RNN model is checked by interpolating gait kinematics across speeds, taking 1.32 m/s and 0.43 m/s. Although both speeds are unseen,; however, within the slow and fast speed limit. The generalisation capability will be more evident if the unseen speed will be outside the training range.

3. It is quite obvious that the RNN model would able to generalise able body kinematics much better than stroke survivors due to the significant difference in impaired gait kinematics among the stroke survivors. In that case, given S3 Figure, two questions must be investigated, :

(a) How well the RNN model generalises for the high functioning stroke survivors? Is the self driven R^2 increasing due to closer cluster distance to centroid of able-bodied individuals?

(b) How one can establish a relation between the differences in joint kinematics (varying from high to low functioning stroke survivors) and impaired anthropometric parameters such as flexion/extension deformities.

4. Moreover, the authors are suggested to provide the medical details for lower-limb joints of the stroke patients. Have the authors considered existing discrepancies in lower-limb joints such as fixed flexion deformities (FFD)? Such discrepancies might affect the gait signatures significantly.

5. There should be a algorithm (pesudo code) to highlight the computational approach involved in the development and implementation of RNN model. Moreover, the authors are suggested to provide a schematic representation of the proposed architecture with hyperparameters. The authors should justify the reason behind the selection of ADAM optimizer. Moreover, the details of the optimiser should be highlighted to improve the readability of the paper.

6. (Line 212) The authors claim that the training and validation error converged for less than 0.75 deg. They are suggested to provide the error convergence plot in addition to S2 Figure.

7. (Line 246) The authors claim that six principal components (PCs) explained 77% of the variance in the gait dynamics. However, in view of S4 Figure, six PCs are able to explain ~71% of the gait variance. The authors are suggested to justify the reason of this discrepancy in the text.

8. This study have focused on a specific gait pathologies (cortical or subcortical ischemic stroke) and only for seven patients, which may limit the applicability of the findings to other gait pathologies. This limitation can not be ignored by just mentioning in the discussion. Additionally, the study have not accounted for confounding variables, such as medical conditions or environmental factors, which could affect gait patterns.

Reviewer #2: The paper proposes a data-driven approach to analyze individual gait dynamics of able bodied individuals and of stroke survivors. The model is based on a simple one-layer LSTM network, used to predict the evolution of the pose of the subjects during locomotion at different speeds. The hidden state of the LSTM is used to create a reduced base on which phase-averaged LSTM activations can be projected. Such projection, defined "locomotion signature", is analyzed in the attempt to identify differences in the gait across subjects. The main findings are:

- Able bodied, high-functioning and low-functioning stroke survivors more or less cluster together

- Able bodied gaits form a more packed cluster

- The variability across gaits of different individuals exceeds that within an individual (even at different speeds), justifying the definition of "locomotion signature"

- Individual components of the locomotion signature are interpretable. Parts of the signature from different individuals can be combined, to investigate specific locomotion deficits.

- The LSTM can be used as a generative model. The able bodied locomotion can be generated accurately over many cycles, while that of stroke survivors is only accurate for one cycle, and then converges to that of able bodied individuals.

In my opinion the paper is very well written and motivated. The exposition is clear and precise. The methods are adequate and the authors have good knowledge of the tools they use.

I have some questions and comments:

- I am surprised that the differences in the gait at different locomotion speeds are much smaller than the differences between the gait of different individuals. I speculate that this is caused by 2 facts: 1. the sample size is small. With 13 individuals I can imagine that each gait has some unique, distinguishable features. If they had considered, e.g., 1000 individuals, I am quite sure the differences would have not been visible. Therefore, does it make sense to speak about a signature, when we do not know whether individual gaits would emerge in a larger dataset? 2. I think the speed range is also small. The authors hint to this fact (lines 289-290). This is a strong limitation, as one of the main results is that the variability across individuals exceeds the variability across locomotion speeds. The full range of locomotion speeds should be considered to prove something like this.

- The gait signature alignment is part of the model selection. Maybe it does not have a strong impact, but I am not sure it is correct to include the point you want to prove (that the hidden state of the LSTM defines a locomotion signature) it the selection of the model to use?

- It is briefly mentioned that gait signatures perform similarly to 26 commonly used discrete variables (fig. S5). The authors should better clarify why the features derived from the LSTM state should be preferred to these variables. They perform identically in classifying gait groups and individuals.

- There are some further clusters in the plots: there seem to be 2 able bodied clusters, almost as far from each other as the stroke survivors. What are these differences due to?

- Fig. 3: not all low-functioning subclusters really look further from the able-bodied than the high-functioning ones. Overall, I doubt that any clustering algorithm would find the able-bodied, high-funcitoning and low-functioning clusters

- Retraining the LSTM on any different dataset (or even just with different initialization) would lead a different basis vector, meaning that the analysis of the PCs has to be done for every model. Do they propose that the basis they found in this paper is used also in future studies? Or how do we do in practice?

- The analysis named "gait sculpting" is very original and interesting in my opinion

- the section 547-557 is not fully clear to me. Have the authors tried to set as an objective of the LSTM a longer trajectory, instead of the next pose? This might improve a lot the generative power of the network.

Minor comments:

104 missing comma?

297 missing )

299 "similarly to a when"?

572-573 strange sentence

660-662 not clear that this result actually follows from the study

737 strong title given the quality of the future predictions

752 extra full stop

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1011556.r003

Decision Letter 1

Lyle J Graham

30 Sep 2023

Dear Mrs. Winner,

We are pleased to inform you that your manuscript 'Discovering individual-specific gait signatures from data- driven models of neuromechanical dynamics' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

I also suggest that you take into account the suggestion from Reviewer 2 about the Discussion, which seems quite reasonable.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Lyle Graham

Section Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have addressed all the concerns raised by the reviewer. The manuscript is now in better shape and organization with improved quality. In future, it would be interesting to see the extended work from the authors' group on using diverse dataset and learning algorithm with better generalization (extrapolating) capabilities.

Reviewer #2: We thank the authors for their detailed response. Overall, we think the paper (was already good) and improved further. As is clear from many responses, this framework would shine most, and perhaps provide interesting insights with a larger cohort of people. E.g., from your answer, I gather that you only test walking speeds from 0 to 1.6 m/s, which is far away from running speed. They do cover the whole range of safe speeds for stroke survivors, but definitely not for able bodied patients. Thus, I do think this limitation should be clarified in the discussion. That said, the dataset is not too small to draw the conclusions.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Jyotindra Narayan

Reviewer #2: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1011556.r004

Acceptance letter

Lyle J Graham

13 Oct 2023

PCOMPBIOL-D-23-00152R1

Discovering individual-specific gait signatures from data-driven models of neuromechanical dynamics

Dear Dr Winner,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Anita Estes

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Video. Stick figure movie demonstrating the RNN’s kinematic reconstruction of all the PCs of dynamics.

    (MP4)

    S2 Video. Stick figure movie demonstrating that PC1 encodes dynamics driving hip flexion and extension.

    (MP4)

    S3 Video. Stick figure movie demonstrating that PC2 encodes dynamics driving knee flexion and extension.

    (MP4)

    S4 Video. Stick figure movie demonstrating that PC3 encodes dynamics driving postural coordination.

    (MP4)

    S5 Video. Stick figure movie demonstrating the kinematic reconstruction when AB2’s PC1 projection is replaced with ST4’s impaired PC1 projection.

    (MP4)

    S6 Video. Stick figure movie demonstrating the kinematic reconstruction when AB2’s PC3 projection is replaced with ST4’s impaired PC3 projection.

    (MP4)

    S7 Video. Stick figure movie demonstrating the kinematic reconstruction when AB2’s PC4 projection is replaced with ST2’s impaired PC4 projection.

    (MP4)

    S8 Video. Stick figure movie demonstrating the kinematic reconstruction when ST3’s impaired PC5 projection is replaced with AB2’s PC5 projection.

    (MP4)

    S1 Fig. Comparison of model performance on training and validation loss (left), and long- and short-time prediction performance (right).

    In both plots, small dots represent the average values across trials for each random initialization of each model. Large dots and bars denote the average and standard deviation of model performance metrics across initializations. Left: Training and validation loss (RMSE) for all 12 hyperparameter pairs. Models in the lower-left consider are considered better. Right: Long- and short-time prediction performance (R2) for the 3 hyperparameter pairs with the lowest training and validation loss. Models in the upper-right corner are considered better.

    (TIF)

    S2 Fig. RNN model training (green) and validation (blue) loss curves.

    (TIF)

    S3 Fig. RNN dynamic learning generalizes across 12 leave-one-individual-out models.

    The minimum train loss (blue) and validation loss (green) was low (< 0.02 degrees2) for 5 models that were trained each with a single able-bodied individual held out of the training data (A, i) and 7 models each with a stroke individual held out of training data (A, ii). The magnitude and range of the test loss (evaluation of model on the held-out data) (orange) was higher than the respective minimum training and validation losses for both held-out able-bodied (A, i) and stroke (A, ii) models. The magnitude and range of test losses evaluated on held-out able-bodied individuals, however, were lower than models evaluated on held-out stroke data. The models generate external predictions (blue) of held-out test trials with higher R2 values than that of self-driven predictions (red) in models evaluated on both able-bodied (B, i) and stroke trials (B, ii). The models can generate external kinematic predictions (blue) of held-out able-bodied (B, i) trials better than that of stroke (B, ii). Self-driven predictions of stroke kinematics were generally very low (R2 values below 0.5). Models were incapable of generating self-driven predictions for 5 of 30 able-bodied trials and 16 of 42 stroke trials. (C) shows reference (black), externally driven (blue) and self-driven (red) phase averaged kinematic predictions for an exemplary able-bodied trial (C, i) and exemplary stroke trial (C, ii). Models can predict kinematics of held-out able-bodied trials better (higher R2) than that of stroke.

    (TIF)

    S4 Fig. Cumulative proportion of variance explained by the first 100 principal components of gait dynamics.

    Six (6) principal components (PCs) explained 77% of the variance in the gait dynamics. The top 6 dominant PCs were used to develop the gait signature.

    (TIF)

    S5 Fig. Support vector machine cross-validation classification accuracy of four different gait descriptors (discrete variables, gait signatures, kinematics, and a combination of kinematics & joint velocity) for discrimination between: a) gait group (able-bodied vs. stroke) and B) individuals.

    Using k = 25 folds, RNN gait signatures distinguished between impaired and unimpaired gait with 100% accuracy, along with the 26 discrete variables (100%, p = 1), whereas kinematic (92.67 ± 0.15%, p < 0.05) and kinematics & velocity (88.67 ± 0.17%, p <0.05) discrimination were significantly lower. Using k = 6 folds, SVM classification of individuals was most accurate using RNN gait signatures and discrete variables (100%), lower using kinematics (88.9 ± 0.13%, p = 0.061) and significantly lower using a kinematics and velocity (68.10 ± 0.16%, p < 0.05).

    (TIF)

    S6 Fig. Relationship between participants’ similarity to able-bodied gait signatures and the RNN’s ability to predict gait kinematics over one gait cycle.

    There exists a negative correlation between self-driven R2 and Euclidean distance to the AB centroid that is statistically significant at the 0.05 level (S6 Fig, dots). Lower functioning stroke survivors are located further away from the able-bodied centroid, however remarkably all but one (outlier) of the R2 values are above 0.73 in both high and lower-functioning individuals. Even though the model can better predict able-bodied future kinematics better than stroke (R2 values above 0.8), the ability of our model to predict at least a single gait cycle of future stroke kinematics with R2 above 0.73 is promising. To provide a control for this analysis, we found the average R2 values between the self-driven prediction of the last full gait cycle of each trial and 5 randomly selected gait cycles from other randomly selected individuals in the same gait group (able-bodied vs. stroke) (S6 Fig, stars). This control would effectively reveal whether our model learned each individual’s specific gait pattern or was just producing arbitrary (averaged) gait patterns. We expect that if the model learned individuals specific gait patterns, then the distribution of the R2 values of the self-driven kinematic predictions would be significantly different to that of the averaged R2 values corresponding to arbitrary gait patterns. In fact, the averaged R2 of the generic gait cycle comparisons ranged from 0.56 to 0.95 (compared to 0.61–0.99 in self-driven predictions). Further, the distributions of the R2 outputs for both able-bodied and stroke individuals were significantly different (at the 0.05 level) to self-driven R2 outputs with p-values 5.5x10-6 and 5.4x10-3 respectively using the Wilcoxon Rank-Sum test. This reveals that even for stroke survivors the model has learned their dynamics in some capacity and the model isn’t simply predicting arbitrary gait patterns.

    (TIF)

    S7 Fig. Graphical summary and pseudo code of our gait signatures framework and algorithm.

    A) Training dataset: Using the 6-dimensional gait trajectory, the inputs (green) were concurrent segments from the gait trajectory, each one 499-time steps long. The outputs (blue) were 1-shifted (in time) segments of inputs. B) Model architecture: Our model consisted of an input layer, a hidden layer composed of 512 LSTM units, and a 6-unit Dense output layer. C) Stateful training: The hidden state of an RNN at time t is a function of the input at time t and the hidden state at time t-1. The model starts with processing the first mini-batch, calculating a new hidden state at each t and predicting gait kinematics at time t+1 given kinematics data at t. At the end of the mini-batch processing, the model calculates MSE over the entire mini-batch to calculate error for backpropagation and to update model weights. The final hidden state h(t+L) is used as the initial hidden state for generating predictions for the next mini batch (L is the temporal length of each mini-batch). The hidden state is initialized as zero before processing the first mini-batch.

    (TIF)

    S1 Table. Stroke participant’s demographics and clinical scores.

    (TIF)

    Attachment

    Submitted filename: Response to reviewers2.docx

    Data Availability Statement

    To ensure rigor, reproducibility, and promote open science, all software is shared under a GNU GPL 3.0 license and on GitHub. Links to Google Colab notebooks enable our software to be run on the cloud for users without computational resources. The gait signature code developed and used in this paper has been deposited at GitHub: https://github.com/bermanlabemory/gait_signatures. The RNN model and code was developed in Python programming language using built-in Python-based libraries such as Keras, Pandas, and NumPy. We revised the Phaser algorithm (https://github.com/sheim/phaser) to estimate phase for our kinematic time trajectories in the development of our phase averaged dynamics per trial. Shareable Jupyter Notebooks were developed on the Google Colab platform. The data analysis of the generated gait signatures was conducted in MATLAB 2022a (MathWorks).


    Articles from PLOS Computational Biology are provided here courtesy of PLOS

    RESOURCES