Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Mar 1.
Published in final edited form as: Osteoarthritis Cartilage. 2021 Jan 7;29(3):346–356. doi: 10.1016/j.joca.2020.12.017

A neural network to predict the knee adduction moment in patients with osteoarthritis using anatomical landmarks obtainable from 2D video analysis

Melissa A Boswell 1,*, Scott D Uhlrich 2,6,*, Łukasz Kidziński 1, Kevin Thomas 3, Julie A Kolesar 1,6, Garry E Gold 4, Gary S Beaupre 1,6, Scott L Delp 1,2,5
PMCID: PMC7925428  NIHMSID: NIHMS1669628  PMID: 33422707

Abstract

Objective:

The knee adduction moment (KAM) can inform treatment of medial knee osteoarthritis; however, measuring the KAM requires an expensive gait analysis laboratory. We evaluated the feasibility of predicting the peak KAM during natural and modified walking patterns using the positions of anatomical landmarks that could be identified from video analysis.

Method:

Using inverse dynamics, we calculated the KAM for 86 individuals (64 with knee osteoarthritis, 22 without) walking naturally and with foot progression angle modifications. We trained a neural network to predict the peak KAM using the 3-dimensional positions of 13 anatomical landmarks measured with motion capture (3D neural network). We also trained models to predict the peak KAM using 2-dimensional subsets of the dataset to simulate 2-dimensional video analysis (frontal and sagittal plane neural networks). Model performance was evaluated on a held-out, 8-person test set that included steps from all trials.

Results:

The 3D neural network predicted the peak KAM for all test steps with r2=0.78. This model predicted individuals’ average peak KAM during natural walking with r2=0.86 and classified which 15° foot progression angle modifications reduced the peak KAM with accuracy=0.85. The frontal plane neural network predicted peak KAM with similar accuracy (r2=0.85) to the 3D neural network, but the sagittal plane neural network did not (r2=0.14).

Conclusion:

Using the positions of anatomical landmarks from motion capture, a neural network accurately predicted the peak KAM during natural and modified walking. This study demonstrates the feasibility of measuring the peak KAM using positions obtainable from 2D video analysis.

Keywords: Knee Adduction Moment, Osteoarthritis, Gait, Machine Learning, Neural Network

Introduction

Knee osteoarthritis (OA) affects approximately 14 million people in the United States1 and is a leading cause of disability worldwide2. The medial compartment of the knee is most commonly affected3, likely because it bears a greater proportion of total knee contact force compared to the lateral compartment4. Joint loading cannot be directly measured in vivo in an osteoarthritic knee, so the peak knee adduction moment (KAM) is a common surrogate measure of medial knee loading due to its relationship to the medio-lateral distribution of force in the knee5. The peak KAM, which typically occurs during the first half of the stance phase, is correlated with the presence6, severity7,8, and progression9 of medial compartment knee osteoarthritis.

Measurements of the peak KAM enhance clinical decision making. Measuring the peak KAM during natural walking aids in diagnosing OA1012, evaluating a patient’s risk of OA progression9, and predicting surgical outcomes13. Gait modifications such as increasing trunk lean14,15, avoiding contralateral pelvic drop16, medializing the knee (medial thrust)17,18, and changing the foot progression angle19,20 can reduce the peak KAM and pain. However, personalizing interventions based on how they affect each individual’s peak KAM is critical for maximizing the achievable reduction in loading and avoiding interventions that cause a harmful increase in loading18,2123. Personalized foot progression angle modifications (toe-in or toe-out), in particular, are an effective and subtle way to reduce the peak KAM23.

The KAM is calculated using inverse dynamics from ground reaction forces and kinematics measured in a gait analysis laboratory equipped with force plates and a motion capture system (Figure 1A). The expensive equipment and technical expertise necessary to operate a gait laboratory are inaccessible to most clinicians, which excludes potentially valuable KAM measurements from routine clinical practice. The peak KAM has been calculated using mobile sensors (inertial measurement units and force-instrumented shoes) and an inverse dynamics approach24; however, this method still requires expensive equipment, limiting its scalability. A simpler, cheaper way to assess the peak KAM is needed.

Figure 1.

Figure 1.

A) The gold standard, laboratory-based workflow for measuring the knee adduction moment (KAM). After motion capture data is collected and semi-manually pre-processed, it is combined with force plate data to compute the KAM using inverse dynamics. B) The workflow for the current study. We use the coordinates of 13 anatomical landmarks from motion capture (to simulate video keypoints) as inputs into a neural network trained to predict the peak KAM. C) Our proposed future workflow for the automated measurement of the KAM. After collecting 2D video of gait, keypoints (e.g., joint positions) could be detected automatically using OpenPose25. A neural network would predict the peak KAM using these keypoints as input.

The ability to predict the peak KAM with 2-dimensional (2D) video analysis would make these measurements available to clinicians, researchers, and patients using only a video camera. Recently-developed automated keypoint detection algorithms such as OpenPose25 extract the positions of anatomical landmarks (e.g., joint locations) from 2D video (Figure 1C). But the small number of keypoints on each body segment and the imperfect anatomical accuracy of keypoints (ranging from 14–30mm)26,27 limit their utility as a replacement for motion capture in an inverse dynamics analysis. Machine learning models are a promising solution for predicting complex biomechanical outputs using low-fidelity inputs like video-based keypoints28,29.

Machine learning models have been trained to predict the KAM, but many of these models rely on input features from equipment that is not readily available in clinical settings, such as force plates, inertial measurement systems, pressure-sensing insoles, and electromyography systems3033. Other models have used kinematic features such as synthesized images of motion capture trajectories34 and Euler angles35 to predict the KAM curve in healthy individuals and individuals with alkaptonuria. These models demonstrate the promise of using kinematics alone to predict the KAM curve; however, the ability to predict the peak KAM in individuals with knee OA using kinematic features identifiable from video analysis remains unknown.

The objective of our study is to evaluate the feasibility of predicting the peak KAM in individuals with and without knee OA using keypoints that could be extracted from video. In the absence of a dataset with synchronized 2D video and joint kinetics, we simulate video-based keypoints using the positions of anatomical landmarks measured with motion capture from a previously-collected dataset. Furthermore, we simulate keypoints from 2D video by projecting 3-dimensional (3D) anatomical landmark positions onto a 2D plane. Anatomical keypoint detection is a rapidly maturing field, so evaluating a machine learning model’s ability to predict the peak KAM using motion-capture-based keypoints will demonstrate the potential for a video-based solution in the future.

We aimed to evaluate the feasibility of predicting the first KAM peak using machine learning models that use only the positions of anatomical landmarks as inputs. We evaluated model performance on two clinical decision-making tasks. The first task was predicting an individual’s first peak KAM during natural walking with a mean absolute error (MAE) less than 0.5% bodyweight*height (BW*H); this threshold was chosen because it is the minimum of the clinically-relevant range of 0.5–2.2%BW*H related to diagnosing OA and evaluating the risk of progression912. The second task was classifying whether a foot progression angle modification increases or decreases the first peak KAM. To evaluate the feasibility of a video-based solution that uses a single camera, we compared the performance of models that use only sagittal or frontal plane anatomical landmark positions to the model that uses 3D positions.

Methods

Data Collection

Eighty-six individuals (64 with medial knee OA and 22 without OA23) participated in this study after providing written consent in compliance with the Stanford University Institutional Review Board (Table 1). We included individuals with and without knee OA to establish a larger dataset and to train a model that is generalizable to a broad spectrum of OA severities. While assessing the KAM in individuals without OA is not currently a part of clinical practice, an inexpensive screening tool, like the pipeline proposed here (Figure 1C), could help identify asymptomatic individuals with a large KAM who may be at risk of developing medial knee OA12. Inclusion criteria for the medial knee OA group were (i) Kellgren-Lawrence grade 1–3 and smaller medial compared to lateral joint space width assessed from an anterior-posterior weight-bearing radiograph by a radiologist (GEG); (ii) medial pain of 3 or higher out of 10 on the numeric rating scale and medial pain greater than patellofemoral or lateral pain; (iii) the ability to walk safely for 25 minutes on a treadmill without ambulatory aids; and (iv) body mass index (BMI) less than 35. Individuals without osteoarthritis were included if they had no lower-extremity pain or history of lower-extremity injury in the past year.

Table 1.

Population information for participants with and without osteoarthritis (OA). Data are presented as mean ± standard deviation.

Characteristic Without OA With OA
Number of subjects 22 64
Gender 9F, 13M 43F / 21M
Age (years) 24.7 ± 3.2 64.6 ± 8.81
Height (m) 1.75 ± 0.13 1.68 ± 0.10
Weight (kg) 69.4 ± 15.3 77.2 ± 15.0
BMI (kg/m2) 22.3 ± 2.2 27.2 ± 3.7
Preferred Walking Speed (m/s) 1.15 ± 0.10 1.16 ± 0.13
Kellgren Lawrence grade N/A I: 12, II: 32, III: 20

Participants walked on an instrumented treadmill (Bertec Corporation, Columbus, OH, USA) at their self-selected speed in an 11-camera motion capture volume (Motion Analysis Corporation, Santa Rosa, CA, USA) with a retro-reflective marker set sufficient to compute the 3D kinematics of the lower limbs and trunk23. Participants first completed a static trial to identify ankle and knee joint axes and joint centers and then performed bilateral hip circumduction trials to determine the hip joint centers36. After 5 minutes of treadmill familiarization, participants completed a 2-minute baseline trial of natural walking. Participants then received vibrotactile biofeedback (described in Uhlrich et al.23) that instructed them to practice toeing-in and toeing-out by 5° and 10° relative to their natural foot progression angle for a minimum of 1 minute per angle. They subsequently performed four 2-minute modification trials where they targeted each of the foot progression angle modifications. The baseline and modification trials were used for analysis.

Ground reaction forces and marker positions were low-pass filtered at 8 Hz using a fourth-order, zero lag, Butterworth filter. The KAM was calculated using inverse dynamics in MATLAB (Mathworks Corporation, Natick, MA, USA) as the frontal plane component of the 3D knee moment expressed in the proximal tibial reference frame23 and normalized by bodyweight and height. The peak KAM was defined as the maximum value of the KAM curve during the first half of the stance phase; the stance phase was defined as the time during which the vertical ground reaction force was greater than 30N.

Predictive Statistical Models

The inputs to our machine learning model were the 3D positions of 13 anatomical landmarks from motion capture and a binary value identifying the stance leg (Figure 1B). From the motion capture marker set, we selected a subset of anatomical landmarks that are similar to the keypoints tracked using 2D video analysis algorithms like OpenPose25. The anatomical landmarks chosen included four from each lower extremity (2nd metatarsal head, posterior calcaneus, lateral malleolus, lateral femoral epicondyle), four pelvic landmarks (right and left anterior superior iliac spine and posterior superior iliac spine), and the C7 vertebrae. We defined the midpoint of the posterior superior iliac spine markers as the origin of the anatomical landmark positions to make these input positions independent of the participant’s location in the laboratory frame. Positions were then normalized by participant height. Additionally, the medio-lateral positions of left steps were reflected across the body midline, so that all data appeared to be from the right leg, making the input more consistent37. Since our goal was to predict the first peak KAM, which occurs during the first half of the stance phase, we sampled the input positions 8 times during the first half of the stance phase. This sample rate was sufficient to capture the frequency content of the 8Hz-lowpass-filtered inputs during the stance phase, which lasts less than 1 second. The final input matrix for each step was of size 40 × 8 ((13 markers · 3 dimensions + 1 leg binary) · 8 timesteps).

Participants were randomly divided 80%−10%−10% into training, development, and test sets, such that all of a participant’s data resided in only one set. The training set consisted of 91,245 steps from 70 individuals (18 without OA, 52 with OA), the development set consisted of 9,810 steps from 8 individuals (2 without OA, 6 with OA), and the test set consisted of 11,675 steps from 8 individuals (2 without OA, 6 with OA). The performance of the model on the development set was used to select the model architecture and hyperparameters. The test set was used only to evaluate the accuracy of the final model.

Models were trained in Python (version 3.6.8) using Keras (version 2.2.4) with 1 GPU (NVIDIA GeForce GTX 960). We evaluated the following architectures: linear regression, a convolutional neural network (three 1-dimensional convolutional layers, 3 fully connected layers)38, a long short-term memory network (2 long short-term memory layers, 1 fully connected layer)39, and a fully connected neural network (10 hidden layers with 100 neurons each). Neural network weights were initialized with Xavier initialization40, and Adam gradient descent41 was used to minimize the root-mean-squared-error. The final layer of each model had a single neuron with linear activation to output the single peak KAM value. We selected a fully connected neural network architecture due to its superior performance on the development set (Table S1, Supplemental Information). We refer to this model as the 3D neural network, as its inputs are 3D anatomical landmark positions.

We used lasso regression42 to reduce redundant information in the flattened input vector before training the 3D neural network (glmnet package43 in R44). To select the lambda value for lasso regression, we performed 10 runs of 10-fold cross-validation using the training set. We used the one-standard-error rule for selecting lambda: for each run, we selected the lambda value that yielded a prediction error that was one standard error greater than the minimum prediction error, thereby removing more features than the lambda that minimized prediction error. We then removed all but one binary leg feature from the flattened input vector. The input vector size was reduced from 320 to 299 features.

After a random search, we selected the following 3D neural network hyperparameters based on model performance on the development set: 1 hidden layer (tested 1–20), 800 neurons in the hidden layer (tested 0–1000), and 0.01 probability of dropout (tested 0–0.25). Neurons in the hidden layer used a rectified linear unit activation function45, while the output neuron had a linear activation function. To reduce overfitting, we used early stopping with 5 epochs of patience and used model weights from the epoch with the best development-set performance.

To identify which anatomical landmark positions were most influential on the peak KAM predictions, we computed the saliency of the input features. In general, saliency analysis identifies how changes in each input feature relate to changes in the output by computing the partial derivative of the output with respect to each input feature46. For this analysis, we trained a model with the same architecture and input features as the 3D neural network but without lasso regression in order to account for every feature at each timestep. We averaged saliency (Python Keras-vis package47) over 2000 examples randomly selected from the test set, then averaged over all timesteps for each of the 39 anatomical landmark positions (Equation 1):

Saliencyxi=meank(meanj(|KAM(j)xi(j,k)|)) (Equation 1)

where xi=1:39 input positions, j=1:2000 examples, and k=1:8 timesteps. Finally, we normalized the saliency value for each feature by the saliency value of the most salient feature (Equation 2).

Saliencyxi,norm=Saliencyximaxi(Saliencyxi) (Equation 2)

We then evaluated the performance of our models on the clinical decision-making tasks. To test a model’s ability to predict an individual’s baseline peak KAM during natural walking, we predicted the peak KAM for all steps during the baseline trial, computed the average value for an individual leg, then compared this value to the average of the peak KAM values from inverse dynamics. We analyzed both legs for the 8 individuals in the test set, yielding 16 conditions. To test the model’s ability to identify changes in the peak KAM from varying degrees of foot progression angle modifications, we identified steps for which individuals toed-in or toed-out by 5±2.5°, 10±2.5°, or 15±2.5°. For each modification, there were 32 possible conditions (8 subjects, 2 legs, 2 angles), but not all legs achieved the target angles, leaving between 20 and 31 conditions for analysis. Changes in the peak KAM were computed by subtracting a leg’s average baseline peak KAM from the peak KAM from a modified step, using KAM values from either the neural network or inverse dynamics.

For our final aim, we simulated 2D video input by using only sagittal or frontal plane anatomical landmark positions as model inputs. We removed the anterior-posterior landmark positions (in the laboratory frame) for the frontal plane neural network and the medio-lateral components for the sagittal plane neural network. There were 209 inputs for these planar models (13 markers · 2 dimensions · 8 timesteps + 1 leg binary). We used the same model architecture and training procedures for the planar models as the 3D neural network, excluding lasso regression.

Analytical Statistics

All statistical analyses were performed in MATLAB. We evaluated model performance using r2 (Pearson’s correlation coefficient) and the mean absolute error (MAE; Equation 3):

MAE=1mi=1m|yiy^i| (Equation 3)

where m=number of examples; yi=inverse dynamics peak KAM; and y^i=neural network-predicted peak KAM. As a supplemental analysis, we added three types of virtual noise to the dataset to simulate various sources of error in video-based keypoint identification, with error magnitudes (14–30mm) based on the previously-reported accuracy of pose recognition algorithms26,27 (details in Table S2).

To estimate the uncertainty in our performance estimates based on test-set subject selection, we used percentile bootstrapping48 to compute 95% confidence intervals (CI) for r2 and MAE. For all bootstrapped distributions, we trained one model and resampled (10,000 times) the subjects in the test set with replacement from the 8 subjects allocated to the test set, then calculated the test statistics for each resampled set. When evaluating model performance on all steps in the test set (Figures 2 and 6), we downsampled the test set to have an equal number of steps (400, randomly selected) for each leg, which gave each individual an equal effect on the test statistic, regardless of walking speed or step frequency. We computed the sample r2 and MAE from this downsampled test set. To evaluate if our model predicted the baseline peak KAM with MAE less than 0.5%BW*H (Figure 4), we computed the average baseline peak KAM for each leg in the test set and computed the 95% CI from a bootstrapped distribution with per-subject resampling.

Figure 2.

Figure 2.

The predicted peak knee adduction moment (KAM) from the neural network (NN) using 3D anatomical landmark positions as input (3D neural network) vs. the peak KAM calculated from inverse dynamics (ID) plotted against the y=x line. Presented data are for test subjects from the baseline and foot progression angle modification trials. Each point represents a single step, and a single color represents the steps from both legs of a subject.

Figure 6.

Figure 6.

The performance of neural networks that use planar projections of anatomical landmark positions as inputs. A) The frontal plane neural network predicts the peak KAM with similar accuracy to the 3D neural network (r2=0.85). B) The sagittal plane neural network is less accurate (r2=0.14) than the 3D or frontal plane neural networks. Presented data are for test subjects from the baseline and foot progression angle modification trials. A point represents a single step, and a single color represents the steps from both legs of a subject.

Figure 4.

Figure 4.

The peak knee adduction moment (KAM) estimated by the 3D neural network (NN) and inverse dynamics (ID) from the baseline (natural walking) trial. Data are averaged over all baseline steps for each leg of each subject (represented by a color) in the test set plotted against the y=x line.

Results

The 3D neural network predicted the first peak KAM for all steps in the test set (Figure 2) with r2=0.78 (95% CI=[0.44, 0.89]) and MAE=0.53%BW*H (95% CI=[0.39, 0.67]). The training set performed with r2=0.88 and the development set with r2=0.72. The saliency analysis (Figure 3) showed that for the 3D neural network, the peak KAM prediction was most sensitive to changes in positions of the swing and stance-leg anterior superior iliac spines (anterior-posterior), the stance-leg knee (medio-lateral), the C7 vertebrae (medio-lateral), and the swing-leg anterior superior iliac spine (anterior-posterior). From the supplemental virtual noise simulation (Table S2), the 3D neural network performance did not degrade substantially with constant error added to a coordinate across all subjects or with 14mm of error that changed on a subject-by-subject basis. Model performance did degrade with 30mm of subject-by-subject noise and any magnitude of noise that changed randomly at each timestep.

Figure 3.

Figure 3.

The top five most salient features (features that, when changed, have the greatest effect on the predicted peak KAM) normalized by the most salient feature for the 3D, frontal plane, and sagittal plane neural networks (left). The positions and Cartesian coordinate directions of the most salient features (right) where x corresponds to the anterior-posterior direction, y to medio-lateral, and z to superior-inferior.

The 3D neural network predicted the average peak KAM during baseline (natural) walking with r2=0.86 (95% CI=[0.62, 0.94]) and MAE=0.37%BW*H (95% CI=[0.23, 0.51]) (Figure 4). The MAE confidence interval minimally overlapped with our most stringent clinically-meaningful accuracy threshold of 0.5%BW*H. Predictions of the baseline peak KAM were, on average, 0.31%BW*H (95% CI=[0.10, 0.51]) greater than the inverse dynamics values (Bland Altman analysis in Figure S1) with an absolute error range of 0.02–1.09%BW*H. The 3D neural network classified if 5°, 10°, and 15° toe-in or toe-out gait modifications increased or reduced the peak KAM with accuracies of 0.65, 0.71, and 0.85, respectively (Figure 5, Table 2).

Figure 5.

Figure 5.

The average change in the peak knee adduction moment (KAM) estimated by the 3D neural network (NN) vs. inverse dynamics (ID) for 5°, 10°, and 15° foot progression angle modifications for each leg of each subject in the test set. The accuracy (acc.) of classification is increases with increasing degrees of foot progression angle modification.

Table 2.

The performance of the 3D and frontal plane neural networks in classifying whether changes in foot progression angle (FPA) increase or reduce the peak knee adduction moment (Figure 5). A reduction in peak KAM was considered a positive result. For example, sensitivity represents the number of cases that the neural network predicted as a reduction in the peak KAM divided by the number of cases that ID identified as a reduction in the peak KAM. Both models predict the effects of larger FPA modifications more accurately than small modifications.

Accuracy Sensitivity Specificity Precision
3D Neural Network
 5° FPA modification 0.65 0.59 0.78 0.87
 10° FPA modification 0.71 0.70 0.75 0.87
 15° FPA modification 0.85 0.73 1.0 1.0
Frontal Plane Neural Network
 5° FPA modification 0.58 0.68 0.33 0.71
 10° FPA modification 0.79 0.90 0.50 0.80
 15° FPA modification 0.80 1.0 0.73 0.73

The frontal plane neural network predicted the peak KAM for all steps in the test set (Figure 6A) with r2=0.85 (95% CI=[0.56, 0.91]) and MAE=0.49%BW*H (95% CI=[0.39, 0.59]), which was not statistically different from the performance of the 3D neural network. The frontal plane neural network predicted the average peak KAM during baseline walking with r2=0.86 (95% CI=[0.49, 0.96]) and MAE=0.40%BW*H (95% CI=[0.23, 0.59]), and predicted the change in KAM resulting from 5°, 10°, and 15° foot progression angle modifications with accuracy of 0.58, 0.79, and 0.80, respectively (Table 2).

The sagittal plane neural network predicted the peak KAM for all steps in the test set (Figure 6B) with r2=0.14 (95% CI=[0.02, 0.45]) and MAE=0.85%BW*H (95% CI=[0.63, 1.06]), which was less accurate than the frontal plane neural network and had minimal confidence-interval overlap with the 3D neural network.

Discussion

The purpose of this study was to evaluate the feasibility of using video to predict the peak KAM in individuals with and without knee OA during walking. To do this, we trained machine learning models to predict the peak KAM using the positions of anatomical landmarks from motion capture that are similar to those that could be measured with video analysis in the future. We found that a neural network that uses 3D positions accurately predicted the peak KAM for people with and without knee OA (r2=0.78). This model accurately predicted individuals’ peak KAM during natural walking as well as changes in peak KAM that resulted from large (15°) foot progression angle modifications, suggesting that it performs well enough to inform clinical decision making in many cases. A neural network that only uses frontal plane positions also predicted the peak KAM accurately, suggesting that it might be feasible to use a front-facing camera and video analysis tools to estimate the peak KAM in the future.

The 3D and frontal plane neural networks predicted the peak KAM with r2=0.78 and r2=0.85, which is similar to previous techniques that predicted the peak KAM with a reduced set of inputs compared to a full gait analysis laboratory. The 95% confidence intervals around the performance of our models overlap with the performance of machine learning approaches that used inertial measurement units (r2=0.71)32 or anthropometric measures and force plate data (r2=0.59)30 as well as an inverse dynamics approach that used instrumented force shoes and inertial measurement units (r2=0.80)24. Other studies have predicted the KAM curve during walking using a reduced set of inputs34,35,49, but it is worth noting that predicted curves that are highly correlated with the reference curve do not always yield accurate peak predictions. For example, Favre et al. predicted the KAM curve with r2=0.94, but the peak KAM extracted from the curve was less accurate (r2=0.59)30, demonstrating the importance of evaluating model performance on clinically-meaningful outcome metrics, like the peak KAM. Although our models do not directly enable mobile measurements, the frontal plane neural network demonstrates the feasibility of predicting the peak KAM from a limited set of anatomical landmark positions that are identifiable from 2D video. Secondarily, the model could serve as the basis for a transfer learning approach that uses keypoints from front-facing video as inputs.

The 3D input positions that the saliency analysis identified as most influential for KAM predictions correspond with kinematic changes that have been shown to affect the KAM. First, the superior-inferior positions of the posterior superior iliac spines relates to contralateral pelvic drop, which increases the KAM16. Second, the medial position of the stance-limb knee relates to the frontal-plane knee angle. Medializing the knee is a suggested mechanism for KAM-reducing interventions such as medial-thrust gait17, toe-in gait50, and variable stiffness shoes51. Third, the medio-lateral position of the C7 vertebrae relates to the trunk-sway angle. Trunk-sway influences the KAM by altering the medio-lateral position of the center of mass15. Finally, the anterior-posterior position of the swing-leg knee could relate to stride length. A slower walking speed or decreased stride length can reduce the peak KAM10,17. Some of the salient features in the frontal and sagittal plane models were different from the 3D model’s salient features in these planes, which may be explained by the redundancy of input features, complex nonlinear relationships between features, and the stochastic nature of model training. Even small perturbations to input data can lead to dramatically different saliency results, and thus, different interpretations of the model52, so the relationships between gait modifications and the salient features remains speculative.

Both the 3D and frontal plane neural networks performed with sufficient accuracy to inform clinical decision making in most cases. The models predicted the peak KAM during natural walking with MAE=0.37–0.49%BW*H, which is accurate enough to evaluate the risk of OA progression (differences between the peak KAM of progressors and non-progressors is 2.1%BW*H)9. Our models also predicted the baseline peak KAM with sufficient accuracy to classify between patients who are likely or unlikely to benefit from a high tibial osteotomy (individuals with a 2.2%BW*H lower pre-surgical peak KAM had better post-surgical outcomes)13,53. Additionally, our models classified whether 15° foot progression angle modifications increased or reduced the peak KAM with an accuracy of 0.80–0.85. However, they were less accurate in predicting smaller 5° and 10° modifications, potentially due to subtler changes to both input kinematics and the output peak KAM. Notably, toeing-in typically reduces the first peak KAM50, but it is not effective for all individuals23; the 3D neural network correctly identified the five of the six legs that reduced the first peak KAM by toeing-in by 15° and the four of the five that did not. If a future video-based model performs with similar accuracy, any clinician with a smartphone would be able to make clinically-actionable biomechanical measurements without purchasing expensive equipment.

It is important to identify the limitations of this study. First, machine learning models may not generalize well to conditions not represented in the training data. Our models were trained on individuals with and without knee OA, with a BMI below 35, who were walking with varying foot progression angles. Thus, our models will likely lose accuracy when predicting the peak KAM for new populations performing different activities. Fortunately, we trained accurate models with a reasonably-sized dataset (n=86), suggesting that new models predicting different parameters can likely be trained using pre-existing datasets. Second, the inputs to our models were motion capture marker positions that were very similar, but not identical to the positions commonly used in video pose-recognition algorithms (e.g., lateral femoral epicondyle versus knee joint center). However, model performance did not change dramatically when a constant offset was added to the motion capture positions (Table S2), indicating that the models may not be sensitive to the small differences in the definitions of anatomical landmarks between motion capture and video-based keypoints. Finally, our models do not directly facilitate mobile measurement of the peak KAM since they rely on anatomical landmark positions from motion capture rather than keypoints from video. The robustness of our pipeline to several sources of error that may arise from video keypoints (Table S2) adds confidence that a video-based approach may be feasible. Nonetheless, the performance of our models trained on motion capture positions will likely exceed the accuracy of a similar model trained with a similarly-sized dataset of video keypoints. Even with a slight decrease in performance, an inexpensive, video-based solution would provide clinical value compared to other mobile solutions that require equipment that costs thousands of dollars24,30,54.

In summary, we developed a model to predict the first peak KAM in individuals with and without knee OA using the positions of anatomical landmarks. Our model accurately predicts the peak KAM during natural walking as well as changes in the KAM that result from gait modifications. Since anatomical landmark positions projected onto a plane are similar to the output from 2D video keypoint detection algorithms, our results support the feasibility of predicting the peak KAM from a 2D frontal plane video as a promising next step. These results support the utility of computer vision and machine learning as tools that can bring biomechanical measurements into the clinic or home. We envision that models like the ones presented here will soon enable scientists to easily monitor joint loading in large cohorts and clinicians to prescribe personalized treatments for musculoskeletal pathologies.

Supplementary Material

1

Acknowledgements

The authors would like to thank Patrick Cho and Andrew Ng for their advice on the machine learning models. We would also like to thank Amy Silder and Brittany Presten for their assistance in data collection. This work was supported by National Science Foundation Graduate Research Fellowships under Grant No. DGE-114747 and Grant No. DGE-1656518. This work was also supported by Merit Review Award Number I01 RX001811 from the United States (U.S.) Department of Veterans Affairs Rehabilitation R&D (Rehab RD) Service, and Grants U54EB020405 and P41EB027060 from the National Institutes of Health.

Role of the funding source

The funding sources did not play a role in study design, collection, analysis and interpretation of data; writing of the manuscript; nor the decision to submit the manuscript for publication.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Competing Interests

The authors have no conflicts of interest to declare.

Data and Code Availability

The models, code, and a portion of the dataset are available on GitHub (https://github.com/stanfordnmbl/predictKAM).

References

  • 1.Deshpande BR, Katz JN, Solomon DH, Yelin EH, Hunter DJ, Messier SP, et al. The number of persons with symptomatic knee osteoarthritis in the United States: Impact of race/ethnicity, age, sex, and obesity. Arthritis Care Res. 2016;68(12):1743–1750. doi: 10.1002/acr.22897 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Murray CJL, Vos T, Lozano R, Naghavi M, Flaxman AD, Michaud C, et al. Disability-adjusted life years (DALYs) for 291 diseases and injuries in 21 regions, 1990–2010: A systematic analysis for the Global Burden of Disease Study 2010. Lancet. 2012;380(9859):2197–2223. doi: 10.1016/S0140-6736(12)61689-4 [DOI] [PubMed] [Google Scholar]
  • 3.Wise BL, Niu J, Yang M, Lane NE, Harvey W, Felson DT, et al. Patterns of compartment involvement in tibiofemoral osteoarthritis in men and women and in whites and African Americans. Arthritis Care Res. 2012;64(6):847–852. doi: 10.1002/acr.21606 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Mündermann A, Dyrby CO, D’Lima DD, Colwell CW, Andriacchi TP. In vivo knee loading characteristics during activities of daily living as measured by an instrumented total knee replacement. J Orthop Res. 2008;26(9):1167–1172. doi: 10.1002/jor.20655 [DOI] [PubMed] [Google Scholar]
  • 5.Kutzner I, Trepczynski A, Heller MO, Bergmann G. Knee adduction moment and medial contact force-facts about their correlation during gait. PLoS One. 2013;8(12):8–15. doi: 10.1371/journal.pone.0081036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Hurwitz DE, Ryals AB, Case JP, Block JA, Andriacchi TP. The knee adduction moment during gait in subjects with knee osteoarthritis is more closely correlated with static alignment than radiographic disease severity, toe out angle and pain. J Orthop Res. 2002;20(1):101–107. doi: 10.1016/S0736-0266(01)00081-X [DOI] [PubMed] [Google Scholar]
  • 7.Sharma L, Hurwitz DE, Thonar EJ-MA, Sum JA, Lenz ME, Dunlop DD, et al. Knee adduction moment, serum hyaluronan level, and disease severity in medial tibiofemoral osteoarthritis. Arthritis Rheum. 1998;41(7):1233–1240. doi: [DOI] [PubMed] [Google Scholar]
  • 8.Foroughi N, Smith R, Vanwanseele B. The association of external knee adduction moment with biomechanical variables in osteoarthritis: A systematic review. Knee. 2009;16(5):303–309. doi: 10.1016/j.knee.2008.12.007 [DOI] [PubMed] [Google Scholar]
  • 9.Miyazaki T, Wada M, Kawahara H, Sato M, Baba H, Shimada S. Dynamic load at baseline can predict radiographic disease progression in medial compartment knee osteoarthritis. Ann Rheum Dis. 2002;61(7):617–622. doi: 10.1136/ard.61.7.617 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Mündermann A, Dyrby CO, Hurwitz DE, Sharma L, Andriacchi TP. Potential strategies to reduce medial compartment loading in patients with knee osteoarthritis of varying severity: reduced walking speed. Arthritis Rheum. 2004;50(4):1172–1178. doi: 10.1002/art.20132 [DOI] [PubMed] [Google Scholar]
  • 11.Mündermann A, Dyrby CO, Andriacchi TP. Secondary gait changes in patients with medial compartment knee osteoarthritis: Increased load at the ankle, knee, and hip during walking. Arthritis Rheum. 2005;52(9):2835–2844. doi: 10.1002/art.21262 [DOI] [PubMed] [Google Scholar]
  • 12.Amin S, Luepongsak N, McGibbon CA, LaValley MP, Krebs DE, Felson DT. Knee adduction moment and development of chronic knee pain in elders. Arthritis Care Res. 2004;51(3):371–376. doi: 10.1002/art.20396 [DOI] [PubMed] [Google Scholar]
  • 13.Prodromos CC, Andriacchi TP, Galante JO. A relationship between gait and clinical changes following high tibial osteotomy. J Bone Jt Surg. 1985;67(8):1188–1194. doi: 10.2106/00004623-198567080-00007 [DOI] [PubMed] [Google Scholar]
  • 14.Simic M, Hunt MA, Bennell KL, Hinman RS, Wrigley TV. Trunk lean gait modification and knee joint load in people with medial knee osteoarthritis: The effect of varying trunk lean angles. Arthritis Care Res. 2012;64(10):1545–1553. doi: 10.1002/acr.21724 [DOI] [PubMed] [Google Scholar]
  • 15.Mündermann A, Asay JL, Mündermann L, Andriacchi TP. Implications of increased medio-lateral trunk sway for ambulatory mechanics. J Biomech. 2008;41(1):165–170. doi: 10.1016/j.jbiomech.2007.07.001 [DOI] [PubMed] [Google Scholar]
  • 16.Dunphy C, Casey S, Lomond A, Rutherford D. Contralateral pelvic drop during gait increases knee adduction moments of asymptomatic individuals. Hum Mov Sci. 2016;49:27–35. doi: 10.1016/j.humov.2016.05.008 [DOI] [PubMed] [Google Scholar]
  • 17.Fregly BJ, Reinbolt JA, Rooney KL, Mitchell KH, Chmielewski TL. Design of patient-specific gait modifications for knee osteoarthritis rhabilitation. IEEE Trans Biomed Eng. 2007;54(9):1687–1695. doi: 10.1109/TBME.2007.891934 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gerbrands TA, Pisters MF, Theeven PJR, Verschueren S, Vanwanseele B. Lateral trunk lean and medializing the knee as gait strategies for knee osteoarthritis. Gait Posture. 2017;51:247–253. doi: 10.1016/j.gaitpost.2016.11.014 [DOI] [PubMed] [Google Scholar]
  • 19.Shull PB, Silder A, Shultz R, Dragoo JL, Besier TF, Delp SL, et al. Six-week gait retraining program reduces knee adduction moment, reduces pain, and improves function for individuals with medial compartment knee osteoarthritis. J Orthop Res. 2013;31(7):1020–1025. doi: 10.1002/jor.22340 [DOI] [PubMed] [Google Scholar]
  • 20.Hunt MA, Takacs J. Effects of a 10-week toe-out gait modification intervention in people with medial knee osteoarthritis: A pilot, feasibility study. Osteoarthr Cartil. 2014;22(7):904–911. doi: 10.1016/j.joca.2014.04.007 [DOI] [PubMed] [Google Scholar]
  • 21.Gerbrands TA, Pisters MF, Vanwanseele B. Individual selection of gait retraining strategies is essential to optimally reduce medial knee load during gait. Clin Biomech. 2014;29(7):828–834. doi: 10.1016/j.clinbiomech.2014.05.005 [DOI] [PubMed] [Google Scholar]
  • 22.Shull PB, Huang Y, Schlotman T, Reinbolt JA. Muscle force modification strategies are not consistent for gait retraining to reduce the knee adduction moment in individuals with knee osteoarthritis. J Biomech. 2015;48(12):3163–3169. doi: 10.1016/j.jbiomech.2015.07.006 [DOI] [PubMed] [Google Scholar]
  • 23.Uhlrich SD, Silder A, Beaupre GS, Shull PB, Delp SL. Subject-specific toe-in or toe-out gait modifications reduce the larger knee adduction moment peak more than a non-personalized approach. J Biomech. 2018;66:103–110. doi: 10.1016/j.jbiomech.2017.11.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.van den Noort J, van der Esch M, Steultjens MPM, Dekker J, Schepers M, Veltink PH, et al. Ambulatory measurement of the knee adduction moment in patients with osteoarthritis of the knee. J Biomech. 2013;46(1):43–49. doi: 10.1016/j.jbiomech.2012.09.030 [DOI] [PubMed] [Google Scholar]
  • 25.Cao Z, Hidalgo G, Simon T, Wei S, Sheikh Y. OpenPose : Realtime multi-person 2D pose estimation using part affinity fields. IEEE Trans Pattern Anal Mach Intell. Published online 2019:1–14. doi: 10.1109/tpami.2019.2929257 [DOI] [PubMed]
  • 26.Iskakov K, Burkov E, Lempitsky V, Malkov Y. Learnable triangulation of human pose. Proc IEEE Int Conf Comput Vis. 2019;2019-Octob:7717–7726. doi: 10.1109/ICCV.2019.00781 [DOI]
  • 27.Nakano N, Sakura T, Ueda K, Omura L, Kimura A, Iino Y, et al. Evaluation of 3D Markerless Motion Capture Accuracy Using OpenPose With Multiple Video Cameras. Front Sport Act Living. 2020;2(May):1–9. doi: 10.3389/fspor.2020.00050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Halilaj E, Rajagopal A, Fiterau M, Hicks JL, Hastie TJ, Delp SL. Machine learning in human movement biomechanics: Best practices, common pitfalls, and new opportunities. J Biomech. 2018;81:1–11. doi: 10.1016/j.jbiomech.2018.09.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kidziński Ł, Yang B, Hicks JL, Rajagopal A, Delp SL, Schwartz MH. Deep neural networks enable quantitative movement analysis using single-camera videos. Nat Commun. 2020;11(4054). doi: 10.1038/s41467-020-17807-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Favre J, Hayoz M, Erhart-Hledik JC, Andriacchi TP. A neural network model to predict knee adduction moment during walking based on ground reaction force and anthropometric measurements. J Biomech. 2012;45(4):692–698. doi: 10.1007/s11103-011-9767-z.Plastid [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Ardestani MM, Zhang X, Wang L, Lian Q, Liu Y, He J, et al. Human lower extremity joint moment prediction: A wavelet neural network approach. Expert Syst Appl. 2014;41(9):4422–4433. doi: 10.1016/j.eswa.2013.11.003 [DOI] [Google Scholar]
  • 32.Stetter BJ, Krafft FC, Ringhof S, Stein T, Sell S. A Machine Learning and Wearable Sensor Based Approach to Estimate External Knee Flexion and Adduction Moments During Various Locomotion Tasks. Front Bioeng Biotechnol. 2020;8(January). doi: 10.3389/fbioe.2020.00009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wang C, Chan PPK, Lam BMF, Wang S, Zhang JH, Chan ZYS, et al. Real-Time Estimation of Knee Adduction Moment for Gait Retraining in Patients with Knee Osteoarthritis. IEEE Trans Neural Syst Rehabil Eng. 2020;28(4):888–894. doi: 10.1109/TNSRE.2020.2978537 [DOI] [PubMed] [Google Scholar]
  • 34.Johnson WR, Mian A, Lloyd DG, Alderson JA. On-field player workload exposure and knee injury risk monitoring via deep learning. J Biomech. 2019;93:185–193. doi: 10.1016/j.jbiomech.2019.07.002 [DOI] [PubMed] [Google Scholar]
  • 35.Aljaaf AJ, Hussain AJ, Fergus P, Przybyla A, Barton GJ. Evaluation of machine learning methods to predict knee loading from the movement of body segments. Proc Int Jt Conf Neural Networks. 2016;2016-Octob:5168–5173. doi: 10.1109/IJCNN.2016.7727882 [DOI]
  • 36.Piazza SJ, Erdemir A, Okita N, Cavanagh PR. Assessment of the functional method of hip joint center location subject to reduced range of hip motion. J Biomech. 2004;37(3):349–356. doi: 10.1016/S0021-9290(03)00288-4 [DOI] [PubMed] [Google Scholar]
  • 37.Tiulpin A, Thevenot J, Rahtu E, Lehenkari P, Saarakkala S. Automatic knee osteoarthritis diagnosis from plain radiographs: A deep learning-based approach. Sci Rep. 2018;8:1–15. doi: 10.1038/s41598-018-20132-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Wang Z, Yan W, Oates T. Time series classification from scratch with deep neural networks: A strong baseline. arXiv. 2016;(1611.06455). doi: 10.1109/IJCNN.2017.7966039 [DOI]
  • 39.Kidziński L event-detector-train. Published 2018. https://github.com/kidzik/event-detector-train
  • 40.Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks Xavier. Proc 13th Int Conf Artif Intell Stat. Published online 2010:249–256. doi: 10.1109/ijcnn.1993.716981 [DOI]
  • 41.Kingma DP, Ba J. Adam: A method for stochastic optimization. Int Conf Learn Represent. Published online 2015:1–15. http://arxiv.org/abs/1412.6980
  • 42.Tibshirani R Regression shrinkage and selection via the lasso. J R Stat Soc Ser B. 1996;58(1):267–288. doi: 10.1111/j.2517-6161.1996.tb02080.x [DOI] [Google Scholar]
  • 43.Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33(1):1–22. doi: 10.18637/jss.v033.i01 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.R Core Team. A language and environment for statistical computing. Published online 2019. https://www.r-project.org/
  • 45.Nair V, Hinton GE. Rectified linear units improve Restricted Boltzmann machines. In: ICML 2010 - Proceedings, 27th International Conference on Machine Learning. ; 2010. [Google Scholar]
  • 46.Itti L, Koch C. A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Res. 2000;40(10–12):1489–1506. doi: 10.1016/S0042-6989(99)00163-7 [DOI] [PubMed] [Google Scholar]
  • 47.Kotikalapudi R, and Contributors. keras-vis. Published 2017. https://github.com/raghakot/keras-vis.
  • 48.Efron B, Tibshirani R. An Introduction to the Bootstrap. Chapman & Hall; 1993. doi: 10.1007/978-1-4899-4541-9 [DOI] [Google Scholar]
  • 49.Karatsidis A, Jung M, Schepers HM, Bellusci G, de Zee M, Veltink PH, et al. Predicting kinetics using musculoskeletal modeling and inertial motion capture. arXiv. Published online 2018. doi:http://arxiv.org/abs/1801.01668
  • 50.Shull PB, Shultz R, Silder A, Dragoo JL, Besier TF, Cutkosky MR, et al. Toe-in gait reduces the first peak knee adduction moment in patients with medial compartment knee osteoarthritis. J Biomech. 2013;46(1):122–128. doi: 10.1016/j.jbiomech.2012.10.019 [DOI] [PubMed] [Google Scholar]
  • 51.Jenkyn TR, Erhart JC, Andriacchi TP. An analysis of the mechanisms for reducing the knee adduction moment during walking using a variable stiffness shoe in subjects with knee osteoarthritis. J Biomech. 2011;44(7):1271–1276. doi: 10.1016/j.jbiomech.2011.02.013 [DOI] [PubMed] [Google Scholar]
  • 52.Ghorbani A, Abid A, Zou J. Interpretation of Neural Networks Is Fragile. Proc Thirty-Third AAAI Conf Artif Intell. 2019;33(01):3681–3688. doi: 10.1609/aaai.v33i01.33013681 [DOI] [Google Scholar]
  • 53.Wang JW, Kuo KN, Andriacchi TP, Galante JO. The influence of walking mechanics and time on the results of proximal tibial osteotomy. J Bone Joint Surg Am. 1990;72(6):905–909. doi: 10.2106/00004623-199072060-00017 [DOI] [PubMed] [Google Scholar]
  • 54.Konrath JM, Karatsidis A, Martin Schepers H, Bellusci G, de Zee M, Andersen MS. Estimation of the knee adduction moment and joint contact force during daily living activities using inertial motion capture. Sensors. 2019;19(7). doi: 10.3390/s19071681 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES