Types of anomalies in two-dimensional video-based gait analysis in uncontrolled environments

Yuki Sugiyama; Kohei Uno; Yusuke Matsui

doi:10.1371/journal.pcbi.1009989

. 2023 Jan 19;19(1):e1009989. doi: 10.1371/journal.pcbi.1009989

Types of anomalies in two-dimensional video-based gait analysis in uncontrolled environments

Yuki Sugiyama ^1,^#, Kohei Uno ^2,^#, Yusuke Matsui ^2,^3,^*,^#

Editor: Thurmon Lockhart⁴

PMCID: PMC9851542 PMID: 36656820

Abstract

Two-dimensional video-based pose estimation is a technique that can be used to estimate human skeletal coordinates from video data alone. It is also being applied to gait analysis and in particularly, due to its simplicity of measurement, it has the potential to be applied to gait analysis of large populations. However, it is considered difficult to completely homogenize the environment and settings during the measurement of large populations. Therefore, it is necessary to appropriately deal with technical errors that are not related to the biological factors of interest. In this study, by analyzing a large cohort database, we have identified four major types of anomalies that occur during gait analysis using OpenPose in uncontrolled environments: anatomical, biomechanical, and physical anomalies and errors due to estimation. We have also developed a workflow for identifying and correcting these anomalies and confirmed that this workflow is reproducible through simulation experiments. Our results will help obtain a comprehensive understanding of the anomalies to be addressed during pre-processing for 2D video-based gait analysis of large populations.

Author summary

Gait is one of the important biomarkers of numerous health conditions. With developing mobile health technologies, it is becoming easier to measure our health. However, establishing evidence is a critical issue to providing preventive medicine, we need to be able to collect data from a large population. Two-dimensional video-based pose estimation can be a solution for the gait analysis of such a population. However, the technical accuracy and limitations of this analysis method have not yet been sufficiently discussed. In this study, by analyzing the largest database currently available, we systematically identified four types of technical anomalies that occur during gait measurement: anatomical, biomechanical, and physical anomalies and errors dues to estimation. We have also shown how to deal with these issues and made solutions available as software so that researchers can reproduce them. In the future, increasing numbers of studies will use 2D video-based pose estimation to research health-related gait among large populations. We believe that our work will provide a guideline for researchers and clinicians involved in these studies to discuss design and algorithms.

This is a PLOS Computational Biology Methods paper.

Introduction

Gait is a simple biomarker of the human condition [1], and its effectiveness as a clinical or preclinical marker for diseases such as nervous system abnormalities and skeletal muscle abnormalities has been revealed in various fields [2–6]. In recent years, with the advancement of artificial intelligence applications, several gait analysis methods based on computer vision have been proposed [7–10]. These methods are characterized by the extraction of parameters using images or videos of walking as the input. Two such approaches have been proposed to date: one approach is to extract features based on appearance, such as walking silhouette [11, 12], and the other is to extract gait parameters, such as a series of joint positions and joint angles, by fitting human joint models to the images using estimation [13].

One algorithm using the latter approach, OpenPose, can estimate joint coordinates at up to 135 key points, such as “body”, “feet”, “hands” and “face” for multiple subjects in an image by learning a vector space called Part Affinity Fields (PAF) for associations between anatomical joints based on a deep learning model. Previous research has suggested that this joint estimation capability is sufficient to some extent even in videos with many dynamic factors [14]. Compared with conventional optical motion capture, approximately 80% of the estimated joint coordinates are less than 30 mm with good accuracy [15].

These computer vision-based gait analysis methods can automatically analyze a large number of joint coordinates with only digital video as input and can be used in any environment, including homes and clinics, requiring little time, cost, and effort compared with conventional optical motion capture. Large-scale human gait analysis can be conducted more easily than ever before. However, there are some issues to be solved in gait analysis applications, such as a certain amount of unexpected noise and the false detection of multiple persons even though only one person is walking [16]. In addition, a reproducible and standardized analysis workflow is still lacking. Stenum et al. proposed a comprehensive analysis for obtaining gait parameters based on OpenPose during gait in a controlled environment [16]. This workflow uses video as input, preprocessing of joint coordinates obtained from OpenPose, and extraction of gait parameters such as step length.

However, workflows for gait analysis in uncontrolled environments have not been studied sufficiently. To capture gait futures in large populations in heterogenous environments, a robust approach is needed. Various factors are assumed to potentially affect the accuracy of joint estimation using OpenPose, including camera performance, the distance between the camera and the subject, walking speed, clothing, and walking environment. Several existing studies seem to provide a solution, although they assume a somewhat controlled data acquisition environment. Seethapathi et al. reviewed six categories of pose estimation problems that can hinder the estimation of kinematic parameters in the application of OpenPose in motion science, and suggested several possible solutions; for example, post-processing and elaboration during data acquisition, e.g., size estimation incorporating reference objects [17]. The workflow reported by Stenum et al. [16] pointed out false person detection and left-right swapping of lower limbs, but it is manual, with detection, correction, and exclusion being based on visual inspection. In an environment that can be controlled to some extent or that includes few subjects, it may be possible to devise data measurement methods or to deal with measurement errors through manual labor. However, both approaches may be limited when measuring large populations of thousands to tens of thousands of people in several different environments, such as in hospitals and other facilities.

When considering the efficient post-processing approach in such an environment, it is useful to perform a statistical examination of the error structure based on large-scale data. Fortunately, in recent years, a gait database consisting of 10,307 individuals has been made public using OpenPose technology [18]. This database contains pose estimations for an unspecified number of visitors to a certain facility during walking at 25 frames per second (fps) using a multi-view camera in an uncontrolled manner, and 18 joints are estimated for one gait cycle [18]. Although this database was originally intended for the biometrics field, we thought technical anomalous errors in uncontrolled environments could be investigated using this data.

The main purpose of this study was to classify the types of anomalies in pose estimation using OpenPose during the gait cycle in an uncontrolled environment to obtain a roadmap for analyzing large-scale gait data with OpenPose. Through our analysis, we identified four main types of anomalies: anatomical, biomechanical, and physical anomalies and errors due to estimation. In addition, we present a data processing workflow for dealing with the errors that we have categorized and demonstrate its reproducibility through simulation experiments. The code used in this study is online (URL: https://github.com/matsui-lab/PoseFixeR).

Results

Overview of anomaly types

This section provides an overview of the types of anomaly errors in OpenPose measurements during gait that are presented in this paper. Individual anomaly types are discussed in detail in the section below. We conducted a comprehensive analysis of the database (see Materials and Methods), while partially referring to the existing literature [16, 17], and identified four main types of anomalous errors that should be preprocessed when estimating joint coordinates during gait using OpenPose in an uncontrolled environment (Fig 1, Table 1). For convenience, the four types are categorized as anatomical, biomechanical, and physical, as well as errors due to the inherent estimation accuracy of OpenPose. Note that the categories in this study were labeled based on the patterns observed in a data-driven manner, and are used for the convenience of interpretation and to facilitate discussion. Therefore, they are not based on strict anatomical, biomechanical, or physical definitions and these types are not completely independent, as they overlap with each other and, in some cases, are composite. We further subdivided these four categories in terms of detection and correction methods and finally classified them into ten types that we believe should be considered during analysis (Fig 1).

Fig 1 — The left panel shows the OpenPose skeletal model and the name of each part. The right panel shows the anomaly types corresponding to those in Table 1. Estimation accuracy in Table 1 is excluded from the figure for convenience of illustration. ROM, range of motion; COG, center of gravity.

Table 1. Percentage of each anomaly type.

	ID	Anomaly types	Whole body (18 points)	Only lower limbs (6 points)
Anatomical constraints	a	Undetected parts	97.7	20.5
	b	Leg length	14.8
	c	Shoulder joint distance	16.1	-
Biomechanical constraints	d	Ankle joint distance	11.6
	e	Range of motion	69.4
	f	Center of gravity	6.3
Physical constraints	g	Side of legs	29.0
	h	Time transition	99.7	93.4
	i	Grounding	2.1
Estimation accuracy	j	Reliability	100	43.2

Open in a new tab

Anomalies for the whole body are reported as a percentage of the 18 whole body parts, and anomalies for the lower limbs are reported as a percentage of the six lower limbs parts. IDs correspond to the right panel in Fig 1.

Anatomical constraints

Anatomical constraints refer to a series of anomalous errors that could be considered deviations with respect to standard human anatomical constraints (Fig 1A). The most common case observed here was extreme lengthening at the joint located on the opposite side to the camera direction (Table 1). This is thought to be caused by forcibly making predictions on unobserved joints. We were unable to find any existing studies explicitly addressing and discussing this anomaly. However, in approximately 15% of the subjects, we also observed cases where the shoulder width suddenly increased even though the video was taken from the side (Figs 1C and S3) and cases where the skeletal length of the lower limbs became extremely short or long before or after a certain point in time, regardless of the camera direction (Fig 1B).

Ideally, we would like to be able to compare the estimated values with the baseline skeletal structure of each person; however, if this is difficult, it may be possible to estimate them by assuming a standard human anatomical skeletal structure and detecting deviations from it. We believe that the relative proportions of standard human skeletal length estimated by a cohort study [19] could be used to predict the joint coordinate values in advance. Specifically, if we consider a normalized coordinate with Neck (Ne) as the origin and apply the standard skeletal length (see Materials and Methods), we can predict the range within which the joint coordinate values should lie. We used this method to identify joint coordinates that were far outside the normal range (see Materials and Methods).

Biomechanical constraints

Anomalous errors deviating from the biomechanical constraints were also observed, mainly in key parameters in the gait analysis, such as the range of motion of the joints (ROM), which is the external angle of the axis connecting Ne and Right Hip (Rh) (or Left Hip [Lh]) and Right Knee (Rk) (or Left Knee [Lk]) (Fig 1D); center of gravity of the trunk (COG) representing the inclination of the trunk (Fig 1E); and stride length with ankle distances (Fig 1F). In particular, the anomalies related to ROM were the highest, accounting for nearly 70% of all subjects (Table 1), demonstrating the difficulty of biomechanical analysis in uncontrolled environments. It should also be noted that the skeletal model in OpenPose does not exactly match actual anatomical skeletal structures (see Discussion).

Furthermore, there are biases in the estimates depending on the distance between the camera and the subject, as well as errors in the pose estimation itself. In fact, when compared with the ROM estimated using a gyro sensor [20], a shift of approximately 10° to 20° was observed, and the variance tended to be large (Fig 2, Table 2). Therefore, instead of directly applying criteria based on other measurement methods, such as gyroscope sensors, OpenPose’s baseline should be estimated to separate the signal from the noise. We detected unnatural errors biomechanically, by calculating thresholds based on statistical confidence intervals derived from the database (see Materials and Methods).

Fig 2 — The ranges in red and blue shading are 95% confidence intervals. The black shaded area indicates the maximum and minimum ROM during gait using inertial sensors as reported by Park et al. ROM, range of motion [20].

Table 2. Mean shift and variability of ROM at the hip and knee joints, comparing published gyro-sensor-based statistics for ROM with those obtained in the present database analysis.

(Unit:deg)	Variable	System	Mean±SD (before anomaly exclusion)	Mean±SD (after anomaly exclusion)
Hip-joint angle (+ flexion/—extension)	Max	OpenPose (R)	27.00±12.40	23.88±8.08
		OpenPose (L)	22.28±9.81	17.27±6.12
		MocapNET	25.70±3.85	25.70±3.85
	Min	OpenPose (R)	-33.07±10.50	-28.35±6.08
		OpenPose (L)	-36.37±11.17	-32.14±5.94
		MocapNET	-14.41±2.23	-14.41±2.23
	ROM	Open Pose(R)	60.06±14.86	52.23±8.96
		Open Pose(L)	58.65±14.60	49.41±7.74
		MocapNET	39.88±3.22	39.88±3.22
Knee-joint angle (+ flexion/—extension)	Max	Open Pose(R)	64.20±21.83	58.17±9.63
		Open Pose(L)	71.29±20.88	64.43±10.87
		MocapNET	64.58 ± 5.21	64.58 ± 5.21
	Min	OpenPose (R)	-4.88±8.29	-1.54±6.24
		OpenPose (L)	-1.10±7.05	0.28±6.26
		MocapNET	-3.18±3.11	-3.18±3.11
	ROM	OpenPose (R)	68.34±19.74	59.71±11.08
		OpenPose (L)	72.39±18.99	64.15±11.98
		MocapNET	67.20±4.66	67.20±4.66

Open in a new tab

ROM, range of motion; SD, standard deviation

Although COG and stride length anomalies were relatively infrequent, they were distinctly different from the COG and stride length of natural gait and, thus, had to be detected and corrected. In a healthy person, COG is unlikely to fluctuate significantly throughout the gait cycle. We used clustering to identify the positions of the hip (Rh or Lh), knee (Rk or Lk), and ankle (Right Ankle [Ra] or Left Ankle [La]), which were considered to be off-center (see Materials and Methods, S1 Fig).

For stride length, we observed cases where the distance of the ankle joint (Ra and La) was underestimated or overestimated for a particular frame or for the entire frame. For the other cases, we focused on the maximum stride length in the gait cycle and derived a threshold based on statistical confidence intervals to identify the error (S2 Fig).

Physical constraints

Gait is a continuous motion in time that depends on the frame rate of the video recording in OpenPose, but it is difficult to imagine instantaneous motion beyond the physical constraints in the normal range. Therefore, motion with extremely discontinuous changes is considered to be due to errors. We considered two types of errors: reversals of the left and right legs (Fig 1G) and discontinuous frame transitions (Fig 1H). In the latter case, it was sufficient to detect the change point in the time series. However, it was not sufficient to detect the point at which the legs switched; thus, the deviated state was detected using the direction vector in periodic motion (see Materials and Methods). Another physically unnatural case was floating above ground surface (Fig 1I). In this case, the normal range from the head to the ankle was estimated in advance based on the standard human skeletal model [19], and deviations from this range were detected (see Materials and Methods).

Estimation accuracy

The reliability score in OpenPose is calculated based on the distance from the correct location to each pixel in the image [21]. The pixels that are the shortest distance from the correct location are considered to have the highest reliability, whereas a low reliability score suggests that the estimated joint may not exist in the image or that it cannot be estimated. The variability of the actual reliability score tended to be lower for certain joints (Fig 3B). It was not easy to identify the cause of the error based on the reliability score, which is an inherent problem in the deep learning algorithm of OpenPose. However, it is possible to determine the reliability score from a statistical perspective. The overall distribution of the reliability scores was bimodal (Fig 3A), suggesting the existence of two potential groups of low and high reliability. We estimated these two groups based on k-means clustering and detected the group with low accuracy.

Fig 3 — (a) Distribution of reliability scores for all subjects. Two groups, low confidence (blue) and high confidence (red), were assumed and classified by clustering. (b) Reliability score per part. The upper panel shows the left direction, and the lower panel shows the right direction.

Accuracy of pose estimation during gait in an uncontrolled environment

We used a largescale database of OpenPose gait data to examine the reliability of OpenPose estimates of joint coordinates. However, the database used in this study only contained OpenPose data, so it was not possible to evaluate the data using external criteria. Instead, using our workflow, we calculated percentages by defining joint coordinates that did not contain the anomalies shown in Table 1 as negative examples of the anomalies. The percentages of each joint determined to be normal and the percentage after correction using our workflow are summarized in Fig 4. The workflow will be described in the next section.

Fig 4 — The accuracy based on the anomaly types (i.e., the percentage not containing any of the anomalies listed in Table 1) is shown. The numbers in parentheses represent the accuracy after correcting the proposed workflow.

The accuracy for the joints on the opposite side to the camera direction was low, but on the same side, the accuracy varied, ranging from 53.8% to 93.5%. In particular, the knee (Rk or Lk) and ankle (Ra or La) contained some anomalous errors in nearly half of the subjects, strongly suggesting that they need to be addressed for downstream analysis to be performed properly.

Workflow for anomaly detection and correction

To perform gait analysis using OpenPose in an uncontrolled environment, many anomalous errors must be addressed during preprocessing. However, it is unclear which strategies should be used for detection and correction. Here, we present a workflow for detecting and correcting 10 types of anomalies (Fig 5).

Normalization step

The first normalization step transforms the coordinate system and skeletal length into a form that is comparable for all samples. Ne was set as the origin and transformed into joint coordinates corresponding to the ratio of the neck to the trunk length (see Materials and Methods). This allows for a general discussion of statistical properties and the setting of thresholds to deal with anomalous errors, which allows for efficient preprocessing.

Anomaly detection step

Following normalization, the anomalous error of switching the left and right legs was first detected and corrected. This is because leg swapping is a serious error in gait analysis and may affect the detection of other anomaly types. Subsequently, the other nine types were detected.

Correction of anomalous error step

The detected parts with anomalous errors could be considered missing values because they cannot be used in the downstream analysis. To some extent, they can be imputed using information from the previous and subsequent frames via averaging. However, if the overall number of missing values is extremely high in one gait cycle or if there are many continuous missing values, the reliability of the missing-value imputation is itself questionable, and such subjects should be excluded. Two filtering criteria were used: (1) the error percentage of each part of the total number of frames was greater than 40%, and (2) the percentage of consecutive missing frames within a gait cycle was more than 20% of the total number of frames (see Materials and Methods). Finally, we were able to impute the missing values of 66.8% of the participants.

Other adjustments

In addition, because it has been reported that video-based pose estimation causes distortions in the estimated coordinate values depending on the distance from the camera (Fig 6), and because the skeletal length within the same subject is not constant [16], we also corrected the skeletal length (see Materials and Methods).

Fig 6 — The left and right panels show the skeletal lengths of the thigh and lower leg, respectively, which depend on the distance between the camera (from the right side) and the subjects. The red and blue represent the right and left sides of the body, respectively.

Reproducibility of workflow

Simulation model

In order to validate the reliability of the proposed workflow, it is common to compare the results of it with ground-truth data such as BICON. However, since there is no ground-truth data used in result section; thus, we conducted a simulation experiment based on actual data (S1 Text) to validate the reproducibility of the workflow in this study. First, we extracted some samples from a real dataset. Second, we generated a pseudo dataset by adding various errors to these samples. The probability of occurrence of each anomalous error was calculated using relative frequencies (Table 1). We generated 10,000 subjects with 25 frames per gait cycle and evaluated the detection accuracy for each anomaly type and the reproducibility of the true joint coordinates.

Simulation results

First, we confirmed the reproducibility of the detection accuracy for individual anomalous errors each parts; the sensitivity and specificity were 82.6% and 95.1%, respectively. However, the accuracy for each type of anomaly varied from 71.1% to 95.4%, which indicates that the difficulty of anomaly detection varies from joint to joint (Table 3). Pearson’s correlation coefficients of the xy-coordinate values before and after the correction were calculated to determine the accuracy for missing imputation, suggesting that there was a significant reproducibility for the joints overall (0.770 for the x-coordinate; p-value <2.2e-16, 0.961 for the y-coordinate; p-value <2.2e-16). In addition, the reproducibility of the individual joints was evaluated. Regarding the accuracy at the level of individual joints, there was a tendency for the accuracy to be relatively low for undetected joints, leg length, COG, and ankle distance, which is thought to represent the variability due to the relatively low accuracy of anomaly detection (Table 4).

Table 3. Sensitivity of detection of each type of anomaly by workflow.

Anomaly types	Sensitivity (%)
Undetected parts	73.0
Leg length	71.1
Shoulder joint distance	85.2
ROM	82.4
COG	79.3
Ankle joint distance	93.0
Time transition	95.4
Side of legs	88.1
Grounding	89.4

Open in a new tab

ROM, range of motion; COG, center of gravity

Table 4. Reproducibility of values by workflow.

	Correlation (x-coordinate)	Correlation (y-coordinate)
Undetected parts	0.691	0.935
Leg length	0.767	0.929
Shoulder joint distance	0.768	0.946
Ankle joint distance	0.734	0.922
ROM	0.946	0.943
COG	0.814	0.988
Ankle joint distance	0.734	0.922
Time transition	0.888	0.952
Side of legs	0.657	0.967
Grounding	0.937	0.871

Open in a new tab

Pearson’s correlation coefficient between the true value and the estimated value with correction after performing all the steps of the workflow. ROM, range of motion; COG, center of gravity

Implementation

The workflow presented in this paper has been deposited in R code (Github URL: https://github.com/matsui-lab/PoseFixeR). A series of preprocessing steps were performed using the coordinate values obtained from the posture estimation using OpenPose as the input data. Detailed parameter settings are described in a vignette.

Discussion

We used a large database analysis to comprehensively classify technical anomalies using OpenPose gait pose estimation and identified four main types: anatomical, biomechanical, and physical anomalies and errors dues to estimation. We have also presented a method of detecting these anomalies and suggested a workflow for their correction. According to our criteria, all of the 18 parts estimated by OpenPose contained some anomalies, suggesting that proper pre-processing is required before extracting gait features. Moreover, simulation experiments showed that the accuracy of anomaly detection and correction varied depending on anomaly type, implying the need to develop appropriate preprocessing methods for each type.

In particular, the nature of the two-dimensional video-based representations makes skeletal length distortions dependent on the distance between the camera and the subject, and produces anomalous joint coordinate estimates owing to unobserved joints on the opposite side to the camera. The latter anomalous measurement error could be rescued with up to 73% accuracy by our workflow, as shown in the numerical experiments, and it was difficult to capture the complete characteristics of one gait cycle by video recording only on one side. Therefore, it may be effective to develop an experimental design that focuses on specific parameters, such as the motions of specific joints on the video recording side.

The anatomical skeletal models should also be evaluated. In the comparison of inertial sensors and OpenPose in terms of ROMs, we observed a shift of approximately 10°–20°. One of the main reasons for this is the difference in the skeletal model. Taking the hip joint as an example, in the field of orthopedics and rehabilitation, ROM is generally measured by measuring the external angle composed of the axis between the trunk and femur. However, because the hip joint area is simplified in OpenPose, the ROM is calculated from the external angle composed of the axis directly connecting Ne and Rh/Lh, Rh/Lh, and Rk/Lk, which results in different criteria and generates a bias. Therefore, when analyzing gait using OpenPose, especially when interpreting biomechanical features, we should not simply compare the results with standard sensors, such as gyroscopes and goniometers, but should make comparisons based on OpenPose’s baseline.

We showed the anomalies in an uncontrolled environment as comprehensively as possible along with the workflow. However, there are several limitations. First, we did not necessarily cover all anomalies since the results are based on the analysis of a single database. Second, we didn’t compare our results to ground-truth data (e.g., motion capture) because only the OpenPose data was available in the public database. Our proposed workflow should be evaluated in further studies. Third, we determined threshold values (e.g., using in Eq(4)) based on a standard skeletal model [19], so gender and age groups were not considered. For more sophisticated algorithms, appropriate skeletal models should be used. Fourth, there is a problem with OpenPose itself. As Seethapathi et al. [17] pointed out, current pose tracking algorithms do not prioritize measurement of the quantities that are important in movement science, such as three-dimensional position, velocity, and acceleration. Therefore, a new algorithm should be developed that considers important factors in kinematics, so that a more suitable system can be constructed.

In order to increase the clinical applications of video-based 2D pose estimation technology, it will be necessary to find situations where it can be used most effectively from a clinical point of view, and to develop an appropriate analysis algorithm in the future [22]. From a practical standpoint, a reproducible and robust analysis method will be crucial. For example, the development of an algorithm that effectively exploits the latent time-series structure of tracking errors for skeletal coordinates is an important issue for future research. On the other hand, this study also employed some threshold-based methods, such as standard skeletal length ratios derived from existing anatomical knowledge and confidence score thresholds based on statistical distributions. Since the OpenPose skeletal estimates do not take into account anatomical or clinical knowledge, methods that combine existing knowledge may be useful. For example, in the category of “side of legs” (g in Fig 1), which occurs when the left and right legs cross each other, self-evaluation of OpenPose such as frame by frame analysis or coordinate distance between frames cannot detect this anomaly, although it can affect joint angle estimation.

It is also important to study analysis methods for disease signals through clinical research designs, such as comparisons between healthy and diseased groups. For example, there may be an affinity between developing research designs that focus on specific parts of the body and researching algorithms dedicated to the early detection of disease-related signatures. Additionally, the development of open-source software and public databases is also considered to be an important research gap that must be filled to allow further clinical applications and the development of our method as a reproducible research method.

The OpenPose in uncontrolled gait analysis revealed various measurement anomalies in all samples due to technical limitations. However, preprocessing using a combination of anatomical, physical, and biomechanical knowledge and statistical algorithms suggested that nearly 70% of the samples could be rescued, although the accuracy varied for each anomaly type. With the development of appropriate study designs and more sophisticated analysis algorithms in the future, it is expected that accuracy can be improved, even in uncontrolled environments. Since our suggested category (Fig 1) will likely include anomalous errors in the controlled environment, it is considered to be widely applicable, and not limited to pose estimation in unconstrained environments. We hope that our study will be helpful when further studies on large populations have been conducted to accumulate evidence.

Materials and methods

Dataset

In this study, we used the Osaka University-Institute of Scientific and Industrial Research (OU-ISIR) Gait Database [18], a multi-view large population dataset with pose sequences [18] deposited in the OU-ISIR Biometric Database. By capturing the subjects walking approximately 10 m back and forth, we can observe the gait cycle at a normal speed for each sample. The images had a frame rate of 25 fps and an image size of 1280 × 980 pixels. OpenPose shows the x and y coordinates and confidence levels for 18 joints in each frame: nose, neck, right shoulder, right elbow, right wrist, left shoulder, left elbow, left wrist, right groin, right knee, right foot, left groin, left knee, left foot, right eye, left eye, right ear, and left ear.

Workflow details

The details of the workflow are described below. First, we define the mathematical notation. Let (X_it[S], Y_it[S]) be the two-dimensional coordinate value of the joint S of subject i: i = 1,2,…N, at time t: t = 1,2,…,T. To represent an arbitrary joint, the name of the joint is written with a dot symbol, as in (X_it[·], Y_it[·]).

Normalization

In video-based pose estimation, normalization is necessary because the skeletal length varies depending on the height of the subject, the distance from the subject, and the position of the camera. In this study, we followed the method described by An W et al. [18] and normalized the skeletal coordinate values in three steps: (1) centering with the Ne coordinates as the origin, (2) estimating the scale factor of the skeletal length, and (3) normalizing the skeletal coordinates. Specifically, centering was performed using Eq (1).

(X_{i t}^{*} [∙], Y_{i t}^{*} [∙]) = (X_{i t} [∙], Y_{i t} [∙]) - (X_{i t} [N e], Y_{i t} [N e])

(1)

Next, to calculate the “relative scale” for each individual frame, the distance from the midpoint of the post-transformed coordinates of the left hip $(X_{i t}^{*} [L H], Y_{i t}^{*} [L H])$ and the right hip $(X_{i t}^{*} [R H], Y_{i t}^{*} [R H])$ to the neck (0, 0) was calculated as scale_it.

s c a l e_{i t} = \frac{1}{2} \sqrt{{(X_{i t}^{*} [L H] + X_{i t}^{*} [R H])}^{2} + {(Y_{i t}^{*} [L H] + Y_{i t}^{*} [R H])}^{2}}

(2)

Finally, we normalized all the joint coordinates such that the relative scale_it became 1. That is, each joint coordinate was divided by the relative scale to obtain the normalized coordinates $(X_{i t}^{†} [∙], Y_{i t}^{†} [∙])$ .

(X_{i t}^{†} [∙], Y_{i t}^{†} [∙]) = (X_{i t}^{*} [∙], Y_{i t}^{*} [∙]) \times (\frac{1}{s c a l e_{i t}})

(3)

Henceforth, we will use $(X_{i t}^{†} [∙], Y_{i t}^{†} [∙])$ in the following description.

Anatomical constraints

According to the skeletal length in the standard skeletal model, if the length of the trunk, which is the distance from the neck to the hip joint, is 1,then the distance from the neck to the top of the head is 0.362, and the distance from the neck to the ankle joint is 2.283 [19]. If the x-coordinates of the top of the head and ankle joint, which are the two ends on the y-axis, are the same as the x-coordinates of the neck, then the neck coordinates are (0, 0), the head coordinates are (0, -0.362), and the ankle coordinates are (0, 2.283). Therefore, the y-coordinates of all joints were considered to be in the range [-0.362, 2.283]. If we introduce an error ratio (ER) to account for individual differences, we can consider that the coordinates of any joint $Y_{i t}^{†} [∙]$ lie within the following range:

- 0.362 \times E R \leq Y_{i t}^{†} [∙] \leq 2.283 \times E R

(4)

Coordinates that do not satisfy this condition deviate from the expected range.

Biomechanical feature

Here we describe anomaly detection for range of motion (ROM), which is the external angle of the axis connecting Ne and Rh (or Lh) and Rk (or Lk). To identify the joint coordinates deviating from the standard ROM, 95% confidence intervals were constructed based on the empirical distributions derived from the maximum flexion angles of each joint of the lower extremities (both hip and knee joints) obtained by the following calculations, and those outside the confidence intervals were considered anomalies. First, the Rh flexion angle (calculated in the same way as the Lh flexion angle) was calculated as follows:

\cos_{it} [R h] = \frac{(X_{i t}^{†} [N e] - X_{i t}^{†} [R h]) (X_{i t}^{†} [R k] - X_{i t}^{†} [R h]) + (Y_{i t}^{†} [N e] - Y_{i t}^{†} [R h]) (Y_{i t}^{†} [R k] - Y_{i t}^{†} [R h])}{\sqrt{{(X_{i t}^{†} [R k] - X_{i t}^{†} [R h])}^{2} + {(X_{i t}^{†} [N e] - X_{i t}^{†} [R h])}^{2}} + \sqrt{{(Y_{i t}^{†} [R k] - Y_{i t}^{†} [R h])}^{2} + {(Y_{i t}^{†} [N e] - Y_{i t}^{†} [R h])}^{2}}}

(5)

The Rk joint flexion angle (calculated in the same way as the Lk joint flexion angle) was obtained as follows:

{c o s}_{i t} [R k] = \frac{(X_{i t}^{†} [R h] - X_{i t}^{†} [R k]) (X_{i t}^{†} [R a] - X_{i t}^{†} [R k]) + {(Y}_{i t}^{†} [R h] - Y_{i t}^{†} [R k]) {(Y}_{i t}^{†} [R a] - Y_{i t}^{†} [R k])}{\sqrt{{(X_{i t}^{†} [R h] - X_{i t}^{†} [R k])}^{2} + {(X_{i t}^{†} [R a] - X_{i t}^{†} [R k])}^{2}} + \sqrt{{(Y_{i t}^{†} [R a] - Y_{i t}^{†} [R k])}^{2} + {(Y_{i t}^{†} [R h] - Y_{i t}^{†} [R k])}^{2}}}

(6)

From these joint angles, the empirical distribution F was constructed. The joint angles were set as R_it, which is defined as arccos(cos_it[·])×180°/π, and

F (r) = \frac{1}{N T} \sum_{i = 1}^{N} \sum_{t = 1}^{T} I (R_{i t} \leq r) .

(7)

The empirical distribution F for each joint was considered to represent the range of motion distribution in one gait cycle at the population level, including all subjects. Based on this, we constructed a 95% confidence interval CI_95% for each joint and obtained

{C I}_{95 %} = [F^{- 1} (0.025), F^{- 1} (0.975)] .

Observed values outside the confidence interval were considered errors.

For the center of gravity (COG), we considered a point in three-dimensional space consisting of the midpoints of the x-coordinates of the hip (Rh or Lh), knee (Rk or Lk), and ankle (Ra or La) with respect to the perpendicular line from Ne to the ground and identified the group whose distance from the origin deviated using the k-means method. The number of clusters was determined using the gap statistic [23]. The distance dist from the origin Ne to a point can be described as follows:

d i s t = \sqrt{{(\frac{X_{i t}^{†} [R h] + X_{i t}^{†} [L h]}{2})}^{2} + {(\frac{X_{i t}^{†} [R k] + X_{i t}^{†} [L k]}{2})}^{2} + {(\frac{X_{i t}^{†} [R a] + X_{i t}^{†} [L a]}{2})}^{2}}

(8)

For anomalous errors related to ankle joint distance, we used

{L e n g t h}_{a n k l e} = \sqrt{{(X_{i t}^{†} [R a] - X_{i t}^{†} [L a])}^{2}}

(9)

to construct empirical distributions, derive 95% confidence intervals, and detect values those outside the intervals as deviating errors. However, because some anomalies in undetected joints may result in extremely large leg lengths, confidence intervals were derived after excluding those errors in advance.

Physical constraints

Regarding the method used to detect when the left and right legs are reversed, the inversion of the leg joint coordinates at frame t was detected by comparing the leg joint coordinates of the two frames before and after. For this purpose, we detected whether inversion occurs at 3≤t≤T−2 frames. As we could use time-series information, we removed the effects of missing values and outliers in advance. We linearly interpolated the knee joint coordinates of the vectors $X_{i ∙}^{†} [L a], X_{i ∙}^{†} [R a]$ at each frame and then applied spline smoothing to obtain Z_i∙[La], Z_i∙[Ra]. If the skeletal coordinates measured in one gait cycle are reversed for the left and right legs, the coordinates should not move like a pendulum but should be biased to either the left or right. Thus, one of the following should be true for the inversion of the left and right legs:

\frac{1}{T} \sum_{t = 1}^{T} I (Z_{i t} [L a] - Z_{i t} [R a] > 0) < 0.3

(10)

\frac{1}{T} \sum_{t = 1}^{T} I (Z_{i t} [L a] - Z_{i t} [R a] > 0) > 0.7

(11)

Here, I(A) is an indicator function that gives 1 if the condition A is satisfied, and 0 if not. In addition, since it was considered that there is a limit to the movement of the legs during gait,

| Z_{i t} [L a] | < 0.4

(12)

was assumed to be satisfied. After satisfying these conditions, the direction of the leg joint movement changes after frame t, that is,

sign (Z_{i t - 2} [L a] - Z_{i t - 1} [L a]) = sign (Z_{i t - 1} [L a] - Z_{i t} [L a]) = sign (Z_{i t - 2} [L a] - Z_{i t} [L a])

(13)

and

s i g n (Z_{i t} [L a] - Z_{i t - 1} [L a])

= s i g n (Z_{i t + 1} [L a] - Z_{i t + 2} [L a])

= s i g n (Z_{i t} [L a] - Z_{i t + 2} [L a])

\neq sign (Z_{i t - 2} [L a] - Z_{i t - 1} [L a])

(14)

are satisfied, and the joint coordinates of the left and right legs are considered to be reversed, where sign(∙) is a sign function.

To detect errors in ground contact, the reference value of the y-axis coordinates of the legs was set to 2.283. When the y-axis coordinates of both legs deviated sufficiently from the reference value, either upward or downward, it was determined that the person was not grounded. That is

Y_{i t} [L h] > 1.2 \times 2.283 a n d Y_{i t} [R h] > 1.2 \times 2.283

(15)

Y_{i t} [L h] < 0.8 \times 2.283 a n d Y_{i t} [R h] < 0.8 \times 2.283 .

(16)

The error of the frame transition was defined a value as more than a certain distance from the coordinate of frame t-1 or frame t+1. In other words,

\sqrt{{(X_{i t}^{†} [∙] - X_{i t - 1}^{†} [∙])}^{2} + {(Y_{i t}^{†} [∙] - Y_{i t - 1}^{†} [∙])}^{2}} \geq J U M P

\sqrt{{(X_{i t}^{†} [∙] - X_{i t + 1}^{†} [∙])}^{2} + {(Y_{i t}^{†} [∙] - Y_{i t + 1}^{†} [∙])}^{2}} \geq J U M P

(17)

When one or more of the following conditions were satisfied, the joint coordinate was treated as an anomalous error. In this case, JUMP = 0.5 (1/25 s comparison) and 0.7 (2/25 s comparison).

Selecting subjects for imputation

We excluded subjects with many anomalous error frames that we defined because it would be difficult to extract gait features in the downstream analysis. Exclusion criteria were as follows: (1) the error rate of each region was more than 40% of the total number of frames and (2) the missing values were greater than 20% of the total number of frames in one gait cycle (S5 Fig). The first criterion was set considering that errors could be detected in at least 20% of samples, even in controlled environments. In addition, technical errors caused by other factors may occur in uncontrolled environments. For the second criterion, we considered that the maximum percentage of each phase per gait cycle was approximately 20% [24].

Supporting information

S1 Text. Details of the simulation.

(DOCX)

Click here for additional data file.^{(40.8KB, docx)}

S1 Fig. Detection of a COG anomaly.

The left panel shows the coordinate values of (X,Y,Z) = (Ankle, Knee, Hip). The best cluster based on the k-means method using the gap static (right panel) is shown by color coding. Clusters very close to the origin were used as normal measurement samples.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

S2 Fig. Maximum ankle joint distance.

Maximum ankle joint distance within one gait cycle for each subject. To illustrate the distribution clearly, skeletal length errors due to undetected sites are excluded.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

S3 Fig. Distribution of shoulder joint distance.

Based on clustering, the four groups were further subdivided into four group each. The group with slightly larger shoulder joint distance (blue, the third group from the left in the histogram) and the group with extremely large shoulder joint distance (red, the fourth group from the left in the histogram) were considered to have abnormal errors.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

S4 Fig. Accuracy of anomaly correction.

The rate of recovery before and after anomaly correction for each part, with workflow. The top and bottom rows show the accuracy for all joints during walking in the right and left directions, respectively, and the left and right rows show the accuracy before and after correction, respectively.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

S5 Fig. Number of consecutive anomalous frames.

Histogram of the number of consecutive anomaly frames for all samples is shown. The samples with a number of consecutive anomalous frames over 20% of the total number of frames were excluded.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

Acknowledgments

We would like to thank Editage [https://www.editage.com] for editing and reviewing this manuscript for English language.

Data Availability

The relevant data and code used in this study are available on github(https://github.com/matsui-lab/PoseFixeR).

Funding Statement

This work was supported by JSPS KAKENHI Grant Number JP20K20657(YM). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1.Baker R. The history of gait analysis before the advent of modern computers. Gait Posture. 2007;26 3: 331–342. doi: 10.1016/j.gaitpost.2006.10.014 [DOI] [PubMed] [Google Scholar]
2.Amboni M, Ricciardi C, Picillo M, De Santis C, Ricciardelli G, Abate F, et al. Gait analysis may distinguish progressive supranuclear palsy and Parkinson disease since the earliest stages. Sci Rep. 2021;11: 9297. doi: 10.1038/s41598-021-88877-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Czech M, Demanuele C, Erb MK, Ramos V, Zhang H, Ho B, et al. The impact of reducing the number of wearable devices on measuring gait in parkinson disease: noninterventional exploratory study. JMIR Rehabil Assist Technol. 2020;7: e17986. doi: 10.2196/17986 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Mirelman A, Bonato P, Camicioli R, Ellis TD, Giladi N, Hamilton JL, et al. Gait impairments in Parkinson’s disease. Lancet Neurol. 2019;18: 697–708. doi: 10.1016/S1474-4422(19)30044-4 [DOI] [PubMed] [Google Scholar]
5.Rucco R, Agosti V, Jacini F, Sorrentino P, Varriale P, De Stefano M, et al. Spatio-temporal and kinematic gait analysis in patients with Frontotemporal dementia and Alzheimer’s disease through 3D motion capture. Gait Posture. 2017;52: 312–317. doi: 10.1016/j.gaitpost.2016.12.021 [DOI] [PubMed] [Google Scholar]
6.Yogev G, Plotnik M, Peretz C, Giladi N, Hausdorff JM. Gait asymmetry in patients with Parkinson’s disease and elderly fallers: when does the bilateral coordination of gait require attention? Exp Brain Res. 2007;177: 336–346. doi: 10.1007/s00221-006-0676-3 [DOI] [PubMed] [Google Scholar]
7.Bouchrika I, Nixon MS, editors. Model-based feature extraction for gait analysis and recognition. International conference on computer vision/computer graphics collaboration techniques and applications Berlin, Heildelberg: Springer-Verlag. 2007: 150–160. [Google Scholar]
8.Gupta A, Jadhav A, Jadhav S, Thengade A. Human gait analysis based on decision tree, random forest and KNN algorithms. In: Iyer B RA, Gudivada V, editors. Image Vis Comput 2020: 283–289. [Google Scholar]
9.Khan MA, Kadry S, Parwekar P, Damaševičius R, Mehmood A, Khan JA, et al. Human gait analysis for osteoarthritis prediction: a framework of deep learning and kernel extreme learning machine. Complex Intell Syst. 2021. doi: 10.1007/s40747-020-00244-2 [DOI] [Google Scholar]
10.Wang Y, Xia Y, Zhang Y. Beyond view transformation: feature distribution consistent GANs for cross-view gait recognition. Vis Comput 2021;38 1915–1928. [Google Scholar]
11.Iwama H, Okumura M, Makihara Y, Yagi Y. The ou-isir gait database comprising the large population dataset and performance evaluation of gait recognition. IEEE Trans Inf Forensics Secur 2012;7: 1511–1521. [Google Scholar]
12.Takemura N, Makihara Y, Muramatsu D, Echigo T, Yagi Y. Multi-view large population gait dataset and its performance evaluation for cross-view gait recognition. IPSJ Trans Comput Vis Appl 2018;10: 4. [Google Scholar]
13.Tao W, Liu T, Zheng R, Feng H. Gait analysis using wearable sensors. Sensors (Basel). 2012;12: 2255–2283. doi: 10.3390/s120202255 [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Viswakumar A, Rajagopalan V, Ray T, Gottipati P, Parimi C. Development of a robust, simple, and affordable human gait analysis system using bottom-up pose estimation with a smartphone camera. Front Physiol. 2022;12. doi: 10.3389/fphys.2021.784865 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Nakano N, Sakura T, Ueda K, Omura L, Kimura A, Iino Y, et al. Evaluation of 3D markerless motion capture accuracy using OpenPose with multiple video cameras. Front Sport Active Living. 2020;2:50. doi: 10.3389/fspor.2020.00050 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Stenum J, Rossi C, Roemmich RT. Two-dimensional video-based analysis of human gait using pose estimation. PLoS Comput Biol. 2021;17: e1008935. doi: 10.1371/journal.pcbi.1008935 [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Seethapathi N., Wang S., Saluja R., Blohm G., & Kording K. P. Movement science needs different pose tracking algorithms. arXiv preprint 2019. arXiv:1907.10226. [Google Scholar]
18.An W, Yu S, Makihara Y, Wu X, Xu C, Yu Y, et al. Performance evaluation of model-based gait on multi-view very large population database with pose sequences. IEEE Trans Biom Behav Identity Sci. 2020;2: 421–430. [Google Scholar]
19.Park S. J., Park S. C., Kim J. H., & Kim C. B. Biomechanical parameters on body segments of Korean adults. International Journal of Industrial Ergonomics. 1999; 23(12): 23–31. [Google Scholar]
20.Park S, Yoon S. Validity evaluation of an inertial measurement unit (IMU) in gait analysis using statistical parametric mapping (SPM). Sensors (Basel). 2021;21. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Cao Z, Hidalgo G, Simon T, Wei S-E, Sheikh Y. OpenPose: realtime multi-person 2D pose estimation using Part Affinity Fields. IEEE Trans Pattern Anal Mach Intell. 2019;43: 172–186. [DOI] [PubMed] [Google Scholar]
22.Hellsten T, Karlsson J, Shamsuzzaman M, Pulkkis G. The potential of computer vision-based marker-less human motion analysis for rehabilitation. Rehabili Process Outcome. 2021;10:11795727211022330. doi: 10.1177/11795727211022330 [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Tibshirani R, Walther G, Hastie T. Estimating the number of clusters in a data set via the gap statistic. J R Stat Soc Series B. 2001;63: 411–423. [Google Scholar]
24.Neumann DA. Kinesiology of the hip: a focus on muscular actions. J Orthop Sports Phys Ther. 2010;40: 82–94. doi: 10.2519/jospt.2010.3025 [DOI] [PubMed] [Google Scholar]

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009989.r001

Decision Letter 0

Feilim Mac Gabhann, Thurmon Lockhart

21 Jul 2022

Dear Matsui,

Thank you very much for submitting your manuscript "Types of anomalies in two-dimensional video-based gait analysis in uncontrolled environments" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Thurmon Lockhart

Guest Editor

PLOS Computational Biology

Feilim Mac Gabhann

Editor-in-Chief

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In this manuscript, the authors report on four major types of anomalies that occur during video-based gait analysis using pose estimation and provide a new analysis workflow for correcting these anomalies. Interest in video-based movement analyses using pose estimation have grown rapidly in the past few years, so this is a timely article that will be of interest to the field. My comments and suggestions are included below.

ABSTRACT

No comments.

INTRODUCTION

In the paragraph spanning lines 64-72, the authors introduce some issues with using pose estimation for gait analysis (e.g., false detection of multiple people in the image). However, previous studies (including the cited study by Stenum et al) have offered solutions for some of these issues. Given that the authors’ primary goal in this study is to focus on unconstrained environments, I suggest that they identify specifically the issues that remain with respect to unconstrained environments in this paragraph.

It also seems to me that the issues that this study aims to address are not specific to unconstrained environments. These issues may be present in unconstrained environments, of course, but they may also be present in some well-controlled environments as well. I suggest that the authors could emphasize that the analysis workflow that they present could be helpful for pose estimation-based gait analysis more broadly and not specifically in unconstrained environments.

Line 85 – I believe that the authors mean “frames per second”

There is a previous article that describes many of the issues with using currently available pose estimation algorithms for movement science (https://arxiv.org/pdf/1907.10226.pdf). It would be helpful if the authors could indicate in the introduction how their proposed study differs from this previous work or helps to resolve some of the issues proposed.

RESULTS

It is clear that the four types of anomalies identified are indeed problems for pose estimation-based gait analysis, but it is not clear how these four types of anomalies were selected. Presumably, there are other remaining issues – why/how were these four chosen in particular?

If I am understanding the results correctly, the “fix” that the authors propose for the anomalous errors is to essentially remove the data and perform gap filling, which is a rather crude solution. The authors provide some simulation work to address the reproducibility of the workflow, but this does not provide information about how accurately the gap-filled keypoints approximate ground-truth data. Comparison to some kind of ground-truth data (e.g., motion capture) would be helpful if available.

DISCUSSION

Similar to my comment about the introduction, it would be helpful if the authors could contextualize their approach and findings against many of the issues proposed in the Seethapathi et al paper. It should also be mentioned as a significant limitation that we do not yet understand the accuracy of this proposed approach.

Reviewer #2: The study evaluates the OpenPose tracker in terms of four types of anomalies to investigate whether the tracker errors are due to biological factors or technical errors. Removing identified technical errors, the tracker outputs can be used to perform gait analysis. The paper identifies anomalies and examines the tracker output on a large-scale gait dataset. The core idea of the evaluation looks valuable to understand if motion-capture or similar systems can be replaced with real-time pose trackers with less afford.

However I have two main concerns. 1) The paper categorizes anomaly types, but I am not sure if the proposed categorization truly identifies the actual types. 2) The authors presents a good simulation setup. However, the robustness of their post-processing technique can be better evaluated in terms of accuracy improvement using some ground truth data coming from a motion capture system. In the current version of the text, experimental results presented in Fig.4. looks like such an evaluation, where the ground truth comparison was used. But the experiment and its purpose are not clear with lack of details. The authors should improve the text and explain this part better.

Some comments:

The authors identify 10 different anomalies. They can improve their discussion on why and how these anomalies identified with related works from the literature. Otherwise proposed categorization could be subjective. For instance, undetected-parts (Fig1) or extreme lengthening are identified under anatomical constraints. However, it is not clear why it is not categorized as one category of the estimation error. Similarly, samples identified under COG category can be also identified under anatomical constraints. Therefore, authors should better present how they define these categories and support with related studies. Lines 116-196 presents the constraint but it is not clear what is the base for identifying biomechanics or physical constraints. This should be supported with reference works. Otherwise, it is not clear why we need the proposed categorization but not the estimation error.

The Result section can be revised. The relations of subsection are getting confused. For instance, in the current version of the text, it is not explicitly written that the sections in Line 116-168 expresses different categories of constraints to detect anomalies. Otherwise, it looks like anomalies are categorize as anatomical/biomechanical etc.. More text can help: Paragraph in lines 98-103 can be improved with few sentences or a new paragraph can be added before line 97 to express the purpose and relation of the following sections.

Some anomalies due to tracker error can be pruned based on self-evaluation of the video data, e.g. analysis on frames or consecutive frames to detect abnormal detections. Proposing such a model for post processing can be more robust than identifying various constraints with various threshold values. Therefore, the authors can add a discussion to present the advantages of their threshold-based model.

The accuracies in Fig4 can be given shortly in the text with more discussion on improvements. Assuming multiple anomaly types, the authors can explain in more detail how they compute the accuracies and how they correct the values. Moreover, what is the ground truth used in this experiment?

Line 374 - Line 419: In these sections, authors use some constant values. Can they give some reference study on these values? Are these values coming from some standards? Otherwise, the dataset contains various genders and age groups.

Line 389: The text can be revised to clearly identify parts related to ROM, COG and ankle distance. What part is related to ROM within the text?

Line 393-398: Please check the abbreviations. Some includes Rh, some others include RH (similarly RK and Rk).

Eq5: Can you please check the equation. This looks inconsistent with Eq.6

Eq7: Can you explain in the text what R_it is?

Eq8: Please check the equation. Does the third component need square?

The qualities of some figures are not good to view, higher resolution would be better.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Comput Biol. 2023 Jan 19;19(1):e1009989. doi: 10.1371/journal.pcbi.1009989.r002

Author response to Decision Letter 0

9 Nov 2022

Attachment

Submitted filename: review_comment_add_response.docx

Click here for additional data file.^{(102.1KB, docx)}

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009989.r003

Decision Letter 1

Feilim Mac Gabhann, Thurmon Lockhart

21 Dec 2022

Dear Matsui,

We are pleased to inform you that your manuscript 'Types of anomalies in two-dimensional video-based gait analysis in uncontrolled environments' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology.

Best regards,

Thurmon Lockhart

Guest Editor

PLOS Computational Biology

Feilim Mac Gabhann

Editor-in-Chief

PLOS Computational Biology

***********************************************************

Many thanks for your nice paper.

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have adequately addressed my prior comments and suggestions. I thank them for sharing an interesting paper.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

Reviewer #1: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009989.r004

Acceptance letter

Feilim Mac Gabhann, Thurmon Lockhart

3 Jan 2023

PCOMPBIOL-D-22-00335R1

Types of anomalies in two-dimensional video-based gait analysis in uncontrolled environments

Dear Dr Matsui,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Zsofia Freund

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Text. Details of the simulation.

(DOCX)

Click here for additional data file.^{(40.8KB, docx)}

S1 Fig. Detection of a COG anomaly.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

S2 Fig. Maximum ankle joint distance.

Maximum ankle joint distance within one gait cycle for each subject. To illustrate the distribution clearly, skeletal length errors due to undetected sites are excluded.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

S3 Fig. Distribution of shoulder joint distance.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

S4 Fig. Accuracy of anomaly correction.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

S5 Fig. Number of consecutive anomalous frames.

Histogram of the number of consecutive anomaly frames for all samples is shown. The samples with a number of consecutive anomalous frames over 20% of the total number of frames were excluded.

(TIFF)

Click here for additional data file.^{(4.1MB, tiff)}

Attachment

Submitted filename: review_comment_add_response.docx

Click here for additional data file.^{(102.1KB, docx)}

Data Availability Statement

The relevant data and code used in this study are available on github(https://github.com/matsui-lab/PoseFixeR).

[pcbi.1009989.ref001] 1.Baker R. The history of gait analysis before the advent of modern computers. Gait Posture. 2007;26 3: 331–342. doi: 10.1016/j.gaitpost.2006.10.014 [DOI] [PubMed] [Google Scholar]

[pcbi.1009989.ref002] 2.Amboni M, Ricciardi C, Picillo M, De Santis C, Ricciardelli G, Abate F, et al. Gait analysis may distinguish progressive supranuclear palsy and Parkinson disease since the earliest stages. Sci Rep. 2021;11: 9297. doi: 10.1038/s41598-021-88877-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009989.ref003] 3.Czech M, Demanuele C, Erb MK, Ramos V, Zhang H, Ho B, et al. The impact of reducing the number of wearable devices on measuring gait in parkinson disease: noninterventional exploratory study. JMIR Rehabil Assist Technol. 2020;7: e17986. doi: 10.2196/17986 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009989.ref004] 4.Mirelman A, Bonato P, Camicioli R, Ellis TD, Giladi N, Hamilton JL, et al. Gait impairments in Parkinson’s disease. Lancet Neurol. 2019;18: 697–708. doi: 10.1016/S1474-4422(19)30044-4 [DOI] [PubMed] [Google Scholar]

[pcbi.1009989.ref005] 5.Rucco R, Agosti V, Jacini F, Sorrentino P, Varriale P, De Stefano M, et al. Spatio-temporal and kinematic gait analysis in patients with Frontotemporal dementia and Alzheimer’s disease through 3D motion capture. Gait Posture. 2017;52: 312–317. doi: 10.1016/j.gaitpost.2016.12.021 [DOI] [PubMed] [Google Scholar]

[pcbi.1009989.ref006] 6.Yogev G, Plotnik M, Peretz C, Giladi N, Hausdorff JM. Gait asymmetry in patients with Parkinson’s disease and elderly fallers: when does the bilateral coordination of gait require attention? Exp Brain Res. 2007;177: 336–346. doi: 10.1007/s00221-006-0676-3 [DOI] [PubMed] [Google Scholar]

[pcbi.1009989.ref007] 7.Bouchrika I, Nixon MS, editors. Model-based feature extraction for gait analysis and recognition. International conference on computer vision/computer graphics collaboration techniques and applications Berlin, Heildelberg: Springer-Verlag. 2007: 150–160. [Google Scholar]

[pcbi.1009989.ref008] 8.Gupta A, Jadhav A, Jadhav S, Thengade A. Human gait analysis based on decision tree, random forest and KNN algorithms. In: Iyer B RA, Gudivada V, editors. Image Vis Comput 2020: 283–289. [Google Scholar]

[pcbi.1009989.ref009] 9.Khan MA, Kadry S, Parwekar P, Damaševičius R, Mehmood A, Khan JA, et al. Human gait analysis for osteoarthritis prediction: a framework of deep learning and kernel extreme learning machine. Complex Intell Syst. 2021. doi: 10.1007/s40747-020-00244-2 [DOI] [Google Scholar]

[pcbi.1009989.ref010] 10.Wang Y, Xia Y, Zhang Y. Beyond view transformation: feature distribution consistent GANs for cross-view gait recognition. Vis Comput 2021;38 1915–1928. [Google Scholar]

[pcbi.1009989.ref011] 11.Iwama H, Okumura M, Makihara Y, Yagi Y. The ou-isir gait database comprising the large population dataset and performance evaluation of gait recognition. IEEE Trans Inf Forensics Secur 2012;7: 1511–1521. [Google Scholar]

[pcbi.1009989.ref012] 12.Takemura N, Makihara Y, Muramatsu D, Echigo T, Yagi Y. Multi-view large population gait dataset and its performance evaluation for cross-view gait recognition. IPSJ Trans Comput Vis Appl 2018;10: 4. [Google Scholar]

[pcbi.1009989.ref013] 13.Tao W, Liu T, Zheng R, Feng H. Gait analysis using wearable sensors. Sensors (Basel). 2012;12: 2255–2283. doi: 10.3390/s120202255 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009989.ref014] 14.Viswakumar A, Rajagopalan V, Ray T, Gottipati P, Parimi C. Development of a robust, simple, and affordable human gait analysis system using bottom-up pose estimation with a smartphone camera. Front Physiol. 2022;12. doi: 10.3389/fphys.2021.784865 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009989.ref015] 15.Nakano N, Sakura T, Ueda K, Omura L, Kimura A, Iino Y, et al. Evaluation of 3D markerless motion capture accuracy using OpenPose with multiple video cameras. Front Sport Active Living. 2020;2:50. doi: 10.3389/fspor.2020.00050 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009989.ref016] 16.Stenum J, Rossi C, Roemmich RT. Two-dimensional video-based analysis of human gait using pose estimation. PLoS Comput Biol. 2021;17: e1008935. doi: 10.1371/journal.pcbi.1008935 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009989.ref017] 17.Seethapathi N., Wang S., Saluja R., Blohm G., & Kording K. P. Movement science needs different pose tracking algorithms. arXiv preprint 2019. arXiv:1907.10226. [Google Scholar]

[pcbi.1009989.ref018] 18.An W, Yu S, Makihara Y, Wu X, Xu C, Yu Y, et al. Performance evaluation of model-based gait on multi-view very large population database with pose sequences. IEEE Trans Biom Behav Identity Sci. 2020;2: 421–430. [Google Scholar]

[pcbi.1009989.ref019] 19.Park S. J., Park S. C., Kim J. H., & Kim C. B. Biomechanical parameters on body segments of Korean adults. International Journal of Industrial Ergonomics. 1999; 23(12): 23–31. [Google Scholar]

[pcbi.1009989.ref020] 20.Park S, Yoon S. Validity evaluation of an inertial measurement unit (IMU) in gait analysis using statistical parametric mapping (SPM). Sensors (Basel). 2021;21. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009989.ref021] 21.Cao Z, Hidalgo G, Simon T, Wei S-E, Sheikh Y. OpenPose: realtime multi-person 2D pose estimation using Part Affinity Fields. IEEE Trans Pattern Anal Mach Intell. 2019;43: 172–186. [DOI] [PubMed] [Google Scholar]

[pcbi.1009989.ref022] 22.Hellsten T, Karlsson J, Shamsuzzaman M, Pulkkis G. The potential of computer vision-based marker-less human motion analysis for rehabilitation. Rehabili Process Outcome. 2021;10:11795727211022330. doi: 10.1177/11795727211022330 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009989.ref023] 23.Tibshirani R, Walther G, Hastie T. Estimating the number of clusters in a data set via the gap statistic. J R Stat Soc Series B. 2001;63: 411–423. [Google Scholar]

[pcbi.1009989.ref024] 24.Neumann DA. Kinesiology of the hip: a focus on muscular actions. J Orthop Sports Phys Ther. 2010;40: 82–94. doi: 10.2519/jospt.2010.3025 [DOI] [PubMed] [Google Scholar]

PERMALINK

Types of anomalies in two-dimensional video-based gait analysis in uncontrolled environments

Yuki Sugiyama

Kohei Uno

Yusuke Matsui

Roles

Abstract

Author summary

Introduction

Results

Overview of anomaly types

Fig 1. Types of anomalies during gait using OpenPose.

Table 1. Percentage of each anomaly type.

Anatomical constraints

Biomechanical constraints

Fig 2. Histograms of ROM of hip and knee joints (left direction: red, right direction: blue).

Table 2. Mean shift and variability of ROM at the hip and knee joints, comparing published gyro-sensor-based statistics for ROM with those obtained in the present database analysis.

Physical constraints

Estimation accuracy

Fig 3. Reliability score.

Accuracy of pose estimation during gait in an uncontrolled environment

Fig 4. Estimated accuracy for each part with OpenPose.

Workflow for anomaly detection and correction

Fig 5. Proposed workflow.

Normalization step

Anomaly detection step

Correction of anomalous error step

Other adjustments

Fig 6. Skeletal length of legs.

Reproducibility of workflow

Simulation model

Simulation results

Table 3. Sensitivity of detection of each type of anomaly by workflow.

Table 4. Reproducibility of values by workflow.

Implementation

Discussion

Materials and methods

Dataset

Workflow details

Normalization

Anatomical constraints

Biomechanical feature

Physical constraints

Selecting subjects for imputation

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Feilim Mac Gabhann

Thurmon Lockhart

Roles

Author response to Decision Letter 0

Decision Letter 1

Feilim Mac Gabhann

Thurmon Lockhart

Roles

Acceptance letter

Feilim Mac Gabhann

Thurmon Lockhart

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases