Abstract
The accurate measurement and analysis of human movement are essential in fields ranging from rehabilitation and neuroscience to sports science and ergonomics. Traditional methods, though precise, are often constrained by cost, accessibility, and controlled environments. The advent of machine learning (ML) pose estimation models (PEMs) offers an alternative solution, enabling detailed motion analysis using low-cost imaging systems in various settings. The aim of this review is to evaluate ML PEMs and their impact on human movement sciences, focusing on recent advancements in machine learning and computer vision for accurate, non-invasive motion analysis using low-cost imaging systems. A narrative review was conducted by searching electronic databases, including PubMed and Google Scholar, using key terms such as "machine learning," "pose estimation models," and "human movement sciences." Thematic analysis identified key advancements, applications, and challenges in ML PEMs across clinical, sports, and ergonomic contexts. The review highlights the development, capabilities, and applications of models such as OpenPose, PoseNet, AlphaPose, DeepLabCut, HRNet, MediaPipe Pose, BlazePose, EfficientPose, and MoveNet, emphasizing their potential for non-invasive, cost-effective assessments. In clinical settings, these models enable objective gait and posture analysis, aiding in diagnosing musculoskeletal disorders and tracking rehabilitation progress. In sports, ML PEMs enhance performance analysis and injury prevention by providing real-time feedback and detailed biomechanical data. In ergonomics, they offer proactive solutions for workplace injury prevention through real-time posture and movement analysis. While promising, the implementation of ML PEMs faces challenges in accuracy, data quality, and integration into existing practices. Establishing standardized protocols and frameworks is crucial for ensuring reliable, interdisciplinary applications. This review can be useful for coaches, healthcare professionals, and researchers in evaluating and implementing ML PEMs, ultimately advancing the field of human movement sciences.
Keywords: Posture, Movement, Sports, Kinematics, Injury prevention, Ergonomics, Physiotherapy, Artificial intelligence
1. Introduction
The accurate measurement and analysis of human movement underlines advancements across diverse scientific and medical disciplines. From motor rehabilitation and neuroscience to sports science and engineering, researchers face the challenge of precise motion quantification, even within laboratory settings [1]. Traditionally, this requires specialized equipment and controlled environments, limiting research scope and real-world applicability [2]. The advent of pose estimation algorithms offers a transformative solution, potentially liberating movement scientists from these constraints [3]. These algorithms enable in-depth motion analysis using low-cost imaging systems, facilitating the non-invasive data collection in different environments like clinics, homes, or outdoor settings. This approach has the power to significantly impact research by enabling larger, more diverse datasets, reducing biases, and ultimately providing a more accurate representation of the complex characteristics of human movement.
The existing methods to analyze human motion are undergoing a deep transformation driven by advancements in machine learning (ML) and computer vision methods. Specifically, the development of sophisticated pose estimation models (PEMs) changed the approaches commonly used for the anatomical landmarks extraction from images and videos. To date, Kinovea has been one of the most used software for the motion analysis from a RGB video which showed a good validity and reliability over time [4]. However, the introduction of ML models, reached many different fields ranging from healthcare and sports science to workplace ergonomics and entertainment [5,6]. The basic approach behind these models is the application of Convolutional Neural Networks (CNNs), a powerful image analysis tools that excel at tasks like image recognition [7]. CNNs use convolutional layers to break images into feature maps, detecting patterns such as edges and textures. Pooling layers then reduce dimensionality, emphasizing the most important features. Fully connected layers at the end use these extracted features to classify the image [8]. To keep a high accuracy of detection, CNNs are trained using a process called backpropagation. During training, the network continuously adjusts its internal parameters based on the errors observed in the predictions. This iterative adjustment helps the network improve its ability to accurately examine and classify images [9]. Using this method, PEMs can automatically locate key anatomical landmarks on the body during the prediction phase (Fig. 1). Once obtained, they can be used for calculating joint angles, and quantify movement trajectories entirely through standard video recordings.
Fig. 1.
Illustration of the skeletal landmarks detection conducted by the different ML pose estimation. a) MediaPipe detects a structure of 33 landmarks on the human body. b) BlazePose detects 33 landmarks on the human body. c) MoveNet detects 17 landmarks, focusing on the main body joints. d) OpenPose detects 18 landmarks for full-body pose estimation. e) HRNet detects 17 landmarks, emphasizing high-resolution representations for more accurate keypoint localization. f) AlphaPose detects 17 landmarks, prioritizing precision in pose estimation.
Traditionally, the study of human motion and posture often relied on manual assessments [10], specialized motion capture laboratories, or the use of wearable sensors [11]. These methods can be accurate but may face constraints relating to cost, accessibility, intrusiveness, and limitations in scalability [2]. ML PEMs, offer distinct advantages and tradeoffs compared to other motion analysis methods. Unlike marker-based systems like Vicon optoelectronic systems, they eliminate the need for physical markers and specialized lab settings, significantly enhancing accessibility and usability [2]. Compared to RGB-D sensors like Microsoft Kinect V2 [12], PEMs using standard cameras are less restricted by the environment. While the comparison with smartphones and Inertial Measurement Units (IMUs) may appear unnecessary, PEMs operating on video data potentially provide richer information, including full skeletal tracking. In the clinical field, ML PEMs hold potential for objective gait analysis [13] to diagnose musculoskeletal disorders [5] and track rehabilitation progress [14]. Their ability to facilitate non-invasive posture assessments empowers interventions for reducing pain and improving functional movement patterns. Sports scientists are using this technology for the analysis of athletic gestures, identifying subtle deviations for injury prevention and so enhancing the performance [15]. Additionally, ML PEMs are being applied in ergonomics, allowing the identification of dangerous movement patterns, guiding the design of preventive approaches for workplace-related musculoskeletal disorders [16]. While the main key features of these human PEMs, e.g., 2D vs 3D or single-view vs. multi-view, have been thoroughly discussed recently [17], to date, a comprehensive analysis of the well-known models and the various applications in the human movement sciences field has been not discussed yet.
This review will focus on comparing ML PEMs to traditional motion analysis methods, such as marker-based systems, RGB-D sensors, and wearable devices, highlighting their advantages and limitations. Additionally, it will explore the applications of ML PEMs in real-world settings, including clinics, homes, and outdoor environments, where they have the potential to enhance the analysis of gait, posture, athletic performance, and workplace ergonomics. The goal is to provide valuable insights for coaches, healthcare professionals, and researchers in evaluating and implementing ML PEMs to advance human movement sciences.
The scope of this narrative review is to offer a comprehensive evaluation of ML PEMs and their impact on human movement sciences. Specifically, it will address recent advancements in ML and computer vision techniques that enable accurate, non-invasive motion analysis using low-cost imaging systems.
1.1. The most frequently used key pose estimation models
Due to the speed with which the ML field is evolving, many models for pose estimation are being presented and discussed in the scientific scenario but some of these, are private or not accessible to all. Here we discuss those models that are highly prevalent in the ML scenario applied to motion analysis to highlight specific features and guide researchers toward the optimal model choice for their needs, as summarized in Table 1.
Table 1.
general overview of the characteristics of each model.
| Model | Year | Developer | Landmarks | Coordinates | 2D/3D | Single/Multi | Pre-trained | Mobile |
|---|---|---|---|---|---|---|---|---|
| OpenPose [18] | 2017 | Carnegie Mellon Univ. | 135 | x, y | 2D | Multi | COCO, MPII | Limited |
| PoseNet [19] | 2017 | Google AI | 17 | x, y | 2D | Single | COCO, internal | Yes |
| AlphaPose [20] | 2017 | SenseTime | 17 | x, y | 2D | Multi | COCO, MPII | Limited |
| DeepLabCut [21] | 2018 | Mackenzie Mathis Lab | User-defined | x, y | 2D | Both | Internal | Limited |
| HRNet [22] | 2019 | Microsoft | 17 | x, y | 2D | Both | COCO, MPII, others | Limited |
| MediaPipe Pose [23] | 2019 | Google AI | 33 | x, y, z | 3D | Both | Internal | Yes |
| BlazePose [24] | 2020 | Google AI | 33+ | x, y, z | 3D | Single | Internal | Yes |
| EfficientPose [25] | 2020 | Megvii | 17 | x, y | 2D | Both | COCO, MPII | Yes |
| MoveNet [26] | 2021 | Google AI | 17 | x, y | 2D | Both | COCO, internal | Yes |
OpenPose, developed with C++ language by Carnegie Mellon University, had its initial major release around 2017, revolutionizing real-time human pose estimation [18], Fig. 1d. This computer vision library enables the detection of body keypoints in images and videos, even in scenarios involving multiple people [27]. The applications of OpenPose are extensive, including movement analysis in sports, fitness, and gesture recognition for human-computer interaction [28]. PoseNet, released by Google AI around 2017, excels in real-time human pose estimation in web environments [19] Fig. 1c. It has been designed to run directly within web browsers to estimate the poses of individuals in images and videos. Its primary applications include interactive experiences, fitness tracking, motion-based games, and telemedicine [29]. AlphaPose, released in 2017 by the SenseTime team, aimed to enhance the field of multi-person pose estimation like OpenPose, Fig. 1f. It introduced a novel Region Proposal Network based approach to first locate individuals within an image and subsequently refine their pose estimations [20]. AlphaPose finds its primary applications in scenarios requiring accurate tracking of multiple people, such as analyzing group dynamics in video footage, sports analytics, and surveillance systems [30]. DeepLabCut was introduced in 2018 by the Mackenzie Mathis Lab, initially focusing on markerless animal pose estimation. Although its primary application centered on analyzing animal behavior and movement using standard cameras and videos [21], DeepLabCut has proven highly adaptable for human motion analysis. Eliminating the need for physical markers, DeepLabCut is used extensively in sports science to evaluate athletic performance, such as running kinematics [31] and open environments sport analyses [32]. It is also employed in clinical settings to assess and monitor movement disorders of neurologic patients, aiding in the development of rehabilitation protocols [33]. HRNet developed by Microsoft in 2019, introduced a novel architecture for human pose estimation that maintains high-resolution representations throughout the estimation process [22], Fig. 1e. The name stems from the full name “High-Resolution Network” as its strength lies in achieving highly accurate human pose estimation, especially in challenging scenarios. Today, HRNet is a key component in systems demanding superior precision, such as applications involving detailed motion analysis or augmented reality experiences where accuracy is mandatory [34].
MediaPipe Pose was made available around 2019 as part of the larger MediaPipe framework of Google AI, a collection of tools for building ML pipelines [23], Fig. 1a. This pose estimation solution focuses on real-time, high-fidelity body tracking. It is a framework designed to enable data inference from sensory inputs such as video streams or photos, making it perfect for quickly prototyping perception pipelines. Due to its versatility, MediaPipe Pose is accessible for use within web environments, mobile applications, and across various platforms. Shortly thereafter, Google AI announced BlazePose in early 2020 as a fast and accurate pose estimation solution specifically tailored for mobile and embedded devices [24], Fig. 1b. Its main applications involve real-time body tracking for fitness tracking and gesture control interfaces. BlazePose is primary designed to deliver reliable results even on lower-power devices, making it widely adopted in mobile applications where speed and efficiency are crucial [35]. EfficientPose, published by Megvii in 2020, aimed to balance accuracy and computational efficiency in human pose estimation [25]. Its primary applications involve scenarios demanding both precision and speed, such as real-time pose tracking or use on less powerful devices. Due to its scalable architecture, it allows for flexible trade-offs between performance and accuracy, resulting in a valid foundation for the next-generation markerless movement analysis [25]. MoveNet, representing one of the latest models introduced by Google AI in 2021, pushed the boundaries of real-time pose estimation with its exceptional speed and accuracy [26]. This model was released in two variations to address specific needs: the 'Lightning' version for latency-critical applications and the 'Thunder' version for applications requiring high accuracy. Due to its versatility, MoveNet is currently being employed in interactive fitness experiences and gesture recognition systems [36].
The development of PEMs is highly collaborative, with innovations building on each other. For instance, OpenPose has influenced subsequent models like tf-pose-estimation in TensorFlow. Despite its popularity, it has limitations in precision and high computational cost, making it less suitable for applications requiring high accuracy, such as elite sports and medical assessments. Different models serve specific needs: BlazePose is optimized for mobile use in resource-limited settings, while HRNet offers high accuracy and is influenced by PoseNet. MoveNet builds on PoseNet and BlazePose, aiming for lightweight, real-time performance. This collaborative development is currently pushing the progress in the field of human pose estimation through ML.
2. Methods
This narrative review synthesizes relevant literature on ML PEMs and their applications in human movement sciences. The literature search was conducted using electronic databases, including PubMed and Google Scholar. Key terms used in the search included "machine learning," "pose estimation models," "motion analysis," "human movement sciences," "sports performance," "gait analysis," and "ergonomics."
Articles were selected based on their relevance to advancements in ML and computer vision techniques, as well as the development of ML PEMs for applications in clinical, sports, and ergonomic contexts. The inclusion criteria focused on peer-reviewed studies, conference papers, and reviews published between 2010 and 2024. Additionally, we reviewed the references of key articles to retrieve valuable publications from earlier years. Given the narrative nature of this review, a formal quality assessment of the studies was not conducted. Instead, emphasis was placed on identifying major trends, key advancements, and the comparative benefits and limitations of ML PEMs across various fields. The selected literature was organized thematically to highlight the significant advancements, applications, and challenges in implementing ML PEMs in human movement sciences.
3. Results
We gathered and synthesized the key information from the studies that utilized PEMs, including the application domains, model types, and main findings, and summarized them in Table 2. Each application domain was then discussed individually, providing a deeper analysis of how different models have been applied, their effectiveness, and the insights gained from their use in various contexts.
Table 2.
Main findings of the studies included in the narrative review for each domain of human movement analysis.
| Field | Author (year) | ML PEM used | Key findings |
|---|---|---|---|
| Posture analysis | Samkari E. et al. (2023) [37] | – | Convolutional and Recurrent Neural Networks are the most used for Human Pose Estimation, but challenges like occlusion and crowded scenes hinder performance. |
| Moreira R. et al. (2022) [38] | PoseNet | The NLMeasurer proved to be a valid tool for assessing postural measurements from a frontal view. Surface markers on anatomical landmarks can enhance accuracy and reliability. | |
| Roggio F. et al. (2024) [39] | MediaPipe | Significant sex differences were detected in shoulder and hip adduction angles, with no variations in horizontal inclinations. | |
| Gait analysis | Hii C.T.S. et al. (2023) [13] | MediaPipe | MediaPipe Pose shows good to excellent agreement with the Vicon system, with moderate agreement in a few parameters. It offers reliable results for monitoring gait changes and evaluating interventions. |
| Khera P. et al. (2020) [40] | – | This review found that support vector machines were the most effective classifiers, while reinforcement learning, and neural networks excelled in controlling rehabilitation devices and capturing participant variability. | |
| Toshev A. et al. (2014) [41] | Own Algorithm | The human pose estimation method using deep neural networks (DNNs) achieved high accuracy results with a cascade of DNN regressors. | |
| Stenum J. et al. (2021) [42] | OpenPose | This study compared OpenPose with 3D motion capture for gait analysis, showing small errors in temporal parameters and step lengths. The workflow is accessible, accurate, and suitable for broader gait analysis applications. | |
| Kim W. et al. (2021) [43] | Own Algorithm | The proposed method successfully detected gait abnormalities using Kinect data, with k-NN and SVM classifiers showing potential for accurate classification without the need for complex setups or wearable sensors. | |
| Li Y. et al. (2020) [44] | Own Algorithm | A smartphone-based lower body rehabilitation system was developed, achieving a 70.8 AP score on COCO val2017 with only 4.7M parameters. The system allows patients to perform exercises at home, with encrypted data processed locally and reports shared with doctors. | |
| Lin C.B. et al. (2020) [45] | OpenPose | The fall detection system using OpenPose is based on tracking human joint movements, reducing the impact of environmental factors and improving training time. | |
| Moro M. et al. (2022) [46] | Pose ResNet | The study compared a new RGB video-based markerless system with a marker-based motion capture system for 3D gait analysis. Similar spatio-temporal parameters were observed, with slight underestimation in ankle and knee flexion. | |
| Menychtas D. et al. (2023) [47] | OpenPose and MediaPipe | The study proposes an artificial intelligence-powered annotation tool that allows users to correct errors, especially those of small joint movements, by combining accuracy with the efficiency of automatic systems. | |
| Sport analysis | Afrouzian R. et al. (2016) [48] | Own Algorithm | The proposed algorithm accurately estimates 3D human poses from multiple uncalibrated cameras using previous frame viewpoints, achieving high accuracy on a soccer player dataset. |
| Badiola-Bengoa A. et al. (2021) [49] | – | The estimation of human poses in sport is based on generic systems such as OpenPose but requires more public data to improve performance in all contexts. | |
| Pagnon D. et al. (2022) [50] | OpenPose | Pose2Sim demonstrated high accuracy for lower-limb kinematics, with errors averaging 3.0°–4.1° across walking, running, and cycling tasks, and a coefficient of multiple correlation above 0.9 in most cases. | |
| Citraro L. et al. (2020) [51] | OpenPose | The proposed framework accurately estimates camera parameters in real time from sports field images using deep learning, outperforming current methods on soccer, basketball, and volleyball datasets. | |
| Ota M. et al. (2020) [52] | OpenPose | OpenPose showed high reliability and validity for joint angle estimation, though biases were present in certain joints. It offers a low-cost, user-friendly alternative, but requires bias correction for clinical use. | |
| Luvizon D.C. et al. (2022) [53] | ResNet-U | The proposed 3D human pose estimation method in camera coordinates reduced prediction error by 32 % and achieved 80 mm error for monocular and 51 mm for multi-view estimations on key datasets. | |
| Injury Prevention | Zhou G. et al. (2022) [54] | OpenPose | The proposed method accurately classified lifting risk and predicted perceived exertion using body movements, posture, and facial expressions. |
| Van Eetvelde H. et al. (2021) [55] | – | Machine learning methods show potential in predicting sports injuries, with performance ranging from poor to strong, but improvements in model interpretation and methodological quality are needed. | |
| Tack C. et al. (2019) [56] | – | Machine learning can enhance musculoskeletal medicine by automating diagnostics, data analysis, and clinical decision support, often matching or surpassing human accuracy. | |
| Remote Physical Activity Tracking | Raza A. et al. (2023) [57] | MoveNet | The proposed LogRF method for feature selection improved human pose estimation. This enhances exercise correction, injury prevention, and remote physiotherapy monitoring. |
| Yang L. et al. (2021) [58] | OpenPose | The software uses OpenPose to analyze exercise posture and provides corrective feedback for squats and push-ups, improving form and preventing injury. | |
| Cai Z. et al. (2022) [59] | MoveNet | The study proposes PoseBuddy, a mobile app that uses computer vision and the MoveNet model to provide personalized workouts with real-time audio feedback on exercise form. | |
| Rosique F. et al. (2021) [60] | OpenPose | ExerCam, a low-cost augmented reality telerehabilitation tool, uses a webcam for pose detection, offering remote patient management, ROM calculation, and gamified therapy, with positive results. | |
| Ergonomics | Mehrizi R. et al. (2018) [61] | Own Algorithm | The proposed predicts 3D pose during lifting tasks, achieving an average error of 14.72 mm, comparable to marker-based methods, and outperforms state-of-the-art techniques on the HumanEva-I dataset. |
| Chen C. et al. (2020) [62] | Own Algorithm | The study achieved 92.8 % accuracy in recognizing assembly actions using YOLOv3 and 82.1 % accuracy in estimating repetitive assembly operating times through joint coordinate extraction. | |
| Kim W. et al. (2021) [63] | OpenPose | OpenPose outperformed Kinect in measuring joint angles and conducting ergonomic assessments, especially in non-ideal conditions with body occlusions or non-frontal views, showing potential for real workspace applications. | |
| Ying W. et al. (2021) [64] | OpenPose | The proposed method using 2D video and machine learning classifiers effectively automates scaffolding workface assessment, proving to be a feasible and practical solution for activity classification. |
3.1. Machine learning pose estimation in posture analysis
Posture can be regarded as the calling card of each human being drawing elements from the human psyche, daily life, work and sports. While it plays a crucial role in our overall health and well-being, poor posture has been linked to numerous musculoskeletal issues, including back pain, headaches, and increased risk of injuries [65]. Traditionally, posture analysis has relied on subjective visual assessments by clinicians or the use of specialized tool like goniometers or plumb lines. Postural assessment is an important part of medical examinations, enabling health professionals to build tailored therapeutic programs [66]. While physical examination is a cost-free method that does not require the use of expensive equipment, it may be biased by the ability and accuracy of the clinician [67]. Further, exposing the patient to x-rays imaging when not required, increases the negative impact of radiation exposure [68].
To overcome these limitations, recent advancements in ML PEMs are providing new approaches for the posture analysis thanks to the automatic detection of anatomical landmarks from photos or videos [37]. By identifying key points such as the shoulders, hips, knees, and head, these models can calculate angles, distances, vertical and horizontal alignments that quantify various aspects of posture alterations. This technology eliminates the need for manual measurements and subjective interpretations, enabling a more standardized and data-driven approach to posture analysis [38]. By integrating ML PEMs, posture assessment becomes non-invasive and low-cost, significantly increasing its feasibility in various settings, such as schools and workplaces. To date, research into the application of ML models for posture analysis remains limited [38]. In a recent study, we applied a human PEM to assess biomechanical posture in a cohort of 200 healthy adults, aiming to enhance musculoskeletal disorder prevention through objective analysis [39]. Utilizing photographs from multiple angles, the study identified significant gender-specific differences in shoulder and hip adduction angles, with a Intraclass Correlation Coefficient peaking at 0.95. We also employed ML analysis methods such as principal component and cluster analyses, uncovering new patterns in postural analysis, like differences in shoulder–hip distance. Since this approach is providing valuable results compared with gold standard methods [69] but at the same time it may seem simplistic, it has the potential to generate large-scale datasets of human posture. Such datasets could facilitate the identification of subtle postural deviations, serving as early indicators of developing musculoskeletal disorders. This would enable the implementation of timely preventive interventions [5]. While the technology itself is accessible, its successful integration into clinical practice requires a collaborative approach between healthcare professionals, engineers, and computer scientists.
3.2. Machine learning pose estimation in gait analysis
Traditionally, gait analysis often relied on specialized motion capture laboratories equipped with expensive marker-based systems and trained clinicians [2]. However, advancements in ML have guided a new era of landmark detection models, revolutionizing the accessibility and potential applications in gait analysis [40]. ML PEMs are typically trained on large datasets and, through deep learning techniques, they learn to identify complex patterns and relationships within images, enabling the prediction of landmarks locations on previously unseen images or videos [41]. This ability eliminates the need for physical markers and time-consuming manual labeling which may bring into collection bias. The advantages of ML pose landmark detection models in gait analysis are various. They facilitate cost-effective and portable analysis setups, making them ideal for use outside traditional laboratory environments. The flexibility of ML models results in lightness in the field of analysis both from the point of view of organization, preparation, reproducibility and versatility [40]. These models are able to collect basic spatiotemporal parameters, i.e., stride length, cadence, gait speed, to complex kinematic patterns, i.e., joint angles, allowing for faster and reliable biomechanics analyses [42]. These data can be used for diagnosing gait abnormalities [43], quantifying the effectiveness of rehabilitation interventions [44], and potentially predicting fall risk through the identification of subtle movement deviations [45]. Researchers investigated the reliability of MediaPipe by comparing it with an optoelectronic system [13] and demonstrated good to excellent agreement across all spatiotemporal parameters, with an intraclass correlation coefficient >0.75. They further justified their use of MediaPipe, highlighting its top-down approach that allows for more precise landmark detection compared to other methods.
While ML PEMs show promising results, it is important to be aware of the limitations behind these methods. Currently, the accuracy of landmarks detection is influenced by factors such as training of the ML model, source quality, or video occlusions [70]. Although this approach is showing promising results as the ML results are comparable to marker-based methods [46], understanding the limitations and potential sources of error is crucial for responsible implementation [47].
3.3. Machine learning pose estimation in sports motion analysis
In the Sports field, the ability to precisely analyze movement patterns is crucial for enhancing the performance, optimizing the technique, and minimizing injury risk [71,72]. Traditional motion analysis methods in sports often fall into two categories: observation by coaches, with subsequent subjective assessment, and biomechanics analyses performed in specialized laboratories with highly sophisticated tools. ML PEMs, with their ability to automatically track anatomical landmarks from videos, are enabling two types of analysis: in-field analysis during training sessions and post-match analysis [48]. Unlike traditional motion capture labs, ML PEMs eliminate the need for specialized settings. Analysis takes place directly on the field, court, or gyms, providing insights into real-world sports performance [49]. They also offer a more affordable and accessible alternative to the expensive equipment and controlled environments of motion capture [50]. While motion capture labs often excel at precise 3D movement reconstruction, ML PEMs can prioritize the extraction of specific, sport-relevant kinematic data (joint angles, velocities, etc.), providing targeted analysis tailored to the needs of coaches and athletes. A research compared different ML models for the match analysis in basketball, volleyball ad soccer matches [51]. The results highlighted a valid accuracy in localizing and identifying 2D player movements as well as field lines, offering valuable insights for sports analysis. Further, another study employed OpenPose for the evaluation of the squat exercise, a common but potentially injury-prone movement [73]. Researchers compared OpenPose to a marker-based gold standard method and found high intraclass correlation coefficients (0.92–0.96), indicating good validity for this specific application [52]. However, more studies are needed to fully confirm the validity of these methods across a wider range of workout exercises.
Despite the undeniable advantages of ML PEMs in sports analysis, accuracy remains a challenge. Unlike traditional marker-based motion capture systems, ML PEMs can be affected by factors such as camera angles and lighting conditions. These variations can influence the model's interpretation of video data, introducing potential errors. Recent research [53], has proposed a consensus-based optimization algorithm to address this issue. This algorithm enables multi-view predictions from uncalibrated images using a single monocular training process. While this represents a sophisticated approach, coaches and sport scientists must still be aware of the potential for inaccuracies when using RGB cameras for sports analysis with ML PEMs.
3.4. Machine learning pose estimation for injury prevention
Injuries, particularly those resulting from musculoskeletal overuse or improper movement patterns, represent a significant challenge in sports [74]. They have both short- and long-term consequences. In the short term, injuries can have strong psychological repercussions as it impairs return to play, increases the risk of re-injury and even leads to the development of mental disorders [75]. Further, untreated injuries can lead to incomplete recovery, abandonment of sports activity, and, in the long term, can cause degeneration of the joint such as osteoarthritis [76,77]. Preventing injuries is crucial for athlete well-being, performance optimization, healthcare cost reduction, and preventing long-term health issues. ML PEMs applied in sports analysis offers a powerful new tool in injury prevention, enabling a shift from reactive treatment to proactive risk assessment. Traditional injury prevention often focuses on analyzing risk factors after an injury occurs. In contrast, ML PEMs can identify subtle deviations, asymmetries, or compensatory mechanisms in movement patterns, indicating the predisposition of an athlete to injury [78]. For example, a recent study validated ML as a prevention tool by combining different techniques [54]. The authors used OpenPose to analyze the movement biomechanics, SelFlow to anticipate the movements and OpenFace to understand the facial expression. This mixed approach helped them discover specific behavioral patterns that predict changes in load lifting and perceived exertion during the task.
While common methods rely on specialized laboratories, using deep learning methods to process vast amounts of movement data can enhance our understanding of sports movements and reveal common patterns linked to injury risk [55]. This fast and non-invasive method would ideally involve collecting the baseline data of the athlete for successive comparisons. Coaches could use this informed analysis to tailor training programs, implement corrective exercises, modify techniques to reduce risk, and even guide physical therapists in providing optimal support for the athlete [56].
However, while this optimistic framework would be appealing to many sports organizations, it is essential to consider the current limitations. The predictive power of models depends on the quality and relevance of training datasets [79]. These methods are still under development in the sports field, where gold-standard techniques are often preferred. This preference is understandable since high-level athletes require the highest accuracy. However, a major limitation of ML models is their need for large volumes of data; without this, the framework described above cannot be fully realized. Additionally, coaches need proper guidance in interpreting the information provided by ML PEMs to fully unlock their potential.
3.5. Machine learning pose estimation in remote physical activity tracking
The COVID-19 pandemic accelerated a significant shift in the perception of remote tracking. While the concept of remote tracking is not new, and the use of wearables for monitoring physical activity has been widely explored [77], these trackers have a notable limitation: they primarily collect data such as heart rate or burned calories during a session rather than providing real-time feedback. Alongside the evolution of fitness tracking, the telehealth field from the physical therapy [80] to cardiac rehabilitation [81], also experienced a digital transformation spurred by the rise of digital approaches. Wearables undoubtedly collect valuable physiological data, but they lack the ability to assess biomechanical parameters. While useful for measuring spatiotemporal aspects of movement, their applicability in analyzing the movement itself is limited [82].
ML PEMs for posture, gait, and movement analysis offer a promising approach for analyzing biomechanics during remote fitness or rehabilitation sessions delivered through online platforms. The key application of these models lies in their ability to provide objective, automated analysis of exercise form and movement, eliminating the need for in-person supervision or specialized sensors.
During live video sessions, trainers can support the customer with real-time feedback on exercise form, repetition counts, range of motion, and other essential aspects of exercise execution [57]. This mitigates the risk of incorrect form and potential re-injury. OpenPose has been successfully employed for this purpose, and researchers have demonstrated that the system can detect human body posture, obtain key point coordinates, analyze exercise posture, and provide relevant suggestions based on correct exercise form definitions [58]. Similarly, a smartphone app has been developed using MoveNet to support customers during their workouts [59]. Utilizing the camera of the phone for real-time video, the app feeds data into a high-accuracy PEM. This model then provides real-time audio feedback during exercise, supported by an additional ML algorithm.
Furthermore, PEMs can facilitate in-depth biomechanical analysis by extracting quantitative data like joint angles, range of motion, and movement velocity. This empowers trainers and therapists with a precise, data-driven understanding of the functional limitations, movement patterns, and potential asymmetries of their clients [60]. For example, a PoseNet-based system has been developed for in-home rehabilitation. This system included statistical analysis tools, empowering clinicians to analyze the recovery status of the patients. Researchers observed that it accurately measured elbow and knee joint angles during movements, calculating range of motion with a simple laptop webcam [29]. Additionally, clinicians could compare movement execution across multiple days to evaluate quality and determine recovery rate.
This approach has the potential to expand access to expert guidance, bridging geographical gaps and making professional trainers and therapists accessible to individuals who might not otherwise have access to a gym or in-person support. This makes it easier to track progress objectively and empowers trainers and therapists to provide more informed suggestions during training sessions.
3.6. Machine learning pose estimation in ergonomics
Work-related musculoskeletal disorders (MSDs) pose a major public health challenge, causing pain, disability, lost productivity, and significant economic burdens [83,84]. Ergonomics is crucial in preventing MSDs, traditionally utilizing observational tools like the Rapid Upper Limb Assessment (RULA) survey [85]. Tools like inertial measurement units and motion capture systems provide high accuracy but come with limitations such as cost, technical expertise requirements, and potential movement interference [86,87]. Although accurate, they may not fully meet the needs of ergonomists, who are called upon to assess the worker in their real working context, where movement is natural, tasks vary, and long-term impact must be considered.
In response to this need, ML PEMs emerge as a useful tool for ergonomic assessments, empowering proactive interventions to safeguard worker health and well-being. Compared to traditional ergonomic assessments, ML PEMs offer a more scalable and objective approach by automatically analyzing postures and movements from readily available video recordings of workers performing their tasks [61]. These models can identify awkward postures, sustained exertions, repetitive motions, and other biomechanical risk factors associated with MSDs, such as low back pain or carpal tunnel syndrome [62,88]. A key advantage of ML pose estimation for ergonomics is its potential for real-time or near real-time analysis.
A recent study employed OpenPose for the postural assessment across various common work tasks, i.e., trunk flexion and rotation, sitting, holding and lifting a box, leg crossed and desk tasks [63]. The authors validated the ML model against a marker-based gold standard and integrated the RULA tool for injury risk calculation. The results demonstrated that the proposed ML method showed good performance at all conditions with also a good agreement of RULA action level, suggesting that OpenPose could be a promising technology to measure joint angles for ergonomic postural assessments in the real workspace. Another study successfully employed OpenPose to classify scaffolding activities, further highlighting the versatility of this model [64]. These findings collectively suggest that OpenPose offers a robust and practical solution for ergonomic assessment within dynamic, real-world workspaces.
This approach has the potential to impact workplace safety by identifying common risk patterns, enabling targeted interventions and broader workplace design improvements. ML PEMs simplifies data collection, facilitating longitudinal tracking of worker postures and movements. This rich dataset could train ML models for preventing specific work-related injuries. By monitoring the effectiveness of ergonomic interventions over time, this approach can promote a safer, more productive workforce while potentially reducing healthcare costs. However, it is essential to acknowledge the importance of domain expertise and responsible implementation. Ergonomists and occupational health specialists play a crucial role in interpreting the data generated by PEMs, designing interventions, and ensuring that they truly address the root causes of MSD risk.
4. Discussion
The application of ML PEMs in human movement sciences brings both opportunities and challenges, particularly concerning model reliability and the selection of the most appropriate model for specific needs. Choosing the right model involves balancing accuracy, generalizability, and computational demands, making reliability a key consideration for researchers and practitioners. It is essential to evaluate the performance of these models using standardized datasets and metrics to ensure consistent results across diverse applications. Additionally, understanding the specific requirements of the application domain, such as real-time analysis or precision in detecting fine movements, can help guide the selection of the most suitable model.
4.1. The question of reliability, how to choose the best model
Selecting the optimal model means balancing the unique requirements of the application against the trade-offs inherent in each model design. Consider desired accuracy, speed, computational limitations, portability needs, and whether 2D or 3D analysis is essential. A fundamental issue exists between the precision of a model in locating keypoints and its computational speed. Models excelling in accuracy often demand significant processing power, hindering real-time scenarios. Conversely, speed-optimized models may compromise some accuracy [89]. Analyzing complex athletic movements to refine technique may necessitate prioritizing a slower, highly accurate model, while building a responsive mobile fitness app might lean towards a less precise but real-time capable option (Table 3, Fig. 2). As many specific features need to be addressed when discussing the best-performing ML model, a recent article thoroughly discussed the main aspects of many of the renowned models [17]. A complex model might be ideal in theory but is useless if it works poorly on a phone. Conversely, a lighter model could be ineffective if used on a powerful system. Understanding the target environment (mobile, web, or workstation) is essential. This awareness allows tailoring the model choice and investigating optimizations specific to those platforms [17].
Table 3.
Applicability of each model in the human movement sciences field.
| Model | Gait Analysis (Fig. 2a) | Posture Analysis (Fig. 2b) | Injury Prevention (Fig. 2c) | Motion Analysis (Fig. 2d) | Remote Fitness Tracking (Fig. 2e) | Ergonomics (Fig. 2f) |
|---|---|---|---|---|---|---|
| OpenPose [18] | High | High | Medium | High | Medium | High |
| PoseNet [19] | Medium | Medium | Low | Medium | High | Moderate |
| AlphaPose [20] | Medium | Medium | Medium | High | Low | Moderate |
| DeepLabCut [21] | Medium | Medium | Medium | Medium | Low | High |
| HRNet [22] | High | High | Medium | Medium | Low | High |
| MediaPipe Pose [23] | High | Medium | Medium | Medium | High | Moderate |
| BlazePose [24] | Medium | Medium | Medium | High | High | Moderate |
| EfficientPose [25] | Medium | Medium | Medium | Medium | Medium | Moderate |
| MoveNet [26] | Medium | Medium | Medium | Medium | High | Moderate |
Fig. 2.
Main application fields of the considered machine learning pose estimation models in the fields of gait analysis (a), posture analysis (b), injury prevention (c), motion analysis (d), remote fitness tracking (e), ergonomics (f).
Concerning the accuracy, several studies investigated the validity of these ML PEMs by comparing them with the marker-based gold standard, but the results were not promising [90,91], most likely to be related to the absence of correct synchronization between the systems. On the contrary, another study attempted to evaluate the accuracy of OpenPose, AlphaPose and DeepLabCut performing walking, running and counter-movement jumps tasks [92]. The performance of OpenPose and AlphaPose across the tasks was comparable with a systematic difference of ∼ 1–5 mm and random errors of ∼ 1–3 mm, while larger systematic and random errors were observed for DeepLabCut. However, the authors stated that the difference between the ML PEMs and the gold standard could have occurred due to a mislabeling of ground truth data in the training datasets, concluding that these methods do not yet match the performance of marker-based motion capture at all joint centers [92].
Also, the validity of the MediaPipe joint inference tracking technique has been rigorously investigated. Lafayette et al. [93] conducted a quantitative and statistical assessment of MediaPipe's angular estimation capabilities, comparing it the gold-standard Qualisys motion capture system. The authors demonstrated that MediaPipe offers excellent absolute error relative to clinical error and strong data correlation with Qualisys with mean Pearson's correlation coefficients of 0.80 ± 0.1 for lower limb movements and 0.91 ± 0.08 for upper limb movements. Additionally, its validity is supported by its 100 % accuracy in classifying exercise postures (25) and detecting human movements in non-standard videos with over 90 % accuracy (26). Similarly, BlazePose and MoveNet have been compared with gold standard methods. BlazePose provided in the 70.9 % of trials a mean absolute errors below 30 mm, and for measuring joint angles (elbows, hips, and knees) 43 % were below 5° in the push-up movement [94]. MoveNet obtained a root mean squared error of 3.24 ± 1.19° and mean absolute error of 2.66 ± 1.00° from tenfold cross validation when assessing the knee joint angle during waking tasks [36]. On the contrary, to date, not valid articles are present that clearly stated the accuracy of PoseNet, HRNet and EfficientPose to marker-based gold standard. Although it is not mandatory for developing a ML PEM, for the researchers working within the biomechanics field, being unsure about the real reliability of new approaches can generate little confidence and disinterest in implementing these approaches in current practice.
4.2. Future perspectives on the application of machine learning in human movement sciences
The future of ML in the human movement sciences field holds immense potential, particularly for biomechanics, sports, and healthcare. The integration of ML in various contexts is not merely a distant future. The rapid evolution of this sector exposes us to different solutions, whose limits and potential are important to consider so as not to introduce an approach into the sector that is distant from the real needs of those who work in this field every day.
Augmented Reality (AR) represents a field where computer vision and ML could enhance the understanding of human movements. Integrating ML with AR through smart lenses and wearable devices has the potential to revolutionize biomechanics and posture analysis. These AR devices can provide real-time feedback on posture and movement by overlaying digital information directly onto the field of vision of the user. AR can be considered in two ways: as an aiding tool for the coach or therapist, and as a guiding tool for the athlete or patient. For example, coaches could wear smart glasses during training sessions, allowing them to see real-time data on athletes’ joint angles, stride length, or posture corrections without needing to stop and review footage, thereby enhancing training in real-time [95]. This approach also shows great promise in the rehabilitation field. In clinical settings, AR combined with ML could support clinicians during patient rehabilitation by providing information on exercises or guiding patients with instant feedback and corrections. The ability to monitor progress and adjust exercises in real-time would personalize treatment plans and potentially speed up recovery times [96].
Sports decision-making represents a field where ML could revolutionize the decisions of sports teams, from player performance and injury prevention to game strategy. For performance analysis, ML can process vast amounts of data from games, training sessions, and wearable sensors. These models can identify patterns and correlations that might not be meaningful to human analysts. For example, ML algorithms can track player movements, assess their efficiency, and provide insights into optimal positioning and movement patterns. By processing real-time data on player performance and opponent strategies, data scientists can suggest optimal tactics and substitutions. For instance, in soccer, an ML model could analyze the formation of the opposing team and suggest counter-strategies that exploit their weaknesses. This real-time decision support can be a game-changer, providing teams with a competitive edge. However, ethical considerations must also be addressed. Beyond the use of private data and obtaining athlete consent, it is important to consider the boundaries of what is allowed to protect human interests. There is a risk of excessive use of these methods, driven by the inherent human desire to find the best advantage. Therefore, governments should establish valid rules and guidelines to ensure that ethics requirements will be met. To date, the European Commission is creating an AI Office to oversee general-purpose models, advised by independent experts, to develop ways to evaluate the capabilities of these models and monitor related risks [97]. With these measures in place, the integration of ML in sports can be both innovative and ethical, ensuring fair competition and the well-being of athletes.
5. Conclusions
The integration of ML PEMs into the analysis of human movement marked a significant advancement in fields such as clinical practice, sports science, and ergonomics. These models provide an accessible, cost-effective, and non-invasive approach to detailed motion analysis, avoiding the limitations of traditional methods that require specialized equipment and controlled environments. This comprehensive review highlighted the capabilities and applications of prominent ML PEMs, showing their potential to revolutionize gait analysis, posture assessment, sports motion analysis, and injury prevention. This approach fosters an objective and automated assessment of human movement, enhancing the accuracy of biomechanical analyses. In clinical settings, they can facilitate the diagnosis and monitoring of musculoskeletal disorders, offering valuable insights for rehabilitation. In sports, these models can provide real-time feedback and detailed performance data, aiding in technique optimization and injury prevention. Finally, in ergonomics, they can identify risky movement patterns, guiding interventions to prevent workplace-related musculoskeletal disorders. However, before their complete integration, ethical considerations, particularly regarding data privacy and their responsible use in sports and healthcare, must be addressed to ensure fair and beneficial applications.
CRediT authorship contribution statement
Federico Roggio: Writing – review & editing, Writing – original draft, Visualization, Methodology, Conceptualization. Bruno Trovato: Writing – original draft, Visualization, Investigation, Conceptualization. Martina Sortino: Writing – original draft, Visualization, Conceptualization. Giuseppe Musumeci: Writing – review & editing, Writing – original draft, Validation, Supervision, Resources, Project administration, Funding acquisition.
Data and code availability
No data was used for the research described in the article.
Fundings
This study was funded by Italian Ministry of University and Research (MUR) - (grant number PRIN 2022PZH8SX).
Declaration of competing interest
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests:Giuseppe Musumeci reports financial support was provided by This study was funded by Italian Ministry of University and Research (MUR) - (grant number PRIN 2022PZH8SX). Giuseppe Musumeci is the Section Editor of Sport and Exercise Medicine. Federico Roggio is an Associate Editor of Sport and Exercise Medicine. If there are other authors, they declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgements
The work was carried out as part of the PRIN 2022 project entitled “Predictive model of osteoarthritis through the use of inertial knee brace and thermography applied to movement analysis. Get back to move!” announcement D.D. n. 104, February 2, 2022, financed by the European Union – NextGenerationEU”.
References
- 1.Mündermann L., Corazza S., Andriacchi T.P. The evolution of methods for the capture of human movement leading to markerless motion capture for biomechanical applications. J. NeuroEng. Rehabil. 2006;3:1–11. doi: 10.1186/1743-0003-3-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Roggio F., et al. Technological advancements in the analysis of human motion and posture management through digital devices. World J. Orthoped. 2021;12(7):467–484. doi: 10.5312/wjo.v12.i7.467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Seethapathi N., et al. Movement science needs different pose tracking algorithms. arXiv preprint arXiv:1907.10226. 2019 [Google Scholar]
- 4.Puig-Diví A., et al. Validity and reliability of the Kinovea program in obtaining angles and distances using coordinates in 4 perspectives. PLoS One. 2019;14(6) doi: 10.1371/journal.pone.0216448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chan V.C.H., et al. The role of machine learning in the primary prevention of work-related musculoskeletal disorders: a scoping review. Appl. Ergon. 2022;98 doi: 10.1016/j.apergo.2021.103574. [DOI] [PubMed] [Google Scholar]
- 6.Rossi A., Pappalardo L., Cintia P. A narrative review for a machine learning application in sports: an example based on injury forecasting in soccer. Sports (Basel) 2021;10(1) doi: 10.3390/sports10010005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Guo T., et al. 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA) IEEE; 2017. Simple convolutional neural network on image classification. [Google Scholar]
- 8.Akhtar N., Ragavendran U. Interpretation of intelligence in CNN-pooling processes: a methodological survey. Neural Comput. Appl. 2020;32(3):879–898. [Google Scholar]
- 9.O'Shea K., Nash R. ArXiv e-prints; 2015. An Introduction to Convolutional Neural Networks. [Google Scholar]
- 10.Cohen L., et al. Non-radiographic methods of measuring global sagittal balance: a systematic review. Scoliosis Spinal Disord. 2017;12:30. doi: 10.1186/s13013-017-0135-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Camp C.L., et al. Are wearable sensors valid and reliable for studying the baseball pitching motion? An independent comparison with marker-based motion capture. Am. J. Sports Med. 2021;49(11):3094–3101. doi: 10.1177/03635465211029017. [DOI] [PubMed] [Google Scholar]
- 12.Kurillo G., et al. Evaluating the accuracy of the azure Kinect and Kinect v2. Sensors. 2022;22(7) doi: 10.3390/s22072469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hii C.S.T., et al. Automated gait analysis based on a marker-free pose estimation model. Sensors. 2023;23(14):6489. doi: 10.3390/s23146489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tack C. Artificial intelligence and machine learning | applications in musculoskeletal physiotherapy. Musculoskelet Sci Pract. 2019;39:164–169. doi: 10.1016/j.msksp.2018.11.012. [DOI] [PubMed] [Google Scholar]
- 15.Emery C.A. Considering cluster analysis in sport medicine and injury prevention research. Clin. J. Sport Med. 2007;17(3):211–214. doi: 10.1097/JSM.0b013e3180592a58. [DOI] [PubMed] [Google Scholar]
- 16.Lee Y.-C., Lee C.-H. SEE: a proactive strategy-centric and deep learning-based ergonomic risk assessment system for risky posture recognition. Adv. Eng. Inf. 2022;53 [Google Scholar]
- 17.Setti, F., et al., Markerless human pose estimation for biomedical applications: a survey. Front. Comput. Sci. 5: p. 1153160.
- 18.Cao Z., et al. Openpose: realtime multi-person 2d pose estimation using part affinity fields. IEEE Trans. Pattern Anal. Mach. Intell. 2021;43(1):172–186. doi: 10.1109/TPAMI.2019.2929257. [DOI] [PubMed] [Google Scholar]
- 19.Kendall A., Grimes M., Posenet R. Cipolla. Proceedings of the IEEE International Conference on Computer Vision. 2015. A convolutional network for real-time 6-dof camera relocalization. [Google Scholar]
- 20.Fang H.-S., et al. Alphapose: whole-body regional multi-person pose estimation and tracking in real-time. IEEE Trans. Pattern Anal. Mach. Intell. 2022 doi: 10.1109/TPAMI.2022.3222784. [DOI] [PubMed] [Google Scholar]
- 21.Mathis A., et al. DeepLabCut: markerless pose estimation of user-defined body parts with deep learning. Nat. Neurosci. 2018;21(9):1281–1289. doi: 10.1038/s41593-018-0209-y. [DOI] [PubMed] [Google Scholar]
- 22.Sun K., et al. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019. Deep high-resolution representation learning for human pose estimation. [Google Scholar]
- 23.Lugaresi C., et al. Mediapipe: a framework for building perception pipelines. arXiv preprint arXiv:1906.08172. 2019 [Google Scholar]
- 24.Bazarevsky V., et al. 2020. Blazepose: On-Device Real-Time Body Pose Tracking. arXiv preprint arXiv:2006.10204. [Google Scholar]
- 25.Groos D., Ramampiaro H., Ihlen E.A.F. EfficientPose: scalable single-person pose estimation. Appl. Intell. 2020;51(4):2518–2533. [Google Scholar]
- 26.MoveNet: Ultra fast and accurate pose detection model. [cited 2024 March]; Available from: https://www.tensorflow.org/hub/tutorials/movenet.
- 27.Osokin D. Real-time 2d multi-person pose estimation on cpu: lightweight openpose. arXiv preprint arXiv:1811.12004. 2018 [Google Scholar]
- 28.Cronin N.J., et al. Feasibility of OpenPose markerless motion analysis in a real athletics competition. Front Sports Act Living. 2023;5 doi: 10.3389/fspor.2023.1298003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Chua J., Ong L.-Y., Leow M.-C. Telehealth using PoseNet-based system for in-home rehabilitation. Future Internet. 2021;13(7):173. [Google Scholar]
- 30.Zhao X., et al. International Conference on Adaptive and Intelligent Systems. Springer; 2022. An alphapose-based pedestrian fall detection algorithm. [Google Scholar]
- 31.Van Hooren B., et al. The accuracy of markerless motion capture combined with computer vision techniques for measuring running kinematics. Scand. J. Med. Sci. Sports. 2023;33(6):966–978. doi: 10.1111/sms.14319. [DOI] [PubMed] [Google Scholar]
- 32.Drazan J.F., et al. Moving outside the lab: markerless motion capture accurately quantifies sagittal plane kinematics during the vertical jump. J. Biomech. 2021;125 doi: 10.1016/j.jbiomech.2021.110547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Milone D., et al. MocapMe: DeepLabCut-enhanced neural network for enhanced markerless stability in sit-to-stand motion capture. Sensors. 2024;24 doi: 10.3390/s24103022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Li R., et al. Human pose estimation based on efficient and lightweight high-resolution network (EL-HRNet) Sensors. 2024;24(2) doi: 10.3390/s24020396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Mroz S., et al. 2021 4th International Conference on Bio-Engineering for Smart Technologies (BioSMART) IEEE; 2021. Comparing the quality of human pose estimation with blazepose or openpose. [Google Scholar]
- 36.Bajpai R., Joshi D. Movenet: a deep neural network for joint profile prediction across variable walking speeds and slopes. IEEE Trans. Instrum. Meas. 2021;70:1–11. [Google Scholar]
- 37.Samkari E., et al. Human pose estimation using deep learning: a systematic literature review. Machine Learning and Knowledge Extraction. 2023;5(4):1612–1659. [Google Scholar]
- 38.Moreira R., et al. A computer vision-based mobile tool for assessing human posture: a validation study. Comput. Methods Progr. Biomed. 2022;214 doi: 10.1016/j.cmpb.2021.106565. [DOI] [PubMed] [Google Scholar]
- 39.Roggio F., et al. Biomechanical posture analysis in healthy adults with machine learning: applicability and reliability. Sensors. 2024;24(9):2929. doi: 10.3390/s24092929. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Khera P., Kumar N. Role of machine learning in gait analysis: a review. J. Med. Eng. Technol. 2020;44(8):441–467. doi: 10.1080/03091902.2020.1822940. [DOI] [PubMed] [Google Scholar]
- 41.Toshev A., Deeppose C. Szegedy. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2014. Human pose estimation via deep neural networks. [Google Scholar]
- 42.Stenum J., Rossi C., Roemmich R.T. Two-dimensional video-based analysis of human gait using pose estimation. PLoS Comput. Biol. 2021;17(4) doi: 10.1371/journal.pcbi.1008935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Kim W., Kim Y. 2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC) IEEE; 2021. Abnormal gait recognition based on integrated gait features in machine learning. [Google Scholar]
- 44.Li Y., et al. 2020 International Joint Conference on Neural Networks (IJCNN) Ieee; 2020. Human pose estimation based in-home lower body rehabilitation system. [Google Scholar]
- 45.Lin C.-B., et al. A framework for fall detection based on OpenPose skeleton and LSTM/GRU models. Appl. Sci. 2020;11(1):329. [Google Scholar]
- 46.Moro M., et al. Markerless vs. Marker-based gait analysis: a proof of concept study. Sensors. 2022;22(5):2011. doi: 10.3390/s22052011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Menychtas D., et al. Gait analysis comparison between manual marking, 2D pose estimation algorithms, and 3D marker-based system. Front Rehabil Sci. 2023;4 doi: 10.3389/fresc.2023.1238134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Afrouzian R., Seyedarabi H., Kasaei S. Pose estimation of soccer players using multiple uncalibrated cameras. Multimed. Tool. Appl. 2016;75:6809–6827. [Google Scholar]
- 49.Badiola-Bengoa A., Mendez-Zorrilla A. A systematic review of the application of camera-based human pose estimation in the field of sport and physical exercise. Sensors. 2021;21(18):5996. doi: 10.3390/s21185996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Pagnon D., Domalain M., Reveret L. Pose2Sim: an end-to-end workflow for 3D markerless sports kinematics—Part 2: accuracy. Sensors. 2022;22(7):2712. doi: 10.3390/s22072712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Citraro L., et al. Real-time camera pose estimation for sports fields. Mach. Vis. Appl. 2020;31(3):16. [Google Scholar]
- 52.Ota M., et al. Verification of reliability and validity of motion analysis systems during bilateral squat using human pose tracking algorithm. Gait Posture. 2020;80:62–67. doi: 10.1016/j.gaitpost.2020.05.027. [DOI] [PubMed] [Google Scholar]
- 53.Luvizon D.C., Picard D., Tabia H. Consensus-based optimization for 3D human pose estimation in camera coordinates. Int. J. Comput. Vis. 2022;130(3):869–882. [Google Scholar]
- 54.Zhou G., et al. A computer vision approach for estimating lifting load contributors to injury risk. IEEE Transactions on Human-Machine Systems. 2022;52(2):207–219. [Google Scholar]
- 55.Van Eetvelde H., et al. Machine learning methods in sport injury prediction and prevention: a systematic review. Journal of experimental orthopaedics. 2021;8:1–15. doi: 10.1186/s40634-021-00346-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Tack C. Artificial intelligence and machine learning| applications in musculoskeletal physiotherapy. Musculoskeletal Science and Practice. 2019;39:164–169. doi: 10.1016/j.msksp.2018.11.012. [DOI] [PubMed] [Google Scholar]
- 57.Raza A., et al. LogRF: an approach to human pose estimation using skeleton landmarks for physiotherapy fitness exercise correction. IEEE Access. 2023;11:107930–107939. [Google Scholar]
- 58.Yang L., et al. 2021 IEEE 5th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) 2021. Human exercise posture analysis based on pose estimation. [Google Scholar]
- 59.Cai Z., Fernando O.N.N., Ong J.Y. 2022 International Conference on Cyberworlds (CW) 2022. PoseBuddy : pose estimation workout mobile application. [Google Scholar]
- 60.Rosique F., Losilla F., Navarro P.J. Applying vision-based pose estimation in a telerehabilitation application. Appl. Sci. 2021;11(19):9132. [Google Scholar]
- 61.Mehrizi R., et al. 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018) IEEE; 2018. Toward marker-free 3D pose estimation in lifting: a deep multi-view solution. [Google Scholar]
- 62.Chen C., et al. Repetitive assembly action recognition based on object detection and pose estimation. J. Manuf. Syst. 2020;55:325–333. [Google Scholar]
- 63.Kim W., et al. Ergonomic postural assessment using a new open-source human pose estimation technology (OpenPose) Int. J. Ind. Ergon. 2021;84 [Google Scholar]
- 64.Ying W., et al. Automatic scaffolding workface assessment for activity analysis through machine learning. Appl. Sci. 2021;11(9):4143. [Google Scholar]
- 65.Global, regional, and national burden of low back pain, 1990-2020, its attributable risk factors, and projections to 2050: a systematic analysis of the Global Burden of Disease Study 2021. Lancet Rheumatol. 2023;5(6):e316–e329. doi: 10.1016/S2665-9913(23)00098-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Musumeci G. The kinesiologist like a tailor: the art of making a tailor-made physical activity plan. J Funct Morphol Kinesiol. 2021;6(3) doi: 10.3390/jfmk6030076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Wade L., et al. Applications and limitations of current markerless motion capture methods for clinical gait biomechanics. PeerJ. 2022;10 doi: 10.7717/peerj.12995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Chou R., et al. Imaging strategies for low-back pain: systematic review and meta-analysis. Lancet. 2009;373(9662):463–472. doi: 10.1016/S0140-6736(09)60172-0. [DOI] [PubMed] [Google Scholar]
- 69.Slowik J.S., et al. Comparison of a single-view image-based system to a multi-camera marker-based system for human static pose estimation. J. Biomech. 2023;159 doi: 10.1016/j.jbiomech.2023.111746. [DOI] [PubMed] [Google Scholar]
- 70.Bai X., et al. CONet: crowd and occlusion-aware network for occluded human pose estimation. Neural Netw. 2024;172 doi: 10.1016/j.neunet.2024.106109. [DOI] [PubMed] [Google Scholar]
- 71.Lees A. Technique analysis in sports: a critical review. J. Sports Sci. 2002;20(10):813–828. doi: 10.1080/026404102320675657. [DOI] [PubMed] [Google Scholar]
- 72.Kellmann M. Preventing overtraining in athletes in high-intensity sports and stress/recovery monitoring. Scand. J. Med. Sci. Sports. 2010;20(Suppl 2):95–102. doi: 10.1111/j.1600-0838.2010.01192.x. [DOI] [PubMed] [Google Scholar]
- 73.Bengtsson V., Berglund L., Aasa U. Narrative review of injuries in powerlifting with special reference to their association to the squat, bench press and deadlift. BMJ open sport & exercise medicine. 2018;4(1) doi: 10.1136/bmjsem-2018-000382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Zernicke R., Whiting W. Biomechanics in Sport: Performance Enhancement and Injury Prevention. 2000. Mechanisms of musculoskeletal injury; pp. 507–522. [Google Scholar]
- 75.Haraldsdottir K., Watson A.M. Psychosocial impacts of sports-related injuries in adolescent athletes. Curr. Sports Med. Rep. 2021;20(2):104–108. doi: 10.1249/JSR.0000000000000809. [DOI] [PubMed] [Google Scholar]
- 76.Carbone A., Rodeo S. Review of current understanding of post‐traumatic osteoarthritis resulting from sports injuries. J. Orthop. Res. 2017;35(3):397–405. doi: 10.1002/jor.23341. [DOI] [PubMed] [Google Scholar]
- 77.Ravalli S., et al. Exploiting real-world data to monitor physical activity in patients with osteoarthritis: the opportunity of digital epidemiology. Heliyon. 2022;8(2) doi: 10.1016/j.heliyon.2022.e08991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Amendolara A., et al. An overview of machine learning applications in sports injury prediction. Cureus. 2023;15(9) doi: 10.7759/cureus.46170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Picard S., et al. 2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW) IEEE; 2020. Ensuring dataset quality for machine learning certification. [Google Scholar]
- 80.Tenforde A.S., et al. Telehealth in physical medicine and rehabilitation: a narrative review. PM&R. 2017;9(5):S51–S58. doi: 10.1016/j.pmrj.2017.02.013. [DOI] [PubMed] [Google Scholar]
- 81.Rawstorn J.C., et al. Telehealth exercise-based cardiac rehabilitation: a systematic review and meta-analysis. Heart. 2016;102(15):1183–1192. doi: 10.1136/heartjnl-2015-308966. [DOI] [PubMed] [Google Scholar]
- 82.Carrier B., et al. Validation of garmin fenix 3 HR fitness tracker biomechanics and metabolics (VO2max) Journal for the Measurement of Physical Behaviour. 2020;3(4):331–337. [Google Scholar]
- 83.Bevan S. Economic impact of musculoskeletal disorders (MSDs) on work in Europe. Best Pract. Res. Clin. Rheumatol. 2015;29(3):356–373. doi: 10.1016/j.berh.2015.08.002. [DOI] [PubMed] [Google Scholar]
- 84.Lurati A.R. Health issues and injury risks associated with prolonged sitting and sedentary lifestyles. Workplace health & safety. 2018;66(6):285–290. doi: 10.1177/2165079917737558. [DOI] [PubMed] [Google Scholar]
- 85.McAtamney L., Nigel Corlett E. RULA: a survey method for the investigation of work-related upper limb disorders. Appl. Ergon. 1993;24(2):91–99. doi: 10.1016/0003-6870(93)90080-s. [DOI] [PubMed] [Google Scholar]
- 86.Valero E., et al. Musculoskeletal disorders in construction: a review and a novel system for activity tracking with body area network. Appl. Ergon. 2016;54:120–130. doi: 10.1016/j.apergo.2015.11.020. [DOI] [PubMed] [Google Scholar]
- 87.Huang C., et al. Development and validation of a wearable inertial sensors-based automated system for assessing work-related musculoskeletal disorders in the workspace. Int. J. Environ. Res. Publ. Health. 2020;17(17):6050. doi: 10.3390/ijerph17176050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Hassan A., et al. Work-relatedness of carpal tunnel syndrome: systematic review including meta-analysis and GRADE. Health Sci Rep. 2022;5(6):e888. doi: 10.1002/hsr2.888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Washabaugh E.P., et al. Comparing the accuracy of open-source pose estimation methods for measuring gait kinematics. Gait Posture. 2022;97:188–195. doi: 10.1016/j.gaitpost.2022.08.008. [DOI] [PubMed] [Google Scholar]
- 90.Nakano N., et al. Evaluation of 3D markerless motion capture accuracy using OpenPose with multiple video cameras. Front Sports Act Living. 2020;2:50. doi: 10.3389/fspor.2020.00050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Zago M., et al. 3D tracking of human motion using visual skeletonization and stereoscopic vision. Front. Bioeng. Biotechnol. 2020;8:181. doi: 10.3389/fbioe.2020.00181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Needham L., et al. The accuracy of several pose estimation methods for 3D joint centre localisation. Sci. Rep. 2021;11(1) doi: 10.1038/s41598-021-00212-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Lafayette T.B.d.G., et al. Validation of angle estimation based on body tracking data from RGB-D and RGB cameras for biomechanical assessment. Sensors. 2023;23(1):3. doi: 10.3390/s23010003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Hakim I.M., et al. 2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC) 2023. 3D human pose estimation using blazepose and direct linear transform (DLT) for joint angle measurement. [Google Scholar]
- 95.Cordeiro M.C., Catháin C.Ó., Rodrigues T.B. Proceedings of the 14th International Workshop on Immersive Mixed and Virtual Environment Systems. Association for Computing Machinery; 2022. The development of a machine learning/augmented reality immersive training system for performance monitoring in athletes; pp. 19–22. Athlone, Ireland. [Google Scholar]
- 96.Postolache O., et al. In: Advanced Systems for Biomedical Applications. Kanoun O., Derbel N., editors. Springer International Publishing; Cham: 2021. Virtual reality and augmented reality technologies for smart physical rehabilitation; pp. 155–180. [Google Scholar]
- 97.Gibney E. What the EU's tough AI law means for research and ChatGPT. 2024. [cited 2024 May] [DOI] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
No data was used for the research described in the article.


