Skip to main content
PLOS One logoLink to PLOS One
. 2026 Apr 8;21(4):e0330097. doi: 10.1371/journal.pone.0330097

Prediction of on-field rugby scrummaging contact forces from videos using artificial neural networks

Juan Cordero-Sánchez 1, Zak Sheehy 2, Gil Serrancolí 3,4,*, Ezio Preatoni 2, Grant Trewartha 5, Dario Cazzola 2
Editor: Christopher Kirk6
PMCID: PMC13061245  PMID: 41950178

Abstract

Rugby is a contact sport with a high risk of injury. Scrums are a key component of the game and represent a collision event associated with serious injuries, especially to the cervical spine. Traditional methods for measuring scrum forces rely on both pressure sensors and scrum machines, but these have limitations for on-field monitoring. Thus, this study investigates the feasibility of using a machine learning approach to predict contact forces in rugby scrummaging during real-game situations, based on only anatomical landmarks of players extracted from video footage. A recurrent neural network (RNN) model was trained to predict contact forces from velocity signals. A dataset of eleven rugby teams (22 forward packs, n = 176 players) during on-field scrum engagements was used. Data augmentation techniques using a generative adversarial network and the Mixup technique were applied to increase the size of the training dataset. The RNN was trained using time-dependent data, with 2D trajectories from video landmarks as inputs and force signals from pressure sensors as outputs. The RNN´s prediction results showed good to excellent agreement between the predicted and measured time-dependent normal contact force signals, with a mean correlation of 0.95 ± 0.05. The mean normalized RMSE was 9.3 ± 4.3% and the mean normalized absolute difference in peak force was 6.7 ± 5.5%. This study demonstrated the feasibility of using RNNs to quantify contact forces in rugby scrummaging using only single-camera video, without players wearing sensors. In combination with a 2D human pose estimation model, the ANN trained in this study could support performance analysis, coaching, technique correction, and injury prediction using in-game data.

Introduction

Rugby is a contact sport played worldwide, where players frequently engage in multiple collisions throughout a match [1]. As a result, compared to other team sports, rugby carries a higher risk of injury [2]. The potential for injury increases with high impact forces [3], with more than half of all rugby-associated injuries affecting soft tissues [36]. This occurs when a player collides against an opponent as it is the case for tackling and scrum events. Scrummaging involves a dynamic engagement phase and a period of sustained pushing [7,8]. Misdirected or repetitive impact scrum engagements are suggested to be related to the largest incidence rates of cervical spine injuries in rugby union [9,10] as the result of extreme neck flexion, with or without rotation or hyperextension of the C4, C5, and C6 vertebrae [3]. Although severe cervical spine injuries are rare [11], their consequences induce substantial life alterations [12].

Several studies focusing on the measurement of scrum forces have been conducted using a scrum machine [8] or with pressure sensors attached to the players´ shoulders [13,14]. However, pressure sensors cannot be used for in-game performance monitoring due to the difficulty of having players wear them during the game. In addition, the development and wider adoption of wearable technology (WT) in sports, such as global positioning systems (GPS) and inertial measurement units (IMU), allow the quantification of both position-specific demands [15,16] and collision frequency [17,18] supporting rugby performance and rehabilitation [1921] under match-like conditions [22]. However, depending on how easily the WT can be attached and how well it fits the body, its comfort features can significantly impact player´s performance. In addition, movements of the WT due to collisions and muscle artifacts can lead to measurement inaccuracies [23]. Therefore, a method to quantify rugby player´s positions and velocities as well as the contact forces between them, without interfering with their game, would enhance athletic performance, provide coaching support, aid in technique correction and improve injury prediction [24] from in-game data analysis. This method could involve the use of video-based motion analysis to allow the remote quantification of player kinematics without physical sensors. This enables a fully non-invasive estimation of body positions and velocities during real-game situations. Therefore, implementing video-derived kinematic features as input to artificial neural networks (ANNs) represents a powerful alternative to wearable systems for estimating contact forces in rugby scrummaging.

ANNs have the capacity of dealing with complex, high-dimensional and noisy datasets to identify subtle patterns or hidden structures. In addition, in recent years, there has been a rapid advancement in ANNs, driven by increased computational power, what has promoted their use as a common tool in biomechanics [25]. Consequently, their application in contact force prediction in rugby biomechanics, where the understanding of player movement and contact forces is critical for performance optimization and injury prevention [26], would be valuable.

Therefore, ANNs offer a powerful approach to improve decision-making [27], player performance [28,29] and reduce injury risk [30,31] in rugby. Furthermore, ANNs have been used to predict ground reaction forces (GRF) based on kinematics during walking [32,33] and running [3437]. This enables GRF estimation outside of laboratory settings, without the need for experimental techniques typically required to record dynamics data (e.g., force plates). Similarly, predicting shoulder contact forces during rugby scrummaging could become feasible using video-based motion kinematics, just as GRF can be predicted from wearable sensor data during running.

State-of-the-art techniques suggest that collision contact forces in rugby can also be predicted from the kinematics of players. Stitt et al. [38] developed a machine learning model to classify rugby player head impact, while Pomarat et al. [39] trained an ANN to estimate single-player GRF in rugby scrummaging from inertial and force data recorded with instrumented insoles. However, these models were implemented in laboratory settings, so their applicability is limited for real game environments. Thus, this study aims to develop an ANN-based tool to predict shoulder contact forces between individual players from single-camera videos of in-game rugby scrummaging, offering a solution for real-time biomechanical analyses.

Methods

Dataset

The dataset used in this study was previously collected and published by Cazzola et al. [13] within the project Biomechanics of the Rugby Scrum funded by the International Rugby Board. It contains data of 176 elite rugby players from eleven professional teams (22 forward packs of 8 players: 853.9 ± 28.0 kg per pack). Data collection from these participants was carried out following the institutional ethics committee at the University of Bath, School for Health Research Ethics Approval Panel, and after individual written informed consent was provided by each participant. This dataset consisted of top-view videos and raw kinetic data collected during on-field contested scrum engagements between professional rugby teams. To boost the generalization of the ANN´s model, three different engagement techniques were included in the dataset. Each team (two forward packs, referred to as ‘Team A’ and ‘Team B’) performed between 12 and 16 scrums (resulting in a total of 330 scrummaging trials) using the engagement techniques of crouch-touch-pause-engage (CTPE), crouch-touch-set (CTS), and PreBind, in which props prebind with the opposition prior to the ‘Set’ command. Players’ movements were recorded from a top-view camera (HVR-Z5, Sony, Japan) at 50 Hz. Scrummaging contact forces were estimated from pre-calibrated [40] pressure sensors (F-Scan, Tekscan Inc, USA, 500 Hz) attached to the left and right shoulders of the front-row players from ‘Team A’ (Fig 1).

Fig 1. Camera top view of one scrumming trial showing the vertebrae C7 and lumbar point locations for each ‘Team A’ and ‘Team B’ front row player.

Fig 1

Data analysis

The two-dimensional (2D) coordinates of the C7 vertebra and lumbar locations of the front row players were manually labelled from the provided top-view videos. Reliability of these labelled data was assessed and was high, as reported by Preatoni et al. [41]. In addition, in this study, the 2D coordinates and the raw contact forces previously obtained from pressure sensors were low-pass filtered with a second-order Butterworth filter with a cut-off frequency of 20 and 100 Hz, respectively. The components of the midpoint velocity between the C7 and lumbar top-plane locations were calculated as the first derivative of their position vector, then the module vector of the velocity was computed. Velocity-force profiles were computed for each pair in the A1L-B3, A1R-B3, A2L-B3, A2R-B2, A3L-B2, A3R-B1 player engagements, where ‘R’ and ‘L’ indicate the contact forces for the right and left shoulders of the ‘Team A’ players (Fig 1). The velocity curve of each velocity-force profile was calculated as the mean velocity between each pair of player´s velocities. A total of 42 trials were finally selected from the original dataset after data cleaning, which involved removing empty files and scrum trials in which velocity and contact force peak values fell outside the ranges of 2.5–4.5 m·s-1 and 1–4 kN, respectively. Based on the literature [13], values beyond these thresholds were considered physiologically unrealistic or potentially indicative of technical execution errors during the trials, such as sensor misalignment. S1 Fig (Supporting information) displays the engagement technique and player´s shoulder for each scrum trial in the final dataset. The time period analyzed was between the first frame when the contact force was higher than 10 N, and the frame with the maximum contact force. Both velocity and contact force signals were time-normalized to 101 data points. Fig 2 shows the distribution of the velocity-force time-dependent curves according to the engagement technique.

Fig 2. Velocity-force distribution of all measured time-dependent curves for each engagement technique (CTS, PreBind, CTPE) among all teams and front row players.

Fig 2

Each line represents one player´s shoulder contact trial. This distribution displays the relationship between the linear velocity of the rugby players and the contact force generated during each scrum collision after data cleaning.

Data augmentation

A generative adversarial network (GAN) was trained to generate one thousand new pairs of velocity-force signals. GANs are generative models that create new samples from scratch. These samples have realistic patterns from the distribution of the original dataset. Therefore, the GAN was trained to synthesize additional realistic velocity–force pairs following the same statistical distribution as the measured data. This data augmentation was implemented to balance the dataset and try to reduce overfitting due to the limited number of real scrum trials, not to modify or substitute the original dataset. The generator was a sequential model with a latent dimension of 50 and composed of one dense layer (26x128 neurons) followed by two convolutional layers (64 and 2 neurons, SELU and tangential activations) ending with one dense layer (202 neurons, tangential activation). Batch normalization layers were implemented in the generator model. The discriminator was a sequential model composed of two convolutional networks (64 and 128 neurons, LeakyReLU activation) and one dense layer (one neuron, sigmoid activation). In addition, layers of dropout were added to regularize the discriminator with a dropout rate of 40%. The convolutional layers had a kernel size of 5 and a stride of 2 with zero padding. The selection of the GAN´s architecture and its hyperparameters relied on manual tuning, according to the correlation and root mean squared error (RMSE) between its generated curves and the measured ones. The resulting final dataset generated by the GAN showed mean correlation and RMSE values of 0.99 and 0.05 m·s-1 for the velocity curves, and 0.99 and 128.7 N for the force curves. In both models, the adaptive moment estimation (Adam) algorithm [42] was used as optimizer (with a learning rate of 10−3) due to its good convergence quality and speed. In addition, using the Mixup technique [43], data were augmented to enlarge the input dataset of the recurrent neural network (RNN). Mixup is a non-generative data augmentation approach that was used to create new velocity–force signal pairs by linearly combining pairs of temporally normalized velocity and force time-series. To construct physically meaningful intermediate time-series while preserving temporal structure a constant mixup coefficient of 0.5 was used. This improves the generalization of the RNN as it provides synthetic data that decreases overfitting.

Data prediction

A RNN model was trained with the dataset generated by the GAN and enlarged by the Mixup technique. This dataset was split into the training dataset (80%) and the validation dataset (20%). The prediction performance of the RNN was assessed with the test dataset that contained the measured velocity and force data (see the training and test processes in Fig 3). The RNN model was chosen given the sequential and dynamic nature of contact forces, which involves complex temporal dependencies with the velocity signals across time steps. The RNN´s input was the velocity curve of each velocity-force profile, while its output was the predicted force curve of the collision between single player´s shoulders. The RNN was a sequential model composed of two layers with 128 and 64 long short-term memory (LSTM) cells, followed by a dense layer with 1 neuron to produce a single output per time step. The rectified linear unit (ReLU) activation function was used in the last dense layer due to its fast computation and because it does not saturate for positive values. The Adam algorithm, with a learning rate of 0.002 was used as optimizer, while early stopping was implemented to regularize the learning process. The RNN’s structure and the learning rate were selected according to the GridSearch algorithm [44] using cross-validation to evaluate all the possible combinations among the number of layers (1, 2 or 3), LSTM cells (32, 64 or 128) and the learning rate (0.001, 0.002 or 0.01). Velocity and force signals were normalized by the maximum absolute value from the entire training dataset to ensure a consistent range of values, thereby facilitating the learning process of the RNN. In the prediction phase, the input data are normalized using the same maximum absolute values determined during training. Pearson´s correlation and RMSE were calculated to evaluate the agreement between the measured and the predicted contact force curves. Correlation was ranked as poor (0–0.5), moderate (0.5–0.75), good (0.75–0.90) and excellent (> 0.9) [45]. Data analysis and neural networks training were carried out in Python (v3.11.5) with TensorFlow and Keras using an Intel i5-14600KF CPU computer with 32GB RAM and a Nvidia GeForce RTX 4060 16GB VRAM GPU. The trained RNN and the data that were used to train and evaluate it, along with a brief tutorial, can be found in this repository: https://doi.org/10.5281/zenodo.18048888.

Fig 3. Diagram of the process to train the RNN to predict contact forces from velocity signals.

Fig 3

Results

The agreement between the predicted and measured contact forces ranged from good (0.75–0.90) to excellent (> 0.90) with a mean correlation of 0.95 ± 0.05. The overall mean RMSE and mean absolute peak differences were 327.5 ± 150.8 N (9.3 ± 4.3%) and 236.1 ± 194.2 N (6.7 ± 5.5%), respectively. Comparing the measured and predicted force signals from the start of contact between the front-row players of both teams to the instant the force peak is reached, the highest RMSE values correspond to scrums 12 and 14, which have very similar measured signal curves (Fig 4). In addition, more than half of the force predictions tended to underestimate the measured curves during most of the collision phase. The same occurred with the peak force values, which were more often underestimated by the ANN (see Fig 5 showing the Bland-Altman plot for all peak force values). The mean difference is 54.3 N and the limits are + 1.96SD: 640 N and −1.96SD: −530 N. This means that the bias was low, but the large limits indicated high prediction variability. Bland-Altman plot values for each engagement technique were 114.6 N (+1.96SD: 660 N/ −1.96SD: −430 N), −36.9 N (+1.96SD: 510 N/ −1.96SD: −580 N) and 117.6 N (+1.96SD: 730 N/ −1.96SD: −490 N) for the CTPE, CTS and PreBind techniques, respectively.

Fig 4. Predicted (blue) and measured (red) time-normalized contact force curves.

Fig 4

Bottom right values stand for correlation, RMSE (Newtons) and peak difference (Newtons). The error percentage is the RMSE and peak difference normalized to the maximum of the corresponding measured force curve.

Fig 5. Bland–Altman plot for all the peak contact force values (Newtons) according to each engagement technique.

Fig 5

The mean difference is 54.3 N and the 95% limits of agreement are + 1.96SD: 640 N and −1.96SD: −530 N.

The major error percentage was found in the PreBind technique for both RMSE (13.5%) and peak difference (10.4%). In the case of player position, the right shoulder of the A3 player reached the highest RMSE (14.5%), while the left shoulder of the A1 player displayed the highest force peak difference (11.4%). Table 1 and Table 2 show the correlation and RMSE values between the measured and predicted contact force curves as well as the peak difference values according to each engagement technique (Table 1) and front row player´s shoulder (Table 2).

Table 1. Mean and standard deviation of the correlation, RMSE (Newtons) and absolute force peak difference (Newtons) between measured and predicted contact forces for each engagement technique. The error percentage is the RMSE and peak difference normalized to the maximum of the underlying engagement force data.

Engagement Correlation RMSE Peak difference
CTPE 0.95 ± 0.07 371.4 ± 189.4 N (10.5 ± 5.4%) 228.4 ± 204.3 N

(6.5 ± 5.8%)
CTS 0.96 ± 0.03 285.5 ± 101.1 N (10.2 ± 3.6%) 216.5 ± 183.7 N

(7.8 ± 6.6%)
PreBind 0.95 ± 0.04 344.0 ± 165.4 N (13.5 ± 6.5%) 265.9 ± 209.4 N
(10.4 ± 8.2%)

Table 2. Mean and standard deviation of the correlation, RMSE (in Newtons) and absolute force peak difference (in Newtons) between measured and predicted contact forces for each front row player´s shoulder. The error percentage is the RMSE and peak difference normalized to the maximum of the underlying player´s shoulder force data. L and R stand for left and right shoulder, respectively.

Player Correlation RMSE Peak difference
A1-L 0.94 ± 0.05 343.5 ± 60.7 N

(13.5 ± 2.4%)
291.4 ± 212.1 N

(11.4 ± 8.3%)
A1-R 0.93 ± 0.07 458.8 ± 205.0 N

(13.0 ± 5.8%)
260.9 ± 128.6 N

(7.4 ± 3.7%)
A2-L 0.96 ± 0.04 335.1 ± 141.7 N

(11.3 ± 4.8%)
316.5 ± 248.4 N

(10.7 ± 8.4%)
A2-R 0.97 ± 0.03 292.8 ± 175.6 N

(11.7 ± 7.0%)
254.5 ± 261.4 N

(10.2 ± 10.6%)
A3-L 0.96 ± 0.02 241.5 ± 90.0 N

(8.6 ± 3.2%)
176.2 ± 131.7 N

(6.3 ± 4.7%)
A3-R 0.95 ± 0.05 303.2 ± 130.7 N

(14.5 ± 6.3%)
102.7 ± 97.3 N

(4.9 ± 4.7%)

Discussion

In this study, we explored the possibility of estimating engagement forces in a rugby union scrum using only motion (video) data as input to the RNN. Our approach included the use of data augmentation techniques showing promising results in the estimation of engagement forces from velocity data of the front row players in the scrum. These results showed the feasibility of predicting contact forces during scrummaging as well as their respective force impact peaks with mean errors lower than 10% (9.3% and 6.7%, respectively) of the maximum measured contact force.

The proposed work on scrum force estimation enables in-game impact force quantification, unlocking the ability to measure the mechanical load experienced by players during real-time match analyses. This advancement could significantly improve the calculation of rugby performance indicators [29,46,47], facilitate tactical decisions [27,28,48], and ultimately reduce injuries [24,49,50]. In addition, our findings complement other machine learning works focused on both detecting impact events [20,31,5153] and estimating poses in rugby [30,54]. These machine learning models employed in prior research can provide the time of impact and the player’s posture, which are the inputs required for applying our scrum engagement force prediction model. However, as the used dataset to train this model consisted exclusively of elite male rugby players, its generalizability to other populations (e.g., female, youth, or amateur players) remains untested. These groups differ in anthropometry, strength, and scrummaging technique. Nevertheless, as the inputs of our model are solely linear velocities, its predictions should not be strongly affected by players’ physical differences. Even so, future work should include additional data from diverse cohorts to enable transfer learning and ensure model applicability across playing levels.

The RNN model developed in this study was trained to estimate the contact force signal during the engagement phase of the force production in scrummaging. This impulsive engagement phase is highly influenced by the mass and the engagement speed of the players [55,56]. Although our RNN did not consider the mass of the players, this would not be a weakness of the RNN since Quarrie and Wilson [56] found that, even though the mean correlation between players pack mass and the scrum force was higher than 0.8, the 95% confidence interval of this correlation included zero making this relationship variable. In addition, it should be noted that heavier forward packs generate greater impact force only in the case of the full scrum technique [57]. Moreover, the prediction accuracy of the RNN developed in this study tended to be similar among the different engagement techniques analysed. Only the PreBind percentage errors of the RNN predictions were slightly higher. Scrummaging collisions result in lower impact forces with the PreBind technique, as demonstrated experimentally by [13]. This could indicate that the lower the scrum impact force, the worse the RNN’s accuracy. The mean maximum RMSE and peak absolute differences among players were 217.4 N and 213.8 N, respectively; and among engagement techniques, the corresponding values were 59.0 N and 49.4 N. This shows that the error observed using different engagement techniques and player positions in the front row was lower than the intrinsic error (~250/300N) of the pressure sensors [40].

The novelty of our work is the prediction of contact forces during scrummaging collisions using a RNN combined with video motion capture. Previous research in literature predicting dynamic data from kinematics had primarily focussed on GRF in tasks like gait and running [33,35,58], and to a lesser extent, pedal forces in cycling [59]. Focusing on rugby, Stitt et al. [38] recently developed a head impact classification machine learning model to study the rugby head impact in a laboratory setting. In addition, Pomarat et al. [39] trained a LSTM neural network to predict three-dimensional GRF from pressure insole signals during scrums in laboratory isolated conditions using a rugby scrum machine. Although the sports task was scrum collision, the application and purpose of these studies were different. Therefore, a straightforward comparison with our results was not possible. Additionally, the data used to train our RNN model were collected during match-realistic scrum situations while the output data were the shoulder contact forces. However, these studies could be used in combination with the method proposed here to get a deeper insight into both the risk of head impacts and the dynamics of scrummaging, by performing inverse dynamics analyses knowing the external forces (GRF and shoulder contact forces) and their points of application along with the kinematics of the front-row players.

The force prediction results of our study were consistent with the error magnitudes reported in other studies for different sports tasks ranging between 12.0%−15.0% [33] and 4.0%−14.0% [35] for GRF, and between 9.9%−13.0% [59] for pedal forces. Nevertheless, caution should be taken when comparing our results with other studies due to differences in both the movement tasks and the nature of the external forces being predicted. Most studies predicted GRF during cyclic movements and, as it was highlighted by [60], model performance is strongly dependent on the task that it was trained on. In addition, although scrum shoulder contact forces and GRF are both external forces, they arise from interactions between different types of bodies. Contact forces, generated by collisions between two (or more) bodies, depend on the mechanical properties of those bodies [61]. Shoulder contact forces come from the collision between two soft tissues in movement, which could add more variability into the force signal. This could be a reason for the results observed in the Bland–Altman plot, whose limits of agreement (±600 N) indicate a notable variability. However, this represents approximately 15% of the maximum measured peak force. Therefore, the model could be better suited for group-level monitoring and trend analysis (e.g., detecting systematic increases in impact load or comparing engagement techniques) rather than for making clinical or safety judgments on single events and individual predictions. Thus, despite scrummaging being a challenging task involving multiple players performing high-impact movements in real-world situations, the RNN trained in this study achieved prediction accuracies, in terms of mean error percentages, comparable to those reported in other studies that predicted external forces during more regular movements under highly controlled conditions.

The main limitation of this study was that the training of the RNN was conducted solely based on 2D body locations obtained from a top-view perspective. This approach resulted in a loss of information regarding the player’s poses during the scrum, which could be highly relevant to evaluate the correlation between player kinematics and scrum contact force generation. However, in rugby scrummaging, players are trained and conditioned to maximize forward (horizontal) force production while maintaining stable body positions, with vertical force and movement components playing a secondary role in the overall movement strategy [62]. Nevertheless, there is considerable room for improvement in terms of both the accuracy and robustness of force predictions. Thus, to improve the performance of the RNN developed in this study, future work should focus on estimating the full-body pose of players during the scrum to combine it with scrum collision physics models and ANNs. By integrating 2D (or even 3D) positions of multiple body joints using pose estimation neural network models, it may be possible to extract a much richer set of anatomical landmarks from video data. This extended set of anatomical features would enable the computation of additional kinematic descriptors such as 3D joint angles, velocities and accelerations, which are critical for understanding force transmission through the kinetic chain during scrum engagement.

However, it must be considered that the transitioning from manually labelled landmarks to automated pose estimation introduces additional sources of error related to landmark misidentification, depth ambiguity, and occlusion. Menychtas et al. [63] have shown that 2D pose estimation algorithms yield accurate estimates for large joint movements but struggle to capture small, high-frequency motions or those occurring out of plane. Similarly, automated vision-based tracking systems may exhibit greater variability when the visibility of landmarks is compromised [64]. On the other hand, manual labelling is more consistent, but it could introduce a static offset due to a user´s bias [63]. To mitigate these limitations, future work should employ confidence-weighted filtering to disregard low-certainty keypoints, apply temporal smoothing to correct transient errors, and train pose-estimation models using hybrid datasets combining automatically estimated landmarks with manual corrections. These strategies can improve robustness and help bridge the accuracy gap between manual and automated tracking systems. Nevertheless, there is still a significant time-saving benefit of the automated algorithms over manual annotation [63]. This efficiency facilitates the annotation of a larger number of body landmarks and, consequently, incorporating this richer information as input to the neural network would provide a more comprehensive representation of player´s postures and motions, ultimately enabling more accurate predictions of contact forces. In addition, embedding physical laws such as Newtonian mechanics or even simplified musculoskeletal models directly into the RNNs training process could improve the physical consistency of the contact force predictions. This could ensure that predicted contact forces do not violate biomechanical limits and that they are consistent with observed accelerations.

To conclude, this study demonstrates the feasibility of using RNN models to predict engagement forces in rugby union scrums. This opens the door to real-time, in-game analysis of scrum collisions, which could be applied directly on the field via embedded sensors or computer vision. Such systems could provide instant feedback on force magnitudes, highlighting dangerously high engagements, and detect performance inconsistencies. As a result, coaches and medical staff could use this information during the game to assess injury risk, identify players showing signs of fatigue or poor form, or make tactical adjustments in real time. This level of live analysis would revolutionize player monitoring, adding a data-driven layer of decision-making unavailable during match play, what represents a meaningful step forward in biomechanical analysis of rugby performance.

Supporting information

S1 Fig. Matrix of measured scrum trials showing what engagement technique (CTS: blue, PreBind: orange, CTPE: green) and player´s shoulder each trial comes from.

It shows both the engagement technique (CTS: blue, PreBind: orange, CTPE: green) and the players’ shoulders (A1, A2, A3; L: left, R: right. See Fig 1) for each measured trial that makes up the RNN test dataset. Trial numbers (upper left) correspond to the same trials shown in Fig 4.

(EPS)

pone.0330097.s001.eps (5.3MB, eps)

Data Availability

Data will be available at https://doi.org/10.5281/zenodo.18048888.

Funding Statement

This study was partly supported by the Spanish Research Agency through a grant awarded to J.C. and G.S. (PID2023-150189NA-I00) and Universitat Politècnica de Catalunya. The specific roles of these authors are articulated in the ‘author contributions’ section. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Worsfold P, Twist C. The Science of Rugby. 2nd ed. Routledge. 2023. 9780367492113. [Google Scholar]
  • 2.West SW, Williams S, Cazzola D, Kemp S, Cross MJ, Stokes KA. Training load and injury risk in elite rugby union: the largest investigation to date. Int J Sports Med. 2021;42(8):731–9. doi: 10.1055/a-1300-2703 [DOI] [PubMed] [Google Scholar]
  • 3.Kaplan KM, Goodwillie A, Strauss EJ, Rosen JE. Rugby injuries: a review of concepts and current literature. Bull NYU Hosp Jt Dis. 2008;66(2):86–93. [PubMed] [Google Scholar]
  • 4.Targett SG. Injuries in professional Rugby Union. Clin J Sport Med. 1998;8(4):280–5. doi: 10.1097/00042752-199810000-00005 [DOI] [PubMed] [Google Scholar]
  • 5.Bathgate A, Best JP, Craig G, Jamieson M. A prospective study of injuries to elite Australian rugby union players. Br J Sports Med. 2002;36(4):265–9; discussion 269. doi: 10.1136/bjsm.36.4.265 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Barbosa NL, Fratti Neves L. Functional assessment in athletes with and without knee injury of a rugby team. Int Phys Med Rehab J. 2020;5(1):46–51. doi: 10.15406/ipmrj.2020.05.00227 [DOI] [Google Scholar]
  • 7.Milburn PD. The kinetics of rugby union scrummaging. J Sports Sci. 1990;8(1):47–60. doi: 10.1080/02640419008732130 [DOI] [PubMed] [Google Scholar]
  • 8.Preatoni E, Stokes KA, England ME, Trewartha G. The influence of playing level on the biomechanical demands experienced by rugby union forwards during machine scrummaging. Scand J Med Sci Sports. 2013;23(3):e178-84. doi: 10.1111/sms.12048 [DOI] [PubMed] [Google Scholar]
  • 9.Silvestros P, Cazzola D. Cervical spine injuries in rugby scrummaging: is buckling the most likely injury mechanism?. In: Book of abstracts for the XVI International Symposium on Computer Simulation in Biomechanics. Coolangatta, Australia; 2017. [Google Scholar]
  • 10.Quarrie KL, Cantu RC, Chalmers DJ. Rugby union injuries to the cervical spine and spinal cord. Sports Med. 2002;32(10):633–53. doi: 10.2165/00007256-200232100-00003 [DOI] [PubMed] [Google Scholar]
  • 11.Hutton MJ, McGuire RA, Dunn R, Williams R, Robertson P, Twaddle B. Catastrophic cervical spine injuries in contact sports. Global Spine J. 2016;6(7):721–34. doi: 10.1055/s-0036-1586744 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Bickenbach J, Officer A, Shakespeare T, von Groote P. International perspectives on spinal cord injury. World Health Organization, editor. Geneva: World Health Organization. 2013. [Google Scholar]
  • 13.Cazzola D, Preatoni E, Stokes KA, England ME, Trewartha G. A modified prebind engagement process reduces biomechanical loading on front row players during scrummaging: a cross-sectional study of 11 elite teams. Br J Sports Med. 2015;49(8):541–6. doi: 10.1136/bjsports-2013-092904 [DOI] [PubMed] [Google Scholar]
  • 14.Preatoni E, Cazzola D, Stokes KA, England M, Trewartha G. Pre-binding prior to full engagement improves loading conditions for front-row players in contested rugby union scrums. Scand J Med Sci Sports. 2015;26(12):1398–407. doi: 10.1111/sms.12592 [DOI] [PubMed] [Google Scholar]
  • 15.Cahill N, Lamb K, Worsfold P, Headey R, Murray S. The movement characteristics of English Premiership rugby union players. J Sports Sci. 2013;31(3):229–37. doi: 10.1080/02640414.2012.727456 [DOI] [PubMed] [Google Scholar]
  • 16.Waldron M, Twist C, Highton J, Worsfold P, Daniels M. Movement and physiological match demands of elite rugby league using portable global positioning systems. J Sports Sci. 2011;29(11):1223–30. doi: 10.1080/02640414.2011.587445 [DOI] [PubMed] [Google Scholar]
  • 17.Hulin BT, Gabbett TJ. Activity Profiles of Successful and Less-successful Semi-elite Rugby League Teams. Int J Sports Med. 2015;36(6):485–9. doi: 10.1055/s-0034-1398532 [DOI] [PubMed] [Google Scholar]
  • 18.Hulin BT, Gabbett TJ, Johnston RD, Jenkins DG. Wearable microtechnology can accurately identify collision events during professional rugby league match-play. J Sci Med Sport. 2017;20(7):638–42. doi: 10.1016/j.jsams.2016.11.006 [DOI] [PubMed] [Google Scholar]
  • 19.Alderson A. SportsTech wearable sensors: making rugby safer. Engineering and Technology. 2016;11(6):76–7. doi: 10.1049/et.2016.0607 [DOI] [Google Scholar]
  • 20.Kelly D, Coughlan GF, Green BS, Caulfield B. Automatic detection of collisions in elite level rugby union using a wearable sensing device. Sports Eng. 2012;15(2):81–92. doi: 10.1007/s12283-012-0088-5 [DOI] [Google Scholar]
  • 21.Powell D, Stuart S, Godfrey A. Wearables in rugby union: A protocol for multimodal digital sports-related concussion assessment. PLoS One. 2021;16(12):e0261616. doi: 10.1371/journal.pone.0261616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lehra C, Omalekb O, Osborne S, Warren Z, Saucier D, F. Burch V R, et al. Wearable Applications in Rugby for Performance Quantification and Player Health Assessment: A Brief Review. IJKSS. 2022;10(2):1–10. doi: 10.7575/aiac.ijkss.v.10n.2p.1 [DOI] [Google Scholar]
  • 23.Such O, Muehlsteff J. The Challenge of Motion Artifact Suppression in Wearable Monitoring Solutions. In: Proceedings of the 3rd IEEE-EMBS. Boston; 2006. doi: 10.1109/ISSMDBS.2006.360094 [DOI] [Google Scholar]
  • 24.Tedesco S, Crowe C, Ryan A, Sica M, Scheurer S, Clifford AM, et al. Motion sensors-based machine learning approach for the identification of anterior cruciate ligament gait patterns in on-the-field activities in rugby players. Sensors (Basel). 2020;20(11):3029. doi: 10.3390/s20113029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Halilaj E, Rajagopal A, Fiterau M, Hicks JL, Hastie TJ, Delp SL. Machine learning in human movement biomechanics: Best practices, common pitfalls, and new opportunities. J Biomech. 2018;81:1–11. doi: 10.1016/j.jbiomech.2018.09.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Edwards S, Lee R, Fuller G, Buchanan M, Tahu T, Tucker R, et al. 3D Biomechanics of rugby tackle techniques to inform future rugby research practice: a systematic review. Sports Med Open. 2021;7(1):39. doi: 10.1186/s40798-021-00322-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Watson N, Hendricks S, Stewart T, Durbach I. Integrating machine learning and decision support in tactical decision-making in rugby union. J Operational Res Society. 2020;72(10):2274–85. doi: 10.1080/01605682.2020.1779624 [DOI] [Google Scholar]
  • 28.Reed D, O’Donoghue P. Development and application of computer-based prediction methods. Int J Performance Analysis in Sport. 2005;5(3):12–28. doi: 10.1080/24748668.2005.11868334 [DOI] [Google Scholar]
  • 29.Parmar N, James N, Hearne G, Jones B. Using principal component analysis to develop performance indicators in professional rugby league. Int J Performance Analysis in Sport. 2018;18(6):938–49. doi: 10.1080/24748668.2018.1528525 [DOI] [Google Scholar]
  • 30.Nishio M, Nonaka N, Fujihira R, Murakami H, Tajima T, Yamada M. Objective detection of high-risk tackle in rugby by combination of pose estimation and machine learning. In: New Frontiers in Artificial Intelligence: JSAI-isAI 2022 Workshop, JURISIN 2022, and JSAI 2022 International Session. Kyoto, Japan; 2022. doi: 10.1007/978-3-031-29168-5_15 [DOI] [Google Scholar]
  • 31.Nonaka N, Fujihira R, Nishio M, Murakami H, Tajima T, Yamada M, et al. End-to-End High-Risk Tackle Detection System for Rugby. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). 2022;3549–58. doi: 10.1109/cvprw56347.2022.00399 [DOI] [Google Scholar]
  • 32.Oh SE, Choi A, Mun JH. Prediction of ground reaction forces during gait based on kinematics and a neural network model. J Biomech. 2013;46(14):2372–80. doi: 10.1016/j.jbiomech.2013.07.036 [DOI] [PubMed] [Google Scholar]
  • 33.Mundt M, Koeppe A, David S, Bamer F, Potthast W, Markert B. Prediction of ground reaction force and joint moments based on optical motion capture data during gait. Med Eng Phys. 2020;86:29–34. doi: 10.1016/j.medengphy.2020.10.001 [DOI] [PubMed] [Google Scholar]
  • 34.Komaris D-S, Perez-Valero E, Jordan L, Barton J, Hennessy L, O’Flynn B, et al. Predicting Three-Dimensional Ground Reaction Forces in Running by Using Artificial Neural Networks and Lower Body Kinematics. IEEE Access. 2019;7:156779–86. doi: 10.1109/access.2019.2949699 [DOI] [Google Scholar]
  • 35.Alcantara RS, Edwards WB, Millet GY, Grabowski AM. Predicting continuous ground reaction forces from accelerometers during uphill and downhill running: a recurrent neural network solution. PeerJ. 2022;10:e12752. doi: 10.7717/peerj.12752 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Cordero-Sánchez J, Bazuelo-Ruiz B, Pérez-Soriano P, Serrancolí G. Comparison of Ground Reaction Forces and Net Joint Moment Predictions: Skeletal Model Versus Artificial Neural Network-Based Approach. J Appl Biomech. 2025;41(4):290–303. doi: 10.1123/jab.2024-0113 [DOI] [PubMed] [Google Scholar]
  • 37.Carter J, Chen X, Cazzola D, Trewartha G, Preatoni E. Consumer-priced wearable sensors combined with deep learning can be used to accurately predict ground reaction forces during various treadmill running conditions. PeerJ. 2024;12:e17896. doi: 10.7717/peerj.17896 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Stitt D, Kabaliuk N, Spriggs N, Henley S, Alexander K, Draper N. Machine learning model to study the rugby head impact in a laboratory setting. PLoS One. 2025;20(1):e0305986. doi: 10.1371/journal.pone.0305986 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Pomarat Z, Passieux J-C, Dufour J-E, Watier B. Machine learning techniques for estimating the individual three-dimensional ground reaction forces during rugby scrummaging. J Biomech. 2025;193:113015. doi: 10.1016/j.jbiomech.2025.113015 [DOI] [PubMed] [Google Scholar]
  • 40.Cazzola D, Trewartha G, Preatoni E. Time-based calibrations of pressure sensors improve the estimation of force signals containing impulsive events. Proc Inst Mech Eng P J Sport Eng Technol. 2014;228(2):147–51. doi: 10.1177/1754337113504397 [DOI] [Google Scholar]
  • 41.Preatoni E, Wallbaum A, Gathercole N, Coombes S, Stokes KA, Trewartha G. An integrated measurement system for analysing impact biomechanics in the rugby scrum. Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology. 2012;226(3–4):266–73. doi: 10.1177/1754337111435200 [DOI] [Google Scholar]
  • 42.Kingma DP, Ba JL. Adam: A method for stochastic optimization. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings. 2015. 1–15. doi: arXiv:1412.6980ss [Google Scholar]
  • 43.Zhang H, Cisse M, Dauphin YN, Lopez-Paz D. mixup: Beyond Empirical Risk Minimization. 2017. https://arxiv.org/abs/1710.09412 [Google Scholar]
  • 44.Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O. Scikit-learn: Machine learning in Python. J Machine Learning Res. 2011;12:2825–30. [Google Scholar]
  • 45.Dancey C, Reidy J. Statistics without maths for psychology with psychology dictionary. London: Pearson Education, Limited. 2004. [Google Scholar]
  • 46.Croft H, Lamb P, Middlemas S. The application of self-organising maps to performance analysis data in rugby union. International Journal of Performance Analysis in Sport. 2015;15(3):1037–46. doi: 10.1080/24748668.2015.11868849 [DOI] [Google Scholar]
  • 47.Kvasnytsya O, Tyshchenko V, Latyshev M, Kvasnytsya I, Kirsanov M, Plakhotniuk O, et al. Team Performance Indicators That Predict Match Outcome in Rugby Union. Pamukkale J Sport Sci. 2024;15(1):203–16. doi: 10.54141/psbd.1342340 [DOI] [Google Scholar]
  • 48.Passos P, Araújo D, Davids K, Gouveia L, Milho J, Serpa S. Information-governing dynamics of attacker-defender interactions in youth rugby union. J Sports Sci. 2008;26(13):1421–9. doi: 10.1080/02640410802208986 [DOI] [PubMed] [Google Scholar]
  • 49.Thornton HR, Delaney JA, Duthie GM, Dascombe BJ. Importance of Various Training-Load Measures in Injury Incidence of Professional Rugby League Athletes. Int J Sports Physiol Perform. 2017;12(6):819–24. doi: 10.1123/ijspp.2016-0326 [DOI] [PubMed] [Google Scholar]
  • 50.Welch M, Cummins C, Thornton H, King D, Murphy A. Training load prior to injury in professional rugby league players: analysing injury risk with machine learning. In: ISBS Proceedings Archive, 2018. https://commons.nmu.edu/isbs/vol36/iss1/59 [Google Scholar]
  • 51.Chambers RM, Gabbett TJ, Cole MH. Validity of a Microsensor-Based Algorithm for Detecting Scrum Events in Rugby Union. Int J Sports Physiol Perform. 2019;14(2):176–82. doi: 10.1123/ijspp.2018-0222 [DOI] [PubMed] [Google Scholar]
  • 52.Martin Z, Patel A, Hendricks S. Automated tackle injury risk assessment in contact-based sports -- a rugby union example. 2021. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Nashville, TN, USA; 2021. 4589–98. doi: 10.1109/CVPRW53098.2021.00517 [DOI] [Google Scholar]
  • 53.Zhang Z, Gao L, Xiang Y. Application of Optimized BP Neural Network Based on Genetic Algorithm in Rugby Tackle Action Recognition. In: 2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA). Dalian, China; 2020. 95–9. doi: 10.1109/icaica50127.2020.9182597 [DOI] [Google Scholar]
  • 54.Blythman R, Saxena M, Tierney GJ, Richter C, Smolic A, Simms C. Assessment of deep learning pose estimates for sports collision tracking. J Sports Sci. 2022;40(17):1885–900. doi: 10.1080/02640414.2022.2117474 [DOI] [PubMed] [Google Scholar]
  • 55.Lallemand B, Clanet C, Blanchard S, Noriega P, Piscione J, Chaplain O. Peak compression force physics in rugby union scrum. Proceedings. 2020;49(1):151. doi: 10.3390/proceedings2020049151 [DOI] [Google Scholar]
  • 56.Quarrie KL, Wilson BD. Force production in the rugby union scrum. J Sports Sci. 2000;18(4):237–46. doi: 10.1080/026404100364974 [DOI] [PubMed] [Google Scholar]
  • 57.Du Toit DE, Venter DJ, Buys FJ, Olivier PE. Kinetics of Rugby Union scrumming in under 19 schoolboy rugby forwards. SA J Res Sp. 2004;26(2). doi: 10.4314/sajrs.v26i2.25894 [DOI] [Google Scholar]
  • 58.Stetter BJ, Stein T. Machine Learning in Biomechanics: Enhancing Human Movement Analysis. In: Dindorf C, Bartaguiz E, Gassmann F, Fröhlich M, editors. Artificial Intelligence in Sports, Movement, and Health. Springer, Cham. 2024. p. 139–60. doi: 10.1007/978-3-031-67256-9_9 [DOI] [Google Scholar]
  • 59.Cordero-Sánchez J, Bini R, Serrancolí G. Validity of recurrent neural networks to predict pedal forces and lower limb kinetics in cycling. J Biomech. 2025;183:112619. doi: 10.1016/j.jbiomech.2025.112619 [DOI] [PubMed] [Google Scholar]
  • 60.Kammoun A, Ravier P, Buttelli O. Comparison of the accuracy of ground reaction force component estimation between supervised machine learning and deep learning methods using pressure insoles. Sensors. 2024;24(16). doi: 10.3390/s24165318 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Zatsiorsky VM. Kinetics of Human Motion. Champaign (IL): Human Kinetics. 2002. 0736037780. [Google Scholar]
  • 62.Green A, Coopoo Y, Tee J, McKinon W. A review of the biomechanical determinants of rugby scrummaging performance. S Afr J Sports Med. 2019;31(1):v31i1a7521. doi: 10.17159/2078-516X/2019/v31i1a7521 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Menychtas D, Petrou N, Kansizoglou I, Giannakou E, Grekidis A, Gasteratos A, et al. Gait analysis comparison between manual marking, 2D pose estimation algorithms, and 3D marker-based system. Front Rehabil Sci. 2023;4:1238134. doi: 10.3389/fresc.2023.1238134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Johnson S, Everingham M. Learning effective human pose estimation from inaccurate annotation. In: CVPR 2011. Colorado Springs, CO, USA; 2011. 1465–72. doi: 10.1109/cvpr.2011.5995318 [DOI] [Google Scholar]

Decision Letter 0

Christopher Kirk

9 Nov 2025

-->PONE-D-25-40825-->-->Prediction of on-field rugby scrummaging contact forces from videos using Artificial Neural Networks.-->-->PLOS ONE

Dear Dr. Serrancolí,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.-->--> -->-->This status of major revision is based on the following requirements being met as a minimum:-->--> -->-->A. Ethical and Editorial Non-Compliance (Fatal Flaws)

1. Severe Violation of the Open Data Policy (PLOS ONE): The authors explicitly state: “The data underlying the results presented in the study are not available. Data are partially owned by a third party and authors do not have permission to share the data.” This violates the fundamental requirement of PLOS ONE that all underlying data be made available without restriction upon publication. The justification of "third-party ownership" is unacceptable for a study seeking to contribute to open science. The reproducibility of the work is, therefore, impossible.

2. Contradiction and Failure in the Ethics Statement: The manuscript marks "N/A" in the Ethics Statement section, implicitly claiming not to involve human participants. However, the Methods section confirms the use of data from 176 elite players with written informed consent and approval from the University of Bath Ethics Committee. The omission or deliberately incorrect statement of ethical approval is a serious breach of editorial integrity.

-->-->If the above requirements are not met in the revised and resubmitted work, then this paper will be rejected without further review.-->--> -->-->Please also address the other methodological and reporting comments provided by all reviewers.-->-->

Please submit your revised manuscript by Dec 24 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:-->

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols .

We look forward to receiving your revised manuscript.

Kind regards,

Christopher Kirk, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf   and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS One has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. We note that you have indicated that there are restrictions to data sharing for this study. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

Before we proceed with your manuscript, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., a Research Ethics Committee or Institutional Review Board, etc.). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of recommended repositories, please see

https://journals.plos.org/plosone/s/recommended-repositories. You also have the option of uploading the data as Supporting Information files, but we would recommend depositing data directly to a data repository if possible.

We will update your Data Availability statement on your behalf to reflect the information you provide.

4. If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

-->Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. -->

Reviewer #1: Partly

Reviewer #2: Yes

Reviewer #3: Yes

**********

-->2. Has the statistical analysis been performed appropriately and rigorously? -->

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

-->3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.-->

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

**********

-->4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.-->

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

-->5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)-->

Reviewer #1: This research implementing machine learning technique to predict the injury risk in the high-risk sports, rugby, to provide further information for team staff to avoid injury risk. I have several comments as below:

1. From line 55-71, authors describe the traditional data collection shall be improve by implementing wearable technology (WT) include GPS and IMU. However, it seems that this article tried to improve the prediction performance by integrating video-analysis techniques. I suggest to revise this paragraph to enhance the importance of implementing video-based analysis instead of highlights the WT importance.

2. From line 80, authors raise the predict target GRF. However, it shall be clarified that GRF is stand for “Ground Contact Forces” or “Ground Reaction Forces.”

3. On line 56-59, authors illustrate that the traditional methods “miss forces transferred through neck and head…..” However, from my point of view, this research also used the sensor installed on player’s shoulder. How does this work is different from other research?

4. On line 97-111, dataset shall be describe as “how many total dataset had been detected.” That is, how many scrummaging had been performance by the 176 elite rugby players to produce the numbers of data collected.

5. Continued with No.4, if the dataset had been modified, what is the reason for implementing GAN to increase the data size?

6. The variables used in this research shall be identified. This research aims to predict the injury risk. However, using of variables represents “injury risk” is not clear enough.

Reviewer #2: Strengths

The study effectively addresses a genuine gap by employing machine learning to quantify rugby scrummaging forces in real-world settings. The dataset is both large and of high quality, encompassing 11 elite teams, thereby lending substantial weight to the analysis and results. The application of GAN and Mixup augmentation, coupled with meticulous hyperparameter tuning, reflects a thoughtful and contemporary approach to model development. The reported results are robust, exhibiting high correlation and relatively low error margins. The breakdown by technique and position is particularly valuable and enhances the study's credibility. The practical implications for non-invasive player monitoring and injury prevention are well-articulated and potentially impactful.

Weaknesses and Suggestions for Improvement

Methods detail and replicability

The description of manual labeling of C7 and lumbar landmarks is insufficient. It is essential to specify the number of raters involved, whether inter- or intra-rater reliability was assessed, and the software utilized, as manual labeling is susceptible to error. Data cleaning is mentioned, but the number of trials excluded and the justification for the chosen thresholds are not reported. Without this information, it is challenging to assess whether exclusions biased the dataset. The GAN and Mixup approaches are introduced but not described in sufficient detail for replication. A clearer specification of the GAN architecture and training procedure, as well as the parameters used in Mixup, is necessary.

Limitations of 2D data

Although the limitation of relying on a single top-view camera is acknowledged, this point warrants more explicit discussion. Important postural information is inevitably lost in 2D, which could directly account for some of the observed prediction variability.

Prediction variability

The Bland–Altman plots demonstrate wide limits of agreement (approximately ±600 N). This is not trivial, particularly when considering injury risk thresholds. While the average error is low, a clearer discussion of the practical implications of such variability would benefit readers—for instance, whether the tool is more suited for monitoring trends across groups rather than making judgments on individual events.

Automated pose estimation

The manuscript suggests eventual automation via pose estimation, yet the challenges of this transition are not discussed. It is important to comment on the likely decrease in accuracy when transitioning from carefully labeled manual landmarks to automatically detected ones, and how this issue might be addressed.

Generalizability

The dataset utilized in this study is derived from elite male rugby teams. It is pertinent to briefly consider the extent to which the model may be applicable to other cohorts, such as youth, amateur, or female players, where variations in anthropometry and technique are evident.

Presentation Issues

The tables and figures are generally well-presented; however, it is advisable to include units and sample sizes in the captions where relevant. Additionally, Table 2 appears to have a formatting issue in the A3-R row, which requires correction. It may also be beneficial to incorporate a small set of example traces (best, average, worst prediction) to demonstrate the model's performance under varying conditions.

Recommendation

Overall, this manuscript is robust and offers a significant contribution to the field of applied sports biomechanics. The novelty, quality of the dataset, and clarity of presentation are commendable.

Nonetheless, prior to publication, I recommend minor revisions to enhance the reproducibility of the methods and to provide a more nuanced discussion of the study's limitations and implications.

Reviewer #3: Critical Review Letter – Manuscript PONE-D-25-40825

Title: “Prediction of on-field rugby scrummaging contact forces from videos using Artificial Neural Networks”.

Dear Researchers,

Thank you for the invitation to review this manuscript. The study presents an innovative proposal of high clinical and sporting relevance: predicting contact forces in the rugby scrum using only video kinematics and Recurrent Neural Networks (RNNs). The potential for cervical injury prevention is promising. However, after a detailed analysis of the submission, I have identified critical issues and severe ethical and editorial failures that fundamentally compromise the validity, reproducibility, and compliance of the work.

Final Recommendation: Rejection with the possibility of resubmission only after substantial revision and proof of compliance with editorial policies.

The severity of the flaws is categorized below:

A. Ethical and Editorial Non-Compliance (Fatal Flaws)

1. Severe Violation of the Open Data Policy (PLOS ONE): The authors explicitly state: “The data underlying the results presented in the study are not available. Data are partially owned by a third party and authors do not have permission to share the data.” This violates the fundamental requirement of PLOS ONE that all underlying data be made available without restriction upon publication. The justification of "third-party ownership" is unacceptable for a study seeking to contribute to open science. The reproducibility of the work is, therefore, impossible.

2. Contradiction and Failure in the Ethics Statement: The manuscript marks "N/A" in the Ethics Statement section, implicitly claiming not to involve human participants. However, the Methods section confirms the use of data from 176 elite players with written informed consent and approval from the University of Bath Ethics Committee. The omission or deliberately incorrect statement of ethical approval is a serious breach of editorial integrity.

B. Methodological and Technical Flaws (ML and Biomechanics)

3. Questionable Data Augmentation Methodology (GAN/Mixup): The use of a Generative Adversarial Network (GAN) to create 1,000 synthetic force signals in a safety context has not been adequately justified in terms of physical validity. The high statistical correlation between generated and measured data (R=0.99) does not guarantee that the synthetic forces respect the biomechanical laws of a human collision. The risk of the trained model learning synthetic artifacts from the GAN, instead of real patterns, is high and dangerous for an application aimed at preventing injuries.

4. Non-Scalable Methodology: The model relies on manual 2D labeling of the C7 and Lumbar landmarks. This makes real-time, on-field (in-game) application unfeasible and burdensome. The authors do not provide: a) A labeling protocol, b) Inter-rater reliability metrics (essential for validating the kinematic input), or c) A validated automated pose estimation model to replace manual labeling.

5. Clinically Unacceptable Variability in Results: Despite the low normalized RMSE (9.3%), the Bland-Altman analysis reveals extreme Limits of Agreement (–530 N to +640 N). In a high-energy collision scenario, this margin of error in peak force prediction is clinically unacceptable and can mask the difference between a safe scrum and one with a high risk of cervical injury.

6. Omission of Critical Biomechanical Covariates: The model ignores the individual body mass of the players as an input variable. Contact forces in a collision are a direct function of mass and acceleration (F=ma). The exclusion of this fundamental biomechanical variable compromises the model's robustness and is a likely cause of the high variability observed in the Bland-Altman analysis (Point 5).

C. Presentation and References

7. Exaggerated Conclusion: The claim that the method "could revolutionize player monitoring" is highly speculative. The unacceptable clinical variability (± 640 N), the failure of scalability (manual labeling), and the dependence on non-sharable data prevent the work from being considered a transformative tool in its current state.

8. References (Requires Review): Excessive self-citation (20% of the total) and the use of conference proceedings (e.g., Pomarat et al., [39]) to substantiate methodological arguments were identified, which weakens the theoretical basis of the submission.

Summary: The study has significant intellectual potential but is irremediably compromised by ethical failures (Data and Consent) and methodological flaws (Scalability and Clinical Validity of Error). Rejection is the only possible recommendation.

**********

-->6. PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .-->

Reviewer #1: No

Reviewer #2: Yes: Dr. SUKUMARAN, C

Reviewer #3: Yes: João Carlos Alves Bueno

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

To ensure your figures meet our technical requirements, please review our figure guidelines: https://journals.plos.org/plosone/s/figures

You may also use PLOS’s free figure tool, NAAS, to help you prepare publication quality figures: https://journals.plos.org/plosone/s/figures#loc-tools-for-figure-preparation.

NAAS will assess whether your figures meet our technical requirements by comparing each figure against our figure specifications.

Attachment

Submitted filename: Letter.pdf

pone.0330097.s002.pdf (111.4KB, pdf)
PLoS One. 2026 Apr 8;21(4):e0330097. doi: 10.1371/journal.pone.0330097.r002

Author response to Decision Letter 1


13 Jan 2026

We thank the reviewers for providing feedback on our paper. We trust that changes resulting from your comments have improved the quality of this manuscript. All comments have been addressed within the file Response to Reviewers. Changes to the manuscript have been highlighted in red in the Manuscript with Track Changes.

Attachment

Submitted filename: Reviewer_comments.docx

pone.0330097.s004.docx (47.5KB, docx)

Decision Letter 1

Christopher Kirk

24 Feb 2026

Prediction of on-field rugby scrummaging contact forces from videos using Artificial Neural Networks.

PONE-D-25-40825R1

Dear Dr. Serrancolí,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager®  and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact billing support .

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards and congratulations,

Christopher Kirk, PhD

Academic Editor

PLOS One

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

-->Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.-->

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

-->2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. -->

Reviewer #1: Yes

Reviewer #2: Yes

**********

-->3. Has the statistical analysis been performed appropriately and rigorously? -->

Reviewer #1: Yes

Reviewer #2: Yes

**********

-->4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.-->

Reviewer #1: Yes

Reviewer #2: Yes

**********

-->5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.-->

Reviewer #1: Yes

Reviewer #2: Yes

**********

-->6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)-->

Reviewer #1: The authors had appropriately response my previous comments. I have no further comments and leave the decision back to editorial office.

Reviewer #2: The authors have adequately addressed all comments raised in the previous review. The revised manuscript provides sufficient methodological detail to ensure reproducibility, including clear descriptions of manual labeling, data cleaning, and data augmentation procedures. The discussion of limitations related to 2D data, prediction variability, automated pose estimation, and generalizability has been appropriately expanded and balanced.

The previously noted formatting issue in Table 2 has been corrected, and the overall presentation is clear. No substantive concerns remain, and the manuscript is suitable for publication.

**********

-->7. PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .-->

Reviewer #1: No

Reviewer #2: No

**********

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Matrix of measured scrum trials showing what engagement technique (CTS: blue, PreBind: orange, CTPE: green) and player´s shoulder each trial comes from.

    It shows both the engagement technique (CTS: blue, PreBind: orange, CTPE: green) and the players’ shoulders (A1, A2, A3; L: left, R: right. See Fig 1) for each measured trial that makes up the RNN test dataset. Trial numbers (upper left) correspond to the same trials shown in Fig 4.

    (EPS)

    pone.0330097.s001.eps (5.3MB, eps)
    Attachment

    Submitted filename: Letter.pdf

    pone.0330097.s002.pdf (111.4KB, pdf)
    Attachment

    Submitted filename: Reviewer_comments.docx

    pone.0330097.s004.docx (47.5KB, docx)

    Data Availability Statement

    Data will be available at https://doi.org/10.5281/zenodo.18048888.


    Articles from PLOS One are provided here courtesy of PLOS

    RESOURCES