Abstract
This study aimed to verify and interpret a model for predicting the number of home runs per year using sensor data from professional baseball players during batting practice. A machine learning model was constructed using Random Forest from the bat kinematics and bat mass data of 41 professional baseball players collected by a bat-mounted sensor. Partial Dependence analysis and Feature Importance analysis by SHAP (SHapley Additive exPlanations) were used to explain the model’s predictions. The predictive model showed that the bat speed, bat mass, and rotational acceleration are particularly important. The results indicated that a bat speed of 33.3 m/s and rotational acceleration exceeding 157 m/s2 exhibited a trend toward a rapid increase in the number of predicted home runs per year. The mass of the bat suggests that an optimum value exists at 0.91 kg. These results suggest that batters who are expected to hit a large number of home runs each year increase the acceleration at the beginning of their swing to produce high bat speed in a short period of time and achieve bat speeds of 33.3 m/s or more with a bat that is somewhat heavier.
Keywords: Biomechanics, Bat speed, Bat mass, Random forest
Subject terms: Engineering, Applied physics
Introduction
In baseball, the key to victory is to score as many runs as possible. Home runs are expected to contribute greatly to victory, as multiple runs can be scored at once. In addition, home runs covering substantial distances, which are hit in a large arc, are one of the most exciting aspects of baseball and attract spectators. A home run is a longer distance hit compared to a non-home run hit, and the dynamic factors that determine this distance are determined by three parameters: the initial velocity, launch angle, and angular velocity of the batted ball1,2. In addition, these parameters are defined by the following four factors3: (1) pitching characteristics that can be quantified by the translational and angular velocities of the pitched ball just before impact, (2) impact parameters that represent the positional relationship between the ball and bat at impact, (3) translational and angular velocities of the bat just before ball impact, and (4) dynamic characteristics of the bat, which are expressed by mass, moment of inertia, material characteristics, etc. Among these, data related to the mass of the bat used for measurement by individual players and the swing characteristics of individual players can now be easily measured in large quantities at the field level, as advances in sensor technology have made sensors smaller and less expensive. Wireless inertial measurement unit (IMU) sensors that can be attached to the knob of a bat have been developed, and several manufacturers have commercialized IMUs that measure the speed and trajectory (swing angle) of the bat4. They can record the length and mass of the bat at the time of measurement. However, it is not clear how these parameters relate to the actual hitting performance in the game, such as the number of home runs per year and the contribution of each parameter. By clarifying the relationship between biomechanical parameters in the practice environment and the number of home runs per year, the parameters that players should focus on will become clear. This information will be very useful as players aim to strengthen their skills. For coaches, the advanced prediction of the number of home runs per year can lead to rational tactical construction and coaching. If an advance prediction model of the number of home runs per year could be created, it would be possible to suggest effective training programs, such as encouraging strength training for players predicted to hit more home runs and those who are predicted to hit fewer home runs to focus more on ball contact and spend more time on base running and defense. In particular, it could be used in terms of early assessment of player development policies for young players and newcomers with no past performance.
Traditionally, sports science research has been dominated by studies on sports biomechanics and sports physiology; however, recently, efforts have also been made to incorporate machine learning, which has a high predictive performance for large amounts of data. Tree ensemble methods, such as random forests (RF), have been successfully used in biomechanical research owing to their robustness5,6. Therefore, it would be useful to use machine learning methods to predict the number of home runs per year based on biomechanical parameters in a practice environment.
Nevertheless, such tree ensemble algorithms can be more difficult in interpretation, despite their potentially satisfactory prediction performance. This drawback can be mitigated using model interpretation techniques, such as explainable artificial intelligence (XAI), thus providing insights into algorithm predictions7. However, it has not been applied to kinematic data pertaining to the hitting motion in baseball. Thus, this study aimed to verify a model for predicting the annual number of home runs using sensor data from professional baseball players and interpret it by applying XAI.
In this study, a hypothesis for the research purpose is designed based on the findings of several previous studies. In a study on swing analysis using a high-speed camera for batting practice by college players3, a multiple regression equation was presented in which the distance of the batted ball was the objective variable and two variables, bat head speed and the angle between the velocity vector of the bat head and the horizontal line, were the explanatory variables. The contribution ratio was 52%, and the standardized partial regression coefficients were 0.64 (bat head speed) and 0.31 (angle of the bat swing), respectively. In addition, to increase the velocity of the batted ball, it is necessary to increase not only the bat speed but also the equivalent mass at the hitting position8. The equivalent mass at the hit position was large because of the larger mass and moment of inertia of the bat. Thus, it was hypothesized that the home-run prediction model based on data acquired by the IMU sensors made significant contributions to the bat speed and bat mass.
Results
Comparison of measured and predicted values
The coefficient of determination (
and root mean square error (RMSE) values of the trained RF, multiple regression analysis, and baseline models (a mean value model that predicts the mean value of the objective variable in the training data) were calculated (Table 1).
of the training data in RF is 0.65 and
of the test data in RF is 0.56
.
of the trained RF was the highest, and the RMSE was the lowest (RMSE = 5.30
).
Table 1.
And RMSE of the trained RF, gradient boosting (GB), multiple regression analysis, And baseline models.
| Model |
|
RMSE |
|---|---|---|
| RF | 0.56 ± 0.08 | 5.30 ± 0.51 |
| GB | 0.49 ± 0.04 | 5.68 ± 0.26 |
| Multiple Linear Regression | 0.52 ± 0.05 | 5.54 ± 0.30 |
| Base-line model | - | 8.27 |
Interpretation by XAI
The mean importance of SHAP for the model is shown in Fig. 1. The feature of bat speed appears to be the most important parameter for prediction. The mass of the bat was the second most important factor, and the rotational acceleration was the third most important. From this point, the focus is on these three parameters. Similar to the SHAP, Permutation Feature Importance showed similar results (Fig. 2). Figures 3, 4 and 5 show the mean relationship between the explanatory variables and the predicted values when replacing the explanatory variables (partial dependence: PD) and the relationship between the explanatory variables for each instance and the predicted values when replacing the explanatory variables (Individual Conditional Expectation: ICE). PD showed that the predicted number of home runs per year increased rapidly at bat speeds above 33.3 m/s (Fig. 3). The PD results for bat mass (Fig. 4) showed that the optimum value is 0.91 kg. The PD results for the rotational acceleration (Fig. 5) show that a rotational acceleration of over 157 m/s2 leads to a sharp increase in the predicted number of home runs per year. Figure 6 shows the results of the SHapley Additive exPlanations (SHAP) value, which is an index indicating the effect of features on the prediction model.
Fig. 1.
Mean importance of SHAP for the model. The horizontal axis indicates the mean importance of SHAP, and the vertical axis indicates each explanatory variable.
Fig. 2.
Permutation Feature Importance. The horizontal axis indicates difference in prediction accuracy due to Permutation Feature Importance, and the vertical axis indicates each explanatory variable.
Fig. 3.
Result of the PD and ICE for bat speed. The horizontal axis represents the bat speed, and the vertical axis represents the predicted number of home runs per year. Dashed lines in the graph indicate the expected values. The thick blue line shows the mean relationship between the explanatory variables and the predicted values when the explanatory variables are replaced, and the blue line shows the relationship between the explanatory variables for each instance and the predicted values when the explanatory variables are replaced. The numbers on blue lines mean each instance.
Fig. 4.
Result of the PD and ICE for the mass of the bat. The horizontal axis shows the mass of the bat and the vertical axis shows the predicted number of home runs per year. Dashed lines in the graph indicate the expected values. The thick blue line shows the mean relationship between the explanatory variables and the predicted values when the explanatory variables are replaced, and the blue line shows the relationship between the explanatory variables for each instance and the predicted values when the explanatory variables are replaced. The numbers on blue lines mean each instance.
Fig. 5.
Result of the PD and ICE for rotational acceleration. The horizontal axis represents rotational acceleration, and the vertical axis represents the predicted number of home runs per year. Dashed lines in the graph indicate the expected values. The thick blue line shows the mean relationship between the explanatory variables and the predicted values when the explanatory variables are replaced, and the blue line shows the relationship between the explanatory variables for each instance and the predicted values when the explanatory variables are replaced. The numbers on blue lines mean each instance.
Fig. 6.
SHAP beeswarm plots. The vertical axis represents each explanatory variable and the horizontal axis represents the SHAP value, which is an index showing the effect of features on the prediction model. Each point represents an individual dataset.
Discussion
The purpose of this study is to verify a model for predicting the number of home runs per year using sensor data of professional baseball players and interpret the model by applying XAI to batting practice, which is a practice environment for baseball. Comparing the measured and predicted number of home runs per year in the training and test data obtained from the trained RF, the
of the training data was 0.65 and that of the test data was 0.56
, indicating that the model was a good fit despite some overfitting. The
and RMSE of the trained RF, multiple regression analysis, and baseline models were calculated. The
of the trained RF was the highest, and the RMSE was the lowest (RMSE = 5.30
), indicating that it can predict the number of home runs per year more accurately than conventional methods. There were two factors that made it possible to predict the game result with high accuracy from the practice data: First, the data from the practice environment were the flying ball hitting condition by batting practice pitchers. Koenig et al. (2004)9 compared the batting motion of various bats under tee-hitting conditions with that of hitting a pitched ball. The results showed that most participants had less bat speed when hitting a pitching ball than when tee-hitting condition, and the degree of speed loss differed between participants. If the practice data were obtained under the tee-hitting condition, the swing would be overestimated and it would be difficult to reproduce the model with high accuracy. The second is the number of data. In this study, all swing observations were treated as independent. On the other hand, when the mean of ten swings per player was used in the analysis, the accuracy of the model was greatly reduced (The
of the trained RF was 0.22). The results suggest that all observed swings were considered independent, and that a significant increase in the amount of data contributed to the improved accuracy. In the model used for predicting the number of home runs per year, bat speed was the most important feature, followed by bat mass and rotational acceleration. This result supports the hypothesis that bat speed and mass are significant contributors to home-run prediction using the model.
The PD results showed a sharp increase in the number of predicted home runs per year starting at bat speeds above 33.3 m/s. To validate this result, we estimated the conditions necessary to hit a home run at 100 m, according to the laws of physics. Once a ball becomes ballistic, that is, once it is released by its launching mechanism, only gravitational and aerodynamic forces act on the ball10. The vector equation of motion that models the center-of-mass trajectory is
![]() |
1 |
where m is the mass of the ball (0.14 kg),
is the gravitational acceleration vector,
is the lift force vector,
is the drag force vector,
is the side force vector, and
is the position vector of the center of mass of the ball. L was assumed to be perpendicular to both the angular and translational velocity vectors and was characterized in terms of a dimensionless lift coefficient
:
![]() |
2 |
where
is the mass density of air,
is the cross-sectional area of the ball,
is the angular velocity vector of the ball. Similarly, drag
is characterized by the drag coefficient
, and acts in the direction opposite to that of the translational velocity vector:
![]() |
3 |
The side force
, is characterized by a third dimensionless side force coefficient,
:
![]() |
4 |
The cross-product term in Eq. (4) ensures that
is perpendicular to
and
. The lift, drag and side-force coefficients were calculated as a function of the spin parameter
depending on the azimuth angle of the spin axis. Using Eqs. (1)–(4), the ball position during flight was calculated by integration.
Here, as an initial condition for estimating the flight distance, the kinematic data of a mean flight distance of 88 m hit by a university baseball player to left field were used11; the initial launch angle was 33.2°, and the initial angular velocity was 225 rad/s. The spin axis of the batted ball had an elevation angle of 13.3° and azimuth angle of 15.4°. The initial height of the batted ball was 1.0 m. Under these conditions, the calculations began with a batted ball speed of 37.5 m/s, and by increasing it by 0.28 m/s each time, the batted ball speed that resulted in a distance of over 100 m for the first time was calculated. As a result, it was calculated that a batted ball velocity of 40.8 m/s (
) or more would be required to hit a home run with a distance of 100 m (Fig. 7). The lift, drag, and side force coefficients refer to values from previous studies12. In addition, a previous study13 investigated the relationship between batted ball speed and bat head speed, and the regression equation (y = 0.865x + 6.425; y: batted ball speed, x: bat head speed) shows that to achieve a batted ball speed of 40.8 m/s, a bat head speed of about 39.7 m/s is required. The IMU (BLAST BASEBALL) fixed to the bat in this study calculates the speed at a position 0.15 m from the head (the distal end of the bat) toward the grip (the proximal end)4. Therefore, assuming that 0.28 m/s occurs for every 0.01 m, BLAST BASEBALL calculates a value that is approximately 4.2 m/s less than the bat head speed. As a result, if the bat head speed is 39.7 m/s, the BLAST bat speed required to hit a home run at a distance of 100 m is expected to be approximately 35.5 m/s. This result is similar to the finding in this study that a bat speed of over 33.3 m/s leads to a sharp increase in the predicted number of home runs per year (the difference is approximately 6.1%), and indicates that a reasonable relationship can be derived from the results obtained using machine learning.
Fig. 7.

Ballistic simulation of different batted ball velocity. The x-axis represents the direction of travel of the ball, the y-axis represents the left-right direction, and the z-axis represents the ball height. The red circle is the result of a homerun with a distance of 100 m at a batted ball velocity of 40.8 m/s (
). The black circle is the result of a distance of 89 m at a batted ball velocity of 37.5 m/s (
). All other conditions were identical.
The PD results for the bat mass showed that the optimal value is 0.91 kg. A previous study using bats with large and small masses14 reported that the bat speed decreased with a large-mass bat. By contrast, long-distance hitters who hit many home runs per year may expect to improve their flying distance by increasing the apparent mass of the bat at the ball impact point, which is the mass of the bat at the ball impact position, even if the increase in mass leads to a decrease in operability and bat speed immediately before impact. The optimal value for these factors is 0.91 kg.
The PD results for rotational acceleration showed that a rotational acceleration of over 157 m/s2 leads to a sharp increase in the predicted number of home runs per year. Koike and Mimura15 performed a dynamic analysis of the bat speed movement and showed that the contribution of the joint torque term was largest in the first half of the swing. In other words, rotational acceleration in the first half of the movement may be related to the output of the joint torque generated by muscle tension, and a long-distance hitter is expected to have a large rotational acceleration value. In baseball batting, it is necessary to predict the type of ball and its trajectory, instantly decide whether to swing, and swing accurately to the desired location within as short a swing time as possible. Even if the two batters have the same bat speed at impact, if the bat accelerates faster at the beginning of the swing, the batter will be able to achieve impact with the ball in a shorter time, allowing more time to predict the ball trajectory and make swing decisions, which is an advantage in batting motion. For these reasons, the importance of rotational acceleration is considered to be high in the home-run number prediction model.
Limitation
In this study, data from different players in different years were used as training and test data, which showed a certain generalization performance. However, the objective variable in this study is the number of home runs per year, an indicator that includes not only a player’s kinematic swing characteristics but also the effects of other factors such as ball-and-bat contact skills and ballpark size. In addition, home-run totals are heavily influenced by both the number of at-bats and the quality of opposing pitchers. For example, skill differences in opposing pitchers in different leagues or bias in the types of batters (e.g. power vs. contact hitters) sampled and trained could lead to overfitting. Therefore, it is necessary to verify whether a similar generalization performance is demonstrated for data on players from different leagues, age groups, competitive levels and non-Japanese professional baseball players. Also, future study would need to conduct longitudinal studies and incorporate game-contextual variables (pitch type, opponent, game situation).
Methods
Experimental design
Forty-one male Japanese professional baseball players (age: 26 ± 4 years; weight: 85 ± 10 kg; height: 1.81 ± 0.06 m) participated in the experiment. All the participants provided informed consent before participating in the study. The experimental procedure was approved by the Ethics Committee of Mizuno Corporation (Approval ID: 20241203-312). The players gave consent for the publication of anonymized results. All methods were performed in accordance with the relevant guidelines and regulations. For practical data collection in the field, rather than in a laboratory environment, data were collected using batting practices thrown by batting practice pitchers during professional baseball spring camp practice. Batting practice pitchers threw a four-seam fastball. Thirty-one professional baseball players were assessed in the spring camp of 2023, and 10 players were assessed in the spring camp of 2024. After warming up according to their normal routine, each batter performed a batting motion using a bat fitted with an IMU. We analyzed the swing data for each player over ten trials.
Recording movement
An IMU (BLAST BASEBALL, BLAST Motion, USA) was attached to the bat knob. The version of the application software was 5.14 in Japan. The participants used wooden bats (mass:0.88
kg length: 0.85
m) that were normally used by players. The IMU was fixed to the bat using a special attachment provided by the manufacturer. The bat mass information was input into a dedicated application software downloaded to the tablet, and the bat swing was measured using the software. The accuracy of the bat speed and attack angle was confirmed through a comparison with the results obtained using an optical motion capture system from a previous study4. A strong correlation existed between the bat speed obtained using BLAST BASEBALL and that obtained using the motion capture system (r = 0.91, ICC = 0.67). A moderate correlation existed between the attack angle obtained by BLAST BASEBALL and that obtained using the motion capture system (r = 0.59, ICC = 0.45). 40% of the participants in the previous study were professional baseball players, an elite group, and that the data included bat speeds in the high velocity range of nearly 35 m/s. Based on this result, it was concluded that BLAST BASEBALL is applicable to the data in this study of professional baseball players.
Pre-processing
Ball impact is determined when the sensor “trigger” measured on the accelerometer and gyroscope exceeds specific thresholds for impact. The “impact” index is the sample before (2 ms) the sensor registers this shock. Five features were extracted from the IMU data.
The bat speed in BLAST BASEBALL was calculated from the displacement of the coordinates of a position 0.15 m away from the bat head in the direction of the knob at impact.
In addition, speed was calculated as the magnitude of the velocity perpendicular to the direction of the bat. 2. Mass of bats used by each player. 3. The attack angle was calculated as the angle between the velocity vector 0.15 m away from the bat head in the direction of the knob and the horizontal plane. 4. The on-plane efficiency is the ratio between the angular velocity in the direction of contact and the total angular velocity of the bat (excluding the bat roll), which is expressed as a percentage. 5. Rotational acceleration is the acceleration of the bat measured from the point at which the knob first begins to accelerate downwards over a predetermined window. In other words, rotational acceleration is acceleration, not angular acceleration. This metric measures how quickly a player initiates rotation of the swing. This representation (rotational acceleration) is the notation on the IMU application (BLAST BASEBALL) used in this study. All five features were used simultaneously as input in the RF model.
Of these five features, the three mechanical factors that directly affect flight distance are bat speed, which has a significant impact on batted ball speed; bat mass, which is the inertial characteristics of the bat; and attack angle, which is related to the launch angle. On the other hand, to increase the number of home runs per year, which is the objective variable, it was considered that features reflecting not only the mechanical factor that increases flight distance but also the skill to handle all types of pitches multiple times would be necessary. On-plane efficiency, which represents the percentage of the trajectory in motion on a single plane during the swing, is assumed to be related to the rate of contact between the ball and the bat at all points during the swing and its stability. In addition, a larger value of rotational acceleration may be advantageous in dealing with all types of pitches, as the same bat speed may result in a shorter time to impact. Based on the above, on-plane efficiency and rotational acceleration were added to the features as they are expected to contribute to improving the prediction performance of the machine learning model.
The mean bat speed was calculated for each player, and data that deviated from the mean by more than 2.78 m/s were excluded as outliers. The number of home runs in the first team’s seasonal performance was used as the objective variable. The annual results for each player were extracted from the Nippon Professional Baseball Organization website16. Since RF is a method that does not require standardization of features, the value of the number of home runs per year was used as is. The average number of home runs was 3.0
6.1.
Machine learning with random forest
RF were used to regress the number of home runs per year. This algorithm combines many decision trees grown in random subsets selected by bootstrap aggregation from the feature space7,17. Human swing motion is expected to include complex interactions between features (such as the relationship between bat speed and mass of bat) that are difficult to capture with linear regression, and a nonlinear analysis method such as RF is expected to be appropriate. In addition, the RF has been shown to provide a higher accuracy than other existing Machine Learning with a faster computation speed5. Also, RF have been successfully used in biomechanical research because of their robustness5,6. From these reasons, the nonlinear method RF was applied in this study. Model results validation was checked by splitting the data into training (90%) and testing (10%) datasets. A total of 295 swings from 31 players measured at the spring training camp in 2023 were used as the training data, and 30 swings from another 10 players measured at the spring training camp in 2024 were used as the test data. In the test data, three trials were randomly selected for each player, and the model was constructed. This was done a total of 10 times, and the mean and standard deviation of
and RMSE of the model was calculated. Hyperparameter tuning was performed, and there were ten decision trees and the maximum depth was five. To compare the performances of the models, we performed a gradient boosting (GB) and a multiple regression analysis using the same dataset. In GB, hyperparameter tuning was performed, and there were fifty decision trees and the maximum depth was five. In multiple regression analysis, explanatory variables were standardized. Similarly, we calculated the RMSE of the mean-value model that predicted the mean value of the objective variable in the training data as a baseline model. All models were built in Python 3.8 with scikit-learn 1.3.0.
Interpretation by XAI
To interpret the predictions of the adopted RF model, we performed the following three analyses18. 1.PD: PD is a method for visualizing the mean predicted value of each instance by varying only the level of a certain feature while fixing the other features. 2. ICE: Unlike PD, which looks at the mean relationship between features and objective variables, ICE is a method for checking the relationship between features and predicted values by varying the level of a certain feature for each instance while fixing the other features19. 3.SHAP: SHAP is an application of the Shapley Value, an evaluation method in cooperative game theory for fairly distributing the gains obtained through cooperation among multiple players. SHAP replaces the players in the Shapley Value of cooperative game theory with features and decomposes the difference between the “prediction value for a certain instance” and the “mean prediction value” into the contribution of each feature. The mean importance of each feature was visualized using SHAP. In addition, SHAP beeswarm plots are more complex and information-rich displays of SHAP values that reveal not only the relative importance of features but also their actual relationships with the predicted outcome20. Thus, SHAP beeswarm plots were created. As with SHAP, the importance of features by Permutation Feature Importance18 was calculated to assess the robustness of the results.
Author contributions
S.S. participated in the study design and contributed to data analysis, interpretation of results, and writing of the manuscript. Y.K. contributed to data analysis and interpretation of results. Y.Y. contributed to data analysis and interpretation of results. K.I. contributed to data analysis and interpretation of results. T.N. participated in the study design and contributed to data collection; T.I. participated in the study design and contributed to data collection; Y.T. participated in the study design and contributed to data collection; and Y.Y. participated in the study design and contributed to data collection. All the authors have read and approved the final version of the manuscript.
Data availability
The anonymized datasets generated and/or analyzed in the current study are available from the corresponding author upon reasonable request.
Declarations
Competing interests
The first, second, third, and fourth authors are affiliated with companies that sell products, creating conflicts of interest between the public and private researchers. The rest authors have no competing interest.
Footnotes
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Watts, R. G. & Baroni, S. Baseball-bat collisions and the resulting trajectories of spinning balls. Am. J. Phys.57, 40–45 (1989). [Google Scholar]
- 2.Adair, R. K. The Physics of Baseball, 3rd edition 16–22, (2002).
- 3.Kidokoro, S., Wakahara, T. & Yanai, T. Characteristics of swing affecting flight distance and kinetic energy of the ball immediately after impact in baseball batting (in Japanese). Jpn J. Biomech. Sports Exerc.15, 78–86 (2011). [Google Scholar]
- 4.Morishita, Y. & Jinji, T. Accuracy and error trends of commercially available Bat swing sensors in baseball. Sports (Basel). 10, 21 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ahamed, N. U. et al. Using wearable sensors to classify subject-specific running Biomechanical gait patterns based on changes in environmental weather conditions. PLOS One. 13, e0203839 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ahamed, N. U. et al. Subject-specific and group-based running pattern classification using a single wearable sensor. J. Biomech.84, 227–233 (2019). [DOI] [PubMed] [Google Scholar]
- 7.Chalitsios, C., Nikodelis, T., Mavrommatis, G. & Kollias, I. Subject-specific sensitivity of several Biomechanical features to fatigue during an exhaustive treadmill run. Sci. Rep.14, 1004 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Koike, S., Tabuchi, N., Yoshida, Y. & Nagao, H. Development of a Bat select system (in Japanese). Symp. Sports Engineering: Symp. Hum. Dynamics. 2009, 218–223 (2009). [Google Scholar]
- 9.Koenig, K., Mitchell, N. D., Hannigan, T. E. & Clutter, J. K. The influence of moment of inertia on baseball/softball Bat swing speed. Sports Eng. 7, 105–117 (2004).
- 10.Alaways, L. W. & Hubbard, M. Experimental determination of baseball spin and lift. J. Sports Sci.19, 349–358 (2001). [DOI] [PubMed] [Google Scholar]
- 11.Nakashima, H., Horiuchi, G. & Sakurai, S. Three-dimensional batted ball in baseball: effect of ball spin on the flight distance. Sports Eng.21, 493–499 (2018). [Google Scholar]
- 12.Sakamoto, S., Hasegawa, J., Tada, T., Naruo, T. & Mizota, T. Wind tunnel testing of new ball, U.S and Japanese baseball balls (in Japanese). Symposium on sports engineering: symposium on human dynamics C-20,1–4 (2011).
- 13.Kidokoro, S. & Yanai, T. The characteristics of Mishit in baseball hitting (in Japanese). Jpn J. Biomech. Sports Exerc.21, 52–64 (2017). [Google Scholar]
- 14.Urata, T., Nakai, A., Nakamura, C. & Ito, A. Effect of the practice swing with different Bats in mass on Bat swing speed (in Japanese). Osaka Res. J. Phys. Educ.50, 47–52 (2012). [Google Scholar]
- 15.Koike, S. & Mimura, K. Dynamic contribution analysis of baseball batting motion considering joint torque generating type (in Japanese). Symposium on sports engineering: symposium on human dynamics 2015 1–10 (2015).
- 16.The Nippon Professional Baseball Organization & KojinNendobetsuSeiseki (2025). https://npb.jp/bis/players/
- 17.Breiman, L. Random forests. Mach. Learn.45, 5–32 (2001). [Google Scholar]
- 18.Morishita, K. Kikaigakusyu Wo Kaisyaku Suru Gizyutsu Yosokuryoku to Setsumeiryoku Wo Ryouristu Suru Zissen technique (in Japanese). Gizyutsuhyouronsya2, 11 (2021). [Google Scholar]
- 19.Alex, G., Adam, K., Justin, B. & Emil, P. Peeking inside the black box: visualizing statistical learning with plots of individual conditional expectation. J. Comput. Graph Stat.24 (1), 44–65 (2015). [Google Scholar]
- 20.Wolf, D., Farrag, G., Flügge, T. & Timm, L. H. Predicting outcome in clear aligner treatment: A machine learning analysis. J. Clin. Med.13 (13), 1–11 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The anonymized datasets generated and/or analyzed in the current study are available from the corresponding author upon reasonable request.











