Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data

Dohyun Chun; Hae Woon Jung; Jongho Kang; Woo Young Jang; Jihun Kim

doi:10.1177/20552076251395975

. 2025 Nov 24;11:20552076251395975. doi: 10.1177/20552076251395975

Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data

Dohyun Chun ^1,^2,^†, Hae Woon Jung ^3,^†, Jongho Kang ^2,⁴, Woo Young Jang ^5,^6,^*,^✉, Jihun Kim ^2,^7,^8,^*,^✉

PMCID: PMC12644449 PMID: 41306296

Abstract

Objective

We developed a precise, reliable artificial intelligence (AI) model for predicting the future height of children and adolescents based on anthropometric and body composition data.

Materials and Methods

We used an extensive longitudinal dataset from a large-scale Korean cohort study, which included 588,546 measurements from 96,485 children and adolescents aged 7–18. We developed a prediction model using the light gradient boosting method and integrated anthropometric and body composition metrics along with their standard deviation scores (SDSs) and velocity parameters. Model performance was assessed through root mean square error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE). We employed Shapley additive explanations (SHAP) for model interpretability.

Results

The model accurately predicted future heights. For males, the average RMSE, MAE, and MAPE were 2.51 cm, 1.74 cm, and 1.14%, respectively, with female prediction results showing comparable accuracy (2.28 cm, 1.68 cm, and 1.13%, respectively). Shapley additive explanations analysis revealed that the SDS of height, height velocity, and soft lean mass velocity were key predictors of future height. The model created personalized growth curves through estimation of individual-specific height trajectories, comparison with actual measurements, and identification of key variables using local SHAP values.

Conclusion

Our model produces accurate and personalized growth curves, incorporating explainable AI techniques for enhanced clinical understanding. This method advances pediatric growth assessment and provides robust clinical decision support. Despite limitations including the absence of handwrist radiography comparison and Korean population specificity, our approach demonstrates significant potential for early identification of growth disorders and optimization of growth outcomes.

Keywords: Height prediction, body composition, growth velocities, explainable artificial intelligence, personalized growth curves

Introduction

Evaluating growth and predicting height in children and adolescents are essential parts of pediatric care. Linear growth serves as a crucial health indicator that reflects the cumulative impact of genetic, environmental, and socioeconomic factors.^1–3 Tracking linear growth allows for the early identification of growth disorders such as growth hormone deficiency, celiac disease, and chronic conditions affecting development.^4–7 Predicting future height accurately is crucial for diagnosing growth disorders, starting hormone therapy, and assessing treatment effectiveness.^8–10 Traditional height prediction methods rely on skeletal maturity assessment using handwrist radiographs. These include the Bayley–Pinneau,¹¹ Tanner–Whitehouse,¹² and Roche–Wainer–Thissen¹³ methods. Nonetheless, these methods have drawbacks, including radiation exposure, a requirement for specialized expertise, and high interobserver variability.^14–16 Additionally, their accuracy diminishes in children with growth disorders or those receiving growth hormone therapy.^17–19

Considering the limitations of conventional growth assessment techniques, body composition measurements have become vital complementary indicators for tracking and forecasting pediatric growth and development.^20–25 Recent advancements in body composition assessment methods, such as multifrequency bioelectrical impedance analysis (BIA), have facilitated the collection of large-scale and extended pediatric data.^26–30 Machine learning (ML) techniques are suitable for analyzing these large-scale longitudinal datasets in growth modeling and height prediction. In growth-related fields, ML has been primarily applied to bone age assessment through image processing.¹⁵^31–35 Several studies have directly predicted height using ML; however, they relied only on basic anthropometric data and did not consider body composition measurements.^36,37 The recent emergence of extensive body composition databases has fueled innovative research, including models for height estimation using ML³⁸ and biological maturity evaluation frameworks that complement traditional skeletal maturity methods.³⁹

This study addressed the existing gap in predicting pediatric growth by utilizing ML algorithms on a large-scale, longitudinal dataset that integrated anthropometric and body composition measures. The gradient boosting framework effectively captured complex relationships between growth-related factors and height trajectories through iterative pattern learning. Such methodology facilitated the development of precise predictive models that identified critical growth determinants. The framework accommodated predictions from multiple baseline ages with varying prediction horizons, enabling growth trajectory modeling throughout pediatric development.

Our method presented several distinctive benefits. First, we integrated the standard deviation scores (SDSs) for body composition measures, which were obtained from a comprehensive body composition database.⁴⁰ These SDSs measured growth status in age- and sex-matched groups, providing standardized comparisons that went beyond absolute anthropometric and body composition values. Our longitudinal data additionally allowed us to calculate growth velocities, which are often overlooked in cross-sectional studies. Tracking growth velocities in height and body composition is vital for detecting abnormalities in growth patterns.^41–44 Incorporating body composition velocity provided valuable insights into the dynamic physical development of children, enabling a more comprehensive evaluation of growth trajectories. Additionally, we improved model interpretability by implementing explainable artificial intelligence (AI) techniques, such as the use of Shapley additive explanations (SHAP), to examine the key factors influencing growth predictions.^45–50 Our model integrated personalized growth curve estimation with local interpretability. This combination facilitated the early identification of at-risk children and enabled targeted interventions in conjunction with conventional growth monitoring techniques.

The structure of this paper is as follows. Section “Data and methods” outlines the data and methods used. Section “Results” highlights the results from our height prediction model and the analyses related to explainable AI. Section “Discussion” presents our findings and their implications, while Section “Conclusion” concludes the paper and proposes directions for future research.

Data and methods

Study population

The AI model was developed using data from the GP Co., Ltd Cohort Study (hereafter, GP Cohort Study), a long-term research project initially based in Gwangmyeong, South Korea. Gwangmyeong is an urban area adjacent to Seoul in Gyeonggi Province, which encompasses 44.8% of Korea's total population. Since 2015, data collection has expanded to include additional urban and suburban areas, including Osan, Siheung, and Ansan in Gyeonggi Province, as well as selected regions in other provinces. This study encompassed students from elementary, middle, and high schools aged 7–18, with an average participation of 35 schools each year. The cohort was not restricted to healthy children but included the general school population. Data collection began on 1 January 2013, and accumulated 588,546 measurements from 96,485 children and adolescents (50,480 males and 46,005 females) born between 1998 and 2016. Inclusion criteria comprised: (1) ability to maintain standard posture for anthropometric measurements and (2) written informed consent from guardians with assent from participants aged 12 years or older. Exclusion criteria included: (1) inability to undergo standard measurement procedures due to physical limitations, (2) absence of informed consent, and (3) noncompliance with premeasurement protocols such as the 2-h fasting requirement for body composition analysis. The Institutional Review Board of Korea University Anam Hospital approved the study protocol (2025AN0110). Due to the retrospective design with de-identified data, the IRB waived the informed consent requirement for data analysis.

Data collection and measurements

Measurements were conducted by dedicated staff, from GP Co., Ltd, who have performed standardized assessments since 2013. All measurements followed written protocols based on CDC anthropometric guidelines and manufacturer specifications. Heights were measured using a stadiometer, while body composition was assessed using octopolar multifrequency BIA (InBody models J10 and J30, InBody Inc., Seoul, Korea). The measurement staff were not involved in the subsequent data analysis or interpretation.

The predictor variables of interest comprised height, weight, protein mass, soft lean mass (SLM), body fat mass (BFM), skeletal muscle mass (SMM), bone mineral mass, total body water (TBW), basal metabolic rate (BMR), as well as waist and hip circumferences. The following indices were calculated: body mass index (BMI), BFM index, SMM index, and waist-to-hip ratio (WHR):

BMI (kg / m^{2}) = \frac{Weight (kg)}{Height {(m)}^{2}},

BFMI (kg / m^{2}) = \frac{BFM (kg)}{Height {(m)}^{2}},

SMMI (kg / m^{2}) = \frac{SMM (kg)}{Height {(m)}^{2}},

WHR = \frac{Waist circumference (cm)}{Hip circumference (cm)} .

The dataset preprocessing involved the following procedures. Observations containing missing values were removed at the observation level. Age was calculated in months from birth to measurement dates.^a Weight measurements were validated against the sum of body composition components (SLM, bone mineral mass, and BFM), with excessive discrepancies excluded to ensure data quality and measurement reliability. Statistical outlier detection was performed separately by sex and age group using the interquartile range (IQR) method. The anthropometric measurements of children and adolescents were standardized using the 2017 Korean National Growth Charts. This included height, weight, and BMI. Body composition parameters not available in these charts were standardized using the GP growth chart developed by Chun et al.^40,^b The dataset included both original measurements and their corresponding age- and sex-specific SDS. The preprocessed dataset contained 27 attributes, which included fundamental variables such as sex and age, as well as metrics related to anthropometry and body composition measures and associated SDS.

Data preparation

The target of the prediction was set as the future growth rate in height, determined through the following method. Constructing the prediction dataset involved selecting paired observations from participants with multiple measurements. For each participant, we selected two observations: the earlier observation (baseline) and the later observation (target). The height from the later observation (Height_T) was used to calculate the prediction target, with the age at the target time point (Months_T) included as a predictor to account for the prediction horizon. The growth rate, calculated as

Growth_Rate = \frac{Height_T - Height_baseline}{Height_baseline} \times 100 (%),

served as the target variable for baseline observation.^c Figure 1 illustrates this data structure. This method required participants to have at least two measurements for longitudinal analysis.

Prediction horizons between paired observations varied across participants (median: 19 months, IQR: 9–34 months). After applying preprocessing procedures to remove observations with missing values, measurement errors, and statistical outliers, then restricting the analysis to ages 7–16 years and excluding participants with single measurements, the final dataset comprised 44,683 participants (23,402 males and 21,281 females) with sufficient longitudinal data, generating 206,112 training samples (106,461 for males and 99,651 for females).

Additional predictor variables were included to effectively capture growth dynamics and trajectories. Growth_Rate_BM represents the expected growth rate if the participant maintains their current height SDS and serves as a benchmark for comparison. Growth velocity variables were calculated from historical measurements taken 3–6 months before baseline. These included height velocity (Height_V) and body composition velocities (weight velocity [Weight_V], Protein_V, SLM velocity [SLM_V], BFM_V, SMM_V). The velocities represent the rate of change per month and provide information about recent growth patterns that predict future trajectories.

Growth velocity calculations required measurements from 3 to 6 months before baseline. Of the 44,683 participants in the prediction dataset, 17,948 were excluded due to insufficient prior measurements. The remaining 26,735 participants (13,907 males and 12,828 females) generated 105,774 measurements (54,463 from males and 51,311 from females). The model's final input dataset was created by combining the target variable (Growth_Rate) with the prediction target information (Months_T, Growth_Rate_BM, and Height_BM) for each subject. This final dataset comprises 37 columns, encompassing the initial 27 attributes, 6 growth velocity metrics, 3 prediction target variables, and the target variable. Of these, 35 columns served as predictor variables, excluding the target variable and sex, as separate models were trained for males and females. Supplementary Table S1 provides details on the dataset's variables, including their descriptions and data types.

1. Prediction Dataset Creation	206,112 measurements (44,683 subjects) 106,461 males, 99,651 females
Preprocessing the original measurements Identify multiple observations per subject
Select two observations per subject (baseline and target)
Calculate the target variables (Growth_Rate)
Construct a dataset with the target variables
2. Feature Engineering	105,774 measurements (26,735 subjects) 54,463 males, 51,311 females Thirty-seven columns (35 predictors)
Incorporation of benchmark features (Height_BM, Growth_Rate_BM)
Computation of the growth velocity variables
3. Data Integration and Transformation
Merge target variables and prediction information
Normalize the target variable (Yeo-Johnson method)
Scale Predictor Variables (Robust Scaler)

Open in a new tab

Prediction model

The height prediction model was trained using the Light Gradient Boosting Method, a framework that utilizes tree-based learning algorithms for gradient boosting. This framework is well-known for its efficiency and accuracy when handling large datasets.^d The dataset was divided using a stratified sampling method based on individual identification numbers. A random 20% of individuals formed the test set, while the remaining 80% constituted the training set. This approach ensured that data from the same individual did not overlap between the training and test sets, thereby avoiding data leakage and reducing the risk of overestimating model accuracy. Distinct models were created for males and females to account for possible sex-specific variations in growth patterns. The representativeness and comparability of the training and test datasets were evaluated by comparing the distributions of key variables and calculating the median and IQR for each variable.

Hyperparameter optimization was conducted through grid search, accompanied by 5-fold cross-validation on the training dataset. The training set was divided into five equal-sized folds, with the model trained on four of these and validated on the remaining fold. This cycle was repeated five times, ensuring each fold acted as the validation set once. The optimized hyperparameters comprised the maximum depth of the trees, the maximum number of leaves per tree, the number of boosting rounds, and the learning rate. These structural parameters serve as regularization mechanisms that constrain model complexity to prevent overfitting. The best hyperparameter configuration from the cross-validation was applied to train the final model on the entire training set. The trained model was then evaluated on an independent test set.

We assessed the performance of the trained models using the test set, calculating three common performance metrics: root mean squared error (RMSE), mean absolute percentage error (MAPE), and mean absolute error (MAE). To evaluate the model's stability and reliability, we employed a bootstrapping method, randomly dividing the test set into 50 subsets. We computed the performance metrics (RMSE, MAE, and MAPE) for each subgroup and estimated the standard deviation and 95% confidence intervals (CIs) for each metric. Heatmaps illustrating MAPE values across different age groups and prediction horizons were created to examine the model's performance. Figure 2 illustrates the flowchart of the model development pipeline, from data preparation to model evaluation.

Figure 2. — Model development pipeline overview.

Explainable AI

The model for predicting height was analyzed using three explainable AI techniques: feature importance, accumulated local effects (ALEs), and SHAP. These methods highlighted the key factors impacting the predicted heights of children and adolescents. Feature importance measures the relative contribution of each feature to the model's predictions. The importance values were determined based on the average gain of each feature, with higher values indicating a greater influence. To facilitate interpretation and comparison, these values were normalized to a scale of 100. Accumulated local effect plots display the average impact of individual features on model predictions across their respective value ranges, considering interactions between features. These plots illustrate the cumulative local effect on predicted growth in height as the value of a specific feature changes while keeping other features constant. The analysis focused on the key features identified by their importance scores. Shapley additive explanations provided a further method to elucidate the impact of features on model predictions. Using game-theoretic Shapley values, this method assesses the marginal impact of each feature on all projections, enabling an analysis of both overall importance and individual cases. Shapley additive explanations values were derived for every instance and visualized through summary plots.

Personalized growth curve estimation

Estimating personalized growth curves is a crucial aspect of our methodology, enabling individualized assessments of growth. This method produces a tailored growth trajectory by forecasting heights at designated ages. The predicted curves are displayed alongside actual height measurements and population-level benchmarks. We utilized local SHAP to enhance model interpretability by pinpointing and emphasizing the most notable features influencing individual predictions. To illustrate these concepts, we present case studies of randomly chosen male and female subjects in the results section.

Results

Baseline characteristics

Table 1 summarizes the baseline characteristics of the study population, categorized by sex and dataset (training vs. test). In the case of male participants, the training set comprised 261,567 data points from 11,125 unique individuals, while the test set contained 66,840 data points from 2782 unique males. For female participants, the training set included 247,580 data points from 10,262 unique individuals, and the test set had 65,104 data points from 2566 unique females. The table shows the median and IQR for essential variables, including age (in years), height, weight, BMI, protein mass, SMM, BMR, TBW, and the target height.

Table 1.

Baseline characteristics.

	Male		Female
	Training set (n = 261,567)	Test set (n = 66,840)	Training set (n = 247,580)	Test set (n = 65,104)
Number of Entities	11,125	2782	10,262	2566
Age (months)	9.08 [8.08–10.50]	9.00 [8.17–10.42]	9.00 [8.08–10.33]	8.92 [8.08–10.33]
Height	134.9 [128.6–142.8]	135.0 [128.9–142.8]	133.6 [127.3–142.5]	133.2 [127.2–142.0]
Weight	32.3 [27.2–40.3]	32.3 [27.1–39.9]	30.6 [25.9–37.5]	29.8 [25.5–36.6]
BMI	17.6 [16.0–20.2]	17.5 [15.9–20.0]	17.0 [15.5–19.1]	16.7 [15.3–18.7]
Protein	5.0 [4.4–5.8]	5.0 [4.4–5.8]	4.6 [4.1–5.5]	4.6 [4.1–5.4]
SLM	23.9 [21.1–27.9]	23.9 [21.2–27.9]	22.2 [19.7–26.2]	21.9 [19.5–25.7]
BFM	6.3 [4.0–11.0]	6.0 [4.0–10.0]	7.0 [5.0–10.0]	6.0 [4.0–10.0]
SMM	13.0 [11.3–15.6]	13.0 [11.3–15.5]	12.0 [10.4–14.4]	11.8 [10.3–14.2]
BMR	917 [853–1009]	917 [855–1009]	879 [821–971]	873.0 [817–960]
TBW	18.6 [16.4–21.7]	18.6 [16.5–21.7]	17.3 [15.3–20.4]	17.1 [15.2–20.0]
Target Height	147.8 [138.8–159.5]	147.9 [138.8–158.9]	148.3 [138.3–156.3]	148.0 [138.1–156.0]

Open in a new tab

The table presents the median and interquartile range [1st quartile–3rd quartile] for variables in the training and test sets. The number of data points (n) and unique individuals (Number of Entities) are also reported for each set.

BFM: body fat mass; BMI: body mass index; BMR: basal metabolic rate; SLM: soft lean mass; SMM: skeletal muscle mass; TBW: total body water.

In the training set, males had a median age of 9.08 years, a height of 134.9 cm, weight of 32.3 kg, BMI of 17.6 kg/m², protein mass of 5.0 kg, SLM of 23.9 kg, BFM of 6.3 kg, SMM of 13.0 kg, BMR of 917 kcal, TBW of 18.6 kg, and a target height of 147.8 cm. For females in the training set, the median age was 9.00 years, with a height of 133.6 cm, weight of 30.6 kg, BMI of 17.0 kg/m², protein mass of 4.6 kg, SLM of 22.2 kg, BFM of 7.0 kg, SMM of 12.0 kg, BMR of 879 kcal, TBW of 17.3 kg, and a target height of 148.3 cm. The test set displayed comparable values for both sexes, demonstrating a suitable balance between the training and test sets.

Prediction accuracy

Table 2 displays the prediction accuracy metrics, indicating high accuracy for both male and female models. For males, the models yielded a mean RMSE of 2.51 ± 0.07 cm [95% CI 2.37–2.66 cm], MAE of 1.74 ± 0.05 cm [95% CI 1.65–1.83 cm], and MAPE of 1.14% ± 0.03% [95% CI 1.08%–1.20%]. In contrast, females displayed a mean RMSE of 2.28 ± 0.07 [95% CI 2.15–2.41] cm, MAE of 1.68 ± 0.05 [95% CI 1.58–1.78] cm, and MAPE of 1.13 ± 0.03 [95% CI 1.06–1.19].

Table 2.

Overall prediction accuracy.

Sex	RMSE	MAE	MAPE
Male	2.51 ± 0.07	1.74 ± 0.05	1.14 ± 0.03
	(2.37–2.66)	(1.65–1.83)	(1.08–1.20)
Female	2.28 ± 0.07	1.68 ± 0.05	1.13 ± 0.03
	(2.15–2.41)	(1.58–1.78)	(1.06–1.19)

Open in a new tab

The table presents the performance metrics for height prediction models, including RMSE, MAE, and MAPE. Data are presented as the mean ± SD (95% CI) for each metric, allowing for an assessment of the model's stability and reliability across 50 different test data subsets.

CI: confidence interval; MAE: mean absolute error; MAPE: mean absolute percentage error; RMSE: root mean square error.

To evaluate the performance of the prediction model, we created heatmaps that illustrate the MAPE across various age groups and prediction horizons for both males and females (Figure 3). These heatmaps indicated that prediction accuracy fluctuated based on the current age and length of the prediction horizon. Overall, the model displayed lower MAPE for shorter prediction horizons and older ages in both sexes. Mean absolute percentage error values rose with extended prediction horizons and younger ages, as shown by the lighter shades in the upper-left section of the heatmaps. Despite these fluctuations, the model exhibited strong performance, with the highest MAPE remaining under 2.42% for males (predicting 5 years ahead from age 7) and 2.00% for females (predicting 5.5 years ahead from age 10).

Figure 3. — Mean absolute percentage error (MAPE) heatmaps across different age groups and prediction horizons. *Notes:* Heatmaps depicting MAPE across different age groups and prediction horizons for (a) males and (b) females. The MAPE is calculated for each combination of age (in years) on the x-axis and prediction horizon (in years) on the y-axis. The color scale indicates the MAPE values, with darker shades of red representing lower errors.

Feature importance

We identified the most significant height predictors by calculating feature importance values, which represent the relative contribution of each feature to the prediction model based on its average gain (Figure 4). The top 20 features for each sex were visualized using horizontal bar plots, with importance values normalized to a scale of 100. We note that prediction target information, such as Height_BM, Months_T, Months, and Growth_Rate_BM, was excluded from this plot to focus on the anthropometric and body composition variables. The results showed that the SDS of current height (SDS_Height) had the greatest influence on both males and females, followed by Height_V and SLM_V after considering the prediction target. Beyond these variables, additional top predictors of males included height, WHR, and SMM. For females, other key predictors were WHR, Weight_V, and height.

Figure 4. — Feature importance plots.

Note: Importance of features within the height prediction models for (a) males and (b) females. The feature importance values are normalized to a total sum of 100, with higher values indicating greater importance. To focus on the importance of the anthropometric and body composition variables, we excluded prediction target information variables (Height_BM, Months_T, Months, and Growth_Rate_BM) from plots. Among these excluded variables, Growth_Rate_BM had an overwhelmingly high importance of 96.33% for males and 96.11% for females.

Accumulated local effect

The ALE plots illustrate the average influence of individual features on model predictions, considering feature interactions (Figure 5). We observe a strong negative and nonlinear association between the SDS_Height and the estimated growth in height. A distinct negative correlation was observed when the scaled SDS_Height was below 0, whereas no significant correlation was found for values above 0. Likewise, a negative association was noted between the Height_V and the predicted growth in height. Conversely, the SLM_V showed a positive correlation with the anticipated growth in height.

Shapley additive explanations

The SHAP summary plots demonstrate the impact of features on model predictions by calculating Shapley values (Figure 6). Red represents features that increase predictions, whereas blue denotes those that decrease them. Important variables for both sexes include fundamental anthropometric metrics such as benchmark growth rate (Growth_Rate_BM) and SDS_Height. Growth velocities, including SLM_V and Height_V, had a significant effect on model results. Notably, the height SDS and velocity exhibited inverse relations with the predicted growth rate, while the SLM_V showed a positive correlation, consistent with previous findings.

Individual height prediction curves

A major advantage of our height prediction model is its ability to create personalized growth curves for children and adolescents.^e To highlight this feature and enhance local interpretability, we present two illustrative examples (Figure 7). These examples were randomly chosen from the test set, featuring one boy and one girl, referred to as Boy #1 and Girl #1, respectively.

Figure 7 presents the predicted and actual growth curves along with the associated SHAP decision plots for Boy #1 and Girl #1. These growth curves were created using the trained model and the available longitudinal data for each child. For Boy #1, the Growth_Rate_BM (2.556) had the most significant positive impact on predicted growth in height. Additionally, the WHR (1.181) and Height_V (0.769) also contributed positively to the predicted growth in height. In contrast, for Girl #1, the Growth_Rate_BM (2.028) was the most notable positive contributor. The TBW percentage (TBWP; 0.864), the SDS of BFM (SDS_BFM; −0.909), and the SDS of SMM (SDS_SMM; −0.517) further illustrate positive influences on the predicted height.

Discussion

In this study, we developed an accurate and reliable AI model to predict the future heights of children and adolescents by analyzing extensive anthropometric and body composition data from a large-scale longitudinal research project. The prediction model yielded mean RMSEs of 2.51 cm for males and 2.28 cm for females, MAEs of 1.74 cm for males and 1.68 cm for females, along with MAPEs of 1.14% for males and 1.13% for females. Performance varied by age group and prediction horizon, showing improved accuracy for shorter prediction intervals and older ages. Importantly, the model maintained high accuracy across all scenarios, with MAPE values consistently below 2.42% for males and 2.00% for females in every age group and prediction horizon.

Previous height prediction studies demonstrate varied accuracies across methodologies. Direct ML prediction from anthropometric data yielded MAE of 3.35–3.66 cm³⁶ and 2.46–3.00 cm.³⁷ Deep learning methods using radiographic images showed MAE of 4.62 cm in Korean children.³⁴ Traditional bone age-based methods produced RMSE of 2.7–3.3 cm with automated BoneXpert³² and SD of 2.5–2.8 cm with Tanner–Whitehouse in athletic populations.⁵¹ Bayley–Pinneau demonstrated SD of 3.9–4.2 cm in Korean short stature children.⁵² In ISS cohorts, multiple linear regression models achieved SD of 3.70–4.19 cm while traditional Bayley–Pinneau showed SD of 5.81 cm.⁵³ Previous investigations primarily analyzed limited sample sizes from restricted cohorts or populations with specific clinical characteristics. Our analysis examined 206,112 training samples from 44,683 Korean children aged 7–16 years from general school populations. Despite analyzing this heterogeneous population spanning normal growth patterns and diverse clinical conditions, our model achieved superior accuracy to established methods.

Our findings emphasize the crucial roles of body composition and growth velocity in predicting future height. Notably, the analysis of feature importance, along with the SHAP results, pinpointed height and SLM_V as the key variables. The ALE plots unveiled interesting connections between these growth velocities and anticipated height increases. A negative correlation between Height_V and expected growth in height supports the idea of catch-up growth, in which children who initially experience growth delays may have accelerated growth in later periods.^54–56 Conversely, the SLM_V exhibited a positive correlation with predicted growth in height, suggesting that children with greater muscle accumulation are likely to experience increased height. Nonetheless, the connection between lean mass increase and future height in children and adolescents is still a matter of debate. Although these results highlight the intricate relation between lean mass development and height achievement, the observational nature of the study limits our conclusions to possible correlations rather than establishing causal relations.

The generation of personalized growth curves is a notable feature of this model. The integration of explainable AI techniques, particularly SHAP decision plots, provides interpretable insights into the factors driving individual growth predictions. For example, the SHAP decision plot for Boy #1 shows that his predicted height was primarily influenced by his SDS of height, WHR, and Height_V. In contrast, Girl #1's predicted growth in height was mainly affected by her SDS of height, TBWP, and SDS_BFM. These personalized insights demonstrate the model's potential for tailored growth assessments.

Our study enhances pediatric growth assessment by applying ML algorithms to a comprehensive set of anthropometric and body composition measures. Conventional pediatric height predictions usually rely on skeletal maturity assessment using handwrist radiographs,^11–13 which have limitations such as the need for specialized expertise, high interobserver variability, and reduced accuracy in assessing children with growth disorders.^14–19 Our study provides complementary information by utilizing body composition data from BIA, which is noninvasive, cost-effective, and convenient, enabling large-scale and long-term data collection.^27–30 This method offers additional insights into biological maturity beyond skeletal maturity.^20–25

To effectively leverage longitudinal body composition data, our study employed SDS for measuring body composition and growth velocities. These SDS, derived from a comprehensive database, provide a standardized method to quantify relative growth status within age- and sex-matched cohorts.⁴⁰ This standardization facilitates the identification of growth patterns across various populations and improves the comparability of growth data in longitudinal studies. Additionally, our longitudinal data enabled the calculation of growth velocities, which are crucial for detecting deviations from the typical growth pattern.^41–44 The inclusion of body composition velocity offers unique insights into children's physical development dynamics, deepening our understanding of growth trajectories.

This study employs ML techniques to analyze large biometric datasets, aiming to directly predict height in children and adolescents. It overcomes the constraints of past research, which primarily focused on utilizing radiographic data for automated skeletal maturity assessments.¹⁵^31–35 Previous efforts to predict height directly were limited by smaller datasets and insufficient body composition measures.^36,37 This study resolves these issues by incorporating comprehensive body composition data from the extensive GP Cohort Study. By applying specialized big data methods, the model gathers insights from various variables, enabling the creation of an unbiased, objective, and reliable AI model that delivers consistent results.

The integration of SHAP analysis provides model transparency by revealing how body composition parameters influence height predictions. This interpretability identifies population-level growth determinants and offers clinicians’ evidence-based reasoning for individual predictions. Furthermore, SHAP analysis revealed distinct predictive patterns between sexes. Males relied primarily on SDS of height, SLM_V, Height_V, and SMM. Females utilized SDS of height with different body composition markers including SDS_SMM, WHR, and TBWP. Weight velocity emerged as a key predictor exclusively in female models. The identification of these distinct predictive patterns validates our sex-stratified modeling approach, given that males and females follow different growth trajectories during adolescence.⁵⁷

For clinical implementation, our AI model enhances pediatric growth assessment through height trajectory predictions. The model enables clinicians to distinguish children requiring intervention from those with adequate growth potential despite current short stature. Clinicians can use these predicted trajectories as objective references for growth hormone therapy decisions and treatment monitoring.^8,10 When observed growth deviates from predicted patterns during follow-up, they can identify needs for reassessment or therapeutic adjustments.^17–19 Shapley additive explanations analysis enhances clinical interpretation by quantifying the relative contributions of body composition parameters to individual predictions. This integration of predictions with transparent reasoning supports data-driven clinical decisions while maintaining interpretive flexibility for complex cases. Moreover, the noninvasive nature of BIA enables frequent monitoring without radiation exposure and population-level growth surveillance through school-based screening programs. This scalability facilitates early identification of at-risk populations and provides epidemiological data for public health interventions.

One major limitation of our study is the absence of handwrist radiography data. This hindered our ability to directly compare our ML-based approach with conventional methods for evaluating skeletal maturity. Had we had access to this data, we could have assessed the added value of our model compared to traditional approaches and created a more comprehensive framework for predicting growth. Similarly, the unavailability of comparable longitudinal datasets with comprehensive body composition measurements prevented external validation of our model. Additionally, another limitation is the observational nature of the study, as we were unable to control for variables that could affect growth outcomes. As a result, the links identified between body composition and upcoming height through explainable AI methods should be interpreted with caution as they may not directly reflect causal relations.

Our model was developed exclusively using data from South Korean children and adolescents. Population-specific growth patterns represent a fundamental consideration for model generalizability. The onset of puberty, body composition trajectories, and adult height potential exhibit substantial variations across racial and ethnic groups.^58,59 These biological differences indicate that predictive relationships between body composition and height trajectory identified in Korean children may not directly translate to other populations. Future research should prioritize acquiring diverse ethnic datasets to conduct comprehensive external validation. More importantly, integrating multiethnic growth data with existing Korean cohort data could enable the development of population-adjusted prediction frameworks.

In addition, our research concentrated on children and adolescents aged 7–16 years due to limited data availability, which may restrict the applicability of our model for younger children or in predicting adult height. The reliability of our model for these populations could be compromised due to insufficient data on early growth trends, development after age 16, and the possible impact of various factors on adult height. Moreover, although our model exhibits strong accuracy and interpretability, integrating ML models into clinical workflows poses significant challenges. To enhance adoption, it is crucial to provide user-friendly interfaces and ensure smooth integration with electronic health record systems. Additionally, healthcare professionals will need training and support to accurately interpret and utilize the model's insights in their clinical decision-making. Finally, while our model encompasses a wide range of anthropometric and body composition metrics, it does not account for other genetic, environmental, or socioeconomic factors that may also play a role.

Despite these challenges, our research offers significant insights into the potential application of ML for assessing pediatric growth. By examining a large longitudinal dataset and considering various factors, our model generates tailored growth curves that provide interpretable insights, enhancing clinical decision-making. Future studies should focus on gathering comprehensive data, such as handwrist radiography, and utilize rigorous experimental methodologies to validate and expand our results. This study lays the essential foundation for further investigation into the application of ML in pediatric growth assessment, highlighting the need for continued interdisciplinary collaboration to refine these techniques and integrate them into clinical practice settings.

Conclusion

This study created an ML model to predict future height in children and adolescents by using anthropometric and body composition parameters. The model was trained on a large, longitudinal dataset, enabling it to forecast height and generate personalized growth curves accurately. Explainable AI methods were implemented to improve the interpretability of the model. While there are some limitations to this approach, it makes a significant contribution to pediatric growth assessment. It could serve as a valuable tool in clinical settings, helping to identify growth disorders and optimize outcomes.

Supplemental Material

sj-docx-1-dhj-10.1177_20552076251395975 - Supplemental material for Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data

sj-docx-1-dhj-10.1177_20552076251395975.docx^{(373.7KB, docx)}

Supplemental material, sj-docx-1-dhj-10.1177_20552076251395975 for Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data by Dohyun Chun, Hae Woon Jung, Jongho Kang, Woo Young Jang and Jihun Kim in DIGITAL HEALTH

sj-pdf-2-dhj-10.1177_20552076251395975 - Supplemental material for Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data

sj-pdf-2-dhj-10.1177_20552076251395975.pdf^{(546.7KB, pdf)}

Supplemental material, sj-pdf-2-dhj-10.1177_20552076251395975 for Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data by Dohyun Chun, Hae Woon Jung, Jongho Kang, Woo Young Jang and Jihun Kim in DIGITAL HEALTH

sj-docx-3-dhj-10.1177_20552076251395975 - Supplemental material for Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data

sj-docx-3-dhj-10.1177_20552076251395975.docx^{(34.5KB, docx)}

Supplemental material, sj-docx-3-dhj-10.1177_20552076251395975 for Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data by Dohyun Chun, Hae Woon Jung, Jongho Kang, Woo Young Jang and Jihun Kim in DIGITAL HEALTH

Acknowledgements

The authors thank Enago for English language editing assistance.

^a.

The age variable was utilized in the model as months and labeled as “Months”; however, for the convenience of readers, it was converted to years and displayed as “age (years).”

^b.

The Korean National Growth Charts do not provide reference values for body composition parameters. The GP growth chart [40] fills this gap by providing monthly reference percentiles for fat-free mass, body fat mass, and related indices in Korean children aged 7–16 years, based on 88,069 measurements from 22,515 children.

^c.

We evaluated various target formulations including absolute height change, height velocity, and SDS change. The percentage growth rate, which normalizes for baseline height, demonstrated optimal predictive performance.

^d.

While deep learning models showed computational inefficiency and reduced stability for tabular data, LightGBM and XGBoost demonstrated comparable performance, with LightGBM chosen for slightly better efficiency.

^e.

Supplementary Figure S1 depicts the actual and predicted growth curves for 1000 males and females sampled from the test dataset. The actual height measurements for each child were plotted against age in years, as shown in panels (a) and (c). Predicted height curves were generated using trained sex-specific models up to age 16, as presented in panels (b) and (d).

Footnotes

ORCID iDs: Dohyun Chun https://orcid.org/0000-0003-3031-4011

Hae Woon Jung https://orcid.org/0000-0003-0494-4626

Jongho Kang https://orcid.org/0000-0002-4273-2440

Woo Young Jang https://orcid.org/0000-0003-1775-7715

Jihun Kim https://orcid.org/0000-0002-2957-8776

Ethical considerations: The Institutional Review Board of Korea University Anam Hospital approved this study (2025AN0110). Written informed consent was obtained from guardians during the original data collection, with assent from participants aged 12 years or older. For this retrospective analysis using de-identified data from the existing cohort, the IRB waived the requirement for additional informed consent.

Ethics of using AI: Our pediatric AI model development followed Korea's Personal Information Protection Act and healthcare data guidelines with IRB approval (2025AN0110). We implemented pseudonymization, access controls, and explainable AI methods (SHAP) for transparency. We acknowledge population-specific limitations and emphasize that AI predictions should supplement clinical judgment. AI-assisted tools were used solely for language improvement in manuscript preparation. All scientific content represents the authors’ original work.

Consent to participate: Written informed consent was obtained from guardians during original data collection. The IRB waived additional consent for this retrospective analysis using de-identified data.

Contributorship: JK and WYJ performed conceptualization, funding acquisition, methodology, project administration, and supervision. DC, HWJ, and JK contributed to data curation, formal analysis, and investigation. DC and HWJ contributed to software, validation, visualization, and writing—original draft. All authors contributed to writing—review & editing.

Funding: The author disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by a National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) [NRF-2022R1A2C2092726], Yonsei University MIRAE Campus [grant number 2024-62-0059], and the Regional Innovation System & Education (RISE) program through the Gangwon RISE Center, funded by the Ministry of Education (MOE) and the Gangwon State (G.S.), Republic of Korea [2025-RISE-10-006]. This research was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (grant number: 2460002156).

D.C., J. Kang, and J. Kim are employees and shareholders of GP Co., Ltd. H.W.J. and W.Y.J. declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Data availability: The data that support the findings of this study are available from GP Co., Ltd. but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of GP Co., Ltd.

Supplemental material: Supplemental material for this article is available online.

References

1.Norris SA, Frongillo EA, Black MM, et al. Nutrition in adolescent growth and development. Lancet 2022; 399: 172–184. [DOI] [PubMed] [Google Scholar]
2.Baxter-Jones AD, Faulkner RA, Forwood MR, et al. Bone mineral accrual from 8 to 30 years of age: an estimation of peak bone mass. J Bone Miner Res 2011; 26: 1729–1739. [DOI] [PubMed] [Google Scholar]
3.Hargreaves D, Mates E, Menon P, et al. Strategies and interventions for healthy adolescent growth, nutrition, and development. Lancet 2022; 399: 198–210. [DOI] [PubMed] [Google Scholar]
4.Saari A, Harju S, Mäkitie O, et al. Systematic growth monitoring for the early detection of celiac disease in children. JAMA Pediatr 2015; 169: e1525. [DOI] [PubMed] [Google Scholar]
5.Craig D, Fayter D, Stirk L, et al. Growth monitoring for short stature: update of a systematic review and economic model. Health Technol Assess 2011; 15: 1–162. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Grote FK, Van Dommelen P, Oostdijk W, et al. Developing evidence-based guidelines for referral for short stature. Arch Dis Child 2008; 93: 212–217. [DOI] [PubMed] [Google Scholar]
7.Zhang Z, Lindstrom MJ, Farrell PM, et al. Pubertal height growth and adult height in cystic fibrosis after newborn screening. Pediatrics 2016; 137: e20152907. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Collett-Solberg PF, Jorge AA, Boguszewski MC, et al. Growth hormone therapy in children; research and practice—a review. Growth Horm IGF Res 2019; 44: 20–32. [DOI] [PubMed] [Google Scholar]
9.Ostojic SM. Prediction of adult height by Tanner-Whitehouse method in young Caucasian male athletes. QJM 2013; 106: 341–345. [DOI] [PubMed] [Google Scholar]
10.Cuttler L, Silvers JB. Growth hormone treatment for idiopathic short stature: implications for practice and policy. Arch Pediatr Adolesc Med 2004; 158: 108–110. [DOI] [PubMed] [Google Scholar]
11.Bayley N, Pinneau SR. Tables for predicting adult height from skeletal age: revised for use with the Greulich-Pyle hand standards. J Pediatr 1952; 40: 423–441. [DOI] [PubMed] [Google Scholar]
12.Tanner JM, Whitehouse RH, Marshall WA, et al. Prediction of adult height from height, bone age, and occurrence of menarche, at ages 4 to 16 with allowance for midparent height. Arch Dis Child 1975; 50: 14–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Roche AF, Wainer H, Thissen D. The RWT method for the prediction of adult stature. Pediatrics 1975; 56: 1026–1033. [PubMed] [Google Scholar]
14.Bull RK, Edwards PD, Kemp PM, et al. Bone age assessment: a large scale comparison of the Greulich and Pyle, and Tanner and Whitehouse (TW2) methods. Arch Dis Child 1999; 81: 172–173. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Chávez-Vázquez AG, Klünder-Klünder M, Garibay-Nieto GN, et al. Evaluation of height prediction models: from traditional methods to artificial intelligence. Pediatr Res 2024; 95: 308–315. [DOI] [PubMed] [Google Scholar]
16.Prokop-Piotrkowska M, Marszałek-Dziuba K, Moszczyńska E, et al. Traditional and new methods of bone age assessment—an overview. J Clin Res Pediatr Endocrinol 2021; 13: 251–262. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Topor LS, Feldman HA, Bauchner H, et al. Variation in methods of predicting adult height for children with idiopathic short stature. Pediatrics 2010; 126: 938–944. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Bramswig JH, Fasse M, Holthoff ML, et al. Adult height in boys and girls with untreated short stature and constitutional delay of growth and puberty: accuracy of five different methods of height prediction. J Pediatr 1990; 117: 886–891. [DOI] [PubMed] [Google Scholar]
19.Bonfig W, Schwarz HP. Overestimation of final height prediction in patients with classical congenital adrenal hyperplasia using the Bayley and Pinneau method. J Pediatr Endocrinol Metab 2012; 25: 645–649. [DOI] [PubMed] [Google Scholar]
20.Johnson W, Stovitz SD, Choh AC, et al. Patterns of linear growth and skeletal maturation from birth to 18 years of age in overweight young adults. Int J Obes (Lond 2012; 36: 535–541. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Dalskov S, Müller M, Ritz C, et al. Effects of dietary protein and glycaemic index on biomarkers of bone turnover in children. Br J Nutr 2013; 109: 1301–1312. [DOI] [PubMed] [Google Scholar]
22.Jamison DT, Breman JG, Measham AR, et al. Disease control priorities in developing countries. 2nd ed. Washington, DC: World Bank, 2006. [PubMed] [Google Scholar]
23.Alimujiang A, Colditz GA, Gardner JD, et al. Childhood diet and growth in boys in relation to timing of puberty and adult height: the Longitudinal Studies of Child Health and Development. Cancer Causes Control 2018; 29: 915–926. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.He Q, Karlberg J. BMI In childhood and its association with height gain, timing of puberty, and final height. Pediatr Res 2001; 49: 244–251. [DOI] [PubMed] [Google Scholar]
25.Brener A, Bello R, Lebenthal Y, et al. The impact of adolescent obesity on adult height. Horm Res Paediatr 2017; 88: 237–243. [DOI] [PubMed] [Google Scholar]
26.Chun D, Kim SJ, Suh J, et al. Timing, velocity, and magnitude of pubertal changes in body composition: a longitudinal study. Pediatr Res 2025; 97: 293–300. [DOI] [PubMed] [Google Scholar]
27.Jansen EC, Dunietz GL, Chervin RD, et al. Adiposity in adolescents: the interplay of sleep duration and sleep variability. J Pediatr 2018; 203: 309–316. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Hanem LGE, Salvesen Ø, Juliusson PB, et al. Intrauterine metformin exposure and offspring cardiometabolic risk factors (PedMet study): a 5-10 year follow-up of the PregMet randomised controlled trial. Lancet Child Adolesc Health 2019; 3: 166–174. [DOI] [PubMed] [Google Scholar]
29.Rodríguez-Carmona Y, Meijer JL, Zhou Y, et al. Metabolomics reveals sex-specific pathways associated with changes in adiposity and muscle mass in a cohort of Mexican adolescents. Pediatr Obes 2022; 17: e12887. [DOI] [PubMed] [Google Scholar]
30.Atazadegan MA, Heidari-Beni M, Entezari MH, et al. Effects of synbiotic supplementation on anthropometric indices and body composition in overweight or obese children and adolescents: a randomized, double-blind, placebo-controlled clinical trial. World J Pediatr 2023; 19: 356–365. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Martin DD, Schittenhelm J, Thodberg HH. Validation of adult height prediction based on automated bone age determination in the Paris Longitudinal Study of healthy children. Pediatr Radiol 2016; 46: 263–269. [DOI] [PubMed] [Google Scholar]
32.Thodberg HH, Jenni OG, Caflisch J, et al. Prediction of adult height based on automated determination of bone age. J Clin Endocrinol Metab 2009; 94: 4868–4874. [DOI] [PubMed] [Google Scholar]
33.Thodberg HH, Juul A, Lomholt J, et al. Adult height prediction models. In: Preedy VR. (ed) Handbook of growth and growth monitoring in health and disease. New York: Springer, 2012, pp.27–57. [Google Scholar]
34.Suh J, Heo J, Kim SJ, et al. Bone age estimation and prediction of final adult height using deep learning. Yonsei Med J 2023; 64: 679–686. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Satoh M. Bone age: assessment methods and clinical applications. Clin Pediatr Endocrinol 2015; 24: 143–152. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Shmoish M, German A, Devir N, et al. Prediction of adult height by machine learning technique. J Clin Endocrinol Metab 2021; 106: e2700–e2710. [DOI] [PubMed] [Google Scholar]
37.Mlakar M, Gradišek A, Luštrek M, et al. Adult height prediction using the growth curve comparison method. PLoS One 2023; 18: e0281960. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Chun D, Chung T, Kang J, et al. Height estimation in children and adolescents using body composition big data: machine-learning and explainable artificial intelligence approach. Digit Health 2025; 11: 20552076251331879. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Jung HW, Chun D, Choi JH, et al. Comparison of adult height prediction using bone age and body composition for growth assessment in Korean children. Sci Rep 2025; 15: 10581. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Chun D, Kim SJ, Suh J, et al. Big data-based reference centiles for body composition in Korean children and adolescents: a cross-sectional study. BMC Pediatr 2024; 24: 23. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.World Health Organization. WHO child growth standards: growth velocity based on weight, length and head circumference: methods and development. Geneva: WHO, 2009. [Google Scholar]
42.Voss LD, Wilkin TJ, Bailey BJ, et al. The reliability of height and height velocity in the assessment of growth (the Wessex Growth Study). Arch Dis Child 1991; 66: 833–837. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.McCarthy A, Hughes R, Tilling K, et al. Birth weight; postnatal, infant, and childhood growth; and obesity in young adulthood: evidence from the Barry Caerphilly growth study. Am J Clin Nutr 2007; 86: 907–913. [DOI] [PubMed] [Google Scholar]
44.Ekelund U, Ong K, Linné Y, et al. Upward weight percentile crossing in infancy and early childhood independently predicts fat mass in young adults: the Stockholm Weight Development Study (SWEDES). Am J Clin Nutr 2006; 83: 324–330. [DOI] [PubMed] [Google Scholar]
45.Karim MR, Islam T, Shajalal M, et al. Explainable AI for bioinformatics: methods, tools and applications. Brief Bioinform 2023; 24: bbad236. [DOI] [PubMed] [Google Scholar]
46.Gu D, Su K, Zhao H. A case-based ensemble learning system for explainable breast cancer recurrence prediction. Artif Intell Med 2020; 107: 101858. [DOI] [PubMed] [Google Scholar]
47.Liu Y, Herrin J, Huang C, et al. Nonexercise machine learning models for maximal oxygen uptake prediction in national population surveys. J Am Med Inform Assoc 2023; 30: 943–952. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Javidi H, Mariam A, Alkhaled L, et al. An interpretable predictive deep learning platform for pediatric metabolic diseases. J Am Med Inform Assoc 2024; 31: 1227–1238. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Moncada-Torres A, van Maaren MC, Hendriks MP, et al. Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival. Sci Rep 2021; 11: 6968. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Seethi VDR, LaCasse Z, Chivte P, et al. An explainable AI approach for diagnosis of COVID-19 using MALDI-ToF mass spectrometry. Expert Syst Appl 2024; 236: 121226. [Google Scholar]
51.Jeong KH, Jung JH, Kim JM, et al. Near final height in Korean children referred for evaluation of short stature: a single tertiary center experience. Ann Pediatr Endocrinol Metab 2018; 23: 28–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Lolli L, Batterham AM, Weston KL, et al. Tanner-Whitehouse and Modified Bayley-Pinneau adult height predictions in elite youth soccer players from the Middle East. Med Sci Sports Exerc 2021; 53: 2635–2642. [DOI] [PubMed] [Google Scholar]
53.Blum WF, Cao D, Hesse V, et al. A novel method for adult height prediction in children with idiopathic short stature. J Clin Endocrinol Metab 2022; 107: e2865–e2873. [Google Scholar]
54.Boersma B, Wit JM. Catch-up growth. Endocr Rev 1997; 18: 646–661. [DOI] [PubMed] [Google Scholar]
55.Wit JM, Boersma B. Catch-up growth: definition, mechanisms, and models. J Pediatr Endocrinol Metab 2002; 15: 1229–1241. [PubMed] [Google Scholar]
56.Albertsson-Wikland K, Karlberg J. Natural growth in children born small for gestational age with and without catch-up growth. Acta Paediatr Suppl 1994; 399: 64–70. [DOI] [PubMed] [Google Scholar]
57.Chun D, Kim SJ, Kim YH, et al. The estimation of pubertal growth spurt parameters using the superimposition by translation and rotation model in Korean children and adolescents: a longitudinal cohort study. Front Pediatr 2024; 12: 1372013. [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Ramnitz MS, Lodish MB. Racial disparities in pubertal development. Semin Reprod Med 2013; 31: 333–339. [DOI] [PubMed] [Google Scholar]
59.Meyer KA, Friend S, Hannan PJ, et al. Ethnic variation in body composition assessment in a sample of adolescent girls. Int J Pediatr Obes 2011; 6: 481–490. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sj-docx-1-dhj-10.1177_20552076251395975 - Supplemental material for Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data

sj-docx-1-dhj-10.1177_20552076251395975.docx^{(373.7KB, docx)}

sj-pdf-2-dhj-10.1177_20552076251395975 - Supplemental material for Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data

sj-pdf-2-dhj-10.1177_20552076251395975.pdf^{(546.7KB, pdf)}

sj-docx-3-dhj-10.1177_20552076251395975 - Supplemental material for Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data

sj-docx-3-dhj-10.1177_20552076251395975.docx^{(34.5KB, docx)}

[bibr1-20552076251395975] 1.Norris SA, Frongillo EA, Black MM, et al. Nutrition in adolescent growth and development. Lancet 2022; 399: 172–184. [DOI] [PubMed] [Google Scholar]

[bibr2-20552076251395975] 2.Baxter-Jones AD, Faulkner RA, Forwood MR, et al. Bone mineral accrual from 8 to 30 years of age: an estimation of peak bone mass. J Bone Miner Res 2011; 26: 1729–1739. [DOI] [PubMed] [Google Scholar]

[bibr3-20552076251395975] 3.Hargreaves D, Mates E, Menon P, et al. Strategies and interventions for healthy adolescent growth, nutrition, and development. Lancet 2022; 399: 198–210. [DOI] [PubMed] [Google Scholar]

[bibr4-20552076251395975] 4.Saari A, Harju S, Mäkitie O, et al. Systematic growth monitoring for the early detection of celiac disease in children. JAMA Pediatr 2015; 169: e1525. [DOI] [PubMed] [Google Scholar]

[bibr5-20552076251395975] 5.Craig D, Fayter D, Stirk L, et al. Growth monitoring for short stature: update of a systematic review and economic model. Health Technol Assess 2011; 15: 1–162. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr6-20552076251395975] 6.Grote FK, Van Dommelen P, Oostdijk W, et al. Developing evidence-based guidelines for referral for short stature. Arch Dis Child 2008; 93: 212–217. [DOI] [PubMed] [Google Scholar]

[bibr7-20552076251395975] 7.Zhang Z, Lindstrom MJ, Farrell PM, et al. Pubertal height growth and adult height in cystic fibrosis after newborn screening. Pediatrics 2016; 137: e20152907. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr8-20552076251395975] 8.Collett-Solberg PF, Jorge AA, Boguszewski MC, et al. Growth hormone therapy in children; research and practice—a review. Growth Horm IGF Res 2019; 44: 20–32. [DOI] [PubMed] [Google Scholar]

[bibr9-20552076251395975] 9.Ostojic SM. Prediction of adult height by Tanner-Whitehouse method in young Caucasian male athletes. QJM 2013; 106: 341–345. [DOI] [PubMed] [Google Scholar]

[bibr10-20552076251395975] 10.Cuttler L, Silvers JB. Growth hormone treatment for idiopathic short stature: implications for practice and policy. Arch Pediatr Adolesc Med 2004; 158: 108–110. [DOI] [PubMed] [Google Scholar]

[bibr11-20552076251395975] 11.Bayley N, Pinneau SR. Tables for predicting adult height from skeletal age: revised for use with the Greulich-Pyle hand standards. J Pediatr 1952; 40: 423–441. [DOI] [PubMed] [Google Scholar]

[bibr12-20552076251395975] 12.Tanner JM, Whitehouse RH, Marshall WA, et al. Prediction of adult height from height, bone age, and occurrence of menarche, at ages 4 to 16 with allowance for midparent height. Arch Dis Child 1975; 50: 14–26. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr13-20552076251395975] 13.Roche AF, Wainer H, Thissen D. The RWT method for the prediction of adult stature. Pediatrics 1975; 56: 1026–1033. [PubMed] [Google Scholar]

[bibr14-20552076251395975] 14.Bull RK, Edwards PD, Kemp PM, et al. Bone age assessment: a large scale comparison of the Greulich and Pyle, and Tanner and Whitehouse (TW2) methods. Arch Dis Child 1999; 81: 172–173. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr15-20552076251395975] 15.Chávez-Vázquez AG, Klünder-Klünder M, Garibay-Nieto GN, et al. Evaluation of height prediction models: from traditional methods to artificial intelligence. Pediatr Res 2024; 95: 308–315. [DOI] [PubMed] [Google Scholar]

[bibr16-20552076251395975] 16.Prokop-Piotrkowska M, Marszałek-Dziuba K, Moszczyńska E, et al. Traditional and new methods of bone age assessment—an overview. J Clin Res Pediatr Endocrinol 2021; 13: 251–262. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr17-20552076251395975] 17.Topor LS, Feldman HA, Bauchner H, et al. Variation in methods of predicting adult height for children with idiopathic short stature. Pediatrics 2010; 126: 938–944. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr18-20552076251395975] 18.Bramswig JH, Fasse M, Holthoff ML, et al. Adult height in boys and girls with untreated short stature and constitutional delay of growth and puberty: accuracy of five different methods of height prediction. J Pediatr 1990; 117: 886–891. [DOI] [PubMed] [Google Scholar]

[bibr19-20552076251395975] 19.Bonfig W, Schwarz HP. Overestimation of final height prediction in patients with classical congenital adrenal hyperplasia using the Bayley and Pinneau method. J Pediatr Endocrinol Metab 2012; 25: 645–649. [DOI] [PubMed] [Google Scholar]

[bibr20-20552076251395975] 20.Johnson W, Stovitz SD, Choh AC, et al. Patterns of linear growth and skeletal maturation from birth to 18 years of age in overweight young adults. Int J Obes (Lond 2012; 36: 535–541. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr21-20552076251395975] 21.Dalskov S, Müller M, Ritz C, et al. Effects of dietary protein and glycaemic index on biomarkers of bone turnover in children. Br J Nutr 2013; 109: 1301–1312. [DOI] [PubMed] [Google Scholar]

[bibr22-20552076251395975] 22.Jamison DT, Breman JG, Measham AR, et al. Disease control priorities in developing countries. 2nd ed. Washington, DC: World Bank, 2006. [PubMed] [Google Scholar]

[bibr23-20552076251395975] 23.Alimujiang A, Colditz GA, Gardner JD, et al. Childhood diet and growth in boys in relation to timing of puberty and adult height: the Longitudinal Studies of Child Health and Development. Cancer Causes Control 2018; 29: 915–926. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr24-20552076251395975] 24.He Q, Karlberg J. BMI In childhood and its association with height gain, timing of puberty, and final height. Pediatr Res 2001; 49: 244–251. [DOI] [PubMed] [Google Scholar]

[bibr25-20552076251395975] 25.Brener A, Bello R, Lebenthal Y, et al. The impact of adolescent obesity on adult height. Horm Res Paediatr 2017; 88: 237–243. [DOI] [PubMed] [Google Scholar]

[bibr26-20552076251395975] 26.Chun D, Kim SJ, Suh J, et al. Timing, velocity, and magnitude of pubertal changes in body composition: a longitudinal study. Pediatr Res 2025; 97: 293–300. [DOI] [PubMed] [Google Scholar]

[bibr27-20552076251395975] 27.Jansen EC, Dunietz GL, Chervin RD, et al. Adiposity in adolescents: the interplay of sleep duration and sleep variability. J Pediatr 2018; 203: 309–316. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr28-20552076251395975] 28.Hanem LGE, Salvesen Ø, Juliusson PB, et al. Intrauterine metformin exposure and offspring cardiometabolic risk factors (PedMet study): a 5-10 year follow-up of the PregMet randomised controlled trial. Lancet Child Adolesc Health 2019; 3: 166–174. [DOI] [PubMed] [Google Scholar]

[bibr29-20552076251395975] 29.Rodríguez-Carmona Y, Meijer JL, Zhou Y, et al. Metabolomics reveals sex-specific pathways associated with changes in adiposity and muscle mass in a cohort of Mexican adolescents. Pediatr Obes 2022; 17: e12887. [DOI] [PubMed] [Google Scholar]

[bibr30-20552076251395975] 30.Atazadegan MA, Heidari-Beni M, Entezari MH, et al. Effects of synbiotic supplementation on anthropometric indices and body composition in overweight or obese children and adolescents: a randomized, double-blind, placebo-controlled clinical trial. World J Pediatr 2023; 19: 356–365. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr31-20552076251395975] 31.Martin DD, Schittenhelm J, Thodberg HH. Validation of adult height prediction based on automated bone age determination in the Paris Longitudinal Study of healthy children. Pediatr Radiol 2016; 46: 263–269. [DOI] [PubMed] [Google Scholar]

[bibr32-20552076251395975] 32.Thodberg HH, Jenni OG, Caflisch J, et al. Prediction of adult height based on automated determination of bone age. J Clin Endocrinol Metab 2009; 94: 4868–4874. [DOI] [PubMed] [Google Scholar]

[bibr33-20552076251395975] 33.Thodberg HH, Juul A, Lomholt J, et al. Adult height prediction models. In: Preedy VR. (ed) Handbook of growth and growth monitoring in health and disease. New York: Springer, 2012, pp.27–57. [Google Scholar]

[bibr34-20552076251395975] 34.Suh J, Heo J, Kim SJ, et al. Bone age estimation and prediction of final adult height using deep learning. Yonsei Med J 2023; 64: 679–686. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr35-20552076251395975] 35.Satoh M. Bone age: assessment methods and clinical applications. Clin Pediatr Endocrinol 2015; 24: 143–152. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr36-20552076251395975] 36.Shmoish M, German A, Devir N, et al. Prediction of adult height by machine learning technique. J Clin Endocrinol Metab 2021; 106: e2700–e2710. [DOI] [PubMed] [Google Scholar]

[bibr37-20552076251395975] 37.Mlakar M, Gradišek A, Luštrek M, et al. Adult height prediction using the growth curve comparison method. PLoS One 2023; 18: e0281960. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr38-20552076251395975] 38.Chun D, Chung T, Kang J, et al. Height estimation in children and adolescents using body composition big data: machine-learning and explainable artificial intelligence approach. Digit Health 2025; 11: 20552076251331879. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr39-20552076251395975] 39.Jung HW, Chun D, Choi JH, et al. Comparison of adult height prediction using bone age and body composition for growth assessment in Korean children. Sci Rep 2025; 15: 10581. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr40-20552076251395975] 40.Chun D, Kim SJ, Suh J, et al. Big data-based reference centiles for body composition in Korean children and adolescents: a cross-sectional study. BMC Pediatr 2024; 24: 23. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr41-20552076251395975] 41.World Health Organization. WHO child growth standards: growth velocity based on weight, length and head circumference: methods and development. Geneva: WHO, 2009. [Google Scholar]

[bibr42-20552076251395975] 42.Voss LD, Wilkin TJ, Bailey BJ, et al. The reliability of height and height velocity in the assessment of growth (the Wessex Growth Study). Arch Dis Child 1991; 66: 833–837. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr43-20552076251395975] 43.McCarthy A, Hughes R, Tilling K, et al. Birth weight; postnatal, infant, and childhood growth; and obesity in young adulthood: evidence from the Barry Caerphilly growth study. Am J Clin Nutr 2007; 86: 907–913. [DOI] [PubMed] [Google Scholar]

[bibr44-20552076251395975] 44.Ekelund U, Ong K, Linné Y, et al. Upward weight percentile crossing in infancy and early childhood independently predicts fat mass in young adults: the Stockholm Weight Development Study (SWEDES). Am J Clin Nutr 2006; 83: 324–330. [DOI] [PubMed] [Google Scholar]

[bibr45-20552076251395975] 45.Karim MR, Islam T, Shajalal M, et al. Explainable AI for bioinformatics: methods, tools and applications. Brief Bioinform 2023; 24: bbad236. [DOI] [PubMed] [Google Scholar]

[bibr46-20552076251395975] 46.Gu D, Su K, Zhao H. A case-based ensemble learning system for explainable breast cancer recurrence prediction. Artif Intell Med 2020; 107: 101858. [DOI] [PubMed] [Google Scholar]

[bibr47-20552076251395975] 47.Liu Y, Herrin J, Huang C, et al. Nonexercise machine learning models for maximal oxygen uptake prediction in national population surveys. J Am Med Inform Assoc 2023; 30: 943–952. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr48-20552076251395975] 48.Javidi H, Mariam A, Alkhaled L, et al. An interpretable predictive deep learning platform for pediatric metabolic diseases. J Am Med Inform Assoc 2024; 31: 1227–1238. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr49-20552076251395975] 49.Moncada-Torres A, van Maaren MC, Hendriks MP, et al. Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival. Sci Rep 2021; 11: 6968. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr50-20552076251395975] 50.Seethi VDR, LaCasse Z, Chivte P, et al. An explainable AI approach for diagnosis of COVID-19 using MALDI-ToF mass spectrometry. Expert Syst Appl 2024; 236: 121226. [Google Scholar]

[bibr51-20552076251395975] 51.Jeong KH, Jung JH, Kim JM, et al. Near final height in Korean children referred for evaluation of short stature: a single tertiary center experience. Ann Pediatr Endocrinol Metab 2018; 23: 28–32. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr52-20552076251395975] 52.Lolli L, Batterham AM, Weston KL, et al. Tanner-Whitehouse and Modified Bayley-Pinneau adult height predictions in elite youth soccer players from the Middle East. Med Sci Sports Exerc 2021; 53: 2635–2642. [DOI] [PubMed] [Google Scholar]

[bibr53-20552076251395975] 53.Blum WF, Cao D, Hesse V, et al. A novel method for adult height prediction in children with idiopathic short stature. J Clin Endocrinol Metab 2022; 107: e2865–e2873. [Google Scholar]

[bibr54-20552076251395975] 54.Boersma B, Wit JM. Catch-up growth. Endocr Rev 1997; 18: 646–661. [DOI] [PubMed] [Google Scholar]

[bibr55-20552076251395975] 55.Wit JM, Boersma B. Catch-up growth: definition, mechanisms, and models. J Pediatr Endocrinol Metab 2002; 15: 1229–1241. [PubMed] [Google Scholar]

[bibr56-20552076251395975] 56.Albertsson-Wikland K, Karlberg J. Natural growth in children born small for gestational age with and without catch-up growth. Acta Paediatr Suppl 1994; 399: 64–70. [DOI] [PubMed] [Google Scholar]

[bibr57-20552076251395975] 57.Chun D, Kim SJ, Kim YH, et al. The estimation of pubertal growth spurt parameters using the superimposition by translation and rotation model in Korean children and adolescents: a longitudinal cohort study. Front Pediatr 2024; 12: 1372013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bibr58-20552076251395975] 58.Ramnitz MS, Lodish MB. Racial disparities in pubertal development. Semin Reprod Med 2013; 31: 333–339. [DOI] [PubMed] [Google Scholar]

[bibr59-20552076251395975] 59.Meyer KA, Friend S, Hannan PJ, et al. Ethnic variation in body composition assessment in a sample of adolescent girls. Int J Pediatr Obes 2011; 6: 481–490. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Artificial intelligence for pediatric height prediction using large-scale longitudinal body composition data

Dohyun Chun

Hae Woon Jung

Jongho Kang

Woo Young Jang

Jihun Kim

Abstract

Objective

Materials and Methods

Results

Conclusion

Introduction

Data and methods

Study population

Data collection and measurements

Data preparation

Figure 1.

Prediction model

Figure 2.

Explainable AI

Personalized growth curve estimation

Results

Baseline characteristics

Table 1.

Prediction accuracy

Table 2.

Figure 3.

Feature importance

Figure 4.

Accumulated local effect

Figure 5.

Shapley additive explanations

Figure 6.

Individual height prediction curves

Figure 7.

Discussion

Conclusion

Supplemental Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases