Abstract
BACKGROUND/OBJECTIVES
Although numerous equations to predict percent body fat have been published, few have broad generalizability. The objective of this study was to develop sets of equations that are generalizable to the American population 8 years of age and older.
SUBJECTS/METHODS
Dual-emission X-ray absorptiometry (DXA) assessed percent body fat from the 1999–2006 National Health and Nutrition Examination Survey (NHANES) was used as the response variable for development of 14 equations for each gender that included between 2 and 10 anthropometrics. Other candidate variables included demographics and menses. Models were developed using the Least Absolute Shrinkage and Selection Operator (LAASO) and validated in a ¼ withheld sample randomly selected from 11 884 males or 9215 females.
RESULTS
In the final models, R2 ranged from 0.664 to 0.845 in males and from 0.748 to 0.809 in females. R2 was not notably improved by development of equations within, rather than across, age and ethnic groups. Systematic over or under estimation of percent body fat by age and ethnic groups was within 1 percentage point. Seven of the fourteen gender-specific models had R2 values above 0.80 in males and 0.795 in females and exhibited low bias by age, race/ethnicity and body mass index (BMI).
CONCLUSIONS
To our knowledge, these are the first equations that have been shown to be valid and unbiased in both youth and adults in estimating DXA assessed body fat. The equations developed here are appropriate for use in multiple ethnic groups, are generalizable to the US population and provide a useful method for assessment of percent body fat in settings where methods such as DXA are not feasible.
INTRODUCTION
Although classic definitions of obesity emphasize adiposity, in practice a body mass index (BMI: weight in kg/height in m2) of ≥30 kg m−2 is currently the measure most often used to diagnose obesity. As BMI does not distinguish fat from lean tissue, some misclassification of obesity (defined as excess adiposity) is inevitable. Adiposity can be accurately assessed in humans in many research and clinical settings, but the most accurate techniques are often not feasible outside these setting because they require relatively expensive equipment, trained technicians and a high level of subject cooperation. Numerous equations have been developed to predict percent body fat that use anthropometric measurements that are feasible to collect in home, school and other community settings.1–8 Most of these equations were developed in small or moderately sized samples that were recruited by convenience and usually limited to a specific and narrowly defined group. It is well known that associations between anthropometric measurements and percent body fat can differ importantly by gender, age and race/ethnicity; and therefore, it is necessary to match these characteristics between the sample in which an equation was developed and the individuals to which it is applied.
In the last 7 years, five groups of investigators have developed equations to predict percent body fat using data from the 1999–2004 National Health and Nutrition Examination Survey (NHANES) in youth4,5 and in adults.6–8 All used percent body fat measured by dual-emission X-ray absorptiometry (DXA) as the criterion measure and included demographic and anthropometric measures in prediction equations. Four of these groups studied at most only 3 of the 10 anthropometric measurements available in NHANES. Zanovec et al.6 examined equations that included either BMI or waist as the only anthropometric variables, whereas Li et al.7 used BMI and triceps skinfold. Dugas et al.4 examined BMI (with exponents of ½, −1 and −2) and body weight in selected combinations. Zanovec et al.6 and Li et al.7 studied only linear, main effects. All of these studies combined data across gender in their analyses. Heo et al.8 stratified by gender, age group and race/ethnicity to create 18 equations using only BMI −1. None of the four papers mentioned above performed internal or external validation of the equations developed or examined potential bias in the estimates across key subgroups.
The fifth set of equations for prediction of percent body fat developed from the 1999–2004 NHANES were developed by Stevens et al. in youth 8–17 years of age, and were intended for use by investigators in the Childhood Obesity Prevention and Treatment (COPTR) Consortium.5,9 Explanatory variables in the gender-specific equations were limited to those collected by the COPTR investigators, which included demographics plus four anthropometric measurements (height, weight, waist circumference and triceps skinfold). Forward and backward selection was used to develop equations in 2/3 of the sample, and the remaining 1/3 of the sample provided internal validation. Bias across race/ethnic groups and BMI categories was examined and influenced the selection of the final equations.
None of the five studies fully tapped the potential of the NHANES data. The purpose of this study was to systematically construct equations to predict percent body fat studying all 10 of the anthropometric measurements in NHANES, as well as subsets, using strategies that thoroughly search for and accommodate much more complex relationships than equations previously developed. We included candidate variables in non-linear forms and interactions and performed term selection using the Least Absolute Shrinkage and Selection Operator (LASSO) technique.10 We strived to develop multiple gender-specific equations, each of which used a different set of variables and each of which is appropriate for use in individuals 8 years of age and older. In addition, we used data from the 1999–2006 NHANES, adding 2 years of information to that included in previous work.
MATERIALS AND METHODS
Data for this study were from the 1999–2006 NHANES. The NHANES used a complex, multistage, probability, sampling design to provide a representative sample of US non-institutionalized children and adults.11 Race and ethnicity were self-reported and categorized as non-Hispanic Whites, non-Hispanic Blacks, Mexican Americans, other Hispanics and other race/ethnicities. We followed the NCHS (National Center for Health Statistics) recommendation to not separately analyze the other Hispanic or the other race/ethnicities groups due to small sample sizes.12
Girls over 12 years of age were asked the age when their first menstrual period occurred. Using this information, we created a dichotomous variable indicating the presence or absence of menarche. Age was used as a continuous variable and as a dichotomous variable indicating youth (8–19) or adult (≥20 years). Ten anthropometrics were measured using standardized procedures:11 height, weight, triceps and subscapular skinfolds, waist, maximal calf, arm and thigh circumferences, and upper arm and upper leg lengths.
DXA measurements were obtained on participants 8 years of age or older using a Hologic QDR-4500 A fan-beam densitometer (Hologic, Inc., Bedford, MA, USA). Data were adjusted as described by Schoeller et al.13 Participants were excluded from DXA measurement if pregnant, had amputations other than fingers and toes, had self-reported history of radiographic contrast material use in past 7 days or participation in nuclear medicine studies in the past 3 days, weighed over 300 pounds or had a height over 6′5″. The imputation of missing DXA measurements is described in technical documents.12 In the text that follows we call both imputed and measured DXA assessed percent body fat ‘observed’, for the purpose of differentiating observed values from the values predicted using the equations developed here. Unresolved IRB issues concerning the reporting of pregnancy test results to minors resulted in no DXA data in females 8–17 years of age in 1999. Since NHANES data were weighted by 2-year increments, there are no public use DXA data available for girls 8–17 years from the 1999–2000 survey. In addition, DXA data were available for individuals ≥70 years of age only in the 1999–2004 surveys.
Analytic sample
There were 31 194 men and women 8 years and older in the 1999–2006 NHANES data with a positive survey weight. After exclusions, the analysis sample included 21 099 participants (exclusion details in Supplementary information 1). Here we use the term ‘eligible sample’ to indicate the sample from which we generalize to the population of Americans who are ≥8 years of age; females who were not pregnant, did not give birth in the last year and were not currently breastfeeding; amputees if they had lost no more than fingers and toes; and those who were ≤300 pounds and less than 6′5″. Because 22% of the eligible sample was excluded in our analysis, we adjusted the sampling weights as recommended by NCHS12 when more than 10% of the eligible sample is excluded and missing is not completely at random (details of method in Supplementary information 2).
Analysis plan
In this work, we distinguish variables (for example, race/ethnicity and weight) from terms (for example, squared terms and interaction terms). The variables used were age, race/ethnicity, menarche status (females only), the 10 NHANES anthropometric variables and BMI (called a variable here). We selected terms to study based on our review of terms used in published equations and our own exploratory analyses (terms in Supplementary information 3). The maximum number of terms tested was 1335 for males and 1402 for females. We conducted model selection with 14 different subsets of candidate variables (models A–N), chosen based on the combinations of variables we judged most likely to be generally available in other studies and with the specific variables measured in selected large cohort studies (The Atherosclerosis Risk in Communities study, the Coronary Artery Risk Development in Young Adults study and the Fels Longitudinal study). For comparison, we examined BMI alone in the linear form (model O). All analyses took into account survey design and multiple imputation.
The following steps outline our approach:
Step 1. Create development and validation data sets and adjusted sampling weights
We used the PROC SURVEYSELECT procedure in SAS (SAS/STAT 9.2 User’s Guide, 2011) to create the development or fitting data set containing a random sample of ¾ of the sample. The remaining ¼ of participants constituted the validation data set. All analyses were stratified by gender.
Step 2. Generate models in development data set
We used the LASSO technique to select models for this project because it can incorporate multiple imputation, accommodates sampling weights, handles large numbers of terms and is computationally efficient.10 Precautions were taken to prevent overfitting. We compared the adjusted R2 in the model selected by LASSO with the minimal cross-validation error (CVmin) to that of the model with cross-validation error that was up to, but not more than 1 standard error (s.e.) larger than the minimum (CVmin+1 s.e.).14 If the difference between the adjusted R2 was at least 0.01, then we chose (CVmin+1 s.e.) as the final model. If not, we examined additional models that further increased the cross-validation error in increments of 0.25 s.e. and selected the model with the largest SE that had an adjusted R2 that was reduced by up to 0.01 compared with the CVmin model. In the rare instance when the R2 was the same to the third decimal place between two such candidate models, we chose the model with the larger s.e.
Step 3. Evaluate equations in the validation data sets
The estimates for the intercept and coefficients for the terms in models calculated in the fitting data set were used to calculate the predicted percent body fat in the validation data set. Then, gender-specific univariate regression models were run using the predicted percent body fat as the only independent variable and DXA as the dependent variable. We compared models created in the full, gender-specific fitting data with those created in subsets of the fitting data formed by age and race/ethnicity groups, and the more generalizable model was preferred if the R2 was reduced by 0.02 or less compared with the model developed in a subset. Models with root mean square error (RMSE) estimates <3 percentage points of body fat were considered as excellent, whereas those with RMSE between 3 and 4 were considered as good. Mean signed differences (MSD) were calculated as the percent body fat from an equation minus percent body fat by DXA, overall and by age group, race/ethnicity and BMI category. We also estimated differential bias within categories of age, ethnicity and BMI by calculating the differences in MSD values (for example, the MSD in youth minus the MSD in adults). An MSD calculated within a subgroup or category that was outside the bounds of ± 1 body fat percentage point was considered biased.1
Step 4. Obtain and examine final equations in a data set that included both the fitting and validation data sets
To estimate the coefficients with greater precision, we ran the models by gender (and over age and race/ethnicity subgroups) in the combined fitting and validation data. Performance statistics were calculated using models in the full data.
RESULTS
The sample was predominantly White and over half were either overweight or obese (Table 1). DXA-measured (or imputed) body fat averaged 27.3% in males and 38.4% in females. As expected, skinfold thickness tended to be greater in females, whereas height, weight and circumferences tended to be greater in males.
Table 1.
Males (n = 11 884a)
|
Females (n = 9215a)
|
|||
---|---|---|---|---|
Mean or % | SE | Mean or % | SE | |
Age (years) | 38.7 | 0.29 | 42.1 | 0.38 |
Youth (%) | 20.9 | 0.47 | 19.3 | 0.58 |
Menses (%) | 62.7 | 1.07 | ||
Race/Ethnicity (%) | ||||
White | 69.6 | 1.41 | 69.7 | 1.69 |
Black | 10.9 | 0.85 | 12.0 | 1.07 |
Mexican | 9.2 | 0.81 | 7.1 | 0.82 |
Multi-ethnic | 10.2 | 0.97 | 11.1 | 1.12 |
Height (cm) | 172.6 | 0.14 | 160.6 | 0.12 |
Weight (kg) | 78.6 | 0.28 | 67.6 | 0.40 |
BMI (kg m−2) | 26.0 | 0.08 | 26.1 | 0.14 |
BMI category (%) | ||||
Underweight | 1.8 | 0.15 | 2.6 | 0.19 |
Normal weight | 37.3 | 0.68 | 43.3 | 1.09 |
Overweight | 38.9 | 0.65 | 29.0 | 0.74 |
Obese | 22.1 | 0.72 | 25.1 | 0.97 |
DXA % body fatb | 27.3 | 0.10 | 38.4 | 0.16 |
White | 27.7 | 0.12 | 38.4 | 0.22 |
Black | 24.6 | 0.19 | 38.4 | 0.33 |
Mexican | 27.8 | 0.19 | 38.9 | 0.39 |
Youth | 25.3 | 0.18 | 32.9 | 0.18 |
White | 25.3 | 0.29 | 32.7 | 0.30 |
Black | 23.0 | 0.24 | 32.3 | 0.27 |
Mexican | 27.9 | 0.21 | 34.7 | 0.33 |
Adults | 27.8 | 0.11 | 39.7 | 0.17 |
White | 28.2 | 0.12 | 39.6 | 0.22 |
Black | 25.2 | 0.21 | 40.3 | 0.37 |
Mexican | 27.8 | 0.26 | 40.7 | 0.40 |
Triceps skinfold (mm) | 14.1 | 0.10 | 22.9 | 0.17 |
Subscapular skinfold (mm) | 17.9 | 0.13 | 20.1 | 0.20 |
Waist (cm) | 93.4 | 0.25 | 88.1 | 0.35 |
Maximal calf circumference (cm) | 37.8 | 0.06 | 36.9 | 0.10 |
Arm circumference (cm) | 32.0 | 0.07 | 30.1 | 0.12 |
Thigh circumference (cm) | 51.8 | 0.09 | 51.1 | 0.17 |
Upper arm length (cm) | 38.1 | 0.05 | 35.3 | 0.05 |
Upper leg length (cm) | 41.8 | 0.08 | 38.4 | 0.09 |
Number in sample without application of sampling weights.
Average of five imputed values.
Supplementary information 4 shows results from 14 models (A–N) developed in the fitting sample and applied to the validation sample with R2 calculated overall and within age, race/ethnicity subgroups in males and females. An example of the application of the rules used to select among the models is given in Supplementary information 5. In males, the BMI only comparison model (model O) had the lowest R2 (0.436) overall and in the age- and race/ethnicity-specific results. Several models produced R2 values >0.8 both overall (models A, B, D, F, G, H and I) and within subgroups of males. Performance tended to be superior in boys compared with adult males and in Whites compared with Blacks and Mexican Americans. When applied to the data stratified by both race/ethnicity and age category, the R2 values tended to be lowest in Mexican American men and highest in Mexican American boys with the median difference in the R2 in those two groups across the 14 models being 0.1245 (model O was not included in these estimates).
In the overall estimates, only one model in females produced an R2 over 0.8, however, 7 of the 14 chosen models produced estimates over 0.79 and 11 of the 14 models had R2 values over 0.75. Similar to males, performance of the equations was generally stronger in younger than in older females (exception was models K and N). Different from males, R2 values tended to be higher in Blacks than in Mexican Americans, with results in Whites varied. Performance tended to be less strong in Black and Mexican American women compared with the other subgroups. In both males and females, models that included a skinfold measurement tended to perform better than those that did not.
We compared the R2 estimates of equations developed in the full gender-specific fitting sample (over age and ethnic groups) with that of equations developed using data only from the age or race/ethnic group to which they were applied in the validation step. In males (Figure 1), the age and race/ethnic-specific R2 values were within ± 0.02. In girls (Figure 2), the age-specific analyses for model E in White girls, K and N in Black girls and A in Mexican American girls estimate produced R2 that was slightly better than the overall female equation. In contrast, the R2 was over 0.02 larger in the equation developed in all females than the race/ethnic-specific equations in Mexican American girls for models B, D, K, M and N.
We explored systematic differences in the prediction of percent body fat in subgroups categorized by age, ethnicity and BMI by examining MSD. As an illustration, the results of this analysis for BMI (model O) are shown in Figure 3. For both genders, the point estimate of the bias by age was outside the limit of ± 1 percentage point of body fat and statistically different from zero (P<0.05). Examination of MSD by ethnic groups showed that percent body fat was overestimated by BMI in Blacks in both males and females. Within BMI categories, BMI overestimated percent body fat in normal weight males and underestimated percent body fat in obese males by an amount only slightly exceeding 1 percentage point (−1.02 and 1.07 percentage points, respectively). BMI overestimated percent body fat in underweight women and underestimated percent body fat in overweight women by more than 1 percentage point. Similar analyses done examining equations A–N showed no systematic differences by age group or ethnicity that was as large as 1 body fat percentage point. There were also no systematic differences that large by BMI categories in women. However, in men four final models underestimated percent body fat in the underweight groups (K, L, M and N) deviating by −1.33 (CI: −2.38, −0.29), −1.09 (CI: −2.12, −0.05), −1.26 (CI: −2.06, −0.46) and −1.66 (CI: −2.58, −0.75) percentage points, respectively.
We also examined differential bias by categories within age, ethnicity and BMI. We found no evidence of differential error as large as 1 percentage point by age (young versus old) or ethnicity (White versus Black and White versus Mexican American). There were also no differential error estimates as large as 1 percentage point between normal weight and overweight participants or between normal weight and obese participants.
We combined the fitting and validation samples and recalculated the coefficients using the models that had been selected in the fitting data. Tables 2 and 3 show the R2 and RMSE estimates for the full data. As expected, the R2 and RMSE estimates were generally intermediate between those found in the fitting data and the validation data (fitting results not shown). Most of the RMSE estimates for developed equations were between 3.5 and 2.5 percent body fat, indicating good to excellent performance. The terms and coefficients for selected, better performing equations are presented in Supplementary information 6.
Table 2.
Modela | Overall
|
Age
|
Race/Ethnicity
|
Age × Race/Ethnicity
|
||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
8–19 years
|
≥20 years
|
White
|
Black
|
Mexican
|
White-Boys
|
White-Men
|
Black-Boys
|
Black-Men
|
Mexican-Boys
|
Mexican-Men
|
||||||||||||||
R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | |
A | 0.845 | 2.52 | 0.886 | 2.63 | 0.821 | 2.49 | 0.839 | 2.52 | 0.864 | 2.54 | 0.835 | 2.46 | 0.881 | 2.59 | 0.816 | 2.50 | 0.889 | 2.67 | 0.845 | 2.49 | 0.889 | 2.65 | 0.787 | 2.38 |
B | 0.842 | 2.55 | 0.882 | 2.67 | 0.818 | 2.51 | 0.836 | 2.54 | 0.863 | 2.56 | 0.832 | 2.48 | 0.876 | 2.65 | 0.814 | 2.52 | 0.884 | 2.72 | 0.845 | 2.49 | 0.889 | 2.65 | 0.783 | 2.41 |
C | 0.793 | 2.91 | 0.839 | 3.12 | 0.765 | 2.86 | 0.786 | 2.90 | 0.822 | 2.91 | 0.789 | 2.77 | 0.832 | 3.07 | 0.760 | 2.86 | 0.844 | 3.16 | 0.804 | 2.80 | 0.845 | 3.14 | 0.741 | 2.63 |
D | 0.840 | 2.56 | 0.879 | 2.70 | 0.817 | 2.52 | 0.834 | 2.56 | 0.860 | 2.58 | 0.832 | 2.48 | 0.872 | 2.69 | 0.813 | 2.53 | 0.883 | 2.73 | 0.841 | 2.52 | 0.887 | 2.67 | 0.783 | 2.40 |
E | 0.769 | 3.08 | 0.807 | 3.42 | 0.743 | 2.99 | 0.764 | 3.05 | 0.790 | 3.16 | 0.764 | 2.94 | 0.801 | 3.35 | 0.740 | 2.97 | 0.808 | 3.50 | 0.772 | 3.02 | 0.817 | 3.41 | 0.719 | 2.74 |
F | 0.840 | 2.57 | 0.882 | 2.67 | 0.814 | 2.54 | 0.833 | 2.57 | 0.859 | 2.59 | 0.832 | 2.48 | 0.876 | 2.64 | 0.809 | 2.55 | 0.884 | 2.73 | 0.839 | 2.53 | 0.890 | 2.64 | 0.782 | 2.41 |
G | 0.842 | 2.55 | 0.882 | 2.67 | 0.817 | 2.52 | 0.835 | 2.55 | 0.862 | 2.56 | 0.831 | 2.48 | 0.875 | 2.65 | 0.813 | 2.53 | 0.885 | 2.71 | 0.843 | 2.50 | 0.889 | 2.65 | 0.781 | 2.41 |
H | 0.805 | 2.83 | 0.856 | 2.95 | 0.775 | 2.79 | 0.796 | 2.83 | 0.827 | 2.87 | 0.797 | 2.72 | 0.849 | 2.92 | 0.768 | 2.81 | 0.862 | 2.97 | 0.800 | 2.83 | 0.855 | 3.03 | 0.747 | 2.59 |
I | 0.836 | 2.59 | 0.878 | 2.72 | 0.811 | 2.56 | 0.829 | 2.59 | 0.856 | 2.62 | 0.829 | 2.50 | 0.872 | 2.68 | 0.806 | 2.57 | 0.880 | 2.77 | 0.836 | 2.56 | 0.887 | 2.68 | 0.778 | 2.43 |
J | 0.766 | 3.10 | 0.804 | 3.44 | 0.741 | 3.00 | 0.761 | 3.07 | 0.789 | 3.17 | 0.763 | 2.94 | 0.797 | 3.39 | 0.738 | 2.99 | 0.806 | 3.51 | 0.772 | 3.02 | 0.816 | 3.41 | 0.717 | 2.75 |
K | 0.682 | 3.61 | 0.735 | 4.00 | 0.647 | 3.50 | 0.675 | 3.58 | 0.699 | 3.78 | 0.695 | 3.34 | 0.724 | 3.95 | 0.643 | 3.49 | 0.737 | 4.10 | 0.666 | 3.65 | 0.747 | 4.00 | 0.649 | 3.06 |
L | 0.764 | 3.11 | 0.801 | 3.47 | 0.738 | 3.01 | 0.759 | 3.08 | 0.784 | 3.20 | 0.762 | 2.95 | 0.793 | 3.42 | 0.736 | 3.00 | 0.802 | 3.55 | 0.767 | 3.05 | 0.815 | 3.42 | 0.716 | 2.75 |
M | 0.789 | 2.94 | 0.854 | 2.97 | 0.751 | 2.94 | 0.778 | 2.96 | 0.810 | 3.00 | 0.792 | 2.76 | 0.846 | 2.95 | 0.743 | 2.96 | 0.863 | 2.96 | 0.771 | 3.02 | 0.862 | 2.96 | 0.732 | 2.67 |
N | 0.664 | 3.71 | 0.728 | 4.06 | 0.623 | 3.62 | 0.658 | 3.67 | 0.675 | 3.93 | 0.682 | 3.41 | 0.716 | 4.00 | 0.621 | 3.59 | 0.730 | 4.15 | 0.631 | 3.84 | 0.738 | 4.07 | 0.632 | 3.13 |
O | 0.430 | 4.84 | 0.285 | 6.57 | 0.521 | 4.07 | 0.447 | 4.66 | 0.444 | 5.14 | 0.371 | 4.79 | 0.261 | 6.46 | 0.530 | 4.00 | 0.332 | 6.52 | 0.549 | 4.24 | 0.343 | 6.45 | 0.592 | 3.30 |
Abbreviations: BMI, body mass index; NHANES, National Health and Nutrition Examination Survey; RMSE, root mean square error. Model A: age, race, height, weight, BMI, triceps, subscapular, waist, calf circumference, arm circumference, thigh circumference, arm length and leg length; Model B: age, race, height, weight, BMI, triceps, subscapular, waist, calf circumference, arm circumference and thigh circumference; Model C: age, race, height, weight, BMI, subscapular, waist, calf circumference, arm circumference and thigh circumference; Model D: age, race, height, weight, BMI, triceps, waist, calf circumference, arm circumference and thigh circumference; Model E: age, race, height, weight, BMI, waist, calf circumference, arm circumference and thigh circumference; Model F: age, race, height, weight, BMI, triceps, subscapular, waist and calf circumference; Model G: age, race, height, weight, BMI, triceps, subscapular, waist and arm circumference; Model H: age, race, height, weight, BMI, triceps, subscapular and arm circumference; Model I: age, race, height, weight, BMI, triceps and waist; Model J: age, race, height, weight, BMI, waist and arm circumference; Model K: age, race, height, BMI and arm circumference; Model L: age, race, height, weight, BMI and waist; Model M: age, race, height, weight, BMI and triceps; Model N: age, race, height, weight and BMI; Model O: BMI.
All candidate variables were retained by the selection process for all models.
Table 3.
Modela | Overall
|
Age
|
Race/Ethnicity
|
Age × Race/Ethnicity
|
||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
8–19 years
|
≥20 years
|
White
|
Black
|
Mexican
|
White-Girls
|
White-Women
|
Black-Girls
|
Black-Women
|
Mexican-Girls
|
Mexican-Women
|
||||||||||||||
R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | R2 | RMSE | |
A | 0.809 | 3.16 | 0.819 | 2.91 | 0.766 | 3.22 | 0.816 | 3.12 | 0.819 | 3.31 | 0.792 | 3.10 | 0.824 | 2.80 | 0.781 | 3.19 | 0.827 | 3.14 | 0.747 | 3.36 | 0.809 | 2.92 | 0.706 | 3.17 |
B | 0.807 | 3.17 | 0.823 | 2.88 | 0.765 | 3.23 | 0.816 | 3.12 | 0.813 | 3.31 | 0.784 | 3.13 | 0.827 | 2.78 | 0.782 | 3.19 | 0.833 | 3.08 | 0.741 | 3.38 | 0.812 | 2.90 | 0.697 | 3.23 |
C | 0.783 | 3.36 | 0.783 | 3.19 | 0.739 | 3.40 | 0.791 | 3.32 | 0.788 | 3.53 | 0.760 | 3.30 | 0.788 | 3.07 | 0.756 | 3.37 | 0.790 | 3.46 | 0.714 | 3.55 | 0.768 | 3.23 | 0.676 | 3.33 |
D | 0.806 | 3.18 | 0.817 | 2.93 | 0.763 | 3.24 | 0.814 | 3.14 | 0.812 | 3.32 | 0.781 | 3.16 | 0.822 | 2.82 | 0.780 | 3.20 | 0.828 | 3.12 | 0.741 | 3.38 | 0.805 | 2.95 | 0.695 | 3.24 |
E | 0.769 | 3.47 | 0.751 | 3.42 | 0.727 | 3.48 | 0.779 | 3.42 | 0.771 | 3.67 | 0.742 | 3.43 | 0.755 | 3.31 | 0.746 | 3.44 | 0.759 | 3.70 | 0.696 | 3.66 | 0.742 | 3.40 | 0.656 | 3.43 |
F | 0.800 | 3.23 | 0.817 | 2.93 | 0.755 | 3.30 | 0.808 | 3.19 | 0.804 | 3.40 | 0.779 | 3.17 | 0.822 | 2.82 | 0.772 | 3.26 | 0.829 | 3.12 | 0.726 | 3.48 | 0.806 | 2.94 | 0.691 | 3.26 |
G | 0.799 | 3.23 | 0.820 | 2.90 | 0.754 | 3.31 | 0.807 | 3.19 | 0.807 | 3.36 | 0.775 | 3.20 | 0.824 | 2.80 | 0.771 | 3.27 | 0.835 | 3.06 | 0.730 | 3.45 | 0.808 | 2.93 | 0.682 | 3.30 |
H | 0.795 | 3.26 | 0.810 | 2.98 | 0.751 | 3.33 | 0.803 | 3.23 | 0.806 | 3.38 | 0.771 | 3.23 | 0.815 | 2.88 | 0.767 | 3.29 | 0.825 | 3.15 | 0.731 | 3.44 | 0.793 | 3.04 | 0.683 | 3.30 |
I | 0.797 | 3.25 | 0.813 | 2.97 | 0.752 | 3.32 | 0.805 | 3.21 | 0.805 | 3.39 | 0.769 | 3.24 | 0.817 | 2.86 | 0.769 | 3.28 | 0.828 | 3.12 | 0.728 | 3.46 | 0.799 | 3.00 | 0.676 | 3.33 |
J | 0.761 | 3.53 | 0.748 | 3.44 | 0.716 | 3.55 | 0.770 | 3.48 | 0.764 | 3.73 | 0.732 | 3.49 | 0.752 | 3.33 | 0.735 | 3.52 | 0.761 | 3.69 | 0.683 | 3.74 | 0.735 | 3.44 | 0.643 | 3.50 |
K | 0.752 | 3.59 | 0.722 | 3.61 | 0.710 | 3.59 | 0.761 | 3.55 | 0.758 | 3.77 | 0.721 | 3.56 | 0.725 | 3.50 | 0.727 | 3.56 | 0.733 | 3.89 | 0.685 | 3.73 | 0.704 | 3.64 | 0.639 | 3.52 |
L | 0.757 | 3.56 | 0.741 | 3.49 | 0.712 | 3.58 | 0.766 | 3.52 | 0.759 | 3.77 | 0.731 | 3.49 | 0.745 | 3.37 | 0.729 | 3.55 | 0.745 | 3.81 | 0.680 | 3.75 | 0.734 | 3.45 | 0.642 | 3.50 |
M | 0.791 | 3.30 | 0.798 | 3.07 | 0.747 | 3.35 | 0.799 | 3.26 | 0.802 | 3.41 | 0.763 | 3.28 | 0.803 | 2.97 | 0.764 | 3.31 | 0.815 | 3.24 | 0.728 | 3.46 | 0.778 | 3.15 | 0.676 | 3.33 |
N | 0.748 | 3.62 | 0.716 | 3.65 | 0.706 | 3.61 | 0.757 | 3.59 | 0.753 | 3.80 | 0.721 | 3.56 | 0.719 | 3.54 | 0.723 | 3.59 | 0.723 | 3.97 | 0.682 | 3.74 | 0.702 | 3.65 | 0.640 | 3.51 |
O | 0.656 | 4.24 | 0.637 | 4.13 | 0.606 | 4.18 | 0.667 | 4.20 | 0.679 | 4.34 | 0.650 | 3.98 | 0.664 | 3.87 | 0.622 | 4.19 | 0.676 | 4.29 | 0.595 | 4.23 | 0.617 | 4.14 | 0.566 | 3.86 |
Abbreviations: BMI, body mass index; NHANES, National Health and Nutrition Examination Survey; RMSE, root mean square error. Model A: age, race, height, (weight), BMI, triceps, subscapular, waist, calf circumference, (arm circumference), thigh circumference, arm length, leg length and menses; Model B: age, race, height, (weight), BMI, triceps, subscapular, waist, calf circumference, arm circumference and thigh circumference; Model C: age, race, height, weight, BMI, subscapular, waist, calf circumference, arm circumference and thigh circumference; Model D: age, race, height, weight, BMI, triceps, waist, calf circumference, arm circumference and thigh circumference; Model E: age, race, height, weight, BMI, waist, calf circumference, arm circumference and thigh circumference; Model F: age, race, height, weight, BMI, triceps, subscapular, waist and calf circumference; Model G: age, race, height, weight, BMI, triceps, subscapular, waist and arm circumference; Model H: age, race, height, weight, BMI, triceps, subscapular, arm circumference Model I: age, race, height, weight, BMI, triceps and waist; Model J: age, race, height, weight, BMI, waist, arm circumference Model K: age, race, height, (weight) BMI and arm circumference; Model L: age, race, height, weight, BMI and waist; Model M: age, race, height, weight, BMI and triceps; Model N: age, race, height, weight and BMI; Model O: BMI.
Candidate variables not retained by the selection process are shown in parentheses.
DISCUSSION
In this work, we developed 28 equations for the prediction of percent body fat in children and adults. For the final gender-specific models, the adjusted R2 ranged from 0.664 to 0.845 in males and from 0.748 to 0.809 in females. BMI alone produced an R2 of 0.430 in males and 0.656 in females. The addition of triceps and subscapular skinfolds to the candidate variables of demographics, height weight and BMI improved performance more than the addition of up to four circumference measurements. R2 values for each set of variables were higher in males than in females, and in youth than in adults. Our examination of the performance of equations within age and race/ethnicity subgroups provided evidence that the equations can be applied with relatively good validity across a wide range of age, race/ethnic and BMI groups and within youth, adults, BMI categories and three race/ethnicities. Equations A, B, D, F, G, H and I performed strongly in both males (adjusted R2 ≥0.805) and females (adjusted R2 ≥0.795). Four models (K, L, M and N) underestimated percent body fat in men by more than 1 percentage point in the underweight group, indicating that that these models should be used with caution if estimates in underweight men are of special interest. Otherwise, the bias by subgroups was within acceptable limits and equations developed within age and race/ethnicity groups did not notably outperform equations developed in the entire gender group in terms of the amount of variance explained.
Our model selection used cutoffs for decision making that were directed by expert judgment. We generally avoided decision making based on P-values.15 The cut point of 1 s.e. for LASSO was taken from Hastie et al.14 In addition, we used the limit of a reduction in R2 of 0.01 for dropping terms. This was an arbitrary, a priori decision based on our judgment that a reduction of this size was small and therefore tolerated in order to have a more parsimonious model. We used a larger bound of a reduction of 0.02 in the R2 to select a more generalizable model over a model fit in a subgroup, and therefore with reduced generalizability. In the former instance, the penalty of retaining more terms was considered to be less of a sacrifice compared with the penalty inflicted by having to apply different equations in different subgroups. Here, the four equations in females that exceeded this bound in a subgroup were at the maximum over 0.02 by only 0.002, an amount we consider trivial. For that reason, we feel comfortable recommending the use of the more general equation in the identified subgroups. For the evaluation of systematic bias, we called biased values outside the range of ± 1 percentage body fat from the DXA-measured mean and 2 percentage points between model-predicted percentage body fat by categories. To put this into perspective, the span in average percentage body fat in Non-Hispanic Whites, 18–29 years of age at a BMI of 18.5 versus 40.0 kg m−2 was approximately 21 percentage points in the 1999–2004 NHANES.8
Truesdale (unpublished, 2015) and Cui et al.1 used MSD to detect differential trends in the underestimation or overestimation of percent body fat by published prediction equations within key subgroups when they were applied to the NHANES data matched to the development data on the criteria of gender, age and ethnicity. Truesdale et al. examined seven equations developed in children and found that six had differential systematic errors in the estimation of percent body fat that varied between the normal weight and obese by more than 2 percentage points. Cui et al.1 found that more than 2/3’s of 26 sets of equation developed in adults, when applied to NHANES, showed systematic bias between normal weight and obese men and women that were larger than 2 percentage points. Both studies also found instances of systematic differences of this magnitude by gender, age and race/ethnic categories. The bias in these estimates could be adequately large to produce misleading results across groups studied. Nevertheless, weaknesses in the assessments of differential errors by Cui and Truesdale include inability to control for subtle differences in the protocols for and operationalization of measurement of anthropometrics as well as potential differential error in the NHANES DXA measures. In the equations developed here in NHANES data, none showed a systematic difference in the estimation of percentage body fat in normal weight compared with obese participants that was as large as 0.5.
The R2 values of the equations developed here are not the largest currently published in the literature, even though these equations may well be the most complex published to date. There are several reasons why other investigators may have reported larger R2 values with more simple equations. Several reports, including those by Dugas et al.,4 Zanovec et al.,6 Heo et al.8 and Li et al.7 that used the NHANES data to produce percent body fat prediction equations, reported R2 values with males and females combined and included gender in their models, and therefore the very large differences in percent body fat between men and women were captured as part of the variance explained by the model, and the range of percent body fat was extended. Both of these attributes would increase the R2. Freedman et al. showed that the R2 for BMI alone as a predictor of percent body fat was 0.55 in an analysis across genders, and the estimate increased to 0.79 with the addition of gender to the model. Most likely, the gender-specific R2 for BMI would have been even lower than 0.55. As currently advocated in the literature,16 we produced separate equations by gender, although this tended to lower the R2. Another reason that other published R2 values may be higher is that they included alternative anthropometric measures. For instance, hip circumference was not available in the NHANES, and it might have explained additional variance. Finally, the estimate of R2 attained could vary importantly by the sample examined. Equations developed for youth by Stevens et al.5 using the 1999–2004 NHANES data had a higher R2 than shown here for equations developed for youth in the 1999–2006 NHANES. However, when the current equations are applied to the 1999–2004 data, they produce higher R2 estimates than shown in this work, and higher estimates than previously published by Stevens et al.5
A limitation of this as well as other work in this field is potential error in the criterion method. Different DXA systems and software produce somewhat different percent body fat measurements. The correction applied to the DXA data here13 was developed in adults, and may not be applicable to participants of all size, fatness and age. Finally, a four compartment model may have provided a more accurate criterion.
Our analytic approach is a strength of this work, and we know of no other study that has used LASSO to identify models for the prediction of percent body fat. LASSO has the advantage of being able to handle a large number of terms, and in fact, the number of terms can exceed the number of observations.10 Given current technology, the complexity of the equations developed here does not limit their feasibility. We credit this complexity as a key to the prediction of percent body fat estimates that (using our limits) are unbiased over age, race/ethnicity and BMI subgroups and explain variance in criterion percent body fat measures equally well as estimates developed exclusively in those specific subgroups. We see these attributes, along with the use of a large sample of superb generalizability, as major advantages of our approach. To assist investigator and personal use of our equations, we developed the American Body Composition Calculator (ABCC) (http://ABCC.sph.unc.edu), which facilitates use of SAS to calculate percent body fat for multiple subjects in an existing data set and use of hand-entered data to perform calculations for a single individual. It is our hope that the equations for prediction of percent body fat produced in this work will leverage research that will improve understanding of the role of obesity in health and disease and promote its prevention and treatment.
Supplementary Material
Acknowledgments
This work was supported by the National Institute of Diabetes and Digestive and Kidney Diseases (R01-DK097046).
Footnotes
CONFLICT OF INTEREST
The authors declare no conflict of interest.
Supplementary Information accompanies this paper on International Journal of Obesity website (http://www.nature.com/ijo)
References
- 1.Cui Z, Truesdale KP, Cai J, Stevens J. Evaluation of anthropometric equations to assess body fat in adults: NHANES 1999–2004. Med Sci Sports Exerc. 2014;46:1147–1158. doi: 10.1249/MSS.0000000000000213. [DOI] [PubMed] [Google Scholar]
- 2.Chambers AJ, Parise E, McCrory JL, Cham R. A comparison of prediction equations for the estimation of body fat percentage in non-obese and obese older Caucasian adults in the United States. J Nutr Health Aging. 2014;18:586–590. doi: 10.1007/s12603-014-0017-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Norgan NG. Laboratory and field measurements of body composition. Public Health Nutr. 2005;8:1108–1122. doi: 10.1079/phn2005799. [DOI] [PubMed] [Google Scholar]
- 4.Dugas LR, Cao G, Luke AH, Durazo-Arvizu RA. Adiposity is not equal in a multi-race/ethnic adolescent population: NHANES 1999–2004. Obesity (Silver Spring) 2011;19:2099–2101. doi: 10.1038/oby.2011.52. [DOI] [PubMed] [Google Scholar]
- 5.Stevens J, Cai J, Truesdale KP, Cuttler L, Robinson TN, Roberts AL. Percent body fat prediction equations for 8- to 17-year-old American children. Pediatr Obes. 2013;9:260–271. doi: 10.1111/j.2047-6310.2013.00175.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Zanovec M, Wang J, O’Neil CE. Development and comparison of two field-based body fat prediction equations: NHANES 1999–2004. Int J Exerc Sci. 2012;5:5. [PMC free article] [PubMed] [Google Scholar]
- 7.Li C, Ford ES, Zhao G, Balluz LS, Giles WH. Estimates of body composition with dual-energy X-ray absorptiometry in adults. Am J Clin Nutr. 2009;90:1457–1465. doi: 10.3945/ajcn.2009.28141. [DOI] [PubMed] [Google Scholar]
- 8.Heo M, Faith MS, Pietrobelli A, Heymsfield SB. Percentage of body fat cutoffs by sex, age, and race-ethnicity in the US adult populaton from NHANES 1999–2004. Am J Clin Nutr. 2012;95:594–602. 60. doi: 10.3945/ajcn.111.025171. [DOI] [PubMed] [Google Scholar]
- 9.Pratt CA, Boyington J, Esposito L, Pemberton VL, Bonds D, Kelley M, et al. Childhood Obesity Prevention and Treatment Research (COPTR): interventions addressing multiple influences in childhood and adolescent obesity. Contemp Clin Trials. 2013;36:406–413. doi: 10.1016/j.cct.2013.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Tibshirani RJ. Regression Shrinkage and Selection via the LASS. J R Stat Soc. 1996;58:267–288. Ser. B. [Google Scholar]
- 11.Centers for Disease Control and Prevention (CDC). National Center for Health Statistics (NCHS) National Health and Nutrition Examination Survey Examination Protocol. Hyattsville, MD, USA: US Department of Health and Human Services, Centers for Disease Control and Prevention; 2011. http://www.cdc.gov/nchs/nhanes.htm. [Google Scholar]
- 12.Johnson CL, Paulose-Ram R, Ogden CL, Carroll MD, Kruszon-Moran D, Dohrmann SM, et al. National health and nutrition examination survey: analytic guidelines, 1999–2010. Vital Health Stat 2. 2013;161:1–24. [PubMed] [Google Scholar]
- 13.Schoeller DA, Tylavsky FA, Baer DJ, Chumlea WC, Earthman CP, Fuerst T, et al. QDR 4500A dual-energy X-ray absorptiometer underestimates fat mass in comparison with criterion methods in adults. Am J Clin Nutr. 2005;81:1018–1025. doi: 10.1093/ajcn/81.5.1018. [DOI] [PubMed] [Google Scholar]
- 14.Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning. Springer New York Inc; New York, NY, USA: 2001. p. 244. [Google Scholar]
- 15.Rothman KJ, Greenland S, Lash TL. Modern Epidemiology. 3. Wolters Kluwer Health/Lippincott Williams & Wilkins; Philadelphia, PA, USA: 2008. [Google Scholar]
- 16.Burton RF. The body adiposity index is not the best hip-height index of adiposity. Br J Nutr. 2012;108:2100–2101. doi: 10.1017/S0007114512004266. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.