Skip to main content
BMJ Open Access logoLink to BMJ Open Access
. 2019 Oct 17;26(1):12–13. doi: 10.1136/bmjebm-2019-111232

Children’s fitness and health: an epic scandal of poor methodology, inappropriate statistics, questionable editorial practices and a generation of misinformation

Jo Welsman 1,, Neil Armstrong 1
PMCID: PMC7848063  PMID: 31624077

A global explosion of research into children and adolescents’ health and cardiorespiratory or aerobic fitness has resulted in a flurry of papers and subsequently systematic reviews revealing apparently worrying but fallacious assumptions such as: (1) aerobic fitness is declining1; (2) aerobic fitness expressed in ratio with body mass reflects present2 and predicts future3 cardiovascular and metabolic health risk; (3) a single sex-specific ‘cut-point’ of aerobic fitness expressed in ratio with body mass identifies children and adolescents who ‘may benefit from primary and secondary cardiovascular prevention programming’, (Ruiz et al p1451)4- the so-called ‘clinical red flags’.

Our serious concerns with these conclusions, despite their basis in large data sets and publication in internationally respected journals, is that they are not founded on rigorous science but on flawed methodology, namely predicting aerobic fitness from the 20 metre shuttle run test (20mSRT)5 and interpreting paediatric fitness data expressed in ratio with body mass.

Problem 1: the 20mSRT is not a valid measure of children’s aerobic fitness

Over 30 years ago6 we demonstrated the poor criterion validity of the 20mSRT or ‘bleep’ test.5 We discounted the test as a research tool not only because of poor statistical validity but because of its dependence on participant motivation and body size, particularly fatness. The 20mSRT was never originally validated against laboratory-determined peak oxygen uptake (V˙O2) (the internationally recognised gold-standard measure of paediatric aerobic fitness). Subsequent validation studies with children are sparse and statistically inadequate being based in correlation and regression not agreement. A recent review, although not specifying the underlying statistics, reported that peak V˙O2 can be estimated within ±10 mL kg-1 min-1 from the 20mSRT,7 but as this represents around 20%–25% of typical values this is hardly a test we would want to see underpinning recommendations for international public health policy.8

Problem 2: the expression of aerobic fitness in simple ratio with body mass (ie as V̇O2 in mL kg-1 min-1) is not a valid method for controlling for body size differences

Over 30 years ago, our attention was drawn to a paper published by Tanner9 which detailed the fallacy of simple division by body mass to control for body size in describing physiological functions. As an assumed, rather than fitted mathematical relationship, per-body-mass ratios typically overestimate values of fitness for light individuals, and artefactually penalise heavier people. Thus, in subsequent correlation analyses, or through subdivision into high vs low fitness groups, for example, to examine relationships with cardiometabolic risk factors,4 spurious conclusions are inevitable and reflect levels of fatness rather than levels of fitness.

Aware of the significance of this paper for our own research, we comprehensively searched the literature but failed to find a published scientific or statistical justification for ‘per-body-mass scaling’ for youth aerobic fitness.10 It has become absorbed into accepted practice simply because it is ‘traditional’,11 ‘convenient’11 and ‘feasible’12 and so evades challenge by peer reviewers and editors.

Discussion

The speed at which research studies based on this combination of two fundamentally flawed methodologies have come to dominate the international literature on paediatric aerobic fitness has been alarming. In the decade to 2000, on average two papers reporting 20mSRT data per year were published in journals summarised by PubMed. In the past 9 years, 379 papers have been published. In response to this we have refocussed our efforts to raise awareness of the methodological inaccuracies inherent in this body of research and published, with comprehensive commentary and reanalyses, 20 of our published cross-sectional studies10 and new longitudinal multilevel modelling analyses13 14 of ~1400 rigorous determinations of 10–18 years old’s aerobic fitness. In all cases the data did not meet the statistical assumptions underpinning ratio scaling of peak V˙O2 with body mass.

Our recent longitudinal studies confirm evidence we first published over 30 years ago: when determined in a laboratory using rigorous assessment procedures, appropriately size-adjusted aerobic fitness increases with age and maturity in both girls and boys (eg, 13), that is, does not decline or level off as suggested by per-body-mass international norms.15 Thus recommendations for single sex-specific ‘cut-off’ points for ‘healthy’ fitness from childhood through adolescence which do not accommodate age or maturational effects4 are meaningless.

Rigorously determined laboratory data16 do not show the declines over time in children’s fitness indicated from 20mSRT data. The latter is an artefact due to increased fatness constituting ‘dead weight’ which increases the work done per shuttle and adversely affects 20mSRT predictions but does not affect true aerobic fitness. This is further confounded by body fat being included in the denominator when simple per body mass ratios are computed. In fact, when body size and fatness differences are appropriately accounted for using allometric multilevel modelling, there are minimal differences in the fitness of overweight versus healthy weight children and adolescents.13

But how do we shift an entire discipline rooted in poor methodology? Not surprisingly young researchers and those in resource poor countries are quick to join the international 20mSRT bandwagon which enables the collection of large volumes of data quickly, cheaply and supports publication in internationally respected journals. Publishing appropriately analysed papers,13 14 writing tutorial17 and commentary style pieces is not enough. We are dismayed by apparent editorial resistance to challenges to the status quo. In the face of demonstrably weak methodology and inappropriate statistics we urgently need those with editorial power, including peer reviewers, to challenge authors to defend their work and for that defence to be based in appropriate statistics. We need better mechanisms and mentoring to support researchers in developing economies to discourage ‘quick wins’ and guide them towards better quality research. We need to ensure that the next generation of researchers are grounded in appropriate methodologies and have the critical ability and confidence to challenge traditional, but unjustified, practices.

We have an ethical and moral duty with minors to ensure that our research methodologies are rigorous and defensible. Only then will we accurately understand the role of fitness in children’s current and future health enabling public health recommendations to be meaningful and evidence-based.

Footnotes

Twitter: @jowelsman

Contributors: JW and NA jointly conceived the paper. JW drafted the paper and both authors revised the manuscript and approved submission for review.

Competing interests: None declared.

Patient consent for publication: Not required.

Provenance and peer review: Not commissioned; externally peer reviewed.

References

  • 1. Tomkinson GR, Lang JJ, Tremblay MS. Temporal trends in the cardiorespiratory fitness of children and adolescents representing 19 high-income and upper middle-income countries between 1981 and 2014. Br J Sports Med 2019;53:478–86. 10.1136/bjsports-2017-097982 [DOI] [PubMed] [Google Scholar]
  • 2. Lang JJ, Belanger K, Poitras V, et al. Systematic review of the relationship between 20m shuttle run performance and health indicators among children and youth. J Sci Med Sport 2018;21:383–97. 10.1016/j.jsams.2017.08.002 [DOI] [PubMed] [Google Scholar]
  • 3. Ruiz JR, Castro-Piñero J, Artero EG, et al. Predictive validity of health-related fitness in youth: a systematic review. Br J Sports Med 2009;43:909–23. 10.1136/bjsm.2008.056499 [DOI] [PubMed] [Google Scholar]
  • 4. Ruiz JR, Cavero-Redondo I, Ortega FB, et al. Cardiorespiratory fitness cut points to avoid cardiovascular disease risk in children and adolescents; what level of fitness should raise a red flag? A systematic review and meta-analysis. Br J Sports Med 2016;50:1451–8. 10.1136/bjsports-2015-095903 [DOI] [PubMed] [Google Scholar]
  • 5. Léger LA, Mercier D, Gadoury C, et al. The multistage 20 metre shuttle run test for aerobic fitness. J Sports Sci 1988;6:93–101. 10.1080/02640418808729800 [DOI] [PubMed] [Google Scholar]
  • 6. Armstrong N, Ringham D, Welsman J. Peak oxygen uptake and progressive shuttle run performance in boys aged 11-14 years. Br J Phys Educ 1988;19:10–11. [Google Scholar]
  • 7. Tomkinson GR, Lang JJ, Blanchard J, et al. The 20-m shuttle run: assessment and interpretation of data in relation to youth aerobic fitness and health. Pediatr Exerc Sci 2019;31:152–63. 10.1123/pes.2018-0179 [DOI] [PubMed] [Google Scholar]
  • 8. Lang JJ, Wolfe Phillips E, Orpana HM, et al. Field-based measurement of cardiorespiratory fitness to evaluate physical activity interventions. Bull World Health Organ 2018;96:794–6. 10.2471/BLT.18.213728 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Tanner JM. Fallacy of per-weight and per-surface area standards, and their relation to spurious correlation. J Appl Physiol 1949;2:1–15. 10.1152/jappl.1949.2.1.1 [DOI] [PubMed] [Google Scholar]
  • 10. Welsman J, Armstrong N. Interpreting aerobic fitness in youth: the fallacy of ratio scaling. Pediatr Exerc Sci 2019:31. [DOI] [PubMed] [Google Scholar]
  • 11. Bar-Or O, Rowland TW. Pediatric exercise medicine. Champaign, IL: Human Kinetics, 2004. [Google Scholar]
  • 12. Mintjens S, Menting MD, Daams JG, et al. Cardiorespiratory fitness in childhood and adolescence affects future cardiovascular risk factors: a systematic review of longitudinal studies. Sports Med 2018;48:2577–605. 10.1007/s40279-018-0974-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Armstrong N, Welsman J. Sex-specific longitudinal modeling of youth peak oxygen uptake. Pediatr Exerc Sci 2019;31:204–12. 10.1123/pes.2018-0175 [DOI] [PubMed] [Google Scholar]
  • 14. Armstrong N, Welsman J. Development of peak oxygen uptake from 11–16 years determined using both treadmill and cycle ergometry. Eur J Appl Physiol 2019;119:801–12. 10.1007/s00421-019-04071-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Tomkinson GR, Lang JJ, Tremblay MS, et al. International normative 20 m shuttle run values from 1 142 026 children and youth representing 50 countries. Br J Sports Med 2017;51:1545–54. 10.1136/bjsports-2016-095987 [DOI] [PubMed] [Google Scholar]
  • 16. Mountjoy M, Andersen LB, Armstrong N, et al. International Olympic Committee consensus statement on the health and fitness of young people through physical activity and sport. Br J Sports Med 2011;45:839–48. 10.1136/bjsports-2011-090228 [DOI] [PubMed] [Google Scholar]
  • 17. Welsman JR, Armstrong N. Interpreting exercise performance data in relation to body size : Armstrong N, van Mechelen W, Paediatric exercise science and medicine. 2nd ed Oxford: Oxford University Press, 2008: 13–21. [Google Scholar]

Articles from BMJ Evidence-Based Medicine are provided here courtesy of BMJ Publishing Group

RESOURCES