Teras et al. (1) report on weight loss and risk of breast cancer using data from 10 prospective studies, eight of which used self-reported weight data. Teras et al. note a high correlation coefficient between self-reported and measured weight. However, a high correlation coefficient is not an indicator of good agreement between two methods of ascertainment. In Bland and Altman’s 1986 article (2) about assessing agreement between two methods, they note that “data which seem to be in poor agreement can produce quite high correlations” and provide an example with a correlation of 0.94 and another with a correlation of 0.99. Bland and Altman raised the question “Why has a totally inappropriate method, the correlation coefficient, become almost universally used for this purpose?” Correlation is not an appropriate approach for assessing the comparability between methods (3). Teras et al. cite an article by Lawlor et al. (4) regarding comparisons of self-reported and measured weight for women 60 years and older; despite a correlation of 0.98, the Lawlor et al. article concluded that “self-report of weight should not be relied upon in prospective epidemiological studies or clinical practice when accuracy at the level of the individual is required.”
Most of the data used by Teras et al. (95% of the cases of incident breast cancer) come from US studies. The Global BMI Mortality Collaboration analysis for North America (5) compared 20 studies with self-reported data against 20 studies with measured data and found highly statistically significant differences in results by method of weight and height ascertainment between self-reported and measured data. For example, the hazard ratio for overweight was 1.00 (95% confidence interval = 0.97 to 1.04) when measured data were used and 1.14 (95% confidence interval = 1.12 to 1.16) when self-reported data were used, with a P value of .004 for heterogeneity. The P values for heterogeneity for other categories were .002 for underweight, less than than .001 for Grade 1 obesity, .037 for Grade 2 obesity, and .057 for Grade 3 obesity. The study by Teras et al. may have failed to detect this difference because they had only two studies with measured data, perhaps too small a sample.
The study by Teras et al. is based on estimates of weight change calculated by subtracting weight at one time from weight at another time. The validity of weight change calculated from self-reported weights at two different times for a given individual has not been established. The direction and magnitude of reporting error are not necessarily the same over time for a given individual. In addition, even if the measurement error in weight change estimates is not associated with the future outcome, nevertheless categorizing individuals into weight-change categories can lead to differential misclassification. Even in prospective studies, categorization itself can induce differential misclassification (6,7) when the risk for the outcome varies within the categories.
Given these limitations and methodological weaknesses, the results reported by Teras et al. should be viewed cautiously. The weight loss estimates are of questionable validity. Studies with estimates of weight change based on measured weights would be useful.
Notes
The author has no conflicts of interest to disclose.
References
- 1. Teras LR, Patel AV, Wang M, et al. Sustained weight loss and risk of breast cancer in women >/=50 years: a pooled analysis of prospective data. J Natl Cancer Inst. 2020;112(9):djz226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Bland JM, Altman DG.. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–310. [PubMed] [Google Scholar]
- 3. Giavarina D. Understanding Bland Altman analysis. Biochem Med. 2015;25(2):141–151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Lawlor DA, Bedford C, Taylor M, Ebrahim S.. Agreement between measured and self-reported weight in older women. Results from the British Women's Heart and Health Study. Age Ageing. 2002;31(3):169–174. [DOI] [PubMed] [Google Scholar]
- 5.Global BMI Mortality Collaboration. Body-mass index and all-cause mortality: individual-participant-data meta-analysis of 239 prospective studies in four continents. Lancet. 2016;388(10046):776–786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Flegal KM, Keyl PM, Nieto FJ.. Differential misclassification arising from nondifferential errors in exposure measurement. Am J Epidemiol. 1991;134(10):1233–1244. [DOI] [PubMed] [Google Scholar]
- 7. Brenner H, Loomis D.. Varied forms of bias due to nondifferential error in measuring exposure. Epidemiology. 1994;5(5):510–517. [PubMed] [Google Scholar]
