Abstract
Purpose
Accurate and reliable measurement of physical activity plays an important role in assessing effective lifestyle interventions for obesity. This study examined reliability of accelerometer-based estimates of physical activity levels of overweight and obese adults before and after a lifestyle weight loss program.
Methods
Participants were overweight and obese (BMI 25–45 kg/m2) members (n=1592; 67% female, 42% African American) of the multi-center Weight Loss Maintenance trial. They wore RT3 accelerometers during waking hours for 7 days at baseline and after a 6-month weight loss intervention that included diet and physical activity recommendations. Moderate to vigorous physical activity (MVPA) and MVPA occurring in bouts ≥ 10 minutes (bout MVPA) were assessed.
Results
At baseline, wear time minimums of 10 and 6 hours/day resulted in similar average minutes·day−1 of MVPA (18.3 and 18.0 minutes) and MVPA bout minutes·day−1 (6.9 and 6.7 minutes). Similar wear times occurred after the weight loss intervention for MVPA (27.0 and 26.8 minutes) and bout MVPA (15.1 and 15.0 minutes). Reliability measurements by Intra Class Correlation (ICC) were larger for 4 versus 2 days·week−1 minimum wear time for both MVPA and bout MVPA (4 day ICCs .27–.44 and 2 day ICCs .19–.38), but there was little increase in ICC comparing 4 (ICCs.27–.44) and 7 days·week−1 (ICCs .30–.46).
Conclusions
Longer wear time requirements did not result in significant increases in reliability. Using 4 days of data with ≥ 6 hours·day−1 of wear time optimized the balance between ICC and participant burden in overweight and obese adults before and after a weight loss intervention. Future investigations using accelerometers to estimate MVPA in overweight and obese samples can consider requiring less monitor wear time.
Keywords: Physical Activity, Exercise, Bouts, Ambulatory Monitoring, Measurement
Introduction
Accurate and reliable measurement of physical activity is needed to assess risk status and compliance with physical activity recommendations, monitor participant progress during behavioral physical activity and weight loss interventions, and determine the effectiveness of such programs. Accelerometers provide an objective measure of physical activity and are increasingly being used in research and population surveillance (12). However, despite the widespread use of accelerometers, the impact of varying amounts of wear time on physical activity estimates has not been well studied.
An important methodological concern is the amount of wear time necessary to reliably estimate physical activity levels. A handful of studies have examined the number of days of data collection needed for reliable estimates, but many of these efforts have focused on children (13). Matthews reported that in adults, fewer days are needed to reliably estimate moderate to vigorous physical activity (MVPA) than to measure inactivity (7).
It is not known whether the required wear time for a reliable estimate varies with the activity level of the sample. Limited evidence suggests that low-active samples have lower reliabilities than more-active samples. For example, a sample of 55 adults wore accelerometers for 2 days on 14 different occasions across a calendar year. As a group, those with lower activity levels had lower reliability, as measured by Intra Class Correlations (ICCs), than those with higher activity levels (5). Because this finding could have implications for population surveillance as well as evaluations of lifestyle interventions, further study is needed.
The purpose of this study was to examine reliability of wear time in a large sample of overweight and obese adults. It is one of the first papers to examine the reliability of accelerometer estimates of both minutes·day−1 of MVPA and estimates of MVPA performed in bouts of 10 minutes or more (bout MVPA). In addition, this paper examined the impact of days·week−1and hours·day−1 wear time criteria on accelerometer-based estimates of MVPA both before and after a weight loss intervention. Finally, we estimated the number of days of wear time needed to reach specific reliability levels.
Methods
Sample
We report on all participants with accelerometer data who underwent screening in the multi-site Weight Loss Maintenance trial (pre-Phase 1) (11). Participants were at least 25 years old and had body mass index (BMI) 25 – 45 kg/m2, classifying them as either overweight (BMI 25–29.9 kg/m2) or obese (BMI ≥ 30 kg/m2). They were also at high risk for developing cardiovascular disease, evidenced by taking antihypertensive or cholesterol-lowering medication. Those who lost at least 4 kg by the end of the initial intervention, Phase 1, were invited to participate in Phase 2, a 30-month weight maintenance program that compared two intervention conditions and a self-directed control condition on success in sustaining weight loss.
Phase 1 Weight Loss Intervention
Weight Loss Maintenance - Phase 1 was a 6-month intensive weight-reduction intervention that incorporated recommendations for diet and physical activity. The intervention included weekly group meetings and supplemental individual sessions. Interventionists encouraged participants to gradually increase to 180 minutes·week−1 of moderate or greater intensity physical activity. It was suggested that participants accumulate physical activity in bouts.
The trial was reviewed and approved by an independent Protocol Review Committee appointed by the NHLBI and by each site’s Institutional Review Board. All participants provided written informed consent.
Assessment of Physical Activity
Physical activity was assessed at both time points using the RT3 (Stayhealthy, Inc.), a triaxial accelerometer that provides an objective measure of physical activity. Accelerations are detected from vertical, horizontal, and anterior-posterior planes and converted to counts. Greater acceleration over a given period of time results in higher counts. The instrument has adequate reliability and validity against treadmill walking at different speeds and non-regulated physical activity (8,9).
During a screening visit prior to the start of Phase 1, participants were given an RT3 accelerometer programmed to capture data in one-minute increments. Participants were shown how to wear the monitor above the left hip and asked to wear it during all waking hours for 7 consecutive days. They returned the monitor to the clinic, where it was downloaded and output was checked for wear time. Whenever possible, a full 7 days of data were collected. The minimum requirement was ≥ 10 hours·day−1 for 4 days, including at least one weekend day. When possible, those who did not meet this requirement were asked to wear the accelerometer again. This procedure was repeated at the end of the 6-month Phase 1 weight loss program for those participants who lost 4 kg and were eligible for the Phase 2 weight maintenance program. MVPA was defined as ≥ 3 metabolic equivalents (1). Previously published RT3 cutpoints (> 1316.5 counts/min) were used to identify total minutes of MVPA and minutes of bout MVPA (9).
Statistical Analyses
We limited this analysis to individuals with at least 4 days of data (including at least one weekend day), with at least 6 hours of wear time each day. Descriptive statistics were used to describe these samples at screening (Pre Phase 1) and after the 6-month intervention (Post Phase 1). Average daily MVPA minutes (both total and 10-minute bout minutes) were calculated by averaging daily data within individuals and then computing an unweighted average of these individual participant means. Similarly, we report between-person standard deviations of individual averages. The reliability of average daily MVPA estimates was estimated using Intra Class Correlations (ICC), which represents the between-person variance/total variance. The ICC was derived from repeated measures analyses of variance (SAS PROC MIXED) using daily activity counts for each individual and controlling for the class variables of clinical center/site.
We examined ICC for the first 4 days (including at least 1 weekend day) using different wear time minimums (≥ 10 hours ·day−1 and ≥ 6 hours ·day−1) both Pre and Post Phase 1. Next we examined ICC for those participants with 7 days of data with a minimum of 10 hours·day−1. These analyses were repeated examining only the first 4 days (including at least 1 weekend day) and again for the first 2 days of monitor wear. The latter ICC was used for the Spearman-Brown prophecy formula (3, 10) to determine number of days to achieve specific reliabilities. The Spearman-Brown prophecy formula was used to determine the number of days need for desired reliabilities of .70, .80, and .90, where [days of monitoring needed for desired reliability] = [ICCdesired/(1-ICCdesired)][(1-ICCestimated)/ICCestimated] (2, 5, 14).
Results
Demographics
Characteristics of the participants in the Pre Phase 1 (n = 1592) and Post Phase 1 (n=1070) samples are presented in Table 1. Both Pre and Post Phase 1 samples included at least 47% over the age of 55, 60% females, and 33% African Americans.
Table 1.
Total | Pre Phase 1 N=1592 (%) |
Post Phase 1 N=1070 (%) |
---|---|---|
Age (years) | ||
≤ 45 | 13.8 | 10.9 |
46–55 | 38.8 | 38.1 |
56–64 | 32.5 | 35.0 |
≥ 65 | 14.8 | 16.1 |
Gender | ||
Male | 33.3 | 35.9 |
Female | 66.7 | 64.1 |
Race | ||
African American | 42.3 | 36.6 |
Non- African American | 57.7 | 63.4 |
BMI | ||
Normal | 0.0 | 6.9 |
Overweight | 21.2 | 39.4 |
Obese | 78.8 | 53.7 |
Note: Pre Phase 1 was screening data. Post Phase 1 was after 6 months of weight loss intervention.
Wear Time: Hours·day−1
Physical activity estimates and the reliability of these estimates were similar for both the 10-hour and 6-hour minimum wear times. All calculations were performed using the first 4 days (ensuring at least 1 weekend day) that met the daily wear time criteria. In Pre Phase 1, mean MVPA estimates using a 10-hour minimum were similar to those using a 6-hour minimum (18.3 minutes·day−1 and 18.0 minutes·day−1), and both methods resulted in an ICC of 0.40. Table 2 shows that estimates of bout MVPA also were similar across wear time minimums (6.9 minutes·day−1 and 6.7 minutes·day−1), and both methods resulted in an ICC of 0.33.
Table 2.
4 days ≥ 10 hours·day−1 | 4 days ≥ 6 hours·day−1 | |||||
---|---|---|---|---|---|---|
N | M (SD) | ICC | N | M (SD) | ICC | |
Pre-Phase 1 | ||||||
MVPA | 1351 | 18.3 (25.5) | .40 | 1592 | 18.0 (25.3) | .40 |
MVPA in bouts | 1351 | 6.9 (18.7) | .33 | 1592 | 6.7 (18.6) | .33 |
Post-Phase 1 | ||||||
MVPA | 964 | 27.0 (32.0) | .46 | 1070 | 26.8 (32.8) | .47 |
MVPA in bouts | 964 | 15.1 (26.4) | .43 | 1070 | 15.0 (27.1) | .42 |
Note: Models were adjusted for site. ICC = Intra Class Correlation. MVPA = Moderate to Vigorous Physical Activity. Pre Phase 1 was screening data. Post Phase 1 was after 6 months of weight loss intervention.
Post Phase 1, the 10-hour and 6-hour minimums resulted in similar MVPA estimates and reliabilities (27.0 and 26.8 minutes·day−1; 0.46 and 0.47 ICC). Similar results were found for bout MVPA (15.1 and 15.0 minutes·day−1; 0.43 and 0.42 ICC). We used the 6-hour minimum wear time for the final analyses.
Wear Time: Days needed
We conducted this set of analyses as a subset of the original sample: participants who had 7 days with ≥ 6 hours·day−1 of wear time. Table 3 shows that there were 721 participants meeting this requirement from Pre Phase 1 and 637 from Post Phase 1. ICCs were calculated using 2, 4 (must include 1 weekend day), and all 7 days of wear time. The ICC for the first 2 days of wear time was lower across time points and outcomes (MVPA and bout MVPA). In the Pre Phase 1 sample, the MVPA ICCs were .36 and .37 for 4 days and 7 days, and the bout MVPA ICCs were .29 and .30. In the Post Phase 1 sample, the MVPA ICCs were .44 and .46 for 4 and 7 days, and the bout MVPA ICCs were .39 and .41. A sensitivity analysis was conducted using the 304 participants who had 7 days of accelerometry data at both Pre and Post Phase 1. Results from this sensitivity analysis indicated similar ICCs Pre Phase 1; however, compared to the larger sample, the subsample of 304 individuals had a lower ICC Post Phase 1.
Table 3.
Moderate to Vigorous Physical Activity | Moderate to Vigorous Physical Activity in bouts | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
ICC | Days of monitoring required to achieve reliability of |
ICC | Days of monitoring required to achieve reliability of |
|||||||||
2 days | 4 days | 7 days | .70 | .80 | .90 | 2 days | 4 days | 7 days | .70 | .80 | .90 | |
All Participants | ||||||||||||
Pre Phase 1* (N=721) | .26 | .36 | .37 | 13 | 23 | 51 | .25 | .29 | .30 | 11 | 20 | 44 |
Post Phase 1 (N=637) | .35 | .44 | .46 | 9 | 16 | 35 | .34 | .39 | .41 | 9 | 16 | 35 |
Participants with both pre and post values | ||||||||||||
Pre Phase1* (N=304) | .32 | .40 | .40 | 10 | 17 | 37 | .38 | .29 | .32 | 8 | 13 | 30 |
Post Phase1 (N=304) | .19 | .30 | .36 | 20 | 35 | 78 | .19 | .27 | .32 | 20 | 35 | 78 |
Note: Models were adjusted for site. Pre Phase 1 was screening data. Post Phase 1 was after 6 months of weight loss intervention
Days of Monitoring Needed to Achieve a Given Level of Reliability
Table 3 also presents estimates, based on the Spearman-Brown prophecy formula, of the number of days needed to achieve reliabilities of .70, .80, and .90 based on the ICC corresponding with 2 days of wear time. The results for MVPA indicate that between 9 and 13 days of data collection are needed for a reliability of .70 and between 16 and 23 days for a reliability of .80 in both the Pre and Post Phase 1 samples. In the Pre Phase 1 sample, bout MVPA reliabilities of .70 and .80 required 11 and 20 days of data collection. In the Post Phase 1 sample, bout MVPA reliabilities of .70 and .80 required 9 and 16 days.
Discussion
This is one of the largest, most diverse samples used to examine reliability in accelerometry data. We found little support for using the more-stringent wear time minimum of ≥ 10 hours·day−1 when estimating MVPA. The 6-hour wear time afforded a larger sample size than the more stringent minimum and resulted in similar physical activity estimates and corresponding reliability. We also found similar results using a 4-day versus a 7-day criterion for wear time. Thus, shorter wear time minimums may provide useful MVPA estimations in populations of sedentary overweight and obese adults.
Studies designed to estimate either total energy expenditure or daily physical activity often require stringent wear time criteria in order to capture all pertinent behavior. (4) Although some studies have required up to 12 hours of wear time (15), there have been few published examinations of the impact of wear time on reliability in estimating minutes·day−1 of MVPA or bout MVPA. It has been suggested that participants are not likely to be engaged in MVPA and even less likely to be engaged in bout MVPA during periods when accelerometers are removed (6). Consequently, wearing the accelerometers during all waking hours may not be necessary when making MVPA estimates. Additionally, requiring long hours (e.g., 10 or 12 hours) of minimum wear time increases participant burden and is not always feasible in large-scale studies. We did not examine minimum wear time needed for reliability estimates of sedentary activities, which is likely to be more stringent than wear time required for reliable MVPA estimates.
It was not surprising to find low levels of activity in this sample of overweight and obese adults in Pre Phase 1. As a result of low activity levels, the between-person variability in activity levels expressed as a percentage of total variance was low, resulting in low ICCs for the Pre Phase 1 sample. In contrast, the between-person variability would account for a larger portion of the overall variability in a sample including both sedentary and active individuals, resulting in a larger ICC. The higher ICCs Pre compared to Post Phase 1 could be due to an increase in between-person variance rather than an indicator of more-reliable MVPA and bout MVPA measurement. The results from the sensitivity analyses do not support this hypothesis, however, as the Post Phase 1 ICCs are lower than the Pre Phase 1 ICCs for this group.
Currently there is no gold standard for capturing intensity, duration, and frequency of daily MVPA. Accelerometry is one of the most promising methods for objective recording of daily physical activity. As with other physical activity measurements, there is a degree of measurement error in accelerometer estimates. Although ICC is widely used to describe reliability of measurement, it may not be the best indicator. To date there have been few attempts to develop standardized methods or identify a battery of tests for assessing measurement reliability. Such a collection might include ICC, inter-monitor reliability, and criterion references.
The Spearman-Brown prophecy formula (3, 10) is one of the most widely used techniques for estimating days of measurement needed to achieve a specific reliability. Different studies have operationalized the procedure slightly differently, but the general methods are similar. A common length of time to construct a repeated measure is used to determine an ICC, we used 2 days of daily MVPA or bout MVPA. The Spearman-Brown prophecy formula uses this ICC to determine how many “lengths of time” are required to achieve a specific reliability. Previously reported Spearman-Brown estimates of the number of days needed to achieve reliability of .80 range from 3 to 10 days (5, 13). We found that between 16 and 23 days of monitoring would be needed to achieve reliability of .80. Reliability increased when comparing 2 to 4 days of monitoring, yet there was negligible increase in ICC when wear time was increased to 7 days. As such, 4 days of monitoring appeared to be an optimal tradeoff between reliability and participant burden for wearing the monitor on multiple days. Because the Spearman-Brown prophecy formula relies on between-person variance, there is concern when using it to estimate the number of days needed for reliable activity estimates (5). Additionally, the formula assumes that reliability increases with increased “length of time”. In this study we found that the reliability did not increase in the same proportion as predicted by the formula, thus highlighting another limitation of using Spearman-Brown prophecy formula for determining reliability of accelerometer data. Generalizing Spearman-Brown estimates across populations should be done with caution and should be secondary to finding actual reliability estimates to determine needed wear time for specific reliabilities.
There are known limitations to accelerometer-based activity estimates. In general, accelerometers capture walking and jogging better than other activities (4), and the nature of the activity may affect reliabilities. Accelerometers need to be taken off before swimming, and they do not accurately estimate bicycling because of lack of hip movement while cycling. However, based on self report there were low rates of these activities in this sample (data not shown). Reliability of estimates could also be influenced by the accuracy of the monitor (17) and completeness of staff training, both of which received substantial attention in this trial. While compliance rates were high across all data collection sites, some of our analyses were conducted on a subsample with the highest compliance. This subsample was larger (n=637) than most samples previously used to examine reliability with adults and is one of the largest reported samples of overweight and obese adults with accelerometer data. While this paper did not test the impact of different wear time instructions, we did examine how data processing decisions affect reliability and sample size. As such, we believe these results are applicable to future study designs.
This paper examined the relationship between wear time requirements and optimal use of data collected when developing reliable physical activity estimates. More stringent (longer) wear time requirements resulted in lower sample sizes without a significant increase in reliability. It has been suggested that researchers seek a balance between feasibility and what is needed to answer the research question at hand (16). Using 4 days of data with ≥ 6 hours·day−1 of wear time optimized both ICC and sample size. Hence, we believe it resulted in the most reasonable estimation of MVPA and bout MVPA in both the Pre and Post Phase 1 samples. These results should help guide future efforts to use accelerometers in estimating MVPA in diverse overweight and obese samples.
Acknowledgements
This study was supported by grants 5-U01 HL68734, 5-U01 HL68676, 5-U01 HL68790, 5-U01 HL68920, and 5-HL68955 from the National Heart, Lung, Blood Institute. The results of the present study do not constitute endorsement by ACSM.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Centers for Disease Control and Prevention. U.S. department of health and human services: Office of disease prevention and health promotion--healthy people 2010. Nasnewsletter. 2000 May;15(3):3. [PubMed] [Google Scholar]
- 2.Gretebeck RJ, Montoye HJ. Variability of some objective measures of physical activity. Med Sci Sports Exerc. 1992 Oct;24(10):1167–1172. [PubMed] [Google Scholar]
- 3.Guilford JP. Psychometric Methods. New York: McGraw-Hill Book Co., Inc.; 1936. p. 419. [Google Scholar]
- 4.Jakicic JM, Winters C, Lang W, Wing RR. Effects of intermittent exercise and use of home exercise equipment on adherence, weight loss, and fitness in overweight women: A randomized trial. JAMA. 1999 Oct 27;282(16):1554–1560. doi: 10.1001/jama.282.16.1554. [DOI] [PubMed] [Google Scholar]
- 5.Levin S, Jacobs DR, Jr, Ainsworth BE, Richardson MT, Leon AS. Intra-individual variation and estimates of usual physical activity. Ann Epidemiol. 1999 Nov;9(8):481–488. doi: 10.1016/s1047-2797(99)00022-8. [DOI] [PubMed] [Google Scholar]
- 6.Masse LC, Fuemmeler BF, Anderson CB, Matthews CE, Trost SG, Catellier DJ, Treuth M. Accelerometer data reduction: A comparison of four reduction algorithms on select outcome variables. Med Sci Sports Exerc. 2005 Nov;37(11 Suppl):S544–S554. doi: 10.1249/01.mss.0000185674.09066.8a. [DOI] [PubMed] [Google Scholar]
- 7.Matthews CE, Ainsworth BE, Thompson RW, Bassett DR., Jr Sources of variance in daily physical activity levels as measured by an accelerometer. Med Sci Sports Exerc. 2002 Aug;34(8):1376–1381. doi: 10.1097/00005768-200208000-00021. [DOI] [PubMed] [Google Scholar]
- 8.Powell SM, Jones DI, Rowlands AV. Technical variability of the RT3 accelerometer. Med Sci Sports Exerc. 2003 Oct;35(10):1773–1778. doi: 10.1249/01.MSS.0000089341.68754.BA. [DOI] [PubMed] [Google Scholar]
- 9.Rowlands AV, Thomas PW, Eston RG, Topping R. Validation of the RT3 triaxial accelerometer for the assessment of physical activity. Med Sci Sports Exerc. 2004 Mar;36(3):518–524. doi: 10.1249/01.mss.0000117158.14542.e7. [DOI] [PubMed] [Google Scholar]
- 10.Stanley JC. Reliability. In: Thorndike RL, editor. Educational Measurement. Washington, DC: American Council of Education; 1971. p. 395. [Google Scholar]
- 11.Svetkey LP, Stevens VJ, Brantley PJ, Appel LJ, Hollis JF, Loria CM, Vollmer WM, Gullion CM, Funk K, Smith P, Samuel-Hodge C, Myers V, Lien LF, Laferriere D, Kennedy B, Jerome GJ, Heinith F, Harsha DW, Evans P, Erlinger TP, Dalcin AT, Coughlin J, Charleston J, Champagne CM, Bauck A, Ard JD, Aicher K Weight Loss Maintenance Collaborative Research Group. Comparison of strategies for sustaining weight loss: The weight loss maintenance randomized controlled trial. JAMA. 2008 Mar 12;299(10):1139–1148. doi: 10.1001/jama.299.10.1139. [DOI] [PubMed] [Google Scholar]
- 12.Troiano RP. A timely meeting Objective measurement of physical activity. Med Sci Sports Exerc. 2005 Nov;37(11 Suppl):S487–S489. doi: 10.1249/01.mss.0000185473.32846.c3. [DOI] [PubMed] [Google Scholar]
- 13.Trost SG, McIver KL, Pate RR. Conducting accelerometer-based activity assessments in field-based research. Med Sci Sports Exerc. 2005 Nov;37(11 Suppl):S531–S543. doi: 10.1249/01.mss.0000185657.86065.98. [DOI] [PubMed] [Google Scholar]
- 14.Trost SG, Pate RR, Freedson PS, Sallis JF, Taylor WC. Using objective physical activity measures with youth: How many days of monitoring are needed? Med Sci Sports Exerc. 2000 Feb;32(2):426–431. doi: 10.1097/00005768-200002000-00025. [DOI] [PubMed] [Google Scholar]
- 15.Tudor-Locke C, Ainsworth BE, Thompson RW, Matthews CE. Comparison of pedometer and accelerometer measures of free-living physical activity. Med Sci Sports Exerc. 2002 Dec;34(12):2045–2051. doi: 10.1097/00005768-200212000-00027. [DOI] [PubMed] [Google Scholar]
- 16.Welk GJ. Principles of design and analyses for the calibration of accelerometry-based activity monitors. Med Sci Sports Exerc. 2005 Nov;37(11 Suppl):S501–S511. doi: 10.1249/01.mss.0000185660.38335.de. [DOI] [PubMed] [Google Scholar]
- 17.Welk GJ, Schaben JA, Morrow JR., Jr Reliability of accelerometry-based activity monitors: A generalizability study. Med Sci Sports Exerc. 2004 Sep;36(9):1637–1645. [PubMed] [Google Scholar]