Skip to main content
American Journal of Epidemiology logoLink to American Journal of Epidemiology
. 2014 Jul 19;180(4):424–435. doi: 10.1093/aje/kwu150

The Sedentary Time and Activity Reporting Questionnaire (STAR-Q): Reliability and Validity Against Doubly Labeled Water and 7-Day Activity Diaries

Ilona Csizmadi *, Heather K Neilson, Karen A Kopciuk, Farah Khandwala, Andrew Liu, Christine M Friedenreich, Yutaka Yasui, Rémi Rabasa-Lhoret, Heather E Bryant, David C W Lau, Paula J Robson
PMCID: PMC4128771  PMID: 25038920

Abstract

We determined measurement properties of the Sedentary Time and Activity Reporting Questionnaire (STAR-Q), which was designed to estimate past-month activity energy expenditure (AEE). STAR-Q validity and reliability were assessed in 102 adults in Alberta, Canada (2009–2011), who completed 14-day doubly labeled water (DLW) protocols, 7-day activity diaries on day 15, and the STAR-Q on day 14 and again at 3 and 6 months. Three-month reliability was substantial for total energy expenditure (TEE) and AEE (intraclass correlation coefficients of 0.84 and 0.73, respectively), while 6-month reliability was moderate. STAR-Q-derived TEE and AEE were moderately correlated with DLW estimates (Spearman's ρs of 0.53 and 0.40, respectively; P < 0.001), and on average, the STAR-Q overestimated TEE and AEE (median differences were 367 kcal/day and 293 kcal/day, respectively). Body mass index-, age-, sex-, and season-adjusted concordance correlation coefficients (CCCs) were 0.24 (95% confidence interval (CI): 0.07, 0.36) and 0.21 (95% CI: 0.11, 0.32) for STAR-Q-derived versus DLW-derived TEE and AEE, respectively. Agreement between the diaries and STAR-Q (metabolic equivalent-hours/day) was strongest for occupational sedentary time (adjusted CCC = 0.76, 95% CI: 0.64, 0.85) and overall strenuous activity (adjusted CCC = 0.64, 95% CI: 0.49, 0.76). The STAR-Q demonstrated substantial validity for estimating occupational sedentary time and strenuous activity and fair validity for ranking individuals by AEE.

Keywords: motor activity, physical activity, reproducibility of results, sedentary lifestyle, questionnaires, validation studies


As the need for comprehensive assessments of daily physical activity has grown, the limitations of existing methods have become apparent (1, 2). Many physical activity questionnaires have been developed and validated, but most are not well suited for the study of new behavioral parameters of interest, such as activity energy expenditure (AEE). In a review of physical activity questionnaires that were validated against doubly labeled water (DLW)—the gold standard for estimating total energy expenditure (TEE) in free-living individuals—we reported that overall, questionnaire performance in estimating AEE was poor when compared with DLW (3). However, none of the questionnaires were originally designed to estimate AEE.

While objective measures continue to evolve in precision and accuracy, they are limited as “stand-alone” measures when contextual information pertaining to habitual behavior is desired (4). It is generally agreed that, for at least the foreseeable future, there will continue to be a need for logistically feasible, cost-effective tools with known validity that can assess a broad range of habitual behaviors in large populations (4, 5).

We designed the Sedentary Time and Activity Reporting Questionnaire (STAR-Q), a past-month activity questionnaire, to address the need for a self-administered questionnaire that could assess all types and intensities of physical activity, including sedentary behavior, across all activity domains for the purpose of ascertaining AEE among adults in large-scale epidemiologic studies. Following several phases of development and cognitive testing (6), the next step was to determine the STAR-Q's validity and reliability. We assessed validity by comparing STAR-Q-derived AEE and TEE estimates with those derived using DLW. We also compared activities reported in the STAR-Q with intensity- and domain-specific activities reported prospectively in a 7-day activity diary. In addition, we assessed the reliability of the STAR-Q for estimating AEE by administering it on 3 occasions.

METHODS

The STAR-Q

The STAR-Q (see Web Appendix 1, available at http://aje.oxfordjournals.org/) is a quantitative, self-administered past-month questionnaire designed to capture information on all types of activities and sedentary behaviors across all domains: eating, personal/medical care, occupation, transportation, household, yard work, caregiving, exercise, light leisure (e.g., television-watching, personal computer time, reading, hobbies, etc.), stair-climbing, and “other” activities. Development and cognitive testing of the STAR-Q have been previously described (6). Briefly, detailed physical activity data from 18,838 participants enrolled in the Alberta Tomorrow Project, a Canadian cohort study (7), were used to identify and group common activities with similar energy expenditures to create activity-related questionnaire items. Open-ended reporting of some activities was also allowed on the STAR-Q.

Study participants

Study participants comprised 1) urban residents of the Calgary, Alberta, Canada, area who were recruited through posters and worksite e-mail distribution lists and 2) participants in the Alberta Tomorrow Project. The Alberta Tomorrow Project is a study of a large, geographically dispersed cohort established in 2001 to examine the relationship between lifestyle and chronic diseases in Alberta (7). Invitations to participate in the current study were mailed to Project participants who met eligibility criteria for the current study, based on characteristics reported upon enrollment in the Project. Eligibility criteria were as follows: body mass index (weight (kg)/height (m)2) ≤35, weight stability (±2.5 kg for at least 3 months) with no intention to gain/lose weight in the near future, not being pregnant or breastfeeding, absence of metabolic disorders (e.g., diabetes, thyroid), and no use of medications that modified water balance. In addition, we set the upper age limit for eligibility at 60 years to avoid incomplete collections of urine due to bladder urine retention. The minimum age of eligibility for participants recruited from the community (i.e., the urban Calgary sample) was set at 30 years, which is comparable to the Alberta Tomorrow Project (age 35 years). All participants provided written informed consent, and ethical approval was obtained from the Alberta Cancer Research Ethics Committee of Alberta Health Services and the Conjoint Health Research Ethics Board of the University of Calgary. Participants received Can$100 upon study completion.

Reliability study design

To assess the consistency of the STAR-Q for estimating energy expenditure over a longer duration, participants completed the STAR-Q 3 times over a 6-month period (Figure 1). The first STAR-Q (STAR-Q1) was completed 14 days after DLW dosing (day 0). The second and third STAR-Qs (STAR-Q2 and STAR-Q3) were completed approximately 3 and 6 months after day 14, respectively. STAR-Q1 was self-administered at the test facility, and following completion, responses were reviewed for completeness and clarity of the text. STAR-Q2 and STAR-Q3 were also self-administered but were sent to participants and returned by mail.

Figure 1.

Figure 1.

Overview of the study design for a study conducted to validate the Sedentary Time and Activity Reporting Questionnaire (STAR-Q), Alberta, Canada, 2009–2011. Numbers along the horizontal axis indicate the number of months relative to study baseline (indicated as “0”). The reliability study occurred across months 0, 3, and 6, comparing responses between the first, second, and third administrations of the STAR-Q (STAR-Q1, STAR-Q2, and STAR-Q3). AEE, activity energy expenditure; DLW, doubly labeled water; TEE, total energy expenditure.

AEE estimation from the STAR-Q

Responses on the STAR-Q were used to estimate AEESTAR-Q, TEESTAR-Q, energy expenditure in specific domains, and daily numbers of hours spent in intensity-specific activities and in sedentary behavior. All self-reported activities were assigned activity codes and metabolic equivalent of task (MET) values from the Compendium of Physical Activities (8, 9). Where total reported time spent in physical activity, sedentary behavior, and sleep was less than 24 hours/day, a MET value of 1.2 was assigned to the unaccounted-for time. Reported frequency, duration, and intensity of activities were combined to obtain a single estimate of energy expenditure in MET-hours/day for each activity. Activity-specific estimates of MET-hours/day were then summed across all activities for each subject to obtain the total average energy cost in MET-hours/day. Those estimates were subsequently applied to our estimations of average past-month TEESTAR-Q and AEESTAR-Q as follows:

TEESTAR-Q=[(0.9METs×hoursofsleep)+(MET-hours/day×weight(kg))]×1.1

and

AEESTAR-Q=[MET-hours/day×weight(kg)][1.0METs×weight(kg)×totalhoursofactivity/day],

where TEESTAR-Q was estimated from reported nightly hours of sleep from the STAR-Q multiplied by 0.9 METs (average energy expenditure during sleep (8, 9)) and added to total daily MET-hours during time spent awake multiplied by weight (in kilograms) measured at baseline. An additional 10% was added for energy expenditure attributed to the thermic effect of food. AEESTAR-Q was estimated from total daily MET-hours accumulated during waking time multiplied by weight (in kilograms) measured at baseline, from which energy expenditure attributed to resting metabolism (1.0 METs × weight × activity hours/day) was subtracted.

Finally, estimates of amounts of time spent in each activity domain and at various intensities—sedentary (≤1.5 METs), light (>1.5–≤3.0 METs), moderate (>3.0–≤6.0 METs), and vigorous (>6.0 METs)—were derived. We also determined AEE per kilogram of body weight (kcal/kg·day) for DLW- and STAR-Q-derived estimates in an effort to account for the known tendency of body mass or lean mass to inflate the association between questionnaire-derived AEE and DLW-derived AEE (10, 11).

Validation study design

An overview of the 21-day validation study protocol is provided in Figure 1. The DLW protocol was completed over a period of 14 days, with the STAR-Q1 being completed on day 14. Participants were instructed to begin prospectively recording daily activities in the 7-day diary on day 15. The relative timing of the STAR-Q1 and diary administrations was intended to minimize learning effects and altered awareness of behavior that might have occurred had the diaries been completed first.

7-Day activity diary

A 7-day activity diary was employed to compare domain- and intensity-specific AEE and duration estimates, respectively, with those of STAR-Q1. The 7-day diary was adapted from physical activity records described by Conway et al. (12) and was designed to ascertain sleep time as well as all activities engaged in (of ≥10 minutes duration and those requiring a change in physical effort) and posture while awake. Respondents recorded the time at which each activity began, described the activity, and selected one of 10 listed domains and subdomains (corresponding closely to the STAR-Q), posture (reclining, sitting, standing, walking, or “in motion” (e.g., swimming, cycling, dancing)), and the perceived intensity of the activity (effort level 1, 2, 3, or 4; definitions were provided). Participants also reported stair-climbing (number of flights up and down). Completed diaries were returned to the test center by mail. Summary estimates of energy expenditure were derived for the 7-day diaries using reduction procedures as described for the STAR-Q.

Anthropometric measures and resting metabolic rate

Height and weight were measured at the test facility to the nearest 0.1 cm and 0.1 kg, respectively, on days 0 and 14 of the DLW protocol by a certified exercise physiologist using standard procedures (13). The TBF-310 Tanita Body Composition Analyzer and Scale (Tanita Corporation of America, Inc., Preston, Washington (TheCompetitiveEdge.com)) was used to measure body weight and to estimate percentage of body fat. Body mass index was determined as body weight (kg) divided by height (m) squared. Resting metabolic rate was estimated by means of the Schofield equation, which takes into account age, height, and weight (14).

DLW protocol

A 14-day DLW study was used as the primary criterion method against which AEESTAR-Q and TEESTAR-Q estimates were validated (Figure 1). Following an overnight fast, participants provided baseline urine and saliva samples for determination of background isotope levels (day 0). DLW doses were administered orally to provide each subject with 2.5 g 10 atom % of oxygen-18 (18O) per kg of total body water and 0.18 g 99 atom % of deuterium (2H) per kg of estimated total body water (Rotem Inc., Topsfield, Massachusetts). Each participant was asked to abstain from food and fluid intake for 4 hours after dosing. Postdose saliva samples were collected at 3 and 4 hours for the measurement of total body water from deuterium-isotope dilution. The Canadian Diet History Questionnaire (15), a food frequency questionnaire that ascertains past-year dietary intake, was completed on day 0 for food quotient estimation (proxy for respiratory quotient). Second-void urine samples were collected on days 1 and 8, with sampling time recorded on a time sheet and stored in a refrigerator in a 125-mL container until pick-up by study staff within the next day. On day 14, following an overnight fast, participants collected the final urine sample and returned to the test center for dosing with 0.18 g 99 atom % deuterium per kg of total body water, followed by postdose saliva sample collections as described for day 0. To measure the decline in isotope enrichment, samples were batch-analyzed in duplicate using the Isoprime Stable Isotope Ratio Mass Spectrometer (Isoprime Ltd., Cheadle Hulme, United Kingdom). A MultiFlow-Bio module for Isoprime (Isoprime Ltd.) equipped with a Gilson 222XL Autosampler (GV Instruments Ltd., Manchester, United Kingdom) was used for daily energy expenditure measurements. Data processing was performed using IonVantage software (2012) for Isoprime (Isoprime Ltd.). Stability tests were performed each day before testing, yielding standard deviations of 0.026% for deuterium and 0.004% for oxygen-18.

TEE was calculated according to the method of Racette et al. (16), using a modified Weir equation and an assumed respiratory quotient of 0.85. In addition, a food quotient (proxy for respiratory quotient) was estimated from the Diet History Questionnaire (15). AEE was subsequently derived from TEE as follows:

AEEDLW=[TEEDLW×0.90]{[(RMR/24)×(24hoursofsleep)]+[0.9(RMR/24)×hoursofsleep]},

where RMR represents resting metabolic rate. As with the self-reported data, resting metabolic rate was estimated using the Schofield equation (14). The thermic effect of food (10%) was subtracted from TEE, and resting metabolic rate was subtracted after accounting for a 10% reduction in energy expenditure during the self-reported sleep hours from STAR-Q1.

Statistical analysis

Baseline characteristics were examined for participants who did not return a STAR-Q or had implausibly high STAR-Q-derived AEE levels. Their characteristics were compared with those of the rest of the group.

Reliability study

Descriptive statistics were calculated for all derived variables. Agreement across STAR-Q administrations was assessed using intraclass correlation coefficients (ICCs) based on 2-way mixed models without interaction (17), treating subjects as random effects and the activity assessment methods as fixed effects. Concordance correlation coefficients (CCCs) were estimated using variance components of a 2-way mixed-effects model (18). CCCs are practically and asymptotically equivalent to ICCs but do not require analysis-of-variance model assumptions, can be adjusted for potential covariates, and can measure 3-way concordance (e.g., STAR-Q1 vs. STAR-Q2 vs. STAR-Q3). Adjustment was made for baseline age, measured body mass index, and sex, since measurement error can vary with these characteristics (1921), and also for season (dichotomized: May–October, November–April), which can influence physical activity behavior (22, 23). Logarithmic transformations were carried out for all variables demonstrating marked departures from normality (indicated in the Results section), and all zero values were converted to 0.1. For assessment of the internal validity of the data, 1,000 bootstrap samples were drawn, providing robust 95% confidence intervals around estimated CCCs.

Validity study

STAR-Q-derived TEESTAR-Q and AEESTAR-Q estimates were compared with those from DLW and 7-day diaries using post hoc Wilcoxon signed-rank tests. Relative agreement was assessed using ICCs based on 2-way mixed models without interaction (17) and CCCs using variance components, reflecting accuracy and precision, in a 2-way mixed-effects model with adjustment for age, sex, and season (18). Bland-Altman plots (24) were used to appraise group-level agreement and to identify trends in under- and overestimation of AEE and TEE by the STAR-Q relative to DLW. The 95% limits of agreement (25) around the mean difference showed with 95% certainty the range in which individual differences would occur for a similar population. Sample size was estimated for 2 mixed-effects analysis-of-variance models, assuming an ICC of at least 0.65 with a 95% confidence interval of ±0.10 (26). We did not experience the possible 20% loss to follow-up or unusable data that we anticipated, so our sample size exceeded our goal of 83 participants. All analyses were conducted with SAS/STAT (version 9.2; SAS Institute, Inc., Cary, North Carolina), PC-SAS (27), and R (version 2.12; R Foundation for Statistical Computing, Vienna, Austria (http://www.R-project.org)) (28). Statistical significance tests were 2-sided.

RESULTS

Participation

Between July 2009 and July 2010, 106 participants were recruited; data collection was completed by January 2011. One participant was excluded based on illegible STAR-Q responses; 5 did not return STAR-Q2s, and 5 did not return STAR-Q3s. Others were excluded based on a priori criteria for implausible values (AEE >3,600 kcal/day). Final sample sizes were 102 (STAR-Q1), 96 (STAR-Q2), and 97 (STAR-Q3). Estimated CCCs comparing all 3 STAR-Qs for reliability included only those participants with data for all 3 STAR-Qs (n = 91). Nine participants who did not return at least 1 STAR-Q were similar to the remaining participants in terms of age, body mass index, sex, and percentage of body fat. Persons excluded because of implausibly high AEEs (n = 7) were mostly men (86% men, as opposed to 40% in the analyzed group).

In the validation study, of 102 participants with valid STAR-Q1 data, 1 did not return a diary, 1 submitted a diary with insufficient detail for activity estimation, and 1 was excluded based on a diary-estimated AEE greater than 3,600 kcal/day. Two DLW measures were excluded because of incomplete equilibration and high baseline urine isotope levels. Thus, 99 and 100 subjects were included in the diary and DLW comparisons, respectively.

On average, study participants were middle-aged, Caucasian, employed, and well-educated (Table 1). Although participants represented a range of physical activity levels using Institute of Medicine (29) criteria and DLW estimates, TEEDLW showed that the majority were “active” (physical activity level 1.60–<1.90) or “very active” (physical activity level ≥1.90).

Table 1.

Characteristics of Participants in the STAR-Q Validation Study Who Completed STAR-Q1,a Alberta, Canada, 2009–2011

Males (n = 41)b
Females (n = 61)b
Mean (SD) Range No. % Mean (SD) Range No. %
Age, years 50.6 (6.9) 33–60 46.0 (8.6) 30–59
Body mass indexc 25.9 (3.1) 19.9–33.4 23.5 (2.9) 16.8–30.3
% body fat (BIA) 21.8 (6.2) 10.9–34.0 30.7 (6.5) 14.3–44.9
Predicted RMR, kcal/dayd 1,787 (132) 1,533–2,035 1,380 (84) 1,184–1,705
Physical activity levele 1.8 (0.4) 1.2–2.9 1.9 (0.4) 1.1–3.1
Education
 Less than bachelor's degree 19 46.3 20 32.8
 Bachelor's degree or higher 18 43.9 31 50.8
 Unknown/missing data 4 9.8 10 16.4
Ethnicity
 Caucasian 35 85.4 44 72.1
 Asian 2 4.9 2 3.3
 Other 0 0.0 2 3.3
 Unknown/missing data 4 9.8 13 21.3
Paid or volunteer work
 STAR-Q1 (n = 102) 38 92.7 57 93.4
 STAR-Q2 (n = 96) 35 89.7 53 93.0
 STAR-Q3 (n = 97) 36 90.0 51 89.5
Physical activity level (n = 101)e
 <1.40 6 14.6 5 8.3
 1.40–<1.60 3 7.3 6 10.0
 1.60–<1.90 17 41.5 23 38.3
 ≥1.90 15 36.6 26 43.3

Abbreviations: BIA, bioelectrical impedance analysis; DLW, doubly labeled water; RMR, resting metabolic rate; STAR-Q, Sedentary Time and Activity Reporting Questionnaire.

a The first STAR-Q (STAR-Q1) was completed 14 days after DLW dosing (day 0).

b Sample size at STAR-Q1.

c Weight (kg)/height (m)2.

d Estimated using the Schofield equation (14).

e Physical activity level = total energy expenditure derived from DLW divided by predicted RMR (n = 101 based on available DLW results).

The median self-reported time required for STAR-Q1 completion was 34 minutes. The proportions of STAR-Qs completed during the warmer months (May–October) were 45.7% (STAR-Q1), 53.0% (STAR-Q2), and 43.0% (STAR-Q3).

Estimated energy expenditure

Numbers of hours per day of self-reported activity (Table 2 and Web Table 1) were reasonable, on average. However, median physical activity levels derived from the STAR-Qs and diary were relatively high (approximately 2.0, or “very active” based on Institute of Medicine criteria, in comparison with the DLW-derived average physical activity level of 1.8 (“active” using Institute of Medicine criteria)). TEEDLW estimation using a respiratory quotient of 0.85 or a Diet History Questionnaire-estimated food quotient were similar, with median TEEs differing by less than 20 kcal/day (data not shown); hence, we assumed a respiratory quotient of 0.85 in all subsequent TEEDLW analyses.

Table 2.

Hours of Activity and Energy Expenditure (Median and Interquartile Range) Derived From STAR-Q Responses, a 7-Day Diary, and Doubly Labeled Water in the STAR-Q Validation Study, Alberta, Canada, 2009–2011

Assessment Methoda
STAR-Q1 (n = 102) STAR-Q2 (n = 96) STAR-Q3 (n = 97) 7-Day Diary (n = 103) DLW (n = 103)
Total hours/day, including sleep 21.2 (4.5) 21.9 (4.7) 21.5 (5.4) 24.0 (0.2)b
Total activity, MET-hours/dayc 30.0 (12.7) 31.5 (15.0) 31.7 (14.2) 33.6 (7.3)b
AEE, kcal/dayc 1,202 (628) 1,242 (697) 1,281 (643) 1,236 (575) 1,016 (661)b
AEE, kcal/kg·dayc 16.9 (7.2) 18.0 (10.5) 18.6 (10.1) 18.0 (7.4) 13.7 (7.1)b
TEE, kcal/day 3,238 (1,273) 3,235 (1,078) 3,261 (1,105) 3,231 (1,104) 2,808 (1,038)b
Physical activity leveld 2.0 (0.5) 2.0 (0.5) 2.1 (0.6) 2.1 (0.4) 1.8 (0.3)b

Abbreviations: AEE, activity energy expenditure; DLW, doubly labeled water; MET, metabolic equivalent of task; RMR, resting metabolic rate; STAR-Q, Sedentary Time and Activity Reporting Questionnaire; TEE, total energy expenditure.

a STAR-Q1, STAR-Q2, and STAR-Q3 represent the first, second, and third administrations of the STAR-Q.

b The median difference between STAR-Q1 and the reference method (i.e., diary or DLW) was significantly different from zero (P < 0.001, 2-sided test) in a Wilcoxon signed-rank test.

c Excludes sleep.

d TEE/predicted RMR.

Reliability

ICCs and CCCs for kcal/day and MET-hours/day are presented in Table 3. Generally the ICCs for STAR-Q1 versus STAR-Q2 (completed 3 months apart) were higher than ICCs for STAR-Q1 versus STAR-Q3 (completed 6 months apart). When comparing STAR-Q1 with STAR-Q2, good-to-excellent ICCs (>0.70) were observed for TEE, AEE, sleep, overall occupational activity, occupational sedentary behavior, and television-watching. Remaining activities were associated with fair-to-moderate ICCs. Generally, adjusted CCCs (CCCadj) were lower than ICCs, but associations remained moderate to good (approximately 0.60–0.70) for TEE, AEE, sleep, overall occupational activity, occupational sedentary behavior, and television-watching. Notable differences between unadjusted and adjusted CCCs were found for TEE and AEE; covariate adjustment otherwise had little impact.

Table 3.

Intraclass and Concordance Correlation Coefficients (MET-hours/day) for Reliability Assessments in the STAR-Q Validation Study, Alberta, Canada, 2009–2011a

STAR-Q1 vs. STAR-Q2 (n = 95)
STAR-Q1 vs. STAR-Q3 (n = 96)
STAR-Q1 vs. STAR-Q2 vs. STAR-Q3 (n = 91)
Unadjusted CCCb 95% CIc Adjusted CCCd 95% CIc
ICC 95% CI ICC 95% CI
TEE, kcal/daye 0.84 0.77, 0.89 0.74 0.64, 0.82 0.75 0.64, 0.82 0.62 0.45, 0.73
AEE, kcal/daye 0.73 0.62, 0.81 0.60 0.46, 0.71 0.61 0.48, 0.71 0.57 0.40, 0.69
Sleepingf 0.79 0.70, 0.85 0.69 0.57, 0.78 0.73 0.61, 0.81 0.72 0.61, 0.80
Stair-climbing, flights/dayg 0.45 0.28, 0.60 0.44 0.27, 0.59 0.47 0.31, 0.61 0.48 0.34, 0.58
Active sittingh 0.45 0.28, 0.60 0.46 0.29, 0.60 0.44 0.30, 0.63 0.45 0.28, 0.58
Overall activityi
 Sedentary 0.53 0.37, 0.66 0.45 0.28, 0.59 0.50 0.40, 0.60 0.50 0.37, 0.59
 Lighte 0.60 0.46, 0.71 0.55 0.40, 0.67 0.60 0.45, 0.71 0.55 0.40, 0.64
 Moderatee 0.45 0.28, 0.60 0.51 0.35, 0.64 0.40 0.29, 0.52 0.41 0.27, 0.55
 Strenuouse 0.65 0.52, 0.75 0.52 0.36, 0.65 0.61 0.48, 0.73 0.60 0.40, 0.69
Exercisee,i 0.63 0.49, 0.74 0.56 0.41, 0.68 0.61 0.48, 0.72 0.61 0.48, 0.70
 Lighte 0.44 0.26, 0.59 0.53 0.37, 0.66 0.47 0.30, 0.61 0.47 0.30, 0.59
 Moderatee 0.42 0.24, 0.57 0.35 0.16, 0.51 0.31 0.19, 0.44 0.33 0.17, 0.43
 Strenuouse 0.59 0.44, 0.71 0.44 0.27, 0.59 0.54 0.43, 0.68 0.55 0.40, 0.66
Occupational activitye,i 0.76 0.66, 0.83 0.62 0.48, 0.73 0.73 0.60, 0.85 0.70 0.54, 0.81
 Sittingj 0.69 0.57, 0.78 0.69 0.57, 0.78 0.70 0.61, 0.79 0.67 0.57, 0.76
 Sedentary 0.71 0.60, 0.80 0.72 0.61, 0.80 0.72 0.61, 0.81 0.69 0.60, 0.77
 Lighte 0.65 0.52, 0.75 0.45 0.28, 0.59 0.57 0.45, 0.69 0.55 0.41, 0.67
 Moderatee 0.44 0.26, 0.59 0.46 0.29, 0.60 0.45 0.24, 0.60 0.46 0.27, 0.59
Light leisure activitye 0.65 0.52, 0.75 0.33 0.14, 0.50 0.54 0.38, 0.67 0.53 0.36, 0.64
 Television-viewing 0.72 0.61, 0.80 0.63 0.49, 0.74 0.69 0.55, 0.79 0.68 0.51, 0.77
 Computer usee 0.60 0.46, 0.71 0.62 0.48, 0.73 0.60 0.49, 0.70 0.61 0.46, 0.71
 Reading 0.56 0.41, 0.68 0.39 0.21, 0.55 0.44 0.26, 0.56 0.41 0.26, 0.52

Abbreviations: AEE, activity energy expenditure; CCC, concordance correlation coefficient; CI, confidence interval; DLW, doubly labeled water; ICC, intraclass correlation coefficient; MET, metabolic equivalent of task; STAR-Q, Sedentary Time and Activity Reporting Questionnaire; TEE, total energy expenditure.

a STAR-Q1, STAR-Q2, and STAR-Q3 represent the first, second, and third administrations of the STAR-Q.

b Not adjusted for any variable.

c 95% CIs were estimated by means of bootstrapping for both adjusted and unadjusted CCCs.

d Adjusted for sex, age, and body mass index at baseline, as well as season of STAR-Q completion.

e Log-transformed data.

f Excludes napping.

g Results are shown for climbing up stairs (self-reported number of flights per day); results for going down stairs were very similar.

h Represents the sum of all activities performed while sitting and expending energy at >1.5–≤2.5 METs.

i Intensity categories were defined as follows: sedentary (excluding sleep), ≤1.5 METs; light, >1.5–≤3 METs; moderate, >3–≤6 METs; strenuous, >6 METs. Strenuous occupational activity was not analyzable because of insufficient data.

j MET level ≤ 2.5.

Validity

STAR-Q1 versus 7-day activity diary

On average, the diary numbers of hours per day closely approximated a 24-hour period, while the STAR-Q underestimated hours per day by just over 2 hours (Table 2). Reporting on sedentary time probably contributed to this difference (Figure 2). Additionally, total occupational activity, occupational light activity, total light leisure activity, and sedentary behavior during light leisure activity accounted for less time using the diary (Figure 3).

Figure 2.

Figure 2.

Self-reported mean duration (hours/day) of active and inactive behavior by intensity in the Sedentary Time and Activity Reporting Questionnaire (STAR-Q) validation study, Alberta, Canada, 2009–2011. STAR-Q1, STAR-Q2, and STAR-Q3 represent the first, second, and third administrations of the STAR-Q.

Figure 3.

Figure 3.

Self-reported mean duration (hours/day) of active and inactive behavior by domain in the Sedentary Time and Activity Reporting Questionnaire (STAR-Q) validation study, Alberta, Canada, 2009–2011. STAR-Q1, STAR-Q2, and STAR-Q3 represent the first, second, and third administrations of the STAR-Q.

Results of the STAR-Q1-versus-diary comparison are presented in Table 4. For TEE and AEE, negligible median differences were observed at the group level; however, larger interquartile ranges indicated high between-subject variability. Agreement and consistency for TEE, AEE, and AEE/kg body weight were moderate to good depending on the test statistic; Spearman correlations were relatively high (ρs = 0.74, ρs = 0.61, and ρs = 0.45, respectively), whereas CCCadj values were lower (0.51, 0.45, and 0.41). Results for overall occupational activity, occupational sitting, and occupational sedentary time were consistently high, with CCCadj values of approximately 0.70, and CCCadj was 0.64 for overall strenuous activity (hours/day). In contrast, consistency between STAR-Q1- and diary-derived estimates for light activities were poor (CCCadj < 0.30) overall and within occupation and exercise domains. Adjusting CCCs for covariates had the strongest effect for TEE and AEE but an otherwise negligible impact.

Table 4.

Agreement and Correlations (MET-hours/day) Between STAR-Q1a and 7-Day Diary Activity, by Intensityb and Domain, in a Validity Assessment of the STAR-Q (n = 99), Alberta, Canada, 2009–2011

Median Difference, STARQ1 – Diary (IQR) Spearman Correlation Coefficient (rs)c ICC 95% CI Unadjusted CCCd 95% CIe Adjusted CCCf 95% CIe
TEE, kcal/dayg −98.2 (615.2) 0.74 0.73 0.62, 0.81 0.73 0.61, 0.81 0.51 0.34, 0.63
AEE, kcal/dayg,h −81.7 (540.5) 0.61 0.55 0.40, 0.67 0.55 0.36, 0.73 0.45 0.17, 0.63
AEE, kcal/kg·dayg,h −1.3 (7.8) 0.45 0.42 0.25, 0.57 0.42 0.24, 0.60 0.41 0.15, 0.59
Sleepingi −0.8 (0.8) 0.62 0.18 −0.02, 0.36 0.37 0.24, 0.49 0.37 0.23, 0.48
Stair-climbing, flights/dayj 0.1 (8.5) 0.41 0.37 0.19, 0.53 0.36 0.16, 0.52 0.36 0.14, 0.52
Active sittingg,k 0.0 (0.4) 0.30l 0.30 0.11, 0.47 0.32 0.14, 0.47 0.33 0.15, 0.49
Overall activityb
 Sedentary −3.9 (4.9) 0.40 0.12 −0.08, 0.31 0.26 0.11, 0.38 0.27 0.12, 0.39
 Lightg 0.6 (7.3) 0.26m 0.26 0.07, 0.43 0.29 0.07, 0.47 0.22 0.00, 0.41
 Moderateg 0.2 (5.0) 0.57 0.49 0.33, 0.63 0.50 0.32, 0.64 0.50 0.29, 0.61
 Strenuousg 0.0 (3.3) 0.68 0.65 0.52, 0.75 0.66 0.54, 0.79 0.64 0.49, 0.76
Exercise, sports, and leisure activityb,g −0.2 (4.2) 0.47 0.45 0.28, 0.59 0.43 0.24, 0.59 0.42 0.24, 0.58
 Lightg 0.0 (0.3) 0.21n 0.27 0.08, 0.44 0.28 0.06, 0.49 0.29 0.05, 0.49
 Moderateg 0.0 (0.8) 0.35 0.36 0.18, 0.52 0.35 0.13, 0.53 0.35 0.13, 0.54
 Strenuousg 0.0 (3.4) 0.49 0.48 0.31, 0.62 0.48 0.33, 0.62 0.48 0.29, 0.63
Occupational activityb,g 1.4 (3.6) 0.71 0.70 0.58, 0.79 0.71 0.53, 0.85 0.69 0.48, 0.84
 Sittingo 0.3 (2.2) 0.75 0.70 0.58, 0.79 0.71 0.59, 0.82 0.69 0.56, 0.81
 Sedentary 0.1 (1.8) 0.76 0.75 0.65, 0.82 0.75 0.63, 0.85 0.76 0.64, 0.85
 Lightg 0.6 (3.0) 0.24p 0.08 −0.12, 0.27 0.19 0.04, 0.36 0.16 0.00, 0.31
 Moderateg 0.0 (0.3) 0.30l 0.50 0.34, 0.63 0.46 0.19, 0.65 0.46 0.21, 0.64

Abbreviations: AEE, activity energy expenditure; CCC, concordance correlation coefficient; CI, confidence interval; ICC, intraclass correlation coefficient; IQR, interquartile range; MET, metabolic equivalents of task; STAR-Q, Sedentary Time and Activity Reporting Questionnaire; TEE, total energy expenditure.

a The first STAR-Q (STAR-Q1) was completed 14 days after doubly labeled water dosing (day 0).

b Intensity categories were defined as follows: sedentary (excluding sleep), ≤1.5 METs; light, >1.5–≤3 METs; moderate, >3–≤6 METs; strenuous, >6 METs. Strenuous occupational activity was not analyzable because of insufficient data.

c P < 0.001 (2-sided t test) for Spearman correlations unless otherwise indicated.

d Not adjusted for any variable.

e 95% CIs were estimated by means of bootstrapping for both adjusted and unadjusted CCCs.

f Adjusted for sex, age, and body mass index at baseline, as well as season of STAR-Q completion.

g Data were log-transformed for ICC and CCC analyses.

h Excludes sleep.

i Excludes napping.

j Results are shown for climbing up stairs (self-reported number of flights per day); results for going down stairs were very similar.

k Represents the sum of all activities performed while sitting and expending energy at >1.5–≤2.5 METs.

l P = 0.003.

m P = 0.009.

n P = 0.04.

o MET level ≤ 2.5.

p P = 0.02.

STAR-Q1 versus DLW

On average, the STAR-Q1 significantly overestimated energy expenditure relative to DLW (P < 0.001; Wilcoxon signed-rank test), with a median difference (STAR-Q1 minus DLW) of 367 kcal/day (interquartile range, 800 kcal/day) for TEE and a median difference of 293 kcal/day (interquartile range, 769 kcal/day) for AEE. Bland-Altman plots (Figure 4) revealed significant inverse trends in which overestimation of energy expenditure occurred for persons with the lowest TEEDLW (Figure 4A) or AEEDLW (Figure 4B), while for persons with the highest TEEDLW or AEEDLW, energy expenditure was underestimated by the STAR-Q. This negative trend was most pronounced for AEE/kg body weight (ρs = −0.61, P < 0.001; Figure 4C).

Figure 4.

Figure 4.

Bland-Altman plots illustrating agreement with doubly labeled water (DLW) in the Sedentary Time and Activity Reporting Questionnaire (STAR-Q) validation study, Alberta, Canada, 2009–2011. Part A (validity comparison for total energy expenditure (TEE) (kcal/day); n = 100) depicts the relationship between the difference between measures and the DLW measure (Spearman correlation coefficient (ρs) = −0.25, P = 0.012; 2-sided t test); the mean difference of 464.48 kcal/day and the 95% limits of agreement (LOA) (−1,116.20, 2,045.16) are depicted by 3 horizontal lines. Part B (validity comparison for activity energy expenditure (AEE) (kcal/day); n = 100) depicts the relationship between the difference between measures and the DLW measure (ρs = −0.50, P < 0.001; 2-sided t test); the mean difference of 310.62 kcal/day and the 95% LOA (−867.37, 1,488.60) are depicted by 3 horizontal lines. Part C (validity comparison for AEE (kcal/kg·day); n = 100) depicts the relationship between the difference between measures and the DLW measure (ρs = −0.61, P < 0.001; 2-sided t test); the mean difference of 4.15 kcal/kg·day and the 95% LOA (−12.86, 21.17) are depicted by 3 horizontal lines.

Compared with DLW estimates, TEESTAR-Q1 was moderately correlated (ρs = 0.53, P < 0.001), as was AEESTAR-Q1s = 0.40, P < 0.001), while the correlation for body-weight-adjusted AEESTAR-Q1 was slightly lower (ρs = 0.34, P = 0.001). ICCs using log-transformed variables were also fair to moderate, with the ICC for TEE being 0.41 (95% CI: 0.23, 0.56), that for AEE being 0.20 (95% CI: 0.01, 0.38), and that for AEE/kg body weight being 0.13 (95% CI: −0.07, 0.32). CCCs with log-transformed TEE, AEE, and AEE/kg body weight and adjusted for covariates were 0.24 (95% CI: 0.07, 0.36), 0.21 (95% CI: 0.11, 0.32), and 0.19 (95% CI: 0.08, 0.27), respectively.

DISCUSSION

The STAR-Q has reasonable reliability for estimating past-month AEE and, on average, fair validity for ranking individuals by AEE based on Spearman correlation comparisons with DLW. However, DLW comparisons also revealed proportional measurement error, with trends of AEE overreporting by the least active persons and AEE underreporting by the most active persons; hence, validity estimates based on adjusted CCCs were lower. Compared with prospectively completed 7-day activity diaries, the STAR-Q performed particularly well in estimating energy expenditure related to overall occupational activity, occupational sedentary and sitting time, and strenuous activity, but agreement was weaker for light activity.

To our knowledge, only 1 other questionnaire was designed with an objective similar to our own. Besson et al. (30) reported results for validation of the Recent Physical Activity Questionnaire against accelerometry and DLW. Similar to the STAR-Q, this questionnaire ascertains past-month activity and demonstrates substantial validity for ranking individuals on vigorous-intensity activity. Relative to the STAR-Q, Besson et al. described somewhat better validity for ranking individuals on TEE (ρs = 0.67 vs. ρs = 0.53) and AEE/kg body weight (ρs = 0.39 vs. ρs = 0.34) but lower validity for ranking them on sedentary time (ρs = 0.27 vs. ρs = 0.40) (30). In contrast to the STAR-Q, the Recent Physical Activity Questionnaire did not ascertain the number of hours slept per night, the duration of the workweek, or details on household activities.

When comparing reported behaviors with 7-day diaries, the STAR-Q performed well, on average, despite nonoverlapping reference periods. True differences in energy expenditure might have occurred between the two assessment periods, perhaps accounting for some discrepancy between the STAR-Q and the diary. However, areas of substantial agreement between the STAR-Q and diary are informative for guiding the optimal use of the STAR-Q in future studies (e.g., occupational sedentary behavior, strenuous activity). These stronger results were not unexpected, since occupational and strenuous activities have been reliably reported in other studies (30, 31). On the other hand, activities with poor agreement highlight problematic domains and intensities—for example, light-intensity activity that may require alternate assessment methods combining objective and self-reported data.

The strengths of our study relative to past DLW studies (3) include a large sample size, appropriate use of DLW as a criterion method, and the use of multiple test statistics, including the novel use of the CCC. The CCC is a rigorous estimate of agreement that allows for the adjustment of covariates that might contribute to between-subject variability. Adjusted coefficients may be lower than unadjusted estimates; however, the remaining explained variance more clearly reflects the measure of interest (18). For example, a CCCadj of 0.21 for AEE indicates that STAR-Q-derived AEE explained 21% of DLW-derived AEE. To our knowledge, only 2 other DLW validation studies have conducted analyses that might be considered analogous to this. Masse et al. (32) used regression analyses to show that 2 different types of questionnaires (checklist and global), when administered to 260 women, explained 6.4% and 5.0% of DLW-derived AEE, respectively. Neuhouser et al. (20) reported that the Women's Health Initiative Personal Habits Questionnaire, the Arizona Activity Frequency Questionnaire, and the 7-Day Physical Activity Recall, administered to 450 women, explained 7.6%, 4.8%, and 3.4% of DLW-derived AEE, respectively.

While our body mass index-adjusted analysis controlled for variance due to misreporting that may be associated with body mass index, it may not have sufficiently accounted for the known tendency for body mass or lean body mass to inflate the association between questionnaire- and DLW-derived AEE (33). We therefore also divided AEE by kilograms of body weight, but we observed only a negligible decrease in the CCCadj (from 0.21 to 0.19), whereas larger decreases occurred with Spearman correlations (from 0.40 to 0.34) and ICCs (from 0.20 to 0.13). The majority of DLW validation studies published to date have not accounted for these effects of body mass (3).

Our results from the reliability study indicate that at the group level, past-month assessments of AEE can provide reasonable estimates of “usual” AEE. Although we adjusted for the effects of season in our CCC analyses, its impact was found to be negligible; yet we do not know whether true AEE was stable across assessment periods. Fluctuations in physical activity behavior have been associated with variations in temperature and weather conditions (22, 23, 34). It is possible that persons who are active in one season may remain active year-round by participating in season-appropriate activities.

Limitations of our study need to be acknowledged. First, we were unable to measure resting metabolic rate directly and instead used a standard equation to estimate it. While we accounted for body weight, age, and sex, other factors that predict resting metabolic rate (e.g., body composition) could have affected the validity of DLW-derived AEE (since AEEDLW = [TEEDLW × 0.90] – {[(RMR/24) × (24 – hours of sleep)] + [0.9(RMR/24) × hours of sleep]}). Second, we used 7-day diaries as a criterion measure for validating domain- and intensity-specific estimates of AEE and activity, but we acknowledge that the diary method is an imperfect gold standard. Third, these results can only be generalized to middle-aged Canadian adults who are generally educated, employed, and physically active. Fourth, since ICCs are lower when between-subject variability is low relative to measurement error (whether random or systematic) (35), it is possible that the validity and reliability of the STAR-Q were underestimated for some domains and activities with restricted variability (e.g., sleeping, sedentary behavior) (Table 4). Fifth, our analyses did not quantify subject-related sources of measurement error, as has been reported by others (19, 20), since we felt it was beyond the scope of the present study.

In summary, using rigorous methods, we showed that the STAR-Q on average had fair validity for ranking individuals by overall AEE and was reliable over a period of several months. Our study also highlighted systematic measurement error that was proportional to AEE, which should not be ignored in the design and analysis of future etiological studies that use the STAR-Q. The comprehensive design of the STAR-Q allows for the assessment of all domains of activity at various intensities, including sedentary behavior, making it a potentially useful tool for investigating emerging hypotheses related to AEE and health. It is widely acknowledged that traditional self-reported measures will continue to be relied upon in epidemiologic studies, ideally combined with objective measures of activity and energy expenditure. The STAR-Q is designed to meet the needs of these evolving, hybrid research methods.

Supplementary Material

Web Material

ACKNOWLEDGMENTS

Author affiliations: Department of Cancer Epidemiology and Prevention Research, CancerControl Alberta, Alberta Health Services, Calgary, Alberta, Canada (Ilona Csizmadi, Heather K. Neilson, Karen A. Kopciuk, Farah Khandwala, Andrew Liu, Christine M. Friedenreich); Department of Oncology, Faculty of Medicine, University of Calgary, Calgary, Alberta, Canada (Ilona Csizmadi, Karen A. Kopciuk, Christine M. Friedenreich); Department of Community Health Sciences, Faculty of Medicine, University of Calgary, Calgary, Alberta, Canada (Ilona Csizmadi, Christine M. Friedenreich, Heather E. Bryant); Department of Mathematics and Statistics, Faculty of Science, University of Calgary, Calgary, Alberta, Canada (Karen A. Kopciuk); School of Public Health, University of Alberta, Edmonton, Alberta, Canada (Yutaka Yasui); Institut de recherches cliniques de Montréal and Département de nutrition, Faculté de médecine, Université de Montréal, Montréal, Québec, Canada (Rémi Rabasa-Lhoret); Division of Cancer Control, Canadian Partnership Against Cancer, Toronto, Ontario, Canada (Heather E. Bryant); Department of Medicine, Faculty of Medicine, University of Calgary, Calgary, Alberta, Canada (David C. W. Lau); Department of Biochemistry and Molecular Biology, Faculty of Medicine, University of Calgary, Calgary, Alberta, Canada (David C. W. Lau); Department of Cancer Measurement, Outcomes, Research and Evaluation, CancerControl Alberta, Alberta Health Services, Edmonton, Alberta, Canada (Paula J. Robson); and Department of Agricultural, Food and Nutritional Sciences, Faculty of Agricultural, Life and Environmental Sciences, University of Alberta, Edmonton, Alberta, Canada (Paula J. Robson).

This work was supported by the Canadian Institutes of Health Research (grant MOP-86632) and the Alberta Cancer Research Institute (grant 24265).

We acknowledge Ruth Gbewonyo (study coordinator) and Jason Ng (research assistant) of CancerControl Alberta (Alberta Health Services) for their contributions in study coordination, participant recruitment, and data collection; Diane Mignault (Institut de recherches cliniques de Montréal) for performing the stable-isotope analysis; Farit Vakhetov (CancerControl Alberta) for designing and managing the interview, questionnaire, and biosample databases; and the staff of the Alberta Tomorrow Project (CancerControl Alberta) for their assistance with participant recruitment.

Preliminary reliability study results were presented by Dr. Ilona Csizmadi at the 8th International Conference on Diet and Activity Methods, Rome, Italy, May 14–17, 2012.

Conflict of interest: none declared.

REFERENCES

  • 1.Warren JM, Ekelund U, Besson H, et al. Assessment of physical activity—a review of methodologies with reference to epidemiological research: a report of the exercise physiology section of the European Association of Cardiovascular Prevention and Rehabilitation. Eur J Cardiovasc Prev Rehabil. 2010;17(2):127–139. doi: 10.1097/HJR.0b013e32832ed875. [DOI] [PubMed] [Google Scholar]
  • 2.Atkin AJ, Gorely T, Clemes SA, et al. Methods of measurement in epidemiology: sedentary behaviour. Int J Epidemiol. 2012;41(5):1460–1471. doi: 10.1093/ije/dys118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Neilson HK, Robson PJ, Friedenreich CM, et al. Estimating activity energy expenditure: how valid are physical activity questionnaires? Am J Clin Nutr. 2008;87(2):279–291. doi: 10.1093/ajcn/87.2.279. [DOI] [PubMed] [Google Scholar]
  • 4.Troiano RP, Pettee Gabriel KK, Welk GJ, et al. Reported physical activity and sedentary behavior: why do you ask? J Phys Act Health. 2012;9(suppl 1):S68–S75. doi: 10.1123/jpah.9.s1.s68. [DOI] [PubMed] [Google Scholar]
  • 5.Sesso HD. Invited commentary: a challenge for physical activity epidemiology. Am J Epidemiol. 2007;165(12):1351–1353. doi: 10.1093/aje/kwm093. [DOI] [PubMed] [Google Scholar]
  • 6.Neilson HK, Ullman R, Robson PJ, et al. Cognitive testing of the STAR-Q: insights in activity and sedentary time reporting. J Phys Act Health. 2013;10(3):379–389. doi: 10.1123/jpah.10.3.379. [DOI] [PubMed] [Google Scholar]
  • 7.Bryant H, Robson PJ, Ullman R, et al. Population-based cohort development in Alberta, Canada: a feasibility study. Chronic Dis Can. 2006;27(2):51–59. [PubMed] [Google Scholar]
  • 8.Ainsworth BE, Haskell WL, Leon AS, et al. Compendium of Physical Activities: classification of energy costs of human physical activities. Med Sci Sports Exerc. 1993;25(1):71–80. doi: 10.1249/00005768-199301000-00011. [DOI] [PubMed] [Google Scholar]
  • 9.Ainsworth BE, Haskell WL, Whitt MC, et al. Compendium of Physical Activities: an update of activity codes and MET intensities. Med Sci Sports Exerc. 2000;32(9 suppl):S498–S504. doi: 10.1097/00005768-200009001-00009. [DOI] [PubMed] [Google Scholar]
  • 10.Westerterp KR. Physical activity as determinant of daily energy expenditure. Physiol Behav. 2008;93(4-5):1039–1043. doi: 10.1016/j.physbeh.2008.01.021. [DOI] [PubMed] [Google Scholar]
  • 11.Ainsworth BE, Haskell WL, Herrmann SD, et al. 2011 Compendium of Physical Activities: a second update of codes and MET values. Med Sci Sports Exerc. 2011;43(8):1575–1581. doi: 10.1249/MSS.0b013e31821ece12. [DOI] [PubMed] [Google Scholar]
  • 12.Conway JM, Seale JL, Jacobs DR, Jr, et al. Comparison of energy expenditure estimates from doubly labeled water, a physical activity questionnaire, and physical activity records. Am J Clin Nutr. 2002;75(3):519–525. doi: 10.1093/ajcn/75.3.519. [DOI] [PubMed] [Google Scholar]
  • 13.Douketis JD. Body weight classification. CMAJ. 2005;172(10):1274–1275. doi: 10.1503/cmaj.1050005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Schofield WN. Predicting basal metabolic rate, new standards and review of previous work. Hum Nutr Clin Nutr. 1985;39(suppl 1):5–41. [PubMed] [Google Scholar]
  • 15.Csizmadi I, Kahle L, Ullman R, et al. Adaptation and evaluation of the National Cancer Institute's Diet History Questionnaire and nutrient database for Canadian populations. Public Health Nutr. 2007;10(1):88–96. doi: 10.1017/S1368980007184287. [DOI] [PubMed] [Google Scholar]
  • 16.Racette SB, Schoeller DA, Luke AH, et al. Relative dilution spaces of 2H- and 18O-labeled water in humans. Am J Physiol. 1994;267(4):E585–E590. doi: 10.1152/ajpendo.1994.267.4.E585. [DOI] [PubMed] [Google Scholar]
  • 17.McGraw KO, Wong SP. Forming inferences about some intraclass correlation coefficients. Psychol Methods. 1996;1(1):30–46. [Google Scholar]
  • 18.Carrasco JL, Jover L. Estimating the generalized concordance correlation coefficient through variance components. Biometrics. 2003;59(4):849–858. doi: 10.1111/j.0006-341x.2003.00099.x. [DOI] [PubMed] [Google Scholar]
  • 19.Ferrari P, Friedenreich C, Matthews CE. The role of measurement error in estimating levels of physical activity. Am J Epidemiol. 2007;166(7):832–840. doi: 10.1093/aje/kwm148. [DOI] [PubMed] [Google Scholar]
  • 20.Neuhouser ML, Di C, Tinker LF, et al. Physical activity assessment: biomarkers and self-report of activity-related energy expenditure in the WHI. Am J Epidemiol. 2013;177(6):576–585. doi: 10.1093/aje/kws269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Tooze JA, Troiano RP, Carroll RJ, et al. A measurement error model for physical activity level as measured by a questionnaire with application to the 1999–2006 NHANES questionnaire. Am J Epidemiol. 2013;177(11):1199–1208. doi: 10.1093/aje/kws379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Newman MA, Pettee KK, Storti KL, et al. Monthly variation in physical activity levels in postmenopausal women. Med Sci Sports Exerc. 2009;41(2):322–327. doi: 10.1249/MSS.0b013e3181864c05. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Matthews CE, Freedson PS, Hebert JR, et al. Seasonal variation in household, occupational, and leisure time physical activity: longitudinal analyses from the Seasonal Variation of Blood Cholesterol Study. Am J Epidemiol. 2001;153(2):172–183. doi: 10.1093/aje/153.2.172. [DOI] [PubMed] [Google Scholar]
  • 24.Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–310. [PubMed] [Google Scholar]
  • 25.Bland JM, Altman DG. Measuring agreement in method comparison studies. Stat Methods Med Res. 1999;8(2):135–160. doi: 10.1177/096228029900800204. [DOI] [PubMed] [Google Scholar]
  • 26.Bonett DG. Sample size requirements for estimating intraclass correlations with desired precision. Stat Med. 2002;21(9):1331–1335. doi: 10.1002/sim.1108. [DOI] [PubMed] [Google Scholar]
  • 27.SAS Institute, Inc. SAS/STAT Software: SAS System for PC, Version 9.2. Cary, NC: SAS Institute, Inc.; 2008. [Google Scholar]
  • 28.R Core Team. R: A Language and Environment for Statistical Computing, Version 2.12. Vienna, Austria: R Foundation for Statistical Computing; 2012. [Google Scholar]
  • 29.Food and Nutrition Board, Institute of Medicine. Dietary Reference Intakes for Energy, Carbohydrate, Fiber, Fat, Fatty Acids, Cholesterol, Protein, and Amino Acids (Macronutrients) Washington, DC: National Academies Press; 2002. [DOI] [PubMed] [Google Scholar]
  • 30.Besson H, Brage S, Jakes RW, et al. Estimating physical activity energy expenditure, sedentary time, and physical activity intensity by self-report in adults. Am J Clin Nutr. 2010;91(1):106–114. doi: 10.3945/ajcn.2009.28432. [DOI] [PubMed] [Google Scholar]
  • 31.Helmerhorst HJ, Brage S, Warren J, et al. A systematic review of reliability and objective criterion-related validity of physical activity questionnaires. Int J Behav Nutr Phys Act. 2012;9:103. doi: 10.1186/1479-5868-9-103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Mâsse LC, Fulton JE, Watson KB, et al. Comparing the validity of 2 physical activity questionnaire formats in African-American and Hispanic women. J Phys Act Health. 2012;9(2):237–248. doi: 10.1123/jpah.9.2.237. [DOI] [PubMed] [Google Scholar]
  • 33.Mâsse LC, Fulton JE, Watson KL, et al. Influence of body composition on physical activity validation studies using doubly labeled water. J Appl Physiol. 2004;96(4):1357–1364. doi: 10.1152/japplphysiol.00901.2003. [DOI] [PubMed] [Google Scholar]
  • 34.Feinglass J, Lee J, Semanik P, et al. The effects of daily weather on accelerometer-measured physical activity. J Phys Act Health. 2011;8(7):934–943. doi: 10.1123/jpah.8.7.934. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.de Vet HC, Terwee CB, Knol DL, et al. When to use agreement versus reliability measures. J Clin Epidemiol. 2006;59(10):1033–1039. doi: 10.1016/j.jclinepi.2005.10.015. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Web Material

Articles from American Journal of Epidemiology are provided here courtesy of Oxford University Press

RESOURCES