Classifiers for Accelerometer-Measured Behaviors in Older Women

Dori Rosenberg; Suneeta Godbole; Katherine Ellis; Chongzhi Di; Andrea Z LaCroix; Loki Natarajan; Jacqueline Kerr

doi:10.1249/MSS.0000000000001121

. Author manuscript; available in PMC: 2018 Mar 1.

Published in final edited form as: Med Sci Sports Exerc. 2017 Mar;49(3):610–616. doi: 10.1249/MSS.0000000000001121

Classifiers for Accelerometer-Measured Behaviors in Older Women

Dori Rosenberg ¹, Suneeta Godbole ², Katherine Ellis ², Chongzhi Di ³, Andrea Z LaCroix ², Loki Natarajan ², Jacqueline Kerr ²

PMCID: PMC5325142 NIHMSID: NIHMS821435 PMID: 28222058

Abstract

Purpose

Machine learning methods could better improve detection of specific types of physical activities and sedentary behaviors from accelerometer data. No studies in older populations have developed and tested algorithms for walking and sedentary time in free-living daily life. Our goal was to rectify this gap by leveraging access to data from two studies in older women.

Methods

In study 1, algorithms were developed and tested in a sample of older women (N = 39; age range = 55–96) in the field. Women wore accelerometers and SenseCam (ground truth annotation) devices for 7 days yielding 3,191 hours and 320 days of data. Images were annotated and time matched to accelerometer data and random forest classifiers labeled behaviors (sitting, riding in a vehicle, standing still, standing moving, walking/running). In study 2, we examined the concurrent validity of the algorithms using accelerometer data from an observed 400 meter walk test (2983 minutes of data available) and 6 days of wearing both accelerometers and global positioning systems (GPS) devices in a sample of 222 women (age range = 67–100; 313,290 minutes of data available). Analyses included sensitivity, specificity balanced accuracy, and precision, as appropriate, averaged over each test participant at the minute level for each behavior.

Results

In study 1, the algorithms had 82.2% balanced accuracy. In study 2, the classifier had 87.9% accuracy for predicting walking. Overall machine learning classifiers and GPS had 88.6% agreement.

Conclusions

Free-living algorithms for walking and sedentary time yielded high levels of accuracy and concurrent validity and can be applied to existing accelerometer data from older women.

Keywords: machine learning, older adults, sitting, walking, physical activity, sedentary time

Introduction

Physical activity promotes emotional, cognitive, functional, and physical health in older adults (21). Current estimates, however, suggest that few older adults meet physical activity guidelines, particularly when assessed by accelerometer cutpoints; estimates are fewer than 5% (30). Rates are higher when using self-reported metrics. For example, in the Women’s Health Study, 67% of women reported meeting physical activity guidelines by questionnaires while 13.4% were classified as meeting guidelines using the most commonly applied accelerometer cutpoint: 1952 counts per minute (27).

Accelerometer cutpoints allow acceleration data to be translated into activity intensity categories (8). This approach mapped well to current physical activity guidelines that state activities must be performed at a moderate or vigorous intensity. However, the most commonly used absolute cutpoints, developed on young adults, have not worked well in older adult samples who engage in activities at a relatively lower level of intensity. Use of absolute cutpoints results in common activities, such as walking, being misclassified as below the threshold for moderate intensity (8).

The exclusive focus on activity intensity can be problematic, however, as the public may not understand this concept and behaviorally specific goals may be easier to communicate than intensity-based ones (9). Understanding how specific patterns of behaviors like walking relate to health outcomes could lead the field to more useful guidelines that older adults can realistically attain.

Computational techniques are now being applied to accelerometer data to develop classifiers that can distinguish time spent in actual behaviors, such as driving, walking, lifting weights, and sitting (10). If valid, these classifiers could be applied to existing longitudinal studies that include accelerometer assessments and well documented health outcomes. For example, several large epidemiologic studies such as the Women’s Health Initiative, Nurses’ Health Study, Reasons for Geographic and Racial Differences in Stroke, and the Adult Changes in Thought studies are gathering substantial amounts of older adult accelerometer data (15, 20, 23).

Most studies using new computational techniques train and test the algorithms on different participants from the same sample, and study behaviors in a laboratory setting or with participants following a fixed protocol in more naturalistic settings (28, 29). Only two algorithms have been developed for older adults based on laboratory protocols (13, 25). More recent studies, however, demonstrate that laboratory-based algorithms do not perform as well when applied to free-living, data (1). Even algorithms from protocolized training data in naturalistic settings do not predict behaviors as accurately as totally free-living participants going about their normal behaviors across multiple days and hours (18). New algorithms trained in such totally free-living settings in adults are promising and can include important free-living behaviors that are difficult to conduct in laboratory settings, such as driving and bicycling (18). However, they have not yet been developed specifically for older adults, and have not yet been validated in a completely independent sample of participants outside of the algorithm testing phase (7). Previous studies suggest up to an 8% difference in accuracy for training data sets that vary by age and gender. For researchers to be confident that they can apply such new algorithms to their free-living older adult cohort data, further validation efforts are required.

The purpose of our study was to develop and test a new computational algorithm to classify walking and sedentary time, including in a vehicle, in older adults. The algorithm was developed on data collected across multiple free-living days and validated in a completely independent cohort of older adults that were not involved in the algorithm development phase. We leveraged a unique opportunity in which older adults, aged 65–100, a quarter using walking aids, in a physical activity intervention trial completed an observed 400 meter walk test while wearing accelerometers, providing a ground truth for comparison. Participants then wore an accelerometer and global positioning systems (GPS) devices for 6 days, providing further opportunity for investigating the algorithm’s concurrent validity against free-living GPS-defined behaviors. The current work focused on older women in order to identify and validate an algorithm that could be applied to a large existing cohort of older women from the Women’s Health Initiative (23).

Methods

Both studies obtained ethics approval from the University of California, San Diego institutional review board. Participants completed written informed consent for both studies.

Study 1: Algorithm Development and Testing

Participants & procedures

A convenience sample of 39 older women were recruited to wear an Actigraph GT3X+ accelerometer (ActiGraph, Pensacola, FL) on a belt over the right hip and a body-worn camera (the SenseCam - Vicon Revue) on a lanyard around their neck during waking hours over 7 days. They were asked to continue their normal activities, but participants were trained in institutional review board–approved procedures to ensure privacy and confidentiality for themselves and others while the camera was being worn, such as turning the camera off or turning it over when needing privacy and only wearing the camera in public setting or with permission from others. The women were recruited to provide a diverse age range (56–94 years), variability in self-reported functioning and physical activity levels, and a range of body mass index (19.74–45.62). All participants were ambulatory, able to provide informed consent, and complete surveys. Participants received and returned the devices in person at UCSD. They received wear time instructions to improve compliance and at the end were given the opportunity to delete any images they did not want included in the dataset.

Ground truth annotation

The SenseCam camera, which captured first-person images approximately every 20 seconds, allowed researchers to capture ground truth information about participant behavior. SenseCam image data were downloaded and imported into the Clarity SenseCam browser, and researchers annotated the SenseCam images with ground truth behavior labels (6). A standardized annotation protocol was developed, and at least 80% agreement for each posture with a standardized day was established. More details on SenseCam image annotation can be found elsewhere (17), and the complete annotation protocol is available from the authors upon request. The SenseCam annotation protocol assigns mutually exclusive posture labels to each image: sitting, riding a vehicle, standing still with no movement), standing moving i.e. walking within a confined space for example walking around in the kitchen, walking/running i.e. making progress to a distant point. Riding in a vehicle is separated out from other sitting because the accelerometer measurements differ in this context due to the vibration of the vehicle and the acceleration from driving. If a minute of data falls within a time window bound by images with identical activity codes, that activity label is applied to the minute. If a minute spans images with changing activity codes, no label is applied to the minute and it is not used for training the classifier.

Behavior Classification Algorithm

We employed a behavior classification system that uses machine learning (ML) algorithms to predict 5 behaviors - sitting, riding a vehicle, standing still, standing moving, walking/running- from raw triaxial accelerometer data. We have developed and tested this system in three other data sets (7, 18). The classifier was re-trained on the current data set of older women. Our system predicts a behavior label for each minute of accelerometer data. A 1-minute window was chosen because we believe it is a sufficiently detailed interval by which to represent public health relevant behaviors on a daily level. The behavior classification process is composed of three steps: feature extraction, minute-level classification, and time smoothing. A detailed description of these three steps can be found in our previous publications (7, 18). A short summary is provided here.

Feature extraction

The raw (unfiltered) triaxial accelerometer data was split into 1-minute windows. For each 1-minute window, 41 descriptive features were calculated. For each sample in a data window, the vector magnitude (VM) of the acceleration signal was calculated, i.e., v = (x² + y² + z²)^1/2. The following basic statistical descriptors of the VM were calculated over the data window: mean; SD (sd); coefficient of variation (coefvariation); minimum (min); maximum (max); and 25th, 50th, and 75th percentile (25thp, median, 75thp). The 1-s lag autocorrelation (autocorr) of the VM and the correlation between each axis were computed (corrxy, corrxz, corryz). For each sample in the window, the roll, pitch, and yaw angles of the direction of acceleration were computed, as roll = tan⁻¹(y, z), pitch = tan⁻¹ (x, z), and yaw = tan⁻¹ (y, x). The average (avgroll, avgpitch, avgyaw) and SD (sdroll, sdpitch, sdyaw) of these angles were computed over the window. A low-pass filter with a cutoff frequency of 0.5 Hz (preliminary experiments tested a few cutoff frequencies and found 0.5 Hz to perform best) was applied to the data window to estimate the average direction of gravity, and the roll, pitch, and yaw angles of this direction were computed (rollg, pitchg, yawg) (14). The fast Fourier transform was applied to the VM to decompose the time domain signal to its frequency components. The resulting power spectrum describes the contribution of a given frequency to the measured acceleration signal. The dominant frequency of the signal (fmax), i.e., the frequency with the highest power, and corresponding maximal power (pmax) were computed from the power spectrum. A similar calculation was done between the frequency bands of 0.3 and 3 Hz (fmaxband, pmaxband). The entropy of the frequency domain signal was computed. Finally, the power in each frequency band between 1 and 15 Hz (fft1–fft15) was computed.

Minute-level classification

Next, each feature vector was input into a random forest classifier. A random forest classifier is a commonly used ML algorithm made up of an ensemble of randomized decision trees, each of which is learned from a random sample of training data and a random sample of features. The decision tree outputs a probability of each behavior label for each feature vector. Test minutes are classified by averaging the output probabilities from each decision tree in the forest.

Time smoothing

After applying the random forest, a minute-by-minute sequence of probabilities of each behavior label results. These probabilities were smoothed over time using a hidden Markov model (HMM). The HMM uses the training data to learn the probability of transitions between behaviors i.e., it can learn that it is more common to transition from sitting to standing than sitting directly to walking. The HMM was used to choose the most likely sequence of behaviors from the sequence of probabilities output by the random forest classifier.

Evaluation

We evaluated the performance of our behavior classification algorithms using leave-one-participant out cross-validation. This means each participant was used as the test subject in turn, using the remaining participants to train the classification algorithm. Sensitivity, specificity and balanced accuracy (the mean of sensitivity and specificity) were averaged over each test participant at the minute level for each behavior (sitting, riding a vehicle, standing still, standing moving, walking/running).

Traditional accelerometer count processing

For comparison with the machine learned outputs, we processed the accelerometer data in Actilife 6. Median counts for each machine learned behavior were also shown to provide an estimate of intensity, although counts were not a feature of the algorithm.

Study 2: Validation in a new cohort

Two types of validation were investigated to establish that the algorithms developed in one cohort could be applied to another without loss of performance, demonstrating generalizability and validity. First, the algorithm performance was tested against a gold standard observation. Participants completed a timed 400 meter walk and the start and end times were recorded. During this time it was known that the participants were walking, though they were allowed to stop and rest as needed during the task and before and after. Stops were noted in the protocol. Second, the behavioral predictions from the algorithm were compared to GPS predictions to provide concurrent validity. The GPS predictions included walking, stationary, and vehicle time.

Participants & procedures

Data were from a sample of 222 older women (aged 67–100 years) living in 11 retirement communities and participating in a randomized control trial comparing a physical activity to a healthy aging comparison group were employed for the validation phase (19). None of the women were included in the algorithm development phase. All participants were ambulatory but not at high-risk for falling and able to provide informed consent. Women wore an Actigraph GT3X+ accelerometer (ActiGraph, Pensacola, FL) on a belt over the right hip and a Qstarz BT1000X Global Positioning System (GPS) data logger during waking hours over 6 days.

Participants completed a timed 400 meter walk test (26) using standard procedures as part of a physical functioning test battery. They were instructed to wear comfortable walking shoes to do the task. The course was set up indoors at each facility. All courses were flat but had various surfaces (some carpeted, some wood flooring). Participants were instructed to walk the course as quickly as possible while remaining safe and were allowed to have standing breaks to rest if needed throughout. The test was ended if the participant needed to sit down or more than 15 minutes were needed to complete the test. Participants wore the accelerometer device during the walk and observers recorded the time the test started and ended. Data from the baseline, 6 month and 12 month measurement tests were combined and included in the current analyses to increase the number of walking minutes to be predicted per participant.

Data processing

Accelerometer data during 400 meter walk

Accelerometer data was truncated to the time within the recorded start and stop of the 400 meter walk test using the sqldf package in R (11). The initial and last minute of the 400 meter walk was removed before analysis to eliminate partial minutes where the walk was initiated and terminated. The behavioral categories of walking to a distant point and standing moving within a confined space were combined.

Accelerometer data in comparison with GPS-defined vehicle travel and walking

GPS and accelerometer data were merged at the minute level in the validated Personal Activity Location Measurements System (PALMS) (3, 4). PALMS employs the 90^th percentile of speed during a trip, percent of time indoors during a trip, and percent of time in a single location during a trip to predict walking, riding in a vehicle, and stationary time. Previous studies have shown this system to have 85% accuracy (3, 4). Stationary time represents any behavior without movement in space, i.e. less than 25 meters distance in a minute. Only outdoor minutes of GPS were used because GPS detection of activities can be hindered by poor signal strength indoors. About 10.7% of outdoor time while wearing the GPS was spent walking, 22.5% riding in a vehicle, and 66.8% stationary. The ML behavior classifier described above was used to categorize each minute of accelerometer data as sitting, riding in a vehicle, standing, standing moving, or walking.

Analyses

The analyses assessed the concurrent validity of the ML classifier using the two sources of data available; observed 400 meter walk test and free-living concurrent GPS data. First, we examined the minute-level sensitivity of the walking algorithm using accelerometer data from the observed timed 400 meter walk test. Non-walking behaviors were not noted so specificity metrics were not available. We then employed generalized estimating equations (GEE) to examine predictors of achieving high (80% or higher) or low (< 80%) sensitivity, using the “geepack” library in R (14). To explore potential reasons for high or low algorithm sensitivity, several predictors were examined based on prior work, which has shown that older adults sometimes have slow gait speed or other abnormalities in their mobility that could impact accelerometer signals (24). We explored the effect of age, which was self-reported at baseline. Furthermore we examined several time-varying predictors measured at each time point: gait speed (calculated from the 400 meter walk test), observer annotated use of a walking aid during the 400 meter walk, short physical performance battery (SPPB) score (12), and fear of falling (Falls Efficacy Scale) (16). We used an exchangeable working correlation structure to account for participant clustering and robust standard errors to provide valid statistical inference even if the working correlation might not hold. The predictors were age, gait speed, number of stops during the 400m walk, use of a walking aid, SPPB and fear of falling.

Second, we examined the concurrent validity of the algorithm for detecting walking, vehicle time, standing moving, standing still, and sitting by time-merging the machine learned activity predictions to the GPS- based travel mode assignments. Since the machine learned classification and GPS travel mode had different classes, we combine the minutes in the classes of standing moving, standing still and sitting and compared it to the stationary GPS class. Two ratios were calculated to assess agreement. First we examined the number of matching class minutes to the total minutes of the class by GPS, which is similar to the recall metric when defined the GPS classes as the standard for comparison and second we examined the number of matching class minutes to the total minutes of the class by machine learning, which is similar to precision. All analyses were conducted using the R statistical package (22).

Results

Phase 1: Algorithm development and testing

Participants providing data included 39 older women (see Table 1). Table 2 shows the confusion matrix for the predicted minutes and known annotated behaviors. The most prevalent behavior was sitting, followed by riding in a vehicle and walking. Sitting behaviors were accurately predicted 89% of the time with misclassification as standing still occurring 7% of the time. Riding in a vehicle was accurately predicted 84% of the time with 6% of minutes being misclassified as sitting and 5% as standing moving. Walking had lower accuracy with 70% accurately being predicted and 24% being misclassified as standing moving. Standing still and standing moving had lower accuracy.

Table 1.

Demographic and health characteristics of study samples

	Study 1 algorithm development & testing	Study 2 observed 400 meter walk sample	Study 2 GPS sample
N	39	195	219
Age, mean, range	69.4, 56–94	83.6, 67–100	83.8, 67–100
White, %	79.5	91.3	91.3
Use of walking aid, %	12.8	25.6	18.3

Open in a new tab

Table 2.

Minute level confusion matrix of predicted and annotated minutes

	Number of minutes of SenseCam Annotated Activity (percent accuracy):
ML Predicted Activity:	Sitting	Riding in vehicle	Walking	Standing still	Standing moving
Sitting	83111 (89)	891 (6)	126 (2)	2072 (20)	671 (6)
Riding in vehicle	1148 (1)	11673 (84)	103 (1)	273 (3)	282 (2)
Walking	224 (0)	323 (2)	4994 (70)	458 (5)	1874 (16)
Standing still	6995 (7)	296 (2)	161 (2)	4104 (40)	1380 (12)
Standing moving	1889 (2)	755 (5)	1711 (24)	3238 (32)	7707 (66)

Open in a new tab

Table 3 demonstrates the sensitivity, specificity and balanced accuracy of the algorithm for the 5 behaviors tested against the annotated SenseCam images, our ground truth. Overall, the algorithm performed with 82.2% average balanced accuracy, using the leave-one-participant-out cross-validation. The median counts provided for comparison indicate that sitting, standing and walking in this population occur at lower intensities than would be detected by existing thresholds of <100 for sedentary behavior and >1951 for moderate to vigorous physical activity. Sitting in a vehicle recorded higher intensities than the sedentary behavior cut off. The accuracy levels achieved by the algorithm were comparable to algorithms developed in laboratory studies (13, 25). Given that this algorithm was developed on free-living data and laboratory studies applied to free-living data lose over 10% accuracy, we believed further validation in an independent cohort (Study 2) was warranted.

Table 3.

Percent Accuracy of Classifiers for sedentary behaviors and physical activity using observed annotations of person worn camera images

	Sensitivity	Specificity	Balanced Accuracy	Median counts (IQR)^*
Machine Learned
Sitting	89	91	90	0 (0–17)
Sitting in vehicle	84	99	91	72 (21 – 177)
Walking	70	98	84	597 (231 – 1210)
Standing moving	66	94	79	268 (97 – 562)
Standing still	40	93	67	56 (3 – 252)

Open in a new tab

counts were not employed as a feature in the algorithm but are provided here as count data are commonly reported in traditional accelerometer studies as a metric of intensity. This demonstrates that behaviors are occurring at lower intensities than would be identified by traditional cut points (<100 for sedentary behavior; 1952 for moderate vigorous activity).

Phase 2: Algorithm validation in new cohort

Validation of ML Walking Algorithm

Participants providing data during the 400 meter walk included 195 women who completed the test (see Table 1). At total of 90% of participants had 1 stop (range 1–4). The minute level sample available for validation of the walking algorithm included 2983 minutes of 400 meter walk test data. Accelerometer counts per minute (CPM) during the 400 meter walk varied from 0 to 5264 CPM with a median value of 1591 CPM. This suggests that the commonly used 1952 cutpoint for moderate to vigorous activity would not have captured a substantial portion of walking that was performed at the older women’s fastest safe pace. Overall, during the 400 meter walk the combined walking and standing moving classifier performed with an overall mean sensitivity of 87.9%. During the 400 meter walk, the algorithm misclassified 9.2% of the test minutes as sitting, 1.2% as vehicle, and 1.2% as standing still. None of the included variables in the GEE analyses significantly predicted the algorithm sensitivity (table 4). This suggests that the algorithm is robust across age, functioning, falls risk and walking speed.

Table 4.

Age and functioning predictors of algorithm performance (<80%) during the 400m walk using generalized estimating equations

	beta-coeffient	Standard Error	p-value
Age	−0.0232	0.0200	0.25
Gait Speed during 400m walk	−0.00437	0.44902	0.99
Number of stops during 400m walk	0.1235	0.2758	0.654
Use of a walking aid	0.288	0.354	0.42
Fear of Falling	0.00381	0.02007	0.8495
SPPB Overall Score	−0.09403	0.05244	0.073

Open in a new tab

Concurrent Validity of ML Algorithms with GPS

A total of 219 women wore the accelerometer and GPS devices for 6 free-living days (Mean age = 83.8, age range = 67 to 100, 91.3% white, 18.3% self-reported using a walking aid, 313,290 minutes of data available). Concurrent validity for behaviors during the 6 days of accelerometer and GPS wear are shown in Table 5. The overall agreement for the two methods was 88.6%. Precision (PPV) and recall (sensitivity) for walking was 68.1% and 85.5%. For all stationary time [sitting, standing moving, standing still], precision and recall were 90.6% and 93.7%; for vehicle time precision and recall were 83.4%, 85.1%, respectively

Table 5.

Minutes in each category and percent agreement between GPS and machine learned accelerometer classifier

	GPS PALMS classification:
Accelerometer ML classification:	Pedestrian	Stationary	Vehicle
Sitting	725 (2.2)	126050 (58.6)	3153 (4.9)
Standing still	249 (0.8)	17446 (8.1)	1304 (2.0)
Standing moving	3049(9.2)	51595 (24.0)	4658 (7.2)
Riding in a vehicle	891 (2.7)	8616 (4.0)	54143 (83.4)
Walking	28194 (85.5)	11553 (5.4)	1664 (2.6)

Open in a new tab

Discussion

We developed a new classifier to predict 5 important health related behaviors in free-living older women and demonstrated high performance of the algorithm (82.2%). While our classifier accuracy is comparable to other algorithms developed in the laboratory with older adults (13, 25), it could have been affected by several factors. Having less available walking data decreases accuracy by reducing the classifier’s ability to generalize to walking patterns it has not seen before. Standing moving can include portions of walking, which can confuse the classifier.

We found excellent levels of sensitivity for our classifier in regards to identification of walking behaviors during a 400 meter walk field test (87.9%). This is the first time that machine learned algorithms for physical activity and sitting, developed in a completely separate training sample, have been applied to a large, independent and truly free-living validation sample. The sensitivity of the algorithm was not dependent on age, walking aid, falls risk, or physical functioning. This means that the algorithm can be applied in populations of women that vary in age, physical function, and gait speed.

In addition, the classifier had excellent concurrent validity with GPS data (88.6%). Our ability to accurately detect time spent in a vehicle is an advancement over the use of accelerometer intensity cutpoints which misclassify time spent in a vehicle as light-intensity about one-third of the time (7, 18). Little is known about the health effects of vehicle time in aging-related health outcomes. Driving or riding in a vehicle could promote increased lifespace and ability to engage in meaningful activities. Driving cessation is associated with depression and poor health outcomes (5). However, it could also substitute for time spent in more active pursuits and could negatively impact health.

The classifiers developed and validated here could now be applied to large samples of existing accelerometer data in older women in which there is rich data on health outcomes. We can then better understand whether intensity is more important or whether total walking could be as associated with health outcomes irrespective of intensity. With the recent Surgeon General’s Call to Action to Promote Walking, we can use accelerometers to determine whether there are improvements in walking behaviors due to public health interventions such as sidewalk installations. Our previous studies have indicated that the training sample and type of training data are important predictors of algorithm performance (18). We, therefore, encourage use of algorithms that are appropriately matched to testing and validation samples at this stage. Future work may allow development of an algorithm that is robust across genders, ages, and body types.

Limitations of our study include that we only had gold standard observational data available for walking and not for other important behaviors for which we have developed algorithms including driving, sitting, standing, running and cycling. Since participants could stop during the walk not all time may have been walking, however, we saw no effect of number of stops on the algorithm performance. In the future we plan to compare machine learned accelerometer algorithms for sitting and standing in older adults to the field gold standard for posture (activPAL) measures. Ongoing work with the activPAL as a ground truth will likely improve our estimates of standing still. The features of roll-pitch and yaw angles of the direction of acceleration were approximations because gyroscope or magnetometer data were not available.

Our study strengths include the first demonstration of how new algorithms can be developed outside of a laboratory or prescriptive free-living setting and applied and externally validated in a new sample. In addition, our focus on older adults is important because they engage in physically active behaviors at a range of intensities that are often far below the most commonly used cutpoints for moderate-intensity physical activity (8). Focusing on identification of behaviors allows us to capture movements that are much more common and could still have impacts on health outcomes. Furthermore, being able to recommend that older adults increase the time they spend walking is much more under individual control than recommending a certain level of activity intensity, something which most of the public is likely unable to clearly understand.

Conclusions

We found excellent sensitivity for identifying walking behaviors using accelerometer data in older adults. Furthermore, we found high levels of concurrent validity with GPS for sedentary, vehicle, and walking time. Our algorithms are available in R [https://cran.r-project.org/web/packages/TLBC/index.html] to researchers who have interests in applications to existing epidemiologic datasets in the validated age range.

Acknowledgments

The results of the present study do not constitute endorsement by ACSM. The results of the study are presented clearly, honestly and without fabrication, falsification or inappropriate data manipulation.

Disclosure of funding: Dr. Rosenberg is supported by K23HL119352

Footnotes

Conflict of interest statement: There are no conflicts of interest

References

1.Bastian T, Maire A, Dugas J, et al. Automatic identification of physical activity types and sedentary behaviors from triaxial accelerometer: laboratory-based calibrations are not enough. Journal of applied physiology. 2015;118(6):716–722. doi: 10.1152/japplphysiol.01189.2013. [DOI] [PubMed] [Google Scholar]
2.Berrigan D, Carroll DD, Fulton JE, Galuska DA, Brown DR, Dorn JM. Vital signs: walking among adults -- United States, 2005 and 2010. MMWR Morb Mortal Wkly Rep. 2012;61:595–603. [PubMed] [Google Scholar]
3.Carey M, Markham C, Gaffney P, Boran C, Maher V. Validation of a point of care lipid analyser using a hospital based reference laboratory. Ir J Med Sci. 2006;175(4):30–35. doi: 10.1007/BF03167964. [DOI] [PubMed] [Google Scholar]
4.Carlson JA, Jankowska MM, Meseck K, et al. Validity of PALMS GPS scoring of active and passive travel compared with SenseCam. Med Sci Sports Exerc. 2015;47(3):662–667. doi: 10.1249/MSS.0000000000000446. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Chihuri S, Mielenz TJ, DiMaggio CJ, et al. Driving Cessation and Health Outcomes in Older Adults. J Am Geriatr Soc. 2016;64(2):332–341. doi: 10.1111/jgs.13931. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Doherty AR, Kelly P, Kerr J, et al. Using wearable cameras to categorise type and context of accelerometer-identified episodes of physical activity. Int J Behav Nutr Phys Act. 2013;10:22. doi: 10.1186/1479-5868-10-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Ellis K, Kerr J, Godbole S, Staudenmayer J, Lanckriet G. Hip and Wrist Accelerometer Algorithms for Free-Living Behavior Classification. Med Sci Sports Exerc. 2016;48(5):933–940. doi: 10.1249/MSS.0000000000000840. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Evenson KR, Wen F, Herring AH, et al. Calibrating physical activity intensity for hip-worn accelerometry in women age 60 to 91 years: The Women's Health Initiative OPACH Calibration Study. Preventive medicine reports. 2015;2:750–756. doi: 10.1016/j.pmedr.2015.08.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Floegel TA, Giacobbi PR, Jr, Dzierzewski JM, et al. Intervention markers of physical activity maintenance in older adults. Am J Health Behav. 2015;39(4):487–499. doi: 10.5993/AJHB.39.4.5. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Freedson PS, Lyden K, Kozey-Keadle S, Staudenmayer J. Evaluation of artificial neural network algorithms for predicting METs and activity type from accelerometer data: validation on an independent sample. J Appl Physiol. 2011;111(6):1804–1812. doi: 10.1152/japplphysiol.00309.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Grothendieck G. Perform SQL Selects on R Data Frames. 2014 Available from: https://cran.r-project.org/web/packages/sqldf/sqldf.pdf.
12.Guralnik JM, Ferrucci L, Pieper CF, et al. Lower extremity function and subsequent disability: consistency across studies, predictive models, and value of gait speed alone compared with the short physical performance battery. J Gerontol A Biol Sci Med Sci. 2000;55(4):M221–M231. doi: 10.1093/gerona/55.4.m221. [DOI] [PubMed] [Google Scholar]
13.He B, Bai J, Zipunnikov VV, et al. Predicting human movement with multiple accelerometers using movelets. Med Sci Sports Exerc. 2014;46(9):1859–1866. doi: 10.1249/MSS.0000000000000285. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Hojsgaard S, Halekoh U, Yan J. Generalized Estimating Equation Package. 2016 Available from: https://cran.r-project.org/web/packages/geepack/geepack.pdf.
15.Howard VJ, Rhodes JD, Mosher A, et al. Obtaining Accelerometer Data in a National Cohort of Black and White Adults. Med Sci Sports Exerc. 2015;47(7):1531–1537. doi: 10.1249/MSS.0000000000000549. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Kempen GI, Yardley L, van Haastregt JC, et al. The Short FES-I: a shortened version of the falls efficacy scale-international to assess fear of falling. Age Ageing. 2008;37(1):45–50. doi: 10.1093/ageing/afm157. [DOI] [PubMed] [Google Scholar]
17.Kerr J, Marshall SJ, Godbole S, et al. Using the SenseCam to improve classifications of sedentary behavior in free-living settings. Am J Prev Med. 2013;44(3):290–296. doi: 10.1016/j.amepre.2012.11.004. [DOI] [PubMed] [Google Scholar]
18.Kerr J, Patterson RE, Ellis K, et al. Objective Assessment of Physical Activity: Classifiers for Public Health. Med Sci Sports Exerc. 2016;48(5):951–957. doi: 10.1249/MSS.0000000000000841. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Kerr J, Rosenberg DE, Nathan A, et al. Applying the ecological model of behavior change to a physical activity trial in retirement communities: description of the study protocol. Contemp Clin Trials. 2012;33(6):1180–1188. doi: 10.1016/j.cct.2012.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Lee IM, Shiroma EJ. Using accelerometers to measure physical activity in large-scale epidemiological studies: issues and challenges. Br J Sports Med. 2014;48(3):197–201. doi: 10.1136/bjsports-2013-093154. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Physical Activity Guidelines Committee. Washington, DC: U.S. Department of Health and Human Services; 2008. Physical activity guidelines advisory committee report, 2008; pp. 1–683. [Google Scholar]
22.R Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2013. [cited 2014]. Available from: www.R-project.org. [Google Scholar]
23.Rillamas-Sun E, Buchner DM, Di C, Evenson KR, LaCroix AZ. Development and application of an automated algorithm to identify a window of consecutive days of accelerometer wear for large-scale studies. BMC Res Notes. 2015;8:270. doi: 10.1186/s13104-015-1229-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Sandroff BM, Riskin BJ, Agiovlasitis S, Motl RW. Accelerometer cut-points derived during over-ground walking in persons with mild, moderate, and severe multiple sclerosis. J Neurol Sci. 2014;340(1–2):50–57. doi: 10.1016/j.jns.2014.02.024. [DOI] [PubMed] [Google Scholar]
25.Sasaki JE, Hickey A, Staudenmayer J, John D, Kent JA, Freedson PS. Performance of Activity Classification Algorithms in Free-living Older Adults. Med Sci Sports Exerc. 2016;48(5):941–950. doi: 10.1249/MSS.0000000000000844. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Sayers SP, Guralnik JM, Newman AB, Brach JS, Fielding RA. Concordance and discordance between two measures of lower extremity function: 400 meter self-paced walk and SPPB. Aging Clin Exp Res. 2006;18(2):100–106. doi: 10.1007/BF03327424. [DOI] [PubMed] [Google Scholar]
27.Shiroma EJ, Cook NR, Manson JE, Buring JE, Rimm EB, Lee IM. Comparison of Self-Reported and Accelerometer-Assessed Physical Activity in Older Women. PLoS One. 2015;10(12):e0145950. doi: 10.1371/journal.pone.0145950. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Staudenmayer J, He S, Hickey A, Sasaki J, Freedson P. Methods to estimate aspects of physical activity and sedentary behavior from high-frequency wrist accelerometer measurements. Journal of applied physiology. 2015;119(4):396–403. doi: 10.1152/japplphysiol.00026.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Staudenmayer J, Pober D, Crouter S, Bassett D, Freedson P. An artificial neural network to estimate physical activity energy expenditure and identify physical activity type from an accelerometer. J Appl Physiol. 2009;107(4):1300–1307. doi: 10.1152/japplphysiol.00465.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Troiano RP, Berrigan D, Dodd KW, Masse LC, Tilert T, McDowell M. Physical activity in the United States measured by accelerometer. Med Sci Sports Exerc. 2008;40(1):181–188. doi: 10.1249/mss.0b013e31815a51b3. [DOI] [PubMed] [Google Scholar]

[R1] 1.Bastian T, Maire A, Dugas J, et al. Automatic identification of physical activity types and sedentary behaviors from triaxial accelerometer: laboratory-based calibrations are not enough. Journal of applied physiology. 2015;118(6):716–722. doi: 10.1152/japplphysiol.01189.2013. [DOI] [PubMed] [Google Scholar]

[R2] 2.Berrigan D, Carroll DD, Fulton JE, Galuska DA, Brown DR, Dorn JM. Vital signs: walking among adults -- United States, 2005 and 2010. MMWR Morb Mortal Wkly Rep. 2012;61:595–603. [PubMed] [Google Scholar]

[R3] 3.Carey M, Markham C, Gaffney P, Boran C, Maher V. Validation of a point of care lipid analyser using a hospital based reference laboratory. Ir J Med Sci. 2006;175(4):30–35. doi: 10.1007/BF03167964. [DOI] [PubMed] [Google Scholar]

[R4] 4.Carlson JA, Jankowska MM, Meseck K, et al. Validity of PALMS GPS scoring of active and passive travel compared with SenseCam. Med Sci Sports Exerc. 2015;47(3):662–667. doi: 10.1249/MSS.0000000000000446. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Chihuri S, Mielenz TJ, DiMaggio CJ, et al. Driving Cessation and Health Outcomes in Older Adults. J Am Geriatr Soc. 2016;64(2):332–341. doi: 10.1111/jgs.13931. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Doherty AR, Kelly P, Kerr J, et al. Using wearable cameras to categorise type and context of accelerometer-identified episodes of physical activity. Int J Behav Nutr Phys Act. 2013;10:22. doi: 10.1186/1479-5868-10-22. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Ellis K, Kerr J, Godbole S, Staudenmayer J, Lanckriet G. Hip and Wrist Accelerometer Algorithms for Free-Living Behavior Classification. Med Sci Sports Exerc. 2016;48(5):933–940. doi: 10.1249/MSS.0000000000000840. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Evenson KR, Wen F, Herring AH, et al. Calibrating physical activity intensity for hip-worn accelerometry in women age 60 to 91 years: The Women's Health Initiative OPACH Calibration Study. Preventive medicine reports. 2015;2:750–756. doi: 10.1016/j.pmedr.2015.08.021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Floegel TA, Giacobbi PR, Jr, Dzierzewski JM, et al. Intervention markers of physical activity maintenance in older adults. Am J Health Behav. 2015;39(4):487–499. doi: 10.5993/AJHB.39.4.5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Freedson PS, Lyden K, Kozey-Keadle S, Staudenmayer J. Evaluation of artificial neural network algorithms for predicting METs and activity type from accelerometer data: validation on an independent sample. J Appl Physiol. 2011;111(6):1804–1812. doi: 10.1152/japplphysiol.00309.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Grothendieck G. Perform SQL Selects on R Data Frames. 2014 Available from: https://cran.r-project.org/web/packages/sqldf/sqldf.pdf.

[R12] 12.Guralnik JM, Ferrucci L, Pieper CF, et al. Lower extremity function and subsequent disability: consistency across studies, predictive models, and value of gait speed alone compared with the short physical performance battery. J Gerontol A Biol Sci Med Sci. 2000;55(4):M221–M231. doi: 10.1093/gerona/55.4.m221. [DOI] [PubMed] [Google Scholar]

[R13] 13.He B, Bai J, Zipunnikov VV, et al. Predicting human movement with multiple accelerometers using movelets. Med Sci Sports Exerc. 2014;46(9):1859–1866. doi: 10.1249/MSS.0000000000000285. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Hojsgaard S, Halekoh U, Yan J. Generalized Estimating Equation Package. 2016 Available from: https://cran.r-project.org/web/packages/geepack/geepack.pdf.

[R15] 15.Howard VJ, Rhodes JD, Mosher A, et al. Obtaining Accelerometer Data in a National Cohort of Black and White Adults. Med Sci Sports Exerc. 2015;47(7):1531–1537. doi: 10.1249/MSS.0000000000000549. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Kempen GI, Yardley L, van Haastregt JC, et al. The Short FES-I: a shortened version of the falls efficacy scale-international to assess fear of falling. Age Ageing. 2008;37(1):45–50. doi: 10.1093/ageing/afm157. [DOI] [PubMed] [Google Scholar]

[R17] 17.Kerr J, Marshall SJ, Godbole S, et al. Using the SenseCam to improve classifications of sedentary behavior in free-living settings. Am J Prev Med. 2013;44(3):290–296. doi: 10.1016/j.amepre.2012.11.004. [DOI] [PubMed] [Google Scholar]

[R18] 18.Kerr J, Patterson RE, Ellis K, et al. Objective Assessment of Physical Activity: Classifiers for Public Health. Med Sci Sports Exerc. 2016;48(5):951–957. doi: 10.1249/MSS.0000000000000841. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Kerr J, Rosenberg DE, Nathan A, et al. Applying the ecological model of behavior change to a physical activity trial in retirement communities: description of the study protocol. Contemp Clin Trials. 2012;33(6):1180–1188. doi: 10.1016/j.cct.2012.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Lee IM, Shiroma EJ. Using accelerometers to measure physical activity in large-scale epidemiological studies: issues and challenges. Br J Sports Med. 2014;48(3):197–201. doi: 10.1136/bjsports-2013-093154. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Physical Activity Guidelines Committee. Washington, DC: U.S. Department of Health and Human Services; 2008. Physical activity guidelines advisory committee report, 2008; pp. 1–683. [Google Scholar]

[R22] 22.R Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2013. [cited 2014]. Available from: www.R-project.org. [Google Scholar]

[R23] 23.Rillamas-Sun E, Buchner DM, Di C, Evenson KR, LaCroix AZ. Development and application of an automated algorithm to identify a window of consecutive days of accelerometer wear for large-scale studies. BMC Res Notes. 2015;8:270. doi: 10.1186/s13104-015-1229-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Sandroff BM, Riskin BJ, Agiovlasitis S, Motl RW. Accelerometer cut-points derived during over-ground walking in persons with mild, moderate, and severe multiple sclerosis. J Neurol Sci. 2014;340(1–2):50–57. doi: 10.1016/j.jns.2014.02.024. [DOI] [PubMed] [Google Scholar]

[R25] 25.Sasaki JE, Hickey A, Staudenmayer J, John D, Kent JA, Freedson PS. Performance of Activity Classification Algorithms in Free-living Older Adults. Med Sci Sports Exerc. 2016;48(5):941–950. doi: 10.1249/MSS.0000000000000844. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Sayers SP, Guralnik JM, Newman AB, Brach JS, Fielding RA. Concordance and discordance between two measures of lower extremity function: 400 meter self-paced walk and SPPB. Aging Clin Exp Res. 2006;18(2):100–106. doi: 10.1007/BF03327424. [DOI] [PubMed] [Google Scholar]

[R27] 27.Shiroma EJ, Cook NR, Manson JE, Buring JE, Rimm EB, Lee IM. Comparison of Self-Reported and Accelerometer-Assessed Physical Activity in Older Women. PLoS One. 2015;10(12):e0145950. doi: 10.1371/journal.pone.0145950. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Staudenmayer J, He S, Hickey A, Sasaki J, Freedson P. Methods to estimate aspects of physical activity and sedentary behavior from high-frequency wrist accelerometer measurements. Journal of applied physiology. 2015;119(4):396–403. doi: 10.1152/japplphysiol.00026.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Staudenmayer J, Pober D, Crouter S, Bassett D, Freedson P. An artificial neural network to estimate physical activity energy expenditure and identify physical activity type from an accelerometer. J Appl Physiol. 2009;107(4):1300–1307. doi: 10.1152/japplphysiol.00465.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.Troiano RP, Berrigan D, Dodd KW, Masse LC, Tilert T, McDowell M. Physical activity in the United States measured by accelerometer. Med Sci Sports Exerc. 2008;40(1):181–188. doi: 10.1249/mss.0b013e31815a51b3. [DOI] [PubMed] [Google Scholar]

PERMALINK

Classifiers for Accelerometer-Measured Behaviors in Older Women

Dori Rosenberg

Suneeta Godbole

Katherine Ellis

Chongzhi Di

Andrea Z LaCroix

Loki Natarajan

Jacqueline Kerr

Abstract

Purpose

Methods

Results

Conclusions

Introduction

Methods

Study 1: Algorithm Development and Testing

Participants & procedures

Ground truth annotation

Behavior Classification Algorithm

Feature extraction

Minute-level classification

Time smoothing

Evaluation

Traditional accelerometer count processing

Study 2: Validation in a new cohort

Participants & procedures

Data processing

Accelerometer data during 400 meter walk

Accelerometer data in comparison with GPS-defined vehicle travel and walking

Analyses

Results

Phase 1: Algorithm development and testing

Table 1.

Table 2.

Table 3.

Phase 2: Algorithm validation in new cohort

Validation of ML Walking Algorithm

Table 4.

Concurrent Validity of ML Algorithms with GPS

Table 5.

Discussion

Conclusions

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases