Systematic review of accelerometer-based methods for 24-h physical behavior assessment in young children (0–5 years old)

Annelinde Lettink; Teatske M Altenburg; Jelle Arts; Vincent T van Hees; Mai J M Chinapaw

doi:10.1186/s12966-022-01296-y

. 2022 Sep 8;19:116. doi: 10.1186/s12966-022-01296-y

Systematic review of accelerometer-based methods for 24-h physical behavior assessment in young children (0–5 years old)

Annelinde Lettink ^1,^2,^3,^✉, Teatske M Altenburg ^1,^2,³, Jelle Arts ^1,³, Vincent T van Hees ^1,⁴, Mai J M Chinapaw ^1,^2,³

PMCID: PMC9461103 PMID: 36076221

Abstract

Background

Accurate accelerometer-based methods are required for assessment of 24-h physical behavior in young children. We aimed to summarize evidence on measurement properties of accelerometer-based methods for assessing 24-h physical behavior in young children.

Methods

We searched PubMed (MEDLINE) up to June 2021 for studies evaluating reliability or validity of accelerometer-based methods for assessing physical activity (PA), sedentary behavior (SB), or sleep in 0–5-year-olds. Studies using a subjective comparison measure or an accelerometer-based device that did not directly output time series data were excluded. We developed a Checklist for Assessing the Methodological Quality of studies using Accelerometer-based Methods (CAMQAM) inspired by COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN).

Results

Sixty-two studies were included, examining conventional cut-point-based methods or multi-parameter methods. For infants (0—12 months), several multi-parameter methods proved valid for classifying SB and PA. From three months of age, methods were valid for identifying sleep. In toddlers (1—3 years), cut-points appeared valid for distinguishing SB and light PA (LPA) from moderate-to-vigorous PA (MVPA). One multi-parameter method distinguished toddler specific SB. For sleep, no studies were found in toddlers. In preschoolers (3—5 years), valid hip and wrist cut-points for assessing SB, LPA, MVPA, and wrist cut-points for sleep were identified. Several multi-parameter methods proved valid for identifying SB, LPA, and MVPA, and sleep.

Despite promising results of multi-parameter methods, few models were open-source. While most studies used a single device or axis to measure physical behavior, more promising results were found when combining data derived from different sensor placements or multiple axes.

Conclusions

Up to age three, valid cut-points to assess 24-h physical behavior were lacking, while multi-parameter methods proved valid for distinguishing some waking behaviors. For preschoolers, valid cut-points and algorithms were identified for all physical behaviors. Overall, we recommend more high-quality studies evaluating 24-h accelerometer data from multiple sensor placements and axes for physical behavior assessment. Standardized protocols focusing on including well-defined physical behaviors in different settings representative for children’s developmental stage are required. Using our CAMQAM checklist may further improve methodological study quality.

PROSPERO Registration number

CRD42020184751.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12966-022-01296-y.

Keywords: 24-h physical behavior, Physical activity, Sedentary behavior, Sleep, Infants, Toddlers, Preschoolers, Accelerometer, Cut-points, Machine learning, Measurement properties, Validity, Reliability

Introduction

Accurate assessment of 24-h physical behavior in young children is crucial as it provides the basis for examining the health benefits of these behaviors and thereby evidence for establishing 24-h movement guidelines. Recent studies indicated the importance of an integrated approach to all 24-h physical behaviors for health, encompassing sleep, sedentary behavior (SB), and physical activity (PA) [1–5]. These behaviors are distributed along an intensity continuum ranging from low energy expenditure, such as sleep, to vigorous PA requiring high energy expenditure [6, 7].

Currently, a wide variety of direct measurement instruments are used to assess physical behaviors of children, such as doubly labelled water, (in)direct calorimetry, polysomnography, direct (video) observation, and accelerometry [8]. Polysomnography is considered a “gold standard” for sleep, however, this can only be applied in a laboratory setting. Doubly labelled water is considered a “gold standard” for total energy expenditure, however, it cannot distinguish frequency, type, and intensity of specific physical behaviors [9, 10]. Direct calorimetry accurately measures metabolic rate in confinement, and indirect calorimetry allows for this assessment in free-living situations, however, both methods come with relatively high costs and are also not distinctive for frequency, type, and intensity of specific physical behaviors [11]. While direct (video) observation is considered a suitable comparator measure for assessing different types of physical behaviors, it is less suitable for assessing activity intensity because this can only be derived by assigning a metabolic equivalent to represent energy cost, which is unknown for the youngest age groups (i.e., infants and toddlers) [6, 7]. In addition, direct observation is very time consuming and requires trained observers scoring a specified protocol [9]. Given these limitations, these methods might not be feasible for measuring young children’s 24-h physical behaviors in free-living situations. Accelerometers can capture data on body movement, or lack thereof, continuously over extended periods of time, and are therefore widely considered the most promising method for physical behavior assessment.

Current reviews on reliability and validity of accelerometer-based methods for measuring physical behaviors in young children were limited to evaluation of only one measurement property [12] or one physical behavior [13]. Lynch and colleagues (2019) reviewed studies that evaluated criterion validity of accelerometers against indirect calorimetry, concluding that accelerometers can accurately assess SB and PA in children 3 to 18 years old [12]. De Vries and colleagues (2006) reviewed criterion-, convergent validity, test–retest- and inter-device reliability of accelerometers. They found that accelerometers provide reasonable estimates for assessing PA, however, no evidence on reliability was found in 2- to 4-years-old children [13]. Moreover, evidence on these measurement properties for infants (0—12 months) and toddlers (1—3 years) is lacking [14, 15]. Bruijns and colleagues (2020) reviewed estimates of PA and SB derived from accelerometer data in infants and toddlers and found that PA estimates were inconclusive and largely heterogeneous [14]. Additionally, no studies under three years old were found in a review on the evidence for methodological accelerometer decisions (e.g., epoch length, wear location, data analysis approach) for assessing PA in children aged 0—5 years [15].

While accelerometer-based methods provide reasonable estimates of time spent in SB, PA, and sleep in school-aged children [12, 13, 16–19], this cannot be generalized to young children due to major differences in types and intensity of their physical behaviors [20]. Physical activity types are different for children, depending on their developmental stage, e.g., daytime naps, crawling, and being carried in the youngest age groups [15, 21]. Moreover, the intensity of activities might differ between children depending on the efficiency of motor skills. For instance, toddlers start walking around one year of age, increase locomotor (e.g., running, jumping, hopping), stability (e.g., balancing, climbing), and develop object-control skills (e.g., kicking, catching, rolling) [22]. Preschoolers (3–5 years) further develop these skills and often participate in modified sports [23]. These differences in physical behaviors and motor development require age group specific studies on the validity and reliability of measurement instruments and analysis techniques, adapted to the child’s developmental stage.

For assessment of 24-h physical behavior in young children a complete overview of measurement properties of accelerometer-based methods is unavailable and there is no consensus about the optimal measurement protocol (e.g., epoch length, wear location) and accelerometer processing (e.g., cut-points, algorithms, machine learning methods) decisions for the use of accelerometer-based methods in young children [15]. To be able to select the most appropriate method for the child’s developmental stage, an overview of current accelerometer processing and study designs, and measurement properties of the available accelerometer-based methods is warranted. Therefore, we aimed to comprehensively review all studies examining the measurement properties test–retest, inter-device reliability, criterion- and convergent validity of accelerometer-based methods assessing 24-h physical behavior in young children aged 0–5 years, including an evaluation of the quality of evidence.

Methods

We registered this review on PROSPERO (international prospective register of ongoing systematic reviews; registration number: CRD42020184751) and followed the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines [24].

Search strategy

We systematically searched the electronic database PubMed (MEDLINE) up until 26^th June 2021. The search strategy focused on terms related to young children (e.g., infant, toddler, preschooler), accelerometer-based methods (e.g., accelerometry/methods, actigraphy), and measurement properties (e.g., validity, reliability). These terms were used in AND-combination with terms related to physical behavior: SB (e.g., inactive behavior, stationary behavior, sitting), PA (e.g., motor activity, tummy time, cycling), OR sleep (e.g., nap, bedtime, night rest). Articles related to animals, a variety of disorders (e.g., autism, attention deficit disorder), and diseases were excluded using the NOT-combination. Medical Subject Heading (MeSH), title and abstract (TIAB), and free-text search terms were used, and a variety of publication types were excluded (e.g., book sections, thesis). The full search strategy can be found in Additional File 1.

Eligibility criteria

Studies were eligible for inclusion when the study: 1) used an accelerometer-based method to monitor at least one physical behavior: SB, PA, or sleep; 2) evaluated at least one measurement property of an accelerometer-based method: test–retest or inter-device reliability, criterion- or convergent validity; 3) included a (sub)sample of apparently healthy children, born term (> 37 weeks), with a mean age < 5 years or a wider range with the results for 0–5-year-olds reported separately; 4) was published in English in a peer-reviewed journal; and 5) full-text was available.

Studies were excluded when the study: 1) used a diary, parent- or proxy-report, or relied on parents for direct observation as comparison measure; 2) evaluated measurement properties in a clinical population, e.g., focused on only children with overweight or obesity; or 3) relied on accelerometer-based devices that do not directly output data on acceleration time series data or magnitude of acceleration, e.g., Fitbit.

Selection procedures

We imported articles into reference manager software (EndNote X 9.1), and subsequently removed duplicate articles. Two researchers (AL and TA) independently screened titles and abstracts for eligibility using Rayyan and subsequently screened full-text articles. For four publications, the mean age of the study population was missing. To resolve this missing information, we contacted the authors. In addition, the reference lists of all relevant full-text articles were screened for possible inclusion of additional studies. A third researcher (MC) was consulted to resolve discrepancies.

Data extraction

For all eligible studies, two researchers (AL and JA) extracted data using a structured form. Disagreements were resolved through discussion. Extracted data included the evaluated measurement properties, study and target population, accelerometer specifications (i.e., device and model, placement location and site, epoch length, axis), accelerometer data analysis approach used, outcome(s) and setting, comparison method (in case of validity), time interval (in case of test–retest reliability), and results of the evaluated measurement properties. The variety of accelerometer-based methods was described using code combinations of the following four elements: accelerometer brand, analysis approach, axis, and epoch length (see Additional File 3).

Methodological quality assessment

Two researchers (AL and either TA or MC) rated the methodological quality of the included studies independently using a newly developed checklist to assess risk of bias. Risk of bias refers to whether the results for evaluating a measurement property are trustworthy based on the methodological study quality. In case of disagreement, all three researchers discussed the rating until consensus was reached.

Checklist development

We newly developed a Checklist for Assessing the Methodological Quality of studies using Accelerometer-based Methods (CAMQAM), as there was no standardized checklist available. The CAMQAM was inspired by quality assessment of patient reported outcomes, the standardized COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) risk of bias checklist [25–27], and by a previous review by Terwee and colleagues [28]. To fit accelerometer-based methods, we used the following relevant parts and, made minimal adjustments (e.g., wordings): ‘Box 6 Reliability’ for test–retest- and inter-device reliability,’Box 8 Criterion Validity’ for criterion validity, and ‘Box 9 Hypothesis testing for construct validity’ part ‘9a Comparison with other outcome measurement instruments’ for convergent validity. Moreover, we developed two boxes with additional items to rate methodological quality of studies assessing the criterion- or convergent validity of a specific accelerometer data analysis approach to categorize physical behavior: 1) conventional cut-points based method using a single value, or 2) multi-parameter method using more than one parameter, e.g., sleep algorithm, machine learning method. Two of the authors (TA and MC) independently rated the most diverse included studies for each measurement property. Thereafter, we added examples or explanations to clarify the items and ensure studies were scored using the correct box. The CAMQAM was used as a modular tool; only those boxes were completed for the measurement properties evaluated in the study.

For each examined measurement property, the study design requirements were rated as either very good, adequate, doubtful, or inadequate quality [25]. To rate the final methodological study quality, a worst score method was adopted, i.e., using the lowest rating of any item in a box. Additional File 2 presents the complete checklist and scoring manual.

In the appraisal of methodological study quality, the following study aspects were considered: study design (e.g., sample, epoch length, measurement duration, comparison instrument and their measurement properties in the study population) and the performed statistical analysis to evaluate the measurement property of the accelerometer-based method (see Table 1 in Additional File 2 for a summary of the definitions).

Rating study results

The result of each study on a measurement property was rated against the criteria for good measurement properties proposed in the COSMIN guideline, i.e., sufficient (+), insufficient (-), inconsistent (±) or intermediate (?) [26]. Below is indicated for each measurement property which outcomes were considered sufficient (+). Outcomes were considered insufficient (-) when these criteria were not met, and intermediate (?) when not all necessary information was reported. Due to the great variety of accelerometer-based methods adopted in the studies, quantitative pooling or quantitatively summarizing of the results was not feasible.

Reliability

Reliability results were considered acceptable under the following conditions: 1) Intraclass Correlation Coefficients (ICC) or Kappa values (κ) were ≥ 0.70 [28]; or Pearson (r_p), Spearman rank (r_sp) or unknown (r) correlation coefficients were ≥ 0.80 [25]. Some studies reported multiple correlations per accelerometer-based method for reliability, e.g., separate correlations for different physical intensities. Therefore, we applied a rating per physical behavior (i.e., incorporating correlations separately for PA, SB, and/or sleep), and an overall rating (i.e., incorporating all correlations) to obtain final reliability rating for each study. When ≥ 75% of reliability outcomes were acceptable, a sufficient rating was received, when ≥ 50% and < 75% of reliability outcomes were acceptable an inconsistent rating was received, and an insufficient evidence rating was received when < 50% of reliability outcomes were acceptable.

Validity

Criterion validity was considered acceptable when: 1) correlations or κ with the ‘gold standard’ were ≥ 0.70 (Table 1, e.g., comparison measure was polysomnography for accelerometer-based methods aiming to assess sleep, or indirect calorimetry to score energy expenditure); or diagnostic test results (i.e., area under the receiver operating curve, accuracy, sensitivity, or specificity) were ≥ 0.80.

Table 1.

Physical behavior assessed by accelerometer-based methods evaluating validity, subdivided by level of evidence^a, and criteria for acceptable outcome values

Physical behavior	Level of Evidence
Physical behavior	*Level 1*	*Level 2*	*Level 3*
Physical activity	Indirect calorimetry to score energy expenditure, e.g., DLW, AEE	Direct observation to score activity type	Pedometer (daily) step counts
Physical activity	Direct video observation to score activity type	Accelerometer-based magnitude of acceleration (different brand/type)
Sedentary behavior (primarily defined as an activity type, and secondarily as an intensity)	Direct video observation to score activity type	Direct observation to score activity type	Accelerometer-based magnitude of acceleration (different brand/type)
	Indirect calorimetry to score energy expenditure	Accelerometer-based orientation classification (thigh data), e.g., activPAL
Sleep	Polysomnography	Videosomnography or direct video observation to classify sleep–wake states	Direct observation to classify sleep–wake states
Sleep		Indirect calorimetry, e.g., SMR	Accelerometer-based magnitude of acceleration (different brand/type)
Outcome value	Acceptable results
	*Level 1*	*Level 2*	*Level 3*
r_p, r_sp, r, R²	≥ .70	≥ .60	≥ .60
AUC-ROC, accuracy, Se, Sp	≥ .80	≥ .80	≥ .80
ICC, CCC κ, κ_w, κ_qw	≥ .70	≥ .70	≥ .70

Open in a new tab

Abbreviations: AEE activity energy expenditure, AUC-ROC area under the receiver operating curve, CCC concordance correlation coefficient, DLW doubly labelled water, ICC intraclass correlation coefficient, κ Kappa, κ_w weighted Kappa, κ_qw quadratic weighted Kappa, r_p Pearson correlation coefficient, r_sp Spearman’s rank-order correlation coefficient, r unknown correlation coefficient, R² R-squared value, Se sensitivity, SMR sleeping metabolic rate, Sp specificity

^a Level of evidence: level 1 indicating strong evidence, level 2 indicating moderate evidence, and level 3 indicating weak evidence

To rate the results of studies that evaluated convergent validity, we formulated criteria for acceptable results regarding the confidence in the comparison instrument to accurately measure the relevant physical behavior (i.e., level of evidence) (Table 1). We first assessed the level of evidence using these criteria, where level 1 indicated strong evidence, level 2 indicated moderate evidence, and level 3 indicated weak evidence. Thereafter, subdivided by the level of evidence, we rated study results as acceptable when: 1a) correlations (i.e., r_p, r_sp, r) with the comparison measure were ≥ 0.70 (level 1); or 1b) correlations with the comparison measure were ≥ 0.60 (level 2 or level 3) [13]; 1c) ICC, Concordance Correlation Coefficients (CCC), or κ with the comparison measure were ≥ 0.70; 2) or diagnostic test results were ≥ 0.80. As most studies reported multiple results, we applied a rating per physical behavior (i.e., incorporating results separately for SB, PA, and/or sleep), and an overall rating for each study. When ≥ 75% of the validity outcomes were rated as acceptable, a sufficient rating was received, when ≥ 50% and < 75% of validity outcomes were rated as acceptable an inconsistent rating was received, and an insufficient evidence rating was received when < 50% of validity outcomes were acceptable.

Quality of evidence grading

Quality of evidence was graded using the Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach as proposed in the COSMIN guideline, i.e., high, moderate, low, or very low [26], to indicate trustworthiness of the measurement property results. To derive the grading, the methodological study quality (i.e., risk of bias) was weighted with relevant risk factors: 1) inconsistency, i.e., unexplained inconsistency of results across studies, 2) imprecision, i.e., total sample size of the available studies, and 3) indirectness, i.e., evidence from different populations than the population of interest in this review [26]. The evidence grading was subsequently downgraded with one, two, or three levels for each risk factor, to moderate, low, or very low quality of evidence. The quality of evidence grading was performed for each measurement property and each accelerometer-based method separately.

Results

The systematic literature search yielded a total of 1,673 unique articles. After title and abstract screening, 82 full-texts were screened. Additionally, 16 articles were found through cross-reference searches. Therefore, a total of 98 full-text articles were assessed for eligibility, of which 62 were included (see Fig. 1 for the full selection process). Thirteen of the included studies evaluated the measurement properties of accelerometer-based methods for assessing SB, PA, and/or sleep in infants [29–41], nine in toddlers [42–50], and forty in preschoolers [51–90].

Fig. 1 — Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) flow diagram of study inclusion

The included studies evaluated measurement properties for time series data or magnitude of acceleration directly [42, 45, 51–60, 62, 63, 70] or applied one of the following data analysis approaches to categorize physical behavior: a conventional cut-points based method using a single value [29–34, 42–47, 51–53, 56, 61, 64–81, 89], or a multi-parameter method (e.g., algorithm, machine learning) [30, 35–41, 48–50, 55, 66, 75, 82–90].

Reliability

Table 2 summarizes the results for reliability, of which two studies examined test–retest reliability [29, 65], and one study examined inter-device reliability [42]. Test–retest reliability of an accelerometer-based method using a cut-points based method was evaluated by measuring SB and PA in preschool aged children [65]. Total PA, SB, light PA (LPA) and moderate-to-vigorous PA (MVPA) were considered reliable across all wear time criteria, except for absolute values of SB. For absolute values of SB, results were sufficient if data was collected for ≥ 5 days/week of ≥ 10 h. Despite adequate methodological study quality, these results received low quality of evidence as they were retrieved in a sample of 91 preschoolers. Inter-device reliability for epoch level activity counts (60 s) was rated as sufficient for activity counts in toddlers wearing two Actical devices side-by-side on the non-dominant ankle [42]. Despite adequate methodological study quality, these results received low quality of evidence as they were retrieved in a limited sample of 24 toddlers.

Table 2.

Reliability of accelerometer-based methods, sorted by methodological study quality, study result rating and quality of evidence

Study

Study population^a

Outcome(s) & Setting

Time interval

Methodological study quality^b

Placement

Device-based method^c

Results

Study result rating^d

Quality of evidence^e

Hager et al. (2016) [42]

Toddlers

n = 24 * 2 Ac

Age = 24.5 ± 5.2 (14.7 to 35.5) months

Sex = 41.7% girls

Activity counts

Laboratory (structured activities)

n.a

Non-dominant ankle; side-by-side

Ac-cts-omni-60

ICC_? = .98

Low

Aadland & Johannessen (2015) [65]

Preschoolers

n = 91

Age = 4.07 ± 0.5 (3 to 5) years

Sex = 51% girls

Total PA, SB, LPA and MVPA in min/day and % of valid wear time

Free-living

14 days

Not reported

AG-Ev-VA-10

ICC⁻_(1,1) ≥ .75 for all outcomes across all wear criteria (≥ 6, 8 and 10 h/day and ≥ 3 and 5 days/week)

Except for SB (min/day): ICC⁻_(1,1) .61 to .81

Higher ICC⁻_(1,1) for percentages than absolute values

ICC⁻_(1,1) ≥ .80 (≥ 6 and ≥ 8 h/day) > 7 days required

ICC⁻_(1,1) ≥ .80 (≥ 10 h/day) within 7 days: 3.9 to 7.1 days

(SB + ;

PA +)

Low

Greenspan, Cunha, & Lobo (2021) [29]

Infants

n = 16

Age = 3.1 ± 1.1 (1.8 to 5) months

Sex = 51% girls

Body position: supine, reclined, upright, inclined, and prone

Free-living (structured play)

0 days

Trunk

GG-Gr-HA/VA/DA-1

κ_w = .89, 95% CI (.87 to .91)

Supine: agreement = 89.2%

Reclined: agreement = 72.6%

Upright: agreement = 90.9%

Inclined: agreement = 84.6%

Prone: agreement = 91.1%

Very low

Open in a new tab

Abbreviations: Ac Actical, AG ActiGraph, cpm counts per minute, cts counts, DA diagonal axis (z-axis), Ev Evenson’s cut-points (2008) [91], GG get around garment with ADXL335, Gr Greenspan’s cut-points (2021) [29], HA horizontal axis (x-axis), ICC_? intraclass correlation coefficient analysis decision was unclear, ICC⁻_(1,1) intraclass correlation coefficient inappropriate analysis decision one-way random effects model, κ_wweighted Kappa, LPA light physical activity, MVPA moderate-to-vigorous physical activity, n.a. not applicable, omni omnidirectional, PA physical activity, r unknown correlation coefficient, SB sedentary behavior, VA vertical axis (y-axis), 1 1 s epoch, 10 10 s epoch, 60 60 s epoch

^aAge presented as mean ± SD (range)

^bMethodological study quality based on newly developed checklist: VG very good, A adequate, D doubtful, I inadequate

^cDevice-based method described using code combinations of four elements resulting in the following format: brand-axis-approach-epoch length

^dStudy result rating based on COSMIN guideline: + sufficient, ± inconsistent, - insufficient, ? intermediate

^eQuality of evidence based on GRADE approach

Validity

The following subsections present the results for validity by age group. Notably, most studies used the vertical axis (VA). In studies among infants the accelerometer was predominantly worn on the ankle, while for studies among toddlers or preschoolers the devices were mainly placed on the hip. Unless otherwise specified, we report study results based on this majority placement and axis.

Infants

Table 3 summarizes the results for validity, of which four studies examined criterion validity [30–32, 35], and nine studies examined convergent validity [29, 33, 34, 36–41] in infants.

Table 3.

Criterion and convergent validity of accelerometer-based methods for infants, sorted by methodological study quality, quality of evidence (level of evidence), and study result rating

	Study	Study population^a	Outcome(s) & Setting	Comparison measure(s)	Methodological study quality^b	Placement	Device-based method^c	Results	Study result rating^d	Quality of evidence^e (level of evidence)
Criterion validity	Galland et al. (2012) [30]	n = 31 Age = 13.0 ± 3.1 (10.0 to 22.3) weeks Sex = 27.3% girls	Sleep quality: sleep latency, TST, sleep efficiency, and WASO (duration and number) Laboratory (daytime nap)	Polysomnography to score sleep stages as sleep (active, quiet, intermediate) and wake	VG	Shin	Ac-CS-omni-15	accuracy = 86.3%, Se = 85.7%, Sp = 84.3%, κ = .66, PABAK = .72; Sleep latency r_p = .79*** (underestimated by 3 min), TST r_p = .83, Sleep efficiency r_p = .87, WASO (duration) r_p = .48* (overestimated by 6.5 min*), WASO (number) r_p = .35 (overestimated by 3*)	+	Low
							Ac-CS-omni-30	accuracy = 86.1%, Se = 84.7%, Sp = 87.4%, κ = .66, PABAK = .72	+
							Ac-CS-omni-60	accuracy = 84.5%, Se = 79.8%, Sp = 90.2%, κ = .63, PABAK = .69	-
							Ac-S0-omni-15	accuracy = 82.3%, Se = 94.8%, Sp = 57.8%, κ = .55, PABAK = .65; Sleep latency r_p = .80* (underestimated by 5 min), TST r_p = .76** (overestimated by 11 min), Sleep efficiency r_p = .76 (overestimated by 16.5%), WASO (duration) r_p = .48* (overestimated by 7 min**), WASO (number) r_p = .38 (overestimated by 3***)	±
							Ac-S0-omni-30	accuracy = 86.3%, Se = 91.0%, Sp = 77.7%, κ = .65, PABAK = .72	±
							Ac-S0-omni-60	accuracy = 84.5%, Se = 80.0%, Sp = 89.9%, κ = .63, PABAK = .69	±
							Ac-PS-omni-15	accuracy = 84.3%, Se = 95.1%, Sp = 63.1%, κ = .59, PABAK = .69; Sleep latency r_p = .93*, TST r_p = .80* (overestimated by 11 min), Sleep efficiency r_p = .81** (overestimated by 13.6%), WASO (duration) r_p = .50* (overestimated by 3.5 min*), WASO (number) r_p = .13 (overestimated by 3*)	±
							Ac-PS-omni-30	accuracy = 86.8%, Se = 90.8%, Sp = 79.0%, κ = .66, PABAK = .74	±
							Ac-PS-omni-60	accuracy = 84.6%, Se = 79.9%, Sp = 90.3%, κ = .63, PABAK = .69	±
							Ac-WSC-omni-15	accuracy = 83.2%, Se = 94.6%, Sp = 60.2%, κ = .57, PABAK = .66; Sleep latency r_p = .78*** (underestimated by 4 min), TST r_p = .78** (overestimated by 10 min), Sleep efficiency r_p = .79** (overestimated by 15.2%), WASO (duration) r_p = .41 (overestimated by 7 min*), WASO (number) r_p = .29 (overestimated by 3*)	±
							Ac-WSC-omni-30	accuracy = 86.5%, Se = 90.9%, Sp = 78.2%, κ = .66, PABAK = .73	±
							Ac-WSC-omni-60	accuracy = 84.5%, Se = 80.0%, Sp = 89.9%, κ = .63, PABAK = .69	±
	Insana, Gozal, & Montgomery-Downs (2010) [31]	n = 22 Age = 14.1 ± 0.6 (13.0 to 15.0) months Sex = 45.5% girls	Sleep quality: TST and WASO (duration) Laboratory (in the hospital)	Polysomnography to score sleep stages as sleep and wake	VG	Ankle	Aw-ACT40-uni-15	accuracy = 89.6%, Se (89.0 to 96.3%), Sp = 58.9% TST: r_p = .83**, ICC* = .80, MD = -72.3 (SD = 61.5) min**, d = .70 WASO: r_p = .52, ICC = .65, MD = 13.9 (SD = 30.9) min*, d = .44	±	Low
	Insana, Gozal, & Montgomery-Downs (2010) [31]			Polysomnography to score sleep stages as sleep and wake	VG	Ankle	Aw-ACT80-uni-15	TST: r_p = .84**, ICC* = .86, MD = 52.1 (SD = 60.4) min*, d = .50 WASO: r_p = .52, ICC = .65, MD = 6.4 (SD = 26.8) min	±	Low
	Rioualen et al. (2015) [32]	n = 24 Age = 2.5 ± 0.7 days Sex = 50.0% girls	Sleep stage: wake, active sleep, and quiet sleep Laboratory (in the hospital)	Polysomnography to score sleep stages as sleep (active, quiet) and wake	VG	Wrist	Aw-ASA40-uni-60	Se = 93%, 95% CI (89 to 96) Sp = 20%, 95% CI (14 to 27)	±	Low
	Rioualen et al. (2015) [32]	n = 24 Age = 2.5 ± 0.7 days Sex = 50.0% girls			VG	Ankle	Aw-ASA40-uni-60	accuracy = 58.5%, Se = 87%, 95% CI (81 to 93), Sp = 31%, 95% CI (24 to 39)	-	Low
	Lewicke, Sazonov, & Schuckers (2004) [35]	n = 25 Age = n.r Sex = n.r	Sleep stage: wake, active sleep, and quiet sleep Laboratory	Polysomnography to score sleep stages as sleep (active, quiet, indeterminate) and wake	I	Left hip	mSCA-LVQ-uni-30	Training: accuracy = 80.7%, Se = 94.6%, Sp = 48.2% Validation: accuracy = 75.3%, Se = 92.3%, Sp = 42.4%	±	Very low
Convergent validity	Camerota et al. (2018) [33]	n = 82 Age = 3.6 ± 0.6 (2.7 to 5.2) months Sex = 43% girls	Sleep quality: sleep onset time, rise time, sleep period, WASO (number and duration), TST, and longest sleep period Free-living (at home)	Videosomnography to score sleep stages as sleep and wake	VG	Left ankle	Aw-ACTdef-uni-15	κ = .47, Se = 52%, Sp = 95% Sleep onset time: r = .79, underestimated by 29 min*, d = .43 Rise time: r = .76 Sleep period: r = .78, overestimated by 23.6 min, d = .28 WASO: (number) r = .47, underestimated by 1.1*, d = .54; (duration): r = .59, overestimated by 58.4 min**, d = 72 TST: r = .54 Longest sleep period: r = .38, overestimated by 35.5 min, d = .28	-	Moderate (level 2)
	Greenspan et al. (2021) [29]	n = 16 Age = 3.1 ± 1.1 (1.8 to 5) months Sex = 51% girls	Body position: supine (parallel to floor stomach facing up), reclined (trunk tilted about 45° posteriorly from upright), upright (90°), inclined (trunk tilted about 45° anteriorly from upright), and prone (parallel to floor stomach facing down) Free-living (structured and free play)	DVO to score body position by two independent raters using outlined definitions	VG	Trunk	GG-Gr-HA/VA/DA-1	κ_w = .84, 95% CI (.83 to .84) Supine: accuracy = 90.7%, r_sp = .97*** Reclined: accuracy = 73.5%, r_sp = .89*** Upright: accuracy = 83.1%, r_sp = .97*** Inclined: accuracy = 41.4%, r_sp = .67** Prone: accuracy = 78.4%, r_sp = .96***	±	Low (level 1)
	Jun & Choi (2020) [40]	n = 9 Age = 144 ± 284.5 (2 to 720) days Sex = 59.1% girls	Activity type: sleeping (without substantial movement), strong movement (struggling with crying or in agony), weak movement (moving in comfy state), and movement by external force (from nurse or caregiver) Laboratory (at the hospital)	DVO to score activity type by two independent raters using a 3-stage classification scheme	A	Chest	ICM-DNN-HA/VA/DA-4	accuracy = .96, F-score = .95, precision = .98, recall = .93, Sp = .98 Sleep: accuracy = .97, F-score = .99, precision = .99, recall = 1.0, Sp = .94 Strong movement: accuracy = .97, F-score = .95, precision = .96, recall = .94, Sp = .99 Weak movement: accuracy = .95, F-score = .93, precision = .97, recall = .90, Sp = .99 External force: accuracy = .95, F-score = .95, precision = 1.0, recall = .89, Sp = 1.0	+	Very low (level 1)
	Airaksinen et al. (2020) [39]	n = 22 Age = 6.7 ± 0.8 (4.5 to 7.7) months Sex = 59.1% girls	Body position: prone (navel on floor), supine (lower back on floor), side left/right, crawl posture (supported by hands, and knees or feet); Movement: macro still (no movement), turn left/right (change of posture along prone-side-supine axis), pivot left/right (change of facing whole body direction, without movement), crawl proto (practice crawl without moving during prone/side posture and move multiple limbs during supine posture), crawl commando (crawl with forward movement) Laboratory (semi-structured)	DVO to score body position and movement by three independent raters using a developed annotation scheme	A	Arms + legs; proximally	SM-CNN-VM-2	Movement track: F-score = 80% Full agreement: posture: accuracy = 99.1%, movement: accuracy = 90.7%; All frames: posture: accuracy = 98.2%, movement: accuracy = 81.7% Most frequent confusions in prone-side-supine axis as well as between crawl posture and prone Individual sensors lower performance (left arm posture: accuracy = 71%, movement: accuracy = 70%, left leg posture: accuracy = 90%, movement: accuracy = 68%) compared to four-sensor setup (posture: accuracy = 95%, movement: accuracy = 80%), and two-sensor setup (right leg + arm posture: accuracy = 94%, movement: accuracy = 78%; left leg + arm posture: accuracy = 95%, movement: accuracy = 79%; left + right arm posture: accuracy = 85%, movement: accuracy = 72%; left + right leg posture: accuracy = 93%, movement: accuracy = 72%)	+ Arm: - Arms: ± Leg: ± Legs: ± Combined 2-sensor: ± ; 4-sensor: +	Very low (level 1)
	Airaksinen et al. (2020) [39]				A	Arms + legs; proximally	SM-SVM-VM-2	Movement track: F-score consistently 5–10% lower performance in prone positions (crawl proto, turn right*; pivot left) Posture classification comparable between CNN and SVM	±	Very low (level 1)
	Smith et al. (2015) [41]	n = 12 Age = 6.8 ± 2.9 (1 to 12) months Sex = 66.7% girls	Activity type: leg movement (change of limb position) Laboratory	DVO to score activity type by one rater	D	Legs	Op-AAV-VM-n.r	accuracy = 92.7%, Se = 92.0%	+	Very low (level 1)
	Hewitt et al. (2019) [34]	n = 32 Age = 15.2 ± 6.4 (4.7 to 24.9) weeks Sex = 40.6% girls	Body position: prone on floor (tummy with both hips touching the floor), prone supported (held in prone, on parent’s chest), and non-prone (supine, left/right side lying, reclined in car seat/pram, upright, supported sitting, cradle hold) Laboratory (structured positions)	DVO to score body position by one rater (one randomly chosen video was scored by four independent raters) using outlined definitions	D	Right hip	AG-He-VA/HA-1	Prone: accuracy = 90%, MD = -18.3 s, LoA (-97.0 to 60.3) Non-prone: accuracy = 99.9%, MD = -0.2 s, LoA (-1.2 to 0.9) Prone supported: accuracy = 63.6%, MD = -127.3 s, LoA (-324.7 to 70.2)	±	Very low (level 1)
						Right hip	GA-Ai-VA/HA-1	Prone: accuracy = 95.4%, MD = -8.4 s, LoA (-78.2—61.3) Non-prone: accuracy = 98%, MD = 31.2 s, LoA (-154.9 to 92.4) Prone supported: accuracy = 52.2%, MD = -166.9 s, LoA (-390.7 to 56.8)	±
						Ankle	AG-He-VA/DA-1	Prone: accuracy = 87.9%, MD = -22.1 s, LoA (-124.0 to 79.7) Non-prone: accuracy = 96.3%, MD = -56.6 s, LoA (-209.5 to 96.3) Prone supported: accuracy = 53.3%, MD = -163.1 s, LoA (-431.7 to 105.6)	±
						Chest	MB-He-VA/HA/DA-1	Prone: accuracy = 79.2%, MD = -38 s, LoA (-194.5 to 118.5) Non-prone: accuracy = 99.9%, MD = 2.0 s, LoA (-14.0 to 10.8) Prone supported: accuracy = 66.1%, MD = -113 s, LoA (-355.6 to 129.6)	-
	Horger et al. (2021) [37]	n = 9 Age_start = 7.96 ± 2.51 (0.89 to 8.19) months Age_end = 10.89 ± 1.14 (9.5 to 13.08) months Sex = 44.0% girls	Sleep quality: sleep onset, morning wake time, WASO, sleep efficiency, and sleep duration Free-living (at home)	Videosomnography to score sleep quality	D	Left ankle	MM-PSinf-uni-60	Sleep onset: r_p= .94**, MD = 0.76 min, SD = 34.5 min Morning wake: r_p= .74, MD = -14.1 min, SD = 40.4 min WASO: r_p= .20, MD = 1.2, SD = 3.9 Sleep efficiency: r_p= .30 Sleep duration: r_p= .36	-	Very low (level 2)
	Horger et al. (2021) [37]			Videosomnography to score sleep quality	D	Left ankle	MM-PS-uni-60	Sleep onset: r_p= .86** Morning wake: r_p= .55 WASO: r_p= .27 Sleep efficiency: r_p= .51 Sleep duration: r_p= .12	-	Very low (level 2)
	Sadeh et al. (1995) [38]	n = 41 (10 newborns; 11 3-month-olds; 10 6-month-olds; 10 12-month-olds) Age = 0 to 12 months Sex = 46.4% girls	Sleep stage: sleep (active and quiet) and wake Free-living (newborns at hospital; others at home)	DO using Thoman’s observation scheme (1975, 1995) to score sleep stage as sleep (active and quiet), wake, sleep–wake transition and uncertain	D	Left ankle	AMA-PS-uni-60	All infants: accuracy = 83.4% (sleep–wake = 95.6%; wake = 93.5%; active sleep = 74.9% and quiet sleep = 78.0%) (excl. 12-month-olds) Newborns: accuracy = 74.9% (wake = 82.8%; active sleep = 74.4.1%, quiet sleep = 54.9%) 3-month-olds: accuracy = 87.3% (wake = 92.5%, active sleep = 78.3%, quiet sleep = 87.2%; sleep–wake = 93.8%); 6-month-olds: accuracy = 83.2% (wake = 97.8%, active sleep = 66.4%, quiet sleep = 76.7%; sleep–wake = 97.9%); 12-month-olds: wake = 99.3%; sleep–wake = 97.2% Wake time: r (.85 to .99), % active sleep time: r (.78 to .98), % quiet sleep time: r (.36 to .85) (lowest correlations in newborns) Wake: MD (0.2 to 3.5%), sleep: MD (0.4 to 11.3%)	± < 3 months: - > 3 months: +	Very low (level 3)
	Gnidovec, Neubauer, & Zidar (2002) [36]	n = 10 Age = 1 to 6 months Sex = 40% girls	Sleep stage: Sleep (active and quiet) and wake (active and quiet) Free-living (at home)	DO using Thoman’s observation scheme (1990) to score sleep stage as sleep (active and quiet), wake (active and quiet), sleep–wake transition and uncertain	I	Left ankle	Gw-ASW-uni-10	1-month: accuracy = 72.0% (sleep = 88.7%, wake = 57.5%); 3-months: accuracy = 90.1% (sleep = 97.0%, wake = 84.5%); 6-months: accuracy = 95.0% (sleep = 88.4%, wake = 98.0%) After 3^rd month 88.9% and 6^th month 94.4% accuracy (calibration sample n = 5) Invalid discrimination between active- and quiet sleep all age groups < 75%, best for 3-month-olds (accuracy = 74.3%; active sleep: accuracy = 49.3%, quiet sleep: accuracy = 88.3%), as compared to 1-month (accuracy = 58.5%; active sleep: accuracy = 35.9%, quiet sleep: accuracy = 90.5%) and 6-months (accuracy = 66.5%; active sleep: accuracy = 39.9%, quiet sleep: accuracy = 87.1%)	± 1 month: - > 3 months: +	Very low (level 3)

Open in a new tab

Abbreviations: AAV acceleration and angular velocity algorithm [41], Ac Actical, ACTdef Actiware software with default wake threshold value of .888 * mean acceleration, ACT40 Actiware software with wake threshold value of 40, ACT80 Actiware software with wake threshold value of 80, ASA40 Actiwatch activity and sleep analysis software with wake threshold value = 40, AG ActiGraph, Ai Activinsights software, AMA AMA-32, ASW automatic sleep wake scoring algorithm, Aw Actiwatch, CI confidence interval, CNN convolutional neural network, CS count-scaled algorithm [30], d Cohen’s d, DA diagonal axis (z-axis), DNN deep neural network, DO direct observation, DVO direct video observation, GA GENEActiv, GG get around garment with ADXL335, Gr Greenspan’s cut-points (2021) [29], Gw Gaewiler, HA horizontal axis (x-axis), He Hewitt’s cut-points (2019) [34], ICC intraclass correlation coefficient, ICM ICM20600 chip, κ Kappa, κ_w weighted Kappa, LoA limits of agreement, LVQ neural network learning vector quantization, MB MonBaby, MD mean difference,MM MicroMini sleep watch, mSCA miniature semiconductor chip accelerometer, n.r. not reported, omni omnidirectional, Op Opal APDM, PABAK prevalence- and bias-adjusted Kappa, PS probability scaled algorithm [38], PSinf probability scaled algorithm for infants [92], r correlation coefficient (unknown), r_p correlation coefficient (Pearson), r_sp correlation coefficient (Spearman rank), SD standard deviation, Se sensitivity, SM Suunto Movesense sensor, Sp specificity, SVM support vector machine, S0 zero-threshold computation [93], TST total sleep time, uni uniaxial (axis was not specified), VA vertical axis (y-axis), VM vector magnitude, WASO wake after sleep onset, WSC weighted sum activity algorithm [30], 1 1 s epoch, 2 2 s epoch, 4 4 s epoch, 10 10 s epoch, 15 15 s epoch, 30 30 s epoch, 60 60 s epoch

^aAge presented as mean ± SD (range)

^bMethodological study quality based on newly developed checklist: VG very good, A adequate, D doubtful, I inadequate

^cDevice-based method described using code combinations of four elements resulting in the following format: brand-axis-approach-epoch length

^dStudy result rating based on COSMIN guideline: + sufficient, ± inconsistent, - insufficient, ? intermediate

^eQuality of evidence based on GRADE approach

^*p < .05

^**p < .01

^***p < .001

No studies assessed validity of cut-points for SB, LPA and MVPA in infants, while for sleep no cut-points (i.e., wake thresholds) were evaluated as valid [30–33]. Quality of evidence was low for studies evaluating criterion validity, as results were retrieved in limited samples of 22 to 31 infants, despite very good methodological quality [30–32]. The results of the study that evaluated convergent validity, were insufficient, despite moderate quality of evidence [33].

In contrast, multi-parameter methods were more suitable for assessing the physical behavior of infants than a conventional cut-points based method. Infant leg movements could sufficiently be distinguished from non-infant produced movement using an algorithm describing velocity and acceleration magnitude for this activity [41]. However, these results received very low quality of evidence as methodological study quality was doubtful and the results were retrieved in a sample of only 12 infants.

For posture and movement classification, using arm and leg data, validity of convolutional neural networks was rated as sufficient [39]. The performance of convolutional neural networks and supported vector machines were comparable for classification of infant specific postures, e.g., tummy time and crawl posture. However, for movement in prone positions (e.g., crawl, turn and pivot) the performance of the convolutional neural networks was consistently 5 to 10% higher than the performance of support vector machines, resulting in a sufficient study result rating for the former and an inconsistent rating for the latter. Despite adequate methodological study quality, these results received low quality of evidence as the results were retrieved in a sample of 22 infants. Another neural network using chest data was rated as sufficient for sleep and movement classification [40]. Despite adequate methodological study quality, the results received very low quality of evidence as the results were retrieved in a sample of only nine infants.

Sleep could be distinguished from wake from 3-months of age using different multi-parameter methods [30, 36, 38]. Convergent validity of the Sadeh algorithm that calculates the probability of sleep, was rated as sufficient in free-living situations for infants [38]. However, it was less suitable to distinguish active and quiet sleep. These results received very low quality of evidence as the results were retrieved in a limited sample of 41 infants and methodological study quality was doubtful. Similarly, convergent validity of the automatic sleep–wake scoring algorithm developed for raw data was rated as sufficient to distinguish sleep from wake, despite low accuracy for distinguishing active from quiet sleep [36]. However, these results received very low quality of evidence as they were retrieved in a sample of only 10 infants and methodological study quality was inadequate. Galland and colleagues (2012) determined the accuracy of three algorithms for distinguishing sleep from wake states using 15-, 30- and 60 s epochs in infants with a mean age around 3-months [30]. In line with previous results, criterion validity of the Sadeh and the Cole-Kripke (computing the weighted sum activity) algorithm was rated as insufficient. However, criterion validity of an algorithm similar to the Cole-Kripke algorithm that uses count-scaled data (leg placement) was rated as sufficient in infants of around 3-months of age using 15 s or 30 s epochs. The best performing algorithm used a sampling epoch of 15 s. Sleep agreement of the other algorithms was highest using the 15- or 30 s epoch, however, at the expense of wake agreement. Notably, correspondence with polysomnography was poorest for the number of wake time after sleep onset using 60 s epochs. Despite very good methodological study quality, these results received low quality of evidence due to the limited sample size of 31 infants.

Toddlers

Table 4 summarizes the results of nine studies in toddlers that examined convergent validity [42–50]. No studies evaluated methods to distinguish sleep from wake.

Table 4.

Convergent validity of accelerometer-based methods for toddlers, sorted by methodological study quality, quality of evidence (level of evidence), and study result rating

	Study	Study population^a	Outcome(s) & Setting	Comparison measure(s)	Method ological study quality^b	Placement	Device-based method^c	Results	Study result rating^d	Quality of evidence^e (level of evidence)
Convergent validity	Kwon et al. (2019) [49]	n = 21 Age = 25 ± 2.5 (13 to 35) months Sex = 50% girls	Activity type: running (forward from one place to another), walking (forward from one place to another), climbing up/down (the stairs/foam climber), crawling (moving forward on two hands and two knees), riding a ride on toy (sitting on toy, moving forward using two feet), standing (without lifting a foot), sitting (on the ground), stroller (sit on stroller/wagon pushed by adult), and being carried (by adult while adult is walking) Free-living (free play in commercial indoor playroom)	DVO using a developed coding scheme to score activity type by two independent raters	VG	Hip	AG-RF-VA/HA/DA/VM-5	Best differentiating features: DA FFT SD (d' = 0.64), DA FFT max (d' = 0.61), HA FFT SD (d' = 0.47). Feature importance all 78 features < 0.1 (highest: SD of VM 0.039). Top 10 ranked features: basic quantiles (e.g., min, median, max) of single axis direct values and FFT values RF: accuracy = 89%, precision = 88%, recall = 89%, F-score = .88 58% of carried labels were correctly (SB) classified, whereas 89% of ambulation was correctly classified	+	Low (level 1)
	Costa et al. (2014) [46]	n = 20 Age = 2.99 ± 0.48 (2 to 3) years Sex = 60% girls	Activity intensity: Total time in SB, LPA, and MVPA Laboratory (semi-structured activity sessions)	DVO using CARS (modified) to score activity type and determine activity intensity (SB: stationary with no movement, and with movement of the limbs; LPA: slow/easy translocation; MVPA: translocation fast, and with moderate effort) by one rater (repeated after one month for random minute of each child)	VG	Right hip	AG-C-VM-5	Total PA: CCC = .80, MD = 48.0 s, LoA* (-217.9 to 121.9) SB: CCC = .74, MD = 48.0 s, LoA* (-121.9 to 217.9) LPA: CCC = .34, MD = -163.5 s**, LoA* (-343.0 to 16.0) MVPA: CCC = .40, MD = 117.8 s**, LoA* (-51.4 to 286.9)	± (SB + ; PA -)	Low (level 1)
							AG-C-VA-5	Total PA: CCC = .84, MD = 5.3 s, LoA (-171.1 to 181.6) SB: CCC = .77, MD = -5.3 s, LoA (-181.6 to 171.1) LPA: CCC = .45**, MD = -112.5 s, LoA* (-320.5 to 95.5) MVPA: CCC = .42, MD = 115.5 s**, LoA* (-49.3 to 280.3)	± (SB + ; PA -)
							AG-Pa-VA-15	Total PA: CCC = .71, MD = 50.5 s, LoA (-183.0 to 284.0) SB: CCC = .75, MD = -22.0 s, LoA (-209.7 to 165.7) LPA: CCC = .38, MD = -115.3 s*, LoA* (-397.27 to 166.8) MVPA: CCC = .30, MD = 165.8 s**, LoA* (-71.2 to 402.73)	± (SB + ; PA -)
							AG-T12-VA-15	Total PA: CCC = .85, MD = 15.3 s, LoA (-152.4 to 182.9) SB: CCC = .79, MD = -15.3 s, LoA (-182.9 to 152.4) LPA: CCC = .36, MD = -150.5 s**, LoA* (-391.1 to 90.1) MVPA: CCC = .30, MD = 165.8 s**, LoA* (-71.2 to 402.7)	± (SB + ; PA -)
	van Cauwenberghe et al. (2011) [45]	n = 31 Age = 20 ± 4 (12 to 30) months Sex = 45.2% girls	Activity intensity: SB, LPA and MVPA Free-living (out- & indoor free play at childcare)	DVO using OSRAC-P to score activity type and derive activity intensity (SB: stationary and motionless, and stationary with limb/trunk movement; LPA: slow, easy movement; MVPA: moderate, and fast movement) by two independent raters	VG	Right hip	AG-cts-VA-15	mean r_sp = .66* epoch-by-epoch intensity level r_sp = .52*	-	Low (level 1)
							AG-Pa-VA-15	accuracy = 58.3% SB: AUC-ROC = .71, Se = 67.0%, Sp = 75.4% LPA: AUC-ROC = .62, Se = 60.0%, Sp = 63.2% MVPA: AUC-ROC = .57, Se = 21.5%, Sp = 91.3%	- (SB -; PA -)
							AG-Si3-VA-15	accuracy = 52.7% SB: AUC-ROC = .58, Se = 91.8%, Sp = 23.9% LPA: AUC-ROC = .52, Se = 14.6%, Sp = 89.0% MVPA: AUC-ROC = .53, Se = 8.9%, Sp = 97.1%	- (SB -; PA -)
							AG-vC-VA-15	accuracy = 52.2% SB: AUC-ROC = .56, Se = 94.4%, Sp = 17.2% LPA: AUC-ROC = .51, Se = 9.0%, Sp = 93.7% MVPA: AUC-ROC = .53, Se = 10.0%, Sp = 96.9%	- (SB -; PA -)
	Trost et al. (2012) [44]	n = 18 Age = 2.3 ± 0.4 years Sex = 55.6% girls	Activity intensity: SB, LPA and MVPA Free-living (regularly scheduled play)	DVO using CARS (modified) to score activity type and determine activity intensity (SB: lying down or sitting; LPA: standing; MVPA: walking and running) by two raters	VG	Right hip	AG-T12-VA-15	SB: MD = -7.6**, LoA* (-17.6to 2.3) LPA: MD = 7.2**, LoA* (-2.0 to 16.3) MVPA: MD = 0.5, LoA (-2.6 to 3.5)	?	Low (level 1)
	Trost et al. (2012) [44]	n = 18 Age = 2.3 ± 0.4 years Sex = 55.6% girls		Pre-school cut-points: AG-Si/vC/Re/Pa/N-VA-15	VG	Right hip	AG-T12-VA-15	Preschool and toddler cut-points overestimated SB* and underestimated LPA; Si* and vC ** underestimated MVPA; Pa (≥ 420) same trend as the toddler cut-point T12: MD = 0.5, 95% LoA (-2.5 to 3.5)	?	Low (level 2)
	Albert et al. (2020) [48]	n = 22 Age = 1.5 ± 0.5 (1.1 to 2) years Sex = 54.6% girls	Activity type: Run/walk, crawl, climb, stand, sit, lie down, carried, and stroller as defined by Kwon et al. (2019)[49] Free-living (guided play)	DVO using a developed coding scheme to score activity type by three independent raters	A	Waist	AG-RF-HA/VA/DA-2	accuracy = 63.8% RF + HMM: accuracy = 64.8% Run/walk: recall = 80.0%, precision = 88.2% Crawl: recall = 81.2%, precision = 68.3% Climb: recall = 56.0%, precision = 29.0% Stand: recall = 45.4%, precision = 49.2% Sit: recall = 66.6%, precision = 66.9% Lie down: recall = 61.7%, precision = 76.0% Carried: recall = 58.8%, precision = 43.5% Stroller: recall = 28.2%, precision = 41.0% Sit, stand and stroller collapsed: accuracy = 79.3%	-	Very low (level 1)
							AG-SVM-HA/VA/DA-2	accuracy = 58.6% SVM + HMM: accuracy = 60.1%	-
							AG-LR-HA/VA/DA-2	accuracy = 57.0% LR + HMM: accuracy = 59.1%	-
							AG-J48-HA/VA/DA-2	accuracy = 57.3% J48 + HMM: accuracy = 57.5%	-
							AG-kNN-HA/VA/DA-2	accuracy = 52.6% kNN + HMM: accuracy = 54.1%	-
	Pulakka et al. (2013) [43]	Validation n = 40 Age = 16.9 ± 5.8 (16.0 to 18.3) months Sex = 60% girls Cross-validation n = 30 Age = 17.0 ± 0.6 (16 to 18.5) months Sex = 60% girls	Activity intensity: SB, LPA, MPA, and VPA Free-living (free play sessions	DVO using CPAF to score activity type and derive activity intensity (SB: stationary, no movement; LPA: stationary, limb movement, MPA: slow trunk movement; VPA: rapid trunk movement) by one rater (and 19.5% of the videos by a second rater)	D	Right hip	AG-Pul-VM-15	Validation SB vs. LPA: AUC-ROC = .73, CI (.67 to .80) VPA vs. MPA: AUC-ROC = .67, CI (.56 to .78) SB vs LPA and MVPA: AUC-ROC = .98, CI (.97 to .99) LPA vs. MPA: AUC-ROC = .94, CI (.91 to .97) Cross-validation SB and LPA vs. MVPA: accuracy = 92%, Se = 94.2%, Sp = 90.9%, κ = .83	+ (SB + ; PA ±)	Very low (level 1)
	Pulakka et al. (2013) [43]				D	Right hip	AG-Pul-VA-15	Validation SB vs. LPA: AUC-ROC = .62, CI (.56 to .67) VPA vs. MPA: AUC-ROC = .59, CI (.47 to .72) SB vs. LPA and MVPA: AUC-ROC = .95, CI (.93 to .96) LPA vs. MPA: AUC-ROC = . 90, CI (.87 to .94) Cross-validation SB and LPA vs. MVPA: accuracy = 84%, Se = 84.1%, Sp = 84.6%, κ = .67	± (SB + ; PA ±)	Very low (level 1)
	Hager et al. (2016) [42]	n = 24 Age = 24.5 ± 5.2 (14.7 to 35.5) months Sex = 41.7% girls	Activity intensity: SB, LPA, and MVPA Laboratory (structured activities)	DO using CARS to score activity type and derive activity intensity (SB: stationary with no movement; LPA: stationary with movement of the limbs, and slow/easy translocation; MVPA: translocation fast, and with moderate effort) by one rater	D	Left ankle	Ac-Ha-omni-30	r_sp = .75 SB: Se = 81.8%, Sp = 77.5%; LPA: Se = 61.7%, Sp = 84.7%; MVPA: Se = 85.7%, Sp = 88.4%	± (SB ± ; PA +)	Very low (level 2)
	Oftedal et al. (2014) [47]	n = 10 Age = 29 ± 6 months Sex = 50% girls	Activity intensity: SB and non-SB Laboratory (semi-structured activity sessions)	DVO using DO software Behavioral Evaluation Strategy and Taxonomy to score activity type and determine activity intensity (SB: lying/sitting with(out) limb movement, and standing still) by one rater	I	Waist	AG–O-VM-5	Se = 82%, Sp = 83% bias = -5.1%, LoA (-27.5 to 16.1%)	SB: +	Very low (level 1)
	Oftedal et al. (2014) [47]	n = 10 Age = 29 ± 6 months Sex = 50% girls			I	Waist	AG–O-VA-5	Se = 76%, Sp = 93%, bias = -17.3%, LoA (-44.3 to 8.3%)*	SB: ±	Very low (level 1)
	Nam & Park (2013) [50]	n = 10 Age = 22.4 ± 3.3 (16 to 29) months Sex = 50% girls	Activity type: Wiggling, rolling, standing still, standing up, sitting down, walking, toddling, crawling, climbing up/down, and stopping (precise definitions not indicated) Laboratory (simulated real home environment)	DVO to score activity type by one rater	I	Hip	mSCA-DT- VA/HA/DA/VM-n.r	accuracy = 74.0% (39.5 to 97.2%)	±	Very low (level 1)
							mSCA-NB- VA/HA/DA/VM-n.r	accuracy = 73.0% (28.0 to 90.9%)	-
							mSCA-BN-VA/HA/DA/VM-n.r	accuracy = 84.8% (57.8 to 98.9%)	+
							mSCA-SVM-VA/HA/DA/VM-n.r	accuracy = 86.2% (39.8 to 99.9%)	+
							mSCA-kNN-VA/HA/DA/VM-n.r	accuracy = 84.1% (67.8 to 94.6%)	+
							mSCA-J48-VA/HA/DA/VM-n.r	accuracy = 88.3% (71.7 to 98.7%)	+
							mSCA-MLP-VA/HA/DA/VM-n.r	accuracy = 84.8% (52.5 to 99.5%)	±
							mSCA-MLR-VA/HA/DA/VM-n.r	accuracy = 86.9% (72.1 to 98.7%)	+

Open in a new tab

Abbreviations: Ac Actical, AUC-ROC area under the receiver operating curve, AG ActiGraph, cts counts, BN Bayes net, C Costa’s cut-points (2014) [46], CARS children’s activity rating system, CCC concordance correlation coefficient, CI confidence interval, CPAF children’s physical activity form, d’ discriminability index, DA diagonal axis (z-axis), DO direct observation, DVO direct video observation, FFT fast Fourier transform, Ha Hager’s cut-points (2014) [42], HMM hidden Markov model, J48 decision tree (pruned), κ Kappa, kNN k-nearest neighbors, LoA limits of agreement, LPA light physical activity, LR logistic regression, MD mean difference, MLP multi-layer perceptron network, MLR multinomial logistic regression, MPA moderate physical activity, mSCA miniature semiconductor chip accelerometer, MVPA moderate-to-vigorous physical activity, N NHANES cut-points [88], NB naïve Bayes, n.r. not reported, O Oftedal’s cut-points (2014) [47], omni omnidirectional, OSRAC-P observational system for recording physical activity in children preschool, Pa Pate’s cut-points (2006) [52], PA physical activity, Pul Pulakka’s cut-points (2013) [43], Re Reilly’s cut-points (2003)[80], RF random forests, r_sp correlation coefficient (Spearman rank), SB sedentary behavior, SD standard deviation, Se sensitivity, Si Sirard’s age-specific cut-points (2005) [81], Si3 Sirard’s cut-points (2005) for 3-year-olds [81], Sp specificity, SVM support vector machine, T12 Trost’s cut-points (2012) [44], VA vertical axis (y-axis), vC van Cauwenberghe’s cut-points (2011) [45], VM vector magnitude, VPA vigorous physical activity, 5 5 s epoch, 15 15 s epoch, 30 30 s epoch

^aAge presented as mean ± SD (range)

^bMethodological study quality based on newly developed checklist: VG very good, A adequate, D doubtful, I inadequate

^cDevice-based method described using code combinations of four elements resulting in the following format: brand-axis-approach-epoch length

^dStudy result rating based on COSMIN guideline: + sufficient, ± inconsistent, - insufficient, ? intermediate

^eQuality of evidence based on GRADE approach

^* p < .05

^** p < .01

^*** p < .001

For the assessment of SB, LPA and MVPA no valid cut-point sets were found. Four studies evaluated the convergent validity of cut-points based methods for accelerometers using direct (video) observation as comparison measure [43, 44, 46, 47]. These studies suggested that cut-points can be used to distinguish SB [46, 47] or low intensity [43, 44] from high intensity PA. Cut-points to distinguish SB and LPA from MVPA were rated as sufficient, with MVPA for the vector magnitude (VM) ≥ 208 counts/15 s (M_age = 1.42 ± 0.05 years) [43], or for the VA ≥ 418 counts/15 s (M_age = 2.30 ± 0.40 years) [44]. In contrast, cut-points to distinguish SB from total PA were rated as sufficient, for the VM with SB < 6 counts/5 s (M_age = 2.99 ± 0.48 years) [46], or < 40 counts/5 s [47]. These results seemed promising as high agreement and low bias were found, but the results of these studies received low [44, 46] to very low [43, 47] quality of evidence due to small sample sizes (10 ≥ n ≤ 40), despite very good methodological quality of two studies [44, 46].

Using a multi-parameter method, SB (e.g., carrying) could be sufficiently distinguished from ambulation PA (e.g., running, crawling, and climbing) using time-domain and frequency acceleration signal features. Convergent validity of this random forests was rated as sufficient [49]. However, these results received low quality of evidence as the results were retrieved in a sample of only 21 toddlers, despite very good methodological study quality. In another study, compared to other multi-parameter methods, random forests provided the best classification of activity type (i.e., running/walking, crawling, climbing, standing, sitting, lying down, carried and stroller) [48]. To improve accuracy, the models were augmented by a hidden Markov model by providing the predictions of the models as observations. Despite small improvements, study results were rated as insufficient.

Preschoolers

Table 5 summarizes the results for validity, of which ten studies examined criterion validity [51–55, 66, 67, 75, 84, 88] and thirty studies examined convergent validity [51, 56–64, 68–74, 76–83, 85–87, 89, 90] in preschoolers.

Table 5.

Criterion and convergent validity of accelerometer-based methods for preschoolers, sorted by methodological study quality, quality of evidence (level of evidence), and study result rating

	Study	Study population^a	Outcome(s) & Setting	Comparison measure(s)	Method ological study quality^b	Placement	Device-based method^c	Results	Study result rating^d	Quality of evidence^e (level of evidence)
Criterion validity	Adolph et al. (2012) [51]	n = 64 Age = 4.5 ± 0.8 (3 to 5) years Sex = 42.2% girls	cpm Laboratory (structured activities)	Indirect calorimetry using breath-by-breath VO₂ and VCO₂ method to determine AEE	VG	Chest	Ah-cts-uni-60	r = .72***	+	Moderate
						Right hip	RT-cts-VM-60	r = .80**	+
							RT-cts-VA-60	r = .80**	+
							RT-cts-HA-60	r = .78**	+
							RT-cts-DA-60	r = .74**	+
	Zakeri et al. (2013) [55]	n = 69 Age = 4.6 ± 1.0 (3 to 5) years Sex = 50.7% girls	Minute-by-minute EE Laboratory (structured activities)	Indirect calorimetry using breath-by-breath VO₂ and VCO₂ to determine EE	VG	Right hip	AG-cts-VA-60	r = .76***	+	Moderate
							AG-cts-HA-60	r = .74***	+
							AG-cts-DA-60	r = .76***	+
							AG-CSTS-VA/HA/DA/VM-60	MAPE = 0.3 ± 6.9%, CCC = .93	+
							AG-MARS-VA/HA/DA/VM-60	MAPE = 0.3 ± 4.8%, CCC = .95	+
	Bélanger et al. (2013) [66]	n = 12 Age = 3.1 ± 1.0 (2 to 5) years Sex = 66.7% girls	Sleep quality: sleep latency, TST, WASO (number), and sleep efficiency Free-living (at home)	Polysomnography to score sleep stages as sleep and wake	VG	Non-dominant wrist (raw data)	Aw-AS-VA-30	Sleep latency: ICC = .96** TST: ICC = .94, overestimated by 13 min WASO: ICC = .28, underestimated by 20** Sleep efficiency: ICC = .76*, overestimated by 2 min Se = 98.7%, Sp = 58.7%, accuracy = 95.6%, NVP = 81.0%	+	Low
							Aw-ACT40-VA-30	Sleep latency: ICC = .92** TST: ICC = .94, underestimated by 25 min WASO: ICC = .42, overestimated by 37** Sleep efficiency: ICC = .73, underestimated by 4 min Se = 92.7%, Sp = 69.9%, accuracy = 90.7%, NVP = 47.4%	±
							Aw-ACT80-VA-30	Sleep latency: ICC = .92** TST: ICC = .95* WASO: ICC = .32, overestimated by 25** Sleep efficiency: ICC = .70** Se = 95.8%, Sp = 56.7%, accuracy = 92.4%, NVP = 55.0%	±
						Non-dominant wrist (adjusted data)	Aw-ACT40-VA-30	Sleep latency: ICC = .96** TST: ICC = .94, underestimated by 58 min WASO: ICC = .28, overestimated by 35** Sleep efficiency: ICC = .76, underestimated by 9.4 min Se = 87.9%, Sp = 81.0%, accuracy = 87.5%, NVP = 39.4%	+
							Aw-ACT80-VA-30	Sleep latency: ICC = .96** TST: ICC = .97, underestimated by 21.5 min WASO: ICC = .36, overestimated by 34** Sleep efficiency: ICC = .82, underestimated by 3.5 min Se = 93.4%, Sp = 70.9%, accuracy = 91.4%, NVP = 49.6%	+
							Aw-AS-VA-30	Sleep latency: ICC = .93** TST: ICC = .98** WASO: ICC = .36, underestimated by 20** Sleep efficiency: ICC = .90** Se = 97.7%, Sp = 61.2%, accuracy = 95.0%, NVP = 72.1%	±
						Left ankle	Aw-ACT40-VA-30	Sleep latency: ICC = .83** TST: ICC = .94, underestimated by > 25 min WASO: ICC = .01, overestimated by 36** Sleep efficiency: ICC = .80, underestimated by 6.5 min Se = 90.5%, Sp = 75.1%, accuracy = 89.3%, NVP = 41.7%	±
							Aw-ACT80-VA-30	Sleep latency: ICC = .85** TST: ICC = .95** WASO: ICC = .09, overestimated by 28** Sleep efficiency: ICC = .81** Se = 95.0%, Sp = 65.0%, accuracy = 92.1%, NVP = 53.7%	±
							Aw-AS-VA-30	Sleep latency: ICC = .81** TST: ICC = .91** WASO: ICC = .14, underestimated by 20** Sleep efficiency: ICC = .77** Se = 97.6%, Sp = 57.7%, accuracy = 94.6%, NVP = 76.8%	±
	Roscoe, James, & Duncan (2017) [67]	n = 21 Age = 4.7 ± 0.5 (4 to 5) years Sex = 38.1% girls	Activity intensity: SB (< 2 METs), LPA (2–2.99 METs), and MPA (3–5.99 METs) Laboratory (structured activities)	Indirect calorimetry using breath-by-breath VO₂ and VCO₂ to determine EE	VG	Non-dominant wrist	GA-Ro-VM-60	SB: AUC-ROC = .99, 95% CI (.98 to 1.0), Se = 90%, Sp = 90% LPA: AUC-ROC = .75, 95% CI (.65 to .85), Se = 40%, Sp = 20% MPA: AUC-ROC = .92, 95% CI (.86 to .98), Se = 86%, Sp = 40%	± (SB + ; PA -)	Low
	Roscoe, James, & Duncan (2017) [67]				VG	Dominant wrist	GA-Ro-VM-60	SB: AUC-ROC = .99, 95% CI (.97 to 1.0), Se = 100%, Sp = 10% LPA: AUC-ROC = .76, 95% CI (.60 to .86), Se = 10%, Sp = 85% MPA: AUC-ROC = .90, 95% CI (.83 to .96), Se = 76%, Sp = 40%	- (SB ± ; PA -)	Low
	Pfeiffer et al. (2006) [53]	n = 18 Age = 4.4 ± 0.7 (3 to 5) years Sex = 61.0% girls	cpm Laboratory (structured activities)	Indirect calorimetry using breath-by-breath VO₂	VG	Right hip	Ac-cts-omni-60	Validation r = .89, r_sp = .90, R² = .96, SE = 3.02, AIC = 435.9, ICC = .59, r_sp = .80***	+	Very low
	Pfeiffer et al. (2006) [53]		Activity intensity: MPA and VPA Free-living (in- and outdoor activities at preschool)	Indirect calorimetry using breath-by-breath VO₂	D	Right hip	Ac-Pf-omni-60	Cross-validation MPA: accuracy = 73%, κ = .40, modified κ = .46 VPA: accuracy = 85%, κ = .26, modified κ = .71	PA: ±	Very low
	Pate et al. (2006) [52]	n = 29 Age = 4.4 ± 0.8 (3.3 to 5.9) years Sex = 46.2% girls	cpm Laboratory (structured activities)	Indirect calorimetry using breath-by-breath VO₂ and VCO₂ to determine EE	VG	Right hip	AG-cts-VA-60	Validation r_p = .82, R² = .90, SE = 4.70, AIC = 735.8, ICC = .57, r_sp = .66***	±	Very low
	Pate et al. (2006) [52]		Activity intensity: MVPA and VPA Free-living (in- and outdoor activities at preschool)		D	Right hip	AG-Pa-VA-60	Cross-validation MVPA: accuracy = 69%, Se = 96.6%, Sp = 86.2% κ = .36, modified κ = .38 VPA: accuracy = 81%, Se = 65.5%, Sp = 95.4%, κ = .13, modified κ = .62	PA: -	Very low
	Sijtsma et al. (2013) [54]	n = 30 Age = 3.4 ± 0.3 (3.1 to 4.4) years Sex = 60.0% girls	Mean activity cpm (ACM) and mean total activity counts per day (ACD) Free-living (at home for 5 days)	Indirect calorimetry using DLW to determine TEE; breath-by-breath VO₂ and VCO₂ to determine SMR to calculate PAL and AEE	D	Middle lower back	T_D-cts-tri-60	PAL: ACM r_p = .61*, ACD r_p = 46 - ACM, gender & weight, R² = .50; ACD, height, gender & sleep duration, R² = .48 AEE: ACM r_p = .56, ACD r_p = .38 - ACM, R² = .31; ACD, gender & sleep duration, R² = .39* TEE: ACM r_p = .34, ACD r_p = .21 - Multivariate models with ACM & SMR or ACM, ACD & sleep duration not significant Using 5 or 3 days of data did not provide different results for PAL estimation, MD = -0.02 ± 0.07, LoA (-0.16 to 0.11)	-	Very low
	Ahmadi et al. (2020) [84]	n = 31 Age = 4.1 ± 1.0 (3 to 5) years Sex = 20% girls	EE Free-living (free play)	Indirect calorimetry using breath-by-breath VO₂ and VCO₂ to determine EE	A	Non-dominant wrist	AG-RF-VM-10	Free-living (FL) model: RMSE = 0.63 (SD = 0.47) kcal/min, RMSE = 0.96 (SD = 0.67) METs, MAPE = 27.4% (SD = 14.0) Retrained lab (RL) model: RMSE = 0.66 (SD = 0.47) kcal/min, RMSE = 1.01 (SD = 0.67) METs, MAPE = 28.3% (SD = 15.0) Existing lab, FL and RL models: mean bias not different from zero, predicted EE within ± 6% of measured EE	?	Very low
						Non-dominant wrist	AG-SVM-VM-10	Free-living (FL) model: RMSE = 0.64 (SD = 0.51) kcal/min, RMSE = 0.99 (SD = 0.73) METs, MAPE = 25.4% (SD = 12.2) Retrained lab (RL) model: RMSE = 0.65 (SD = 0.54) kcal/min, RMSE = 0.99 (SD = 0.77) METs, MAPE = 26.0% (SD = 12.6) Existing lab, FL and RL models: mean bias not different from zero, predicted EE within ± 6% of measured EE	?
						Right hip	AG-RF-VM-10	Free-living (FL) model: RMSE = 0.63 (SD = 0.42) kcal/min, RMSE = 0.96 (SD = 0.59) METs, MAPE = 28.1% (SD = 12.0) Retrained lab (RL) model: RMSE = 0.67 (SD = 0.41) kcal/min, RMSE = 1.02 (SD = 0.57) METs, MAPE = 28.3% (SD = 12.7) FL and RL models: EE overestimated during play sessions with low total EE and underestimated during sessions with high total EE Existing lab models: mean bias not different from zero, predicted EE within ± 6% of measured EE	?
						Right hip	AG-ANN-VM-10	Free-living (FL) model: RMSE = 0.63 (SD = 0.43) kcal/min, RMSE = 0.96 (SD = 0.61) METs, MAPE = 27.1% (SD = 11.1) Retrained lab (RL) model: RMSE = 0.65 (SD = 0.44) kcal/min, RMSE = 0.99 (SD = 0.62) METs, MAPE = 28.4% (SD = 11.8) Existing lab, FL and RL models: EE overestimated during play sessions with low total EE and underestimated during sessions with high total EE	?
	Steenbock et al. (2019) [88]	n = 41 Age = 4.8 ± 0.8 (3 to 6) years Sex = 46% girls	Absolute (kJ/min) and relative (kg/min/J) EE Free-living (in- and outdoor semi-structured activities at school)	Indirect calorimetry using breath-by-breath VO₂ and VCO₂ to determine EE	A	Right hip	AG-LM- HA/VA/DA-30	RMSE = 2.91 (SD = 0.95) kJ/min, RMSE = 124.91 (SD = 30.58) J/min/kg, RMSE = 1.70 (SD = 0.39) METs	?	Very low
							AG-MLM- HA/VA/DA-30	RMSE = 2.91 (SD = 0.99) kJ/min, RMSE = 125.40 (SD = 31.50) J/min/kg, RMSE = 1.70 (SD = 0.40) METs	?
							AG-RF- HA/VA/DA-30	RMSE = 2.74 (SD = 0.96) kJ/min, RMSE = 115.56 (SD = 27.35) J/min/kg, RMSE = 1.56 (SD = 0.36) METs	?
							AG-ANN- HA/VA/DA-30	RMSE = 2.86 (SD = 0.95) kJ/min, RMSE = 121.51 (SD = 29.57) J/min/kg, RMSE = 1.66 (SD = 0.39) METs	?
							GA-LM- HA/VA/DA-30	RMSE = 2.89 (SD = 0.95) kJ/min, RMSE = 123.79 (SD = 31.28) J/min/kg, RMSE = 1.68 (SD = 0.39) METs	?
							GA-MLM- HA/VA/DA-30	RMSE = 2.90 (SD = 0.98) kJ/min, RMSE = 124.70 (SD = 32.95) J/min/kg, RMSE = 1.69 (SD = 0.42) METs	?
							GA-RF- HA/VA/DA-30	RMSE = 2.73 (SD = 1.00) kJ/min, RMSE = 112.57 (SD = 28.83) J/min/kg, RMSE = 1.53 (SD = 0.38) METs	?
							GA-ANN- HA/VA/DA-30	RMSE = 2.83 (SD = 0.98) kJ/min, RMSE = 118.04 (SD = 29.80) J/min/kg, RMSE = 1.66 (SD = 0.43) METs	?
						Left hip	AG-LM- HA/VA/DA-30	RMSE = 2.84 (SD = 0.94) kJ/min, RMSE = 123.64 (SD = 31.82) J/min/kg, RMSE = 1.67 (SD = 0.41) METs	?
							AG-MLM- HA/VA/DA-30	RMSE = 2.81 (SD = 0.99) kJ/min, RMSE = 123.58 (SD = 31.94) J/min/kg, RMSE = 1.67 (SD = 0.41) METs	?
							AG-RF- HA/VA/DA-30	RMSE = 2.60 (SD = 0.97) kJ/min, RMSE = 112.32 (SD = 28.40) J/min/kg, RMSE = 1.52 (SD = 0.38) METs	?
							AG-ANN- HA/VA/DA-30	RMSE = 2.78 (SD = 1.01) kJ/min, RMSE = 116.89 (SD = 28.41) J/min/kg, RMSE = 1.63 (SD = 0.42) METs	?
						Right wrist	GA-LM- HA/VA/DA-30	RMSE = 2.85 (SD = 0.83) kJ/min, RMSE = 125.60 (SD = 28.52) J/min/kg, RMSE = 1.69 (SD = 0.38) METs	?
							GA-MLM- HA/VA/DA-30	RMSE = 2.83 (SD = 0.87) kJ/min, RMSE = 125.42 (SD = 28.74) J/min/kg, RMSE = 1.69 (SD = 0.38) METs	?
							GA-RF- HA/VA/DA-30	RMSE = 2.56 (SD = 0.83) kJ/min, RMSE = 109.34 (SD = 26.98) J/min/kg, RMSE = 1.48 (SD = 0.37) METs	?
							GA-ANN- HA/VA/DA-30	RMSE = 2.74 (SD = 0.88) kJ/min, RMSE = 115.91 (SD = 28.55) J/min/kg, RMSE = 1.62 (SD = 0.40) METs	?
						Non-dominant wrist	GA-LM- HA/VA/DA-30	RMSE = 2.83 (SD = 0.86) kJ/min, RMSE = 125.21 (SD = 27.38) J/min/kg, RMSE = 1.70 (SD = 0.37) METs	?
							GA-MLM- HA/VA/DA-30	RMSE = 2.83 (SD = 0.89) kJ/min, RMSE = 124.91 (SD = 27.58) J/min/kg, RMSE = 1.69 (SD = 0.38) METs	?
							GA-RF- HA/VA/DA-30	RMSE = 2.56 (SD = 0.83) kJ/min, RMSE = 108.64 (SD = 26.33) J/min/kg, RMSE = 1.47 (SD = 0.36) METs	?
							GA-ANN- HA/VA/DA-30	RMSE = 2.72 (SD = 0.91) kJ/min, RMSE = 114.01 (SD = 27.88) J/min/kg, RMSE = 1.61 (SD = 0.44) METs	?
						Right thigh	aP-LM- HA/VA/DA-30	RMSE = 2.94 (SD = 0.91) kJ/min, RMSE = 126.22 (SD = 29.46) J/min/kg, RMSE = 1.69 (SD = 0.38) METs	?
							aP-MLM- HA/VA/DA-30	RMSE = 2.92 (SD = 0.95) kJ/min, RMSE = 125.96 (SD = 29.86) J/min/kg, RMSE = 1.69 (SD = 0.39) METs	?
							aP-RF- HA/VA/DA-30	RMSE = 2.76 (SD = 0.94) kJ/min, RMSE = 115.61 (SD = 27.93) J/min/kg, RMSE = 1.56 (SD = 0.38) METs	?
							aP-ANN- HA/VA/DA-30	RMSE = 3.08 (SD = 1.00) kJ/min, RMSE = 125.98 (SD = 31.50) J/min/kg, RMSE = 1.81 (SD = 0.40) METs	?
	Butte et al. (2014) [75]	n = 50 Age = 4.5 ± 0.8 (3 to 5) years Sex = 50.0% girls	cpm Laboratory (structured activities)	Indirect calorimetry using breath-by-breath VO₂ and VCO₂ to determine minute-by-minute EE	I	Right hip	AG-CSTS-VA/HA/DA/VM-60	MAPE = -0.2 ± 6.7%, MAE = -0.005 ± 0.07 kcal/min, RMSE = 0.07 kcal/min, R² = .88	+	Very low
			cpm Laboratory (structured activities)		I		AG-MARS-VA/HA/DA/VM-60	MAPE = 1.1 ± 6.6%, MAE = 0.009 ± 0.07 kcal/min, RMSE = 0.07 kcal/min	?
		n = 105 Age = 4.6 ± 0.9 years Sex = 66.7% girls	Activity intensity: SB, LPA, MPA, and VPA Free-living	Indirect calorimetry using DLW to determine AEE	I		AG-B-VM-60	accuracy = 70% SB: Sp = 83% LPA: Sp = 64% MPA: Sp = 35% VPA: Sp = 38% MVPA collapsed: accuracy = 74%	- (SB + ; PA -)
							AG-B-HA-60	accuracy = 68% SB: Sp = 82% LPA: Sp = 58% MPA: Sp = 37% VPA: Sp = 27% MVPA collapsed: accuracy = 71%	- (SB + ; PA -)
						Chest	Ah-B-HA-60	accuracy = 70%; Sp: SB: Sp = 81% LPA: Sp = 64% MPA: Sp = 48% VPA: Sp = 39% MVPA collapsed: accuracy = 74%	- (SB + ; PA -)
Convergent validity	Sirard et al. (2005) [81]	n = 269 (n = 69 3-, n = 125 4-, n = 75 5-year-olds) Sex = 58% 3-, 52% 4-, 52% 5-year-old girls	Activity intensity: SB, LPA, MPA and VPA Free-living (at preschool)	DVO using CARS to score activity type and derive activity intensity (SB: stationary, motionless or with trunk/limb movements; LPA: slow/easy movement; MPA: moderate movement; VPA: fast movement) by a single rater (beginning and end of video by four additional raters)	VG	Right hip	AG-Si-VA-15	Total counts: r_p = .58* SB: r_p = .70, Se (94.4 to 100%), Sp (91.7 to 100%) LPA: r_p = .59 VPA: r_p = .61, Se (95.8 to 100%), Sp (80.0 to 83.3%) MPA: r_p = .50, Se (86.7 to 94.4%), Sp (66.7 to 100%) MVPA: r_p = .46*	+ (SB + ; PA +)	High (level 1)
	Peirera et al. (2020) [74]	n = 60 Age = 2.7 ± 0.4 (1.8 to 3.5) years Sex = 50% girls	% time spent in SB Free-living (at childcare)	aP-PRE-uni-n.r. at right thigh (SB: sitting/lying)	VG	Hip		All outside the ± 10% equivalent interval of (-4.05 to 4.05)		Moderate (level 2)
							AG-C-VA-5	bias = -5.11, 90% CI (-7.90 to -2.32)	?
							AG-Ev-VA-15	bias = 8.15, 90% CI (5.44 to 10.86)	?
							AG-T12-VA-15	bias = 4.46, 90% CI (1.65 to 7.28)	?
							AG-K-VA-15	bias = -7.95, 90% CI (-11.01 to -4.89)	?
							AG-Pa-VA-15	bias = 6.17, 90% CI (3.40 to 8.93)	?
							AG-Pa2-VA-15	bias = -9.26, 90% CI (-12.32 to -6.21)	?
							AG-Re-VA-60	bias = -40.52, 90% CI (-43.82 to -37.22)	?
							AG-Si-VA-15	bias = -15.07, 90% CI (-18.16 to -11.97)	?
							AG-vC-VA-15	bias = -18.71, 90% CI (-21.89 to -15.54)	?
	Dobell et al. (2019) [58]	n = 62 Age = 3.5 ± 0.5 (3 to 5) years Sex = 45.5% girls	Activity intensity: SB, LPA, and MVPA Free-living (free-play in nursery)	DO using OSRAC-P (modified) to score activity type and derive activity intensity (SB: stationary/ motionless; LPA: stationary torso with limb and slow/easy movement; MVPA: moderate and fast movement) by two independent raters	VG	Non-dominant wrist	AG-cts-VM-5/10/15/30	r_sp = .34, R² = .11* (5 s) to r_sp = .48, R² = .23* (30 s) SB: AUC-ROC (.71 to .78), Se (.64 to .68), Sp (.64 to .78) LPA: AUC-ROC (.55 to .56), Se (.74 to .80), Sp (.33 to .43) MVPA: AUC-ROC (.67 to .78), Se (.63 to .80), Sp (.80 to .82)	- (SB -; PA -)	Moderate (level 2)
						Right hip	AG-cts-VA-5/10/15/30	r_sp = .38, R² = .16* (5 s) to r_sp = .52, R² = .27* (10 s) SB: AUC-ROC (.69 to .77), Se (.61 to .72), Sp (.69 to .72) LPA: AUC-ROC (.53 to .54), Se (.59 to .76), Sp (.39 to .50) MVPA: AUC-ROC (.71 to .81), Se (.55 to .70), Sp (.77 to .79)	- (SB -; PA -)
						Right hip	AG-cts-VM-5/10/15/30	r_sp = .40, R² = .16* (5 s) to r_sp = .55, R² = .30* (30 s) SB: AUC-ROC (.73 to .79), Se (.69 to .72), Sp (.67 to .75) LPA: AUC-ROC = .55, Se (.78 to .82), Sp (.35 to .36) MVPA: AUC-ROC (.72 to .82), Se (.68 to .78), Sp (.62 to .79)	- (SB -; PA -)
	Ahmadi, Pavey, & Trost (2020) [85]	n = 31 Age = 4.0 ± 0.9 (3 to 5) years Sex = 29.0% girls	Activity intensity: SB, LPA, MVPA, run and walk Free-living (20 min active play session)	DVO using customized scheme to score activity type and derive activity intensity (SB: sitting/lying down, and stationary, motionless; LPA: standing, stationary, movement of the limbs/trunk, easy translocation, MPA: moderate and fast translocation; walk: translocation, medium speed; run: translocation (very) fast)	VG	Right hip	AG-RF-VM-1	Time and frequency features: F-score = 70.6%, + temporal features: F-score = 75.8% SB: F-score = 73.3%, + temporal: F-score = 80.0% LPA: F-score = 76.0%, + temporal: F-score = 80.4% MVPA: F-score = 61.0%, + temporal: F-score = 65.2% Walk: F-score = 55.7%, + temporal: F-score = 62.9% Run: F-score = 53.1%, + temporal: F-score = 67.2%	-	Low (level 1)
							AG-RF-VM-5	Time and frequency features: F-score = 79.5%, + temporal features: F-score = 82.2% SB: F-score = 80.6%, + temporal: F-score = 85.7% LPA: F-score = 82.7%, + temporal: F-score = 85.6% MVPA: F-score = 75.5%, + temporal: F-score = 76.3% Walk: F-score = 69.8%, + temporal: F-score = 68.8% Run: F-score = 71.2%, + temporal: F-score = 72.8%	- (incl. temp oral ±)
							AG-RF-VM-10	Time and frequency features: F-score = 83.1%, + temporal features: F-score = 85.3% SB: F-score = 82.3%, + temporal: F-score = 87.6% LPA: F-score = 85.7%, + temporal: F-score = 87.9% MVPA: F-score = 77.6%, + temporal: F-score = 76.4% Walk: F-score = 80.7%, + temporal: F-score = 81.0% Run: F-score = 74.4%, + temporal: F-score = 73.6%	±
							AG-RF-VM-15	Time and frequency features: F-score = 84.0%, + temporal features: F-score = 85.9% SB: F-score = 85.0%, + temporal: F-score = 87.7% LPA: F-score = 86.8%, + temporal: F-score = 88.2% MVPA: F-score = 75.3%, + temporal: F-score = 78.5% Walk: F-score = 78.4%, + temporal: F-score = 79.4% Run: F-score = 80.0%, + temporal: F-score = 82.6%	±
						Non-dominant wrist	AG-RF-VM-1	Time and frequency features: F-score = 62.6%, + temporal features: F-score = 68.8% SB: F-score = 63.3%, + temporal: F-score = 69.2% LPA: F-score = 71.4%, + temporal: F-score = 75.5% MVPA: F-score = 45.7%, + temporal: F-score = 57.5% Walk: F-score = 45.9%, + temporal: F-score = 54.8% Run: F-score = 55.1%, + temporal: F-score = 61.6%	-
							AG-RF-VM-5	Time and frequency features: F-score = 70.8%, + temporal features: F-score = 75.5% SB: F-score = 69.3%, + temporal: F-score = 78.% LPA: F-score = 76.5%, + temporal: F-score = 80.1% MVPA: F-score = 60.9%, + temporal: F-score = 66.9% Walk: F-score = 60.7%, + temporal: F-score = 60.3% Run: F-score = 68.5%, + temporal: F-score = 68.8%	-
							AG-RF-VM-10	Time and frequency features: F-score = 74.5%, + temporal features: F-score = 80.0% SB: F-score = 73.7%, + temporal: F-score = 82.4% LPA: F-score = 79.1%, + temporal: F-score = 82.9% MVPA: F-score = 62.0%, + temporal: F-score = 70.3% Walk: F-score = 68.8%, + temporal: F-score = 70.8% Run: F-score = 73.4%, + temporal: F-score = 71.5%	- (incl. temp oral ±)
							AG-RF-VM-15	Time and frequency features: F-score = 77.3%, + temporal features: F-score = 80.6% SB: F-score = 78.2%, + temporal: F-score = 83.3% LPA: F-score = 81.2%, + temporal: F-score = 83.7% MVPA: F-score = 62.1%, + temporal: F-score = 70.7% Walk: F-score = 70.5%, + temporal: F-score = 69.0% Run: F-score = 82.4%, + temporal: F-score = 82.4%	- (incl. temp oral ±)
						Right hip + Non-dominant wrist	AG-RF-VM-1	Time and frequency features: F-score = 72.5%, + temporal features: F-score = 76.8% SB: F-score = 75.8%, + temporal: F-score = 80.8% LPA: F-score = 77.7%, + temporal: F-score = 81.0% MVPA: F-score = 62.1%, + temporal: F-score = 66.7% Walk: F-score = 58.4%, + temporal: F-score = 64.9% Run: F-score = 64.5%, + temporal: F-score = 67.9%	-
							AG-RF-VM-5	Time and frequency features: F-score = 79.6%, + temporal features: F-score = 82.1% SB: F-score = 81.6%, + temporal: F-score = 86.0% LPA: F-score = 82.8%, + temporal: F-score = 85.6% MVPA: F-score = 74.3%, + temporal: F-score = 74.5% Walk: F-score = 70.1%, + temporal: F-score = 69.5% Run: F-score = 71.7%, + temporal: F-score = 72.5%	- (incl. temp oral ±)
							AG-RF-VM-10	Time and frequency features: F-score = 83.2%, + temporal features: F-score = 85.3% SB: F-score = 83.9%, + temporal: F-score = 87.5% LPA: F-score = 85.8%, + temporal: F-score = 87.9% MVPA: F-score = 75.6%, + temporal: F-score = 77.2% Walk: F-score = 79.5%, + temporal: F-score = 80.7% Run: F-score = 74.9%, + temporal: F-score = 73.0%	- (incl. temp oral ±)
							AG-RF-VM-15	Time and frequency features: F-score = 84.3%, + temporal features: F-score = 86.4% SB: F-score = 85.5%, + temporal: F-score = 88.7% LPA: F-score = 86.4%, + temporal: F-score = 88.4% MVPA: F-score = 77.1%, + temporal: F-score = 78.0% Walk: F-score = 78.1%, + temporal: F-score = 79.8% Run: F-score = 86.6%, + temporal: F-score = 86.8%	±
	Hislop et al. (2012) [70]	n = 31 Age = 4.4 ± 0.8 years Sex = 51.6% girls	cpm and % time spent in MVPA Free-living (unstructured outdoor free play during preschool)	DVO using CARS (modified) to score activity type and derive activity intensity (MVPA: moderate/fast translocation) by one rater	VG	Right hip	AG-cts-VA-15	r_sp = .39*	-	Low (level 1)
							RT-cts-VM-15	r_sp = .56**	-
							AG-Si-VA-15	MD = 0.8 min*	?
							RT-Swr-VM-15	MD = -12.2 min*	?
							RT-Slj-VM-15	MD = 0 min	?
	Hislop et al. (2012) [61]	n = 31 Age = 4.4 ± 0.8 years Sex = 51.6% girls	Activity intensity: SB, LPA, and MVPA Free-living (unstructured outdoor free play during preschool)	DVO using CARS to score activity type and derive activity intensity (SB: stationary, motionless or with movement of limbs/trunk; LPA: easy translocation; MVPA: moderate/fast translocation) by one rater	VG	Right hip	AG-Ev-VA-15	SB: MD = 13.2, LoA (-2.2 to 28.6)**	?	Low (level 1)
							AG-Si-VA-15	SB: MD = -7.2, LoA (-20.2 to 5.7)*; LPA: MD = 6.3, LoA* (-6.0 to 18.6)** MVPA: MD = 0.8, LoA (-6.2 to 7.8)	?
							AG-Puy-VA-15	SB: MD = 0.4, LoA (-13.25 to 13.99); LPA: MD = -2.2, LoA (-15.0 to 10.5) MVPA: MD = 1.7, LoA (-8.0 to 11.5)	?
							AG-vC-VA-15	SB: MD = -8.2, LoA (-22.7 to 6.2)*; LPA: MD = 11.4, LoA* (-1.4 to 24.2)** MVPA: MD = -3.3, LoA (-11.8 to 5.3)**	?
							AG-Pa-VA-15	MVPA: MD = -8.7, LoA (-19.9 to 2.5)**	?
							AG-Re-VA-15	SB: MD = -3.6, LoA (-17.6 to 10.4)**	?
	Hislop et al. (2016) [76]	n = 32 Age = 4.20 ± 0.5 (3 to 5) years Sex = 34.4% girls	Activity intensity: SB, LPA, MVPA and total PA Free-living (free-play in nursery)	DVO using CARS (modified) to score activity type and derive activity intensity (SB: stationary, motionless; LPA: stationary, movement of the trunk/limbs and easy translocation; MVPA: moderate and fast translocation)	VG	Non-dominant wrist	AG-J13-VM-5	Total PA: MD = 1.1, LoA (-9.9 to 12.1) Locomotion CARS vs. high intensity: MD = -2.0, LoA (-9.4 to 5.4) No clear agreement MVPA: MD = -9.3, LoA (-20.0 to 1.5); Locomotion vs. high wrist estimate: MD = 3.4, LoA (-3.1 to 10.0); Standing vs. low wrist estimate: MD = 2.8, LoA (-10.3 to 16.4)	?	Low (level 1)
	Hislop et al. (2016) [76]				VG	Waist	AG-Ev-VA-5	No clear agreement: Total PA: MD = 6.3, LoA (-8.8 to 21.4); MVPA: MD = -3.8, LoA (-10.7 to 3.1); Locomotion: MD = 3.4, LoA (-3.1 to 10.0); Standing: MD = 2.8, LoA (-10.3 to 16.4)	?	Low (level 1)
	Martin et al. (2011) [72]	n = 23 Age = 4.5 ± 0.7 years Sex = 60.9% girls	SB and total PA Free-living	aP-PRE-uni-n.r. at right thigh (SB: sitting/lying, and quiet standing)	VG	Right hip	AG-Re-VA-60	r_sp = .66**, MD = -2.1 ± 4.6%, LoA* (-11.4 to 7.2%)* Corrected (counts increased by 9%): r_sp = .68**, MD = -4.3 ± 4.8%, LoA* (-14.0 to 5.4%)***	+	Low (level 2)
	van Cauwenberghe et al. (2012) [64]	n = 49 Age = 4.0 ± 0.5 (3 to 4) years Sex = 55.1% girls	SB and total PA Free-living (at childcare)	aP-PRE-uni-n.r. at right thigh (SB: sitting/lying)	VG	Right hip	Ac-Ev-omni-15	accuracy = 73.0%, κ = .46, 95% CI (.45 to .47) MD = 1.3 min; 0.1%, LoA (-59.4 to 61.9); 39.2%	-	Low (level 2)
	Kahan, Nicaise, & Reuben (2013) [71]	n = 12 Age = 4.7 ± 0.3 years Sex = 75.0% girls	Activity intensity: SB and MVPA Free-living (unstructured outdoor free play)	DO using OSRAC-P to score activity type and derive activity intensity (SB: stationary, motionless or with limb/trunk movement, light activity; MVPA: moderate/ vigorous activity) scored by one rater	VG	Right hip	AG-Si-VA-5	accuracy = 81.2, 95% CI (79.2 to 83.1), κ = .48 SB: Se = 81.4%, Sp = 50.7%, bias = -8.1% MVPA: Se = 35.8%, Sp = 87.6%, bias = 4.7%	± (SB ± ; PA ±)	Low (level 2)
							AG-vC-VA-5	accuracy = 78.5, 95% CI (76.4 to 80.3), κ = .48 SB: Se = 80.0%, Sp = 52.4%, bias = -8.6% MVPA: Se = 53.9%, Sp = 79.1%, bias = -5.3%	- (SB ± ; PA -)
							AG-Ev-VA-5	accuracy = 79.9, 95% CI (77.5 to 82.1), κ = .58 SB: Se = 75.2%, Sp = 66.4%, bias = 36.3%*** MVPA: Se = 54.8%, Sp = 78.9%, bias = -5.9%	- (SB -; PA -)
							AG-Pa-VA-5	accuracy = 77.2, 95% CI (74.9 to 79.4), κ = .53 SB: Se = 50.0%, Sp = 58.7%, bias = 33.2%* MVPA: Se = 62.7%, Sp = 72.2%, bias = -15.8%*	- (SB -; PA -)
	Alhassan et al. (2017) [56]	n = 33 Age = 4.4 ± 0.8 (2.9 to 5) years Sex = 36.4% girls	Counts Free-living (at preschool)	Aw-cts-uni-15 at non-dominant wrist	VG	Waist	AG-cts-VA-15	r_sp = .41***	-	Low (level 2)
			Activity intensity: SB, LPA, MPA, and VPA Free-living (at preschool)	DO using OSRAC-P (modified) to score activity type and derive activity intensity (SB: stationary, motionless or with trunk/limb movement; LPA: easy translocation; MPA: moderate translocation; VPA: vigorous translocation) by one rater	A VG (for MVPA)		AG-cts-VA-15	r_sp = .47***	-	Low (MVPA) Very low (rest) (level 2)
							AG-Si-VA-15	accuracy = 50.3%, κ = .13, κ_w = .19 SB: accuracy = 62.7%, κ = .26, Se = 87.4%, Sp = 38.2% LPA: accuracy = 65.0%, κ = .01, Se = 18.5%, Sp = 82.2% MPA: accuracy = 86.3%, κ = .06, Se = 10.0%, Sp = 95.1% VPA: accuracy = 86.5%, κ = .06, Se = 5.3%, Sp = 98.5% MVPA: accuracy = 76.9%, κ = .15, Se = 16.2%, Sp = 95.2%	- (SB -; PA -)
							AG-Pa-VA-15	accuracy = 44.1%, κ = .21, κ_w = .32 SB: accuracy = 61.5%, κ = 0.40, Se = 60.1%, Sp = 79.7% LPA: accuracy = 60.7%, κ = .13, Se = 51.1%, Sp = 64.3% MPA: accuracy = 80.3%, κ = .09, Se = 24.0%, Sp = 86.8% VPA: accuracy = 86.1%, κ = .19, Se = 18.0%, Sp = 96.2% MVPA: accuracy = 75.3%, κ = .27, Se = 39.8%, Sp = 86.0%	- (SB -; PA -)
						Non-dominant wrist	Aw-cts-uni-15	r_sp = .47***	-
						Non-dominant wrist	Aw-Ek-uni-15	accuracy = 50.3%, κ = .13, κ_w = .19 SB: accuracy = 65.47%, κ = 0.31, Se = 41.5%, Sp = 89.1% LPA: accuracy = 53.8%, κ = .10, Se = 62.3%, Sp = 50.7% MPA: accuracy = 82.8%, κ = .11, Se = 22.0%, Sp = 89.7% VPA: accuracy = 86.3%, κ = .31, Se = 33.9%, Sp = 94.0% MVPA: accuracy = 77.5%, κ = .35, Se = 46.9%, Sp = 86.8%	- (SB -; PA -)
	Ahmadi et al. (2020) [83]	n = 31 Age = 4.0 ± 0.9 (3 to 5) years Sex = 29% girls	Activity intensity: SB, LPA, MVPA, run, and walk Free-living (play session)	DVO using customized scheme to score activity type and derive activity intensity (SB: sitting/lying down, and stationary, motionless; LPA: standing, stationary, movement of the limbs/trunk, easy translocation, MPA: moderate and fast translocation; walk: translocation, medium speed; run: translocation (very) fast)	A	Non-dominant wrist	AG-RF-VM-15	accuracy = 59.1%, 95% CI (57.1 to 61.1%), κ = .37 SB: accuracy = 76.8% LPA: accuracy = 57.5% MVPA: accuracy = 83.6% Walk: accuracy = 14.8% Run: accuracy = 68.7% Excl. multiple activities: walk accuracy (14.8 to 43.8%) and run accuracy (68.7 to 100%)	- (SB -; PA -)	Very low (level 1)
						Non-dominant wrist	AG-SVM-VM-15	accuracy = 59.3%, 95% CI (57.3 to 61.3%), κ = .37 SB: accuracy = 76.8% LPA: accuracy = 58.3% MVPA: accuracy = 82.6% Walk: accuracy = 12.3% Run: accuracy = 70.1% Excl. multiple activities: walk accuracy (12.3 to 45.8%) and run accuracy (70.1 to 100%)	- (SB -; PA -)
						Right hip	AG-RF-VM-15	accuracy = 69.4%, 95% CI (67.4 to 71.2%), κ = .48 SB: accuracy = 71.7% LPA: accuracy = 79.3% MVPA: accuracy = 71.3% Walk: accuracy = 10.4% Run: accuracy = 74.5% Excl. multiple activities: walk accuracy (10.4 to 29.2%) and run accuracy (74.5 to 100%)	- (SB -; PA -)
						Right hip	AG-SVM-VM-15	accuracy = 66.4%, 95% CI (64.4 to 68.3%), κ = .45 SB: accuracy = 77.7% LPA: accuracy = 71.1% MVPA: accuracy = 73.0% Walk: accuracy = 8.9% Run: accuracy = 65.8% Excl. multiple activities: walk accuracy (8.9 to 33.3%) and run accuracy (65.8 to 100%)	- (SB -; PA -)
	Trost et al. (2018) [89]	n = 11 Age = 4.8 ± 0.87 (3 to 6) years Sex = 55% girls	Activity intensity: SB, LPA, MVPA, run, and walk Laboratory (semi-standardized activities	DO to score activity type and determine activity intensity based on preschool-specific MET thresholds (SB < 1.5 METs, LPA 1.5–2.7, MPA 2.8–3.4 VPA ≥ 3.5)	A	Non-dominant wrist	AG-SVM-VM-15	accuracy = 80.4%, 95% CI (78.9 to 81.9%), κ_w = .73, CV κ_w = .78 SB: accuracy = 90.0% LPA: accuracy = 73.7% MVPA: accuracy = 78.6% Walk: accuracy = 70.6% Run: accuracy = 71.2%	+ (SB + ; PA -)	Very low (level 2)
							AG-RF-VM-15	accuracy = 78.1%, 95% CI (76.6 to 79.9%), κ_w = .70, CV κ_w = .76 SB: accuracy = 89.1% LPA: accuracy = 68.7% MVPA: accuracy = 79.1% Walk: accuracy = 61.1% Run: accuracy = 68.8%	± (SB + ; PA -)
							AG-T18-VA-15	κ_w = .64	-
							AG-T18-VM-15	κ_w = .49	-
						Right hip	AG-SVM-VM-15	accuracy = 81.3%, 95% CI (79.9 to 82.8%), κ_w = .74, CV κ_w = .82 SB: accuracy = 94.0% LPA: accuracy = 76.8% MVPA: accuracy = 76.0% Walk: accuracy = 63.5% Run: accuracy = 69.8%	+ (SB + ; PA -)
							AG-RF-VM-15	accuracy = 80.2%, 95% CI (79.9 to 81.6%), κ_w = .72, CV κ_w = .81 SB: accuracy = 93.2% LPA: accuracy = 72.1% MVPA: accuracy = 76.7% Walk: accuracy = 61.1% Run: accuracy = 73.0%	± (SB + ; PA -)
							AG-T18-VA-15	κ_w = .65	-
							AG-T18-VM-15	κ_w = .61	-
						Non-dominant wrist + hip	AG-SVM-VM-15	accuracy = 85.2%, 95% CI (83.8 to 86.5%), κ_w = .80, CV κ_w = .84 SB: accuracy = 94.7% LPA: accuracy = 80.2% MVPA: accuracy = 81.7% Walk: accuracy = 69.7% Run: accuracy = 82.3%	+ (SB + ; PA +)
AG-RF-VM-15						Non-dominant wrist + hip	accuracy = 81.8%, 95% CI (80.4 to 83.2%), κ_w = .75, CV κ_w = .81 SB: accuracy = 93.0% LPA: accuracy = 73.9% MVPA: accuracy = 80.7% Walk: accuracy = 63.0% Run: accuracy = 75.3%	+ (SB + ; PA +)
Finn & Specker (2000) [60]	n = 40 Age = (3 to 4) years Sex = 60% girls	Mean activity counts (3-min) Free-living	DVO using CARS (3-min average) to score activity type and derive activity intensity (SB: stationary, motionless or with movement of limbs/trunk; LPA: easy translocation; MVPA: moderate/fast translocation) by two independent raters	D	Waist	Aw-cts-uni-60	r = .03 to r = .92 (median = .74) More active children higher association between CARS and counts: within-child correlation coefficients vs. CARS r = .37* or counts r = .31	PA: -	Very low (level 1)
Brønd et al. (2020) [86]	n = 29 Age = 4.4 (3 to 6) years Sex = n.r. % girls	Activity type: Sit, stand, run, walk, and bike Free-living (structured activities at school)	DO to score activity type (sit: on floor/chair playing; stand: stand close to table playing; run: run together with instructor; walk: walks together with instructor; bike: on running bike) by one rater	D	Thigh	AX-DT-HA/VA/DA-2	Sit: Se = 100%, Sp = 100% Stand: Se = 100%, Sp = 99.8% Bike (sitting bike): Se = 64.8%, Sp = 100% Walk: Se = 82.6%, Sp = 98.1% Run: Se = 92.4%, Sp = 95.0%	+	Very low (level 2)
Zhao et al. (2013) [90]	n = 69 Age = (3 to 5) years Sex = 55% girls	Activity type: Sleep, rest, quiet play, low active play, moderately active play, and very active play Laboratory (structured activities)	DO to score activity type (sleep: lying and sleeping; rest: watching tv, quiet play: sitting while coloring, puzzle, video game; low active play: stand and play; moderately active play: standing while ball toss, active video game, dance; very active play: running in place)	D	Right hip	AG-SVM -VA/HA/DA/VM-60	accuracy = 75.3% Sleep: accuracy = 91.44% Rest: accuracy = 65.66% Quiet play: accuracy = 74.07% Low active play: accuracy = 68.49% Moderately active play: accuracy = 93.73% Very active play: accuracy = 98.73% Misclassification highest for sleep (31.6%) and quiet play (33.0%)	±	Very low (level 2)
Zhao et al. (2013) [90]	n = 69 Age = (3 to 5) years Sex = 55% girls			D		AG-MLR—VA/HA/DA/VM-60	accuracy (67.2 to 73.2%)	-	Very low (level 2)
Hagenbuchner et al. (2015) [87]	n = 11 Age = 4.8 ± 0.87 (3 to 6) years Sex = 55% girls	Activity intensity: SB, LPA, MVPA, run, and walk Laboratory (activity trials)	DO to score activity type and derive activity intensity (SB: watching tv, tablet/computer, reading, quiet play; LPA: clean up toys, standing art, treasure hunt; MVPA: bicycle riding, obstacle course, active game; walk: walk at self-selected pace; run: run at self-selected speed)	D	Right hip	AG-MLP-VM-60	accuracy = 69.7% SB: accuracy = 81.8% LPA: accuracy = 78.8% MVPA: accuracy = 63.6% Walk: accuracy = 36.4% Run: accuracy = 45.5%	- (SB + ; PA -)	Very low (level 2)
						AG-SOM-VM-60	accuracy = 53.8% SB: accuracy = 59.1% LPA: accuracy = 57.6% MVPA: accuracy = 66.7% Walk: accuracy = 36.4% Run: accuracy = 0.0%	- (SB -; PA -)
						AG-DLEN-VM-60	accuracy = 82.6% SB: accuracy = 84.1% LPA: accuracy = 90.9% MVPA: accuracy = 78.8% Walk: accuracy = 72.7% Run: accuracy = 72.7%	± (SB + ; PA -)
de Bock et al. (2010) [68]	n = 33 Age = 3.5 ± 0.8 (boys); 4.3 ± 1.1 (girls) years Sex = 36.36% girls	Activity intensity: SB and MVPA Free-living (at preschool)	DO using CARS to score activity type and derive activity intensity (SB: stationary, motionless or with trunk/limb movement; MVPA: easy, moderate, and fast translocation) by five independent raters	D	Chest	Ah-dB-VA-15	Calibration MVPA: AUC-ROC = .86 SB: AUC-ROC = .79 Cross-validation SB: accuracy 67% (boys), 69% (girls); Sp = 52% (boys), 61% (girls); Se = 78% (boys), 75% (girls) MVPA: accuracy = 84% (boys), 87% (girls)	- (SB -; PA +)	Very low (level 2)
Fairweather et al. (1999) [59]	n = 11 Age = 3.7 ± 0.5 years Sex = 72.7% girls	Mean cpm Free-living (exercise class)	DO using 1-min average CPAF score (ranging from 1 to 4) scored by one rater	D	Right hip	CSA-cts-uni-60	r = .87*, r_sp = .79	PA: +	Very low (level 2)
Fairweather et al. (1999) [59]	n = 10 Age = 4.0 ± 0.4 years Sex = 90% girls	Mean cpm Free-living (exercise class)	CSA at left hip	D	Right hip	CSA-cts-uni-60	r = .92*, MD = 31 (SD = 64, 95% CI* (1 to 61), r_sp = .97* Relative PA assessment largely unaffected by location	PA: +	Very low (level 2)
Klesges & Klesges (1987) [62]	n = 28 Age = 33.9 ± 8.0 months Sex = 43.3% girls	Total PA (all-day) and hourly PA Free-living (unstructured activities)	DO using FATS to score behavior and intensity	D	Left hip	CPAC-cts-uni-n.r	r = .54* (total PA); r = .62 to .95* (hourly PA) Behavior: lying r_sp = -.37; sitting r_sp = -.26; standing r_sp = .60; walking r_sp = .50; running r_sp = .37 Intensity: minimal r_sp = -.51; moderate r_sp = .51*; extreme r_sp = .40 Summary: correlations higher for older children (< 32.5 months r_sp = .39; > 32.5 months r_sp = .80**), girls (r_sp = ,67, boys r_sp = .43), and obese children (r_sp = .75, normal weight r_sp = .46)	PA: ±	Very low (level 2)
Klesges et al. (1985) [63]	n = 30 Age = 47.7 ± 12.5 months Sex = 40% girls	Total PA Free living (free play)	DO using FATS to score behavior and intensity	D	Left hip	CPAC-cts-uni-n.r	Behavior: r = .38, r_sp = .28: lying r = .17, r_sp = -.18; sitting r = -.31, r_sp = -.32; crawling r = -.22, r_sp = -.20; climbing r = -.06, r_sp = .02; standing r = .35, r_sp = .37; walking r = -.11, r_sp = -.03; running r = .32, r_sp = .22 Intensity: r = .32, r_sp = .29: minimal r = -.27, r_sp = -.23; moderate r = .05, r_sp = .07; extreme r = .44, r_sp = .43* Summary: r = .40*, r_sp = .36	PA: -	Very low (level 2)
Klesges et al. (1985) [63]	n = 30 Age = 47.7 ± 12.5 months Sex = 40% girls	Total PA Free living (free play)	DO using FATS to score behavior and intensity	D	Right hip	LSI-cts-uni-n.r	Behavior: r = .38, r_sp = .28: lying r = .17, r_sp = -.18; sitting r = -.31, r_sp = -.32; crawling r = -.22, r_sp = -.20; climbing r = -.06, r_sp = .02; standing r = .35, r_sp = .37; walking r = -.11, r_sp = -.03; running r = .32, r_sp = .22 Intensity: r = .32, r_sp = .29: minimal r = -.27, r_sp = -.23; moderate r = .05, r_sp = .07; extreme r = .44, r_sp = .43* Summary: r = .40*, r_sp = .36	PA: -	Very low (level 2)
Ettienne et al. (2016) [69]	n = 30 Age = 3.5 ± 0.6 years Sex = 43% girls	Activity intensity: SB, LPA, and MVPA Free-living (during preschool)	DO using (modified) SOFIT to score activity type and derive activity intensity (SB: lying; LPA: sitting, standing; MVPA: walking, very active)	D	Non-dominant wrist	Ac-Sc-omni-60	accuracy = 74%, κ_w = .17*** 391 min classified as MVPA, whereas DO class was LPA; 406 min DO classified as MVPA and accelerometer as LPA	-	Very low (level 2)
Djafarian et al. (2013) [57]	n = 42 Age = 4.1 ± 0.8 (3 to 5) years Sex = 47.6% girls	cpm 1–10-min periods Free-living (structured & unstructured)	DO using 1-min average CARS score	D	Non-dominant wrist	Aw-cts-uni-300	r = .52** (n = 21 cross-validation sample) Measured (DO) vs. predicted (log accelerometry) CARS score MD = 0.025 (± 0.38), CI 95% with 0.74 CARS unit	PA: -	Very low (level 2)
Davies et al. (2012) [82]	n = 30 Age = 4.1 ± 0.5 (3.1 to 4.9) years Sex = 60% girls	Activity type: Sit/lie, stand, and walk Free-living (usual nursery setting)	DVO using to score activity type (walk, sit, stand, lie, other (crouching down, kneeling up, crawling) by one rater	I	Right thigh	aP-PRE-uni-1	Time spent: sitting MD = -4.4%, standing MD = 7.1% Sit/lie: Se = 86.7%, Sp = 97.1%, PPV = 96.3% Stand: Se = 91.8%, Sp = 84.3%, PPV = 75.4% Walk: Se = 80.3%, Sp = 95.9%, PPV = 78.4%	+	Very low (level 1)
Johansson et al. (2015) [77]	Calibration n = 26 Age = 26 ± 6.0 months Sex = 38% girls Cross-validation n = 12 Age = 25 ± 5.6 (15 to 36) months Sex = 50% girls	Activity intensity: SB, LPA, and MVPA Free-living (structured activities and free play at preschool)	DVO using CARS to score activity type and derive activity intensity (SB: stationary, motionless; LPA: stationary, movement of the limbs/trunk, easy translocation; MVPA: moderate and fast translocation)	I	Non-dominant wrist	AG-J13-VM-5	Calibration SB: AUC-ROC = .98, CI (.95 to 1.0) MVPA: AUC-ROC = .90, CI (.81 to .98) Cross-validation SB: r_sp = .91 LPA: r_sp = .77* MVPA: r_sp = .69*	+ (SB + ; PA ±)	Very low (level 1)
Johansson et al. (2015) [77]				I	Non-dominant wrist	AG-J13-VA-5	Calibration SB: AUC-ROC = .98, CI (.95 to 1.0) MVPA: AUC-ROC = .88, CI (.78 to .97) Cross-validation SB: r_sp = .91 LPA: r_sp = .89* MVPA: r_sp = .77*	+ (SB + ; PA +)	Very low (level 1)
Johansson et al. (2016) [78]	n = 30 Age = 49.0 ± 3.7 months Sex = 53% girls	Activity intensity: SB, LPA, and MVPA Free-living (structured indoor activities and free play at preschool)	DO using CARS to score activity type and derive activity intensity (SB: stationary, motionless; LPA: stationary, movement of the limbs/trunk, easy translocation; MVPA: moderate and fast translocation) by one rater	I	Left hip	AG-J16-VM-5	κ_qw = .86, 95% CI (.85 to .87), accuracy = 75% SB: Se = 100%, Sp = 60%, AUC-ROC = .95 MVPA: Se = 70%, Sp = 93%, AUC-ROC = .91	+ (SB ± ; PA ±)	Very low (level 2)
					Left hip	AG-J16-VA-5	κ_qw = .76, 95% CI (.74 to .77), accuracy = 68% SB: Se = 100%, Sp = 60%, AUC-ROC = .93 MVPA: Se = 70%, Sp = 100%, AUC-ROC = .95	± (SB ± ; PA ±)
					Non-dominant wrist	AG-J16-VM-5	κ_qw = .95, 95% CI (.94 to .96), accuracy = 82% SB: Se = 100%, Sp = 60%, AUC-ROC = .99 MVPA: Se = 70%, Sp = 100%, AUC-ROC = .96	+ (SB ± ; PA ±)
					Non-dominant wrist	AG-J16-VA-5	κ_qw = .92, 95% CI (.91 to .93), accuracy = 82% SB: Se = 100%, Sp = 60%, AUC-ROC = .99 MVPA: Se = 70%, Sp = 100%, AUC-ROC = .95	+ (SB ± ; PA ±)
Reilly et al. (2003) [80]	n = 52 Age = 3.5 ± 0.5 (3 to 4) years Sex = 59.62% girls	Activity intensity: SB Free-living	DO using CPAF to score activity type and derive activity intensity (SB: stationary, motionless or with limb movement) by one rater	I	Right hip	CSA-Re-uni-60	Se = 83%, 95% CI (78 to 86%) Sp = 82%, 95% CI (89 to 86%)	SB: +	Very low (level 2)
Adolph et al. (2012) [51]	n = 64 Age = 4.5 ± 0.8 (3 to 5) years Sex = 42.2% girls	Activity intensity: SB, LPA, and MVPA Laboratory	DO using CARS to score activity type and derive CARS levels (1 to 5)	I	Chest	Ah-A-uni-60	accuracy = 72%, TPR = 82% SB: accuracy = 75% LPA: accuracy = 61% MVPA: accuracy = 82%	- (SB -; PA ±)	Very low (level 2)
					Right hip	Ac-A-omni-60	accuracy = 71%, TPR = 69% SB: accuracy = 77% LPA: accuracy = 63% MVPA: accuracy = 69%	- (SB -; PA -)
					Right hip	RT-A-VM-60	accuracy = 73%, TPR = 79% SB: accuracy = 76% LPA: accuracy = 65% MVPA: accuracy = 79%	- (SB -; PA ±)
Li et al. (2020) [79]	n = 34 Age = 4.0 ± 0.5 (3 to 5) years Sex = 58.8% girls	Activity intensity: SB, LPA, MPA, and VPA Free-living (during preschool)	AG-B-VM-60	I	Right hip	AG-Li_ROC-VM-60	κ = .30, accuracy = 59.5% SB: Se = 77.8%, Sp = 77.8%, FPR = 22.2%, FNR = 22.2%, κ = .54 LPA: Se = 26.8%, Sp = 87.5%, FPR = 12.5%, FNR = 73.2%, κ = .16 MPA: Se = 4.5%, Sp = 69.8%, FPR = 3.2%, FNR = 95.5%, κ = .02 VPA: = Se = 79.0%, Sp = 79.0%, FPR = 21.0%, FNR = 21.0%, κ = .11 MVPA collapsed: κ = .35, accuracy = 63.6%, Se = 78.9%, Sp = 78.9%, FPR = 20.1%, FNR = 21.1%, κ = .28	- (SB -; PA -)	Very low (level 2)
			AG-B-VM-60		Right hip	AG-Li_OLR-VM-60	κ = .37, accuracy = 71.5% SB: Se = 91.0%, Sp = 52.8%, FPR = 47.2%, FNR = 9.1%, κ = .47 LPA: Se = 43.3%, Sp = 86.5%, FPR = 56.7%, FNR = 13.5%, κ = .32 MPA: Se = 8.8%, Sp = 99.1%, FPR = 1.0%, FNR = 76.1%, κ = .12 VPA: Se = 23.9%, Sp = 99.0%, FPR = 1.0%, FNR = 76.1%, κ = .28 MVPA collapsed: κ = .39, accuracy = 72.4%, Se = 24.3%, Sp = 99.0%, FPR = 1.0%, FNR = 75.7%, κ = .33	- (SB -; PA -)
			AG-J13-VM-60		Non-dominant wrist	AG-Li_4k-VM-60	κ = .37, accuracy = 63.7% SB: Se = 71.6%, Sp = 85.1%, FPR = 14.9%, FNR = 28.4%, κ = .53 LPA: Se = 50.9%, Sp = 76.8%, FPR = 23.2%, FNR = 49.1%, κ = .27 MPA: Se = 48.4%, Sp = 86.8%, FPR = 13.2%, FNR = 51.6%, κ = .19 VPA: Se = 33.9%, Sp = 98.2%, FPR = 1.8%, FNR = 66.1%, κ = .31 MVPA collapsed: κ = .40, accuracy = 65.6%, Se = 68.4%, Sp = 86.6%, FPR = 13.4%, FNR = 31.6%, κ = .35	- (SB -; PA -)
Pagels, Boldemann, & Raustorp (2011) [73]	n = 55 Age = 4.5 ± 0.6 (3.4 to 5.7) years Sex = 49.1% girls	Activity intensity: LPA and MVPA Free-living (at preschool)	Pedometer (Yamax SW200) to measure daily steps	I	Waist	AG-Si-VA-15	r = .67*, R²_adj = 45%: (3-year-olds: r = .37, 4-year-olds: r = .52, 5-year-olds: r = .84*) LPA: r = .76, R²_adj = 52% MVPA: 3-year-olds (r = .19, R²_adj = 4%); 4-to-5-year-olds (r = .50**, R²_adj = 23%)	PA: -	Very low (level 3)

Open in a new tab

Abbreviations: A Adolph’s cut-points (2012) [51], Ac Actical, ACT40 Actiware software with wake threshold value of 40, ACT80 Actiware software with wake threshold value of 80, AEE activity energy expenditure, AIC Akaike information criterion, AG ActiGraph, Ah Actiheart, ANN artificial neural network, aP activPAL, AS AlgoSmooth [94], AUC-ROC area under the receiver operating curve, Aw Actiwatch, AX Axivity, B Butte’s cut-points (2014) [75], C Costa’s cut-points (2014) [46], CARS children’s activity rating system, CCC concordance correlation coefficient, CI confidence interval, CPAC Caltrac personal activity computer, CPAF children’s physical activity form, cpm counts per minute, cts counts, CSA computer science and applications activity monitor, CSTS cross-sectional time series, CV cross-validation, DA diagonal axis (z-axis), dB de Bock’s cut-points (2010) [68], DLEN deep learning ensemble network, DLW doubly labelled water, DO direct observation, DT decision table, DVO direct video observation, EE energy expenditure, Ek Ekblom’s cut-points (2012) [95], Ev Evenson’s cut-points (2008) [91], FATS Fargo activity time-sampling survey, FNR false negative rate, FPR false positive rate, GA GENEActiv, HA horizontal axis (x-axis), ICC intraclass correlation coefficient, J13 Johansson’s cut-points (2015) [77], J16 Johansson’s cut-points (2016) [78], κ Kappa, K Kelly’s cut-points (2016) [96], κ_w weighted Kappa, κ_qw quadratic weighted Kappa, Li_ROC Li’s cut-points (2020) [79], Li_ORL Li’s cut-points (2020) [79], Li_4k Li’s cut-points (2020) [79], LoA limits of agreement, LM linear model, LPA light physical activity, LSI LSI moving activity monitor, MAE mean absolute error, MAPE mean absolute percentage error, MARS multivariate adaptive regression splines, MD mean difference, METs metabolic equivalents, MLM mixed linear model, MLP multi-layer perceptron network, MLR multinomial logistic regression, MPA moderate physical activity, MVPA moderate-to-vigorous physical activity, n.r. not reported, NVP negative predictive value, omni omnidirectional, OSRAC-P observational system for recording physical activity in children preschool, Pa Pate’s cut-points (2006) [52], PA physical activity, Pa2 Pate’s cut-points (2006) [52], PAL physical activity level, Pf Pfeiffer’s cut-points (2006) [53], PPV positive predictive value, PRE activPAL professional research edition software, Puy Puyau’s cut-points (2002) [97], r correlation coefficient (unknown), Re Reilly’s cut-points (2003) [80], RF random forests, Ro Roscoe’s cut-points (2017) [67], r_p correlation coefficient (Pearson), r_sp correlation coefficient (Spearman rank), R² R-squared value, R.²_adj adjusted R-squared value, RMSE root mean square error, RT Research Tracker 3, SB sedentary behavior, Sc Schaefer’s cut-points (2014) [98], SD standard deviation, Se sensitivity, SE standard error, Si Sirard’s age-specific cut-points (2005) [81], Slj Sun’s cut point (2008) [99], SMR sleeping metabolic rate, SOFIT system for observing fitness instruction time, SOM self-organizing map, Sp specificity, SVM support vector machine, Swr Sun’s cut point (2008) [99], T_D Tracmor_D, TEE total energy expenditure, tri mean of the three axes, TPR true positive rate, TST total sleep time, T12 Trost’s cut-points (2012) [44], T18 Trost’s cut-points (2018) [89], uni uniaxial (axis not specified), VA vertical axis (y-axis), vC van Cauwenberghe’s cut-points (2011) [45], VPA vigorous physical activity, VCO₂ carbon dioxide, VM vector magnitude, VO₂ oxygen consumption, WASO wake after sleep onset, 1 1 s epoch, 2 2 s epoch, 5 5 s epoch, 10 10 s epoch, 15 15 s epoch, 30 30 s epoch, 60 60 s epoch, 300 5 min epochs

^aAge presented as mean ± SD (range)

^bMethodological study quality based on newly developed checklist: VG very good, A adequate, D doubtful, I inadequate

^cDevice-based method described using code combinations of four elements resulting in the following format: brand-axis-approach-epoch length

^dStudy result rating based on COSMIN guideline: + sufficient, ± inconsistent, - insufficient, ? intermediate

^eQuality of evidence based on GRADE approach

^* p < .05

^** p < .01

^*** p < .001

For the assessment of SB, LPA and MVPA, hip [81] and wrist [78] cut-points were evaluated as valid. Sirard and colleagues (2005) evaluated convergent validity of age specific cut-points [81]. Compared to observational scores using the Children’s Activity Rating Scale (CARS) agreement was high. Highest correlation between predicted and observed scores was found for SB, whereas lowest correlation between predicted and observed scores was found for MVPA. These results received high quality of evidence as the methodological study quality was very good and results were retrieved in a sample of 269 preschoolers. Similar results for convergent validity of Sirard’s SB and MVPA cut-points were found, using the Observational System for Recording physical Activity in Children-Preschool (OSRAC-P) as comparison measure [71]. Highest sensitivity was found for SB, while sensitivity was lowest for VPA. However, specificity was lowest for SB and highest for MVPA. Compared to other cut-points based methods, Sirard’s cut-points were most sensitive in detecting SB and converged best with direct observation [45, 52, 71, 91]. However, convergent validity of Sirard’s cut-points was rated as inconsistent with moderate quality of evidence, as these results were retrieved in a sample of 56 preschoolers. Another study evaluated convergent validity of the Sirard MVPA cut-point compared to the same direct observation scheme, i.e., CARS [61]. Bias was lowest when applying the Sirard MVPA cut-point versus other cut-points based methods in a sample of only 32 preschoolers [45, 52, 97]. When applied to accelerometer-derived data in toddlers, results of Sirard’s cut-points were rated as insufficient [45].

Johansson and colleagues (2016) evaluated convergent validity of wrist cut-points (both VA and VM) [78]. Agreement with direct observation (i.e., CARS) was high [78]. However, these results received very low quality of evidence, as the methodological study quality was inadequate, and results were retrieved in a limited sample of 30 preschoolers.

Some multi-parameter methods were also suitable for assessing SB, LPA and MVPA. Convergent validity of a support vector machine was evaluated as sufficient for distinguishing SB, LPA and MVPA, walking and running in laboratory setting [89] but not in free-living settings [83, 85]. Trost and colleagues (2018) developed random forest and support vector machine models to categorize SB, light activities and games (i.e., LPA), moderate-to-vigorous activities and games (i.e., MVPA), walking and running [89]. Almost perfect agreement with direct observation was found (hip and combination hip and wrist), while cut-points for this sample resulted in only moderate to substantial agreement. Using hip or wrist only data, convergent validity of support vector machines was rated as sufficient, while both random forests and support vector machines were rated as sufficient when hip and wrist data were combined. In free-living settings accuracy of these models decreased by 11 to 15% [85]. Hip classifiers had moderate agreement with direct video observation, while agreement of wrist classifiers was lower in free-living. In free-living settings, accuracy of random forests combining hip and wrist data was comparable. However, study results were rated as insufficient. In addition, support vector machine models have not been tested while combining hip and wrist data. Despite very good [85] and adequate [83, 89] methodological study quality, these results received low quality of evidence as the results were retrieved in limited samples of 11 [89] and 31 [83, 85] preschoolers. Convergent validity of a simple decision table was evaluated as sufficient for distinguishing comparable activity types (i.e., sit, stand, run, walk and bike) in a free-living setting [86]. These study results received very low quality of evidence as methodological study quality was doubtful and the results were retrieved in a sample of only 29 preschoolers.

Criterion validity of support vector machine models, random forests, and artificial neural networks was evaluated [84, 88]. Despite intermediate study results, random forests and artificial neural networks seemed to result in equal prediction accuracy of energy expenditure [88]. Results of the other study indicated that predicted energy expenditure of existing lab, free-living and retrained random forests (wrist and hip) and support vector machines (wrist) were within ± 6% of measured energy expenditure, but not for artificial neural networks (hip) [84]. Despite adequate methodological study quality, these study results received very low quality of evidence as they were retrieved in limited samples of 31 [84] and 41 [88] preschoolers. Criterion validity of cross-sectional time series models [55, 75] and multivariate adaptive regression splines [55] was evaluated as sufficient for predicting minute-by-minute energy expenditure. Results indicated a lack of bias and acceptable limits of agreement for these models, however, bias was slightly lower for multivariate adaptive regression splines. These results received moderate [55] and very low [75] quality of evidence.

For sleep assessment, different cut-points based methods (i.e., wake threshold values) were evaluated to distinguish sleep from wake. Moreover, the multi-parameter method AlgoSmooth was evaluated, which smooths data derived from the wake threshold value of 40 before scoring sleep or wake [100]. Criterion validity of wake threshold values of 40 and 80 applied to adjusted wrist data (i.e., regressed using lower activity counts of the ankle device) was rated as sufficient for wake time but poor for wake time after onset, but results were rated as inconsistent when applied to raw data [66]. In contrast, when the raw wrist data was smoothed using the AlgoSmooth algorithm, criterion validity was evaluated as sufficient [66]. Correlations between an Actiwatch and polysomnography were high, except for wake time after onset. Results were worse for ankle data. Despite very good methodological study quality, these results received low quality of evidence as the results were retrieved in a limited sample of 12 preschoolers [66].

Convergence between devices

Few studies examined the convergence between different accelerometers [56, 64, 72, 74]. No studies evaluated convergence between accelerometer-based methods in infants and toddlers. Convergence of both time spent in SB derived by ActiGraph and activPAL (hip vs. thigh) [72, 74], Actical and activPAL (wrist vs. thigh) [64], and ActiGraph and Actiwatch (waist vs. wrist) count data [56] was rated as insufficient as low to moderate agreement was found. Despite very good methodological study quality of all studies, these results received moderate [74] to low [56, 64, 72] quality of evidence due to sample sizes of 23 to 60 preschoolers.

Combining data

Combining data derived from multiple sensors [39, 85, 89] or multiple axes [34, 43, 47, 49, 77, 78] resulted in more valid predictions. Single placed sensors resulted in lower performance and insufficient data for posture and movement classification, compared to combining data from two or four different locations [39]. In line with these results, the combination of hip and wrist data in models for activity intensity prediction resulted in higher performance compared to using hip or wrist data only [85, 89]. Despite very good [85] and adequate [39, 89] methodological study quality, these results received (very) low quality of evidence due to limited sample sizes of 31 [85], 22 [39], and 11 [89] children.

Generally, using data from the VM to categorize activity intensity provided better results (higher agreement between accelerometry and comparison measure) than the VA (y-axis) [43, 47, 77, 78]. In line with these results, for activity type classification (i.e., running walking climbing up/down, crawling, riding on a ride on toy, standing, sitting, stroller, being carried) feature importance was highest for the VM standard deviation versus features of the different axes or other VM features [49]. In addition, combined cut-points for the VA (y-axis) and horizontal axis (x-axis) were used to classify the postures of infants as prone and non-prone [34]. Despite the inconsistent results, the study indicated that combining data from these axes is required to accurately assess horizontal movement behaviors such as tummy time (time spent prone on floor). The combination of the acceleration signal time-domain and frequency features also resulted in better activity type classification [49, 50]. When comparing feature sets, accuracy was higher when frequency-domain features were included in addition to time-domain features. Moreover, activity intensity classification improved when adding temporal features to this base set (i.e., time domain and frequency features) [85].

Discussion

This review summarizes studies that evaluated the measurement properties of accelerometer-based methods for assessing physical behavior in young children (< 5 years old). To assess the methodological quality of the 62 included studies, we developed a new checklist inspired by COSMIN [25–27]. Despite very good to adequate methodological study quality of 58% of the studies, only ten percent of the study results received high or moderate quality of evidence, due to limited samples sizes.

Validated cut-points for the youngest age group (i.e., infants) are still lacking, while multi-parameter methods were evaluated as sufficient to distinguish posture, SB and PA using multiple sensors [39], movement and sleep [40], leg movements [41], and sleep–wake [30, 36, 38]. Despite very good [30] or adequate [39, 40] methodological study quality of some of these studies, quality of evidence was rated as low [30] to very low [36, 38–41].

In toddlers, hip cut-points were considered valid for distinguishing SB and LPA [46, 47] from high intensity PA [43, 44], despite large differences between cut-points based methods. No studies were found for identifying sleep in toddlers using cut-points based or multi-parameter methods. For SB, VM cut-points varied between < 40 counts/5 s [47] and < 97 counts/5 s [46]. For MVPA, cut-points were VM ≥ 208 counts/15 s [43] and VA ≥ 418 counts/15 s [44]. The difference between the SB cut-points is probably due to inconsistency in the definition of SB adopted by the different observational schemes used. Specifically, in the study by Costa and colleagues, SB was defined as “stationary with no movement and stationary with movement of the limbs”, resulting in a higher cut-point [46] than Oftedal and colleagues who did not include limb movement [47]. Notably, the MVPA cut-points were derived in study samples with divergent characteristics. Trost and colleagues included children from an urban area that were around one year older than the children from a rural area included by Pulakka and colleagues [43, 44]. Conversely, cut-points could not identify toddler specific behaviors such as “being carried” as SB. A random forest model was considered valid to distinguish SB (including this toddler specific behavior) from PA [49]. Despite very good methodological study quality of some studies [44, 46, 49], quality of evidence for these cut-points based and multi-parameter methods was low.

For distinguishing different physical intensities (i.e., SB, LPA, and MVPA), we found strongest evidence in preschoolers. Next to age-specific hip cut-points [81], wrist cut-points were also evaluated as valid [78]. For assessing sleep, adjusted wrist [66] cut-points were evaluated as valid. Quality of evidence for hip cut-points was high [81], and these cut-points were also positively evaluated in cross-validation studies [61, 71], but not when applied in toddlers [45]. The wrist cut-points, however, have not been cross-validated and despite very good methodological study quality of the study assessing sleep cut-points [66], quality of evidence was low [66, 78]. Conversely, for identifying SB, LPA, and MVPA, using multi-parameter methods resulted in more promising results compared to cut-points based methods [83, 85, 89]. Although random forest and support vector machine models were rated as sufficient in laboratory setting [89], these were rated as insufficient in a free-living setting [83, 85]. Activity type could be distinguished using a decision table in a free-living setting [86]. In addition, sleep could be distinguished from wake using the AlgoSmooth algorithm applied to (raw) wrist data [66]. Despite very good [66, 85] and adequate [83, 86, 89] methodological study quality, quality of evidence was low [66, 85] or very low [83, 86, 89].

Despite the promising results of multi-parameter methods, only few models were accessible as open-source software [84–86, 88]. This hampers the replication of study results as closed source models cannot be reused. In case software is available, be aware to use the same version, configuration, and implementation (e.g., brand, axis, placement, parameters, target group). It is not recommended to reuse cut-points based methods when deviating from accelerometer specifications (i.e., brand, axis, and placement) or target population (age group), as cut-points need to be re-evaluated. Moreover, the inconvenience of cut-points based methods is related to the derivation of the magnitude of acceleration (count data), which is kept close source by most manufacturers.

Most studies used a single device or axis to measure physical behavior, while more promising results were found when combining data derived from different sensor placements [39, 85, 89] or multiple axes [34, 43, 47, 49, 50, 90]. Movement patterns of young children are sporadic, omnidirectional, and unique per developmental stage (e.g., lying on back or tummy, crawling or walking), and accelerometers capture only the movement of the body segment it is attached to [15, 20, 23]. This requires accelerometers that can capture movement in multiple planes, and placement of accelerometers on different sites (e.g., wrist and hip, legs, and arms).

There are a few reasons that contributed to the (very) low quality of evidence of the studies. Firstly, the quality of evidence was mostly downgraded due to limited sample sizes of < 100 children included in studies, i.e., imprecision [26]. If more studies would have used the same measurement and analyses protocol (i.e., accelerometer brand, accelerometer data analysis approach, axis, and epoch length), this sample size issue could have been resolved by pooling the results. Besides sample size, another important aspect for the generalizability is that the variation of the performed physical behaviors in the target population is captured. Secondly, the doubtful methodological study quality contributed to the downgrading of the quality of evidence, i.e., risk of bias [26]. Common methodological limitations varied per measurement property. Regarding convergent validity, both the unknown or insufficient measurement properties of the comparison instrument and the too long epoch lengths [101] resulted in low methodological study quality. For most studies that used direct observation as comparison measure, interrater agreement was insufficient, or a non-validated observation scheme was adopted. In some studies, physical behavior of toddlers was assessed using observation schemes developed and validated in preschoolers, thereby disregarding developmental specific physical behavior. Thus, physical activities and their intensities may have been misinterpreted. In addition, an epoch length < 60 s is preferred to measure the sporadic and intermittent nature of physical behavior in young children. However, some studies reintegrated the epochs up to three minutes without providing a valid reason, e.g., to align the epochs with the comparison instrument. For studies evaluating validity of cut-points based methods, low methodological study quality was mostly due to validation using data derived under the same circumstances as for the cut-point calibration, e.g., using the same sample instead of an independent sample. Studies that assessed validity of multi-parameter methods, mainly received low methodological study quality due to not reporting statistics suitable for comparing the performance of prediction models.

Next to the low quality of evidence, there were some general study limitations. For instance, in the reliability studies, differences between accelerometer recordings might be due to slightly different placement of the accelerometer, or actual different physical behaviors during the repeated measurements. Differences based on mechanical shaker experiments, on the other hand, are purely device related. Regarding validity studies using observation as comparison measure, it is difficult to estimate what observation time would be sufficient for validation of the targeted physical intensities and representative for physical behavior. This is also dependent on the observed activities and setting. Observation durations of included studies varied from 8 to 180 min. For instance, if observation took place in the childcare setting, this might not be representative for daily life physical behavior. Another general limitation is that activity types were not described in detail, resulting in different activity intensity ranges between studies.

Strengths and limitations

A strength of this review is that the methodological study quality was systematically assessed using the newly developed CAMQAM checklist. Another strength is that our search had no publication date limit. Although this resulted in including studies on devices that are no longer available on the market, this review includes all available studies examining measurement properties of accelerometer-based methods in 0—5-year-olds. A limitation of this review is that due to the great variability of accelerometer-based methods, it was not feasible to pool the study results, resulting in low quality of evidence ratings due to small sample sizes. Appropriate sample sizes are important for precision but also in order to capture adequate variation in physical activities. Another limitation is that we were unable to rate study results of measurement errors, as the minimal important difference or minimal important change is needed to conclude on the magnitude of measurement error. Since this information was not available for 0—5-year-olds, we only described these results. A limitation of the study result rating is that, the ratings were not weighted for the number of reported results. Lastly, our focus was limited to the evaluation and quality of measurement properties and did not include feasibility. Feasibility is context-specific because studies differ in available expertise and computational resources to perform the data analysis. Further, feasibility may change over time as software is subject to ongoing development and maintenance, or lack thereof, and accelerometers may change in price or dimensions as newer models enter the market.

Recommendations for future studies

High quality studies and standardized protocols are required to assess measurement properties (including feasibility) of these accelerometer-based methods and enable pooling of data. To improve methodological study quality of future studies, we recommend using our developed CAMQAM Checklist for Assessing the Methodological Quality of studies using Accelerometer-based Methods. Future studies should incorporate more precise definitions for physical activity types, adapted to the child’s developmental stage. For example, activity types such as crawling can be more precisely defined using inclination angles in video observation or derived from accelerometer-based methods. Additionally, for accelerometer-based methods to be generalizable to young children, ideally in a free-living setting, validation studies should include a variety of physical activity types representative for the target population. Moreover, we recommend that future studies transparently share methods by making these open-source available. Making methods accessible supports the sustained impact of research investments. Given the lack of reliable and/or valid of accelerometer-based methods and the lack of 24-h studies on physical behavior, especially in the youngest age groups (i.e., infants and toddlers), future studies should develop and evaluate methods targeted at these young age groups, including all 24-h physical behaviors, and exploring different sensor placements and axes using raw acceleration data of modern accelerometers.

Conclusions

Validated cut-points are lacking in infants and toddlers, while multi-parameter methods proved valid to distinguish SB and LPA from more vigorous activities. For preschoolers, both valid cut-points based and valid multi-parameter methods were identified, where multi-parameter methods appeared to have better measurement properties. Large heterogeneity and methodological limitations, impedes our ability to draw definitive conclusions about the best available accelerometer-based methods assessing all 24-h physical behaviors combined in young children.

Supplementary Information

12966_2022_1296_MOESM1_ESM.pdf^{(441.8KB, pdf)}

Additional file 1. Search strategy using MEDLINE for studies evaluating accelerometer-based methods.

12966_2022_1296_MOESM2_ESM.pdf^{(1,000.2KB, pdf)}

Additional file 2. CAMQAM: Checklist for Assessing the Methodological Quality of studies using Accelerometer-based Methods.

12966_2022_1296_MOESM3_ESM.pdf^{(999.3KB, pdf)}

Additional file 3. Description of the elements and the corresponding codes to describe accelerometer-based methods.

Acknowledgements

Not applicable.

Authors’ contributions

AL, MC, TA, and VH conceived the study. AL and TA screened all articles for eligibility. AL and JA extracted the data. AL, MC and TA rated the methodological study quality. AL completed data analysis. AL drafted the manuscript. MC, TA, and VH revised and edited significant sections of the manuscript. All authors reviewed and approved the manuscript.

Funding

This review is part of the “My Little Moves” project that is funded by ZonMw (546003008) and the Bernard van Leer foundation. The funding bodies had no role in the design of the study; in the collection, analysis, and interpretation of the data; or in the writing of the manuscript.

Availability of data and materials

The data that support the findings of this review are available from the corresponding author upon reasonable request.

Declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

Annelinde Lettink, Teatske M. Altenburg, Jelle Arts, Vincent T. van Hees, and Mai J. M. Chinapaw declare that they have no competing interests.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Chaput JP, Carson V, Gray CE, Tremblay MS. Importance of all movement behaviors in a 24 hour period for overall health. Int J Environ Res Public Health. 2014;11(12):12575–81. doi: 10.3390/ijerph111212575. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Kuzik N, Poitras VJ, Tremblay MS, Lee EY, Hunter S, Carson V. Systematic review of the relationships between combinations of movement behaviours and health indicators in the early years (0–4 years) BMC Public Health. 2017;17(5):849. doi: 10.1186/s12889-017-4851-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Rollo S, Antsygina O, Tremblay MS. The whole day matters: understanding 24-hour movement guideline adherence and relationships with health indicators across the lifespan. J Sport Health Sci. 2020;9(6):493–510. doi: 10.1016/j.jshs.2020.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Tremblay MS, Carson V, Chaput J-P, Connor Gorber S, Dinh T, Duggan M, et al. Canadian 24-hour movement guidelines for children and youth: an integration of physical activity, sedentary behaviour, and sleep. Appl Physiol Nutr Metab. 2016;41(6):S311–S27. doi: 10.1139/apnm-2016-0151. [DOI] [PubMed] [Google Scholar]
5.Tremblay MS, Chaput JP, Adamo KB, Aubert S, Barnes JD, Choquette L, et al. Canadian 24-hour movement guidelines for the early years (0–4 years): an integration of physical activity, sedentary behaviour, and sleep. BMC Public Health. 2017;17(5):874. doi: 10.1186/s12889-017-4859-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Ainsworth BE, Haskell WL, Herrmann SD, Meckes N, Bassett DR, Tudor-Locke C, et al. 2011 Compendium of physical activities: a second update of codes and MET values. Med Sci Sports Exerc. 2011;43(8):1575–81. doi: 10.1249/MSS.0b013e31821ece12. [DOI] [PubMed] [Google Scholar]
7.Butte NF, Watson KB, Ridley K, Zakeri IF, McMurray RG, Pfeiffer KA, et al. A youth compendium of physical activities: activity codes and metabolic intensities. Med Sci Sports Exerc. 2018;50(2):246–56. doi: 10.1249/mss.0000000000001430. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Hills AP, Mokhtar N, Byrne NM. Assessment of physical activity and energy expenditure: an overview of objective measures. Front Nutr. 2014;1:5. doi: 10.3389/fnut.2014.00005. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Loprinzi PD, Cardinal BJ. Measuring children's physical activity and sedentary behaviors. J Exerc Sci Fit. 2011;9(1):15–23. doi: 10.1016/s1728-869x(11)60002-6. [DOI] [Google Scholar]
10.Park J, Kazuko IT, Kim E, Kim J, Yoon J. Estimating free-living human energy expenditure: practical aspects of the doubly labeled water method and its applications. Nutr Res Pract. 2014;8(3):241–8. doi: 10.4162/nrp.2014.8.3.241. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Ndahimana D, Kim EK. Measurement methods for physical activity and energy expenditure: a review. Clin Nutr Res. 2017;6(2):68–80. doi: 10.7762/cnr.2017.6.2.68. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Lynch BA, Kaufman TK, Rajjo TI, Mohammed K, Kumar S, Murad MH, et al. Accuracy of accelerometers for measuring physical activity and levels of sedentary behavior in children: a systematic review. J Prim Care Community Health. 2019;10:1–8. doi: 10.1177/2150132719874252. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.de Vries SI, Bakker I, Hopman-Rock M, Hirasing RA, van Mechelen W. Clinimetric review of motion sensors in children and adolescents. J Clin Epidemiol. 2006;59(7):670–80. doi: 10.1016/j.jclinepi.2005.11.020. [DOI] [PubMed] [Google Scholar]
14.Bruijns BA, Truelove S, Johnson AM, Gilliland J, Tucker P. Infants’ and toddlers’ physical activity and sedentary time as measured by accelerometry: a systematic review and meta-analysis. Int J Behav Nutrition Physical Act. 2020;17(1):14. doi: 10.1186/s12966-020-0912-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Cliff DP, Reilly JJ, Okely AD. Methodological considerations in using accelerometers to assess habitual physical activity in children aged 0–5 years. J Sci Med Sport. 2009;12(5):557–67. doi: 10.1016/j.jsams.2008.10.008. [DOI] [PubMed] [Google Scholar]
16.Sarker H, Anderson LN, Borkhoff CM, Abreo K, Tremblay MS, Lebovic G, et al. Validation of parent-reported physical activity and sedentary time by accelerometry in young children. BMC Research Notes. 2015;8(1):735. doi: 10.1186/s13104-015-1648-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Sasaki JE, John D, Freedson PS. Validation and comparison of ActiGraph activity monitors. J Sci Med Sport. 2011;14(5):411–6. doi: 10.1016/j.jsams.2011.04.003. [DOI] [PubMed] [Google Scholar]
18.Weiss AR, Johnson NL, Berger NA, Redline S. Validity of activity-based devices to estimate sleep. J Clin Sleep Med. 2010;6(4):336–342. doi: 10.5664/jcsm.27874. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Welk GJ, Schaben JA, Morrow JR. Reliability of accelerometry-based activity monitors: a generalizability study. Med Sci Sports Exerc. 2004;36(9):1637–1645. [PubMed] [Google Scholar]
20.Oliver M, Schofield GM, Kolt GS. Physical activity in preschoolers. Sports Med. 2007;37(12):1045–70. doi: 10.2165/00007256-200737120-00004. [DOI] [PubMed] [Google Scholar]
21.Galland BC, Taylor BJ, Elder DE, Herbison P. Normal sleep patterns in infant and children: a systematic review of observational studies. Sleep Med Rev. 2012;16(3):213–22. doi: 10.1016/j.smrv.2011.06.001. [DOI] [PubMed] [Google Scholar]
22.Adolph KE, Franchak JM. The development of motor behavior. Wiley Interdiscip Rev Cog Sci. 2017;8(1–2):e1430. doi: 10.1002/wcs.1430. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Gerber RJ, Wilks T, Erdie-Lalena C. Developmental milestones: motor development. Pediatr Rev. 2010;31(7):267–77. doi: 10.1542/pir.31-7-267. [DOI] [PubMed] [Google Scholar]
24.Page M, McKenzie J, Bossuyt P, Boutron I, Hoffmann T, Mulrow C, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. PLoS Med. 2021;18(3):e1003583. doi: 10.1371/journal.pmed.1003583. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Mokkink LB, De Vet HC, Prinsen CA, Patrick DL, Alonso J, Bouter LM, et al. COSMIN risk of bias checklist for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1171–9. doi: 10.1007/s11136-017-1765-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Prinsen CA, Mokkink LB, Bouter LM, Alonso J, Patrick DL, De Vet HC, et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1147–57. doi: 10.1007/s11136-018-1798-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Terwee CB, Prinsen CA, Chiarotto A, Westerman MJ, Patrick DL, Alonso J, et al. COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual Life Res. 2018;27(5):1159–70. doi: 10.1007/s11136-018-1829-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Terwee CB, Mokkink LB, van Poppel MN, Chinapaw MJ, van Mechelen W, de Vet HC. Qualitative attributes and measurement properties of physical activity questionnaires. Sports Med. 2010;40(7):525–37. doi: 10.2165/11531370-000000000-00000. [DOI] [PubMed] [Google Scholar]
29.Greenspan B, Cunha AB, Lobo MA. Design and validation of a smart garment to measure positioning practices of parents with young infants. Infant Behav Dev. 2021;62:101530. doi: 10.1016/j.infbeh.2021.101530. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Galland BC, Kennedy GJ, Mitchell EA, Taylor BJ. Algorithms for using an activity-based accelerometer for identification of infant sleep-wake states during nap studies. Sleep Med. 2012;13(6):743–51. doi: 10.1016/j.sleep.2012.01.018. [DOI] [PubMed] [Google Scholar]
31.Insana SP, Gozal D, Montgomery-Downs HE. Invalidity of one actigraphy brand for identifying sleep and wake among infants. Sleep Med. 2010;11(2):191–6. doi: 10.1016/j.sleep.2009.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Rioualen S, Roué JM, Lefranc J, Gouillou M, Nowak E, Alavi Z, et al. Actigraphy is not a reliable method for measuring sleep patterns in neonates. Acta Paediatr. 2015;104(11):e478–82. doi: 10.1111/apa.13088. [DOI] [PubMed] [Google Scholar]
33.Camerota M, Tully KP, Grimes M, Gueron-Sela N, Propper CB. Assessment of infant sleep: how well do multiple methods compare? Sleep. 2018;41(10):zsy146. doi: 10.1093/sleep/zsy146. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Hewitt L, Stanley RM, Cliff D, Okely AD. Objective measurement of tummy time in infants (0–6 months): a validation study. PLoS One. 2019;14(2):e0210977. doi: 10.1371/journal.pone.0210977. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Lewicke AT, Sazonov ES, Schuckers SA. Sleep-wake identification in infants : heart rate variability compared to actigraphy. Conf Proc IEEE Eng Med Biol Soc. 2004;1:442–5. doi: 10.1109/IEMBS.2004.1403189. [DOI] [PubMed] [Google Scholar]
36.Gnidovec B, Neubauer D, Zidar J. Actigraphic assessment of sleep-wake rhythm during the first 6 months of life. Clin Neurophysiol. 2002;113(11):1815–21. doi: 10.1016/s1388-2457(02)00287-0. [DOI] [PubMed] [Google Scholar]
37.Horger MN, Marsiliani R, DeMasi A, Allia A, Berger SE. Researcher choices for infant sleep assessment: parent report, actigraphy, and a novel video system. J Genet Psychol. 2021;182(4):218–35. doi: 10.1080/00221325.2021.1905600. [DOI] [PubMed] [Google Scholar]
38.Sadeh A, Acebo C, Seifer R, Aytur S, Carskadon MA. Activity-based assessment of sleep-wake patterns during 1st year of life. Infant Behav Dev. 1995;18(3):329–37. doi: 10.1016/0163-6383(95)90021-7. [DOI] [Google Scholar]
39.Airaksinen M, Räsänen O, Ilén E, Häyrinen T, Kivi A, Marchi V, et al. Automatic posture and movement tracking of infants with wearable movement sensors. Sci Rep. 2020;10(1):169. doi: 10.1038/s41598-019-56862-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Jun K, Choi S. Unsupervised end-to-end deep model for newborn and infant activity recognition. Sensors (Basel) 2020;20(22):6467. doi: 10.3390/s20226467. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Smith BA, Trujillo-Priego IA, Lane CJ, Finley JM, Horak FB. Daily quantity of infant leg movement: wearable sensor algorithm and relationship to walking onset. Sensors (Basel) 2015;15(8):19006–20. doi: 10.3390/s150819006. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Hager ER, Gormley CE, Latta LW, Treuth MS, Caulfield LE, Black MM. Toddler physical activity study: laboratory and community studies to evaluate accelerometer validity and correlates. BMC Public Health. 2016;16(6):936. doi: 10.1186/s12889-016-3569-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Pulakka A, Cheung YB, Ashorn U, Penpraze V, Maleta K, Phuka JC, et al. Feasibility and validity of the ActiGraph GT3X accelerometer in measuring physical activity of Malawian toddlers. Acta Paediatr. 2013;102(12):1192–8. doi: 10.1111/apa.12412. [DOI] [PubMed] [Google Scholar]
44.Trost SG, Fees BS, Haar SJ, Murray AD, Crowe LK. Identification and validity of accelerometer cut-points for toddlers. Obesity (Silver Spring) 2012;20(11):2317–9. doi: 10.1038/oby.2011.364. [DOI] [PubMed] [Google Scholar]
45.van Cauwenberghe E, Gubbels J, De Bourdeaudhuij I, Cardon G. Feasibility and validity of accelerometer measurements to assess physical activity in toddlers. Int J Behav Nutr Phys Act. 2011;8:67. doi: 10.1186/1479-5868-8-67. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Costa S, Barber SE, Cameron N, Clemes SA. Calibration and validation of the ActiGraph GT3X+ in 2–3 year olds. J Sci Med Sport. 2014;17(6):617–22. doi: 10.1016/j.jsams.2013.11.005. [DOI] [PubMed] [Google Scholar]
47.Oftedal S, Bell KL, Davies PS, Ware RS, Boyd RN. Validation of accelerometer cut points in todlers with and without cerebral palsy. Med Sci Sports Exerc. 2014;46(9):1808–15. doi: 10.1249/MSS.0000000000000299. [DOI] [PubMed] [Google Scholar]
48.Albert MV, Sugianto A, Nickele K, Zavos P, Sindu P, Ali M, et al. Hidden Markov model-based activity recognition for toddlers. Physiol Meas. 2020;41(2):025003. doi: 10.1088/1361-6579/ab6ebb. [DOI] [PubMed] [Google Scholar]
49.Kwon S, Zavos P, Nickele K, Sugianto A, Albert MV. Hip and wrist-worn accelerometer data analysis for toddler activities. Int J Environ Res Public Health. 2019;16(14). 10.3390/ijerph16142598 [DOI] [PMC free article] [PubMed]
50.Nam Y, Park JW. Child activity recognition based on cooperative fusion model of a triaxial accelerometer and a barometric pressure sensor. IEEE J Biomed Health Inform. 2013;17(2):420–6. doi: 10.1109/JBHI.2012.2235075. [DOI] [PubMed] [Google Scholar]
51.Adolph AL, Puyau MR, Vohra FA, Nicklas TA, Zakeri IF, Butte NF. Validation of uniaxial and triaxial accelerometers for the assessment of physical activity in preschool children. J Phys Act Health. 2012;9(7):944–53. doi: 10.1123/jpah.9.7.944. [DOI] [PubMed] [Google Scholar]
52.Pate RR, Almeida MJ, McIver KL, Pfeiffer KA, Dowda M. Validation and calibration of an accelerometer in preschool children. Obesity (Silver Spring) 2006;14(11):2000–6. doi: 10.1038/oby.2006.234. [DOI] [PubMed] [Google Scholar]
53.Pfeiffer KA, McIver KL, Dowda M, Almeida MJ, Pate RR. Validation and calibration of the Actical accelerometer in preschool children. Med Sci Sports Exerc. 2006;38(1):152–7. doi: 10.1249/01.mss.0000183219.44127.e7. [DOI] [PubMed] [Google Scholar]
54.Sijtsma A, Schierbeek H, Goris AH, Joosten KF, van Kessel I, Corpeleijn E, et al. Validation of the TracmorD triaxial accelerometer to assess physical activity in preschool children. Obesity (Silver Spring) 2013;21(9):1877–83. doi: 10.1002/oby.20401. [DOI] [PubMed] [Google Scholar]
55.Zakeri IF, Adolph AL, Puyau MR, Vohra FA, Butte NF. Cross-sectional time series and multivariate adaptive regression splines models using accelerometry and heart rate predict energy expenditure of preschoolers. J Nutr. 2013;143(1):114–22. doi: 10.3945/jn.112.168542. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Alhassan S, Sirard JR, Kurdziel LBF, Merrigan S, Greever C, Spencer RMC. Cross-validation of two accelerometers for assessment of physical pctivity and sedentary time in preschool children. Pediatr Exerc Sci. 2017;29(2):268–77. doi: 10.1123/pes.2016-0074. [DOI] [PubMed] [Google Scholar]
57.Djafarian K, Speakman JR, Stewart J, Jackson DM. Comparison of activity levels measured by a wrist worn accelerometer and direct observation in young children. Open J Pediatr. 2013;3(04):422–7. doi: 10.4236/ojped.2013.34076. [DOI] [Google Scholar]
58.Dobell AP, Eyre ELJ, Tallis J, Chinapaw MJM, Altenburg TM, Duncan MJ. Examining accelerometer validity for estimating physical activity in pre-schoolers during free-living activity. Scand J Med Sci Sports. 2019;29(10):1618–28. doi: 10.1111/sms.13496. [DOI] [PubMed] [Google Scholar]
59.Fairweather SC, Reilly J, Grant S, Whittaker A, Paton JY. Using the Computer Science and Applications (CSA) activity monitor in preschool children. Pediatr Exerc Sci. 1999;11(4):413–20. doi: 10.1123/pes.11.4.413. [DOI] [Google Scholar]
60.Finn KJ, Specker B. Comparison of Actiwatch activity and Children's Activity Rating scale in children. Med Sci Sports Exerc. 2000;32(10):1794–7. doi: 10.1097/00005768-200010000-00021. [DOI] [PubMed] [Google Scholar]
61.Hislop JF, Bulley C, Mercer TH, Reilly JJ. Comparison of accelerometry cut points for physical activity and sedentary behavior in preschool children: a validation study. Pediatr Exerc Sci. 2012;24(4):563–76. doi: 10.1123/pes.24.4.563. [DOI] [PubMed] [Google Scholar]
62.Klesges LM, Klesges RC. The assessment of children's physical activity: a comparison of methods. Med Sci Sports Exerc. 1987;19(5):511–517. doi: 10.1249/00005768-198710000-00015. [DOI] [PubMed] [Google Scholar]
63.Klesges RC, Klesges LM, Swenson AM, Pheley AM. A validation of two motion sensors in the prediction of child and adult physical activity levels. Am J Epidemiol. 1985;122(3):400–10. doi: 10.1093/oxfordjournals.aje.a114121. [DOI] [PubMed] [Google Scholar]
64.van Cauwenberghe E, Wooller L, Mackay L, Cardon G, Oliver M. Comparison of Actical and activPAL measures of sedentary behaviour in preschool children. J Sci Med Sport. 2012;15(6):526–31. doi: 10.1016/j.jsams.2012.03.014. [DOI] [PubMed] [Google Scholar]
65.Aadland E, Johannessen K. Agreement of objectively measured physical activity and sedentary time in preschool children. Prev Med Rep. 2015;2:635–9. doi: 10.1016/j.pmedr.2015.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Bélanger ME, Bernier A, Paquet J, Simard V, Carrier J. Validating actigraphy as a measure of sleep for preschool children. J Clin Sleep Med. 2013;9(7):701–6. doi: 10.5664/jcsm.2844. [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Roscoe CMP, James RS, Duncan MJ. Calibration of GENEActiv accelerometer wrist cut-points for the assessment of physical activity intensity of preschool aged children. Eur J Pediatr. 2017;176(8):1093–8. doi: 10.1007/s00431-017-2948-2. [DOI] [PubMed] [Google Scholar]
68.de Bock F, Menze J, Becker S, Litaker D, Fischer J, Seidel I. Combining accelerometry and HR for assessing preschoolers' physical activity. Med Sci Sports Exerc. 2010;42(12):2237–43. doi: 10.1249/MSS.0b013e3181e27b5d. [DOI] [PubMed] [Google Scholar]
69.Ettienne R, Nigg CR, Li F, Su Y, McGlone K, Luick B, et al. Validation of the Actical accelerometer in multiethnic preschoolers: The Children's Healthy Living (CHL) program. Hawai'i J Med Public Health. 2016;75(4):95–100. [PMC free article] [PubMed] [Google Scholar]
70.Hislop JF, Bulley C, Mercer TH, Reilly JJ. Comparison of epoch and uniaxial versus triaxial accelerometers in the measurement of physical activity in preschool children: a validation study. Pediatr Exerc Sci. 2012;24(3):450–60. doi: 10.1123/pes.24.3.450. [DOI] [PubMed] [Google Scholar]
71.Kahan D, Nicaise V, Reuben K. Convergent validity of four accelerometer cutpoints with direct observation of preschool children's outdoor physical activity. Res Q Exerc Sport. 2013;84(1):59–67. doi: 10.1080/02701367.2013.762294. [DOI] [PubMed] [Google Scholar]
72.Martin A, Mcneil M, Penpraze V, Dall P, Granat M, Paton JY, et al. Objective measurement of habitual sedentary behavior in pre-school children: comparison of activPAL with Actigraph monitors. Pediatr Exerc Sci. 2011;23(4):468–76. doi: 10.1123/pes.23.4.468. [DOI] [PubMed] [Google Scholar]
73.Pagels P, Boldemann C, Raustorp A. Comparison of pedometer and accelerometer measures of physical activity during preschool time on 3- to 5-year-old children. Acta Paediatr. 2011;100(1):116–20. doi: 10.1111/j.1651-2227.2010.01962.x. [DOI] [PubMed] [Google Scholar]
74.Pereira JR, Sousa-Sá E, Zhang Z, Cliff DP, Santos R. Concurrent validity of the ActiGraph GT3X+ and activPAL for assessing sedentary behaviour in 2–3-year-old children under free-living conditions. J Sci Med Sport. 2020;23(2):151–6. doi: 10.1016/j.jsams.2019.08.009. [DOI] [PubMed] [Google Scholar]
75.Butte NF, Wong WW, Lee JS, Adolph AL, Puyau MR, Zakeri IF. Prediction of energy expenditure and physical activity in preschoolers. Med Sci Sports Exerc. 2014;46(6):1216–26. doi: 10.1249/mss.0000000000000209. [DOI] [PMC free article] [PubMed] [Google Scholar]
76.Hislop J, Palmer N, Anand P, Aldin T. Validity of wrist worn accelerometers and comparability between hip and wrist placement sites in estimating physical activity behaviour in preschool children. Physiol Meas. 2016;37(10):1701–14. doi: 10.1088/0967-3334/37/10/1701. [DOI] [PubMed] [Google Scholar]
77.Johansson E, Ekelund U, Nero H, Marcus C, Hagströmer M. Calibration and cross-validation of a wrist-worn Actigraph in young preschoolers. Pediatr Obes. 2015;10(1):1–6. doi: 10.1111/j.2047-6310.2013.00213.x. [DOI] [PubMed] [Google Scholar]
78.Johansson E, Larisch LM, Marcus C, Hagströmer M. Calibration and validation of a wrist- and hip-worn Actigraph accelerometer in 4-year-old children. PLoS One. 2016;11(9):e0162436. doi: 10.1371/journal.pone.0162436. [DOI] [PMC free article] [PubMed] [Google Scholar]
79.Li S, Howard JT, Sosa ET, Cordova A, Parra-Medina D, Yin Z. Calibrating wrist-worn accelerometers for physical activity assessment in prechoolers : machine learning approaches. JMIR Form Res. 2020;4(8):e16727. doi: 10.2196/16727. [DOI] [PMC free article] [PubMed] [Google Scholar]
80.Reilly JJ, Coyle J, Kelly L, Burke G, Grant S, Paton JY. An objective method for measurement of sedentary behavior in 3- to 4-year olds. Obes Res. 2003;11(10):1155–8. doi: 10.1038/oby.2003.158. [DOI] [PubMed] [Google Scholar]
81.Sirard J, Trost S, Pfeiffer K, Dowda M, Pate R. Calibration and evaluation of an objective measure of physical activity in preschool children. J Phys Act Health. 2005;2(3):345–57. doi: 10.1123/jpah.2.3.345. [DOI] [Google Scholar]
82.Davies G, Reilly J, McGowan AJ, Dall P, Granat M, Paton JY. Validity, practical utility, and reliability of the Activpal in preschool children. Med Sci Sports Exerc. 2012;44(4):761–8. doi: 10.1249/MSS.0b013e31823b1dc7. [DOI] [PubMed] [Google Scholar]
83.Ahmadi MN, Brookes D, Chowdhury A, Pavey T, Trost SG. Free-living evaluation of laboratory-based activity classifiers in preschoolers. Med Sci Sports Exerc. 2020;52(5):1227–34. doi: 10.1249/mss.0000000000002221. [DOI] [PubMed] [Google Scholar]
84.Ahmadi MN, Chowdhury A, Pavey T, Trost SG. Laboratory-based and free-living algorithms for energy expenditure estimation in preschool children: a free-living evaluation. PLoS One. 2020;15(5):e0233229. doi: 10.1371/journal.pone.0233229. [DOI] [PMC free article] [PubMed] [Google Scholar]
85.Ahmadi MN, Pavey TG, Trost SG. Machine learning models for classifying physical activity in free-living preschool children. Sensors (Basel). 2020;20(16). 10.3390/s20164364 [DOI] [PMC free article] [PubMed]
86.Brønd JC, Grøntved A, Andersen LB, Arvidsson D, Olesen LG. Simple method for the objective activity type assessment with preschoolers, children and adolescents. Children (Basel). 2020;7(7). 10.3390/children7070072 [DOI] [PMC free article] [PubMed]
87.Hagenbuchner M, Cliff DP, Trost SG, Van Tuc N, Peoples GE. Prediction of activity type in preschool children using machine learning techniques. J Sci Med Sport. 2015;18(4):426–31. doi: 10.1016/j.jsams.2014.06.003. [DOI] [PubMed] [Google Scholar]
88.Steenbock B, Wright MN, Wirsik N, Brandes M. Accelerometry-based prediction of energy expenditure in preschoolers. J Meas Phys Behav. 2019;2(2):94–102. doi: 10.1123/jmpb.2018-0032. [DOI] [Google Scholar]
89.Trost SG, Cliff DP, Ahmadi MN, Tuc NV, Hagenbuchner M. Sensor-enabled activity class recognition in prechoolers: hip versus wrist Data. Med Sci Sports Exerc. 2018;50(3):634–41. doi: 10.1249/MSS.0000000000001460. [DOI] [PubMed] [Google Scholar]
90.Zhao W, Adolph AL, Puyau MR, Vohra FA, Butte NF, Zakeri IF. Support vector machines classifiers of physical activities in preschoolers. Physiol Rep. 2013;1(1):e00006. doi: 10.1002/phy2.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
91.Evenson KR, Catellier DJ, Gill K, Ondrak KS, McMurray RG. Calibration of two objective measures of physical activity for children. J Sports Sci. 2008;26(14):1557–65. doi: 10.1080/02640410802334196. [DOI] [PubMed] [Google Scholar]
92.Sadeh A, Lavie P, Scher A, Tirosh E, Epstein R. Actigraphic home-monitoring sleep-disturbed and control infants and young children: a new method for pediatric assessment of sleep-wake patterns. Pediatrics. 1991;87(4):494–499. [PubMed] [Google Scholar]
93.Redmond DP, Hegge FW. Observations on the design and specification of a wrist-worn human activity monitoring system. Behav Res Methods Instrum Comput. 1985;17(6):659–69. doi: 10.3758/BF03200979. [DOI] [Google Scholar]
94.Sitnick SL, Goodlin-Jones BL, Anders TF. The use of actigraphy to study sleep disorders in preschoolers: some concerns about detection of nighttime awakenings. Sleep. 2008;31(3):395–401. doi: 10.1093/sleep/31.3.395. [DOI] [PMC free article] [PubMed] [Google Scholar]
95.Ekblom O, Nyberg G, Bak EE, Ekelund U, Marcus C. Validity and comparability of a wrist-worn accelerometer in children. J Phys Act Health. 2012;9(3):389–93. doi: 10.1123/jpah.9.3.389. [DOI] [PubMed] [Google Scholar]
96.Kelly LA, Villalpando J, Carney B, Wendt S, Haas R, Ranieri BJ, et al. Development of Actigraph GT1M accelerometer cut-points for young children aged 12–36 months. J Athl Enhanc. 2016;5(4):1–4. doi: 10.4172/2324-9080.1000235. [DOI] [Google Scholar]
97.Puyau MR, Adolph AL, Vohra FA, Butte NF. Validation and calibration of physical activity monitors in children. Obes Res. 2002;10(3):150–7. doi: 10.1038/oby.2002.24. [DOI] [PubMed] [Google Scholar]
98.Schaefer CA, Nigg CR, Hill JO, Brink LA, Browning RC. Establishing and evaluating wrist cutpoints for the GENEActiv accelerometer in youth. Med Sci Sports Exerc. 2014;46(4):826–33. doi: 10.1249/MSS.0000000000000150. [DOI] [PMC free article] [PubMed] [Google Scholar]
99.Sun DX, Schmidt G, Teo-Koh SM. Validation of the RT3 accelerometer for measuring physical activity of children in simulated free-living conditions. Pediatr Exerc Sci. 2008;20(2):181–97. doi: 10.1123/pes.20.2.181. [DOI] [PubMed] [Google Scholar]
100.Sitnick SL, Goodlin-Jones BL, Anders TF. The use of actigraphy to study sleep disorders in preschoolers: some concerns about detection of nighttime awakenings. Sleep. 2008;31(3):395. doi: 10.1093/sleep/31.3.395. [DOI] [PMC free article] [PubMed] [Google Scholar]
101.Altenburg TM, Wang X, van Ekris E, Andersen LB, Møller NC, Wedderkopp N, et al. The consequences of using different epoch lengths on the classification of accelerometer based sedentary behaviour and physical activity. PLOS ONE. 2021;16(7):e0254721. doi: 10.1371/journal.pone.0254721. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

12966_2022_1296_MOESM1_ESM.pdf^{(441.8KB, pdf)}

Additional file 1. Search strategy using MEDLINE for studies evaluating accelerometer-based methods.

12966_2022_1296_MOESM2_ESM.pdf^{(1,000.2KB, pdf)}

Additional file 2. CAMQAM: Checklist for Assessing the Methodological Quality of studies using Accelerometer-based Methods.

12966_2022_1296_MOESM3_ESM.pdf^{(999.3KB, pdf)}

Additional file 3. Description of the elements and the corresponding codes to describe accelerometer-based methods.

Data Availability Statement

The data that support the findings of this review are available from the corresponding author upon reasonable request.

[CR1] 1.Chaput JP, Carson V, Gray CE, Tremblay MS. Importance of all movement behaviors in a 24 hour period for overall health. Int J Environ Res Public Health. 2014;11(12):12575–81. doi: 10.3390/ijerph111212575. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Kuzik N, Poitras VJ, Tremblay MS, Lee EY, Hunter S, Carson V. Systematic review of the relationships between combinations of movement behaviours and health indicators in the early years (0–4 years) BMC Public Health. 2017;17(5):849. doi: 10.1186/s12889-017-4851-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Rollo S, Antsygina O, Tremblay MS. The whole day matters: understanding 24-hour movement guideline adherence and relationships with health indicators across the lifespan. J Sport Health Sci. 2020;9(6):493–510. doi: 10.1016/j.jshs.2020.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Tremblay MS, Carson V, Chaput J-P, Connor Gorber S, Dinh T, Duggan M, et al. Canadian 24-hour movement guidelines for children and youth: an integration of physical activity, sedentary behaviour, and sleep. Appl Physiol Nutr Metab. 2016;41(6):S311–S27. doi: 10.1139/apnm-2016-0151. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Tremblay MS, Chaput JP, Adamo KB, Aubert S, Barnes JD, Choquette L, et al. Canadian 24-hour movement guidelines for the early years (0–4 years): an integration of physical activity, sedentary behaviour, and sleep. BMC Public Health. 2017;17(5):874. doi: 10.1186/s12889-017-4859-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Ainsworth BE, Haskell WL, Herrmann SD, Meckes N, Bassett DR, Tudor-Locke C, et al. 2011 Compendium of physical activities: a second update of codes and MET values. Med Sci Sports Exerc. 2011;43(8):1575–81. doi: 10.1249/MSS.0b013e31821ece12. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Butte NF, Watson KB, Ridley K, Zakeri IF, McMurray RG, Pfeiffer KA, et al. A youth compendium of physical activities: activity codes and metabolic intensities. Med Sci Sports Exerc. 2018;50(2):246–56. doi: 10.1249/mss.0000000000001430. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Hills AP, Mokhtar N, Byrne NM. Assessment of physical activity and energy expenditure: an overview of objective measures. Front Nutr. 2014;1:5. doi: 10.3389/fnut.2014.00005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Loprinzi PD, Cardinal BJ. Measuring children's physical activity and sedentary behaviors. J Exerc Sci Fit. 2011;9(1):15–23. doi: 10.1016/s1728-869x(11)60002-6. [DOI] [Google Scholar]

[CR10] 10.Park J, Kazuko IT, Kim E, Kim J, Yoon J. Estimating free-living human energy expenditure: practical aspects of the doubly labeled water method and its applications. Nutr Res Pract. 2014;8(3):241–8. doi: 10.4162/nrp.2014.8.3.241. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Ndahimana D, Kim EK. Measurement methods for physical activity and energy expenditure: a review. Clin Nutr Res. 2017;6(2):68–80. doi: 10.7762/cnr.2017.6.2.68. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Lynch BA, Kaufman TK, Rajjo TI, Mohammed K, Kumar S, Murad MH, et al. Accuracy of accelerometers for measuring physical activity and levels of sedentary behavior in children: a systematic review. J Prim Care Community Health. 2019;10:1–8. doi: 10.1177/2150132719874252. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.de Vries SI, Bakker I, Hopman-Rock M, Hirasing RA, van Mechelen W. Clinimetric review of motion sensors in children and adolescents. J Clin Epidemiol. 2006;59(7):670–80. doi: 10.1016/j.jclinepi.2005.11.020. [DOI] [PubMed] [Google Scholar]

[CR14] 14.Bruijns BA, Truelove S, Johnson AM, Gilliland J, Tucker P. Infants’ and toddlers’ physical activity and sedentary time as measured by accelerometry: a systematic review and meta-analysis. Int J Behav Nutrition Physical Act. 2020;17(1):14. doi: 10.1186/s12966-020-0912-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Cliff DP, Reilly JJ, Okely AD. Methodological considerations in using accelerometers to assess habitual physical activity in children aged 0–5 years. J Sci Med Sport. 2009;12(5):557–67. doi: 10.1016/j.jsams.2008.10.008. [DOI] [PubMed] [Google Scholar]

[CR16] 16.Sarker H, Anderson LN, Borkhoff CM, Abreo K, Tremblay MS, Lebovic G, et al. Validation of parent-reported physical activity and sedentary time by accelerometry in young children. BMC Research Notes. 2015;8(1):735. doi: 10.1186/s13104-015-1648-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Sasaki JE, John D, Freedson PS. Validation and comparison of ActiGraph activity monitors. J Sci Med Sport. 2011;14(5):411–6. doi: 10.1016/j.jsams.2011.04.003. [DOI] [PubMed] [Google Scholar]

[CR18] 18.Weiss AR, Johnson NL, Berger NA, Redline S. Validity of activity-based devices to estimate sleep. J Clin Sleep Med. 2010;6(4):336–342. doi: 10.5664/jcsm.27874. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Welk GJ, Schaben JA, Morrow JR. Reliability of accelerometry-based activity monitors: a generalizability study. Med Sci Sports Exerc. 2004;36(9):1637–1645. [PubMed] [Google Scholar]

[CR20] 20.Oliver M, Schofield GM, Kolt GS. Physical activity in preschoolers. Sports Med. 2007;37(12):1045–70. doi: 10.2165/00007256-200737120-00004. [DOI] [PubMed] [Google Scholar]

[CR21] 21.Galland BC, Taylor BJ, Elder DE, Herbison P. Normal sleep patterns in infant and children: a systematic review of observational studies. Sleep Med Rev. 2012;16(3):213–22. doi: 10.1016/j.smrv.2011.06.001. [DOI] [PubMed] [Google Scholar]

[CR22] 22.Adolph KE, Franchak JM. The development of motor behavior. Wiley Interdiscip Rev Cog Sci. 2017;8(1–2):e1430. doi: 10.1002/wcs.1430. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Gerber RJ, Wilks T, Erdie-Lalena C. Developmental milestones: motor development. Pediatr Rev. 2010;31(7):267–77. doi: 10.1542/pir.31-7-267. [DOI] [PubMed] [Google Scholar]

[CR24] 24.Page M, McKenzie J, Bossuyt P, Boutron I, Hoffmann T, Mulrow C, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. PLoS Med. 2021;18(3):e1003583. doi: 10.1371/journal.pmed.1003583. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Mokkink LB, De Vet HC, Prinsen CA, Patrick DL, Alonso J, Bouter LM, et al. COSMIN risk of bias checklist for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1171–9. doi: 10.1007/s11136-017-1765-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Prinsen CA, Mokkink LB, Bouter LM, Alonso J, Patrick DL, De Vet HC, et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1147–57. doi: 10.1007/s11136-018-1798-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Terwee CB, Prinsen CA, Chiarotto A, Westerman MJ, Patrick DL, Alonso J, et al. COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual Life Res. 2018;27(5):1159–70. doi: 10.1007/s11136-018-1829-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Terwee CB, Mokkink LB, van Poppel MN, Chinapaw MJ, van Mechelen W, de Vet HC. Qualitative attributes and measurement properties of physical activity questionnaires. Sports Med. 2010;40(7):525–37. doi: 10.2165/11531370-000000000-00000. [DOI] [PubMed] [Google Scholar]

[CR29] 29.Greenspan B, Cunha AB, Lobo MA. Design and validation of a smart garment to measure positioning practices of parents with young infants. Infant Behav Dev. 2021;62:101530. doi: 10.1016/j.infbeh.2021.101530. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Galland BC, Kennedy GJ, Mitchell EA, Taylor BJ. Algorithms for using an activity-based accelerometer for identification of infant sleep-wake states during nap studies. Sleep Med. 2012;13(6):743–51. doi: 10.1016/j.sleep.2012.01.018. [DOI] [PubMed] [Google Scholar]

[CR31] 31.Insana SP, Gozal D, Montgomery-Downs HE. Invalidity of one actigraphy brand for identifying sleep and wake among infants. Sleep Med. 2010;11(2):191–6. doi: 10.1016/j.sleep.2009.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Rioualen S, Roué JM, Lefranc J, Gouillou M, Nowak E, Alavi Z, et al. Actigraphy is not a reliable method for measuring sleep patterns in neonates. Acta Paediatr. 2015;104(11):e478–82. doi: 10.1111/apa.13088. [DOI] [PubMed] [Google Scholar]

[CR33] 33.Camerota M, Tully KP, Grimes M, Gueron-Sela N, Propper CB. Assessment of infant sleep: how well do multiple methods compare? Sleep. 2018;41(10):zsy146. doi: 10.1093/sleep/zsy146. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.Hewitt L, Stanley RM, Cliff D, Okely AD. Objective measurement of tummy time in infants (0–6 months): a validation study. PLoS One. 2019;14(2):e0210977. doi: 10.1371/journal.pone.0210977. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Lewicke AT, Sazonov ES, Schuckers SA. Sleep-wake identification in infants : heart rate variability compared to actigraphy. Conf Proc IEEE Eng Med Biol Soc. 2004;1:442–5. doi: 10.1109/IEMBS.2004.1403189. [DOI] [PubMed] [Google Scholar]

[CR36] 36.Gnidovec B, Neubauer D, Zidar J. Actigraphic assessment of sleep-wake rhythm during the first 6 months of life. Clin Neurophysiol. 2002;113(11):1815–21. doi: 10.1016/s1388-2457(02)00287-0. [DOI] [PubMed] [Google Scholar]

[CR37] 37.Horger MN, Marsiliani R, DeMasi A, Allia A, Berger SE. Researcher choices for infant sleep assessment: parent report, actigraphy, and a novel video system. J Genet Psychol. 2021;182(4):218–35. doi: 10.1080/00221325.2021.1905600. [DOI] [PubMed] [Google Scholar]

[CR38] 38.Sadeh A, Acebo C, Seifer R, Aytur S, Carskadon MA. Activity-based assessment of sleep-wake patterns during 1st year of life. Infant Behav Dev. 1995;18(3):329–37. doi: 10.1016/0163-6383(95)90021-7. [DOI] [Google Scholar]

[CR39] 39.Airaksinen M, Räsänen O, Ilén E, Häyrinen T, Kivi A, Marchi V, et al. Automatic posture and movement tracking of infants with wearable movement sensors. Sci Rep. 2020;10(1):169. doi: 10.1038/s41598-019-56862-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Jun K, Choi S. Unsupervised end-to-end deep model for newborn and infant activity recognition. Sensors (Basel) 2020;20(22):6467. doi: 10.3390/s20226467. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Smith BA, Trujillo-Priego IA, Lane CJ, Finley JM, Horak FB. Daily quantity of infant leg movement: wearable sensor algorithm and relationship to walking onset. Sensors (Basel) 2015;15(8):19006–20. doi: 10.3390/s150819006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR42] 42.Hager ER, Gormley CE, Latta LW, Treuth MS, Caulfield LE, Black MM. Toddler physical activity study: laboratory and community studies to evaluate accelerometer validity and correlates. BMC Public Health. 2016;16(6):936. doi: 10.1186/s12889-016-3569-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Pulakka A, Cheung YB, Ashorn U, Penpraze V, Maleta K, Phuka JC, et al. Feasibility and validity of the ActiGraph GT3X accelerometer in measuring physical activity of Malawian toddlers. Acta Paediatr. 2013;102(12):1192–8. doi: 10.1111/apa.12412. [DOI] [PubMed] [Google Scholar]

[CR44] 44.Trost SG, Fees BS, Haar SJ, Murray AD, Crowe LK. Identification and validity of accelerometer cut-points for toddlers. Obesity (Silver Spring) 2012;20(11):2317–9. doi: 10.1038/oby.2011.364. [DOI] [PubMed] [Google Scholar]

[CR45] 45.van Cauwenberghe E, Gubbels J, De Bourdeaudhuij I, Cardon G. Feasibility and validity of accelerometer measurements to assess physical activity in toddlers. Int J Behav Nutr Phys Act. 2011;8:67. doi: 10.1186/1479-5868-8-67. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR46] 46.Costa S, Barber SE, Cameron N, Clemes SA. Calibration and validation of the ActiGraph GT3X+ in 2–3 year olds. J Sci Med Sport. 2014;17(6):617–22. doi: 10.1016/j.jsams.2013.11.005. [DOI] [PubMed] [Google Scholar]

[CR47] 47.Oftedal S, Bell KL, Davies PS, Ware RS, Boyd RN. Validation of accelerometer cut points in todlers with and without cerebral palsy. Med Sci Sports Exerc. 2014;46(9):1808–15. doi: 10.1249/MSS.0000000000000299. [DOI] [PubMed] [Google Scholar]

[CR48] 48.Albert MV, Sugianto A, Nickele K, Zavos P, Sindu P, Ali M, et al. Hidden Markov model-based activity recognition for toddlers. Physiol Meas. 2020;41(2):025003. doi: 10.1088/1361-6579/ab6ebb. [DOI] [PubMed] [Google Scholar]

[CR49] 49.Kwon S, Zavos P, Nickele K, Sugianto A, Albert MV. Hip and wrist-worn accelerometer data analysis for toddler activities. Int J Environ Res Public Health. 2019;16(14). 10.3390/ijerph16142598 [DOI] [PMC free article] [PubMed]

[CR50] 50.Nam Y, Park JW. Child activity recognition based on cooperative fusion model of a triaxial accelerometer and a barometric pressure sensor. IEEE J Biomed Health Inform. 2013;17(2):420–6. doi: 10.1109/JBHI.2012.2235075. [DOI] [PubMed] [Google Scholar]

[CR51] 51.Adolph AL, Puyau MR, Vohra FA, Nicklas TA, Zakeri IF, Butte NF. Validation of uniaxial and triaxial accelerometers for the assessment of physical activity in preschool children. J Phys Act Health. 2012;9(7):944–53. doi: 10.1123/jpah.9.7.944. [DOI] [PubMed] [Google Scholar]

[CR52] 52.Pate RR, Almeida MJ, McIver KL, Pfeiffer KA, Dowda M. Validation and calibration of an accelerometer in preschool children. Obesity (Silver Spring) 2006;14(11):2000–6. doi: 10.1038/oby.2006.234. [DOI] [PubMed] [Google Scholar]

[CR53] 53.Pfeiffer KA, McIver KL, Dowda M, Almeida MJ, Pate RR. Validation and calibration of the Actical accelerometer in preschool children. Med Sci Sports Exerc. 2006;38(1):152–7. doi: 10.1249/01.mss.0000183219.44127.e7. [DOI] [PubMed] [Google Scholar]

[CR54] 54.Sijtsma A, Schierbeek H, Goris AH, Joosten KF, van Kessel I, Corpeleijn E, et al. Validation of the TracmorD triaxial accelerometer to assess physical activity in preschool children. Obesity (Silver Spring) 2013;21(9):1877–83. doi: 10.1002/oby.20401. [DOI] [PubMed] [Google Scholar]

[CR55] 55.Zakeri IF, Adolph AL, Puyau MR, Vohra FA, Butte NF. Cross-sectional time series and multivariate adaptive regression splines models using accelerometry and heart rate predict energy expenditure of preschoolers. J Nutr. 2013;143(1):114–22. doi: 10.3945/jn.112.168542. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR56] 56.Alhassan S, Sirard JR, Kurdziel LBF, Merrigan S, Greever C, Spencer RMC. Cross-validation of two accelerometers for assessment of physical pctivity and sedentary time in preschool children. Pediatr Exerc Sci. 2017;29(2):268–77. doi: 10.1123/pes.2016-0074. [DOI] [PubMed] [Google Scholar]

[CR57] 57.Djafarian K, Speakman JR, Stewart J, Jackson DM. Comparison of activity levels measured by a wrist worn accelerometer and direct observation in young children. Open J Pediatr. 2013;3(04):422–7. doi: 10.4236/ojped.2013.34076. [DOI] [Google Scholar]

[CR58] 58.Dobell AP, Eyre ELJ, Tallis J, Chinapaw MJM, Altenburg TM, Duncan MJ. Examining accelerometer validity for estimating physical activity in pre-schoolers during free-living activity. Scand J Med Sci Sports. 2019;29(10):1618–28. doi: 10.1111/sms.13496. [DOI] [PubMed] [Google Scholar]

[CR59] 59.Fairweather SC, Reilly J, Grant S, Whittaker A, Paton JY. Using the Computer Science and Applications (CSA) activity monitor in preschool children. Pediatr Exerc Sci. 1999;11(4):413–20. doi: 10.1123/pes.11.4.413. [DOI] [Google Scholar]

[CR60] 60.Finn KJ, Specker B. Comparison of Actiwatch activity and Children's Activity Rating scale in children. Med Sci Sports Exerc. 2000;32(10):1794–7. doi: 10.1097/00005768-200010000-00021. [DOI] [PubMed] [Google Scholar]

[CR61] 61.Hislop JF, Bulley C, Mercer TH, Reilly JJ. Comparison of accelerometry cut points for physical activity and sedentary behavior in preschool children: a validation study. Pediatr Exerc Sci. 2012;24(4):563–76. doi: 10.1123/pes.24.4.563. [DOI] [PubMed] [Google Scholar]

[CR62] 62.Klesges LM, Klesges RC. The assessment of children's physical activity: a comparison of methods. Med Sci Sports Exerc. 1987;19(5):511–517. doi: 10.1249/00005768-198710000-00015. [DOI] [PubMed] [Google Scholar]

[CR63] 63.Klesges RC, Klesges LM, Swenson AM, Pheley AM. A validation of two motion sensors in the prediction of child and adult physical activity levels. Am J Epidemiol. 1985;122(3):400–10. doi: 10.1093/oxfordjournals.aje.a114121. [DOI] [PubMed] [Google Scholar]

[CR64] 64.van Cauwenberghe E, Wooller L, Mackay L, Cardon G, Oliver M. Comparison of Actical and activPAL measures of sedentary behaviour in preschool children. J Sci Med Sport. 2012;15(6):526–31. doi: 10.1016/j.jsams.2012.03.014. [DOI] [PubMed] [Google Scholar]

[CR65] 65.Aadland E, Johannessen K. Agreement of objectively measured physical activity and sedentary time in preschool children. Prev Med Rep. 2015;2:635–9. doi: 10.1016/j.pmedr.2015.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR66] 66.Bélanger ME, Bernier A, Paquet J, Simard V, Carrier J. Validating actigraphy as a measure of sleep for preschool children. J Clin Sleep Med. 2013;9(7):701–6. doi: 10.5664/jcsm.2844. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR67] 67.Roscoe CMP, James RS, Duncan MJ. Calibration of GENEActiv accelerometer wrist cut-points for the assessment of physical activity intensity of preschool aged children. Eur J Pediatr. 2017;176(8):1093–8. doi: 10.1007/s00431-017-2948-2. [DOI] [PubMed] [Google Scholar]

[CR68] 68.de Bock F, Menze J, Becker S, Litaker D, Fischer J, Seidel I. Combining accelerometry and HR for assessing preschoolers' physical activity. Med Sci Sports Exerc. 2010;42(12):2237–43. doi: 10.1249/MSS.0b013e3181e27b5d. [DOI] [PubMed] [Google Scholar]

[CR69] 69.Ettienne R, Nigg CR, Li F, Su Y, McGlone K, Luick B, et al. Validation of the Actical accelerometer in multiethnic preschoolers: The Children's Healthy Living (CHL) program. Hawai'i J Med Public Health. 2016;75(4):95–100. [PMC free article] [PubMed] [Google Scholar]

[CR70] 70.Hislop JF, Bulley C, Mercer TH, Reilly JJ. Comparison of epoch and uniaxial versus triaxial accelerometers in the measurement of physical activity in preschool children: a validation study. Pediatr Exerc Sci. 2012;24(3):450–60. doi: 10.1123/pes.24.3.450. [DOI] [PubMed] [Google Scholar]

[CR71] 71.Kahan D, Nicaise V, Reuben K. Convergent validity of four accelerometer cutpoints with direct observation of preschool children's outdoor physical activity. Res Q Exerc Sport. 2013;84(1):59–67. doi: 10.1080/02701367.2013.762294. [DOI] [PubMed] [Google Scholar]

[CR72] 72.Martin A, Mcneil M, Penpraze V, Dall P, Granat M, Paton JY, et al. Objective measurement of habitual sedentary behavior in pre-school children: comparison of activPAL with Actigraph monitors. Pediatr Exerc Sci. 2011;23(4):468–76. doi: 10.1123/pes.23.4.468. [DOI] [PubMed] [Google Scholar]

[CR73] 73.Pagels P, Boldemann C, Raustorp A. Comparison of pedometer and accelerometer measures of physical activity during preschool time on 3- to 5-year-old children. Acta Paediatr. 2011;100(1):116–20. doi: 10.1111/j.1651-2227.2010.01962.x. [DOI] [PubMed] [Google Scholar]

[CR74] 74.Pereira JR, Sousa-Sá E, Zhang Z, Cliff DP, Santos R. Concurrent validity of the ActiGraph GT3X+ and activPAL for assessing sedentary behaviour in 2–3-year-old children under free-living conditions. J Sci Med Sport. 2020;23(2):151–6. doi: 10.1016/j.jsams.2019.08.009. [DOI] [PubMed] [Google Scholar]

[CR75] 75.Butte NF, Wong WW, Lee JS, Adolph AL, Puyau MR, Zakeri IF. Prediction of energy expenditure and physical activity in preschoolers. Med Sci Sports Exerc. 2014;46(6):1216–26. doi: 10.1249/mss.0000000000000209. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR76] 76.Hislop J, Palmer N, Anand P, Aldin T. Validity of wrist worn accelerometers and comparability between hip and wrist placement sites in estimating physical activity behaviour in preschool children. Physiol Meas. 2016;37(10):1701–14. doi: 10.1088/0967-3334/37/10/1701. [DOI] [PubMed] [Google Scholar]

[CR77] 77.Johansson E, Ekelund U, Nero H, Marcus C, Hagströmer M. Calibration and cross-validation of a wrist-worn Actigraph in young preschoolers. Pediatr Obes. 2015;10(1):1–6. doi: 10.1111/j.2047-6310.2013.00213.x. [DOI] [PubMed] [Google Scholar]

[CR78] 78.Johansson E, Larisch LM, Marcus C, Hagströmer M. Calibration and validation of a wrist- and hip-worn Actigraph accelerometer in 4-year-old children. PLoS One. 2016;11(9):e0162436. doi: 10.1371/journal.pone.0162436. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR79] 79.Li S, Howard JT, Sosa ET, Cordova A, Parra-Medina D, Yin Z. Calibrating wrist-worn accelerometers for physical activity assessment in prechoolers : machine learning approaches. JMIR Form Res. 2020;4(8):e16727. doi: 10.2196/16727. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR80] 80.Reilly JJ, Coyle J, Kelly L, Burke G, Grant S, Paton JY. An objective method for measurement of sedentary behavior in 3- to 4-year olds. Obes Res. 2003;11(10):1155–8. doi: 10.1038/oby.2003.158. [DOI] [PubMed] [Google Scholar]

[CR81] 81.Sirard J, Trost S, Pfeiffer K, Dowda M, Pate R. Calibration and evaluation of an objective measure of physical activity in preschool children. J Phys Act Health. 2005;2(3):345–57. doi: 10.1123/jpah.2.3.345. [DOI] [Google Scholar]

[CR82] 82.Davies G, Reilly J, McGowan AJ, Dall P, Granat M, Paton JY. Validity, practical utility, and reliability of the Activpal in preschool children. Med Sci Sports Exerc. 2012;44(4):761–8. doi: 10.1249/MSS.0b013e31823b1dc7. [DOI] [PubMed] [Google Scholar]

[CR83] 83.Ahmadi MN, Brookes D, Chowdhury A, Pavey T, Trost SG. Free-living evaluation of laboratory-based activity classifiers in preschoolers. Med Sci Sports Exerc. 2020;52(5):1227–34. doi: 10.1249/mss.0000000000002221. [DOI] [PubMed] [Google Scholar]

[CR84] 84.Ahmadi MN, Chowdhury A, Pavey T, Trost SG. Laboratory-based and free-living algorithms for energy expenditure estimation in preschool children: a free-living evaluation. PLoS One. 2020;15(5):e0233229. doi: 10.1371/journal.pone.0233229. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR85] 85.Ahmadi MN, Pavey TG, Trost SG. Machine learning models for classifying physical activity in free-living preschool children. Sensors (Basel). 2020;20(16). 10.3390/s20164364 [DOI] [PMC free article] [PubMed]

[CR86] 86.Brønd JC, Grøntved A, Andersen LB, Arvidsson D, Olesen LG. Simple method for the objective activity type assessment with preschoolers, children and adolescents. Children (Basel). 2020;7(7). 10.3390/children7070072 [DOI] [PMC free article] [PubMed]

[CR87] 87.Hagenbuchner M, Cliff DP, Trost SG, Van Tuc N, Peoples GE. Prediction of activity type in preschool children using machine learning techniques. J Sci Med Sport. 2015;18(4):426–31. doi: 10.1016/j.jsams.2014.06.003. [DOI] [PubMed] [Google Scholar]

[CR88] 88.Steenbock B, Wright MN, Wirsik N, Brandes M. Accelerometry-based prediction of energy expenditure in preschoolers. J Meas Phys Behav. 2019;2(2):94–102. doi: 10.1123/jmpb.2018-0032. [DOI] [Google Scholar]

[CR89] 89.Trost SG, Cliff DP, Ahmadi MN, Tuc NV, Hagenbuchner M. Sensor-enabled activity class recognition in prechoolers: hip versus wrist Data. Med Sci Sports Exerc. 2018;50(3):634–41. doi: 10.1249/MSS.0000000000001460. [DOI] [PubMed] [Google Scholar]

[CR90] 90.Zhao W, Adolph AL, Puyau MR, Vohra FA, Butte NF, Zakeri IF. Support vector machines classifiers of physical activities in preschoolers. Physiol Rep. 2013;1(1):e00006. doi: 10.1002/phy2.6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR91] 91.Evenson KR, Catellier DJ, Gill K, Ondrak KS, McMurray RG. Calibration of two objective measures of physical activity for children. J Sports Sci. 2008;26(14):1557–65. doi: 10.1080/02640410802334196. [DOI] [PubMed] [Google Scholar]

[CR92] 92.Sadeh A, Lavie P, Scher A, Tirosh E, Epstein R. Actigraphic home-monitoring sleep-disturbed and control infants and young children: a new method for pediatric assessment of sleep-wake patterns. Pediatrics. 1991;87(4):494–499. [PubMed] [Google Scholar]

[CR93] 93.Redmond DP, Hegge FW. Observations on the design and specification of a wrist-worn human activity monitoring system. Behav Res Methods Instrum Comput. 1985;17(6):659–69. doi: 10.3758/BF03200979. [DOI] [Google Scholar]

[CR94] 94.Sitnick SL, Goodlin-Jones BL, Anders TF. The use of actigraphy to study sleep disorders in preschoolers: some concerns about detection of nighttime awakenings. Sleep. 2008;31(3):395–401. doi: 10.1093/sleep/31.3.395. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR95] 95.Ekblom O, Nyberg G, Bak EE, Ekelund U, Marcus C. Validity and comparability of a wrist-worn accelerometer in children. J Phys Act Health. 2012;9(3):389–93. doi: 10.1123/jpah.9.3.389. [DOI] [PubMed] [Google Scholar]

[CR96] 96.Kelly LA, Villalpando J, Carney B, Wendt S, Haas R, Ranieri BJ, et al. Development of Actigraph GT1M accelerometer cut-points for young children aged 12–36 months. J Athl Enhanc. 2016;5(4):1–4. doi: 10.4172/2324-9080.1000235. [DOI] [Google Scholar]

[CR97] 97.Puyau MR, Adolph AL, Vohra FA, Butte NF. Validation and calibration of physical activity monitors in children. Obes Res. 2002;10(3):150–7. doi: 10.1038/oby.2002.24. [DOI] [PubMed] [Google Scholar]

[CR98] 98.Schaefer CA, Nigg CR, Hill JO, Brink LA, Browning RC. Establishing and evaluating wrist cutpoints for the GENEActiv accelerometer in youth. Med Sci Sports Exerc. 2014;46(4):826–33. doi: 10.1249/MSS.0000000000000150. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR99] 99.Sun DX, Schmidt G, Teo-Koh SM. Validation of the RT3 accelerometer for measuring physical activity of children in simulated free-living conditions. Pediatr Exerc Sci. 2008;20(2):181–97. doi: 10.1123/pes.20.2.181. [DOI] [PubMed] [Google Scholar]

[CR100] 100.Sitnick SL, Goodlin-Jones BL, Anders TF. The use of actigraphy to study sleep disorders in preschoolers: some concerns about detection of nighttime awakenings. Sleep. 2008;31(3):395. doi: 10.1093/sleep/31.3.395. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR101] 101.Altenburg TM, Wang X, van Ekris E, Andersen LB, Møller NC, Wedderkopp N, et al. The consequences of using different epoch lengths on the classification of accelerometer based sedentary behaviour and physical activity. PLOS ONE. 2021;16(7):e0254721. doi: 10.1371/journal.pone.0254721. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Systematic review of accelerometer-based methods for 24-h physical behavior assessment in young children (0–5 years old)

Annelinde Lettink

Teatske M Altenburg

Jelle Arts

Vincent T van Hees

Mai J M Chinapaw

Abstract

Background

Methods

Results

Conclusions

PROSPERO Registration number

Supplementary Information

Introduction

Methods

Search strategy

Eligibility criteria

Selection procedures

Data extraction

Methodological quality assessment

Checklist development

Rating study results

Reliability

Validity

Table 1.

Quality of evidence grading

Results

Fig. 1.

Reliability

Table 2.

Validity

Infants

Table 3.

Toddlers

Table 4.

Preschoolers

Table 5.

Convergence between devices

Combining data

Discussion

Strengths and limitations

Recommendations for future studies

Conclusions

Supplementary Information

Acknowledgements

Authors’ contributions

Funding

Availability of data and materials

Declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases