Skip to main content
Journal of Child and Adolescent Psychopharmacology logoLink to Journal of Child and Adolescent Psychopharmacology
. 2023 Nov 15;33(9):387–392. doi: 10.1089/cap.2023.0038

Machine Learning Identifies Smartwatch-Based Physiological Biomarker for Predicting Disruptive Behavior in Children: A Feasibility Study

Magdalena Romanowicz 1,2,, Kyle S Croarkin 3,*, Rana Elmaghraby 4,*, Michelle Skime 1, Paul E Croarkin 1,2,3, Jennifer L Vande Voort 1,2, Julia Shekunov 1,2, Arjun P Athreya 1,2,3,
PMCID: PMC10698791  PMID: 37966360

Abstract

Objective:

Parents frequently purchase and inquire about smartwatch devices to monitor child behaviors and functioning. This pilot study examined the feasibility and accuracy of using smartwatch monitoring for the prediction of disruptive behaviors.

Methods:

The study enrolled children (N = 10) aged 7–10 years hospitalized for the treatment of disruptive behaviors. The study team completed continuous behavioral phenotyping during study participation. The machine learning protocol examined severe behavioral outbursts (operationalized as episodes that preceded physical restraint) for preparing the training data. Supervised machine learning methods were trained with cross-validation to predict three behavior states—calm, playful, and disruptive.

Results:

The participants had a 90% adherence rate for per protocol smartwatch use. Decision trees derived conditional dependencies of heart rate, sleep, and motor activity to predict behavior. A cross-validation demonstrated 80.89% accuracy of predicting the child's behavior state using these conditional dependencies.

Conclusion:

This study demonstrated the feasibility of 7-day continuous smartwatch monitoring for children with severe disruptive behaviors. A machine learning approach characterized predictive biomarkers of impending disruptive behaviors. Future validation studies will examine smartwatch physiological biomarkers to enhance behavioral interventions, increase parental engagement in treatment, and demonstrate target engagement in clinical trials of pharmacological agents for young children.

Keywords: wearable, smartwatch, prediction, child behavior

Introduction

Evidence-based interventions such as parent–child interaction therapy (PCIT) reduce behavioral and emotional symptoms in children by improving parent–child relationships through the implementation of specific rules taught over a multiweek period (Chaffin et al., 2011; Eyberg et al., 2001; Hood and Eyberg, 2003).

Despite the widespread availability of PCIT, there are significant challenges: (1) the effectiveness of PCIT is contingent upon parents making a concerted effort to remember the rules imparted by a provider in engaging with their children and (2) families from rural areas and under-represented populations are less likely to have access to and utilize evidence-based therapies when it is available (Bauermeister et al., 2003; Bell et al., 2021; McIver et al., 2021; Rubinstein et al., 1985; Shi et al., 2021). Therefore, there is a need for innovations that utilize technologies to enhance PCIT's accessibility and effectiveness in differing evolving social contexts.

In the era of digital health (the use of communication and information technology for passive or remote monitoring to deliver precision health care), there is an opportunity to improve access and effectiveness of behavioral interventions in children and adolescents, including PCIT. Smartwatches measure individual functioning in real time and provide opportunities to identify point-of-care smartwatch biomarkers in young children that can provide prompts for parents of child's impending behavioral outbursts. Accessible digital tools may have the promise of increasing parental engagement for improved outcomes.

The growth in children's smartwatch market (valued at $1.15 billion in 2020) is largely driven by parental interest to track their child's functioning (Leroux et al., 2021). Recent research has demonstrated that wearables (e.g., Google glass) can be used to promote positive behavior in children diagnosed with autism spectrum disorder (Al Mamun et al., 2016; Carpenter et al., 2021; O'Brien et al., 2020; Pollreisz and TaheriNejad, 2017; Stephenson and Limbrick, 2015; Voss et al., 2019).

Other independent studies have demonstrated parental acceptance (Mackintosh et al., 2019), tolerance (Mackintosh et al., 2019), clinical validations of smartwatch activity in children aged 4–7 years (Byun et al., 2018), and other physiological measurements (Chinoy et al., 2021; Evenson and Spade, 2020; Stevens and Siengsukon, 2019; Tedesco et al., 2019).

This pilot study (see Fig. 1A) sought to establish whether physiological data from smartwatches could predict disruptive behaviors in a sample of hospitalized children. Specifically, this pilot study focused on examining the feasibility of smartwatch monitoring in children and whether machine learning approaches with physiological smartwatch data could predict impending disruptive behaviors. It was hypothesized that a predictive model with heart rate and sleep data could predict disruptive behaviors before their manifestation.

FIG. 1.

FIG. 1.

(A) Study design and analytical workflow. (B) Heart rate (solid blue line—instant heart rate; dashed red line—15-minute moving average) and intensity variations during the onset of a disruptive behavior. (C) Variations of moving average heart rate (and associated regression with 95% confidence interval) across behavior phenotypes in a 60-minute window. Solid black line is the average resting heart rate across all patients, and dashed black line is the average heart rate 1 standard deviation above the resting heart rate. (D) Instant heart rates across behavior phenotypes. The solid blue line is the instantaneous heart rate recorded each minute by the smartwatch and the horizontal dashed red line is the 15-minute moving average. The bottom and top black dashed lines are the resting heart rate (reported by smartwatch) and average heart rate 1 standard deviation above the resting heart rate. The solid vertical red lines are the start and end of the period child was in seclusion after an uncontrolled behavior outburst. (E) Variations in sleep on days with and without disruptive behaviors. (F) Decision tree to predict calm, playful, and onset of disruptive behavior. In the tree, within the predicted behavior (colored boxes as leaves of the tree), second row of numbers is proportion of events predicted by that conditional path of the tree, and the first row of numbers is the proportion of behavior events of each class predicted by the tree for the given conditional statement (e.g., in the green Playful box, 29% events are Calm; 0% events are Disruptive; 71% of events are Playful).

Methods

Sample

Hospitalized children with disruptive behaviors (N = 10) aged 7–10 years were enrolled in this IRB-approved study conducted during May 2020 through December 2020. Registered nurses (RNs) of the Mayo Clinic Child and Adolescent Psychiatry inpatient unit continuously assessed and annotated behavior for 24 hours daily during hospitalization (see behavior log in Supplementary Table S1). The annotations serve as the labels for behavior to be predicted using machine learning approaches that utilize data from smartwatches. Additional study inclusion–exclusion criteria are described in Supplementary Data.

Smartwatch enrollment

Participants wore Garmin vivosmart4 smartwatches during the inpatient hospitalization, except for times when they showered, had meals, or refused to wear the watch. The smartwatch collected motor activity (referred to as intensity by Garmin, which is minutes spent being sedentary, active, or highly active), heart rate (beats per minute), and sleep (including duration and sequence of sleep stages) data through the Garmin Connect mobile application. The smartwatch minute-level data were aggregated and collected through Fitabase. Each participant and family had an abbreviated PCIT training intervention during hospitalization.

Behavior phenotyping

Inpatient RNs annotated behavior states as either calm, playful, or disruptive using the behavior code described in Supplementary Table S1. Calm (behavior codes 1–4, 7, 8) indicated that the child was sedentary while performing routine activities such as reading, lying down, watching movie, talking/interacting with others, or doing their daily homework. Playful (behavior codes 5, 6) indicated that the child was engaged in a nondisruptive activity of elevated heart rate such as exercising (e.g., on a treadmill), playing a sport (e.g., basketball), or playing with other children (e.g., outdoor gymnasium).

Disruptive (behavior codes 9–11) indicated that the child showed uncontrolled aggression/behavior (e.g., tantrum) warranting intensive behavioral interventions, time-outs, or restraint for safety. For each of the calm, playful, and disruptive phenotypic annotations on a given day, the corresponding average heart rate in the preceding 60 minutes and the durations of sleep stages from the previous night were captured with the smartwatches (see Supplementary Table S2).

Machine learning for predicting disruptive behavior

Decision trees were trained to use heart rate (60-minute average) and sleep data to predict calm, playful, or impending disruptive behaviors (that warranted physical restraint; see Table 1 for distribution of disruptive behavior across subjects). Decision trees are nonparametric supervised machine learning algorithms that generate conditional dependency statements (that can be programmed into low-cost smartphones connecting to a smartwatch) to indicate the likelihood of an impending event. Tenfold cross-validation with chi-square pruning (using RPART package in R) was conducted to derive the optimal split points for each level of the tree to reach the final decision of whether the physiological data were predictive of either of the three behavior states.

Table 1.

Cohort Summary

Subject ID Gender Age Marital status of parents Days in the study Hours watch worn during day Watch wear time (% per day) Number of disruptive events Diagnosis Medications at discharge
1 F 9 Separated 15 151.5 87 12a ODDb, ADHD Amphetamine/dextroamphetamine salts, clonidine
2 M 10 Separated 6 123.5 95 4 ODDb, ADHD, Developmental Speech Language Disorder Aripiprazole, atomoxetine, clonidine, divalproex sodium, fluoxetine
3 M 8 Married 10 166.6 87 1 ODDb, ADHD Methylphenidate, sertraline, melatonin
4 F 9 Unknown 6 102.4 83 2 ODDb, RAD, ADHD Guanfacine, methylphenidate, quetiapine, melatonin, iron, carbonyl-vitamin C
5 F 8 Separated 8 139.6 93 4 ODDb, ADHD, OCD Amphetamine/dextroamphetamine salts, escitalopram
6 M 7 Separated 4 55.9 90 0 ODDb, ADHD Clonidine, quetiapine, cholecalciferol
7 M 7 Single 5 83.5 86 0 Problem behavioral childb, Anxiety disorder unspecified Fluoxetine, multivitamin
8 M 9 Married 17 356.2 91 17a Conduct disorderb, ADHD, PTSD Lurasidone, methylphenidate
9 F 9 Single 7 87.6 96 3 Adjustment Disorder Mixed Emotion and Conductb, Anxiety Melatonin
10 F 7 Single 5 102.6 97 0 Disruptive Behavior Disorderb, PTSD, Developmental Speech Language Disorder Clonidine, risperidone
a

Disruptive behavior events in subjects who warranted physical restraint and used in training the decision tree.

b

Primary diagnosis is in bold. ODD, oppositional defiant disorder; ADHD, attention deficit/hyperactivity disorder; RAD, reactive attachment disorder; OCD, obsessive compulsive disorder; PTSD, posttraumatic stress disorder.

Results

Smartwatch tolerability and compliance

The smartwatches were worn for an average of 7 days and 90% of the time, resulting in 1369.2 hours of physiological data (see Table 1). The study captured 29 disruptive events that resulted in physical restraint of the child for safety that were used as labels to infer smartwatch biomarkers using supervised machine learning methods. There were 267 and 47 hours where the child was either calm or playful, respectively, with no incidents of disruptive behavior for those time periods.

Smartwatch biomarkers

Motor activity (intensity) variations in 60 minutes of behavior phenotypes

The Garmin vivosmart4 smartwatch provides three levels of intensity of motor activity (Inc; Garmin Data Dictionary 2020). Sedentary: little to no movement; active: engaging in a walk; and highly active: engaging in running or jumping jacks. In the 60-minute duration of when subjects were calm or playful (without disruptive behavior) or leading up to disruptive behavior, the proportion of motor activity intensities was significantly different (p < 2E-16, chi-square test) across the three behavior phenotypes. Figure 1B illustrates a calm period that preceded an episode of dysregulation.

Heart rate variations across behavior phenotypes

The moving heart rate over 60 minutes and related behavioral phenotypes are illustrated in Figure 1C (the 95% confidence interval around the mean as derived using linear regression). The heart rate during the entire 60 minutes was significantly different across the three behavior phenotypes (p < 2E-16). Notably, when participants were characterized as calm, there was no difference (p > 0.8) between the mean heart rate during the 60-minute calm period and the overall resting heart rate (the solid black line shown in Fig. 1D).

Sleep variations across behavior phenotypes

The Garmin vivosmart4 watch provides the duration and sequence of sleep stages (light, deep, rapid eye movement [REM], and awake) (Chinoy et al., 2021). The total sleep durations (in hours) or amount of time awake was not significantly different among participants who exhibited disruptive behavior and those who did not (p > 0.16). However, the durations of REM, light, and deep sleep stages were different between participants who exhibited disruptive behavior versus those who did not (p ≤ 0.04; see Fig. 1E). In participants who exhibited disruptive behavior, the odds (odds ratio 6) of disruptive behavior were significant (p < 0.05) if the duration of light sleep exceeded 4 hours, and there were no significant associations with the total sleep duration or duration of awake, deep, and REM sleep.

Predicting behavior phenotypes by combining heart rate, intensity, and sleep measures

A decision tree derived conditional dependencies with physiological data predictive of the three behavior states (Fig. 1F). If the heart rate is >129 bpm, then there was a 71% chance of the child being playful. If the heart rate was <105 and with ≥7 hours of light sleep (the night before), then there was a 67% chance of a disruptive behavior in the next 60 minutes. In all other instances, the child was likely calm. In the 10-fold cross-validation across 10 participants, the accuracy of predicting the child's behavior state using these conditional dependencies was 80.89% (with a false-positive rate of 7.2% for disruptive behavior).

Discussion

This pilot study demonstrated the feasibility of machine learning methods using data from smartwatches to predict disruptive behavior in children. The 10 subjects in the study had a 90% adherence rate for wearing the smartwatch for the duration of their hospitalization. Decision trees utilizing heart rate, sleep, and motor activity data derived from the smartwatch predicted child's behavior state (i.e., calm, playful, or impending disruptive behavior) with an accuracy of 80.89%.

Prior research has demonstrated parental acceptance (Mackintosh et al., 2019), tolerance (Mackintosh et al., 2019), clinical validations of smartwatch measures in children aged 4–7 years (Byun et al., 2018) and Garmin's measurements (Chinoy et al., 2021; Evenson and Spade, 2020; Stevens and Siengsukon, 2019; Tedesco et al., 2019). This digital biomarker framework will inform the development of ecologically valid decentralized interventions for improved contemporary family engagement and health.

These predictive platforms have the promise of facilitating objective reliable assessments for children with emotional and behavioral symptoms as opposed to standard questionnaires, interviews with parents recreating behavioral events, and the time burdens associated with current approaches (Abernethy et al., 2022; Harvey et al., 2022). Future innovations with the present platform will also increase the accessibility of evidence-based treatment for children and families.

Smartwatches with U.S. Food and Drug Administration (FDA)-approved features (e.g., to detect atrial fibrillation) are becoming ubiquitous across all ages for wellness monitoring and improved health-related outcomes. Cardiology practice has adopted the use of smartwatch devices for monitoring and assessment of remote cardiac monitoring (Perez et al., 2019). For example, benchmarks of clinically meaningful predictive accuracies for atrial fibrillation are established and documented for FDA applications. Child and adolescent psychiatry practice has been slow to evolve in this area despite a considerable need for more objective assessments.

Additional studies (e.g., NCT05077722 and NCT05725525) are underway to establish benchmarks and the utility of wearable devices for monitoring, decision support, augmented interventions, and predictive models. Valid and accessible digital biomarkers will have great utility for future pharmacological trials for young children.

This study has limitations. First, an inpatient monitoring study did not present specific behavioral triggers in outpatient settings (e.g., home, school, and interpersonal stressors). Biomarker thresholds and accuracies of predicted behavior states may vary between patients based on parents' perception of what constitutes disruptive behavior versus nearly homogeneous annotations of disruptive behavior marked by trained nurses. Second, only one smartwatch brand was used and additional studies with trackers from other vendors (e.g., fitbit by Google, Apple, Inc.) are needed. Third, the study used data from hours with complete measurements, hence the impact of time gaps in measurements is yet to be understood.

Fourth, it is possible that more computationally intensive algorithms may outperform prediction performance of decision trees with larger sample sizes and duration of observation. Fifth, although the study psychiatrists and medical records did not identify any medication-related side effects in the sample, the potential impact of concurrent psychotropic medications on physiological measures (including sleep disturbances) is unknown as it is highly possible that the threshold of biomarkers varies with time and improves behavior states.

Therefore, larger studies with longer duration of observation and treatment modalities are needed to further develop and refine time-series algorithms to predict impending disruptive behavior in children. Finally, digital biomarkers derived in this study need validation in an outpatient setting to guide the development of a technology-enhanced version of behavioral interventions such as PCIT.

Clinical Significance

There is a critical societal need to better understand and leverage scalable remote technologies to foster healthy parenting practices within the framework of traditional evidence-based behavioral therapies such as PCIT. This feasibility study demonstrates the capability to introduce precision target-based (e.g., heart rate, sleep, and motor activity) technologies using smartwatch-derived biomarkers to improve effectiveness of psychosocial interventions in young children by predicting disruptive behavior in children before manifestation. Such biomarkers when successful could be utilized in delivering precision care to children and families through remote care capabilities—thereby addressing barriers to accessing specialty pediatric behavior clinics.

Supplementary Material

Supplemental data
Suppl_TableS1.docx (45.6KB, docx)
Supplemental data
Suppl_Data.docx (10.3KB, docx)
Supplemental data
Suppl_TableS2.docx (12.5KB, docx)

Contributor Information

Collaborators: Mayo Clinic Inpatient Child and Adolescent Psychiatry Registered Nurse Staff

Authors' Contributions

A.P.A., M.R., J.S., and P.E.C. contributed to the conceptualization, methodology, data analyses interpretation, writing of the original draft, and visualization. The Mayo Clinic Inpatient Child and Adolescent Psychiatry Registered Nurse Staff K.S.C. and R.E. contributed to data curation. J.S. contributed to study resources. A.P.A. and K.S.C. completed data analyses. A.P.A. and M.R. contributed to writing the original draft and supervision of K.S.C. and R.E. All authors contributed to reviewing, editing, and writing of this article.

Disclosures

Study watches were purchased from commercial vendors at retail price using funding from Mayo Medical Foundation. Garmin International or Fitabase did not fund the study in any form nor did these companies have any input into the design, implementation, conduct, data analyses, and data interpretation of the study.

Supplementary Material

Supplementary Data

Supplementary Table S1

Supplementary Table S2

References

  1. Abernethy A, Adams L, Barrett M, et al. The promise of digital health: Then, now, and the future. NAM Perspect 2022; doi: 10.31478/202206e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Al Mamun KA, Bardhan S, Ullah MA, et al. Smart autism—A mobile, interactive and integrated framework for screening and confirmation of autism. Annu Int Conf IEEE Eng Med Biol Soc 2016; 2016:5989–5992. [DOI] [PubMed] [Google Scholar]
  3. Bauermeister JJ, Canino G, Bravo M, et al. Stimulant and psychosocial treatment of ADHD in Latino/Hispanic children. J Am Acad Child Adolesc Psychiatry 2003;42:851–855. [DOI] [PubMed] [Google Scholar]
  4. Bell S, Deen JF, Fuentes M, et al. Caring for American Indian and Alaska Native children and adolescents. Pediatrics 2021;147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Byun W, Lee JM, Kim Y, et al. Classification accuracy of a wearable activity tracker for assessing sedentary behavior and physical activity in 3-5-year-old children. Int J Environ Res Public Health 2018;15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Carpenter KLH, Hahemi J, Campbell K, et al. Digital behavioral phenotyping detects atypical pattern of facial expression in toddlers with autism. Autism Res 2021;14:488–499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chaffin M, Funderburk B, Bard D, et al. A combined motivation and parent-child interaction therapy package reduces child welfare recidivism in a randomized dismantling field trial. J Consult Clin Psychol 2011;79:84–95. [DOI] [PubMed] [Google Scholar]
  8. Chinoy ED, Cuellar JA, Huwa KE, et al. Performance of seven consumer sleep-tracking devices compared with polysomnography. Sleep 2021;44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Evenson KR, Spade CL. Review of validity and reliability of garmin activity trackers. J Meas Phys Behav 2020;3:170–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Eyberg SM, Funderburk BW, Hembree-Kigin TL, et al. Parent-child interaction therapy with behavior problem children: One and two year maintenance of treatment effects in the family. Child Fam Behav Ther 2001;23:1–20. [Google Scholar]
  11. Garmin Data Dictionary. 2020. Available from: https://www.fitabase.com/media/1781/fitabase-garmin-data-dictionary-11420.pdf [Last accessed: November 7, 2023].
  12. Harvey PD, Depp CA, Rizzo AA, et al. Technology and mental health: State of the art for assessment and treatment. Am J Psychiatry 2022;179:897–914. [DOI] [PubMed] [Google Scholar]
  13. Hood KK, Eyberg SM. Outcomes of parent-child interaction therapy: Mothers' reports of maintenance three to six years after treatment. J Clin Child Adolesc Psychol 2003;32:419–429. [DOI] [PubMed] [Google Scholar]
  14. Leroux J, Bordachar P, Strik M, et al. Recording an ECG with a smartwatch in newborns and young children: Feasibility and perspectives. Can J Cardiol 2021;37(11):1877–1879. [DOI] [PubMed] [Google Scholar]
  15. Mackintosh KA, Chappel SE, Salmon J, et al. Parental perspectives of a wearable activity tracker for children younger than 13 years: Acceptability and Usability Study. JMIR Mhealth Uhealth 2019;7:e13858. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. McIver MB, Colby S, Hansen-Petrik M, et al. Caregiver feeding practices as predictors for child dietary intake in Low-Income, Appalachian Communities. Nutrients 2021;13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. O'Brien AM, Schlosser RW, Shane H, et al. Providing visual directives via a smart watch to a student with Autism Spectrum Disorder: An intervention note. Augment Altern Commun 2020;36:249–257. [DOI] [PubMed] [Google Scholar]
  18. Perez MV, Mahaffey KW, Hedlin H, et al. Large-scale assessment of a smartwatch to identify atrial fibrillation. N Engl J Med 2019;381:1909–1917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Pollreisz D, TaheriNejad N. A simple algorithm for emotion recognition, using physiological signals of a smart watch. Annu Int Conf IEEE Eng Med Biol Soc 2017;2017:2353–2356. [DOI] [PubMed] [Google Scholar]
  20. Rubinstein N, Kopolovic J, Wexler MR, et al. Malignant blue nevus. J Dermatol Surg Oncol 1985;11:921–923. [DOI] [PubMed] [Google Scholar]
  21. Shi Y, Hunter Guevara LR, Dykhoff HJ, et al. Racial disparities in diagnosis of attention-deficit/hyperactivity disorder in a US National Birth Cohort. JAMA Netw Open 2021;4:e210321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Stephenson J, Limbrick L. A review of the use of touch-screen mobile devices by people with developmental disabilities. J Autism Dev Disord 2015;45:3777–3791. [DOI] [PubMed] [Google Scholar]
  23. Stevens S, Siengsukon C.. Commercially-Available Wearable Provides Valid Estimate of Sleep Stages (P3. 6-042). AAN Enterprises: Minnesota, USA; 2019. [Google Scholar]
  24. Tedesco S, Sica M, Ancillao A, et al. Validity Evaluation of the Fitbit Charge2 and the Garmin vivosmart HR+ in free-living environments in an older adult cohort. JMIR Mhealth Uhealth 2019;7:e13084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Voss C, Schwartz J, Daniels J, et al. Effect of wearable digital intervention for improving socialization in children with autism spectrum disorder: A Randomized Clinical Trial. JAMA Pediatr 2019;173:446–454. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental data
Suppl_TableS1.docx (45.6KB, docx)
Supplemental data
Suppl_Data.docx (10.3KB, docx)
Supplemental data
Suppl_TableS2.docx (12.5KB, docx)

Articles from Journal of Child and Adolescent Psychopharmacology are provided here courtesy of Mary Ann Liebert, Inc.

RESOURCES