Skip to main content
JAMA Network logoLink to JAMA Network
. 2022 Jul 11;5(7):e2221325. doi: 10.1001/jamanetworkopen.2022.21325

Development and Validation of a Deep Learning Method to Predict Cerebral Palsy From Spontaneous Movements in Infants at High Risk

Daniel Groos 1, Lars Adde 2,3,, Sindre Aubert 4, Lynn Boswell 5, Raye-Ann de Regnier 5,6, Toril Fjørtoft 2,3, Deborah Gaebler-Spira 6,7, Andreas Haukeland 4, Marianne Loennecken 8, Michael Msall 9,10, Unn Inger Möinichen 8, Aurelie Pascal 11, Colleen Peyton 6,12, Heri Ramampiaro 4, Michael D Schreiber 12, Inger Elisabeth Silberg 8, Nils Thomas Songstad 13, Niranjan Thomas 14, Christine Van den Broeck 11, Gunn Kristin Øberg 8,15, Espen AF Ihlen 1, Ragnhild Støen 2,16
PMCID: PMC9274325  PMID: 35816301

Key Points

Question

What is the external validity of a deep learning–based method to predict cerebral palsy (CP) based on infants’ spontaneous movements at 9 to 18 weeks’ corrected age?

Findings

In this prognostic study of 557 infants with a high risk of perinatal brain injury, a deep learning–based method for early prediction of CP had sensitivity of 71%, specificity of 94%, positive predictive value of 68%, and negative predictive value of 95%. Prognosis of CP based on the deep learning–based method was associated with later functional level and CP subtype in children with CP.

Meaning

This study’s findings suggest that deep learning–based assessments could support early detection of CP in infants at high risk.

Abstract

Importance

Early identification of cerebral palsy (CP) is important for early intervention, yet expert-based assessments do not permit widespread use, and conventional machine learning alternatives lack validity.

Objective

To develop and assess the external validity of a novel deep learning–based method to predict CP based on videos of infants’ spontaneous movements at 9 to 18 weeks’ corrected age.

Design, Setting, and Participants

This prognostic study of a deep learning–based method to predict CP at a corrected age of 12 to 89 months involved 557 infants with a high risk of perinatal brain injury who were enrolled in previous studies conducted at 13 hospitals in Belgium, India, Norway, and the US between September 10, 2001, and October 25, 2018. Analysis was performed between February 11, 2020, and September 23, 2021. Included infants had available video recorded during the fidgety movement period from 9 to 18 weeks’ corrected age, available classifications of fidgety movements ascertained by the general movement assessment (GMA) tool, and available data on CP status at 12 months’ corrected age or older. A total of 418 infants (75.0%) were randomly assigned to the model development (training and internal validation) sample, and 139 (25.0%) were randomly assigned to the external validation sample (1 test set).

Exposure

Video recording of spontaneous movements.

Main Outcomes and Measures

The primary outcome was prediction of CP. Deep learning–based prediction of CP was performed automatically from a single video. Secondary outcomes included prediction of associated functional level and CP subtype. Sensitivity, specificity, positive and negative predictive values, and accuracy were assessed.

Results

Among 557 infants (310 [55.7%] male), the median (IQR) corrected age was 12 (11-13) weeks at assessment, and 84 infants (15.1%) were diagnosed with CP at a mean (SD) age of 3.4 (1.7) years. Data on race and ethnicity were not reported because previous studies (from which the infant samples were derived) used different study protocols with inconsistent collection of these data. On external validation, the deep learning–based CP prediction method had sensitivity of 71.4% (95% CI, 47.8%-88.7%), specificity of 94.1% (95% CI, 88.2%-97.6%), positive predictive value of 68.2% (95% CI, 45.1%-86.1%), and negative predictive value of 94.9% (95% CI, 89.2%-98.1%). In comparison, the GMA tool had sensitivity of 70.0% (95% CI, 45.7%-88.1%), specificity of 88.7% (95% CI, 81.5%-93.8%), positive predictive value of 51.9% (95% CI, 32.0%-71.3%), and negative predictive value of 94.4% (95% CI, 88.3%-97.9%). The deep learning method achieved higher accuracy than the conventional machine learning method (90.6% [95% CI, 84.5%-94.9%] vs 72.7% [95% CI, 64.5%-79.9%]; P < .001), but no significant improvement in accuracy was observed compared with the GMA tool (85.9%; 95% CI, 78.9%-91.3%; P = .11). The deep learning prediction model had higher sensitivity among infants with nonambulatory CP (100%; 95% CI, 63.1%-100%) vs ambulatory CP (58.3%; 95% CI, 27.7%-84.8%; P = .02) and spastic bilateral CP (92.3%; 95% CI, 64.0%-99.8%) vs spastic unilateral CP (42.9%; 95% CI, 9.9%-81.6%; P < .001).

Conclusions and Relevance

In this prognostic study, a deep learning–based method for predicting CP at 9 to 18 weeks’ corrected age had predictive accuracy on external validation, which suggests possible avenues for using deep learning–based software to provide objective early detection of CP in clinical settings.


This prognostic study describes the development of a deep learning–based method to predict cerebral palsy based on videos of infants’ spontaneous movements at 9 to 18 weeks’ corrected age and assesses the external validity of the deep learning prediction model.

Introduction

Cerebral palsy (CP) is the most common physical disability in children, producing functional limitation and co-occurring impairments1 (eg, pain, musculoskeletal deformities, seizures, and communication and sleep disorders) because of injury to the developing brain.2 Cerebral palsy is typically diagnosed between ages 12 and 24 months, and milder forms of CP may be diagnosed even later in childhood.3,4 Early identification of infants with a high risk of CP is important to provide targeted follow-up and interventions during infancy when neuroplasticity is high,5,6 improve access to community services to minimize complications,7 and reassure parents of infants at high risk if their children are unlikely to develop CP.8

Prechtl et al9,10,11 introduced the general movement assessment (GMA) tool 25 years ago as a method to predict CP. From birth until 2 months’ corrected age, general movements (ie, spontaneous movements involving the whole body) have a writhing character and later occur as fidgety movements.12 The GMA is recommended as the most accurate clinical test for CP prognosis among infants younger than 5 months4,13 based on the absence of the fidgety type of general movements.14,15 The GMA is based on clinical experts’ observation of infants’ general movements in video recordings. The method requires training,16 and rater experience may alter GMA reliability.17 These factors hamper widespread clinical use.18

With advancements in the field of artificial intelligence, machine learning techniques have been developed as objective low-cost alternatives to the GMA.18,19,20,21,22 Former machine learning techniques for tracking and classification of infants’ spontaneous movements generally aimed to predict CP by proposing restricted sets of manually selected movement features used in combination with conventional statistical methods (eg, logistic regression analysis and support vector machines).23,24,25,26,27 A recent study28 found that the predictive values of these conventional machine learning–based CP prediction models were similar to the predictive values of the GMA. Despite this progress, there are fundamental challenges yet to be addressed. The restricted set of manually selected movement features has an unknown association with the observational GMA tool, which calls into question the construct validity of conventional machine learning techniques. External validation is consequently lacking because of small samples and short follow-up duration.18,21 As a result, validation is performed using less conservative methods (including leave-one-out cross-validation) and the absence of fidgety movements as a surrogate predictor for CP.18,29

A new field within machine learning, called deep learning, has enabled automatic detection of discriminative movement features through representation learning.30 This process involves dynamically selecting features relevant to the task at hand without any human expert involvement. The accuracy of deep learning improves with increasing amounts of data (eg, videos), and deep learning has the capacity to detect features representing intricate associations in the data, such as complex full-body general movements.

Our primary objective was to develop a deep learning–based early prediction model of CP based on infants’ spontaneous movements during the fidgety movement period from 9 to 18 weeks’ corrected age and to perform external validation using a multicenter sample of infants with a high risk of perinatal brain injury. Our secondary objective was to compare the predictive accuracy of the deep learning–based prediction method with the accuracy of the clinically recommended GMA tool and the conventional machine learning method and to evaluate the ability of the deep learning method to predict functional level and CP subtype.

Methods

Participants

This prognostic study of patients at 13 hospitals was approved by the regional Committee for Medical and Health Research Ethics in Norway and local institutional review boards in Belgium, India, and the US. Written informed consent was obtained from parents before study inclusion (including written parental consent for publication of an infant image). This study followed the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) reporting guideline.

The sample comprised 557 infants with a high risk of perinatal brain injury who were prospectively enrolled in previous studies27,31,32,33 of CP risk prediction between September 10, 2001, and October 25, 2018. Statistical analysis was performed between February 11, 2020, and September 23, 2021. A description of these previous studies is available in eAppendix 1 in the Supplement. All studies included infants with an increased risk of abnormal neurodevelopment, which was identified before discharge from the neonatal intensive care unit (eAppendix 2 and eTable 1 in the Supplement). Infants were included based on the following criteria: (1) available video following the standards of the Prechtl GMA tool34 recorded during the fidgety movement period from 9 to 18 weeks’ corrected age, (2) available GMA classifications of fidgety movements, and (3) available data on CP status at 12 months’ corrected age or older. Two infants with videos recorded at 7 weeks’ and 8 weeks’ corrected age were included. Both were correctly classified by the GMA; 1 infant had intermittent fidgety movements and did not develop CP, and 1 infant had absent fidgety movements and was diagnosed with CP. Data on infants excluded because of missing video recordings, GMA classification, or CP status are reported elsewhere.27,31,32,33 The sample size was determined by the number of infants from the previous studies27,31,32,33 who had available data.

Videos and Classification of General Movements

Infants were recorded in the supine position during active wakefulness for a median of 5 minutes (range, 1-9 minutes) following GMA standards.34 A conventional video camera (Sanyo VPC-HD2000 Xacti dual camera [Funai Corporation] or Sony Handycam DCR-PC100E [Sony Electronics Inc]) at a median recording rate of 30 frames per second (range, 24-60 frames per second) and a median video resolution of 720 × 1280 pixels (range, 576 × 720 to 1080 × 1920 pixels) was used in a standardized setup comprising a mattress and a stationary overhead camera. If more than 1 recording was available, the recording made between 12 weeks’ and 13 weeks’ corrected age was used.

Two certified observers (L.A. and T.F.) who were blinded to the medical history of the infants performed classification of fidgety movements for all infants. Fidgety movements were classified as normal (sporadically, intermittently, or continuously present) or abnormal (absent). Classification of sporadic fidgety movements as normal was based on a previous study that found a low risk of CP among infants with sporadic fidgety movements.31 Infants classified with exaggerated fidgety movements that were excessive in amplitude and speed were excluded a priori from the analysis because of unpredictable outcomes among infants in this category. In cases of disagreement between observers, videos were reassessed by the same 2 observers, and consensus was reached.

Cerebral Palsy Status, Subtype, and Functional Level

The primary outcome of CP was diagnosed by a pediatrician who was unaware of GMA classifications and followed the CP decision tree of the Surveillance of Cerebral Palsy in Europe.35 This diagnosis included classification of CP subtypes into spastic unilateral, spastic bilateral, dyskinetic, and ataxic. Follow-up times differed between studies, ranging from ages 18 months to 5 years.27,31,32,33 The Gross Motor Function Classification System (GMFCS; levels I-V, with level I indicating the ability to walk without limitations; level II, the ability to walk with limitations; level III, the ability to walk using a handheld mobility device; level IV, the ability for self-mobility with limitations [may need to use powered mobility]; and V, the need to be transported in a manual wheelchair)3 was used to classify functional levels into ambulatory CP (levels I, II, and III) and nonambulatory CP (levels IV and V).

Method Development and External Validation

To achieve representative samples for method development (ie, training and internal validation) and external validation, all infants at high risk were stratified into classes based on the study in which they were originally enrolled,27,31,32,33 the country of the center in which the study was conducted (Belgium, India, Norway, or the US) (step 1 in Figure 1), and their CP subtype (spastic bilateral CP, spastic unilateral CP, or no CP) (step 2 in Figure 1). Data on race and ethnicity were not reported because the previous studies (from which the infant samples were derived) used different study protocols with inconsistent collection of these data.

Figure 1. Data Sets for Development and External Validation.

Figure 1.

Infants diagnosed with cerebral palsy (CP) for whom subtype was not available were classified as having spastic unilateral CP (UL CP) if they had a Gross Motor Function Classification System level of I or II and classified as having spastic bilateral CP (BL CP) if they had a Gross Motor Function Classification System level of III, IV, or V. Infants with dyskinetic CP and ataxic CP were classified as having BL CP. A total of 75.0% of infants of each class (orange path in step 3) were randomly assigned to the method development (training and internal validation) sample, and the remaining 25.0% were randomly assigned (blue path in step 3) to the external validation sample. CA indicates corrected age.

A total of 75.0% of infants of each class (orange path in step 3 of Figure 1) were randomly assigned to the method development (training and internal validation) sample, and the remaining 25.0% were randomly assigned (blue path in step 3 of Figure 1) to the external validation sample (1 test set). Infants assigned to the method development sample were further divided into 7 internal validation samples (ie, folds), each comprising 9 infants with CP and 50 or 51 infants without CP. This additional division enabled 7-fold cross-validation for evaluating internal validity. The internal validation samples were constructed using a similar procedure for stratification based on study center and CP subtype (as performed with the external validation test set shown in Figure 1).

Deep Learning Method

The overall concept of the deep learning method for CP prediction is presented in Figure 2. The method comprised 4 steps: (1) motion tracking, (2) creation of a skeleton sequence, (3) development of a deep learning–based prediction model, and (4) prediction of CP.

Figure 2. Steps Involved in Deep Learning–Based Method for Cerebral Palsy Prediction.

Figure 2.

In the deep learning–based method, a video-based infant motion tracker (step 1) constructs a skeleton sequence of 5-second (5s) windows (step 2), in which a deep learning–based prediction model estimates cerebral palsy (CP) risk in each 5-second window by detecting single-joint movements over a few time steps in the initial model layers and whole-body movements over many time steps in the later model layers (step 3). Next, CP risk of the total video is aggregated to classify an infant as having CP or no CP (step 4) based on the decision threshold (dashed line) and yield uncertainty of classification (color coding, with red representing certain classification of CP, orange representing uncertain classification of CP, yellow representing uncertain classification of no CP, and green representing certain classification of no CP). Written parental consent was obtained for publication of the infant image in step 1.

Motion Tracking

The raw video was processed by a motion tracker36 that localized horizontal and vertical coordinates of 19 important body points (forehead, nose, ears, upper neck, shoulders, elbows, wrists, upper chest, right pelvis, left pelvis, midpelvis, knees, and ankles) (step 1 in Figure 2), creating an infant skeleton (ie, a full pose of the infant in the form of a digital skeleton model). The motion tracker had previously been trained and validated on infant videos according to GMA standards following in-motion poses as defined by Groos et al37; further technical details of the motion tracker are described in the original articles by Groos et al.36,37

Skeleton Sequence

The infant skeletons of all video frames composed a spatiotemporal skeleton sequence (step 2 in Figure 2) representing infant movements in the video. The skeleton sequence was divided into 5-second windows, which were processed by the deep learning–based prediction model to estimate CP risk in that particular window.

Deep Learning–Based CP Prediction Model

To automatically detect movement features associated with CP, a novel deep learning procedure was developed. A deep learning model consists of multiple layers (step 3 in Figure 2). The initial layers detect features of movements of a single limb or joint, whereas subsequent layers detect features of whole-body movements. To prevent manual selection bias, the optimal model architecture was set by an automatic search on the training and internal validation data. The first 10 automatically selected models were defined as artificial experts and retrained on the 7 internal validation samples (internal validation results are provided in eTable 2 in the Supplement). Each of the resulting 70 artificial expert instances used the biomechanical properties (position, velocity, and body segment length) in 5-second windows to detect whole-body movement features that distinguished infants with CP from infants without CP. Details on the automatic search procedure and configurations of selected deep learning models are available in eAppendix 3, eTable 3, eTable 4, and eFigure 1 in the Supplement.

With regard to the group of artificial experts and uncertainty of decisions, based on the obtained movement features in each of the 70 artificial expert instances, CP risk was estimated on a continuous scale from low (0%) to high (100%). The median value of the 70 individual artificial expert predictions was used as CP risk in the 5-second window, with uncertainty of CP risk color coded based on the level of agreement across the 70 predictions. Green (with 0-17 agreements [<25.0%] predicting CP) and yellow (with 18-35 agreements [≤50.0%] predicting CP) represented certain and uncertain predictions of no CP, respectively. Orange (with 36-52 agreements [>50.0%] predicting CP) and red (with 53-70 agreements [>75.0%] predicting CP) represented uncertain and certain predictions of CP, respectively.

Prediction of CP

The final score for CP risk in a total video was estimated as the median CP risk across all 5-second windows of the skeleton sequence (step 4 in Figure 2). This score was used to classify an infant as having CP or no CP based on a fixed decision threshold (different thresholds are shown in eAppendix 4, eFigure 2, and eTable 5 in the Supplement). A classification of CP was considered certain (red) if more than 75.0% of the artificial expert predictions were classified as CP and uncertain (orange) if more than 50.0% were classified as CP. A classification of no CP was considered uncertain (yellow) if 50.0% or fewer of the artificial expert predictions were classified as CP and certain (green) if fewer than 25.0% were classified as CP (step 4 in Figure 2).

Conventional Machine Learning Method

To enable objective comparison between the deep learning method and the conventional machine learning method previously described,28 retraining of the conventional machine learning method was performed on skeleton sequences of 19 important body points in the method development data set. Additional details about the conventional machine learning method have been published previously by Ihlen et al.28

Statistical Analysis

The sensitivity of the methods used for external validation was fixed a priori based on the sensitivity level of the GMA tool to ensure objective comparisons. The Clopper-Pearson method was used to calculate 95% CIs for sensitivity, specificity, positive and negative predictive value, and accuracy, which were computed using the conf package in R software, version 4.0 (R Foundation for Statistical Computing). The difference in CP risk between infants with ambulatory CP (GMFCS level I, II, or III) and nonambulatory CP (GMFCS level IV or V) was assessed using a 2-sided Wilcoxon rank sum test and computed using algorithms from the SciPy library in Python, version 3.6 (Python Software Foundation). A Wilcoxon rank sum test was also used to assess the difference in CP risk among infants with spastic unilateral CP vs spastic bilateral CP. The significance threshold was 2-tailed P < .05.

Results

Among 557 infants at high risk, 310 (55.7%) were male, and 247 (44.3%) were female; the median (IQR) corrected age at assessment was 12 (11-13) weeks, and 84 infants (15.1%) had a diagnosis of CP at a mean (SD) age of 3.4 (1.7) years (eTable 1 in the Supplement). The median (IQR) corrected age at which CP status was evaluated was 38 (23-46) months. A total of 418 infants (75.0%) were randomly assigned to the model development sample, and 139 (25.0%) were randomly assigned to the external validation sample.

Predictive accuracies of the deep learning method, the GMA tool, and the conventional machine learning method are presented in the Table. On external validation, the deep learning–based CP prediction method had sensitivity of 71.4% (95% CI, 47.8%-88.7%), specificity of 94.1% (95% CI, 88.2%-97.6%), positive predictive value of 68.2% (95% CI, 45.1%-86.1%), and negative predictive value of 94.9% (95% CI, 89.2%-98.1%). In comparison, the GMA tool had sensitivity of 70.0% (95% CI, 45.7%-88.1%), specificity of 88.7% (95% CI, 81.5%-93.8%), positive predictive value of 51.9% (95% CI, 32.0%-71.3%), and negative predictive value of 94.4% (95% CI, 88.3%-97.9%). The deep learning method achieved higher accuracy than the conventional machine learning method (90.6% [95% CI, 84.5%-94.9%] vs 72.7% [95% CI, 64.5%-79.9%]; P < .001), but no significant improvement in accuracy was observed compared with the GMA tool (85.9%; 95% CI, 78.9%-91.3%; P = .11).

Table. Predictive Values on External Validation Given a Fixed Sensitivity of 70.0%a.

Method Result, No. Validation measure, % (95% CI)
True positive False positive True negative False negative Sensitivity Specificity PPV NPV Accuracy
Deep learning 15 7 111 6 71.4 (47.8-88.7) 94.1 (88.2-97.6) 68.2 (45.1-86.1) 94.9 (89.2-98.1) 90.6 (84.5-94.9)
GMA 14 13 102 6 70.0 (45.7-88.1) 88.7 (81.5-93.8) 51.9 (32.0-71.3) 94.4 (88.3-97.9) 85.9 (78.9-91.3)
Conventional machine learning 15 32 86 6 71.4 (47.8-88.7) 72.9 (63.9-80.7) 31.9 (19.1-47.1) 93.5 (86.3-97.6) 72.7 (64.5-79.9)

Abbreviations: GMA, general movement assessment tool; NPV, negative predictive value; PPV, positive predictive value.

a

The external validation sample included 4 infants (1 with cerebral palsy and 3 without cerebral palsy) with exaggerated fidgety movements (excluded by the GMA), yielding 3 true-negative results and 1 false-negative result, both with deep learning–based and conventional machine learning–based predictions of cerebral palsy. Sensitivity was fixed based on the sensitivity level of the GMA tool.

The external validation sample comprised 139 infants; of those, 21 infants (15.1%) were diagnosed with CP. Among those diagnosed with CP, 12 infants (57.1%) were correctly classified as having certain CP, and 2 infants (9.5%) were incorrectly classified as having certain no CP (red and green box plots in Figure 3C). Of 118 infants without CP, 104 (88.1%) were correctly classified as having certain no CP, and 2 (1.7%) were incorrectly classified as having certain CP (green and red box plots in Figure 3D). Cerebral palsy risk across 5-second windows for 1 representative infant with CP and 1 representative infant without CP, both classified correctly with high certainty, are shown in Figure 3A and B.

Figure 3. Risk Predictions for Infants in the External Validation Sample.

Figure 3.

A total of 139 infants were included. A and B, cerebral palsy (CP) risk in 5-second (5s) windows is shown on the left, and aggregated CP risk across the total video is shown on the right. The dashed horizontal line represents aggregated CP risk. Both of the representative infants at high risk were classified correctly with high classification certainty. C and D, Distribution of individual CP risk and box plots of classification uncertainties of the 70 artificial expert predictions among infants at high risk with and without CP. The dots indicate outlier points. In C, the x-axis displays the Gross Motor Function Classification System level (with levels I, II, and III indicating ambulatory CP and levels IV and V indicating nonambulatory CP) and the CP subtype (spastic unilateral [UL] or spastic bilateral [BL]) at the time of diagnosis. In the box plots, the dashed blue horizontal lines represent aggregated CP risk, the solid black horizontal lines represent median CP risk across artificial experts, lower and upper edges represent IQR, and whiskers represent range (or 1.5 times the IQR). The dashed horizontal line running across each graph represents the decision threshold. Red represents certain classification of CP, orange represents uncertain classification of CP, yellow represents uncertain classification of no CP, and green represents certain classification of no CP. NA indicates not available.

The deep learning–based CP prediction method had higher sensitivity (ie, a greater percentage of infants higher than the decision threshold) among infants with nonambulatory CP (100%; 95% CI, 63.1%-100%) vs ambulatory CP (58.3%; 95% CI, 27.7%-84.8%; P = .02) and among infants with spastic bilateral CP (92.3%; 95% CI, 64.0%-99.8%) vs spastic unilateral CP (42.9%; 95% CI, 9.9%-81.6%; P < .001) (Figure 3C). A significantly higher estimated CP risk was observed among infants with nonambulatory motor function (median [IQR], 0.90 [0.75-0.93]) vs ambulatory motor function (median [IQR], 0.45 [0.24-0.78]; P = .007) and among infants with spastic bilateral CP (median [IQR],0.85 [0.55-0.92]) vs spastic unilateral CP (median [IQR], 0.26 [0.23-0.56]; P = .03) (Figure 4).

Figure 4. Cerebral Palsy (CP) Risk Among Infants in the External Validation Sample With Different Outcomes.

Figure 4.

Distribution of CP risk across 139 infants. In box plots, the solid black horizontal lines represent median aggregated CP risk, lower and upper edges represent IQR, and whiskers represent range (or 1.5 times the IQR). The dots indicate outlier points. The dashed black horizontal lines represent the decision threshold. Gross Motor Function Classification System levels I through III indicate ambulatory CP, and levels IV and V indicate nonambulatory CP.

Discussion

In this prognostic study, a fully automated deep learning method for early CP prediction had high predictive accuracy in an external sample of infants from different countries with heterogenous perinatal risk factors and variation in video quality. Furthermore, the deep learning method differentiated between infants who developed ambulatory vs nonambulatory CP as well as spastic unilateral vs spastic bilateral CP. This novel method may support decisions in early pediatric care by initiating targeted interventions to improve function, prevent complications, and individualize follow-up. This prediction method is a substantial improvement compared with previously published conventional machine learning–based CP prediction methods.28

The improved predictive accuracy of the deep learning method compared with the conventional machine learning method used in a previous study28 may have occurred for several reasons. First, the deep learning method has the capacity to identify intricate associations in the data because it processes data in several layers. This ability suggests that deep learning could handle the high complexity and variation observed in infants’ spontaneous movements. Second, manual selection of movement features, which is required when using conventional machine learning methods,25,26 is eliminated by the automatic feature detection of deep learning methods. Although previous studies have also used deep learning methods for classification of infant motor function,38,39,40,41 these studies have been limited by small samples and lack of external validation and have used surrogate outcomes for CP.18,21,29

More studies are needed to identify which movement features the deep learning method selects as relevant for CP. A step toward this identification could be localization of movement features within skeleton sequences (eFigure 3 in the Supplement).40,42 In the present study, we did not investigate whether the deep learning method used features associated with fidgety movements, other movements, and postural patterns in the early motor repertoire (eg, kicking and body symmetry)43 or as yet unidentified patterns of movement.

The comparable performance of the deep learning method vs the observational GMA tool in this study may reflect an upcoming paradigm shift in early prediction of CP. A recent review by Silva et al18 highlighted the fact that adoption of automated CP prediction in clinical practice has been restricted because existing machine learning methods lack the predictive accuracy of the GMA. The feasibility of home-based smartphone recordings22,44,45,46,47 and associated infant motion tracking37 may be combined with the proposed deep learning method to obtain a fully automated system for clinical decision support.

The sensitivity of the observational GMA tool was lower than reported in some previous reviews13,48 but similar to findings of other studies.49,50 A sensitivity level that was lower than commonly reported4 may, at least in part, be explained by the classification of sporadic fidgety movements as normal. This approach contrasted with the classification method taught in courses by the General Movements Trust, but it may increase the accuracy and positive predictive value of the GMA, as reported in a previous study.31 Furthermore, a single assessment at approximately 12 weeks’ corrected age may have had a role in the lower sensitivity observed in the present study compared with the sensitivity levels reported by studies performing several assessments throughout the fidgety movement period.9,51

The present study included infants recruited from several sites based on a variety of risk factors for perinatal brain injury.27,31,32,33 Despite the diverse set of risk factors and clinical characteristics of infants, the prevalence of CP in each diagnostic group matched numbers found in the literature.52,53,54 This consistency suggests that the results are generalizable to clinical follow-up programs for infants who were previously in the neonatal intensive care unit based on an increased risk of adverse neurodevelopment.

Limitations

This study has several limitations. The use of a separate data set for method development limits the number of infants with CP who can be included in the assessment of external validity. This smaller sample limits the possibility of performing subgroup analyses of CP subtypes and GMFCS levels. Further research could assess the validity of the deep learning method on different types of CP and separate medical risk factors. Few children were assessed for CP before age 2 years, which may have resulted in lack of identification of several children with mild phenotypes. Short follow-up duration may also have produced less accurate GMFCS classification because of lower reliability among children younger than 2 years.55 However, inaccurate GMFCS classification of a few children is unlikely to change the general interpretation of results because classification rarely changes from ambulatory CP to nonambulatory CP and vice versa.55

The present study included videos recorded using a standardized setup; therefore, the deep learning–based CP prediction method requires validation using home-based smartphone recordings. The prediction model may also be refined by extending the skeleton sequence to include facial expressions and fine motor function in fingers and toes that may be associated with CP43 and by including techniques, such as temporal attention,40 to enable varying influence of the CP risk of different 5-second windows in the skeleton sequence.

Conclusions

In this prognostic study, the novel deep learning–based CP prediction model had predictive accuracy comparable with GMA results among an external multicenter sample of infants at high risk. The predictive model also differentiated between infants with ambulatory vs nonambulatory CP and infants with spastic unilateral vs spastic bilateral CP. A fully automated movement analysis for CP prediction may serve as an important decision support for clinicians caring for infants at high risk.18,21 Future research is needed to identify specific movement biomarkers associated with CP outcome and facilitate widespread clinical use.

Supplement.

eAppendix 1. Related Previously Published Articles

eAppendix 2. Characteristics of Included Infants

eAppendix 3. Deep Learning–Based CP Prediction Model

eAppendix 4. Decision Thresholds

eTable 1. Characteristics of Infants in Data Sets

eTable 2. Internal Validation of Deep Learning Method

eTable 3. Search Space of 20 Architectural Choices

eTable 4. Characteristics of Architectures Obtained by K-Best Search

eTable 5. Predictive Values of Decision Thresholds on External Validation

eFigure 1. Overall Architecture of Graph Convolutional Networks

eFigure 2. ROC Curves on External Validation

eFigure 3. Class Activation Mapping

eReferences

References

  • 1.Novak I, Hines M, Goldsmith S, Barclay R. Clinical prognostic messages from a systematic review on cerebral palsy. Pediatrics. 2012;130(5):e1285-e1312. doi: 10.1542/peds.2012-0924 [DOI] [PubMed] [Google Scholar]
  • 2.Rosenbaum P, Paneth N, Leviton A, et al. A report: the definition and classification of cerebral palsy April 2006. Dev Med Child Neurol Suppl. 2007;109:8-14. [PubMed] [Google Scholar]
  • 3.Palisano R, Rosenbaum P, Walter S, Russell D, Wood E, Galuppi B. Development and reliability of a system to classify gross motor function in children with cerebral palsy. Dev Med Child Neurol. 1997;39(4):214-223. doi: 10.1111/j.1469-8749.1997.tb07414.x [DOI] [PubMed] [Google Scholar]
  • 4.Novak I, Morgan C, Adde L, et al. Early, accurate diagnosis and early intervention in cerebral palsy: advances in diagnosis and treatment. JAMA Pediatr. 2017;171(9):897-907. doi: 10.1001/jamapediatrics.2017.1689 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Anderson V, Spencer-Smith M, Wood A. Do children really recover better? neurobehavioural plasticity after early brain insult. Brain. 2011;134(pt 8):2197-2221. doi: 10.1093/brain/awr103 [DOI] [PubMed] [Google Scholar]
  • 6.Morgan C, Fetters L, Adde L, et al. Early intervention for children aged 0 to 2 years with or at high risk of cerebral palsy: international clinical practice guideline based on systematic reviews. JAMA Pediatr. 2021;175(8):846-858. doi: 10.1001/jamapediatrics.2021.0878 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Baird G, McConachie H, Scrutton D. Parents’ perceptions of disclosure of the diagnosis of cerebral palsy. Arch Dis Child. 2000;83(6):475-480. doi: 10.1136/adc.83.6.475 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Guttmann K, Flibotte J, DeMauro SB. Parental perspectives on diagnosis and prognosis of neonatal intensive care unit graduates with cerebral palsy. J Pediatr. 2018;203:156-162. doi: 10.1016/j.jpeds.2018.07.089 [DOI] [PubMed] [Google Scholar]
  • 9.Prechtl HF, Einspieler C, Cioni G, Bos AF, Ferrari F, Sontheimer D. An early marker for neurological deficits after perinatal brain lesions. Lancet. 1997;349(9062):1361-1363. doi: 10.1016/S0140-6736(96)10182-3 [DOI] [PubMed] [Google Scholar]
  • 10.Prechtl HF. Qualitative changes of spontaneous movements in fetus and preterm infant are a marker of neurological dysfunction. Early Hum Dev. 1990;23(3):151-158. doi: 10.1016/0378-3782(90)90011-7 [DOI] [PubMed] [Google Scholar]
  • 11.Einspieler C, Marschik PB, Prechtl HFR. Human motor behavior: prenatal origin and early postnatal development. J Psychol. 2008;216(3):147-153. doi: 10.1027/0044-3409.216.3.147 [DOI] [Google Scholar]
  • 12.Einspieler C, Peharz R, Marschik PB. Fidgety movements—tiny in appearance, but huge in impact. J Pediatr (Rio J). 2016;92(3)(suppl 1):S64-S70. doi: 10.1016/j.jped.2015.12.003 [DOI] [PubMed] [Google Scholar]
  • 13.Bosanquet M, Copeland L, Ware R, Boyd R. A systematic review of tests to predict cerebral palsy in young children. Dev Med Child Neurol. 2013;55(5):418-426. doi: 10.1111/dmcn.12140 [DOI] [PubMed] [Google Scholar]
  • 14.Einspieler C, Yang H, Bartl-Pokorny KD, et al. Are sporadic fidgety movements as clinically relevant as is their absence? Early Hum Dev. 2015;91(4):247-252. doi: 10.1016/j.earlhumdev.2015.02.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Hadders-Algra M. Neural substrate and clinical significance of general movements: an update. Dev Med Child Neurol. 2018;60(1):39-46. doi: 10.1111/dmcn.13540 [DOI] [PubMed] [Google Scholar]
  • 16.Einspieler C, Prechtl HFR. Prechtl’s assessment of general movements: a diagnostic tool for the functional assessment of the young nervous system. Ment Retard Dev Disabil Res Rev. 2005;11(1):61-67. doi: 10.1002/mrdd.20051 [DOI] [PubMed] [Google Scholar]
  • 17.Peyton C, Pascal A, Boswell L, et al. Inter-observer reliability using the general movement assessment is influenced by rater experience. Early Hum Dev. 2021;161:105436. doi: 10.1016/j.earlhumdev.2021.105436 [DOI] [PubMed] [Google Scholar]
  • 18.Silva N, Zhang D, Kulvicius T, et al. The future of general movement assessment: the role of computer vision and machine learning—a scoping review. Res Dev Disabil. 2021;110:103854. doi: 10.1016/j.ridd.2021.103854 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Marcroft C, Khan A, Embleton ND, Trenell M, Plötz T. Movement recognition technology as a method of assessing spontaneous general movements in high risk infants. Front Neurol. 2015;5:284. doi: 10.3389/fneur.2014.00284 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Cabon S, Porée F, Simon A, Rosec O, Pladys P, Carrault G. Video and audio processing in paediatrics: a review. Physiol Meas. 2019;40(2):02TR02. doi: 10.1088/1361-6579/ab0096 [DOI] [PubMed] [Google Scholar]
  • 21.Redd CB, Karunanithi M, Boyd RN, Barber LA. Technology-assisted quantification of movement to predict infants at high risk of motor disability: a systematic review. Res Dev Disabil. 2021;118:104071. doi: 10.1016/j.ridd.2021.104071 [DOI] [PubMed] [Google Scholar]
  • 22.Marschik PB, Pokorny FB, Peharz R, et al. ; BEE-PRI Study Group . A novel way to measure and predict development: a heuristic approach to facilitate the early detection of neurodevelopmental disorders. Curr Neurol Neurosci Rep. 2017;17(5):43. doi: 10.1007/s11910-017-0748-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Stahl A, Schellewald C, Stavdahl Ø, Aamo OM, Adde L, Kirkerød H. An optical flow–based method to predict infantile cerebral palsy. IEEE Trans Neural Syst Rehabil Eng. 2012;20(4):605-614. doi: 10.1109/TNSRE.2012.2195030 [DOI] [PubMed] [Google Scholar]
  • 24.Adde L, Helbostad J, Jensenius AR, Langaas M, Støen R. Identification of fidgety movements and prediction of CP by the use of computer-based video analysis is more accurate when based on two video recordings. Physiother Theory Pract. 2013;29(6):469-475. doi: 10.3109/09593985.2012.757404 [DOI] [PubMed] [Google Scholar]
  • 25.Rahmati H, Martens H, Aamo OM, Stavdahl O, Støen R, Adde L. Frequency analysis and feature reduction method for prediction of cerebral palsy in young infants. IEEE Trans Neural Syst Rehabil Eng. 2016;24(11):1225-1234. doi: 10.1109/TNSRE.2016.2539390 [DOI] [PubMed] [Google Scholar]
  • 26.Orlandi S, Raghuram K, Smith CR, et al. Detection of atypical and typical infant movements using computer-based video analysis. Annu Int Conf IEEE Eng Med Biol Soc. 2018;2018:3598-3601. doi: 10.1109/EMBC.2018.8513078 [DOI] [PubMed] [Google Scholar]
  • 27.Adde L, Helbostad JL, Jensenius AR, Taraldsen G, Grunewaldt KH, Støen R. Early prediction of cerebral palsy by computer-based video analysis of general movements: a feasibility study. Dev Med Child Neurol. 2010;52(8):773-778. doi: 10.1111/j.1469-8749.2010.03629.x [DOI] [PubMed] [Google Scholar]
  • 28.Ihlen EAF, Støen R, Boswell L, et al. Machine learning of infant spontaneous movements for the early prediction of cerebral palsy: a multi-site cohort study. J Clin Med. 2019;9(1):5. doi: 10.3390/jcm9010005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Irshad MT, Nisar MA, Gouverneur P, Rapp M, Grzegorzek M. AI approaches towards Prechtl’s assessment of general movements: a systematic literature review. Sensors (Basel). 2020;20(18):5321. doi: 10.3390/s20185321 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Goodfellow I, Bengio Y, Courville A. Deep Learning. MIT Press; 2016. [Google Scholar]
  • 31.Støen R, Boswell L, de Regnier RA, et al. The predictive accuracy of the general movement assessment for cerebral palsy: a prospective, observational study of high-risk infants in a clinical follow-up setting. J Clin Med. 2019;8(11):1790. doi: 10.3390/jcm8111790 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Pascal A, Govaert P, Ortibus E, et al. Motor outcome after perinatal stroke and early prediction of unilateral spastic cerebral palsy. Eur J Paediatr Neurol. 2020;29:54-61. doi: 10.1016/j.ejpn.2020.09.002 [DOI] [PubMed] [Google Scholar]
  • 33.Aker K, Thomas N, Adde L, et al. Prediction of outcome from MRI and general movements assessment after hypoxic-ischaemic encephalopathy in low-income and middle-income countries: data from a randomised controlled trial. Arch Dis Child Fetal Neonatal Ed. 2022;107(1):32-38. doi: 10.1136/archdischild-2020-321309 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Einspieler C, Prechtl HFR, Bos AF, Ferrari F, Cioni G. Prechtl’s Method on the Qualitative Assessment of General Movements in Preterm, Term and Young Infants. Mac Keith Press; 2004. [DOI] [PubMed] [Google Scholar]
  • 35.Surveillance of Cerebral Palsy in Europe . Surveillance of cerebral palsy in Europe: a collaboration of cerebral palsy surveys and registers. Surveillance of Cerebral Palsy in Europe (SCPE). Dev Med Child Neurol. 2000;42(12):816-824. doi: 10.1017/s0012162200001511 [DOI] [PubMed] [Google Scholar]
  • 36.Groos D, Ramampiaro H, Ihlen EAF. EfficientPose: scalable single-person pose estimation. Applied Intelligence. 2021;51(4):2518-2533. doi: 10.1007/s10489-020-01918-7 [DOI] [Google Scholar]
  • 37.Groos D, Adde L, Støen R, Ramampiaro H, Ihlen EAF. Towards human-level performance on automatic pose estimation of infant spontaneous movements. Comput Med Imaging Graph. 2022;95:102012. doi: 10.1016/j.compmedimag.2021.102012 [DOI] [PubMed] [Google Scholar]
  • 38.McCay KD, Ho ESL, Shum HPH, Fehringer G, Marcroft C, Embleton ND. Abnormal infant movements classification with deep learning on pose-based features. IEEE Access. 2020;8:51582-51592. doi: 10.1109/ACCESS.2020.2980269 [DOI] [Google Scholar]
  • 39.Reich S, Zhang D, Kulvicius T, et al. Novel AI driven approach to classify infant motor functions. Sci Rep. 2021;11(1):9888. doi: 10.1038/s41598-021-89347-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Nguyen-Thai B, Le V, Morgan C, Badawi N, Tran T, Venkatesh S. A spatio-temporal attention-based model for infant movement assessment from videos. IEEE J Biomed Health Inform. 2021;25(10):3911-3920. doi: 10.1109/JBHI.2021.3077957 [DOI] [PubMed] [Google Scholar]
  • 41.Shin HL, Shin HI, Bang MS, et al. Deep learning–based quantitative analyses of spontaneous movements and their association with early neurological development in preterm infants. Sci Rep. 2022;12(1):3138. doi: 10.1038/s41598-022-07139-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Song YF, Zhang Z, Shan C, Wang L. Constructing stronger and faster baselines for skeleton-based action recognition. IEEE Trans Pattern Anal Mach Intell. Published online March 7, 2022. doi: 10.1109/TPAMI.2022.3157033 [DOI] [PubMed] [Google Scholar]
  • 43.Einspieler C, Bos AF, Krieber-Tomantschger M, et al. Cerebral palsy: early markers of clinical phenotype and functional outcome. J Clin Med. 2019;8(10):1616. doi: 10.3390/jcm8101616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Adde L, Brown A, van den Broeck C, et al. In-motion–app for remote general movement assessment: a multi-site observational study. BMJ Open. 2021;11(3):e042147. doi: 10.1136/bmjopen-2020-042147 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kwong AKL, Eeles AL, Olsen JE, Zannino D, Kariotis T, Spittle AJ. Instructional guides for filming infant movements at home are effective for the General Movements Assessment. J Paediatr Child Health. 2022;58(5):796-801. doi: 10.1111/jpc.15838 [DOI] [PubMed] [Google Scholar]
  • 46.Spittle AJ, Olsen J, Kwong A, et al. The Baby Moves prospective cohort study protocol: using a smartphone application with the general movements assessment to predict neurodevelopmental outcomes at age 2 years for extremely preterm or extremely low birthweight infants. BMJ Open. 2016;6(10):e013446. doi: 10.1136/bmjopen-2016-013446 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Svensson KA, Örtqvist M, Bos AF, Eliasson AC, Sundelin HE. Usability and inter-rater reliability of the NeuroMotion app: a tool in general movements assessments. Eur J Paediatr Neurol. 2021;33:29-35. doi: 10.1016/j.ejpn.2021.05.006 [DOI] [PubMed] [Google Scholar]
  • 48.Kwong AKL, Fitzgerald TL, Doyle LW, Cheong JLY, Spittle AJ. Predictive validity of spontaneous early infant movement for later cerebral palsy: a systematic review. Dev Med Child Neurol. 2018;60(5):480-489. doi: 10.1111/dmcn.13697 [DOI] [PubMed] [Google Scholar]
  • 49.Constantinou JC, Adamson-Macedo EN, Mirmiran M, Fleisher BE. Movement, imaging and neurobehavioral assessment as predictors of cerebral palsy in preterm infants. J Perinatol. 2007;27(4):225-229. doi: 10.1038/sj.jp.7211664 [DOI] [PubMed] [Google Scholar]
  • 50.Datta AN, Furrer MA, Bernhardt I, et al. ; GM Group . Fidgety movements in infants born very preterm: predictive value for cerebral palsy in a clinical multicentre setting. Dev Med Child Neurol. 2017;59(6):618-624. doi: 10.1111/dmcn.13386 [DOI] [PubMed] [Google Scholar]
  • 51.Bruggink JLM, Einspieler C, Butcher PR, Van Braeckel KNJA, Prechtl HFR, Bos AF. The quality of the early motor repertoire in preterm infants predicts minor neurologic dysfunction at school age. J Pediatr. 2008;153(1):32-39. doi: 10.1016/j.jpeds.2007.12.047 [DOI] [PubMed] [Google Scholar]
  • 52.Grunt S, Mazenauer L, Buerki SE, et al. Incidence and outcomes of symptomatic neonatal arterial ischemic stroke. Pediatrics. 2015;135(5):e1220-e1228. doi: 10.1542/peds.2014-1520 [DOI] [PubMed] [Google Scholar]
  • 53.Zhang S, Li B, Zhang X, Zhu C, Wang X. Birth asphyxia is associated with increased risk of cerebral palsy: a meta-analysis. Front Neurol. 2020;11:704. doi: 10.3389/fneur.2020.00704 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Hafström M, Källén K, Serenius F, et al. Cerebral palsy in extremely preterm infants. Pediatrics. 2018;141(1):e20171433. doi: 10.1542/peds.2017-1433 [DOI] [PubMed] [Google Scholar]
  • 55.Gorter JW, Ketelaar M, Rosenbaum P, Helders PJM, Palisano R. Use of the GMFCS in infants with CP: the need for reclassification at age 2 years or older. Dev Med Child Neurol. 2009;51(1):46-52. doi: 10.1111/j.1469-8749.2008.03117.x [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement.

eAppendix 1. Related Previously Published Articles

eAppendix 2. Characteristics of Included Infants

eAppendix 3. Deep Learning–Based CP Prediction Model

eAppendix 4. Decision Thresholds

eTable 1. Characteristics of Infants in Data Sets

eTable 2. Internal Validation of Deep Learning Method

eTable 3. Search Space of 20 Architectural Choices

eTable 4. Characteristics of Architectures Obtained by K-Best Search

eTable 5. Predictive Values of Decision Thresholds on External Validation

eFigure 1. Overall Architecture of Graph Convolutional Networks

eFigure 2. ROC Curves on External Validation

eFigure 3. Class Activation Mapping

eReferences


Articles from JAMA Network Open are provided here courtesy of American Medical Association

RESOURCES