Skip to main content
. 2021 Mar;110:103854. doi: 10.1016/j.ridd.2021.103854

Table 3.

Challenges, limitations, and future directions of different computer vision sensing and data processing approaches.

Current approaches / Problems Future directions / Improvements/ Challenges
Sensors Current approaches
Mostly 2D single cameras
Problems
Only 2D information
Occlusions
Future directions
Multiple 2D cameras
3D (depth) sensors
Pressure mat sensors
Improvements
3D information
Less occlusions
More information due to multi-sensory integration
Data Current approaches
Small datasets
Problems
Not enough data to employ deep learning methods
Not publicly available – no benchmarking possible
Incorrect or incomplete data in some cases, e.g., inaccurate outcome labelling due to lack of longitudinal studies, the inclusion of incorrect age-specificity cases, use of low-inter-rater agreement or small rater-group or lack of experienced raters in data labelling, disorders or gender misrepresentation
Future directions
Collect more data
Make it publicly available
Make use of home videos
Employ DL methods
Make use of transfer learning (e.g., Tan et al., 2018)
Challenges
Need to solve anonymisation issue (automated techniques for face detection and replacement can be applied)
Development of methods which can cope with different light conditions, resolution, frame rate
Body areas of interest Current approaches
Mostly movement of arms, legs, head
Problems
Incomplete information of full-body movement
Future directions
Hand, fingers, feet
Eye movement data
Mimic
Challenges
Integration and analysis of multimodal information
Motion tracking Current approaches
Mostly in 2D space
Problems
Only 2D information
Future directions
Full-body tracking in 3D using well-established methods in DL (e.g., DeepLabCut and OpenPose frameworks)
Challenges
DL methods need to be adapted to infants
Motion encoding Current approaches
Conventional features based on: displacement, distance, velocity, acceleration, speed, and time
Problems
Only 2D features
Future directions
Motion encoding using well-established methods from robotics: Dynamic Movement Primitives (e.g., Ijspeert, Nakanishi, Hoffmann, Pastor, & Schaal, 2013), Gaussian Mixture Models (e.g., Calinon, 2016; Khansari-Zadeh & Billard, 2011), Probabilistic Movement Primitives (Paraschos, Daniel, Peters, & Neumann, 2018)
Learn features from expert knowledge during observation (e.g., Silva et al., 2018, 2019)
Improvements
3D features
New motion encoding and features
Classification algorithms Current approaches
Conventional ML methods, e.g., SVM, Decision Trees, Neural Networks, Hidden Markov Models
Supervised learning without feedback during learning
Future directions
Employ ANN, DL
Employ Interactive Machine Learning (learning with feedback)
Improvements
Better models with more accurate predictions
Challenges
More data is needed