Computer vision in autism spectrum disorder research: a systematic review of published studies from 2009 to 2019

Ryan Anthony J de Belen; Tomasz Bednarz; Arcot Sowmya; Dennis Del Favero

doi:10.1038/s41398-020-01015-w

. 2020 Sep 30;10:333. doi: 10.1038/s41398-020-01015-w

Computer vision in autism spectrum disorder research: a systematic review of published studies from 2009 to 2019

Ryan Anthony J de Belen ^1,^✉, Tomasz Bednarz ¹, Arcot Sowmya ², Dennis Del Favero ¹

PMCID: PMC7528087 PMID: 32999273

Abstract

The current state of computer vision methods applied to autism spectrum disorder (ASD) research has not been well established. Increasing evidence suggests that computer vision techniques have a strong impact on autism research. The primary objective of this systematic review is to examine how computer vision analysis has been useful in ASD diagnosis, therapy and autism research in general. A systematic review of publications indexed on PubMed, IEEE Xplore and ACM Digital Library was conducted from 2009 to 2019. Search terms included [‘autis*’ AND (‘computer vision’ OR ‘behavio* imaging’ OR ‘behavio* analysis’ OR ‘affective computing’)]. Results are reported according to PRISMA statement. A total of 94 studies are included in the analysis. Eligible papers are categorised based on the potential biological/behavioural markers quantified in each study. Then, different computer vision approaches that were employed in the included papers are described. Different publicly available datasets are also reviewed in order to rapidly familiarise researchers with datasets applicable to their field and to accelerate both new behavioural and technological work on autism research. Finally, future research directions are outlined. The findings in this review suggest that computer vision analysis is useful for the quantification of behavioural/biological markers which can further lead to a more objective analysis in autism research.

Subject terms: Diagnostic markers, Human behaviour, Autism spectrum disorders

Introduction

Visual observation and analysis of children’s natural behaviours are instrumental to the early detection of developmental disorders, including autism spectrum disorder (ASD). While a gold standard observational tool is available, there are limitations that hinder the early screening of ASD in children. Interpretative coding of child observations, parent interviews and manual testing¹ are costly and time-consuming². In addition, the reliability and validity of the results obtained from a clinician’s observations can be subjective³, arising from differences in professional training, resources and cultural context. Furthermore, behavioural ratings typically do not capture data from the children in their natural environments. Such limitations combined with rising incidence rates call for the development of new methods of ASD diagnosis without compromising accuracy, in order to reduce waiting periods for access to care. This is critical as diagnosis and intervention within the first few years of life can provide long-term improvements for the child and can even have greater effect on outcomes⁴.

Early behavioural risk markers of ASD have been discovered with the help of retrospective analysis of home videos^5–7. Research studies have documented ASD-related behavioural markers that emerge within the first months of life; these include diminished social engagement and joint attention^8,9, atypical visual attention such as difficulty during response-to-name protocol¹⁰, longer latencies to disengage from a stimulus if multiple ones are presented¹¹, and non-smooth visual tracking¹². Furthermore, children with ASD may exhibit atypical social behaviours such as decreased attention to social scenes, decreased frequency of gaze to faces¹³ and decreased expression of emotion. In addition, evidence suggests that differences in motor control are an early feature of ASD^14–17.

Over the past decade, computer vision has been used in the field of automated medical diagnosis as it can provide unobtrusive objective information on a patient’s condition. A recent finding has shown that utilising computer vision methods to automatically detect symptoms can pre-diagnose over 30 conditions¹⁸. For example, computer vision-based facial analysis can be used to monitor vascular pulse, assess pain, detect facial paralysis, diagnose psychiatric disorders and even distinguish ASD individuals from individuals with typical development (TD) through behaviour imaging¹⁹. The main rationale for using computer vision for a clinical purpose would be to remove any potential bias, develop a more objective approach to analysis, increase trust towards diagnosis, as well as decrease errors related to human factors in the decision-making process. Furthermore, computer vision-based systems provide a low-cost and non-invasive approach, potentially reducing healthcare expenditures when compared to medical examinations.

Computer vision techniques have been effectively exploited in the last years to automatically and consistently assess existing ASD biomarkers, as well as discover new ones²⁰. To further examine how computer vision has been useful in ASD research, a systematic review of published studies was conducted on computer vision techniques for ASD diagnosis study, therapy and autism research in general. First, eligible papers are categorised based on the quantified behavioural/biological markers. In addition, different publicly available ASD datasets suitable for computer vision research are reviewed. Finally, interesting research directions are outlined. To this end, this systematic review can serve as an effective summary resource that researchers can consult when developing computer vision-based assessment tools for automatically quantifying ASD-related markers.

Materials and methods

Eligibility criteria

All titles and abstracts were initially screened to include studies that meet the following inclusion criteria: (1) the study focussed on autism in humans (i.e. animal studies were excluded); (2) the study mainly focussed on the use of computer vision techniques in autism diagnosis study, therapy of autism or autism research in general; (3) the study explained how behavioural/biological markers can be automatically quantified; and (4) the study included an experiment, a pilot study or a trial with at least one group of individuals with ASD. Finally, results in the form of review, meta-analysis, keynote, narrative, editorial or magazine were excluded.

Search process

An electronic database search of PubMed, IEEE Xplore and ACM Digital Library was conducted by including simple terms and Medical Subject Headings terms for keywords [‘autis*’ AND (‘computer vision’ OR ‘behavio* imaging’ OR ‘behavio* analysis’ OR ‘affective computing’)] in all fields (title, abstract, keywords, full text and bibliography) from January 1, 2009 to December 31, 2019. A snowballing approach was also conducted to identify additional papers. Included peer-reviewed articles followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement²¹. Duplicates were removed and the title and abstract of each article were scanned for relevance. The full text of potentially relevant studies was assessed for eligibility considering established criteria detailed above. A PRISMA flow diagram was constructed and is shown in Appendix A.

Data items and analysis

Identical variables in eligible studies were extracted where possible into an Excel spreadsheet: (1) quantified behavioural/biological markers; (2) application focus; (3) child diagnosis and size of participants’ groups; (4) age range of the participants or age mean and standard deviation; (5) input data and devices used; (6) computer vision method applied in the study; and (7) dataset used in the study. 94 eligible studies were categorised based on the behavioural/biological markers that were quantified.

Results

Overview of behavioural/biological markers used in eligible papers

The findings in this survey show that there is an increase in the number of significant contributions of computer vision methods to autism research. Over the last decade, computer vision has been used to capture and quantify different information, such as: (a) Magnetic Resonance Imaging (MRI)/functional MRI (see Table 1) (b) facial expression/emotion (see Table 2) (c) eye gaze data (see Table 3) (d) motor control/movement pattern (see Table 4) (e) stereotyped behaviours (see Table 5) and (f) multimodal data (see Table 6). Identical variables (discussed in ‘Data Items and Analysis’) were reported for each quantified information.

Table 1.

Magnetic resonance imaging (MRI)/functional MRI (fMRI).

Reference	Focus	N participants	Age	Input data/device used	Method used	Dataset
Samson et al.²²	fMRI to study the neural bases of complex non-social sound processing	15 ASD, 13 TD	ASD: 24.3 ± 6.25 TD: 23.5 ± 7.42	fMRI scans/3 T TRIO MRI system	Image processing/ICBM152 (MNI) space and 3D Gaussian Filtering	Own dataset
Abdelrahman et al.²³	MRI for diagnosis	14 ASD, 28 TD	7-38 years	MRI scans/1.5 T Sigma MRI scanner	Mesh processing	Own dataset
Durrleman et al.²⁴	MRI for biomarker detection	51 ASD, 25 TD and developmentally delayed children	18-35 months	MRI, 1.5-T GE Signa MRI scanner		¹²²
Ahmadi et al.²⁵	fMRI for biomarker detection	24 ASD, 27 TD		MRI scans/3T MRI scanner	Machine learning, independent component analysis	Own dataset
Chaddad et al.²⁶	MRI for biomarker detection	34 ASD, 30 TD	4-24 years	MRI scans/3T MRI scanner	Texture analysis	ABIDE I dataset
Chaddad et al.²⁷	MRI for biomarker detection	539 ASD, 573 TD	ASD: 17.01 ± 8.36 TD: 17.08 ± 7.72	MRI scans	Texture analysis	ABIDE I dataset
Eslami and Saeed²⁸	fMRI for diagnosis	187 ASD, 183 TD		fMRI scans	Deep learning, MLP with 2 hidden layers + SVM	Four datasets (NYU, OHSU, USM, UCLA) from ABIDE-I fMRI dataset
Li et al.²⁹	fMRI for diagnosis	149 ASD, 161 TD		rs-fMRI scans	Deep learning/SSAE	4 datasets (UM, UCLA, USM, LEUVEN) from ABIDE MRI dataset
Crimi et al.³⁰	fMRI for diagnosis	31 ASD, 23 TD		Imaging data, GE 3T MR750 scanner	Machine Learning/Constrained Autoregressive Model	San Diego State University cohort of ABIDE II dataset
Chanel et al.³²	fMRI for diagnosis	15 ASD, 14 TD	ASD: 28.6 ± 1.87 TD: 31.6 ± 2.61	fMRI/3T MRI scanner	Machine learning/SVM	Own dataset
Zheng et al.³⁴	MRI for biomarker detection	66 ASD, 66 TD		MRI scans	multi-feature-based networks (MFN) and SVM	4 datasets (NYU, SBL, KUL, ISMMS) from ABIDE database

Open in a new tab

Table 2.

Facial expression/emotion.

Reference	Focus	N participants	Age	Input data/device used	Method used	Dataset
Leo et al.⁴⁷	Facial expression for quantitative assessment	17 ASD, 10 TD	6–13 years	Image sequences	Deep learning	Own dataset
Kalantarian et al.³⁶	Facial emotion for mobile games	8 ASD	6–12 years	Mobile phone	Ensemble classification (AWS + Sighthound + Azure)	Own dataset
Kalantarian et al.³⁷	Facial expression for quantitative assessment	8 ASD, 5 TD	ASD: 8.5 ± 1.85 TD: 4.4 ± 0.54 (in years)	Video, mobile phone	Histogram of Oriented Gradients (HOG) + SVM	Own dataset
Han et al.³⁸	Emotional expression recognition	25 ASD		Camera	Deep learning, CNN	^128,129
Tang et al.³⁹	Automatic smile detection	11 ASD, 23 TD	6–24 months	Video, two wireless cameras	Deep learning, CNN	GENKI-4K, CelebA¹³², RCLA&NBH Smile
Daniels et al.⁴⁰	Emotion recognition for assistive technology	23 ASD, 20 TD	6–17 years	Google Glass		n/a
Jazouli et al.⁴¹	Emotion recognition for assistive technology	10 ASD		3D image, Microsoft Kinect		Own dataset
Washington et al.⁴²	Emotion recognition for assistive technology	14 ASD	9.57 months [3.37. 4–15]	Video/Google Glass and mobile phone	Machine learning, Histogram of Gradients (HOG) + SVM	^{128,139–143}
Voss et al.⁴³	Emotion recognition for assistive technology	20 ASD, 20 TD		Video/Google Glass and mobile phone	Machine learning, Histogram of Gradients (HOG) + SVM	n/a
Vahabzadeh et al.⁴⁴	Emotion recognition for assistive technology	8 ASD	11.7–20.5 years	Video, Google Glass		n/a
Leo et al.⁴⁵	Emotion recognition for behaviour monitoring	3 ASD		Video, Robokind R25 Robot		¹²⁸
Pan et al.⁴⁶	Facial emotion for behaviour analysis	2 ASD		Video, NAO robot		Own dataset
Coco et al.⁴⁸	Facial expression analysis for diagnosis	5 ASD, 5 TD	65.38 months [15.86, 48–65 months]	Video, webcam	Deep learning, Histogram of Oriented Gradients (HOG) feature combined with a linear classifier, CNN	DISFA [24], SEMAINE [26] and BP4D [34] datasets.
Leo et al.⁴⁹	Facial expression for quantitative assessment	17 ASD	6–13 years	Image sequences	Deep learning	Own dataset
Samad et al.⁵⁰	3D facial imaging for physiology-based impairment detection	8 ASD, 8 TD	7–20 years	3D images, high resolution 3D facial imaging sensor, 3dMD		n/a
Leo et al.⁵¹	Facial expression recognition for assistive technology	1 ASD, 1 TD		Video	Deep learning, Facial Action Coding System (FACS)	Own dataset
Guha et al.⁵²	Facial expression for quantitative assessment	20 ASD, 19 TD	9–14 years	Motion capture data, 6 infra-red motion-capture cameras	Deep learning, Facial Action Coding System (FACS)	Own dataset
Ahmed and Goodwin⁵³	Facial expression for predicting engagement and learning performance	7 ASD	8–19 years	Video, camera	Computer Expression Recognition Toolbox	Own dataset
Harrold et al.⁵⁴	Facial expression for assistive technology	2 ASD, 4 TD	8–10 years	Video, Apple iPad		n/a
Harrold et al.⁵⁵	Facial expression for assistive technology	2 ASD, 4 TD	8–10 years	Video, Apple iPad		n/a
White et al.⁵⁶	Facial emotion expression and recognition	20 ASD, 20 TD	9–12 years	3D data, Microsoft Kinect		n/a
Garcia-Garcia et al.⁵⁷	Facial expression for learning emotional intelligence	3 ASD	8–10 years	Video, mobile phone	Affectiva SDK	n/a
Jain et al.⁵⁸	Facial expression recognition for assistive technology	6 ASD	5–12 years	Video, webcam		¹²⁸
Li et al.⁵⁹	Facial attributes for ASD classification	49 ASD, 39 TD		Video, Apple iPad	Deep learning, CNN	Training: AffectNet¹³³ and EmotioNet¹³⁴ Evaluation: Own dataset
Shukla et al.⁶⁰	Facial image analysis for diagnosis	91 ASD, 1035 NDD, 1126 TD		Image, camera	Deep learning, CNN	Own dataset

Open in a new tab

Table 3.

Eye Gaze Data.

Reference	Focus	N participants	Age	Input data/device used	Method	Dataset
Pierce et al.⁶⁵	Biomarker detection	444 subjects from 6 distinct groups		Eye tracking data, Tobii T120 eye tracker		Own dataset
Murias et al.⁶⁶	Biomarker detection	25 ASD	24–72 months	Eye tracking data, Tobii TX300 eye tracker		Own dataset
Chawarska et al.⁶⁷	Eye movement to determine prodromal symptoms of ASD	84 ASD	6 months	Gaze trajectories, SensoMotoric Instruments IView X RED eye-tracking system		Own dataset
Shi et al.⁶⁸	Visual stimuli design consideration	13 ASD, 20 TD	4–6 years	Infra-red eye-tracking recording, EyeLink1000		Own dataset
Shic et al.⁶⁹	Visual attention preference	28 ASD, 16 DD, 34 TD	20 months	Gaze patterns, SMI iView X™ RED dark-pupil 60 Hz eye-tracking system		Own dataset
Liu et al.⁷³	Eye movement for diagnosis	29 ASD, 58 TD	4–11 years	Gaze data, Tobii T60 eye tracker	Machine learning, k nearest neighbours (kNN)	Own dataset
Tung et al.⁶¹	Eye detection	33 ASD		Video, camera		Own dataset
Balestra et al.⁶²	Eye tracking to study language impairments and text comprehension and production deficits	1 ASD	25 years	Eye tracking data, Tobii 1750 eye tracker		n/a
Li et al.⁶³	Identification of fixations and saccades	38 ASD, 179 TD		Eye-tracking data	Modified DBSCAN Algorithm	Own dataset
Matthews et al.⁶⁴	Eye gaze analysis for affective state recognition	19 ASD, 19 TD	ASD: 41.05 ± 32.15 TD: 32.15 ± 9.93 (in years)	Video, Gazepoint GP3 eye-tracker	Scanpath trend analysis and arousal sensing and detection of focal attention	n/a
Campbell et al.⁷⁰	Gaze pattern for saliency analysis	15 ASD, 13 TD	8–43 months	Gaze trajectories, SensoMotoric Instruments iView XRED eye-tracking system	Bayesian model	n/a
Syeda et al.⁷¹	Eye gaze for visual face scanning and emotion analysis	21 ASD, 21 TD	5–17 years	Gaze data, Tobii EyeX controller		Own dataset
Chrysouli et al.⁷²	Eye gaze analysis for affective state recognition			Video, Kinect camera	Deep learning, two-stream CNN	MaTHiSis
Liu et al.⁷⁴	Eye movement for diagnosis	Children: 20 ASD, 21 TD adults: 19 ASD, 22 intellectually disabled (ID), 28 TD	children: ASD: 7.85 ± 1.59 TD: 7.73 ± 1.51 adults: ASD: 20.84 ± 3.27 ID: 23.59 ± 3.08 TD: 20.61 ± 2.90	Eye tracking data, Tobii T60 eye tracker	Bag-of-Words (BOW) framework and SVM	^144,145
Vu et al.⁷⁵	Gaze pattern for diagnosis	16 ASD, 16 TD	2–10 years	Gaze data, Tobii EyeX controller	Machine learning, similarity matching + kNN	Own dataset
Jiang and Zhao⁷⁶	Visual attention preference for diagnosis	20 ASD, 19 TD	ASD: 30.8 ± 11.1 TD: 32 ± 10.4 (in years)	Eye tracking data	Deep learning	¹⁴⁶
Higuchi et al.⁷⁷	Gaze direction for behaviour analysis	2 ASD, 2 TD		Video, camera	OpenFace Toolkit	Own dataset
Chong et al.⁷⁸	Eye contact detection for behaviour analysis	50 ASD, 50 TD		Videos,	Deep learning	Own dataset (subset is from MMDB)
Toshniwal et al.⁷⁹	Attention recognition for assistive technology	10 ASD, 8 NDD	12–18 years	Video, Mobile phone	Android Face Detection API	Own dataset

Open in a new tab

Table 4.

Motor control/movement pattern.

Reference	Focus	N participants	Age	Input data/device used	Method used	Dataset
Dawson et al.⁸⁰	Head movement for digital phenotyping	22 ASD, 82 TD	16–31 months	Video, iPad	Intraface, model-based object pose	Not publicly available
Martin et al.⁸¹	Head movement analysis	21 ASD, 21 TD	2.5–6.5 years old	Video, camera	Zface to track pitch, yaw, and roll of head movement	Not publicly available
Zunino et al.⁸²	Grasping actions for diagnosis	20 ASD, 20 TD	ASD: 9.8 years TD: 9.5 years	Video, Vicon VUE video camera	Deep learning, CNN + LSTM	Publicly available
Vyas et al.⁸³	Motion pattern for diagnosis			Video, Mobile Phone	R-CNN	From NODA programme of Behaviour Imaging company
Piana et al.⁸⁴	Body movement for emotional training	10 ASD	Mean age: 9.6 years old	Video and motion capture data, Microsoft Kinect v2		n/a
Bartoli et al.⁸⁵	Movement pattern analysis for game-based therapy	5 ASD	10–12 years old	Video, Microsoft Xbox 360 Kinect		n/a
Ringland et al.⁸⁶	Movement pattern analysis to support therapeutic tool	15 with neurodevelopmental disorder	10–14 years old	Video, Microsoft Kinect		n/a
Magrini et al.⁸⁷	Gesture tracking for music therapy	4 ASD	5–7 years old	Video, camera		n/a
Dickstein-Fischer and Fischer⁸⁸	Robot-assisted therapy			Video, Penguin for Autism Behavioural Interventions (PABI)		n/a
Bekele et al.⁸⁹	Head movement analysis for assistive technology	6 ASD, 6 TD	ASD: 4.70 ± 0.70 TD: 4.26 ± 1.05	Video, NAO Robot with 2 vertical stereo cameras	Image processing	n/a
Dimitrova et al.⁹⁰	Movement analysis for assistive technology		7–9 years old	Video, webcam		n/a

Open in a new tab

Table 5.

Stereotyped behaviours.

Reference	Focus	N participants	Age	Input data/device used	Method used	Dataset
Hashemi et al.⁹⁵	Behaviour analysis	6 ASD, 14 TD	16–30 months	Video, iPad	IntraFace	Own dataset
Hashemi et al.⁹²	Sharing interest, visual tracking, and disengagement of attention detection	3 ASD, 3 TD	6–15 months	Video, two GoPro HD cameras	Histogram of Orientated Gradients (HOG) and SVM	Own dataset
Hashemi et al.⁹³	Behaviour analysis	12 ASD	5–16 months	GoPro Hero HD	HOG and SVM	Own dataset
Bidwell et al.⁹⁴	Behaviour analysis		15–30 months	Video, Camera and Microsoft Kinect	Omron OKAO Vision Library	Multimodal Dyadic Behaviour (MMDB) Dataset
Campbell et al.⁹⁶	Atypical orienting and attention behaviours for behavioural observation	22 ASD, 82 TD or DD	16–31 months	Tablet Device	IntraFace	Own dataset
Hashemi et al.⁹⁷	Engagement, name-call responses, and emotional responses	15 ASD, 18 TD	16–31 months	Video, iPad	IntraFace	Own dataset
Wang et al.⁹⁸	Attention monitoring for diagnosis	5 ASD, 12 TD		Video, two RGB cameras	Microsoft SDK	Own dataset
Bovery et al.⁹⁹	Attention monitoring for behavioural assessment	22 ASD, 82 TD	16–31 months	Video, iPad	IntraFace	Own dataset
Rajagopalan and Goecke¹⁰⁰	Self-stimulatory behaviour detection			YouTube videos	Histogram of Dominant Motions (HDM)	Self-stimulatory Behaviour Dataset, UCF101 and Weizmann Datasets
Rajagopalan et al.¹⁰¹	Self-stimulatory behaviour detection			YouTube videos	Space Time Interest Points (STIP) with Harris3D detectors in a BOW framework	Self-stimulatory Behaviour Dataset
Rajagopalan¹⁰²	Self-stimulatory behaviour detection			YouTube videos	Motion trajectories	Self-stimulatory Behaviour Dataset, UCF101, Hollywood 2 Datasets
Winoto et al.¹⁰³	Behaviour analysis	4 ASD, 4 TD		Microsoft Kinect v2
Feil-Seifer and Matarić¹⁰⁴	Interaction with robots for behaviour analysis	8 ASD		Video, camera	Heuristics	Own dataset
Moghadas and Moradi¹⁰⁵	Interaction with robots for diagnosis	8 ASD, 8 TD	ASD: 2.1–4.1 years TD: 2.11–7.6 years	Video, RobotParrot and two cameras	Kernelised Correlation Filter (KFC) and cosine similarity and SVM	Own dataset

Open in a new tab

Table 6.

Multimodal data.

Reference	Focus	N participants	Age	Input data/device used	Method used	Dataset
Egger et al.¹¹⁴	Emotion and attention analysis		16–30 months old	Video, mobile phone	IntraFace and⁹⁵	BU-3D Facial Expression¹³⁵ and Cohn-Kanade(CK)¹²⁷
Rudovic et al.¹¹⁰	Autism therapy	35 ASD	3–13 years old	Synchronised video recordings of facial expressions, head and body movements, pose, and gestures, audio recordings, and autonomic physiology	Deep learning, Personalised Perception of Affect Network (PPA-net)	Own dataset - multimodal data set of children with ASC (MDCA)¹⁴⁷
Chen and Zhao¹⁰⁶	Attentional and image-viewing preference for diagnosis	Photo taking: 22 ASD and 23 controls image-viewing: 20 ASD and 19 controls		Photo sequence + Image and Eye fixations	Deep learning, ResNet-50 and LSTM	Own dataset (photos and eye-tracking data) and¹²⁴
Wang et al.¹⁰⁷	Mutual gaze and gesture recognition for diagnosis	2 ASD, 6 TD	Children: mean 25 months Adults: mean 25 years	Image/Two Logitech BRIO 4K Pro RGB cameras + Microsoft Kinect	Deep learning, VGG + SSD	Oxford hand and Egohands dataset
Mazzei et al.¹⁰⁸	Robotic social therapy	5 ASD, 15 TD	6–12 years old			n/a
Coco et al.¹⁰⁹	Face detection, landmark extraction, gaze estimation, head pose estimation and FER for behaviour analysis	8 ASD	47–93 months	Mobile tablet and Zeno R25 robot	Facial landmark Detection and Tracking: conditional local neural field¹⁴⁸	Own dataset
Palestra et al.¹¹¹	Head pose, body posture, eye contact and facial expression for robotics treatment of autism	3 ASD	8–13 years	Robokind Zeno R25 humanoid robot and a Microsoft Kinect	¹⁴⁹	Own dataset
Dickstein-Fischer et al.¹¹²	Face recognition, head pose and eye gaze estimation for assistive technology	5 ASD	5–8 years old	Video, Penguin for Autism Behavioural Intervention (PABI)	Face Detection: Histogram of oriented gradients (HOG) + linear classifier Face recognition: LBPH Feature extraction: Regression trees Head pose estimation: Perceptive-N-Point problem	HELEN dataset
Mehmood et al.¹¹³	Analysis of joint attention and imitation accuracy	6 ASD, 2 TD	4–10 years old	2 NAO robots, Microsoft Kinect, and EEG		Own dataset
Peters et al.¹¹⁵	Behaviour recognition for assistive technology	2 ASD, 5 NDD	41–56 years	Two cameras, flow sensor, x-imu sensor		Own dataset
Rehg et al.¹¹⁶	Video, audio, and physiological data for behaviour analysis	121, total	15–30 months	Multimodal, cameras Microsoft Kinect, microphone, Q-sensors	Smile/Gaze Detection: Omron OKAO Vision Library + SVM	Multimodal Dyadic Behaviour (MMDB) Dataset
Liu et al.¹¹⁷	Video and audio for diagnosis	22 ASD, 21 TD	2–3 years old	Video and audio, camera		Own dataset (‘Response to Name’)
Marinoiu et al.¹¹⁸	Action and emotion for behaviour analysis	7 ASD		RGB + Depth/Microsoft Kinect v2	Deep learning	DE-ENIGMA dataset
Schwarzkopf et al.¹¹⁹	Study of larger extrastriate population receptive fields in ASD	15 ASD, 12 TD	20–48 years old	fMRI & eye gaze/3 T TIM-Trio scanner & EyeLink 1000 MRI compatible eye tracker		Own dataset

Open in a new tab

This review presents consolidated evidence on the effectiveness of using computer vision techniques in (1) determining behavioural/biological markers for diagnosis and characterisation of ASD, (2) developing assistive technologies that aid in emotion recognition and expression for ASD individuals and (3) augmenting existing clinical protocols with vision-based systems for ASD therapy and automatic behaviour analysis. The following subsections discuss in detail how each quantified marker was utilised in autism research.

Magnetic resonance imaging (MRI)/functional MRI (fMRI)

The need for a more quantitative approach to ASD diagnosis has pushed research towards analysing brain imaging data, such as MRI and fMRI. Generally, MRI and fMRI techniques scan different parts of the brain to provide images which are then used as input for further processing. These images have been used to determine potential biomarkers that show differences between ASD and TD subjects. For example, Samson et al.²² used fMRI scans to explore the differences of complex non-social sound processing between ASD and TD subjects. With increasing temporal complexity, TD subjects showed greater activity in anterolateral superior temporal gyrus while ASD subjects have greater effects in Heshl’s gyrus. Abdelrahman et al.²³ used MRI scans to generate a 3D model of the brain and accurately calculate the volume of white matter in the segmented brain. Considering the white matter volume as a discriminatory feature in a classification step using k-nearest neighbour algorithm, their system reached an accuracy of 93%. Durrleman et al.²⁴ examined MRI scans to find differences in the growth of the hippocampus in children with ASD and control subjects. Their findings suggest that group differences may be better identified by maturation speed rather than shape differences at a given age. Ahmadi et al.²⁵ used independent component analysis to show that within-network connections on fMRI images of ASD subjects are lower when compared to TD subjects.

The remaining eligible studies developed new techniques for diagnosing ASD using MRI^26,27 and fMRI^28–30 data in the ABIDE repository. Based on their recent findings, Chaddad et al.²⁶^,²⁷ demonstrated the potential of hippocampal texture features extracted from MRI scans as biomarkers for the diagnosis and characterisation of ASD. They used Laplacian-of-Gaussian filter³¹ across a range of resolution scales and performed statistical analysis to identify regions exhibiting significant textural differences between ASD and TD subjects. They identified asymmetrical difference in the right hippocampus, left choroid-plexus and corpus callosum and symmetrical difference in the cerebellar white matter.

Some of the techniques are based on conventional machine learning techniques, such as Support Vector Machines (SVM). For example, Chanel et al.³² used a multivariate pattern analysis approach in two different fMRI experiments with social stimuli. The method, based on a modified version of SVM Recursive Feature Elimination algorithm³³, is trained independently and then combined to obtain a final classification output (e.g. ASD or TD). Their results revealed classification accuracy of between 69% and 92.3%. Crimi et al.³⁰ used a constrained autoregressive model followed by an SVM to differentiate individuals with ASD from TD individuals. Zheng et al.³⁴ constructed multi-feature-based networks (MFN) and SVM to classify individuals of the two groups. Their results showed that using MFN significantly improved the classification accuracy by almost 14% compared to using morphological features. Their findings also demonstrated that variations in cortico-cortical similarities can be used as biomarkers in the diagnostic process.

Deep learning techniques have also been proposed for automating ASD diagnosis by extracting discriminative features from fMRI data and feeding them to a classifier²⁸. In order to increase the number of training samples and avoid overfitting, Eslami and Saeed²⁸ used Synthetic Minority Over-Sample (SMOTE)³⁵. They also investigated the effectiveness of the features extracted using an SVM classifier. Their model achieved more than 70% classification accuracy for four fMRI datasets, with highest accuracy of 80%. Attaining similar performance, Li et al.²⁹ adopted a deep transfer learning neural network model for ASD diagnosis. Compared to traditional models, their approach led to improved performance in terms of accuracy, sensitivity, specificity and area under receiver operating characteristic curve.

Facial expression/emotion

Emotion classification focusses on the development of algorithms that produce an emotion label (e.g. happy or sad) from a face in a photo or a video frame. Recent advances in the field of computer vision have contributed to the development of various emotion classifiers that can potentially play a significant role in mobile screening and therapy for ASD children. However, most classifiers are biased towards neurotypical adults and can fail to generalise to children with ASD. To address this, Kalantarian et al.^36,37 presented a framework for semi-automatic label frame extraction to crowdsource labelled emotion data from children. The labels consist of six emotions: disgust, neutral, surprise, scared, angry and happy. To improve the generalisation of expression recognition models to children with ASD, Han et al.³⁸ presented a transfer learning approach based on a sparse coding algorithm. Their results showed that their method can more accurately identify the emotional expression of children with ASD. Tang et al.³⁹ proposed a convolutional neural networks-based (CNN) method for smile detection of infants in mother–infant interaction. Their results showed that their approach can achieve a mean accuracy of 87.16% and F1-score of 62.54%.

Several papers have focussed on using computer vision to develop assistive technologies for ASD children^40–43. For example, researchers^40,42,43 developed and evaluated a wearable assistive technology to help ASD children with emotion recognition. Vahabzadeh et al.⁴⁴. provided initial evidence for the potential of wearable assistive technologies to reduce hyperactivity, inattention and impulsivity in school-aged children, adolescents and young adults with ASD. Leo et al.⁴⁵ and Pan et al.⁴⁶ developed an automatic emotion recognition system in robot-children interaction for ASD treatment. Their results suggest that computer vision could help to improve the efficiency of behaviour analysis during interactions with robots.

Most research mainly focusses on qualitative recognition of facial expressions. This is due to the fact that computational approach on facial expression analysis is an emerging research topic. There are a few attempts to automatically quantify facial expression production of ASD children^47–52. For example, Leo et al.⁴⁷ proposed a framework to computationally analyse how ASD and TD children produce facial expression. Guha et al.⁵² investigated differences in the overall and local facial dynamics of TD and ASD children. Their observations showed that there is reduced complexity in the dynamic facial behaviour of ASD children arising primarily from the eye region. Computer vision has also been used to predict engagement and learning performance. For example, Ahmed and Goodwin⁵³ analysed facial expressions from video recordings obtained when kids interacted with a computer-assisted instruction programme. Their results showed that emotional and behavioural engagement can be quantified automatically using computer vision analysis.

Harrold et al.⁵⁴^,⁵⁵ developed a mobile application that allows children to learn emotions with instant feedback on performance through computer vision. White et al.⁵⁶ presented results which showed that children with ASD found their system to be acceptable and enjoyable. Similar to this approach, Garcia-Garcia et al.⁵⁷ presented a system that incorporates emotion recognition and tangible user interfaces to teach children with ASD to identify and express emotions. Jain et al.⁵⁸ proposed an interactive game that can be used for autism therapy. The system tracks facial features to recognise the facial expressions of the participant and to animate an avatar. Developed as a game, the system attempts to teach kids how to recognise and express emotions through facial expressions.

A deep learning approach has also been applied to recognise developmental disorders through facial images. For example, Li et al.⁵⁹ introduced an end-to-end CNN-based system for ASD classification using facial attributes. Their results show that different facial attributes are statistically significant and improve classification performance by about 7%. A deep convolutional neural network (DCNN) for feature extraction followed by an SVM for classification has been trained by Shukla et al.⁶⁰ to detect whether a person in an image has ASD, cerebral palsy, Down syndrome, foetal alcohol spectrum syndrome, progeria or other intellectual disabilities. Their results indicate that their model has an accuracy of 98.80% and performs better than average human intelligence in distinguishing between different disorders.

Eye gaze data

Analysing attention and psychological factors encoded in eye movements of individuals could help in ASD diagnosis. Computer vision has been used to automatically analyse children’s gaze and distinguish ASD-related characteristics present in a video⁶¹. Research has shown that there is a significant difference in gaze patterns between children with ASD and TD. Eye tracking technology provides automatic assessment of gaze behaviour in different contexts. For example, Balestra et al.⁶² showed that it can be used to study language impairments, text comprehension and production deficits. In addition, it can be used to identify fixation and saccades⁶³, recognise affective states⁶⁴ and even reveal early biomarkers associated with ASD^65,66. Furthermore, eye tracking can be used to detect saliency differences between ASD and TD children. Researchers^67–70 showed that there is a difference in preference for both social and non-social images. This finding is consistent with a similar published study of Syeda et al.⁷¹, which examined face scanning patterns in a controlled experiment. By extracting and analysing gaze data, the study revealed that children with autism spend less time looking at core features of faces (e.g. eyes, nose and mouth). Chrysouli et al.⁷² proposed a deep learning-based technique to recognise the affective state (e.g. engaged, bored, frustrated) of an individual (e.g. ASD, TD, etc.) from a video sequence.

Building upon the knowledge of previous research, several studies have concentrated on using visual attention preference of children with ASD for diagnosis. For example, Liu et al.^73,74 proposed a machine learning-based system to capture discriminative eye movement patterns related to ASD. They also presented a comprehensive set of effective feature extraction methods, prediction frameworks and corresponding scoring frameworks. Vu et al.⁷⁵ examined the impact of visual stimuli and exposure time on the quantitative accuracy of ASD diagnosis. They showed that using a ‘social scene’ stimulus with 5-s exposure time has the best performance at 98.24%. By also using visual attention preference, Jiang and Zhao⁷⁶ leveraged recent advances in deep learning for superior performance in ASD diagnosis. In particular, they used a DCNN and SVM to achieve an accuracy of 92%. Higuchi et al.⁷⁷ developed a novel system that provides visualisation of automatic gaze estimation and allows for experts to perform further analysis.

Most of the studies have been conducted in a highly controlled environment in which the subjects were asked to view a screen for a short period of time. Recently, Chong et al.⁷⁸ presented a novel deep learning architecture for eye contact detection in natural social interactions. In their study, eye contact detection was performed during adult–child sessions in which the adult wears a point-of-view camera. Their results showed significant improvement over existing methods, with a reported precision and recall of 76% and 80%, respectively. Toshniwal et al.⁷⁹ proposed an assistive technology that tracks attention using mobile camera and uses haptic feedback to recapture attention. Their evaluation study with users with various intellectual disabilities showed that it can provide better learning with less intervention.

Motor control/movement pattern

The use of computer vision has also shown potential for a more precise, objective and quantitative assessment of early motor control variations. For example, Dawson et al.⁸⁰ used computer vision analysis to analyse differences in midline head postural control, as reflected in the rate of spontaneous head movements between toddlers with ASD versus those without ASD. Their study followed a response-to-name protocol where a series of social and non-social stimuli (i.e. in the form of a movie) were shown on a smart tablet while the child sat on a caregiver’s lap. During the protocol, the examiner, standing behind the child, calls the child’s name and the child’s reaction is recorded using the smart tablet. Afterwards, a fully automated computer vision algorithm detects and tracks 49 facial landmarks and estimates head pose angles. Their study revealed that toddlers with ASD exhibited a significantly higher rate of head movement compared to their typically developing counterparts. Using the same approach, Martin et al.⁸¹ examined head movement dynamics of a cohort of children. They found that there is an evident difference in lateral (yaw and roll) head movement between children with ASD and TD children.

Deep learning has also been employed to develop novel screening tools that analyse gestures captured in video sequences. For example, Zunino et al.⁸² used CNN to extract features, followed by a long short-term memory (LSTM) model with an attentional mechanism. They demonstrated that it is possible to determine whether a video sequence contains grasping action performed by ASD or TD subjects. In another study, Vyas et al.⁸³ estimated children’s pose over time by retraining a state-of-the-art pose estimator (2D Mask R-CNN) and trained a CNN to categorise whether a given video clip contains a typical (normal) or atypical (ASD) behaviour. Their approach with an accuracy of 72% outperformed conventional video classification approaches.

Computer vision has also been used to develop motion-based touchless games for ASD therapy. For example, Piana et al.⁸⁴ conducted an evaluation study of a system designed for helping ASD children to recognise and express emotions by means of their full-body movement captured by RGB-D sensors. Their results showed that there is an increase in task (recognition) accuracy from the beginning to the end of training sessions. Bartoli et al.⁸⁵ showed the effectiveness of using embodied touchless interaction to promote attention skills during therapy sessions. Similarly, Ringland et al.⁸⁶ developed SensoryPaint that allows whole-body interactions and showed that it is a promising therapeutic tool. Magrini et al.⁸⁷ developed an interactive vision-based system which reacts to movements of the human body to produce sounds. Their system has been evaluated by a team of clinical psychologists and parents of young patients.

Computer vision has also been used to develop robot-mediated assistive technologies for ASD therapy. Dickstein-Fischer and Fischer⁸⁸ developed a robot, named PABI (Penguin for Autism Behavioural Interventions), with augmented vision to interact meaningfully with an autistic child during therapy. Similarly, Bekele et al.⁸⁹ developed a robot with augmented vision to automatically adapt itself in an individualised manner and to administer joint attention prompts. Their study suggests that robotic systems with augmented vision may be capable of enhancing skills related to attention coordination. This confirms an earlier study of Dimitrova et al.⁹⁰ where adaptive robots showed potential for educating children in various complex cognitive and social skills that eventually produce a substantial development impact.

Stereotyped behaviours

In the context of autism research, atypical behaviours are assessed during screening using different clinical tools and protocols. For example, Autism Observation Scale for Infants (AOSI) consists of a set of protocols that is designed to assess specific behaviours⁹¹. During the last decade, research has been growing towards behavioural imaging to create new capabilities for the quantitative understanding of behavioural signs, such as those outlined in AOSI. For example, Hashemi et al.^92,93 examined the potential benefits that computer vision can provide for measuring and identifying ASD behavioural signs based on two components of AOSI. In particular, they developed a computer vision tool to assess: (1) disengagement of attention: the ability of kids to disengage their attention from one of two competing visual stimuli and (2) visual tracking, to visually follow a moving object laterally across the midline. Similarly, computer vision analysis has also been explored to automatically detect and analyse atypical attention behaviours in toddlers in a response-to-name protocol. A proof of concept system that used marker-less head tracking was presented by Bidwell et al.⁹⁴ and scalable applications were developed by Hashemi et al.⁹⁵, Campbell et al.⁹⁶ and Hashemi et al.⁹⁷. The latter systems run on a mobile application designed to elicit ASD-related behaviours (e.g. social referencing, smiling while watching a movie and pointing) and use computer vision analysis to automatically code behaviours related to early risk markers of ASD. When compared to a human analyst, computer vision analysis was found to be as reliable in predicting child response latency. Using the response-to-name protocol, Wang et al.⁹⁸ proposed a non-contact vision system that achieved an average classification score of 92.7% for assistant screening of ASD. The results of the mentioned studies show that computer vision tools can capture critical behavioural observations and potentially augment clinical behavioural observations when using AOSI. Bovery et al.⁹⁹ also used a mobile application and movie stimuli to measure attention of toddlers. They used computer vision algorithms to detect head and iris positions and determine the direction of attention. Their results showed that toddlers with ASD paid less attention to the movie, showed less attention to the social as compared to the non-social visual stimuli and often directed their attention to one side of the screen.

Behaviours other than those outlined by AOSI have also been quantified using computer vision. For example, self-stimulatory behaviours refer to stereotyped, repetitive movements of body parts. Also known as ‘stimming behaviours’, these behaviours are often manifested when a person with autism engages in actions like rocking, pacing or hand flapping. Researchers^100–102 have introduced a dataset with stimming behaviours and used computer vision to determine if these behaviours exist in a video stream. Another quantified behaviour is social interaction and communication among individuals with ASD. Winoto et al.¹⁰³ developed an unobtrusive sensing system to observe, sense and annotate behavioural cues which can be reviewed by specialists and parents for better tailored assessment and interventions. Similarly, children’s responses when interacting with robots have been quantified using computer vision techniques. Feil-Seifer and Matarić¹⁰⁴ showed that computer vision can be used to study behaviours of ASD children towards robots during free-play settings. Moghadas and Moradi¹⁰⁵ proposed a computer vision approach to analyse human-robot interaction sessions and to extract features that can be used for ASD diagnosis.

Multimodal data

Over the last decade, there has been increasing interest in incorporating multiple behavioural modalities to achieve superior performance and even outperform previous state-of-the-art methods that utilise only a single modality for ASD screening. For example, Chen and Zhao¹⁰⁶ proposed a privileged modality framework that integrates information from two different tasks; (1) photo taking task where subjects freely move around the environment and take photos and (2) image-viewing task where their eye movements are recorded by an eye-tracking device. They used CNN and LSTM to integrate features extracted from these two tasks for more accurate ASD screening. Their results showed that the proposed models can achieve new state-of-the-art results. They also demonstrated that utilising knowledge across the two modalities dramatically improved performance by more than 30%.

Wang et al.¹⁰⁷ presented a standardised screening protocol, namely Expressing Needs with Index Finger Pointing (ENIFP), to assist in ASD diagnosis. The protocol is administered in a novel non-invasive system trained using deep learning to automatically capture eye gaze and gestures of the participant. Their results showed that the system can record the child’s performance and reliably check mutual attention and gestures during the ENIFP protocol. Computer vision techniques have also been used during robotic social therapy sessions proposed by Mazzei et al.¹⁰⁸.

Computer vision systems that incorporate multimodal information have also been used to detect behavioural features during interaction with a humanoid robot. For example, Coco et al.¹⁰⁹. proposed a technological framework to automatically build a quantitative report that could help therapists to better achieve either ASD diagnosis or assessment tasks. Furthermore, computer vision has been used to address autism therapy through social robots that automatically adapt their behaviours. For example, researchers^110–113 have presented systems that simultaneously include eye contact, joint attention, imitation and emotion recognition for an intervention protocol for ASD children. Egger et al.¹¹⁴ presented the first study showing the feasibility of computer vision techniques to automatically code behaviours in natural environments. Another assistive technology was introduced by Peters et al.¹¹⁵. to assist people with cognitive disabilities in brushing teeth. It uses behaviour recognition and a machine learning network to provide automatic assistance in task execution.

Rehg et al.¹¹⁶. proposed a new action recognition dataset for analysis of children’s social and communicative behaviours based on video and audio data. Their preliminary experimental results demonstrated the potential of this dataset to drive multi-modal activity recognition. Similarly, Liu et al.¹¹⁷ proposed a ‘Response-to-Name’ dataset and a multimodal ASD auxiliary screening system based on machine learning. Marinoiu et al.¹¹⁸ introduced one of the largest existing multimodal datasets of its kind (i.e. autistic interaction rather than genetic or medical data). They also proposed a fine-grained action classification and emotion prediction task recorded during robot-assisted therapy sessions of children with ASD. Their results showed that machine-predicted scores align closely with human professional diagnosis.

Computer vision has also been applied to multimodal data, such as fMRI and eye gaze information, in order to test differences in response selectivity of the human visual cortex between individuals with ASD and TD. Schwarzkopf et al.¹¹⁹. have shown that sharper spatial selectivity in visual cortex is not characterised in ASD individuals.

Datasets used in eligible papers

The dataset requirement typically depends on the target behavioural/biological marker and the computer vision methods to be employed. In this section, the publicly available datasets used by eligible papers are reviewed and those with autistic samples are focussed on.

Magnetic resonance imaging datasets

Autism Brain Imaging Data Exchange (ABIDE) initiative has aggregated functional and structural brain imaging data collected from different laboratories to accelerate understanding of the neural basis of autism. ABIDE I represents the first ABIDE initiative¹²⁰. This effort yielded a total of 1112 records (sets of magnetic resonance imaging (MRI) and functional MRI), including 539 from individuals with ASD and 573 from TD individuals. ABIDE II was established to further promote discovery of brain connectome in ASD¹²¹. It consists of 1114 records from 521 individuals with ASD and 593 TD individuals. Hazlett et al.¹²² conducted an MRI study with 51 children with ASD and 25 control children (including both developmentally delayed and TD children) between 18 and 35 months of age.

Autism spectrum disorder detection dataset

This dataset consists of a set of video clips of reach-to-grasp actions performed by children with ASD and TD⁸². In the protocol, children were asked to grasp a bottle and perform different subsequent actions (e.g. placing, pouring, passing to pour, and passing to place). A total of 20 children with ASD and 20 TD children participated in the study.

DE-ENIGMA dataset

DE-ENIGMA dataset is a free, large-scale, publicly available multi-modal (e.g. audio, video, and depth) database of autistic children’s interactions that is suitable for behavioural research¹²³. A total of 128 children on the autism spectrum participated in the study. During the experiment, children within each age group were randomly assigned to either a robot-led or a researcher/therapist-led teaching intervention which was implemented across multiple short sessions. This dataset includes ~13 TB of multi-modal data, representing 152 h of interaction. Furthermore, 50 children’s data have been annotated by experts for emotional valence, arousal, audio features and body gestures. The annotated data are in effect ready for future autism-focussed machine learning research.

Multimodal behaviour dataset

The Multimodal Dyadic Behaviour (MMDB) dataset is a unique collection of 160 multimodal (video, audio and physiological) recordings and annotations of the social and communicative behaviours of 121 children aged 15–30 months, gathered in a protocol known as the Rapid-ABC sessions¹¹⁶. This play protocol is an interactive assessment (3–5 min) consisting of five semi-structured play interactions in which the examiner elicits social attention, interaction and non-verbal communication from the child.

Saliency4ASD dataset

Saliency4ASD Grand Challenge aims to align the visual attention modelling community around the application of ASD diagnosis and to provide an open dataset of eye movements recorded from children with ASD and TD. The database consists of 300 images with various animals, buildings, natural scenes and combinations of different visual stimuli¹²⁴. Each image has corresponding eye-tracking data collected from 28 participants.

Self-stimulatory behaviour dataset

Due to the lack of a database containing self-stimulatory behaviours, Rajagopalan et al.¹⁰¹ searched for and collected videos on public domain websites and video portals (e.g. YouTube). They classified the videos into three categories: arm flapping, head banging and spinning. Compared to other datasets, their dataset is recorded in natural settings. The dataset contains 75 videos with an equal number of videos for each category.

Other datasets

Until recently, autism datasets have been relatively small when compared to other datasets in which machine learning has seen tremendous application. As a result, earlier published research has resorted to using a subset of videos of neurotypical individuals from human action recognition datasets [UCF101¹²⁵, Weizmann¹²⁶], facial expression datasets [Cohn-Kanade(CK)¹²⁷, CK+¹²⁸, FERET¹²⁹, Hollywood2¹³⁰, HELEN¹³¹, CelebA¹³², AffectNet¹³³, EmotioNet¹³⁴, BU-3D Facial Expression¹³⁵] and gesture recognition datasets [Oxford Hand Dataset¹³⁶, Egohands¹³⁷] to help train systems that analyse autistic behaviours.

Limitations

This review has some limitations: one is linked to the number of included papers and the other to the quality of papers included. Although it has been attempted to make the review as inclusive as possible through the PRISMA checklist, there are studies that might not have been included because of the chosen keywords and time period used. Nevertheless, as far as is known, this is the first systematic review of the current state of computer vision approaches in autism research.

Being a relatively new field of research, some published papers have few longitudinal studies or included small cohorts of participants, thus the quality of the results may change as more clinical trials are conducted. Nonetheless, this systematic review suggests that these advances in computer vision are applicable in the ASD domain and can stimulate further research in using computer vision techniques to augment existing clinical methods. However, these approaches require further evaluation before they can be applied in clinical settings.

Discussion

In this work, a systematic review has been provided on the use of computer vision techniques in autism research in general. Although there have been considerable studies on this area, different factors such as controlled experiments in a clinical setting mean that quantification of human behaviours in real scenarios remains challenging in the context of understanding image or video streams. In this paper, publicly available datasets relevant to behaviour analysis have also been reviewed, in order to rapidly familiarise researchers with datasets applicable to their field and to accelerate both new behavioural and technological work on autism. The primary conclusion of this study on computer vision approaches in autism research are provided below:

Different behavioural/biological markers have already been quantified, to some extent, using computer vision analysis with comparable performance to a human analyst.
For feature extraction and classification tasks, deep learning-based approaches have shown superior performance when compared to traditional computer vision approaches.
The growing number of large-scale publicly available datasets provides the required scale of data needed for furthering machine learning and deep learning developments.
Multimodal methods attain superior performance by combining knowledge across different modalities.

In the current state of the art, it is evident that computer vision analysis is useful for the quantification of behavioural/biological markers that can further lead to a non-invasive, objective and automatic tool for autism research. It can also be used to provide effective interventions using robots with augmented vision during therapies. In addition, it can be used to develop technologies that assist individuals with ASD in certain tasks, such as emotion recognition.

To date, most published studies are related to the use of computer vision in a clinical setting. However, in complex scenes outside of clinical protocols, there are many issues with feature learning in single or even multimodal data. In addition, it is challenging to compare the performance of the eligible studies due to the lack of benchmarked datasets that researchers have ‘agreed’ on for the use of deep learning¹³⁸. Until recently, there have been no large-scale datasets that researchers could use to compare their results. Given the current state of research, researchers in this area should address the following problems:

Multimodal approaches based on multimodal fusion methods. In current research, most studies have focussed on RGB data from image or video streams. However, an increasing number of studies has shown that superior performance can be achieved through a combination of multimodal information.
Researchers should agree to work on a benchmark dataset and evaluate their models on them for more reliable comparison of performance. The datasets reviewed in this paper serve as a starting point for researchers to use in computer vision research. Experts can borrow knowledge gained from existing state-of-the-art human activity recognition models trained on neurotypical individuals, apply them to these datasets, and build models that can generalise to individuals with ASD.
Computer vision approaches that address fully unconstrained scenarios. Most published studies require participants to be in clinical settings that typically do not capture data from the children in their natural environments.
Longitudinal studies or a collection of a large cohort of individuals with ASD and TD individuals should be conducted to evaluate the performance of succeeding computer vision systems. This requires a careful and systematic empirical validation to ensure their accuracy, reliability, interpretability and true clinical utility. This would help determine if these systems can generalise across different participant groups (e.g. multiple ages, cultural differences) and demonstrate fairness and unbiasedness.
It is also important to gain a deeper understanding of human factors, user experience and ethical considerations surrounding the application of vision-based systems. This would help develop usable and useful systems and determine if these systems can really be used to augment existing behavioural observations in a clinical setting.

Supplementary information

41398_2020_1015_MOESM1_ESM.docx^{(68.7KB, docx)}

Appendix A: PRISMA 2009 flow diagram: computer vision in autism spectrum disorder research: a systematic review of published studies from 2009 to 2019

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information accompanies this paper at (10.1038/s41398-020-01015-w).

References

1.Thabtah, F. & Peebles, D. A new machine learning model based on induction of rules for autism detection. Health Inform. J. 1460458218824711 (2019). [DOI] [PubMed]
2.Wiggins LD, Baio J, Rice C. Examination of the time between first evaluation and first autism spectrum diagnosis in a population-based sample. J. Dev. Behav. Pediatr. 2006;27:S79–S87. doi: 10.1097/00004703-200604002-00005. [DOI] [PubMed] [Google Scholar]
3.Taylor LJ, et al. Brief report: an exploratory study of the diagnostic reliability for autism spectrum disorder. J. Autism Dev. Disord. 2017;47:1551–1558. doi: 10.1007/s10803-017-3054-z. [DOI] [PubMed] [Google Scholar]
4.Pickles A, et al. Parent-mediated social communication therapy for young children with autism (PACT): long-term follow-up of a randomised controlled trial. Lancet. 2016;388:2501–2509. doi: 10.1016/S0140-6736(16)31229-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Adrien JL, et al. Autism and family home movies: preliminary findings. J. Autism Dev. Disord. 1991;21:43–49. doi: 10.1007/BF02206996. [DOI] [PubMed] [Google Scholar]
6.Adrien JL, et al. Early symptoms in autism from family home movies. Evaluation comparison 1st 2nd year life using I.B.S.E. scale. Acta Paedopsychiatr. 1992;55:71–75. [PubMed] [Google Scholar]
7.Werner E, Dawson G. Validation of the phenomenon of autistic regression using home videotapes. Arch. Gen. Psychiatry. 2005;62:889–895. doi: 10.1001/archpsyc.62.8.889. [DOI] [PubMed] [Google Scholar]
8.Mars AE, Mauk JE, Dowrick PW. Symptoms of pervasive developmental disorders as observed in prediagnostic home videos of infants and toddlers. J. Pediatr. 1998;132:500–504. doi: 10.1016/S0022-3476(98)70027-7. [DOI] [PubMed] [Google Scholar]
9.Osterling J, Dawson G. Early recognition of children with autism: a study of first birthday home videotapes. J. Autism Dev. Disord. 1994;24:247–257. doi: 10.1007/BF02172225. [DOI] [PubMed] [Google Scholar]
10.Nadig AS, et al. A prospective study of response to name in infants at risk for autism. Arch. Pediatr. Adolesc. Med. 2007;161:378–383. doi: 10.1001/archpedi.161.4.378. [DOI] [PubMed] [Google Scholar]
11.Elsabbagh M, et al. Disengagement of visual attention in infancy is associated with emerging autism in toddlerhood. Biol. Psychiatry. 2013;74:189–194. doi: 10.1016/j.biopsych.2012.11.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Zwaigenbaum L, et al. Behavioral manifestations of autism in the first year of life. Int. J. Dev. Neurosci. 2005;23:143–152. doi: 10.1016/j.ijdevneu.2004.05.001. [DOI] [PubMed] [Google Scholar]
13.Ozonoff S, et al. A prospective study of the emergence of early behavioral signs of autism. J. Am. Acad. Child Adolesc. Psychiatry. 2010;49:256–266.e251–252. [PMC free article] [PubMed] [Google Scholar]
14.Flanagan JE, Landa R, Bhat A, Bauman M. Head lag in infants at risk for autism: a preliminary study. Am. J. Occup. Ther. 2012;66:577–585. doi: 10.5014/ajot.2012.004192. [DOI] [PubMed] [Google Scholar]
15.Esposito G, Venuti P, Apicella F, Muratori F. Analysis of unsupported gait in toddlers with autism. Brain Dev. 2011;33:367–373. doi: 10.1016/j.braindev.2010.07.006. [DOI] [PubMed] [Google Scholar]
16.Gima H, et al. Early motor signs of autism spectrum disorder in spontaneous position and movement of the head. Exp. Brain Res. 2018;236:1139–1148. doi: 10.1007/s00221-018-5202-x. [DOI] [PubMed] [Google Scholar]
17.Brisson J, Warreyn P, Serres J, Foussier S, Adrien-Louis J. Motor anticipation failure in infants with autism: a retrospective analysis of feeding situations. Autism. 2012;16:420–429. doi: 10.1177/1362361311423385. [DOI] [PubMed] [Google Scholar]
18.Thevenot J, López MB, Hadid A. A survey on computer vision for assistive medical diagnosis from faces. IEEE J. Biomed. Health Inform. 2018;22:1497–1511. doi: 10.1109/JBHI.2017.2754861. [DOI] [PubMed] [Google Scholar]
19.Rehg JM. Behavior imaging: using computer vision to study. Autism MVA. 2011;11:14–21. [Google Scholar]
20.Sapiro G, Hashemi J, Dawson G. Computer vision and behavioral phenotyping: an autism case study. Curr. Opin. Biomed. Eng. 2019;9:14–20. doi: 10.1016/j.cobme.2018.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Moher D, Liberati A, Tetzlaff J, Altman DG, Group, a. t. P. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Ann. Intern. Med. 2009;151:264–269. doi: 10.7326/0003-4819-151-4-200908180-00135. [DOI] [PubMed] [Google Scholar]
22.Samson F, et al. Atypical processing of auditory temporal complexity in autistics. Neuropsychologia. 2011;49:546–555. doi: 10.1016/j.neuropsychologia.2010.12.033. [DOI] [PubMed] [Google Scholar]
23.Abdelrahman, M., Ali, A., Farag, A., Casanova, M. F. & Farag, A. New approach for classification of autistic vs. typically developing brain using white matter volumes. In Proc. Ninth Conference on Computer and Robot Vision. 284–289 (2012).
24.Durrleman S, et al. Toward a comprehensive framework for the spatiotemporal statistical analysis of longitudinal shape data. Int. J. Comput. Vis. 2013;103:22–59. doi: 10.1007/s11263-012-0592-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Ahmadi, S. M. M., Mohajeri, N. & Soltanian-Zadeh, H. Connectivity abnormalities in autism spectrum disorder patients: a resting state fMRI study. In Proc. 22nd Iranian Conference on Electrical Engineering (ICEE). 1878–1882 (2014).
26.Chaddad A, Desrosiers C, Hassan L, Tanougast C. Hippocampus and amygdala radiomic biomarkers for the study of autism spectrum disorder. BMC Neurosci. 2017;18:52. doi: 10.1186/s12868-017-0373-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Chaddad A, Desrosiers C, Toews M. Multi-scale radiomic analysis of sub-cortical regions in MRI related to autism, gender and age. Sci. Rep. 2017;7:45639. doi: 10.1038/srep45639. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Eslami, T. & Saeed, F. Auto-ASD-network: a technique based on deep learning and support vector machines for diagnosing autism spectrum disorder using fMRI data. In Proc. 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics. 646–651 (Association for Computing Machinery).
29.Li, H., Parikh, N. A. & He, L. A novel transfer learning approach to enhance deep neural network classification of brain functional connectomes. Front. Neurosci.12, 10.3389/fnins.2018.00491 (2018). [DOI] [PMC free article] [PubMed]
30.Crimi, A., Dodero, L., Murino, V. & Sona, D. Case–control discrimination through effective brain connectivity. In Proc. IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017). 970–973 (2017).
31.Ganeshan B, Miles KA, Young RC, Chatwin CR. In search of biologic correlates for liver texture on portal-phase CT. Acad. Radio. 2007;14:1058–1068. doi: 10.1016/j.acra.2007.05.023. [DOI] [PubMed] [Google Scholar]
32.Chanel G, et al. Classification of autistic individuals and controls using cross-task characterization of fMRI activity. NeuroImage: Clin. 2016;10:78–88. doi: 10.1016/j.nicl.2015.11.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach. Learn. 2002;46:389–422. doi: 10.1023/A:1012487302797. [DOI] [Google Scholar]
34.Zheng, W. et al. Multi-feature based network revealing the structural abnormalities in autism spectrum disorder. IEEE Trans. Affect. Comput. 1–1, 10.1109/TAFFC.2018.2890597 (2018).
35.Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J. Artif. Int. Res. 2002;16:321–357. [Google Scholar]
36.Kalantarian H, et al. Labeling images with facial emotion and the potential for pediatric healthcare. Artif. Intell. Med. 2019;98:77–86. doi: 10.1016/j.artmed.2019.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Kalantarian, H. et al. A gamified mobile system for crowdsourcing video for autism research. In Proc. IEEE International Conference on Healthcare Informatics (ICHI). 350–352 (2018).
38.Han, J. et al. Affective computing of childern with authism based on feature transfer In Proc. 5th IEEE International Conference on Cloud Computing and Intelligence Systems (CCIS). 845–849 (2018).
39.Tang, C. et al. Automatic smile detection of infants in mother-infant interaction via CNN-based feature learning. In Proc. Joint Workshop of the 4th Workshop on Affective Social Multimedia Computing and First Multi-modal Affective Computing of Large-scale Multimedia Data. 35–40 (Association for Computing Machinery).
40.Daniels J, et al. Feasibility testing of a wearable behavioral aid for social learning in children with autism. Appl. Clin. Inform. 2018;9:129–140. doi: 10.1055/s-0038-1626727. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Jazouli, M., Majda, A. & Zarghili, A. A $P recognizer for automatic facial emotion recognition using Kinect sensor. In Proc. Intelligent Systems and Computer Vision (ISCV). 1–5 (2017).
42.Washington, P. et al. SuperpowerGlass: a wearable aid for the at-home therapy of children with autism. In Proc. ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies1, Article 112, 10.1145/3130977 (2017).
43.Voss, C. et al. Superpower glass: delivering unobtrusive real-time social cues in wearable systems. In Proc. ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct. 1218–1226 (Association for Computing Machinery, 2016).
44.Vahabzadeh A, Keshav NU, Salisbury JP, Sahin NT. Improvement of attention-deficit/hyperactivity disorder symptoms in school-aged children, adolescents, and young adults with autism via a digital smartglasses-based socioemotional coaching aid: short-term, uncontrolled pilot study. JMIR Ment. Health. 2018;5:e25. doi: 10.2196/mental.9631. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Leo, M. et al. Automatic emotion recognition in robot-children interaction for ASD treatment. In Proc. IEEE International Conference on Computer Vision Workshop (ICCVW). 537–545 (2015).
46.Pan, Y., Hirokawa, M. & Suzuki, K. Measuring K-degree facial interaction between robot and children with autism spectrum disorders. In Proc. 24th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN). 48–53 (2015).
47.Leo M, et al. Computational analysis of deep visual data for quantifying facial expression production. Appl. Sci. 2019;9:4542. doi: 10.3390/app9214542. [DOI] [Google Scholar]
48.Coco, M. D. et al. A computer vision based approach for understanding emotional involvements in children with autism spectrum disorders. In Proc. IEEE International Conference on Computer Vision Workshops (ICCVW). 1401–1407 (2017).
49.Leo M, et al. Computational assessment of facial expression production in ASD children. Sensors. 2018;18:3993. doi: 10.3390/s18113993. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Samad, M. D., Bobzien, J. L., Harrington, J. W. & Iftekharuddin, K. M. Analysis of facial muscle activation in children with autism using 3D imaging. In Proc. IEEE International Conference on Bioinformatics and Biomedicine (BIBM). 337–342 (2015).
51.Leo, M. et al. Towards the automatic assessment of abilities to produce facial expressions: the case study of children with ASD. In Proc. IET Conference 66 (64 pp.) <https://digital-library.theiet.org/content/conferences/10.1049/cp.2018.1675> (2018).
52.Guha T, Yang Z, Grossman RB, Narayanan SS. A computational study of expressive facial dynamics in children with autism. IEEE Trans. Affect. Comput. 2018;9:14–20. doi: 10.1109/TAFFC.2016.2578316. [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Ahmed, A. A. & Goodwin, M. S. Automated detection of facial expressions during computer-assisted instruction in individuals on the autism spectrum. In Proc. CHI Conference on Human Factors in Computing Systems. 6050–6055 (Association for Computing Machinery, 2017).
54.Harrold, N., Tan, C. T., Rosser, D. & Leong, T. W. CopyMe: a portable real-time feedback expression recognition game for children. In CHI ’14 Extended Abstracts on Human Factors in Computing Systems. 1195–1200 (Association for Computing Machinery, 2014).
55.Harrold, N., Tan, C. T., Rosser, D. & Leong, T. W. CopyMe: an emotional development game for children in CHI ’14 Extended Abstracts on Human Factors in Computing Systems. 503–506 (Association for Computing Machinery).
56.White SW, et al. Feasibility of automated training for facial emotion expression and recognition in autism. Behav. Ther. 2018;49:881–888. doi: 10.1016/j.beth.2017.12.010. [DOI] [PubMed] [Google Scholar]
57.Garcia-Garcia, J. M., Cabañero, M. d. M., Penichet, V. M. R. & Lozano, M. D. EmoTEA: teaching children with autism spectrum disorder to identify and express emotions. In Proc. XX International Conference on Human Computer Interaction. Article 36 (Association for Computing Machinery).
58.Jain, S., Tamersoy, B., Zhang, Y., Aggarwal, J. K. & Orvalho, V. An interactive game for teaching facial expressions to children with Autism Spectrum Disorders. In Proc. 5th International Symposium on Communications, Control and Signal Processing. 1–4 (2012).
59.Li, B. et al. A facial affect analysis system for autism spectrum disorder. In Proc. IEEE International Conference on Image Processing (ICIP). 4549–4553 (2019).
60.Shukla, P., Gupta, T., Saini, A., Singh, P. & Balasubramanian, R. A Deep Learning frame-work for recognizing developmental disorders. In Proc. IEEE Winter Conference on Applications of Computer Vision (WACV). 705–714 (2017).
61.Tung, K. et al. Eye detection in CSBS-DP evaluation video. I Proc. IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW). 1–2 (2016).
62.Balestra, A. et al. Analyzing text comprehension deficits in autism with eye tracking: a case study. In Proc. 3rd International Conference on Human System Interaction. 230-235.
63.Li, B. et al. Modified DBSCAN algorithm on oculomotor fixation identification. In Proc. Ninth Biennial ACM Symposium on Eye Tracking Research & Applications. 337–338 (Association for Computing Machinery).
64.Matthews, O. et al. Combining trending scan paths with arousal to model visual behaviour on the web: a case study of neurotypical people vs people with autism. In Proc. 27th ACM Conference on User Modeling, Adaptation and Personalization. 86–94 (Association for Computing Machinery).
65.Pierce K, et al. Eye tracking reveals abnormal visual preference for geometric images as an early biomarker of an autism spectrum disorder subtype associated with increased symptom severity. Biol. Psychiatry. 2016;79:657–666. doi: 10.1016/j.biopsych.2015.03.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Murias M, et al. Validation of eye-tracking measures of social attention as a potential biomarker for autism clinical trials. Autism Res. 2018;11:166–174. doi: 10.1002/aur.1894. [DOI] [PubMed] [Google Scholar]
67.Chawarska K, Macari S, Shic F. Decreased spontaneous attention to social scenes in 6-month-old infants later diagnosed with autism spectrum disorders. Biol. Psychiatry. 2013;74:195–203. doi: 10.1016/j.biopsych.2012.11.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
68.Shi L, et al. Different visual preference patterns in response to simple and complex dynamic social stimuli in preschool-aged children with autism spectrum disorders. PLoS One. 2015;10:e0122280. doi: 10.1371/journal.pone.0122280. [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Shic F, Bradshaw J, Klin A, Scassellati B, Chawarska K. Limited activity monitoring in toddlers with autism spectrum disorder. Brain Res. 2011;1380:246–254. doi: 10.1016/j.brainres.2010.11.074. [DOI] [PMC free article] [PubMed] [Google Scholar]
70.Campbell, D. J., Chang, J., Chawarska, K. & Shic, F. Saliency-based Bayesian modeling of dynamic viewing of static scenes. In Proc. Symposium on Eye Tracking Research and Applications. 51–58 (Association for Computing Machinery).
71.Syeda, U. H. et al. Visual face scanning and emotion perception analysis between autistic and typically developing children. In Proc. ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2017 ACM International Symposium on Wearable Computers. 844–853 (Association for Computing Machinery, 2017).
72.Chrysouli, C., Vretos, N. & Daras, P. Affective state recognition based on eye gaze analysis using two–stream convolutional networks. In Proc. IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP). 1–6 (2018).
73.Liu W, Li M, Yi L. Identifying children with autism spectrum disorder based on their face processing abnormality: a machine learning framework. Autism Res. 2016;9:888–898. doi: 10.1002/aur.1615. [DOI] [PubMed] [Google Scholar]
74.Liu, W. et al. Efficient autism spectrum disorder prediction with eye movement: a machine learning framework. In Proc. International Conference on Affective Computing and Intelligent Interaction (ACII). 649-655 (2015).
75.Vu, T. et al. Effective and efficient visual stimuli design for quantitative autism screening: an exploratory study. In Proc. IEEE EMBS International Conference on Biomedical & Health Informatics (BHI). 297–300 (2017).
76.Jiang, M. & Zhao, Q. Learning visual attention to identify people with autism spectrum disorder. In Proc. IEEE International Conference on Computer Vision (ICCV). 3287–3296 (2017).
77.Higuchi, K. et al. Visualizing gaze direction to support video coding of social attention for children with autism spectrum disorder. In Proc. 23rd International Conference on Intelligent User Interfaces. 571–582 (Association for Computing Machinery).
78.Chong, E. et al. Detecting gaze towards eyes in natural social interactions and its use in child assessment. In Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 1, Article 43, 10.1145/3131902 (2017).
79.Toshniwal, S., Dey, P., Rajput, N. & Srivastava, S. VibRein: an engaging and assistive mobile learning companion for students with intellectual disabilities. In Proc. Annual Meeting of the Australian Special Interest Group for Computer Human Interaction. 20–28 (Association for Computing Machinery).
80.Dawson G, et al. Atypical postural control can be detected via computer vision analysis in toddlers with autism spectrum disorder. Sci. Rep. 2018;8:17008. doi: 10.1038/s41598-018-35215-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
81.Martin KB, et al. Objective measurement of head movement differences in children with and without autism spectrum disorder. Mol. Autism. 2018;9:14. doi: 10.1186/s13229-018-0198-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
82.Zunino, A. et al. Video gesture analysis for autism spectrum disorder detection In Proc. 24th International Conference on Pattern Recognition (ICPR). 3421-3426 (2018).
83.Vyas, K. et al. Recognition of atypical behavior in autism diagnosis from video using pose estimation over time. In Proc. IEEE 29th International Workshop on Machine Learning for Signal Processing (MLSP). 1-6 (2019).
84.Piana, S., Malagoli, C., Usai, M. C. & Camurri, A. effects of computerized emotional training on children with high functioning autism. IEEE Trans. Affect. Comput., 1-1, 10.1109/TAFFC.2019.2916023 (2019).
85.Bartoli, L., Corradi, C., Garzotto, F. & Valoriani, M. Exploring motion-based touchless games for autistic children’s learning. In Proc. 12th International Conference on Interaction Design and Children. 102–111 (Association for Computing Machinery).
86.Ringland, K. et al. SensoryPaint: a natural user interface supporting sensory integration in children with neurodevelopmental disorders. In Proc. Conference on Human Factors in Computing Systems, 10.1145/2559206.2581249 (2014).
87.Magrini, M., Carboni, A., Salvetti, O. & Curzio, O. An auditory feedback based system for treating autism spectrum disorder. In Proc. International Workshop on ICTs for Improving Patients Rehabilitation Research Techniques. 46–58 (Springer).
88.Dickstein-Fischer L, Fischer GS. Combining psychological and engineering approaches to utilizing social robots with children with autism. Conf. Proc. IEEE Eng. Med Biol. Soc. 2014;2014:792–795. doi: 10.1109/EMBC.2014.6943710. [DOI] [PubMed] [Google Scholar]
89.Bekele ET, et al. A step towards developing adaptive robot-mediated intervention architecture (ARIA) for children with autism. IEEE Trans. Neural Syst. Rehabilitation Eng. 2013;21:289–299. doi: 10.1109/TNSRE.2012.2230188. [DOI] [PMC free article] [PubMed] [Google Scholar]
90.Dimitrova, M., Vegt, N. & Barakova, E. Designing a system of interactive robots for training collaborative skills to autistic children In Proc. 15th International Conference on Interactive Collaborative Learning (ICL). 1–8 (2012).
91.Bryson SE, Zwaigenbaum L, McDermott C, Rombough V, Brian J. The autism observation scale for infants: scale development and reliability data. J. Autism Dev. Disord. 2008;38:731–738. doi: 10.1007/s10803-007-0440-y. [DOI] [PubMed] [Google Scholar]
92.Hashemi, J. et al. A computer vision approach for the assessment of autism-related behavioral markers. In Proc. IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL). 1–7 (2012).
93.Hashemi J, et al. Computer vision tools for low-cost and noninvasive measurement of autism-related behaviors in infants. Autism Res. Treat. 2014;2014:935686. doi: 10.1155/2014/935686. [DOI] [PMC free article] [PubMed] [Google Scholar]
94.Bidwell, J., Essa, I. A., Rozga, A. & Abowd, G. D. Measuring child visual attention using markerless head tracking from color and depth sensing cameras. In Proc. 16th International Conference on Multimodal Interaction. 447–454 (Association for Computing Machinery).
95.Hashemi, J. et al. A scalable app for measuring autism risk behaviors in young children: a technical validity and feasibility study. In Proc. 5th EAI International Conference on Wireless Mobile Communication and Healthcare. 23–27 (ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering).
96.Campbell K, et al. Computer vision analysis captures atypical attention in toddlers with autism. Autism. 2019;23:619–628. doi: 10.1177/1362361318766247. [DOI] [PMC free article] [PubMed] [Google Scholar]
97.Hashemi, J. et al. Computer vision analysis for quantification of autism risk behaviors. IEEE Trans. Affect. Comput., 1-1, 10.1109/TAFFC.2018.2868196 (2018). [DOI] [PMC free article] [PubMed]
98.Wang, Z. et al. Screening early children with autism spectrum disorder via response-to-name protocol. IEEE Trans Ind. Inform., 1-1, 10.1109/TII.2019.2958106 (2019).
99.Bovery, M. D. M. J., Dawson, G., Hashemi, J. & Sapiro, G. A scalable off-the-shelf framework for measuring patterns of attention in young children and its application in autism spectrum disorder. IEEE Trans. Affect. Comput., 1–1, 10.1109/TAFFC.2018.2890610 (2018). [DOI] [PMC free article] [PubMed]
100.Rajagopalan, S. S. & Goecke, R. Detecting self-stimulatory behaviours for autism diagnosis. In Proc. IEEE International Conference on Image Processing (ICIP). 1470–1474 (2014).
101.Rajagopalan, S. S., Dhall, A. & Goecke, R. Self-stimulatory behaviours in the wild for autism diagnosis. In Proc. IEEE International Conference on Computer Vision Workshops. 755-761 (2013).
102.Rajagopalan, S. S. Computational behaviour modelling for autism diagnosis. In Proc. 15th ACM on International Conference on Multimodal Interaction. 361–364 (Association for Computing Machinery).
103.Winoto, P., Chen, C. G. & Tang, T. Y. The development of a Kinect-based online socio-meter for users with social and communication skill impairments: a computational sensing approach. In Proc. IEEE International Conference on Knowledge Engineering and Applications (ICKEA). 139–143 (2016).
104.Feil-Seifer, D. & Matarić, M. Using proxemics to evaluate human-robot interaction. In Proc. 5th ACM/IEEE international conference on Human-robot interaction. 143–144 (IEEE Press).
105.Moghadas, M. & Moradi, H. Analyzing human-robot interaction using machine vision for autism screening. In Proc. 6th RSI International Conference on Robotics and Mechatronics (IcRoM). 572–576 (2018).
106.Chen, S. & Zhao, Q. Attention-based autism spectrum disorder screening with privileged modality. In Proc. IEEE/CVF International Conference on Computer Vision (ICCV). 1181–1190 (2019).
107.Wang, Z., Xu, K. & Liu, H. Screening early children with autism spectrum disorder via expressing needs with index finger pointing. In Proc. 13th International Conference on Distributed Smart Cameras. Article 24 (Association for Computing Machinery).
108.Mazzei, D. et al. Robotic social therapy on children with autism: preliminary evaluation through multi-parametric analysis. In Proc. International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing. 766–771 (2012).
109.Coco MD, et al. Study of mechanisms of social interaction stimulation in autism spectrum disorder by assisted humanoid robot. IEEE Trans. Cogn. Dev. Syst. 2018;10:993–1004. doi: 10.1109/TCDS.2017.2783684. [DOI] [Google Scholar]
110.Rudovic O, Lee J, Dai M, Schuller B, Picard RW. Personalized machine learning for robot perception of affect and engagement in autism therapy. Sci. Robot. 2018;3:eaao6760. doi: 10.1126/scirobotics.aao6760. [DOI] [PubMed] [Google Scholar]
111.Palestra, G., Varni, G., Chetouani, M. & Esposito, F. A multimodal and multilevel system for robotics treatment of autism in children. In Proc. International Workshop on Social Learning and Multimodal Interaction for Designing Artificial Agents. Article 3 (Association for Computing Machinery).
112.Dickstein-Fischer, L. A., Pereira, R. H., Gandomi, K. Y., Fathima, A. T. & Fischer, G. S. Interactive tracking for robot-assisted autism therapy. In Proc. Companion of the 2017 ACM/IEEE International Conference on Human-Robot Interaction. 107–108 (Association for Computing Machinery).
113.Mehmood F, Ayaz Y, Ali S, Amadeu RDC, Sadia H. Dominance in visual space of ASD children using multi-robot joint attention integrated distributed imitation system. IEEE Access. 2019;7:168815–168827. doi: 10.1109/ACCESS.2019.2951366. [DOI] [Google Scholar]
114.Egger HL, et al. Automatic emotion and attention analysis of young children at home: a ResearchKit autism feasibility study. npj Digit. Med. 2018;1:20. doi: 10.1038/s41746-018-0024-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
115.Peters, C., Hermann, T., Wachsmuth, S. & Hoey, J. Automatic task assistance for people with cognitive disabilities in brushing teeth—a user study with the TEBRA system. ACM Trans. Access. Comput. 5, Article 10, 10.1145/2579700 (2014).
116.Rehg, J. M. et al. Decoding children’s social behavior. In Proc. IEEE Conference on Computer Vision and Pattern Recognition. 3414–3421 (2013).
117.Liu, W., Zhou, T., Zhang, C., Zou, X. & Li, M. Response to name: a dataset and a multimodal machine learning framework towards autism study. In Proc. Seventh International Conference on Affective Computing and Intelligent Interaction (ACII). 178–183 (2017).
118.Marinoiu, E., Zanfir, M., Olaru, V. & Sminchisescu, C. 3D Human sensing, action and emotion recognition in robot assisted therapy of children with autism. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2158–2167 (2018).
119.Schwarzkopf DS, Anderson EJ, de Haas B, White SJ, Rees G. Larger extrastriate population receptive fields in autism spectrum disorders. J. Neurosci. 2014;34:2713. doi: 10.1523/JNEUROSCI.4416-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
120.Di Martino A, et al. The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry. 2014;19:659–667. doi: 10.1038/mp.2013.78. [DOI] [PMC free article] [PubMed] [Google Scholar]
121.Di Martino A, et al. Enhancing studies of the connectome in autism using the autism brain imaging data exchange II. Sci. Data. 2017;4:170010. doi: 10.1038/sdata.2017.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
122.Hazlett HC, et al. Magnetic resonance imaging and head circumference study of brain size in autism: birth through age 2 years. Arch. Gen. Psychiatry. 2005;62:1366–1376. doi: 10.1001/archpsyc.62.12.1366. [DOI] [PubMed] [Google Scholar]
123.Baird, A. et al. Automatic classification of autistic child vocalisations: a novel database and results. In Proc. Interspeech 2017 849–853 (2017).
124.Duan, H. et al. A dataset of eye movements for the children with autism spectrum disorder. In Proc. 10th ACM Multimedia Systems Conference. 255–260 (Association for Computing Machinery).
125.Soomro, K., Zamir, A. R., & Shah, M. UCF101: A dataset of 101 human actions classes from videos in the wild. arXiv preprintarXiv:1212.0402 (2012).
126.Blank, M., Gorelick, L., Shechtman, E., Irani, M. & Basri, R. Actions as space-time shapes. In Proc. Tenth IEEE International Conference on Computer Vision (ICCV'05)Volume 1. 1395–1402 Vol. 1392.
127.Kanade, T., Cohn, J. F. & Yingli, T. Comprehensive database for facial expression analysis. In Proc. Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580). 46–53.
128.Lucey, P. et al. The Extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression in 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition—Workshops. 94–101.
129.Phillips PJ, Wechsler H, Huang J, Rauss PJ. The FERET database and evaluation procedure for face-recognition algorithms. Image Vis. Comput. 1998;16:295–306. doi: 10.1016/S0262-8856(97)00070-X. [DOI] [Google Scholar]
130.Marszalek, M., Laptev, I. & Schmid, C. Actions in context. In Proc. IEEE Conference on Computer Vision and Pattern Recognition. 2929–2936 (2009).
131.Le, V., Brandt, J., Lin, Z., Bourdev, L. & Huang, T. S. Interactive Facial Feature Localization. 679–692 (Springer Berlin Heidelberg).
132.Liu, Z., Luo, P., Wang, X. & Tang, X. Deep learning face attributes in the wild. In Proc. IEEE International Conference on Computer Vision. 3730–3738.
133.Mollahosseini A, Hasani B, Mahoor MH. AffectNet: a database for facial expression, valence, and arousal computing in the wild. IEEE Trans. Affect. Comput. 2019;10:18–31. doi: 10.1109/TAFFC.2017.2740923. [DOI] [Google Scholar]
134.Benitez-Quiroz, C. F., Srinivasan, R. & Martinez, A. M. EmotioNet: an accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 5562–5570 (2016).
135.Lijun, Y., Xiaozhou, W., Yi, S., Jun, W. & Rosato, M. J. A 3D facial expression database for facial behavior research. In Proc. 7th International Conference on Automatic Face and Gesture Recognition (FGR06). 211–216.
136.Mittal A, Zisserman A, Torr PH. Hand detection using multiple proposals. BMVC. 2011;2:5. [Google Scholar]
137.Bambach, S, Lee, S, Crandall, D. J, Yu, C. & Lending a hand: detecting hands and recognizing activities in complex egocentric interactions Proc. IEEE Int. Conf. Comput. Vis.1949–1957 (2015). [DOI] [PMC free article] [PubMed]
138.Thabtah F. Machine learning in autistic spectrum disorder behavioral research: a review and ways forward. Inform. Health Soc. Care. 2019;44:278–297. doi: 10.1080/17538157.2017.1399132. [DOI] [PubMed] [Google Scholar]
139.Yin, L., Chen, X., Sun, Y., Worm, T. & Reale, M. A high-resolution 3D dynamic facial expression database. In Proc. 8th IEEE International Conference on Automatic Face & Gesture Recognition. 1–6 (2008).
140.Savran, A. et al. Bosphorus Database for 3D Face Analysis. 47–56 (Springer Berlin Heidelberg).
141.Sim, T., Baker, S. & Bsat, M. The CMU pose, illumination, and expression (PIE) database. In Proc. 5th IEEE International Conference on Automatic Face Gesture Recognition. 53–58.
142.Lyons, M., Akamatsu, S., Kamachi, M. & Gyoba, J. Coding facial expressions with Gabor wavelets. In Proc. Third IEEE International Conference on Automatic Face and Gesture Recognition. 200–205.
143.Pantic, M., Valstar, M., Rademaker, R. & Maat, L. Web-based database for facial expression analysis. In Proc. IEEE International Conference on Multimedia and Expo. 5 pp (2005).
144.Yi, L. et al. Abnormality in face scanning by children with autism spectrum disorder is limited to the eye region: evidence from multi-method analyses of eye tracking data. J. Vis. 13, 10.1167/13.10.5 (2013). [DOI] [PMC free article] [PubMed]
145.Yi L, et al. Do individuals with and without autism spectrum disorder scan faces differently? A new multi-method look at an existing controversy. Autism Res. 2014;7:72–83. doi: 10.1002/aur.1340. [DOI] [PubMed] [Google Scholar]
146.Wang S, et al. A typical visual saliency in autism spectrum disorder quantified through model-based eye tracking. Neuron. 2015;88:604–616. doi: 10.1016/j.neuron.2015.09.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
147.Rudovic, O., Lee, J., Mascarell-Maricic, L., Schuller, B. W. & Picard, R. W. Measuring engagement in robot-assisted autism therapy: a cross-cultural study. Front. Robot. AI4, 10.3389/frobt.2017.00036 (2017).
148.Baltrusaitis, T., Robinson, P. & Morency, L.-P. Constrained local neural fields for robust facial landmark detection in the wild. In Proc. IEEE International Conference on Computer Vision Workshops. 354–361.
149.Palestra, G., Pettinicchio, A., Del Coco, M., Carcagnì, P., Leo, M., Distante, C. Improved Performance in Facial Expression Recognition Using 32 Geometric Features. In Image Analysis and Processing—ICIAP 2015 (eds Murino V. & Puppo E.) ICIAP 2015. Lecture Notes in Computer Science, vol 9280. (Springer, Cham, 2015). 10.1007/978-3-319-23234-8_48.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

41398_2020_1015_MOESM1_ESM.docx^{(68.7KB, docx)}

Appendix A: PRISMA 2009 flow diagram: computer vision in autism spectrum disorder research: a systematic review of published studies from 2009 to 2019

[CR1] 1.Thabtah, F. & Peebles, D. A new machine learning model based on induction of rules for autism detection. Health Inform. J. 1460458218824711 (2019). [DOI] [PubMed]

[CR2] 2.Wiggins LD, Baio J, Rice C. Examination of the time between first evaluation and first autism spectrum diagnosis in a population-based sample. J. Dev. Behav. Pediatr. 2006;27:S79–S87. doi: 10.1097/00004703-200604002-00005. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Taylor LJ, et al. Brief report: an exploratory study of the diagnostic reliability for autism spectrum disorder. J. Autism Dev. Disord. 2017;47:1551–1558. doi: 10.1007/s10803-017-3054-z. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Pickles A, et al. Parent-mediated social communication therapy for young children with autism (PACT): long-term follow-up of a randomised controlled trial. Lancet. 2016;388:2501–2509. doi: 10.1016/S0140-6736(16)31229-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Adrien JL, et al. Autism and family home movies: preliminary findings. J. Autism Dev. Disord. 1991;21:43–49. doi: 10.1007/BF02206996. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Adrien JL, et al. Early symptoms in autism from family home movies. Evaluation comparison 1st 2nd year life using I.B.S.E. scale. Acta Paedopsychiatr. 1992;55:71–75. [PubMed] [Google Scholar]

[CR7] 7.Werner E, Dawson G. Validation of the phenomenon of autistic regression using home videotapes. Arch. Gen. Psychiatry. 2005;62:889–895. doi: 10.1001/archpsyc.62.8.889. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Mars AE, Mauk JE, Dowrick PW. Symptoms of pervasive developmental disorders as observed in prediagnostic home videos of infants and toddlers. J. Pediatr. 1998;132:500–504. doi: 10.1016/S0022-3476(98)70027-7. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Osterling J, Dawson G. Early recognition of children with autism: a study of first birthday home videotapes. J. Autism Dev. Disord. 1994;24:247–257. doi: 10.1007/BF02172225. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Nadig AS, et al. A prospective study of response to name in infants at risk for autism. Arch. Pediatr. Adolesc. Med. 2007;161:378–383. doi: 10.1001/archpedi.161.4.378. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Elsabbagh M, et al. Disengagement of visual attention in infancy is associated with emerging autism in toddlerhood. Biol. Psychiatry. 2013;74:189–194. doi: 10.1016/j.biopsych.2012.11.030. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Zwaigenbaum L, et al. Behavioral manifestations of autism in the first year of life. Int. J. Dev. Neurosci. 2005;23:143–152. doi: 10.1016/j.ijdevneu.2004.05.001. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Ozonoff S, et al. A prospective study of the emergence of early behavioral signs of autism. J. Am. Acad. Child Adolesc. Psychiatry. 2010;49:256–266.e251–252. [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Flanagan JE, Landa R, Bhat A, Bauman M. Head lag in infants at risk for autism: a preliminary study. Am. J. Occup. Ther. 2012;66:577–585. doi: 10.5014/ajot.2012.004192. [DOI] [PubMed] [Google Scholar]

[CR15] 15.Esposito G, Venuti P, Apicella F, Muratori F. Analysis of unsupported gait in toddlers with autism. Brain Dev. 2011;33:367–373. doi: 10.1016/j.braindev.2010.07.006. [DOI] [PubMed] [Google Scholar]

[CR16] 16.Gima H, et al. Early motor signs of autism spectrum disorder in spontaneous position and movement of the head. Exp. Brain Res. 2018;236:1139–1148. doi: 10.1007/s00221-018-5202-x. [DOI] [PubMed] [Google Scholar]

[CR17] 17.Brisson J, Warreyn P, Serres J, Foussier S, Adrien-Louis J. Motor anticipation failure in infants with autism: a retrospective analysis of feeding situations. Autism. 2012;16:420–429. doi: 10.1177/1362361311423385. [DOI] [PubMed] [Google Scholar]

[CR18] 18.Thevenot J, López MB, Hadid A. A survey on computer vision for assistive medical diagnosis from faces. IEEE J. Biomed. Health Inform. 2018;22:1497–1511. doi: 10.1109/JBHI.2017.2754861. [DOI] [PubMed] [Google Scholar]

[CR19] 19.Rehg JM. Behavior imaging: using computer vision to study. Autism MVA. 2011;11:14–21. [Google Scholar]

[CR20] 20.Sapiro G, Hashemi J, Dawson G. Computer vision and behavioral phenotyping: an autism case study. Curr. Opin. Biomed. Eng. 2019;9:14–20. doi: 10.1016/j.cobme.2018.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Moher D, Liberati A, Tetzlaff J, Altman DG, Group, a. t. P. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Ann. Intern. Med. 2009;151:264–269. doi: 10.7326/0003-4819-151-4-200908180-00135. [DOI] [PubMed] [Google Scholar]

[CR22] 22.Samson F, et al. Atypical processing of auditory temporal complexity in autistics. Neuropsychologia. 2011;49:546–555. doi: 10.1016/j.neuropsychologia.2010.12.033. [DOI] [PubMed] [Google Scholar]

[CR23] 23.Abdelrahman, M., Ali, A., Farag, A., Casanova, M. F. & Farag, A. New approach for classification of autistic vs. typically developing brain using white matter volumes. In Proc. Ninth Conference on Computer and Robot Vision. 284–289 (2012).

[CR24] 24.Durrleman S, et al. Toward a comprehensive framework for the spatiotemporal statistical analysis of longitudinal shape data. Int. J. Comput. Vis. 2013;103:22–59. doi: 10.1007/s11263-012-0592-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Ahmadi, S. M. M., Mohajeri, N. & Soltanian-Zadeh, H. Connectivity abnormalities in autism spectrum disorder patients: a resting state fMRI study. In Proc. 22nd Iranian Conference on Electrical Engineering (ICEE). 1878–1882 (2014).

[CR26] 26.Chaddad A, Desrosiers C, Hassan L, Tanougast C. Hippocampus and amygdala radiomic biomarkers for the study of autism spectrum disorder. BMC Neurosci. 2017;18:52. doi: 10.1186/s12868-017-0373-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Chaddad A, Desrosiers C, Toews M. Multi-scale radiomic analysis of sub-cortical regions in MRI related to autism, gender and age. Sci. Rep. 2017;7:45639. doi: 10.1038/srep45639. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Eslami, T. & Saeed, F. Auto-ASD-network: a technique based on deep learning and support vector machines for diagnosing autism spectrum disorder using fMRI data. In Proc. 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics. 646–651 (Association for Computing Machinery).

[CR29] 29.Li, H., Parikh, N. A. & He, L. A novel transfer learning approach to enhance deep neural network classification of brain functional connectomes. Front. Neurosci.12, 10.3389/fnins.2018.00491 (2018). [DOI] [PMC free article] [PubMed]

[CR30] 30.Crimi, A., Dodero, L., Murino, V. & Sona, D. Case–control discrimination through effective brain connectivity. In Proc. IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017). 970–973 (2017).

[CR31] 31.Ganeshan B, Miles KA, Young RC, Chatwin CR. In search of biologic correlates for liver texture on portal-phase CT. Acad. Radio. 2007;14:1058–1068. doi: 10.1016/j.acra.2007.05.023. [DOI] [PubMed] [Google Scholar]

[CR32] 32.Chanel G, et al. Classification of autistic individuals and controls using cross-task characterization of fMRI activity. NeuroImage: Clin. 2016;10:78–88. doi: 10.1016/j.nicl.2015.11.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR33] 33.Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach. Learn. 2002;46:389–422. doi: 10.1023/A:1012487302797. [DOI] [Google Scholar]

[CR34] 34.Zheng, W. et al. Multi-feature based network revealing the structural abnormalities in autism spectrum disorder. IEEE Trans. Affect. Comput. 1–1, 10.1109/TAFFC.2018.2890597 (2018).

[CR35] 35.Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J. Artif. Int. Res. 2002;16:321–357. [Google Scholar]

[CR36] 36.Kalantarian H, et al. Labeling images with facial emotion and the potential for pediatric healthcare. Artif. Intell. Med. 2019;98:77–86. doi: 10.1016/j.artmed.2019.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Kalantarian, H. et al. A gamified mobile system for crowdsourcing video for autism research. In Proc. IEEE International Conference on Healthcare Informatics (ICHI). 350–352 (2018).

[CR38] 38.Han, J. et al. Affective computing of childern with authism based on feature transfer In Proc. 5th IEEE International Conference on Cloud Computing and Intelligence Systems (CCIS). 845–849 (2018).

[CR39] 39.Tang, C. et al. Automatic smile detection of infants in mother-infant interaction via CNN-based feature learning. In Proc. Joint Workshop of the 4th Workshop on Affective Social Multimedia Computing and First Multi-modal Affective Computing of Large-scale Multimedia Data. 35–40 (Association for Computing Machinery).

[CR40] 40.Daniels J, et al. Feasibility testing of a wearable behavioral aid for social learning in children with autism. Appl. Clin. Inform. 2018;9:129–140. doi: 10.1055/s-0038-1626727. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Jazouli, M., Majda, A. & Zarghili, A. A $P recognizer for automatic facial emotion recognition using Kinect sensor. In Proc. Intelligent Systems and Computer Vision (ISCV). 1–5 (2017).

[CR42] 42.Washington, P. et al. SuperpowerGlass: a wearable aid for the at-home therapy of children with autism. In Proc. ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies1, Article 112, 10.1145/3130977 (2017).

[CR43] 43.Voss, C. et al. Superpower glass: delivering unobtrusive real-time social cues in wearable systems. In Proc. ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct. 1218–1226 (Association for Computing Machinery, 2016).

[CR44] 44.Vahabzadeh A, Keshav NU, Salisbury JP, Sahin NT. Improvement of attention-deficit/hyperactivity disorder symptoms in school-aged children, adolescents, and young adults with autism via a digital smartglasses-based socioemotional coaching aid: short-term, uncontrolled pilot study. JMIR Ment. Health. 2018;5:e25. doi: 10.2196/mental.9631. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR45] 45.Leo, M. et al. Automatic emotion recognition in robot-children interaction for ASD treatment. In Proc. IEEE International Conference on Computer Vision Workshop (ICCVW). 537–545 (2015).

[CR46] 46.Pan, Y., Hirokawa, M. & Suzuki, K. Measuring K-degree facial interaction between robot and children with autism spectrum disorders. In Proc. 24th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN). 48–53 (2015).

[CR47] 47.Leo M, et al. Computational analysis of deep visual data for quantifying facial expression production. Appl. Sci. 2019;9:4542. doi: 10.3390/app9214542. [DOI] [Google Scholar]

[CR48] 48.Coco, M. D. et al. A computer vision based approach for understanding emotional involvements in children with autism spectrum disorders. In Proc. IEEE International Conference on Computer Vision Workshops (ICCVW). 1401–1407 (2017).

[CR49] 49.Leo M, et al. Computational assessment of facial expression production in ASD children. Sensors. 2018;18:3993. doi: 10.3390/s18113993. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR50] 50.Samad, M. D., Bobzien, J. L., Harrington, J. W. & Iftekharuddin, K. M. Analysis of facial muscle activation in children with autism using 3D imaging. In Proc. IEEE International Conference on Bioinformatics and Biomedicine (BIBM). 337–342 (2015).

[CR51] 51.Leo, M. et al. Towards the automatic assessment of abilities to produce facial expressions: the case study of children with ASD. In Proc. IET Conference 66 (64 pp.) <https://digital-library.theiet.org/content/conferences/10.1049/cp.2018.1675> (2018).

[CR52] 52.Guha T, Yang Z, Grossman RB, Narayanan SS. A computational study of expressive facial dynamics in children with autism. IEEE Trans. Affect. Comput. 2018;9:14–20. doi: 10.1109/TAFFC.2016.2578316. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR53] 53.Ahmed, A. A. & Goodwin, M. S. Automated detection of facial expressions during computer-assisted instruction in individuals on the autism spectrum. In Proc. CHI Conference on Human Factors in Computing Systems. 6050–6055 (Association for Computing Machinery, 2017).

[CR54] 54.Harrold, N., Tan, C. T., Rosser, D. & Leong, T. W. CopyMe: a portable real-time feedback expression recognition game for children. In CHI ’14 Extended Abstracts on Human Factors in Computing Systems. 1195–1200 (Association for Computing Machinery, 2014).

[CR55] 55.Harrold, N., Tan, C. T., Rosser, D. & Leong, T. W. CopyMe: an emotional development game for children in CHI ’14 Extended Abstracts on Human Factors in Computing Systems. 503–506 (Association for Computing Machinery).

[CR56] 56.White SW, et al. Feasibility of automated training for facial emotion expression and recognition in autism. Behav. Ther. 2018;49:881–888. doi: 10.1016/j.beth.2017.12.010. [DOI] [PubMed] [Google Scholar]

[CR57] 57.Garcia-Garcia, J. M., Cabañero, M. d. M., Penichet, V. M. R. & Lozano, M. D. EmoTEA: teaching children with autism spectrum disorder to identify and express emotions. In Proc. XX International Conference on Human Computer Interaction. Article 36 (Association for Computing Machinery).

[CR58] 58.Jain, S., Tamersoy, B., Zhang, Y., Aggarwal, J. K. & Orvalho, V. An interactive game for teaching facial expressions to children with Autism Spectrum Disorders. In Proc. 5th International Symposium on Communications, Control and Signal Processing. 1–4 (2012).

[CR59] 59.Li, B. et al. A facial affect analysis system for autism spectrum disorder. In Proc. IEEE International Conference on Image Processing (ICIP). 4549–4553 (2019).

[CR60] 60.Shukla, P., Gupta, T., Saini, A., Singh, P. & Balasubramanian, R. A Deep Learning frame-work for recognizing developmental disorders. In Proc. IEEE Winter Conference on Applications of Computer Vision (WACV). 705–714 (2017).

[CR61] 61.Tung, K. et al. Eye detection in CSBS-DP evaluation video. I Proc. IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW). 1–2 (2016).

[CR62] 62.Balestra, A. et al. Analyzing text comprehension deficits in autism with eye tracking: a case study. In Proc. 3rd International Conference on Human System Interaction. 230-235.

[CR63] 63.Li, B. et al. Modified DBSCAN algorithm on oculomotor fixation identification. In Proc. Ninth Biennial ACM Symposium on Eye Tracking Research & Applications. 337–338 (Association for Computing Machinery).

[CR64] 64.Matthews, O. et al. Combining trending scan paths with arousal to model visual behaviour on the web: a case study of neurotypical people vs people with autism. In Proc. 27th ACM Conference on User Modeling, Adaptation and Personalization. 86–94 (Association for Computing Machinery).

[CR65] 65.Pierce K, et al. Eye tracking reveals abnormal visual preference for geometric images as an early biomarker of an autism spectrum disorder subtype associated with increased symptom severity. Biol. Psychiatry. 2016;79:657–666. doi: 10.1016/j.biopsych.2015.03.032. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR66] 66.Murias M, et al. Validation of eye-tracking measures of social attention as a potential biomarker for autism clinical trials. Autism Res. 2018;11:166–174. doi: 10.1002/aur.1894. [DOI] [PubMed] [Google Scholar]

[CR67] 67.Chawarska K, Macari S, Shic F. Decreased spontaneous attention to social scenes in 6-month-old infants later diagnosed with autism spectrum disorders. Biol. Psychiatry. 2013;74:195–203. doi: 10.1016/j.biopsych.2012.11.022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR68] 68.Shi L, et al. Different visual preference patterns in response to simple and complex dynamic social stimuli in preschool-aged children with autism spectrum disorders. PLoS One. 2015;10:e0122280. doi: 10.1371/journal.pone.0122280. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR69] 69.Shic F, Bradshaw J, Klin A, Scassellati B, Chawarska K. Limited activity monitoring in toddlers with autism spectrum disorder. Brain Res. 2011;1380:246–254. doi: 10.1016/j.brainres.2010.11.074. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR70] 70.Campbell, D. J., Chang, J., Chawarska, K. & Shic, F. Saliency-based Bayesian modeling of dynamic viewing of static scenes. In Proc. Symposium on Eye Tracking Research and Applications. 51–58 (Association for Computing Machinery).

[CR71] 71.Syeda, U. H. et al. Visual face scanning and emotion perception analysis between autistic and typically developing children. In Proc. ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2017 ACM International Symposium on Wearable Computers. 844–853 (Association for Computing Machinery, 2017).

[CR72] 72.Chrysouli, C., Vretos, N. & Daras, P. Affective state recognition based on eye gaze analysis using two–stream convolutional networks. In Proc. IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP). 1–6 (2018).

[CR73] 73.Liu W, Li M, Yi L. Identifying children with autism spectrum disorder based on their face processing abnormality: a machine learning framework. Autism Res. 2016;9:888–898. doi: 10.1002/aur.1615. [DOI] [PubMed] [Google Scholar]

[CR74] 74.Liu, W. et al. Efficient autism spectrum disorder prediction with eye movement: a machine learning framework. In Proc. International Conference on Affective Computing and Intelligent Interaction (ACII). 649-655 (2015).

[CR75] 75.Vu, T. et al. Effective and efficient visual stimuli design for quantitative autism screening: an exploratory study. In Proc. IEEE EMBS International Conference on Biomedical & Health Informatics (BHI). 297–300 (2017).

[CR76] 76.Jiang, M. & Zhao, Q. Learning visual attention to identify people with autism spectrum disorder. In Proc. IEEE International Conference on Computer Vision (ICCV). 3287–3296 (2017).

[CR77] 77.Higuchi, K. et al. Visualizing gaze direction to support video coding of social attention for children with autism spectrum disorder. In Proc. 23rd International Conference on Intelligent User Interfaces. 571–582 (Association for Computing Machinery).

[CR78] 78.Chong, E. et al. Detecting gaze towards eyes in natural social interactions and its use in child assessment. In Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 1, Article 43, 10.1145/3131902 (2017).

[CR79] 79.Toshniwal, S., Dey, P., Rajput, N. & Srivastava, S. VibRein: an engaging and assistive mobile learning companion for students with intellectual disabilities. In Proc. Annual Meeting of the Australian Special Interest Group for Computer Human Interaction. 20–28 (Association for Computing Machinery).

[CR80] 80.Dawson G, et al. Atypical postural control can be detected via computer vision analysis in toddlers with autism spectrum disorder. Sci. Rep. 2018;8:17008. doi: 10.1038/s41598-018-35215-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR81] 81.Martin KB, et al. Objective measurement of head movement differences in children with and without autism spectrum disorder. Mol. Autism. 2018;9:14. doi: 10.1186/s13229-018-0198-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR82] 82.Zunino, A. et al. Video gesture analysis for autism spectrum disorder detection In Proc. 24th International Conference on Pattern Recognition (ICPR). 3421-3426 (2018).

[CR83] 83.Vyas, K. et al. Recognition of atypical behavior in autism diagnosis from video using pose estimation over time. In Proc. IEEE 29th International Workshop on Machine Learning for Signal Processing (MLSP). 1-6 (2019).

[CR84] 84.Piana, S., Malagoli, C., Usai, M. C. & Camurri, A. effects of computerized emotional training on children with high functioning autism. IEEE Trans. Affect. Comput., 1-1, 10.1109/TAFFC.2019.2916023 (2019).

[CR85] 85.Bartoli, L., Corradi, C., Garzotto, F. & Valoriani, M. Exploring motion-based touchless games for autistic children’s learning. In Proc. 12th International Conference on Interaction Design and Children. 102–111 (Association for Computing Machinery).

[CR86] 86.Ringland, K. et al. SensoryPaint: a natural user interface supporting sensory integration in children with neurodevelopmental disorders. In Proc. Conference on Human Factors in Computing Systems, 10.1145/2559206.2581249 (2014).

[CR87] 87.Magrini, M., Carboni, A., Salvetti, O. & Curzio, O. An auditory feedback based system for treating autism spectrum disorder. In Proc. International Workshop on ICTs for Improving Patients Rehabilitation Research Techniques. 46–58 (Springer).

[CR88] 88.Dickstein-Fischer L, Fischer GS. Combining psychological and engineering approaches to utilizing social robots with children with autism. Conf. Proc. IEEE Eng. Med Biol. Soc. 2014;2014:792–795. doi: 10.1109/EMBC.2014.6943710. [DOI] [PubMed] [Google Scholar]

[CR89] 89.Bekele ET, et al. A step towards developing adaptive robot-mediated intervention architecture (ARIA) for children with autism. IEEE Trans. Neural Syst. Rehabilitation Eng. 2013;21:289–299. doi: 10.1109/TNSRE.2012.2230188. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR90] 90.Dimitrova, M., Vegt, N. & Barakova, E. Designing a system of interactive robots for training collaborative skills to autistic children In Proc. 15th International Conference on Interactive Collaborative Learning (ICL). 1–8 (2012).

[CR91] 91.Bryson SE, Zwaigenbaum L, McDermott C, Rombough V, Brian J. The autism observation scale for infants: scale development and reliability data. J. Autism Dev. Disord. 2008;38:731–738. doi: 10.1007/s10803-007-0440-y. [DOI] [PubMed] [Google Scholar]

[CR92] 92.Hashemi, J. et al. A computer vision approach for the assessment of autism-related behavioral markers. In Proc. IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL). 1–7 (2012).

[CR93] 93.Hashemi J, et al. Computer vision tools for low-cost and noninvasive measurement of autism-related behaviors in infants. Autism Res. Treat. 2014;2014:935686. doi: 10.1155/2014/935686. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR94] 94.Bidwell, J., Essa, I. A., Rozga, A. & Abowd, G. D. Measuring child visual attention using markerless head tracking from color and depth sensing cameras. In Proc. 16th International Conference on Multimodal Interaction. 447–454 (Association for Computing Machinery).

[CR95] 95.Hashemi, J. et al. A scalable app for measuring autism risk behaviors in young children: a technical validity and feasibility study. In Proc. 5th EAI International Conference on Wireless Mobile Communication and Healthcare. 23–27 (ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering).

[CR96] 96.Campbell K, et al. Computer vision analysis captures atypical attention in toddlers with autism. Autism. 2019;23:619–628. doi: 10.1177/1362361318766247. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR97] 97.Hashemi, J. et al. Computer vision analysis for quantification of autism risk behaviors. IEEE Trans. Affect. Comput., 1-1, 10.1109/TAFFC.2018.2868196 (2018). [DOI] [PMC free article] [PubMed]

[CR98] 98.Wang, Z. et al. Screening early children with autism spectrum disorder via response-to-name protocol. IEEE Trans Ind. Inform., 1-1, 10.1109/TII.2019.2958106 (2019).

[CR99] 99.Bovery, M. D. M. J., Dawson, G., Hashemi, J. & Sapiro, G. A scalable off-the-shelf framework for measuring patterns of attention in young children and its application in autism spectrum disorder. IEEE Trans. Affect. Comput., 1–1, 10.1109/TAFFC.2018.2890610 (2018). [DOI] [PMC free article] [PubMed]

[CR100] 100.Rajagopalan, S. S. & Goecke, R. Detecting self-stimulatory behaviours for autism diagnosis. In Proc. IEEE International Conference on Image Processing (ICIP). 1470–1474 (2014).

[CR101] 101.Rajagopalan, S. S., Dhall, A. & Goecke, R. Self-stimulatory behaviours in the wild for autism diagnosis. In Proc. IEEE International Conference on Computer Vision Workshops. 755-761 (2013).

[CR102] 102.Rajagopalan, S. S. Computational behaviour modelling for autism diagnosis. In Proc. 15th ACM on International Conference on Multimodal Interaction. 361–364 (Association for Computing Machinery).

[CR103] 103.Winoto, P., Chen, C. G. & Tang, T. Y. The development of a Kinect-based online socio-meter for users with social and communication skill impairments: a computational sensing approach. In Proc. IEEE International Conference on Knowledge Engineering and Applications (ICKEA). 139–143 (2016).

[CR104] 104.Feil-Seifer, D. & Matarić, M. Using proxemics to evaluate human-robot interaction. In Proc. 5th ACM/IEEE international conference on Human-robot interaction. 143–144 (IEEE Press).

[CR105] 105.Moghadas, M. & Moradi, H. Analyzing human-robot interaction using machine vision for autism screening. In Proc. 6th RSI International Conference on Robotics and Mechatronics (IcRoM). 572–576 (2018).

[CR106] 106.Chen, S. & Zhao, Q. Attention-based autism spectrum disorder screening with privileged modality. In Proc. IEEE/CVF International Conference on Computer Vision (ICCV). 1181–1190 (2019).

[CR107] 107.Wang, Z., Xu, K. & Liu, H. Screening early children with autism spectrum disorder via expressing needs with index finger pointing. In Proc. 13th International Conference on Distributed Smart Cameras. Article 24 (Association for Computing Machinery).

[CR108] 108.Mazzei, D. et al. Robotic social therapy on children with autism: preliminary evaluation through multi-parametric analysis. In Proc. International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing. 766–771 (2012).

[CR109] 109.Coco MD, et al. Study of mechanisms of social interaction stimulation in autism spectrum disorder by assisted humanoid robot. IEEE Trans. Cogn. Dev. Syst. 2018;10:993–1004. doi: 10.1109/TCDS.2017.2783684. [DOI] [Google Scholar]

[CR110] 110.Rudovic O, Lee J, Dai M, Schuller B, Picard RW. Personalized machine learning for robot perception of affect and engagement in autism therapy. Sci. Robot. 2018;3:eaao6760. doi: 10.1126/scirobotics.aao6760. [DOI] [PubMed] [Google Scholar]

[CR111] 111.Palestra, G., Varni, G., Chetouani, M. & Esposito, F. A multimodal and multilevel system for robotics treatment of autism in children. In Proc. International Workshop on Social Learning and Multimodal Interaction for Designing Artificial Agents. Article 3 (Association for Computing Machinery).

[CR112] 112.Dickstein-Fischer, L. A., Pereira, R. H., Gandomi, K. Y., Fathima, A. T. & Fischer, G. S. Interactive tracking for robot-assisted autism therapy. In Proc. Companion of the 2017 ACM/IEEE International Conference on Human-Robot Interaction. 107–108 (Association for Computing Machinery).

[CR113] 113.Mehmood F, Ayaz Y, Ali S, Amadeu RDC, Sadia H. Dominance in visual space of ASD children using multi-robot joint attention integrated distributed imitation system. IEEE Access. 2019;7:168815–168827. doi: 10.1109/ACCESS.2019.2951366. [DOI] [Google Scholar]

[CR114] 114.Egger HL, et al. Automatic emotion and attention analysis of young children at home: a ResearchKit autism feasibility study. npj Digit. Med. 2018;1:20. doi: 10.1038/s41746-018-0024-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR115] 115.Peters, C., Hermann, T., Wachsmuth, S. & Hoey, J. Automatic task assistance for people with cognitive disabilities in brushing teeth—a user study with the TEBRA system. ACM Trans. Access. Comput. 5, Article 10, 10.1145/2579700 (2014).

[CR116] 116.Rehg, J. M. et al. Decoding children’s social behavior. In Proc. IEEE Conference on Computer Vision and Pattern Recognition. 3414–3421 (2013).

[CR117] 117.Liu, W., Zhou, T., Zhang, C., Zou, X. & Li, M. Response to name: a dataset and a multimodal machine learning framework towards autism study. In Proc. Seventh International Conference on Affective Computing and Intelligent Interaction (ACII). 178–183 (2017).

[CR118] 118.Marinoiu, E., Zanfir, M., Olaru, V. & Sminchisescu, C. 3D Human sensing, action and emotion recognition in robot assisted therapy of children with autism. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2158–2167 (2018).

[CR119] 119.Schwarzkopf DS, Anderson EJ, de Haas B, White SJ, Rees G. Larger extrastriate population receptive fields in autism spectrum disorders. J. Neurosci. 2014;34:2713. doi: 10.1523/JNEUROSCI.4416-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR120] 120.Di Martino A, et al. The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry. 2014;19:659–667. doi: 10.1038/mp.2013.78. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR121] 121.Di Martino A, et al. Enhancing studies of the connectome in autism using the autism brain imaging data exchange II. Sci. Data. 2017;4:170010. doi: 10.1038/sdata.2017.10. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR122] 122.Hazlett HC, et al. Magnetic resonance imaging and head circumference study of brain size in autism: birth through age 2 years. Arch. Gen. Psychiatry. 2005;62:1366–1376. doi: 10.1001/archpsyc.62.12.1366. [DOI] [PubMed] [Google Scholar]

[CR123] 123.Baird, A. et al. Automatic classification of autistic child vocalisations: a novel database and results. In Proc. Interspeech 2017 849–853 (2017).

[CR124] 124.Duan, H. et al. A dataset of eye movements for the children with autism spectrum disorder. In Proc. 10th ACM Multimedia Systems Conference. 255–260 (Association for Computing Machinery).

[CR125] 125.Soomro, K., Zamir, A. R., & Shah, M. UCF101: A dataset of 101 human actions classes from videos in the wild. arXiv preprintarXiv:1212.0402 (2012).

[CR126] 126.Blank, M., Gorelick, L., Shechtman, E., Irani, M. & Basri, R. Actions as space-time shapes. In Proc. Tenth IEEE International Conference on Computer Vision (ICCV'05)Volume 1. 1395–1402 Vol. 1392.

[CR127] 127.Kanade, T., Cohn, J. F. & Yingli, T. Comprehensive database for facial expression analysis. In Proc. Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580). 46–53.

[CR128] 128.Lucey, P. et al. The Extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression in 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition—Workshops. 94–101.

[CR129] 129.Phillips PJ, Wechsler H, Huang J, Rauss PJ. The FERET database and evaluation procedure for face-recognition algorithms. Image Vis. Comput. 1998;16:295–306. doi: 10.1016/S0262-8856(97)00070-X. [DOI] [Google Scholar]

[CR130] 130.Marszalek, M., Laptev, I. & Schmid, C. Actions in context. In Proc. IEEE Conference on Computer Vision and Pattern Recognition. 2929–2936 (2009).

[CR131] 131.Le, V., Brandt, J., Lin, Z., Bourdev, L. & Huang, T. S. Interactive Facial Feature Localization. 679–692 (Springer Berlin Heidelberg).

[CR132] 132.Liu, Z., Luo, P., Wang, X. & Tang, X. Deep learning face attributes in the wild. In Proc. IEEE International Conference on Computer Vision. 3730–3738.

[CR133] 133.Mollahosseini A, Hasani B, Mahoor MH. AffectNet: a database for facial expression, valence, and arousal computing in the wild. IEEE Trans. Affect. Comput. 2019;10:18–31. doi: 10.1109/TAFFC.2017.2740923. [DOI] [Google Scholar]

[CR134] 134.Benitez-Quiroz, C. F., Srinivasan, R. & Martinez, A. M. EmotioNet: an accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 5562–5570 (2016).

[CR135] 135.Lijun, Y., Xiaozhou, W., Yi, S., Jun, W. & Rosato, M. J. A 3D facial expression database for facial behavior research. In Proc. 7th International Conference on Automatic Face and Gesture Recognition (FGR06). 211–216.

[CR136] 136.Mittal A, Zisserman A, Torr PH. Hand detection using multiple proposals. BMVC. 2011;2:5. [Google Scholar]

[CR137] 137.Bambach, S, Lee, S, Crandall, D. J, Yu, C. & Lending a hand: detecting hands and recognizing activities in complex egocentric interactions Proc. IEEE Int. Conf. Comput. Vis.1949–1957 (2015). [DOI] [PMC free article] [PubMed]

[CR138] 138.Thabtah F. Machine learning in autistic spectrum disorder behavioral research: a review and ways forward. Inform. Health Soc. Care. 2019;44:278–297. doi: 10.1080/17538157.2017.1399132. [DOI] [PubMed] [Google Scholar]

[CR139] 139.Yin, L., Chen, X., Sun, Y., Worm, T. & Reale, M. A high-resolution 3D dynamic facial expression database. In Proc. 8th IEEE International Conference on Automatic Face & Gesture Recognition. 1–6 (2008).

[CR140] 140.Savran, A. et al. Bosphorus Database for 3D Face Analysis. 47–56 (Springer Berlin Heidelberg).

[CR141] 141.Sim, T., Baker, S. & Bsat, M. The CMU pose, illumination, and expression (PIE) database. In Proc. 5th IEEE International Conference on Automatic Face Gesture Recognition. 53–58.

[CR142] 142.Lyons, M., Akamatsu, S., Kamachi, M. & Gyoba, J. Coding facial expressions with Gabor wavelets. In Proc. Third IEEE International Conference on Automatic Face and Gesture Recognition. 200–205.

[CR143] 143.Pantic, M., Valstar, M., Rademaker, R. & Maat, L. Web-based database for facial expression analysis. In Proc. IEEE International Conference on Multimedia and Expo. 5 pp (2005).

[CR144] 144.Yi, L. et al. Abnormality in face scanning by children with autism spectrum disorder is limited to the eye region: evidence from multi-method analyses of eye tracking data. J. Vis. 13, 10.1167/13.10.5 (2013). [DOI] [PMC free article] [PubMed]

[CR145] 145.Yi L, et al. Do individuals with and without autism spectrum disorder scan faces differently? A new multi-method look at an existing controversy. Autism Res. 2014;7:72–83. doi: 10.1002/aur.1340. [DOI] [PubMed] [Google Scholar]

[CR146] 146.Wang S, et al. A typical visual saliency in autism spectrum disorder quantified through model-based eye tracking. Neuron. 2015;88:604–616. doi: 10.1016/j.neuron.2015.09.042. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR147] 147.Rudovic, O., Lee, J., Mascarell-Maricic, L., Schuller, B. W. & Picard, R. W. Measuring engagement in robot-assisted autism therapy: a cross-cultural study. Front. Robot. AI4, 10.3389/frobt.2017.00036 (2017).

[CR148] 148.Baltrusaitis, T., Robinson, P. & Morency, L.-P. Constrained local neural fields for robust facial landmark detection in the wild. In Proc. IEEE International Conference on Computer Vision Workshops. 354–361.

[CR149] 149.Palestra, G., Pettinicchio, A., Del Coco, M., Carcagnì, P., Leo, M., Distante, C. Improved Performance in Facial Expression Recognition Using 32 Geometric Features. In Image Analysis and Processing—ICIAP 2015 (eds Murino V. & Puppo E.) ICIAP 2015. Lecture Notes in Computer Science, vol 9280. (Springer, Cham, 2015). 10.1007/978-3-319-23234-8_48.

PERMALINK

Computer vision in autism spectrum disorder research: a systematic review of published studies from 2009 to 2019

Ryan Anthony J de Belen

Tomasz Bednarz

Arcot Sowmya

Dennis Del Favero

Abstract

Introduction

Materials and methods

Eligibility criteria

Search process

Data items and analysis

Results

Overview of behavioural/biological markers used in eligible papers

Table 1.

Table 2.

Table 3.

Table 4.

Table 5.

Table 6.

Magnetic resonance imaging (MRI)/functional MRI (fMRI)

Facial expression/emotion

Eye gaze data

Motor control/movement pattern

Stereotyped behaviours

Multimodal data

Datasets used in eligible papers

Magnetic resonance imaging datasets

Autism spectrum disorder detection dataset

DE-ENIGMA dataset

Multimodal behaviour dataset

Saliency4ASD dataset

Self-stimulatory behaviour dataset

Other datasets

Limitations

Discussion

Supplementary information

Conflict of interest

Footnotes

Supplementary information

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases