Detecting self-stimulatory behaviours for autism diagnosis

Autism Spectrum Disorders (ASD), often referred to as autism, are neurological disorders characterised by deficits in cognitive skills, social and communicative behaviours. A common way of diagnosing ASD is by studying behavioural cues expressed by the children. An algorithm for detecting three types of self-stimulatory behaviours from publicly available unconstrained videos is proposed here. The child's body is tracked in the video by a careful selection of poselet bounding box predictions using a nearest neighbour algorithm. A global motion descriptor - Histogram of Dominant Motions (HDM) - is computed using the dominant motion flow in the detected body regions. The motion model built using this descriptor is used for detecting the self-stimulatory behaviours. Experiments conducted on the recently released unconstrained SSBD video dataset show significant improvement in detection accuracy over the baseline approach. The robustness of the method is validated using benchmark action recognition datasets. The proposed poselet bounding box selection algorithm is validated against the ground truth annotation data provided with the UCF101 dataset.

[1]  Vassilios Morellas,et al.  A computer vision approach for the assessment of autism-related behavioral markers , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[2]  James M. Rehg Behavior Imaging: Using Computer Vision to Study Autism , 2011, MVA.

[3]  R. Vidal,et al.  Histograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of human actions , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Ronen Basri,et al.  Actions as Space-Time Shapes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Roland Göcke,et al.  Self-Stimulatory Behaviours in the Wild for Autism Diagnosis , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[6]  Mubarak Shah,et al.  UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.

[7]  James M. Rehg,et al.  Decoding Children's Social Behavior , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Agata Rozga,et al.  Automatic assessment of problem behavior in individuals with developmental disabilities , 2012, UbiComp.

[9]  Peter Robinson,et al.  Automatic multimodal descriptors of rhythmic body movement , 2013, ICMI '13.

[10]  J. Baio Morbidity and Mortality Weekly Report Prevalence of Autism Spectrum Disorders — Autism and Developmental Disabilities Monitoring Network, Six Sites, United States, 2000; Prevalence of Autism Spectrum Disorders — Autism and Developmental Disabilities Monitoring Network, 14 Sites, United States, 2002; , 2007 .

[11]  Gregory D. Abowd,et al.  Recognizing mimicked autistic self-stimulatory behaviors using HMMs , 2005, Ninth IEEE International Symposium on Wearable Computers (ISWC'05).

[12]  J. Baio Prevalence of autism spectrum disorders--Autism and Developmental Disabilities Monitoring Network, 14 sites, United States, 2008. , 2012, Morbidity and mortality weekly report. Surveillance summaries.

[13]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[14]  Lonnie Zwaigenbaum,et al.  Early identification of autism spectrum disorders , 2013, Behavioural Brain Research.

[15]  Craig Newschaffer,et al.  Timing of identification among children with an autism spectrum disorder: findings from a population-based surveillance study. , 2009, Journal of the American Academy of Child and Adolescent Psychiatry.

[16]  Ivan Laptev,et al.  On Space-Time Interest Points , 2005, International Journal of Computer Vision.

[17]  Jitendra Malik,et al.  Poselets: Body part detectors trained using 3D human pose annotations , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[18]  Cordelia Schmid,et al.  DeepFlow: Large Displacement Optical Flow with Deep Matching , 2013, 2013 IEEE International Conference on Computer Vision.