Feature replacement methods enable reliable home video analysis for machine learning detection of autism

Autism Spectrum Disorder is a neuropsychiatric condition affecting 53 million children worldwide and for which early diagnosis is critical to the outcome of behavior therapies. Machine learning applied to features manually extracted from readily accessible videos (e.g., from smartphones) has the potential to scale this diagnostic process. However, nearly unavoidable variability in video quality can lead to missing features that degrade algorithm performance. To manage this uncertainty, we evaluated the impact of missing values and feature imputation methods on two previously published autism detection classifiers, trained on standard-of-care instrument scoresheets and tested on ratings of 140 children videos from YouTube. We compare the baseline method of listwise deletion to classic univariate and multivariate techniques. We also introduce a feature replacement method that, based on a score, selects a feature from an expanded dataset to fill-in the missing value. The replacement feature selected can be identical for all records (general) or automatically adjusted to the record considered (dynamic). Our results show that general and dynamic feature replacement methods achieve a higher performance than classic univariate and multivariate methods, supporting the hypothesis that algorithmic management can maintain the fidelity of video-based diagnostics in the face of missing values and variable video quality.

[1]  Jon Baio,et al.  Examination of the Time Between First Evaluation and First Autism Spectrum Diagnosis in a Population-based Sample , 2006, Journal of developmental and behavioral pediatrics : JDBP.

[2]  Z. Warren,et al.  Prevalence of autism spectrum disorder among children aged 8 years - autism and developmental disabilities monitoring network, 11 sites, United States, 2010. , 2014, Morbidity and mortality weekly report. Surveillance summaries.

[3]  Sylvia Soldatou,et al.  Publisher Correction: Morphological, genotypic and metabolomic signatures confirm interfamilial hybridization between the ubiquitous kelps Macrocystis (Arthrothamnaceae) and Lessonia (Lessoniaceae) , 2020, Scientific Reports.

[4]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[5]  T. Vos,et al.  The epidemiology and global burden of autism spectrum disorders , 2014, Psychological Medicine.

[6]  D. Wall,et al.  Use of machine learning for behavioral distinction of autism and ADHD , 2016, Translational Psychiatry.

[7]  D. Wall,et al.  Testing the accuracy of an observation-based classifier for rapid detection of autism risk , 2014, Translational Psychiatry.

[8]  Harvey A. Whiteford,et al.  Developmental disabilities among children younger than 5 years in 195 countries and territories, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016 , 2018, The Lancet. Global health.

[9]  Mohd Bakri Adam,et al.  Effect of missing value methods on Bayesian network classification of hepatitis data , 2013 .

[10]  Peter Washington,et al.  Labeling images with facial emotion and the potential for pediatric healthcare , 2019, Artif. Intell. Medicine.

[11]  Stephen J. Blumberg,et al.  Trends in the Prevalence of Developmental Disabilities in US Children, 1997–2008 , 2011, Pediatrics.

[12]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[13]  Haik Kalantarian,et al.  Feature Selection and Dimension Reduction of Social Autism Data , 2019, PSB.

[14]  Peter Washington,et al.  Effect of Wearable Digital Intervention for Improving Socialization in Children With Autism Spectrum Disorder: A Randomized Clinical Trial , 2019, JAMA pediatrics.

[15]  Yordan Penev,et al.  Precision Telemedicine through Crowdsourced Machine Learning: Testing Variability of Crowd Workers for Video-Based Autism Feature Recognition , 2020, Journal of personalized medicine.

[16]  Sebastien Levy,et al.  Sparsifying machine learning models identify stable subsets of predictive features for behavioral detection of autism , 2017, Molecular Autism.

[17]  D. Wall,et al.  Clinical Evaluation of a Novel and Mobile Autism Risk Assessment , 2016, Journal of Autism and Developmental Disorders.

[18]  Peter Washington,et al.  Detecting Developmental Delay and Autism Through Machine Learning Models Using Home Videos of Bangladeshi Children: Development and Validation Study , 2019, Journal of medical Internet research.

[19]  M. E. Warfield,et al.  Unmet Need and Problems Accessing Core Health Care Services for Children with Autism Spectrum Disorder , 2012, Maternal and Child Health Journal.

[20]  Peter Washington,et al.  Feasibility Testing of a Wearable Behavioral Aid for Social Learning in Children with Autism , 2018, Applied Clinical Informatics.

[21]  Vaibhav A. Narayan,et al.  Mobile and pervasive computing technologies and the future of Alzheimer’s clinical trials , 2018, npj Digital Medicine.

[22]  Catalin Voss,et al.  The Potential for Machine Learning-Based Wearables to Improve Socialization in Teenagers and Adults With Autism Spectrum Disorder-Reply. , 2019, JAMA pediatrics.

[23]  A. Couteur,et al.  Autism Diagnostic Interview-Revised: A revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders , 1994, Journal of autism and developmental disorders.

[24]  Peter Washington,et al.  Exploratory study examining the at-home feasibility of a wearable tool for social-affective learning in children with autism , 2018, npj Digital Medicine.

[25]  Peter Washington,et al.  SuperpowerGlass , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[26]  Georgina Peacock,et al.  Whittling Down the Wait Time: Exploring Models to Minimize the Delay from Initial Concern to Diagnosis and Treatment of Autism Spectrum Disorder. , 2016, Pediatric clinics of North America.

[27]  Peter Washington,et al.  A Gamified Mobile System for Crowdsourcing Video for Autism Research , 2018, 2018 IEEE International Conference on Healthcare Informatics (ICHI).

[28]  A. Thallaj,et al.  Guess what? , 2011, Saudi journal of anaesthesia.

[29]  Dennis P. Wall,et al.  Machine learning for early detection of autism (and other conditions) using a parental questionnaire and home video screening , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[30]  M. Mildner,et al.  Re-epithelialization and immune cell behaviour in an ex vivo human skin model , 2020, Scientific Reports.

[31]  A. Mack,et al.  Trends in the Prevalence of Developmental Disabilities in US Children, 1997–2008 , 2012 .

[32]  Dennis P. Wall,et al.  Machine learning approach for early detection of autism by combining questionnaire and home video screening , 2017, J. Am. Medical Informatics Assoc..

[33]  Peter Washington,et al.  Guess What? , 2018, Journal of Healthcare Informatics Research.

[34]  C. Lord,et al.  Austism diagnostic observation schedule: A standardized observation of communicative and social behavior , 1989, Journal of autism and developmental disorders.

[35]  The Simons,et al.  Simons Variation in Individuals Project (Simons VIP): A Genetics-First Approach to Studying Autism Spectrum and Related Neurodevelopmental Disorders , 2012, Neuron.

[36]  C. Lord,et al.  The Simons Simplex Collection: A Resource for Identification of Autism Genetic Risk Factors , 2010, Neuron.

[37]  Peter Washington,et al.  The Performance of Emotion Classifiers for Children With Parent-Reported Autism: Quantitative Feasibility Study , 2020, JMIR mental health.

[38]  Dennis P. Wall,et al.  The Quantified Brain: A Framework for Mobile Device-Based Assessment of Behavior and Neurological Function , 2016, Applied Clinical Informatics.

[39]  W. Mandy,et al.  What Is the Male-to-Female Ratio in Autism Spectrum Disorder? A Systematic Review and Meta-Analysis. , 2017, Journal of the American Academy of Child and Adolescent Psychiatry.

[40]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[41]  Peter Washington,et al.  Validity of Online Screening for Autism: Crowdsourcing Study Comparing Paid and Unpaid Diagnostic Tasks , 2019, Journal of medical Internet research.

[42]  Dennis P. Wall,et al.  The Potential of Accelerating Early Detection of Autism through Content Analysis of YouTube Videos , 2014, PloS one.

[43]  S. Spence,et al.  The autism genetic resource exchange: a resource for the study of autism and related neuropsychiatric conditions. , 2001, American journal of human genetics.

[44]  Wei Bao,et al.  Prevalence and Treatment Patterns of Autism Spectrum Disorder in the United States, 2016 , 2019, JAMA pediatrics.

[45]  Todd F. DeLuca,et al.  Use of machine learning to shorten observation-based screening and diagnosis of autism , 2012, Translational Psychiatry.

[46]  Peter Washington,et al.  A Mobile Game for Automatic Emotion-Labeling of Images , 2020, IEEE Transactions on Games.

[47]  Peter Washington,et al.  Mobile detection of autism through machine learning on home video: A development and prospective validation study , 2018, PLoS medicine.

[48]  Stephen J. Blumberg,et al.  The Prevalence of Parent-Reported Autism Spectrum Disorder Among US Children , 2018, Pediatrics.

[49]  D. Wall,et al.  Crowdsourced validation of a machine-learning classification system for autism and ADHD , 2017, Translational Psychiatry.

[50]  Dennis P. Wall,et al.  A Low Rank Model for Phenotype Imputation in Autism Spectrum Disorder , 2018, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[51]  D. Wall,et al.  Searching for a minimal set of behaviors for autism detection through feature selection-based machine learning , 2015, Translational Psychiatry.

[52]  Natalia Kliewer,et al.  Identifying predictive features of autism spectrum disorders in a clinical sample of adolescents and adults using machine learning , 2020, Scientific Reports.

[53]  D. Wall,et al.  Use of Artificial Intelligence to Shorten the Behavioral Diagnosis of Autism , 2012, PloS one.

[54]  Matthew J. McAuliffe,et al.  Sharing Heterogeneous Data: The National Database for Autism Research , 2012, Neuroinformatics.

[55]  Edgar Acuña,et al.  The Treatment of Missing Values and its Effect on Classifier Accuracy , 2004 .

[56]  Terry Winograd,et al.  Toward Continuous Social Phenotyping: Analyzing Gaze Patterns in an Emotion Recognition Task for Children With Autism Through Wearable Smart Glasses , 2020, Journal of medical Internet research.

[57]  Peter Washington,et al.  A Wearable Social Interaction Aid for Children with Autism , 2016, CHI Extended Abstracts.

[58]  Haik Kalantarian,et al.  Data-Driven Diagnostics and the Potential of Mobile Artificial Intelligence for Digital Therapeutic Phenotyping in Computational Psychiatry. , 2019, Biological psychiatry. Cognitive neuroscience and neuroimaging.

[59]  Dennis P. Wall,et al.  Multi-modular AI Approach to Streamline Autism Diagnosis in Young Children , 2020, Scientific Reports.

[60]  Matthew S. Goodwin,et al.  Applying Machine Learning to Facilitate Autism Diagnostics: Pitfalls and Promises , 2014, Journal of Autism and Developmental Disorders.

[61]  Hannah Zeive,et al.  Autism Spectrum Disorder in the United States , 2013 .

[62]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[63]  Amelia Ritahani Ismail,et al.  Performance Analysis Of Machine Learning Algorithms For Missing Value Imputation , 2018 .

[64]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .