Sparsifying machine learning models identify stable subsets of predictive features for behavioral detection of autism

BackgroundAutism spectrum disorder (ASD) diagnosis can be delayed due in part to the time required for administration of standard exams, such as the Autism Diagnostic Observation Schedule (ADOS). Shorter and potentially mobilized approaches would help to alleviate bottlenecks in the healthcare system. Previous work using machine learning suggested that a subset of the behaviors measured by ADOS can achieve clinically acceptable levels of accuracy. Here we expand on this initial work to build sparse models that have higher potential to generalize to the clinical population.MethodsWe assembled a collection of score sheets for two ADOS modules, one for children with phrased speech (Module 2; 1319 ASD cases, 70 controls) and the other for children with verbal fluency (Module 3; 2870 ASD cases, 273 controls). We used sparsity/parsimony enforcing regularization techniques in a nested cross validation grid search to select features for 17 unique supervised learning models, encoding missing values as additional indicator features. We augmented our feature sets with gender and age to train minimal and interpretable classifiers capable of robust detection of ASD from non-ASD.ResultsBy applying 17 unique supervised learning methods across 5 classification families tuned for sparse use of features and to be within 1 standard error of the optimal model, we find reduced sets of 10 and 5 features used in a majority of models. We tested the performance of the most interpretable of these sparse models, including Logistic Regression with L2 regularization or Linear SVM with L1 regularization. We obtained an area under the ROC curve of 0.95 for ADOS Module 3 and 0.93 for ADOS Module 2 with less than or equal to 10 features.ConclusionsThe resulting models provide improved stability over previous machine learning efforts to minimize the time complexity of autism detection due to regularization and a small parameter space. These robustness techniques yield classifiers that are sparse, interpretable and that have potential to generalize to alternative modes of autism screening, diagnosis and monitoring, possibly including analysis of short home videos.

[1]  Dennis P. Wall,et al.  The Potential of Accelerating Early Detection of Autism through Content Analysis of YouTube Videos , 2014, PloS one.

[2]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[3]  C. Lord,et al.  The Autism Diagnostic Observation Schedule: Revised Algorithms for Improved Diagnostic Validity , 2007, Journal of autism and developmental disorders.

[4]  D. Wall,et al.  Use of machine learning for behavioral distinction of autism and ADHD , 2016, Translational Psychiatry.

[5]  D. Wall,et al.  Searching for a minimal set of behaviors for autism detection through feature selection-based machine learning , 2015, Translational Psychiatry.

[6]  Todd F. DeLuca,et al.  Use of machine learning to shorten observation-based screening and diagnosis of autism , 2012, Translational Psychiatry.

[7]  D. Wall,et al.  Use of Artificial Intelligence to Shorten the Behavioral Diagnosis of Autism , 2012, PloS one.

[8]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[9]  J. Pandey,et al.  Diagnostic Stability in Very Young Children with Autism Spectrum Disorders , 2008, Journal of autism and developmental disorders.

[10]  B. Leventhal,et al.  The Autism Diagnostic Observation Schedule—Generic: A Standard Measure of Social and Communication Deficits Associated with the Spectrum of Autism , 2000, Journal of autism and developmental disorders.

[11]  C. Molloy,et al.  Use of the Autism Diagnostic Observation Schedule (ADOS) in a clinical setting , 2011, Autism : the international journal of research and practice.

[12]  Alan Emond,et al.  Autism spectrum disorder and autistic traits in the Avon Longitudinal Study of Parents and Children: precursors and early signs. , 2012, Journal of the American Academy of Child and Adolescent Psychiatry.

[13]  Raphael Bernier,et al.  Psychopathology, families, and culture: autism. , 2010, Child and adolescent psychiatric clinics of North America.

[14]  Maureen S. Durkin,et al.  Prevalence and Characteristics of Autism Spectrum Disorder Among 4-Year-Old Children in the Autism and Developmental Disabilities Monitoring Network , 2016, Journal of developmental and behavioral pediatrics : JDBP.

[15]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[16]  D. Wall,et al.  Testing the accuracy of an observation-based classifier for rapid detection of autism risk , 2014, Translational Psychiatry.

[17]  Jon Baio,et al.  Examination of the Time Between First Evaluation and First Autism Spectrum Diagnosis in a Population-based Sample , 2006, Journal of developmental and behavioral pediatrics : JDBP.