Feature Selection With Weighted Importance Index in an Autism Spectrum Disorder Study

Abstract Elastic net regularization is a popular statistical tool for variable selection that combines lasso and ridge regression penalties. When used in combination with ensemble methods, it improves stability of the estimates and increases confidence in the results. We proposed and tested a version of this method that considers a measure of models’ goodness of fit and gives estimates of importance for each feature weighted on this measure. The method was applied to an autism spectrum disorder (ASD) study to select a subset of biosensor-based features that can be used to predict clinical scores of study participants. In this study, the participants’ responses to visual and audio stimuli were captured by the Janssen Autism Knowledge Engine (JAKE®) biosensors and used to construct approximately 50,000 features. We examined how well changes in these features mirrored changes in the Social Responsiveness Scale (SRS), a quantitative assessment of ASD individuals by clinicians. As a result, we isolated the top features changes which are most associated with changes in SRS, and built predictive models using these features.

[1]  M. Williams,et al.  Investigating the Clinical Usefulness of the Social Responsiveness Scale (SRS) in a Tertiary Level, Autism Spectrum Disorder Specific Assessment Clinic , 2011, Journal of Autism and Developmental Disorders.

[2]  Andrew Kennedy,et al.  The use of eye-tracking to explore social difficulties in cognitively able students with autism spectrum disorder: A pilot investigation , 2015, Autism : the international journal of research and practice.

[3]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[4]  Vanessa Troiani,et al.  Pupil adaptation corresponds to quantitative measures of autism traits in children , 2017, Scientific Reports.

[5]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[6]  Sven Bölte,et al.  Assessing autistic traits: cross‐cultural validation of the social responsiveness scale (SRS) , 2008, Autism research : official journal of the International Society for Autism Research.

[7]  Matthew S. Goodwin,et al.  JAKE® Multimodal Data Capture System: Insights from an Observational Study of Autism Spectrum Disorder , 2017, Front. Neurosci..

[8]  J. W. Miskin,et al.  Ensemble Learning , 2000 .

[9]  Charis Eng,et al.  Development of an Objective Autism Risk Index Using Remote Eye Tracking. , 2016, Journal of the American Academy of Child and Adolescent Psychiatry.

[10]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[11]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[12]  E. Walker,et al.  Diagnostic and Statistical Manual of Mental Disorders , 2013 .

[13]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[14]  R. Dennis Cook,et al.  Cross-Validation of Regression Models , 1984 .