Machine learning, waveform preprocessing and feature extraction methods for classification of acoustic startle waveforms

The acoustic startle response (ASR) is an involuntary muscle reflex that occurs in response to a transient loud sound and is a highly-utilized method of assessing hearing status in animal models. Currently, a high level of variability exists in the recording and interpretation of ASRs due to the lack of standardization for collecting and analyzing these measures. An ensembled machine learning model was trained to predict whether an ASR waveform is a startle or non-startle using highly-predictive features extracted from normalized ASR waveforms collected from young adult CBA/CaJ mice. Features were extracted from the normalized waveform as well as the power spectral density estimates and continuous wavelet transforms of the normalized waveform. Machine learning models utilizing methods from different families of algorithms were individually trained and then ensembled together, resulting in an extremely robust model.• ASR waveforms were normalized using the mean and standard deviation computed before the startle elicitor was presented• 9 machine learning algorithms from 4 different families of algorithms were individually trained using features extracted from the normalized ASR waveforms• Trained machine learning models were ensembled to produce an extremely robust classifier

[1]  Side He,et al.  A comparison of the wavelet and short-time fourier transforms for Doppler spectral analysis. , 2003, Medical engineering & physics.

[2]  J. Turner,et al.  Exposure to an augmented acoustic environment alters auditory function in hearing-impaired DBA/2J mice , 1998, Hearing Research.

[3]  J. Zarate,et al.  Provisional Mapping of Quantitative Trait Loci Modulating the Acoustic Startle Response and Prepulse Inhibition of Acoustic Startle , 2002, Neuropsychopharmacology.

[4]  Jennifer E. Jeskey,et al.  Modulation of prepulse inhibition by an augmented acoustic environment in DBA/2J mice. , 2000, Behavioral neuroscience.

[5]  José Antonio Lozano,et al.  Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Wlodzislaw Duch,et al.  Feature Selection for High-Dimensional Data - A Pearson Redundancy Based Filter , 2008, Computer Recognition Systems 2.

[7]  Ryan J. Longenecker,et al.  Automated classification of acoustic startle reflex waveforms in young CBA/CaJ mice using machine learning , 2020, Journal of Neuroscience Methods.

[8]  Rich Caruana,et al.  Ensemble selection from libraries of models , 2004, ICML.

[9]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[10]  Huan Liu,et al.  Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[11]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[12]  R. Frisina,et al.  Long-term treatment with aldosterone slows the progression of age-related hearing loss , 2016, Hearing Research.

[13]  Max Kuhn,et al.  caret: Classification and Regression Training , 2015 .

[14]  Max Kuhn,et al.  Applied Predictive Modeling , 2013 .

[15]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[16]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[17]  Marios Michailidis Investigating machine learning methods in recommender systems , 2017 .