Sleepiness detection on read speech using simple features

This paper is about automatic sleepiness state detection using speech samples. Following previous research carried out for the Interspeech 2011 challenge, we use the Sleepy Language Corpus (SLC) for our experiments. However, as we are willing to record our own subjects within a collaboration project with the Bordeaux hospital, we focus only on the read speech samples of that database. Furthermore, we are looking for understandable cues that can guide clinicians to provide a diagnostic. Hence, we devised a set of meaningful features that are close to the signal and restrict the feature selection process to methods that do not use feature combinations. Thus, using simple correlations and a grid search procedure on the training and development parts of the database, we selected a final set of 23 features, reaching a performance on par with state-of-the-art systems. A discussion is proposed on the subjective ground truth used for the boundary between sleepy and non sleepy speech in this database. Finally, we discuss on the interpretation of the features and provide hints on the physiological causes.

[1]  Jarek Krajewski,et al.  Sleepiness detection from speech by perceptual features , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Nicholas Cummins,et al.  Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning. , 2018, Methods.

[3]  Björn W. Schuller,et al.  Cross-Domain Classification of Drowsiness in Speech: The Case of Alcohol Intoxication and Sleep Deprivation , 2017, INTERSPEECH.

[4]  Mo Chen,et al.  Feature Fusion for the Detection of Microsleep Events , 2007, J. VLSI Signal Process..

[5]  George Christodoulides Effects of cognitive load on speech production and perception , 2016 .

[6]  Jean-Luc Rouas,et al.  Automatic Classification of Phonation Modes in Singing Voice: Towards Singing Style Characterisation and Application to Ethnomusicological Recordings , 2016, INTERSPEECH.

[7]  Sebastian Schnieder,et al.  Large Sleepy Reading Corpus (LSRC): Applying Read Speech for Detecting Sleepiness , 2016, ITG Symposium on Speech Communication.

[8]  Haizhou Li,et al.  Analysis and detection of speech under sleep deprivation , 2006, INTERSPEECH.

[9]  Oliver Keelan,et al.  Feature Engineering and Machine Learning for Driver Sleepiness Detection , 2017 .

[10]  Björn W. Schuller,et al.  The INTERSPEECH 2011 Speaker State Challenge , 2011, INTERSPEECH.

[11]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[12]  Björn W. Schuller,et al.  Medium-term speaker states - A review on intoxication, sleepiness and the first challenge , 2014, Comput. Speech Lang..

[13]  Aurobinda Routray,et al.  A novel drowsiness detection scheme based on speech analysis with validation using simultaneous EEG recordings , 2010, 2010 IEEE International Conference on Automation Science and Engineering.

[14]  Keng-Hao Chang,et al.  The effect of sleep deprivation on vocal expression of emotion in adolescents and adults. , 2011, Sleep.

[16]  François Pellegrino,et al.  Automatic language identification: an alternative approach to phonetic modelling , 2000, Signal Process..

[17]  S. D. Jong SIMPLS: an alternative approach to partial least squares regression , 1993 .

[18]  John Kane,et al.  COVAREP — A collaborative voice analysis repository for speech technologies , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Jarek Krajewski,et al.  Medium term speaker state detection by perceptually masked spectral features , 2015, Speech Communication.

[20]  Hideki Kashioka,et al.  Feature normalization and selection for robust speaker state recognition , 2011, 2011 International Conference on Speech Database and Assessments (Oriental COCOSDA).

[21]  Martin Golz,et al.  Acoustic sleepiness detection: Framework and validation of a speech-adapted pattern recognition approach , 2009, Behavior research methods.

[22]  Shuzhi Sam Ge,et al.  Speaker state classification based on fusion of asymmetric simple partial least squares (SIMPLS) and support vector machines , 2014, Comput. Speech Lang..

[23]  H.P. Greeley,et al.  Detecting Fatigue From Voice Using Speech Recognition , 2006, 2006 IEEE International Symposium on Signal Processing and Information Technology.

[24]  Björn W. Schuller,et al.  Applying multiple classifiers and non-linear dynamics features for detecting sleepiness from speech , 2012, Neurocomputing.

[25]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality. , 1994, Journal of speech and hearing research.