Sleepiness detection from speech by perceptual features

We propose a two-class classification scheme with a small number of features for sleepiness detection. Unlike the conventional methods that rely on the linguistics content of speech, we work with prosodic features extracted by psychoacoustic masking in spectral and temporal domain. Our features also model the variations between non-sleepy and sleepy modes in a quasi-continuum space with the help of code words learned by a bag-of-features scheme. These improve the unweighted recall rates for unseen people and minimize the language dependence. Recall rates reported based on Karolinska Sleepiness Scale (KSS) for Support Vector Machine and Learning Vector Quantization classifiers show that the developed system enable us monitoring sleepiness efficiently with a lower complexity compared to the reported benchmarking results for Sleepy Language Corpus.

[1]  W. Marsden I and J , 2012 .

[2]  Shuzhi Sam Ge,et al.  Speaker State Classification Based on Fusion of Asymmetric SIMPLS and Support Vector Machines , 2011, INTERSPEECH.

[3]  Zhi-Hong Mao,et al.  Detection of Driver Fatigue Caused by Sleep Deprivation , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[4]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[5]  Björn W. Schuller,et al.  Applying multiple classifiers and non-linear dynamics features for detecting sleepiness from speech , 2012, Neurocomputing.

[6]  Björn W. Schuller,et al.  The INTERSPEECH 2011 Speaker State Challenge , 2011, INTERSPEECH.

[7]  Martin Golz,et al.  Acoustic sleepiness detection: Framework and validation of a speech-adapted pattern recognition approach , 2009, Behavior research methods.

[8]  T. Åkerstedt,et al.  Validation of the Karolinska sleepiness scale against performance and EEG variables , 2006, Clinical Neurophysiology.

[9]  Günes Karabulut-Kurt,et al.  Perceptual audio features for emotion detection , 2012, EURASIP J. Audio Speech Music. Process..

[10]  Björn W. Schuller,et al.  OpenEAR — Introducing the munich open-source emotion and affect recognition toolkit , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[11]  Haizhou Li,et al.  Analysis and detection of speech under sleep deprivation , 2006, INTERSPEECH.

[12]  Fakhri Karray,et al.  Survey on speech emotion recognition: Features, classification schemes, and databases , 2011, Pattern Recognit..

[13]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.