Analysis of Speech Under Stress and Cognitive Load in USAR Operations

This paper presents ongoing work on analysis of speech under stress and cognitive load in speech recordings of Urban Search and Rescue (USAR) training operations. During the training operations several team members communicate with other members on the field and members on the control command using only one radio channel. The type of stress encountered in the USAR domain, more specifically on the human team communication, includes both physical or psychological stress and cognitive task load. Physical stress due to the real situation and cognitive task load due to tele-operation of robots and equipment. We were able to annotate and identify the acoustic correlates of these two types of stress on the recordings. Traditional prosody features and acoustic features extracted at sub-band level probed to be robust to discriminate among the different types of stress and neutral data.

[1]  Klaus R. Scherer,et al.  Acoustic correlates of task load and stress , 2002, INTERSPEECH.

[2]  W·M·贝尔特曼,et al.  Speech audio process , 2011 .

[3]  Marc Schröder,et al.  The Vocal Effort of Dominance in Scenario Meetings , 2011, INTERSPEECH.

[4]  Neerincx,et al.  Geo-collaboration under stress , 2007 .

[5]  John H. L. Hansen,et al.  Getting started with SUSAS: a speech under simulated and actual stress database , 1997, EUROSPEECH.

[6]  Tim Polzehl,et al.  Detecting real life anger , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  John H. L. Hansen,et al.  Speech Under Stress: Analysis, Modeling and Recognition , 2007, Speaker Classification.

[8]  John H. L. Hansen,et al.  Nonlinear feature based classification of speech under stress , 2001, IEEE Trans. Speech Audio Process..

[9]  Christian A. Müller,et al.  Assessment of a User's Time Pressure and Cognitive Load on the Basis of Features of Speech , 2011, Resource-Adaptive Cognitive Processes.

[10]  John H. L. Hansen,et al.  Detection of speech under physical stress: model development, sensor selection, and feature fusion , 2008, INTERSPEECH.

[11]  Hervé Bourlard,et al.  Multi-resolution spectral entropy feature for robust ASR , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..