VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments

Public speaking is a widely requested professional skill, and at the same time an activity that causes one of the most common adult phobias (Miller and Stone, 2009). It is also known that the study of stress under laboratory conditions, as it is most commonly done, may provide only limited ecological validity (Wilhelm and Grossman, 2010). Previously, we introduced an inter-disciplinary methodology to enable collecting a large amount of recordings under consistent conditions (Aguiar et al., 2013). This paper introduces the VOCE corpus of speech annotated with stress indicators under naturalistic public speaking (PS) settings. The novelty of this corpus is that the recordings are carried out in objectively stressful PS situations, as recommended in (Zanstra and Johnston, 2011). The current database contains a total of 38 recordings, 13 of which contain full psychological and physiologic annotation. We show that the collected recordings validate the assumptions of the methodology, namely that participants experience stress during the PS events. We describe the various metrics that can be used for physiologic and psychological annotation, and we characterise the sample collected so far, providing evidence that demographics do not affect the relevant psychological or physiologic annotation. The collection activities are on-going, and we expect to increase the number of complete recordings in the corpus to 30 by June 2014.

[1]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[2]  Loïc Kessous,et al.  Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis , 2010, Journal on Multimodal User Interfaces.

[3]  Klaus R. Scherer,et al.  Vocal communication of emotion: A review of research paradigms , 2003, Speech Commun..

[4]  Pascale Fung,et al.  A Multilingual Natural Stress Emotion Database , 2012, LREC.

[5]  C. Spielberger,et al.  Manual for the State-Trait Anxiety Inventory , 1970 .

[6]  Daniel Gatica-Perez,et al.  StressSense: detecting stress in unconstrained acoustic environments using smartphones , 2012, UbiComp.

[7]  John T. Cacioppo,et al.  Heart Rate Variability: Stress and Psychiatric Conditions , 2007 .

[8]  S. Folkman,et al.  Stress, appraisal, and coping , 1974 .

[9]  Wendi B. Heinzelman,et al.  Speech-based emotion classification using multiclass SVM with hybrid kernel and thresholding fusion , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).

[10]  A. Boquet,et al.  Cluster analyses of cardiovascular responsivity to three laboratory stressors. , 1991, Psychosomatic medicine.

[11]  D. Johnston,et al.  Cardiovascular reactivity in real life settings: Measurement, mechanisms and meaning , 2011, Biological Psychology.

[12]  F. Wilhelm,et al.  Emotions beyond the laboratory: Theoretical fundaments, study design, and analytic strategies for advanced ambulatory assessment , 2010, Biological Psychology.

[13]  John H. L. Hansen,et al.  The Impact of Speech Under `Stress''on Military Speech Technology , 2000 .

[14]  S. Shiffman,et al.  Capturing momentary, self-report data: A proposal for reporting guidelines , 2002, Annals of behavioral medicine : a publication of the Society of Behavioral Medicine.

[15]  Constantine Kotropoulos,et al.  Emotional speech recognition: Resources, features, and methods , 2006, Speech Commun..

[16]  Timothy C. Miller,et al.  Public Speaking Apprehension (PSA), Motivation, and Affect among Accounting Majors: A Proof‐of‐Concept Intervention , 2009 .

[17]  A. Malliani,et al.  Heart rate variability. Standards of measurement, physiological interpretation, and clinical use , 1996 .

[18]  Ana Aguiar,et al.  Speech stress assessment using physiological and psychological measures , 2013, UbiComp.