Cross-validation of bimodal health-related stress assessment

This study explores the feasibility of objective and ubiquitous stress assessment. 25 post-traumatic stress disorder patients participated in a controlled storytelling (ST) study and an ecologically valid reliving (RL) study. The two studies were meant to represent an early and a late therapy session, and each consisted of a "happy" and a "stress triggering" part. Two instruments were chosen to assess the stress level of the patients at various point in time during therapy: (i) speech, used as an objective and ubiquitous stress indicator and (ii) the subjective unit of distress (SUD), a clinically validated Likert scale. In total, 13 statistical parameters were derived from each of five speech features: amplitude, zero-crossings, power, high-frequency power, and pitch. To model the emotional state of the patients, 28 parameters were selected from this set by means of a linear regression model and, subsequently, compressed into 11 principal components. The SUD and speech model were cross-validated, using 3 machine learning algorithms. Between 90% (2 SUD levels) and 39% (10 SUD levels) correct classification was achieved. The two sessions could be discriminated in 89% (for ST) and 77% (for RL) of the cases. This report fills a gap between laboratory and clinical studies, and its results emphasize the usefulness of Computer Aided Diagnostics (CAD) for mental health care.

[1]  Patrick W. Corrigan,et al.  A stress-coping model of mental illness stigma: II. Emotional stress responses, coping behavior and outcome , 2009, Schizophrenia Research.

[2]  J. Wolpe,et al.  Psychotherapy by reciprocal inhibition , 1958, Conditional reflex.

[3]  Richard G. Lyons,et al.  Understanding Digital Signal Processing , 1996 .

[4]  R. Kessler,et al.  The effects of stressful life events on depression. , 1997, Annual review of psychology.

[5]  Constantine Kotropoulos,et al.  Emotional speech recognition: Resources, features, and methods , 2006, Speech Commun..

[6]  Iain R. Murray,et al.  Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. , 1993, The Journal of the Acoustical Society of America.

[7]  P. Boersma ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .

[8]  R. Blonk,et al.  The Depression Anxiety Stress Scales (DASS): detecting anxiety disorder and depression in employees absent from work because of mental health problems , 2003, Occupational and environmental medicine.

[9]  Nilanjan Sarkar,et al.  Online stress detection using psychophysiological signals for implicit human-robot cooperation , 2002, Robotica.

[10]  G Tröster,et al.  Pervasive Healthcare , 2009, Methods of Information in Medicine.

[11]  Edmund Husserl,et al.  Besprechung von: A. Marty, Untersuchungen zur Grundlegung der allgemeinen Grammatik und Sprachphilosophie, I. Band, Halle 1908 , 1979 .

[12]  Tara E. Galovski,et al.  Gender differences in recovery from posttraumatic stress disorder: A critical review , 2010 .

[13]  Klaus R. Scherer,et al.  Vocal communication of emotion: A review of research paradigms , 2003, Speech Commun..

[14]  B. Kedem,et al.  Spectral analysis and discrimination by zero-crossings , 1986, Proceedings of the IEEE.

[15]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[16]  Richard G. Lyons,et al.  Understanding Digital Signal Processing (2nd Edition) , 2004 .

[17]  G. Riva,et al.  Biofeedback, virtual reality and mobile phones in the treatment of generalized anxiety disorder (gad): A phase-2 controlled clinical trial , 2009 .

[18]  Egon L. van den Broek,et al.  Ubiquitous emotion-aware computing , 2011, Personal and Ubiquitous Computing.

[19]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[20]  Jan M. Van Campenhout,et al.  On the Possible Orderings in the Measurement Selection Problem , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  E. Frommer Speech Evaluation in Psychiatry , 1982 .

[22]  Ruili Wang,et al.  Ensemble methods for spoken emotion recognition in call-centres , 2007, Speech Commun..

[23]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[24]  Léon J. M. Rothkrantz,et al.  Voice Stress Analysis , 2004, TSD.

[25]  David Haussler,et al.  What Size Net Gives Valid Generalization? , 1989, Neural Computation.

[26]  D L Roth,et al.  The effects of aerobic exercise on cardiovascular, facial EMG, and self‐report responses to emotional imagery. , 1992, Psychosomatic medicine.

[27]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[28]  Fakhri Karray,et al.  Survey on speech emotion recognition: Features, classification schemes, and databases , 2011, Pattern Recognit..

[29]  Patrick W. Corrigan,et al.  A stress-coping model of mental illness stigma: I. Predictors of cognitive stress appraisal , 2009, Schizophrenia Research.

[30]  A. Przeworski,et al.  A review of technology-assisted self-help and minimal contact therapies for anxiety and depression: is human contact necessary for therapeutic efficacy? , 2011, Clinical psychology review.

[31]  Joyce H. D. M. Westerink,et al.  Sensing Emotions: The impact of context on experience measurements (Philips Research Book Series) , 2011 .

[32]  S. Rice Mathematical analysis of random noise , 1944 .

[33]  J. Westerink,et al.  Considerations for emotion-aware consumer products. , 2009, Applied ergonomics.

[34]  Mohan M. Trivedi,et al.  Speech Emotion Analysis: Exploring the Role of Context , 2010, IEEE Transactions on Multimedia.

[35]  K. Scherer,et al.  Acoustic profiles in vocal emotion expression. , 1996, Journal of personality and social psychology.

[36]  Hajime Kobayashi,et al.  Weighted autocorrelation for pitch extraction of noisy speech , 2001, IEEE Trans. Speech Audio Process..

[37]  Thomas R. Kosten,et al.  Sustained urinary norepinephrine and epinephrine elevation in post-traumatic stress disorder , 1987, Psychoneuroendocrinology.

[38]  Kim E. A. Silverman,et al.  Evidence for the independent function of intonation contour type, voice quality, and F0 range in signaling speaker affect , 1985 .

[39]  Jennifer Healey,et al.  Detecting stress during real-world driving tasks using physiological sensors , 2005, IEEE Transactions on Intelligent Transportation Systems.

[40]  Peter L. Bartlett,et al.  The Sample Complexity of Pattern Classification with Neural Networks: The Size of the Weights is More Important than the Size of the Network , 1998, IEEE Trans. Inf. Theory.

[41]  Julio Sánchez-Meca,et al.  Psychological treatment of panic disorder with or without agoraphobia: a meta-analysis. , 2010, Clinical psychology review.

[42]  J. Westerink,et al.  Emotional and psychophysiological responses to tempo, mode, and percussiveness , 2011 .

[43]  Esam N. Khalil,et al.  Communicating affect in news stories: The case of the lead sentence , 2006 .

[44]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[45]  J. Darby Speech evaluation in psychiatry , 1981 .

[46]  Oleksandr Makeyev,et al.  Neural network with ensembles , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[47]  George N. Votsis,et al.  Emotion recognition in human-computer interaction , 2001, IEEE Signal Process. Mag..

[48]  Egon L. van den Broek,et al.  Tune in to your emotions: a robust personalized affective music player , 2012, User Modeling and User-Adapted Interaction.

[49]  Egon L. van den Broek,et al.  Unobtrusive Sensing of Emotions (USE) , 2009, J. Ambient Intell. Smart Environ..

[50]  Björn W. Schuller,et al.  Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge , 2011, Speech Commun..

[51]  A. Craig How do you feel? Interoception: the sense of the physiological condition of the body , 2002, Nature Reviews Neuroscience.

[52]  Rod McCall,et al.  Infinite Reality: Avatars, Eternal Life, New Worlds, and the Dawn of the Virtual Revolution , 2011, PRESENCE: Teleoperators and Virtual Environments.

[53]  Rosalind W. Picard Affective computing: (526112012-054) , 1997 .

[54]  Donald M Hilty,et al.  Clinical and Educational Telepsychiatry Applications: A Review , 2004, Canadian journal of psychiatry. Revue canadienne de psychiatrie.

[55]  R. Likert “Technique for the Measurement of Attitudes, A” , 2022, The SAGE Encyclopedia of Research Design.

[56]  O. Mayora,et al.  Pervasive or Ubiquitous Healthcare? , 2010, Methods of Information in Medicine.

[57]  D. Kibler,et al.  Instance-based learning algorithms , 2004, Machine Learning.

[58]  K. Domschke,et al.  Interoceptive sensitivity in anxiety and anxiety disorders: an overview and integration of neurobiological findings. , 2010, Clinical psychology review.

[59]  E. Foa,et al.  Sex differences in trauma and posttraumatic stress disorder: a quantitative review of 25 years of research. , 2006, Psychological bulletin.

[60]  John Krumm,et al.  Ubiquitous Computing Fundamentals , 2009 .

[61]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[62]  Tiago H. Falk,et al.  Automatic speech emotion recognition using modulation spectral features , 2011, Speech Commun..

[63]  R. Chao,et al.  DSM‐IV‐TR: Diagnostic and Statistical Manual of Mental Disorders , 2013 .