Face-speech sensor fusion for non-invasive stress detection

We describe a human-centered multimodal framework for automatically measuring cognitive changes. As a proof-ofconcept, we test our approach on the use case of stress detection. We contribute a method that combines non-intrusive behavioral analysis of facial expressions with speech data, enabling detection without the use of wearable devices. We compare these modalities’ effectiveness against galvanic skin response (GSR) collected simultaneously from the subject group using a wristband sensor. Data was collected with a modified version of the Stroop test, in which subjects perform the test both with and without the inclusion of stressors. Our study attempts to distinguish stressed and unstressed behaviors during constant cognitive load. The best improvement in accuracy over the majority class baseline was 38%, which was only 5% behind the best GSR result on the same data. This suggests that reliable markers of cognitive changes can be captured by behavioral data that are more suitable for group settings than wearable devices, and that combining modalities is beneficial.

[1]  S. Kong,et al.  Block-Based Neural Networks for Personalized ECG Signal Classification , 2007, IEEE Transactions on Neural Networks.

[2]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[3]  Colin M. Macleod Half a century of research on the Stroop effect: an integrative review. , 1991, Psychological bulletin.

[4]  P. Lieberman,et al.  Fundamental frequency of phonation and perceived emotional stress. , 1997, The Journal of the Acoustical Society of America.

[5]  G. Chrousos Stress and disorders of the stress system , 2009, Nature Reviews Endocrinology.

[6]  Stanley Peters,et al.  Detection of time-pressure induced stress in speech via acoustic indicators , 2010, SIGDIAL Conference.

[7]  An evaluation of facial expression displayed by patients with chest pain. , 1999, Heart & lung : the journal of critical care.

[8]  Frédo Durand,et al.  Detecting Pulse from Head Motions in Video , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Kenneth D. Craig,et al.  Expressing pain: The communication and interpretation of facial pain signals , 1995 .

[10]  C. Kirschbaum,et al.  Salivary cortisol in psychobiological research: an overview. , 1989, Neuropsychobiology.

[11]  Minh Hoai Nguyen,et al.  Personalized Stress Detection from Physiological Measurements , 2010 .

[12]  George R. Doddington,et al.  Recognition of speech under stress and in noise , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[14]  Hao Li,et al.  Realtime performance-based facial animation , 2011, ACM Trans. Graph..

[15]  P. Ekman,et al.  Facial signs of emotional experience. , 1980 .

[16]  John H. L. Hansen,et al.  Speech Under Stress: Analysis, Modeling and Recognition , 2007, Speaker Classification.

[17]  B. McEwen,et al.  Effects of stress throughout the lifespan on the brain, behaviour and cognition , 2009, Nature Reviews Neuroscience.

[18]  N. Minois Longevity and aging: beneficial effects ofexposure to mild stress , 2004, Biogerontology.

[19]  P. Ekman Facial expression and emotion. , 1993, The American psychologist.

[20]  Kimihiro Yamanaka,et al.  Convenient Evaluation of Mental Stress With Pupil Diameter , 2009, International journal of occupational safety and ergonomics : JOSE.

[21]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[22]  K. Sato,et al.  Analysis of psychological stress factors and facial parts effect on intentional facial expressions , 2012, 2012 Proceedings of SICE Annual Conference (SICE).

[23]  Robert E. Babe No Time: Stress and the Crisis of Modern Life , 2006 .

[24]  John H. L. Hansen,et al.  Evaluation of acoustic correlates of speech under stress for robust speech recognition , 1989, Proceedings of the Fifteenth Annual Northeast Bioengineering Conference.

[25]  John H. L. Hansen,et al.  Nonlinear analysis and classification of speech under stressed conditions , 1994 .

[26]  John H. L. Hansen,et al.  Analysis and compensation of stressed and noisy speech with application to robust automatic recognition , 1988 .

[27]  F. Dhabhar,et al.  Stress-induced changes in blood leukocyte distribution. Role of adrenal steroid hormones. , 1996, Journal of immunology.

[28]  Gerhard Tröster,et al.  Discriminating Stress From Cognitive Load Using a Wearable EDA Device , 2010, IEEE Transactions on Information Technology in Biomedicine.