Investigating Word Affect Features and Fusion of Probabilistic Predictions Incorporating Uncertainty in AVEC 2017

Predicting emotion intensity and severity of depression are both challenging and important problems within the broader field of affective computing. As part of the AVEC 2017, we developed a number of systems to accomplish these tasks. In particular, word affect features, which derive human affect ratings (e.g. arousal and valence) from transcripts, were investigated for predicting depression severity and liking, showing great promise. A simple system based on the word affect features achieved an RMSE of 6.02 on the test set, yielding a relative improvement of 13.6% over the baseline. For the emotion prediction sub-challenge, we investigated multimodal fusion, which incorporated a measure of uncertainty associated with each prediction within an Output-Associative fusion framework for arousal and valence prediction, whilst liking prediction systems mainly focused on text-based features. Our best emotion prediction systems provided significant relative improvements over the baseline on the test set of 39.5%, 17.6%, and 29.3% for arousal, valence, and liking. Of particular note is that consistent improvements were observed when incorporating prediction uncertainty across various system configurations for predicting arousal and valence, suggesting the importance of taking into consideration prediction uncertainty for fusion and more broadly the advantages of probabilistic predictions.

[1]  Klaus R. Scherer,et al.  Antecedents of and Reactions to Emotions in the United States and Japan , 1988 .

[2]  Thomas E Joiner,et al.  Do major depressive disorder and dysthymic disorder confer differential risk for suicide? , 2009, Journal of affective disorders.

[3]  Shrikanth S. Narayanan,et al.  Support Vector Regression for Automatic Recognition of Spontaneous Emotions in Speech , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[4]  Roland Göcke,et al.  Elicitation Design for Acoustic Depression Classification: An Investigation of Articulation Effort, Linguistic Complexity, and Word Affect , 2017, INTERSPEECH.

[5]  John Kane,et al.  COVAREP — A collaborative voice analysis repository for speech technologies , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  J. Pennebaker,et al.  Language use of depressed and depression-vulnerable college students , 2004 .

[7]  Erik Cambria,et al.  SenticNet: A Publicly Available Semantic Resource for Opinion Mining , 2010, AAAI Fall Symposium: Commonsense Knowledge.

[8]  Birgitta Ojamaa,et al.  Sentiment analysis on conversational texts , 2015, NODALIDA.

[9]  Vidhyasaharan Sethu,et al.  Analysis of acoustic space variability in speech affected by depression , 2015, Speech Commun..

[10]  Peter Kulchyski and , 2015 .

[11]  T. Pozzo,et al.  Psychomotor Retardation in Depression: A Systematic Review of Diagnostic, Pathophysiologic, and Therapeutic Implications , 2013, BioMed research international.

[12]  N. Freedman,et al.  The language of depression. , 1981, Bulletin of the Menninger Clinic.

[13]  Athanasios Katsamanis,et al.  Tracking changes in continuous emotion states using body language and prosodic cues , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  J. Sayers The world health report 2001 - Mental health: new understanding, new hope , 2001 .

[15]  David J. Fleet,et al.  Erratum: "Gaussian process dynamical models for human motion" (IEEE Transactions on Pattern analysis and Machine Intelligenc (292)) , 2008 .

[16]  David DeVault,et al.  The Distress Analysis Interview Corpus of human and computer interviews , 2014, LREC.

[17]  Laurence Devillers,et al.  Multimodal Sentiment Analysis in the Wild: Ethical considerations on Data Collection, Annotation, and Exploitation , 2016 .

[18]  Björn W. Schuller,et al.  The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing , 2016, IEEE Transactions on Affective Computing.

[19]  S. Flint,et al.  Sequence stratigraphy of Cretaceous shallow marine sandstones, Book Cliffs, Utah: application to reservoir modelling , 1993 .

[20]  Carlos Busso,et al.  Interpreting ambiguous emotional expressions , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[21]  Danielle S McNamara,et al.  Sentiment Analysis and Social Cognition Engine (SEANCE): An automatic tool for sentiment, social cognition, and social-order analysis , 2017, Behavior research methods.

[22]  J. Darby,et al.  Speech and voice parameters of depression: a pilot study. , 1984, Journal of communication disorders.

[23]  Byron Reeves,et al.  The effects of animated characters on anxiety, task performance, and evaluations of user interfaces , 2000, CHI.

[24]  Thomas F. Quatieri,et al.  Vocal biomarkers of depression based on motor incoordination , 2013, AVEC@ACM Multimedia.

[25]  Michele R. Dudash,et al.  Predicting the Probability of Outbreeding Depression , 2011, Conservation biology : the journal of the Society for Conservation Biology.

[26]  Robert Philip Weber,et al.  Dynamics of culture , 1986 .

[27]  Björn W. Schuller,et al.  LSTM-Modeling of continuous emotions in an audiovisual affect recognition framework , 2013, Image Vis. Comput..

[28]  M. Bradley,et al.  Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings , 1999 .

[29]  Zhaocheng Huang,et al.  A PLLR and multi-stage Staircase Regression framework for speech-based emotion prediction , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[30]  Ting Dang,et al.  An Investigation of Emotion Prediction Uncertainty Using Gaussian Mixture Regression , 2017, INTERSPEECH.

[31]  Carl E. Rasmussen,et al.  Additive Gaussian Processes , 2011, NIPS.

[32]  Scott A. Crossley,et al.  Automatically Assessing Lexical Sophistication: Indices, Tools, Findings, and Application , 2015 .

[33]  Lenore J Launer,et al.  Intensive vs Standard Blood Pressure Control and Cardiovascular Disease Outcomes in Adults Aged ≥75 Years: A Randomized Clinical Trial. , 2016, JAMA.

[34]  Ting Dang,et al.  Staircase Regression in OA RVM, Data Selection and Gender Dependency in AVEC 2016 , 2016, AVEC@ACM Multimedia.

[35]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[36]  Balthasar Bickel,et al.  Language evolution: syntax before phonology? , 2014, Proceedings of the Royal Society B: Biological Sciences.

[37]  David J. Fleet,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Gaussian Process Dynamical Model , 2007 .

[38]  Fabien Ringeval,et al.  AVEC 2017: Real-life Depression, and Affect Recognition Workshop and Challenge , 2017, AVEC@ACM Multimedia.

[39]  T. Strine,et al.  The PHQ-8 as a measure of current depression in the general population. , 2009, Journal of affective disorders.

[40]  Katharina Eggensperger,et al.  Towards an Empirical Foundation for Assessing Bayesian Optimization of Hyperparameters , 2013 .

[41]  Ting Dang,et al.  An Investigation of Annotation Delay Compensation and Output-Associative Fusion for Multimodal Continuous Emotion Prediction , 2015, AVEC@ACM Multimedia.

[42]  Laura K. Allen,et al.  Analyzing Discourse Processing Using a Simple Natural Language Processing Tool , 2014 .

[43]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[44]  Jean-Claude Martin,et al.  Human computer interfaces for autism: assessing the influence of task assignment and output modalities , 2005, CHI Extended Abstracts.

[45]  Fabien Ringeval,et al.  At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech , 2016, INTERSPEECH.

[46]  A. Feinstein,et al.  The link between multiple sclerosis and depression , 2014, Nature Reviews Neurology.

[47]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[48]  B. Pfohl,et al.  Linguistic analysis of speech in affective disorders. , 1976, Archives of general psychiatry.

[49]  W. Iacono,et al.  Risk for recurrence in depression. , 2007, Clinical psychology review.

[50]  Hatice Gunes,et al.  Output-associative RVM regression for dimensional and continuous emotion prediction , 2011, Face and Gesture 2011.