Random Discriminative Projection Based Feature Selection with Application to Conflict Recognition

Computational paralinguistics deals with underlying meaning of the verbal messages, which is of interest in manifold applications ranging from intelligent tutoring systems to affect sensitive robots. The state-of-the-art pipeline of paralinguistic speech analysis utilizes brute-force feature extraction, and the features need to be tailored according to the relevant task. In this work, we extend a recent discriminative projection based feature selection method using the power of stochasticity to overcome local minima and to reduce the computational complexity. The proposed approach assigns weights both to groups and to features individually in many randomly selected contexts and then combines them for a final ranking. The efficacy of the proposed method is shown in a recent paralinguistic challenge corpus to detect level of conflict in dyadic and group conversations. We advance the state-of-the-art in this corpus using the INTERSPEECH 2013 Challenge protocol.

[1]  Elmar Nöth,et al.  Associating children's non-verbal and verbal behaviour: Body movements, emotions, and laughter in a human-robot interaction , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Albert Ali Salah,et al.  Ensemble CCA for Continuous Emotion Prediction , 2014, AVEC '14.

[3]  Fabio Valente,et al.  Predicting the conflict level in television political debates: an approach based on crowdsourcing, nonverbal communication and gaussian processes , 2012, ACM Multimedia.

[4]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[5]  Björn Schuller,et al.  Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[6]  Albert Ali Salah,et al.  Eyes Whisper Depression: A CCA based Multimodal Approach , 2014, ACM Multimedia.

[7]  Fabio Valente,et al.  Collecting data for socially intelligent surveillance and monitoring approaches: The case of conflict in competitive conversations , 2012, 2012 5th International Symposium on Communications, Control and Signal Processing.

[8]  Björn W. Schuller,et al.  CCA based feature selection with application to continuous depression recognition from acoustic speech features , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Fabio Valente,et al.  Predicting Continuous Conflict Perceptionwith Bayesian Gaussian Processes , 2014, IEEE Transactions on Affective Computing.

[10]  Albert Ali Salah,et al.  Canonical correlation analysis and local fisher discriminant analysis based multi-view acoustic feature reduction for physical load prediction , 2014, INTERSPEECH.

[11]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[12]  Antonio Barrientos Cruz,et al.  Emotion recognition in non-structured utterances for human-robot interaction , 2005, ROMAN 2005. IEEE International Workshop on Robot and Human Interactive Communication, 2005..

[13]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  Johanna D. Moore,et al.  Word-Level Emotion Recognition Using High-Level Features , 2014, CICLing.

[16]  Björn W. Schuller,et al.  Paralinguistics in speech and language - State-of-the-art and the challenge , 2013, Comput. Speech Lang..

[17]  Fabio Valente,et al.  The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism , 2013, INTERSPEECH.

[18]  J. Schoentgen Vocal cues of disordered voices: an overview , 2006 .

[19]  T. Paus,et al.  Functional abnormalities in the primary orofacial sensorimotor cortex during speech in Parkinson's disease , 2007, Movement disorders : official journal of the Movement Disorder Society.

[20]  Gerasimos Potamianos,et al.  A hierarchical approach with feature selection for emotion recognition from speech , 2012, LREC.

[21]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[22]  Luis Villaseñor Pineda,et al.  Bilingual acoustic feature selection for emotion estimation using a 3D continuous model , 2011, Face and Gesture 2011.

[23]  Björn W. Schuller,et al.  AVEC 2014: 3D Dimensional Affect and Depression Recognition Challenge , 2014, AVEC '14.

[24]  Max A. Little,et al.  Accurate Telemonitoring of Parkinson's Disease Progression by Noninvasive Speech Tests , 2009, IEEE Transactions on Biomedical Engineering.

[25]  Björn W. Schuller,et al.  Voice and Speech Analysis in Search of States and Traits , 2011, Computer Analysis of Human Behavior.

[26]  Björn W. Schuller,et al.  Affect recognition in real-life acoustic conditions - a new perspective on feature selection , 2013, INTERSPEECH.

[27]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[28]  Masashi Sugiyama,et al.  Local Fisher discriminant analysis for supervised dimensionality reduction , 2006, ICML.

[29]  Björn W. Schuller,et al.  AVEC 2013: the continuous audio/visual emotion and depression recognition challenge , 2013, AVEC@ACM Multimedia.

[30]  Jeff A. Bilmes,et al.  Classification of developmental disorders from speech signals using submodular feature selection , 2013, INTERSPEECH.

[31]  Andrew Rosenberg,et al.  Let me finish: automatic conflict detection using speaker overlap , 2013, INTERSPEECH.

[32]  Okko Johannes Räsänen,et al.  Random subset feature selection in automatic recognition of developmental disorders, affective states, and level of conflict from speech , 2013, INTERSPEECH.

[33]  Nasrollah Moghaddam Charkari,et al.  Audiovisual emotion recognition using ANOVA feature selection method and multi-classifier neural networks , 2014, Neural Computing and Applications.