Speech Emotion Recognition Based on Sparse Representation

Speech emotion recognition is deemed to be a meaningful and intractable issue among a number of do- mains comprising sentiment analysis, computer science, pedagogy, and so on. In this study, we investigate speech emotion recognition based on sparse partial least squares regression (SPLSR) approach in depth. We make use of the sparse partial least squares regression method to implement the feature selection and dimensionality reduction on the whole acquired speech emotion features. By the means of exploiting the SPLSR method, the component parts of those redundant and meaningless speech emotion features are lessened to zero while those serviceable and informative speech emotion features are maintained and selected to the following classification step. A number of tests on Berlin database reveal that the recogni- tion rate of the SPLSR method can reach up to 79.23% and is superior to other compared dimensionality reduction methods.

[1]  Philippe Besse,et al.  Sparse canonical methods for biological data integration: application to a cross-platform study , 2009, BMC Bioinformatics.

[2]  Jiawei Han,et al.  Spectral Regression: A Unified Approach for Sparse Subspace Learning , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[3]  Wee Ser,et al.  Speech Emotion Recognition Using Canonical Correlation Analysis and Probabilistic Neural Network , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[4]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[5]  Jian Yang,et al.  Sparse two-dimensional local discriminant projections for feature extraction , 2011, Neurocomputing.

[6]  Anthony Randal McIntosh,et al.  Partial Least Squares (PLS) methods for neuroimaging: A tutorial and review , 2011, NeuroImage.

[7]  Wee Ser,et al.  A Hybrid PNN-GMM classification scheme for speech emotion recognition , 2008, 2008 19th International Conference on Pattern Recognition.

[8]  Lijiang Chen,et al.  Speech emotion recognition: Features and classification models , 2012, Digit. Signal Process..

[9]  Larry S. Davis,et al.  Vehicle Detection Using Partial Least Squares , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Astrid Paeschke,et al.  A database of German emotional speech , 2005, INTERSPEECH.

[11]  D. Tritchler,et al.  Sparse Canonical Correlation Analysis with Application to Genomic Data Integration , 2009, Statistical applications in genetics and molecular biology.

[12]  Ja-Chen Lin,et al.  A new LDA-based face recognition system which can solve the small sample size problem , 1998, Pattern Recognit..

[13]  S. Keleş,et al.  Sparse partial least squares regression for simultaneous dimension reduction and variable selection , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[14]  Wenming Zheng,et al.  Generalized Maximal Margin Discriminant Analysis for Speech Emotion Recognition , 2013 .

[15]  Xiaoyan Zhou,et al.  Speech emotion recognition based on kernel reduced-rank regression , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[16]  V. Kshirsagar,et al.  Face recognition using Eigenfaces , 2011, 2011 3rd International Conference on Computer Research and Development.

[17]  Philippe Besse,et al.  Statistical Applications in Genetics and Molecular Biology A Sparse PLS for Variable Selection when Integrating Omics Data , 2011 .

[18]  D Gianola,et al.  Dimension reduction and variable selection for genomic selection: application to predicting milk yield in Holsteins. , 2011, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[19]  Roman Rosipal,et al.  Overview and Recent Advances in Partial Least Squares , 2005, SLSFS.

[20]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[21]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[22]  Jianhua Z. Huang,et al.  Sparse principal component analysis via regularized low rank matrix approximation , 2008 .

[23]  Sunduz Keles,et al.  Sparse Partial Least Squares Classification for High Dimensional Data , 2010, Statistical applications in genetics and molecular biology.

[24]  Giovanni Montana,et al.  Sparse partial least squares regression for on‐line variable selection with multivariate data streams , 2010, Stat. Anal. Data Min..

[25]  Kim-Anh Lê Cao,et al.  Integration and variable selection of ‘omics’ data sets with PLS: a survey , 2011 .

[26]  Philippe Besse,et al.  Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems , 2011, BMC Bioinformatics.

[27]  Fakhri Karray,et al.  Survey on speech emotion recognition: Features, classification schemes, and databases , 2011, Pattern Recognit..

[28]  Rongshan Yu,et al.  Detecting Intelligibility by Linear Dimensionality Reduction and Normalized Voice Quality Hierarchical Features , 2012, INTERSPEECH.

[29]  Tiago H. Falk,et al.  Automatic speech emotion recognition using modulation spectral features , 2011, Speech Commun..

[30]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[31]  Xiaoyan Zhou,et al.  Improving CCA via spectral components selection for facial expression recognition , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[32]  Jangsun Baek,et al.  Face recognition using partial least squares components , 2004, Pattern Recognit..

[33]  Larry S. Davis,et al.  Human detection using partial least squares analysis , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[34]  R. Manne Analysis of two partial-least-squares algorithms for multivariate calibration , 1987 .

[35]  Jianhua Z. Huang,et al.  Sparse Linear Discriminant Analysis with Applications to High Dimensional Low Sample Size Data , 2009 .