Research on Speech Emotion Recognition Technology based on Deep and Shallow Neural Network

Speech emotion recognition has become a hot topic in the field of human-computer interaction. In order to improve the accuracy of emotion recognition, this paper proposes a new speech emotion recognition technology based on the combination of deep and shallow neural networks. First, the speech signal is preprocessed, then the parallel training sample set is established, and the Deep Belief Network (DBN) is used to automatically extract and recognize the speech emotion features. Finally, the shallow neural network is used to obtain the final recognition results. In order to evaluate the quality of the new method, we compared three systems to identify five emotions, and the experimental results show that the proposed method can effectively improve the accuracy of emotion recognition.

[1]  Ping Liu,et al.  Facial Expression Recognition via a Boosted Deep Belief Network , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Wang Jian Speech Emotion Recognition Based on Genetic Wavelet Neural Network , 2013 .

[3]  Pierre Dumouchel,et al.  Anchor Models for Emotion Recognition from Speech , 2013, IEEE Transactions on Affective Computing.

[4]  Carlos Busso,et al.  Domain Adversarial for Acoustic Emotion Recognition , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[5]  Song Peng,et al.  Joint subspace learning and feature selection method for speech emotion recognition , 2018 .

[6]  Clifford Nass,et al.  The media equation - how people treat computers, television, and new media like real people and places , 1996 .

[7]  Zdravko Kacic,et al.  Context-Independent Multilingual Emotion Recognition from Speech Signals , 2003, Int. J. Speech Technol..

[8]  Ryohei Nakatsu,et al.  Emotion Recognition in Speech Using Neural Networks , 2000, Neural Computing & Applications.

[9]  Chen Ming-yi Study on emotion feature analysis and recognition in speech signal: an overview , 2007 .

[10]  Ning An,et al.  Speech Emotion Recognition Using Fourier Parameters , 2015, IEEE Transactions on Affective Computing.

[11]  C. P. Latha,et al.  A Review on Deep Learning Algorithms for Speech and Facial Emotion Recognition , 2016, APTIKOM Journal on Computer Science and Information Technologies.

[12]  Wenming Zheng,et al.  A Novel Speech Emotion Recognition Method via Incomplete Sparse Least Square Regression , 2014, IEEE Signal Processing Letters.