Emotion Recognition Using Speaker Cues

The focus of the current research is on emotion identification of a speaker using his/her cues. In this work, emotion identification depends on a two-stage framework. The first stage recognizes the speaker whose emotion is undetermined, while the second stage recognizes the unidentified emotion which was spoken by the speaker whose identity was recognized in the preceding stage. Our proposed architecture has been assessed on an Arabic Emirati-accented speech corpus expressed by fifteen Emirati speakers for every gender. As a classifier, Hidden Markov Model is exploited in this study. Our results show that the proposed two-stage framework is superior to the one-stage framework and the “state-of-the-art classifiers such as Gaussian Mixture Model, Support Vector Machine, and Vector Quantization”.

[1]  Ismail Shahin,et al.  Employing Emotion Cues to Verify Speakers in Emotional Talking Environments , 2017, J. Intell. Syst..

[2]  Haizhou Li,et al.  An overview of text-independent speaker recognition: From features to supervectors , 2010, Speech Commun..

[3]  Ismail Shahin Employing Second-Order Circular Suprasegmental Hidden Markov Models to Enhance Speaker Identification Performance in Shouted Talking Environments , 2010, EURASIP J. Audio Speech Music. Process..

[4]  Ismail Shahin Speaking Style Authentication Using Suprasegmental Hidden Markov Models , 2017, ArXiv.

[5]  Ismail Shahin,et al.  Employing both gender and emotion cues to enhance speaker identification performance in emotional talking environments , 2013, International Journal of Speech Technology.

[6]  Ziad Osman,et al.  Ensemble Models for Enhancement of an Arabic Speech Emotion Recognition System , 2019 .

[7]  Ismail Shahin,et al.  Speaker Identification in a Shouted Talking Environment Based on Novel Third-Order Circular Suprasegmental Hidden Markov Models , 2015, Circuits, Systems, and Signal Processing.

[8]  Valery A. Petrushin,et al.  Emotion recognition in speech signal: experimental study, development, and application , 2000, INTERSPEECH.

[9]  Ismail Shahin,et al.  Talking condition recognition in stressful and emotional talking environments based on CSPHMM2s , 2015, Int. J. Speech Technol..

[10]  Ismail Shahin,et al.  Emotion Recognition Using Hybrid Gaussian Mixture Model and Deep Neural Network , 2019, IEEE Access.

[11]  Ismail Shahin,et al.  Studying and enhancing talking condition recognition in stressful and emotional talking environments based on HMMs, CHMM2s and SPHMMs , 2012, Journal on Multimodal User Interfaces.

[12]  George N. Votsis,et al.  Emotion recognition in human-computer interaction , 2001, IEEE Signal Process. Mag..

[13]  Analysis and investigation of emotion identification in biased emotional talking environments , 2011 .

[14]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[15]  Khaled Shaalan,et al.  Speech Recognition Using Deep Neural Networks: A Systematic Review , 2019, IEEE Access.

[16]  William M. Campbell,et al.  Support vector machines for speaker verification and identification , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).

[17]  Ruili Wang,et al.  Ensemble methods for spoken emotion recognition in call-centres , 2007, Speech Commun..

[18]  Torky I. Sultan,et al.  A Computational Approach for Analyzing and Detecting Emotions in Arabic Text , 2022 .

[19]  Ismail Shahin,et al.  Speaker identification investigation and analysis in unbiased and biased emotional talking environments , 2012, International Journal of Speech Technology.

[20]  Ismail Shahin Emotion Recognition based on Third-Order Circular Suprasegmental Hidden Markov Model , 2019, 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT).

[21]  Shahin Ismail,et al.  Utilizing Third-Order Hidden Markov Models for Emotional Talking Condition Recognition , 2018, 2018 14th IEEE International Conference on Signal Processing (ICSP).