Emirati-Accented Emotion Verification based on HMM3s, HMM2s, and HMM1s

The proposed research is dedicated to verifying the claimed emotion of speaker-independent and text-independent formed on three dissimilar classifiers. The HMM3 short for Third-Order Hidden Markov Model, HMM2 short for Second-Order Hidden Markov Model, and HMM1 short for First-Order Hidden Markov Model are the three classifiers utilized in this study. Our work has been evaluated on our collected Emirati-accented speech corpus which entails 50 speakers of Emirati origin (25 female and 25 male) uttering sentences in six emotions by means of the extracted features by Mel-Frequency Cepstral Coefficients (MFCCs). Our outcomes prove that HMM3 is superior to each of HMM1 and HMM2 to authenticate the claimed emotion. The achieved results formed on HMM3 are very similar to the outcomes attained in the subjective valuation by Arab listeners.

[1]  Biing-Hwang Juang,et al.  Hidden Markov Models for Speech Recognition , 1991 .

[2]  Khaled Shaalan,et al.  Speech Recognition Using Deep Neural Networks: A Systematic Review , 2019, IEEE Access.

[3]  Valery A. Petrushin,et al.  Emotion recognition in speech signal: experimental study, development, and application , 2000, INTERSPEECH.

[4]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[5]  K. YogeshC.,et al.  A new hybrid PSO assisted biogeography-based optimization for emotion and stress recognition from speech signal , 2017, Expert Syst. Appl..

[6]  Ismail Shahin Employing Second-Order Circular Suprasegmental Hidden Markov Models to Enhance Speaker Identification Performance in Shouted Talking Environments , 2010, EURASIP J. Audio Speech Music. Process..

[7]  Ruili Wang,et al.  Ensemble methods for spoken emotion recognition in call-centres , 2007, Speech Commun..

[8]  Abdelaziz Kriouile,et al.  Automatic word recognition based on second-order hidden Markov models , 1994, IEEE Trans. Speech Audio Process..

[9]  Analysis and investigation of emotion identification in biased emotional talking environments , 2011 .

[10]  Torky I. Sultan,et al.  A Computational Approach for Analyzing and Detecting Emotions in Arabic Text , 2022 .

[11]  Ismail Shahin,et al.  Novel third-order hidden Markov models for speaker identification in shouted talking environments , 2014, Eng. Appl. Artif. Intell..

[12]  William M. Campbell,et al.  Support vector machines for speaker verification and identification , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).

[13]  Haizhou Li,et al.  An overview of text-independent speaker recognition: From features to supervectors , 2010, Speech Commun..

[14]  Ismail Shahin,et al.  Enhancing speaker identification performance under the shouted talking condition using second-order circular hidden Markov models , 2006, Speech Commun..

[15]  Ismail Shahin,et al.  Employing both gender and emotion cues to enhance speaker identification performance in emotional talking environments , 2013, International Journal of Speech Technology.

[16]  Ismail Shahin,et al.  Emotion Recognition Using Hybrid Gaussian Mixture Model and Deep Neural Network , 2019, IEEE Access.

[17]  P. Alam ‘G’ , 2021, Composites Engineering: An A–Z Guide.

[18]  Ismail Shahin,et al.  Studying and enhancing talking condition recognition in stressful and emotional talking environments based on HMMs, CHMM2s and SPHMMs , 2012, Journal on Multimodal User Interfaces.

[19]  Ismail Shahin Speaking Style Authentication Using Suprasegmental Hidden Markov Models , 2017, ArXiv.

[20]  Ismail Shahin,et al.  Speaker identification investigation and analysis in unbiased and biased emotional talking environments , 2012, International Journal of Speech Technology.

[21]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[22]  Ziad Osman,et al.  Ensemble Models for Enhancement of an Arabic Speech Emotion Recognition System , 2019 .

[23]  George N. Votsis,et al.  Emotion recognition in human-computer interaction , 2001, IEEE Signal Process. Mag..

[24]  Ismail Shahin,et al.  Speaker Identification in a Shouted Talking Environment Based on Novel Third-Order Circular Suprasegmental Hidden Markov Models , 2015, Circuits, Systems, and Signal Processing.

[25]  Ismail Shahin,et al.  Talking condition recognition in stressful and emotional talking environments based on CSPHMM2s , 2015, Int. J. Speech Technol..

[26]  Ismail Shahin,et al.  Employing Emotion Cues to Verify Speakers in Emotional Talking Environments , 2017, J. Intell. Syst..

[27]  Shahin Ismail,et al.  Utilizing Third-Order Hidden Markov Models for Emotional Talking Condition Recognition , 2018, 2018 14th IEEE International Conference on Signal Processing (ICSP).