A Hybrid Ensemble Deep Learning Approach for Emotion Classification

Speech processing, the field of analysing input speech signals and methods of processing them has emerged in the recent days. Additionally, the development of a speech processing system involves several components in the design phase with probabilistic approximations for enhanced audio sampling and de-noising. In this work, we focus into use of Gaussian random variables while modelling and filtering noise that gets added after being passed through an additive noise channel in a communication system, and the applications of Hidden Markov models. Moreover, we apply deep learning methods for emotion classification via a robust and accurate ensemble learning scheme that is applied to a joint deep network which incorporates audiovisual inputs and generates the emotion prediction effectively reaching satisfactory accuracy.

[1]  S. Sioutas,et al.  Peer to Peer Federated Learning: Towards Decentralized Machine Learning on Edge Devices , 2022, 2022 7th South-East Europe Design Automation, Computer Engineering, Computer Networks and Social Media Conference (SEEDA-CECNSM).

[2]  S. Sioutas,et al.  Distributed Gibbs Sampling and LDA Modelling for Large Scale Big Data Management on PySpark , 2022, 2022 7th South-East Europe Design Automation, Computer Engineering, Computer Networks and Social Media Conference (SEEDA-CECNSM).

[3]  N. Ramzan,et al.  On the benefits of using Hidden Markov Models to predict emotions , 2022, UMAP.

[4]  Aristeidis Karras,et al.  Integrating User and Item Reviews in Deep Cooperative Neural Networks for Movie Recommendation , 2022, 2205.06296.

[5]  Christos Karras,et al.  DBSOP: An Efficient Heuristic for Speedy MCMC Sampling on Polytopes , 2022, ArXiv.

[6]  John A. Stankovic,et al.  Robustness to noise for speech emotion classification using CNNs and attention mechanisms , 2020, Smart Health.

[7]  Z. Tan,et al.  An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8]  Ruiyu Liang,et al.  Speech Emotion Classification Using Attention-Based LSTM , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[9]  Tan Lee,et al.  Revisiting Hidden Markov Models for Speech Emotion Recognition , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Rahul Chauhan,et al.  Convolutional Neural Network (CNN) for Image Detection and Recognition , 2018, 2018 First International Conference on Secure Cyber Computing and Communication (ICSCCC).

[11]  Marcus Liwicki,et al.  DeXpression: Deep Convolutional Neural Network for Expression Recognition , 2015, ArXiv.

[12]  Jian Sun,et al.  Accelerating Very Deep Convolutional Networks for Classification and Detection , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Emily Mower Provost,et al.  Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[14]  S. Sioutas,et al.  An Overview of MCMC Methods: From Theory to Applications , 2022, AIAI Workshops.

[15]  S. Sioutas,et al.  Maximum Likelihood Estimators on MCMC Sampling Algorithms for Decision Making , 2022, AIAI Workshops.

[16]  S. Sioutas,et al.  Expanding Queries with Maximum Likelihood Estimators and Language Models , 2022, ICR.

[17]  Abhishek Verma,et al.  Compressed residual-VGG16 CNN model for big data places image recognition , 2018, 2018 IEEE 8th Annual Computing and Communication Workshop and Conference (CCWC).