Multimodal Approach of Speech Emotion Recognition Using Multi-Level Multi-Head Fusion Attention-Based Recurrent Neural Network
暂无分享,去创建一个
Hyung-Jeong Yang | Soo-Hyung Kim | Gueesang Lee | Ngoc-Huynh Ho | Hyung-Jeong Yang | Gueesang Lee | Soohyung Kim | Ngoc-Huynh Ho
[1] Kornel Laskowski,et al. Emotion recognition in spontaneous speech using GMMs , 2006, INTERSPEECH.
[2] Jinkyu Lee,et al. High-level feature representation using recurrent neural network for speech emotion recognition , 2015, INTERSPEECH.
[3] Pushpak Bhattacharyya,et al. Multi-task Gated Contextual Cross-Modal Attention Framework for Sentiment and Emotion Analysis , 2019, ICONIP.
[4] Ruili Wang,et al. Ensemble methods for spoken emotion recognition in call-centres , 2007, Speech Commun..
[5] Mohammed Bennamoun,et al. Learning-Based Confidence Estimation for Multi-modal Classifier Fusion , 2019, ICONIP.
[6] Chengxin Li,et al. Speech emotion recognition with acoustic and lexical features , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[7] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[8] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[9] Che-Wei Huang,et al. Attention Assisted Discovery of Sub-Utterance Structure in Speech Emotion Recognition , 2016, INTERSPEECH.
[10] Carlos Busso,et al. IEMOCAP: interactive emotional dyadic motion capture database , 2008, Lang. Resour. Evaluation.
[11] Laurence Devillers,et al. CNN+LSTM Architecture for Speech Emotion Recognition with Data Augmentation , 2018, Workshop on Speech, Music and Mind (SMM 2018).
[12] Chunyan Miao,et al. Knowledge-Enriched Transformer for Emotion Detection in Textual Conversations , 2019, EMNLP.
[13] Erik Cambria,et al. Conversational Memory Network for Emotion Recognition in Dyadic Dialogue Videos , 2018, NAACL.
[14] Björn W. Schuller,et al. Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.
[15] Efthymios Tzinis,et al. Segment-based speech emotion recognition using recurrent neural networks , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).
[16] Ngoc Thang Vu,et al. Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech , 2017, INTERSPEECH.
[17] Lun-Wei Ku,et al. EmotionLines: An Emotion Corpus of Multi-Party Conversations , 2018, LREC.
[18] Dong Yu,et al. Speech emotion recognition using deep neural network and extreme learning machine , 2014, INTERSPEECH.
[19] Ryohei Nakatsu,et al. Emotion Recognition in Speech Using Neural Networks , 2000, Neural Computing & Applications.
[20] Chan Woo Lee,et al. Convolutional Attention Networks for Multimodal Emotion Recognition from Speech and Text Data , 2018, ArXiv.
[21] Rada Mihalcea,et al. MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations , 2018, ACL.
[22] Shiqing Zhang,et al. Spoken emotion recognition via locality-constrained kernel sparse representation , 2014, Neural Computing and Applications.
[23] Rada Mihalcea,et al. DialogueRNN: An Attentive RNN for Emotion Detection in Conversations , 2018, AAAI.
[24] Shashidhar G. Koolagudi,et al. SVM Scheme for Speech Emotion Recognition using MFCC Feature , 2013 .
[25] P. Kleinginna,et al. A categorized list of emotion definitions, with suggestions for a consensual definition , 1981 .
[26] Shiqing Zhang,et al. Robust emotion recognition in noisy speech via sparse representation , 2013, Neural Computing and Applications.
[27] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[28] Fakhri Karray,et al. Survey on speech emotion recognition: Features, classification schemes, and databases , 2011, Pattern Recognit..
[29] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[30] Guodong Zhou,et al. Modeling both Context- and Speaker-Sensitive Dependence for Emotion Detection in Multi-speaker Conversations , 2019, IJCAI.
[31] Erik Cambria,et al. Multi-attention Recurrent Network for Human Communication Comprehension , 2018, AAAI.
[32] Pushpak Bhattacharyya,et al. Multi-task Learning for Multi-modal Emotion Recognition and Sentiment Analysis , 2019, NAACL.
[33] Rui Xia,et al. Multimodal Relational Tensor Network for Sentiment and Emotion Classification , 2018, ArXiv.
[34] Frank Dellaert,et al. Recognizing emotion in speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[35] Philip N. Garner,et al. Context-Aware Attention Mechanism for Speech Emotion Recognition , 2018, 2018 IEEE Spoken Language Technology Workshop (SLT).
[36] Jennifer Williams,et al. Recognizing Emotions in Video Using Multimodal DNN Feature Fusion , 2018 .
[37] Homayoon S. M. Beigi,et al. Multi-Modal Emotion recognition on IEMOCAP Dataset using Deep Learning , 2018, ArXiv.
[38] Björn W. Schuller,et al. Categorical and dimensional affect analysis in continuous input: Current trends and future directions , 2013, Image Vis. Comput..
[39] Diego H. Milone,et al. Spoken emotion recognition using hierarchical classifiers , 2011, Comput. Speech Lang..
[40] Albino Nogueiras,et al. Speech emotion recognition using hidden Markov models , 2001, INTERSPEECH.
[41] Grigoriy Sterling,et al. Emotion Recognition From Speech With Recurrent Neural Networks , 2017, ArXiv.
[42] Michael R. Lyu,et al. Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network , 2019, AAAI.
[43] Seyedmahdad Mirsamadi,et al. Automatic speech emotion recognition using recurrent neural networks with local attention , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[44] Yuanyuan Zhang,et al. Attention Based Fully Convolutional Network for Speech Emotion Recognition , 2018, 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).
[45] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[46] Carlos Busso,et al. Emotion recognition using a hierarchical binary decision tree approach , 2011, Speech Commun..
[47] M. Sreeshakthy,et al. Classification of Human Emotion from Deap EEG Signal Using Hybrid Improved Neural Networks with Cuckoo Search , 2016 .