Multimodal Affective Analysis Using Hierarchical Attention Strategy with Word-Level Alignment
暂无分享,去创建一个
Ivan Marsic | Shuhong Chen | Xinyu Li | Yue Gu | Kangning Yang | Shiyu Fu | I. Marsic | Xinyu Li | Kangning Yang | Yue Gu | Shuhong Chen | Shiyu Fu
[1] John Kane,et al. COVAREP — A collaborative voice analysis repository for speech technologies , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Eric P. Xing,et al. Select-Additive Learning: Improving Cross-individual Generalization in Multimodal Sentiment Analysis , 2016, ArXiv.
[3] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[4] Diane J. Litman,et al. Predicting Student Emotions in Computer-Human Tutoring Dialogues , 2004, ACL.
[5] Ivan Marsic,et al. Deep Mul Timodal Learning for Emotion Recognition in Spoken Language , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Louis-Philippe Morency,et al. MOSI: Multimodal Corpus of Sentiment Intensity and Subjectivity Analysis in Online Opinion Videos , 2016, ArXiv.
[7] Wen Gao,et al. Learning Affective Features With a Hybrid Deep Model for Audio–Visual Emotion Recognition , 2018, IEEE Transactions on Circuits and Systems for Video Technology.
[8] Rohit Kumar,et al. Ensemble of SVM trees for multimodal emotion recognition , 2012, Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference.
[9] Verónica Pérez-Rosas,et al. Multimodal Sentiment Analysis of Spanish Online Videos , 2013, IEEE Intelligent Systems.
[10] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[11] Arman Savran,et al. Combining video, audio and lexical indicators of affect in spontaneous conversation via particle filtering , 2012, ICMI '12.
[12] Erik Cambria,et al. Deep Convolutional Neural Network Textual Features and Multiple Kernel Learning for Utterance-level Multimodal Sentiment Analysis , 2015, EMNLP.
[13] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[14] Erik Cambria,et al. A review of affective computing: From unimodal analysis to multimodal fusion , 2017, Inf. Fusion.
[15] Diane J. Litman,et al. Predicting Emotion in Spoken Dialogue from Multiple Knowledge Sources , 2004, NAACL.
[16] Ngoc Thang Vu,et al. Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech , 2017, INTERSPEECH.
[17] Björn W. Schuller,et al. Patterns, prototypes, performance: classifying emotional user states , 2008, INTERSPEECH.
[18] Che-Wei Huang,et al. Attention Assisted Discovery of Sub-Utterance Structure in Speech Emotion Recognition , 2016, INTERSPEECH.
[19] Ivan Marsic,et al. Speech Intention Classification with Multimodal Deep Learning , 2017, Canadian Conference on AI.
[20] Eric P. Xing,et al. Select-additive learning: Improving generalization in multimodal sentiment analysis , 2016, 2017 IEEE International Conference on Multimedia and Expo (ICME).
[21] Diyi Yang,et al. Hierarchical Attention Networks for Document Classification , 2016, NAACL.
[22] Björn W. Schuller,et al. On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues , 2009, Journal on Multimodal User Interfaces.
[23] Erik Cambria,et al. Convolutional MKL Based Multimodal Emotion Recognition and Sentiment Analysis , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).
[24] Sen Wang,et al. Multimodal sentiment analysis with word-level fusion and reinforcement learning , 2017, ICMI.
[25] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.
[26] Carlos Busso,et al. IEMOCAP: interactive emotional dyadic motion capture database , 2008, Lang. Resour. Evaluation.
[27] Gerald Penn,et al. Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[28] Chengxin Li,et al. Speech emotion recognition with acoustic and lexical features , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[29] Seyedmahdad Mirsamadi,et al. Automatic speech emotion recognition using recurrent neural networks with local attention , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[30] Björn Schuller,et al. Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.
[31] Erik Cambria,et al. Context-Dependent Sentiment Analysis in User-Generated Videos , 2017, ACL.
[32] Erik Cambria,et al. Tensor Fusion Network for Multimodal Sentiment Analysis , 2017, EMNLP.
[33] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .
[34] Björn W. Schuller,et al. YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context , 2013, IEEE Intelligent Systems.
[35] Rada Mihalcea,et al. Towards multimodal sentiment analysis: harvesting opinions from the web , 2011, ICMI '11.