Pooling acoustic and lexical features for the prediction of valence
暂无分享,去创建一个
Emily Mower Provost | Dimitrios Dimitriadis | Zakaria Aldeneh | Soheil Khorram | E. Provost | D. Dimitriadis | Zakaria Aldeneh | S. Khorram
[1] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.
[2] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.
[3] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.
[4] Lianhong Cai,et al. Combining CNN and BLSTM to Extract Textual and Acoustic Features for Recognizing Stances in Mandarin Ideological Debate Competition , 2016, INTERSPEECH.
[5] Trevor Darrell,et al. Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding , 2016, EMNLP.
[6] Yang Gao,et al. Compact Bilinear Pooling , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Verónica Pérez-Rosas,et al. Utterance-Level Multimodal Sentiment Analysis , 2013, ACL.
[8] Björn Schuller,et al. Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.
[9] Carlos Busso,et al. Using neutral speech models for emotional speech analysis , 2007, INTERSPEECH.
[10] Stefan Scherer,et al. A Multimodal Predictive Model of Successful Debaters or How I Learned to Sway Votes , 2015, ACM Multimedia.
[11] Charless C. Fowlkes,et al. Bilinear classifiers for visual recognition , 2009, NIPS.
[12] Stefan Scherer,et al. Learning representations of emotional speech with deep convolutional generative adversarial networks , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] Andrew Rosenberg,et al. Classifying Skewed Data: Importance Weighting to Optimize Average Recall , 2012, INTERSPEECH.
[14] Subhransu Maji,et al. Bilinear CNN Models for Fine-Grained Visual Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[15] Björn W. Schuller,et al. Paralinguistics in speech and language - State-of-the-art and the challenge , 2013, Comput. Speech Lang..
[16] Carlos Busso,et al. IEMOCAP: interactive emotional dyadic motion capture database , 2008, Lang. Resour. Evaluation.
[17] Erik Cambria,et al. Deep Convolutional Neural Network Textual Features and Multiple Kernel Learning for Utterance-level Multimodal Sentiment Analysis , 2015, EMNLP.
[18] Tara N. Sainath,et al. Learning filter banks within a deep neural network framework , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[19] Rasmus Pagh,et al. Fast and scalable polynomial kernels via explicit feature maps , 2013, KDD.
[20] Louis-Philippe Morency,et al. Learning Representations of Affect from Speech , 2015, ICLR 2015.
[21] Chengxin Li,et al. Speech emotion recognition with acoustic and lexical features , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).