Modeling multimodal cues in a deep learning-based framework for emotion recognition in the wild
暂无分享,去创建一个
[1] Albert Newen,et al. Cognitive penetrability and emotion recognition in human facial expressions , 2015, Front. Psychol..
[2] Ya Li,et al. Multi-scale Temporal Modeling for Dimensional Emotion Recognition in Video , 2014, AVEC '14.
[3] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[4] Tara N. Sainath,et al. The shared views of four research groups ) , 2012 .
[5] Erhardt Barth,et al. Recurrent Dropout without Memory Loss , 2016, COLING.
[6] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[7] Jesse Hoey,et al. From individual to group-level emotion recognition: EmotiW 5.0 , 2017, ICMI.
[8] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] L. de Haan,et al. Specificity of facial emotion recognition impairments in patients with multi-episode schizophrenia , 2015, Schizophrenia Research: Cognition.
[10] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.
[11] Benoit Huet,et al. Towards multimodal emotion recognition: a new approach , 2010, CIVR '10.
[12] Shiliang Zhang,et al. Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition , 2016, ICMR.
[13] Tamás D. Gedeon,et al. Collecting Large, Richly Annotated Facial-Expression Databases from Movies , 2012, IEEE MultiMedia.
[14] Thomas S. Huang,et al. How deep neural networks can improve emotion recognition on video data , 2016, 2016 IEEE International Conference on Image Processing (ICIP).
[15] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.
[16] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[17] Luc Van Gool,et al. AENet: Learning Deep Audio Features for Video Analysis , 2017, IEEE Transactions on Multimedia.
[18] Dongmei Jiang,et al. Multimodal Affective Dimension Prediction Using Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks , 2015, AVEC@ACM Multimedia.
[19] Ioannis Pitas,et al. The eNTERFACE05 Audio-Visual Emotion Database , 2006, 22nd International Conference on Data Engineering Workshops (ICDEW'06).
[20] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[21] Haizhou Li,et al. Audio and face video emotion recognition in the wild using deep neural networks and small datasets , 2016, ICMI.
[22] S. Tiwari. Deep features for multimodal emotion classification , 2016 .
[23] Wootaek Lim,et al. Speech emotion recognition using convolutional and Recurrent Neural Networks , 2016, 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).
[24] Razvan Pascanu,et al. Combining modality specific deep neural networks for emotion recognition in video , 2013, ICMI '13.
[25] A. Corvin,et al. Detecting facial emotion recognition deficits in schizophrenia using dynamic stimuli of varying intensities , 2016, Neuroscience Letters.
[26] Qin Jin,et al. Multi-modal Dimensional Emotion Recognition using Recurrent Neural Networks , 2015, AVEC@ACM Multimedia.
[27] Antonio Torralba,et al. SoundNet: Learning Sound Representations from Unlabeled Video , 2016, NIPS.
[28] Benoit Huet,et al. EURECOM@MediaEval 2017: Media Genre Inference for Predicting Media Interestingness , 2017, MediaEval.
[29] Fabien Ringeval,et al. AV+EC 2015: The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data , 2015, AVEC@ACM Multimedia.
[30] Ping Hu,et al. HoloNet: towards robust emotion recognition in the wild , 2016, ICMI.
[31] Yongzhao Zhan,et al. Speech Emotion Recognition Using CNN , 2014, ACM Multimedia.
[32] Christopher Joseph Pal,et al. Recurrent Neural Networks for Emotion Recognition in Video , 2015, ICMI.
[33] Yuanliu Liu,et al. Video-based emotion recognition using CNN-RNN and C3D hybrid networks , 2016, ICMI.
[34] Antonio Torralba,et al. Anticipating Visual Representations from Unlabeled Video , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[35] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.
[36] Sergey Ioffe,et al. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.
[37] Tomás Pajdla,et al. NetVLAD: CNN Architecture for Weakly Supervised Place Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[38] Yu Qiao,et al. Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.
[39] Wen Gao,et al. Learning Affective Features With a Hybrid Deep Model for Audio–Visual Emotion Recognition , 2018, IEEE Transactions on Circuits and Systems for Video Technology.
[40] Yoshua Bengio,et al. Challenges in representation learning: A report on three machine learning contests , 2013, Neural Networks.
[41] Honglak Lee,et al. Deep learning for robust feature generation in audiovisual emotion recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[42] John R. Smith,et al. Harnessing A.I. for Augmenting Creativity: Application to Movie Trailer Creation , 2017, ACM Multimedia.
[43] Ya Li,et al. Long short term memory recurrent neural network based encoding method for emotion recognition in video , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[44] Yongzhao Zhan,et al. Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks , 2014, IEEE Transactions on Multimedia.
[45] Albert Ali Salah,et al. Video-based emotion recognition in the wild using deep transfer learning and score fusion , 2017, Image Vis. Comput..
[46] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[47] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[48] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[49] Cordelia Schmid,et al. Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[50] Dong Yu,et al. Speech emotion recognition using deep neural network and extreme learning machine , 2014, INTERSPEECH.