Multi-modal Fusion Using Spatio-temporal and Static Features for Group Emotion Recognition
暂无分享,去创建一个
Yi Yang | Jian Tang | Jieping Ye | Hui Feng | Jian Li | Haifeng Shen | Mo Sun | Wei Gou
[1] Yu Qiao,et al. Frame Attention Networks for Facial Expression Recognition in Videos , 2019, 2019 IEEE International Conference on Image Processing (ICIP).
[2] Haifeng Shen,et al. PropagationNet: Propagate Points to Curve to Learn Structure Information , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[3] Xingyi Zhou,et al. Objects as Points , 2019, ArXiv.
[4] Dima Damen,et al. EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[5] Zhiyuan Li,et al. Group-Level Emotion Recognition using Deep Models with A Four-stream Hybrid Network , 2018, ICMI.
[6] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Luc Van Gool,et al. Temporal Segment Networks: Towards Good Practices for Deep Action Recognition , 2016, ECCV.
[8] Chuang Gan,et al. TSM: Temporal Shift Module for Efficient Video Understanding , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[9] Irene Kotsia,et al. RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Enhua Wu,et al. Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[11] Jitendra Malik,et al. SlowFast Networks for Video Recognition , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).