Filtration network: A frame sampling strategy via deep reinforcement learning for video captioning
暂无分享,去创建一个
Xue Mei | Tiancheng Qian | Pengxiang Xu | Kangqi Ge | Zhelei Qiu | Xue Mei | Pengxiang Xu | Tiancheng Qian | Kangqi Ge | Zhelei Qiu
[1] Lin Ma,et al. Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[2] Jin Young Lee,et al. Deep multimodal embedding for video captioning , 2019, Multimedia Tools and Applications.
[3] Bing Li,et al. Object Relational Graph With Teacher-Recommended Learning for Video Captioning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Weizhi Nie,et al. Multi-guiding long short-term memory for video captioning , 2018, Multimedia Systems.
[5] Tieniu Tan,et al. M3: Multimodal Memory Modelling for Video Captioning , 2016, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[6] Dale Schuurmans,et al. Learning to Generalize from Sparse and Underspecified Rewards , 2019, ICML.
[7] Yu-Wing Tai,et al. Memory-Attended Recurrent Network for Video Captioning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Sheng Liu,et al. SibNet: Sibling Convolutional Encoder for Video Captioning , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[9] Tao Mei,et al. MSR-VTT: A Large Video Description Dataset for Bridging Video and Language , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Thomas Brox,et al. ECO: Efficient Convolutional Network for Online Video Understanding , 2018, ECCV.
[11] Wei Liu,et al. Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Nicu Sebe,et al. Fast and Robust Dynamic Hand Gesture Recognition via Key Frames Extraction and Feature Fusion , 2019, Neurocomputing.
[13] Jianfei Yang,et al. Semantic-filtered Soft-Split-Aware video captioning with audio-augmented feature , 2019, Neurocomputing.
[14] Bin Zhao,et al. CAM-RNN: Co-Attention Model Based RNN for Video Captioning , 2019, IEEE Transactions on Image Processing.
[15] Yingming Li,et al. Recurrent convolutional video captioning with global and local attention , 2019, Neurocomputing.
[16] Qingming Huang,et al. Less Is More: Picking Informative Frames for Video Captioning , 2018, ECCV.
[17] Zhen Yang,et al. Online scheduling of image satellites based on neural networks and deep reinforcement learning , 2019, Chinese Journal of Aeronautics.
[18] Marcus Rohrbach,et al. Translating Videos to Natural Language Using Deep Recurrent Neural Networks , 2014, NAACL.
[19] Wei Liu,et al. Controllable Video Captioning With POS Sequence Guidance Based on Gated Fusion Network , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[20] Yuxin Peng,et al. Object-Aware Aggregation With Bidirectional Temporal Graph for Video Captioning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[21] Robert Babuska,et al. A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
[22] Yu-Gang Jiang,et al. Motion Guided Spatial Attention for Video Captioning , 2019, AAAI.