Video Description Model Based on Temporal-Spatial and Channel Multi-Attention Mechanisms
暂无分享,去创建一个
Jie Xu | Jinhong Guo | Linke Li | Haoliang Wei | Qiuru Fu | Jinhong Guo | Jie Xu | Haoliang Wei | Qiuru Fu | Linke Li
[1] Yingming Li,et al. Recurrent convolutional video captioning with global and local attention , 2019, Neurocomputing.
[2] Bin Zhao,et al. CAM-RNN: Co-Attention Model Based RNN for Video Captioning , 2019, IEEE Transactions on Image Processing.
[3] Yi Yang,et al. Convolutional Reconstruction-to-Sequence for Video Captioning , 2020, IEEE Transactions on Circuits and Systems for Video Technology.
[4] Yi Yang,et al. Two-Stream Multirate Recurrent Neural Network for Video-Based Pedestrian Reidentification , 2018, IEEE Transactions on Industrial Informatics.
[5] Timothy K. Shih,et al. Modelling a Spatial-Motion Deep Learning Framework to Classify Dynamic Patterns of Videos , 2020 .
[6] Heng Tao Shen,et al. Video Captioning by Adversarial LSTM , 2018, IEEE Transactions on Image Processing.
[7] Ghulam Muhammad,et al. Automatic Fruit Classification Using Deep Learning for Industrial Applications , 2019, IEEE Transactions on Industrial Informatics.
[8] Yongdong Zhang,et al. STAT: Spatial-Temporal Attention Mechanism for Video Captioning , 2020, IEEE Transactions on Multimedia.
[9] Alireza Behrad,et al. Video captioning using boosted and parallel Long Short-Term Memory networks , 2020, Comput. Vis. Image Underst..
[10] Hyedong Jung,et al. Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map , 2019, Applied Sciences.