论文信息 - Video Description Model Based on Temporal-Spatial and Channel Multi-Attention Mechanisms - 字舞流文

Video Description Model Based on Temporal-Spatial and Channel Multi-Attention Mechanisms

Jie Xu | Jinhong Guo | Linke Li | Haoliang Wei | Qiuru Fu | Jinhong Guo | Jie Xu | Haoliang Wei | Qiuru Fu | Linke Li

[1] Yingming Li,et al. Recurrent convolutional video captioning with global and local attention , 2019, Neurocomputing.

[2] Bin Zhao,et al. CAM-RNN: Co-Attention Model Based RNN for Video Captioning , 2019, IEEE Transactions on Image Processing.

[3] Yi Yang,et al. Convolutional Reconstruction-to-Sequence for Video Captioning , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[4] Yi Yang,et al. Two-Stream Multirate Recurrent Neural Network for Video-Based Pedestrian Reidentification , 2018, IEEE Transactions on Industrial Informatics.

[5] Timothy K. Shih,et al. Modelling a Spatial-Motion Deep Learning Framework to Classify Dynamic Patterns of Videos , 2020 .

[6] Heng Tao Shen,et al. Video Captioning by Adversarial LSTM , 2018, IEEE Transactions on Image Processing.

[7] Ghulam Muhammad,et al. Automatic Fruit Classification Using Deep Learning for Industrial Applications , 2019, IEEE Transactions on Industrial Informatics.

[8] Yongdong Zhang,et al. STAT: Spatial-Temporal Attention Mechanism for Video Captioning , 2020, IEEE Transactions on Multimedia.

[9] Alireza Behrad,et al. Video captioning using boosted and parallel Long Short-Term Memory networks , 2020, Comput. Vis. Image Underst..

[10] Hyedong Jung,et al. Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map , 2019, Applied Sciences.