论文信息 - Attention Mechanism for Fashion Image Captioning - 字舞流文

Attention Mechanism for Fashion Image Captioning

Om Prakash | Bao T. Nguyen | Bao T. Nguyen | Anh H. Vo | Anh H. Vo | O. Prakash

[1] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2] Kota Yamaguchi,et al. Attention to describe products with attributes , 2017, 2017 Fifteenth IAPR International Conference on Machine Vision Applications (MVA).

[3] Lei Zhang,et al. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4] Van-Huy Pham,et al. Video-Based Vietnamese Sign Language Recognition Using Local Descriptors , 2019, ACIIDS.

[5] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Shuang Bai,et al. A survey on automatic image caption generation , 2018, Neurocomputing.

[8] Santosh Chapaneri,et al. Encoder-Decoder Architecture for Image Caption Generation , 2020, 2020 3rd International Conference on Communication System, Computing and IT Applications (CSCITA).

[9] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Yejin Choi,et al. Baby talk: Understanding and generating simple image descriptions , 2011, CVPR 2011.

[11] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[12] Bao T. Nguyen,et al. Deep Learning for Vietnamese Sign Language Recognition in Video Sequence , 2019 .

[13] Wei Xu,et al. Dual Learning for Cross-domain Image Captioning , 2017, CIKM.

[14] Anh Vo,et al. Facial Expression Recognition Based on Salient Regions , 2018, 2018 4th International Conference on Green Technology and Sustainable Development (GTSD).

[15] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Ngoc Quoc Ly,et al. Facial Expression Recognition Using Pyramid Local Phase Quantization Descriptor , 2014, KSE.

[17] Tat-Seng Chua,et al. SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Ying Zhang,et al. Fashion-Gen: The Generative Fashion Dataset and Challenge , 2018, ArXiv.

[19] Vaibhava Goel,et al. Self-Critical Sequence Training for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Trang Nguyen,et al. A hybrid framework for smile detection in class imbalance scenarios , 2019, Neural Computing and Applications.

[21] Gang Sun,et al. Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22] Changki Lee,et al. Image Caption Generation using Recurrent Neural Network , 2016 .

[23] Richard Socher,et al. Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Nazli Ikizler-Cinbis,et al. Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures , 2016, J. Artif. Intell. Res..

[25] Siqi Liu,et al. Optimization of image description metrics using policy gradient methods , 2016, ArXiv.

[26] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[27] Xiaogang Wang,et al. DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).