暂无分享,去创建一个
[1] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[2] Chuang Gan,et al. Recurrent Topic-Transition GAN for Visual Paragraph Generation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[3] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[4] Alon Lavie,et al. Meteor Universal: Language Specific Translation Evaluation for Any Target Language , 2014, WMT@ACL.
[5] Hongwei Liu,et al. Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC , 2017, ICML.
[6] Tao Mei,et al. Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation , 2019, IJCAI.
[7] Rita Cucchiara,et al. Meshed-Memory Transformer for Image Captioning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[9] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[11] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[12] Chang Zhou,et al. Show and Tell More: Topic-Oriented Multi-Sentence Image Captioning , 2018, IJCAI.
[13] Fei Sha,et al. Aligning Where to See and What to Tell: Image Captioning with Region-Based Attention and Scene-Specific Contexts , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[14] Ting Wang,et al. What Topics Do Images Say: A Neural Image Captioning Model with Topic Representation , 2019, 2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).
[15] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[16] Alexander G. Schwing,et al. Diverse and Coherent Paragraph Generation from Images , 2018, ECCV.
[17] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.
[18] Jie Chen,et al. Attention on Attention for Image Captioning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[19] Mingyuan Zhou,et al. Augmentable Gamma Belief Networks , 2016, J. Mach. Learn. Res..
[20] Zejian Yuan,et al. Topic-Guided Attention for Image Captioning , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).
[21] Michael S. Bernstein,et al. Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations , 2016, International Journal of Computer Vision.
[22] Jonathan Krause,et al. A Hierarchical Approach for Generating Descriptive Image Paragraphs , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[24] Alexander M. Rush,et al. Training for Diversity in Image Paragraph Captioning , 2018, EMNLP.
[25] Karl Stratos,et al. Large Scale Retrieval and Generation of Image Descriptions , 2015, International Journal of Computer Vision.
[26] Jian Yang,et al. Topic-Oriented Image Captioning Based on Order-Embedding , 2019, IEEE Transactions on Image Processing.
[27] Alec Radford,et al. Improving Language Understanding by Generative Pre-Training , 2018 .
[28] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Hao Zhang,et al. Variational Hetero-Encoder Randomized Generative Adversarial Networks for Joint Image-Text Modeling , 2019, ArXiv.
[30] Bo Chen,et al. Recurrent Hierarchical Topic-Guided RNN for Language Generation , 2020, ICML.
[31] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[32] Hao Zhang,et al. WHAI: Weibull Hybrid Autoencoding Inference for Deep Topic Modeling , 2018, ICLR.
[33] Michael I. Jordan,et al. Hierarchical Dirichlet Processes , 2006 .
[34] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[35] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[36] Chengming Li,et al. Interactive Key-Value Memory-augmented Attention for Image Paragraph Captioning , 2020, COLING.
[37] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[38] Chong Wang,et al. Stochastic variational inference , 2012, J. Mach. Learn. Res..
[39] Lei Zhang,et al. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[40] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[41] Yi Yang,et al. Entangled Transformer for Image Captioning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[42] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.