暂无分享,去创建一个
[1] Chong Wang,et al. Stochastic variational inference , 2012, J. Mach. Learn. Res..
[2] Karol Gregor,et al. Neural Variational Inference and Learning in Belief Networks , 2014, ICML.
[3] Mahdieh Soleymani Baghshah,et al. Jointly Measuring Diversity and Quality in Text Generation Models , 2019, Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural Language Generation.
[4] Samy Bengio,et al. Generating Sentences from a Continuous Space , 2015, CoNLL.
[5] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[6] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Ben Poole,et al. Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.
[8] Jacopo Staiano,et al. What BERT Sees: Cross-Modal Transfer for Visual Question Generation , 2020, INLG.
[9] Ruslan Salakhutdinov,et al. Importance Weighted Autoencoders , 2015, ICLR.
[10] Alexander M. Rush,et al. Latent Alignment and Variational Attention , 2018, NeurIPS.
[11] Rémi Louf,et al. Transformers : State-ofthe-art Natural Language Processing , 2019 .
[12] Kilian Q. Weinberger,et al. BERTScore: Evaluating Text Generation with BERT , 2019, ICLR.
[13] Lucia Specia,et al. Probing the Need for Visual Context in Multimodal Machine Translation , 2019, NAACL.
[14] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[15] Oriol Vinyals,et al. Neural Discrete Representation Learning , 2017, NIPS.
[16] Peng Xu,et al. Variational Transformers for Diverse Response Generation , 2020, ArXiv.
[17] Iryna Gurevych,et al. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks , 2019, EMNLP.
[18] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[19] Max Welling,et al. Semi-supervised Learning with Deep Generative Models , 2014, NIPS.
[20] Bolei Zhou,et al. Visual Question Generation as Dual Task of Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[21] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[22] Phil Blunsom,et al. Language as a Latent Variable: Discrete Generative Models for Sentence Compression , 2016, EMNLP.
[23] Eric P. Xing,et al. Toward Controlled Generation of Text , 2017, ICML.
[24] Desmond Elliott,et al. Findings of the Third Shared Task on Multimodal Machine Translation , 2018, WMT.
[25] Khalil Sima'an,et al. A Shared Task on Multimodal Machine Translation and Crosslingual Image Description , 2016, WMT.
[26] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[27] Xuanjing Huang,et al. A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators , 2018, COLING.
[28] Vinay P. Namboodiri,et al. Deep Bayesian Network for Visual Question Generation , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).
[29] Lei Zhang,et al. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[30] Yue Zheng,et al. Intention Oriented Image Captions With Guiding Objects , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[31] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.
[32] Dhruv Batra,et al. Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[33] Phil Blunsom,et al. Neural Variational Inference for Text Processing , 2015, ICML.
[34] Tat-Seng Chua,et al. Recent Advances in Neural Question Generation , 2019, ArXiv.
[35] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[36] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[37] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[38] Alexander G. Schwing,et al. Creativity: Generating Diverse Questions Using Variational Autoencoders , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[39] Sandeep Kumar,et al. Multimodal Differential Network for Visual Question Generation , 2018, EMNLP.
[40] Rita Cucchiara,et al. Meshed-Memory Transformer for Image Captioning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[41] Jianfeng Gao,et al. Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks , 2020, ECCV.
[42] Lysandre Debut,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.
[43] Radu Soricut,et al. Understanding Guided Image Captioning Performance across Domains , 2021, CONLL.
[44] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[45] Michael S. Bernstein,et al. Information Maximizing Visual Question Generation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[46] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with High Levels of Correlation with Human Judgments , 2007, WMT@ACL.
[47] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[48] Oriol Vinyals,et al. Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.
[49] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.
[50] Shaodi You,et al. Automatic Generation of Grounded Visual Questions , 2016, IJCAI.
[51] Desmond Elliott,et al. Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description , 2017, WMT.