Towards Building Large Scale Multimodal Domain-Aware Conversation Systems
暂无分享,去创建一个
Mitesh M. Khapra | Amrita Saha | Karthik Sankaranarayanan | Amrita Saha | Karthik Sankaranarayanan | K. Sankaranarayanan
[1] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[2] Hugo Larochelle,et al. GuessWhat?! Visual Object Discovery through Multi-modal Dialogue , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[3] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.
[4] Wei Xu,et al. Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.
[6] Kam-Fai Wong,et al. An Attentional Neural Conversation Model with Improved Specificity , 2016, ArXiv.
[7] Juan Carlos Niebles,et al. Leveraging Video Descriptions to Learn Video Question Answering , 2016, AAAI.
[8] Alan Ritter,et al. Unsupervised Modeling of Twitter Conversations , 2010, NAACL.
[9] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[10] Quoc V. Le,et al. A Neural Conversational Model , 2015, ArXiv.
[11] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[12] Tegan Maharaj,et al. A Dataset and Exploration of Models for Understanding Video Data through Fill-in-the-Blank Question-Answering , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Meeyoung Cha,et al. Fashion Conversation Data on Instagram , 2017, ICWSM.
[14] Joelle Pineau,et al. The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems , 2015, SIGDIAL Conference.
[15] José M. F. Moura,et al. Visual Dialog , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[16] Jason Weston,et al. Learning End-to-End Goal-Oriented Dialog , 2016, ICLR.
[17] Joelle Pineau,et al. A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues , 2016, AAAI.
[18] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[19] Joelle Pineau,et al. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models , 2015, AAAI.
[20] Jianfeng Gao,et al. Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation , 2017, IJCNLP.