AlignVE: Visual Entailment Recognition Based on Alignment Relations
暂无分享,去创建一个
J. Kwok | Bo Liu | Lei He | Jiuxin Cao | Biwei Cao | Jie Gui | Yuan Yan Tang | Jiayun Shen
[1] Ilya Sutskever,et al. Learning Transferable Visual Models From Natural Language Supervision , 2021, ICML.
[2] Thomas Lukasiewicz,et al. e-SNLI-VE: Corrected Visual-Textual Entailment with Natural Language Explanations , 2020, 2004.03744.
[3] Xinlei Chen,et al. In Defense of Grid Features for Visual Question Answering , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.
[5] Yu Cheng,et al. UNITER: UNiversal Image-TExt Representation Learning , 2019, ECCV.
[6] Yuqing Guo,et al. Multi-grained Attention with Object-level Grounding for Visual Question Answering , 2019, ACL.
[7] Koji Mineshima,et al. Multimodal Logical Inference System for Visual-Textual Entailment , 2019, ACL.
[8] Zhou Yu,et al. Deep Modular Co-Attention Networks for Visual Question Answering , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Asim Kadav,et al. Visual Entailment: A Novel Task for Fine-Grained Image Understanding , 2019, ArXiv.
[10] Peng Gao,et al. Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Asim Kadav,et al. Visual Entailment Task for Visually-Grounded Language Learning , 2018, ArXiv.
[12] Takayuki Okatani,et al. Improved Fusion of Visual and Language Representations by Dense Symmetric Co-attention for Visual Question Answering , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[13] Tamir Hazan,et al. High-Order Attention Models for Visual Question Answering , 2017, NIPS.
[14] Jian Zhang,et al. Natural Language Inference over Interaction Space , 2017, ICLR.
[15] Lei Zhang,et al. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[16] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[17] Pascual Martínez-Gómez,et al. On-demand Injection of Lexical Knowledge for Recognising Textual Entailment , 2017, EACL.
[18] Zhifang Sui,et al. Reading and Thinking: Re-read LSTM Unit for Textual Entailment Recognition , 2016, COLING.
[19] Zhen-Hua Ling,et al. Enhanced LSTM for Natural Language Inference , 2016, ACL.
[20] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.
[21] Jiasen Lu,et al. Hierarchical Question-Image Co-Attention for Visual Question Answering , 2016, NIPS.
[22] Yang Liu,et al. Learning Natural Language Inference using Bidirectional LSTM model and Inner-Attention , 2016, ArXiv.
[23] Shuohang Wang,et al. Learning Natural Language Inference with LSTM , 2015, NAACL.
[24] Rui Yan,et al. Natural Language Inference by Tree-Based Convolution and Heuristic Matching , 2015, ACL.
[25] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Licheng Yu,et al. Visual Madlibs: Fill in the Blank Description Generation and Question Answering , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[27] Saurabh Singh,et al. Where to Look: Focus Regions for Visual Question Answering , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Yash Goyal,et al. Yin and Yang: Balancing and Answering Binary Visual Questions , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Alexander J. Smola,et al. Stacked Attention Networks for Image Question Answering , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[30] Phil Blunsom,et al. Reasoning about Entailment with Neural Attention , 2015, ICLR.
[31] Christopher Potts,et al. A large annotated corpus for learning natural language inference , 2015, EMNLP.
[32] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[33] Wei Xu,et al. Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question , 2015, NIPS.
[34] Mario Fritz,et al. Ask Your Neurons: A Neural-Based Approach to Answering Questions about Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[35] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[36] Zhi Jin,et al. Discriminative Neural Sentence Modeling by Tree-Based Convolution , 2015, EMNLP.
[37] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[38] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[39] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[40] Dan Roth,et al. A Unified Representation and Inference Paradigm for Natural Language , 2008 .
[41] Ido Dagan,et al. The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.
[42] Christopher D. Manning,et al. Learning to recognize features of valid textual entailments , 2006, NAACL.
[43] S. Hochreiter,et al. Long Short-Term Memory , 1997, Neural Computation.
[44] Yorick Wilks,et al. Natural language inference. , 1973 .
[45] Ming-Wei Chang,et al. Relation Alignment for Textual Entailment Recognition , 2009, TAC.
[46] Diana Inkpen,et al. Machine Learning Experiments for Textual Entailment , 2006 .
[47] Ido Dagan,et al. Investigating a Generic Paraphrase-Based Approach for Relation Extraction , 2006, EACL.
[48] Ido Dagan,et al. Web Based Probabilistic Textual Entailment , 2005 .