暂无分享,去创建一个
[1] Mohit Bansal,et al. LXMERT: Learning Cross-Modality Encoder Representations from Transformers , 2019, EMNLP.
[2] G. Boole. An Investigation of the Laws of Thought: On which are founded the mathematical theories of logic and probabilities , 2007 .
[3] A. Gopnik,et al. The scientist in the crib : minds, brains, and how children learn , 1999 .
[4] Luke S. Zettlemoyer,et al. Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars , 2005, UAI.
[5] Mario Fritz,et al. A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input , 2014, NIPS.
[6] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[7] S. Carey. Conceptual Change in Childhood , 1985 .
[8] Yash Goyal,et al. Yin and Yang: Balancing and Answering Binary Visual Questions , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] R'emi Louf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.
[10] Qing Li,et al. Why Does a Visual Question Have Different Answers? , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[11] M. Wolff,et al. Hegel's Science of Logic , 2013 .
[12] Yash Goyal,et al. Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Andrew McCallum,et al. Compositional Vector Space Models for Knowledge Base Completion , 2015, ACL.
[14] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[15] Richard Evans,et al. Learning Explanatory Rules from Noisy Data , 2017, J. Artif. Intell. Res..
[16] Ali Farhadi,et al. From Recognition to Cognition: Visual Commonsense Reasoning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Richard S. Zemel,et al. Exploring Models and Data for Image Question Answering , 2015, NIPS.
[18] Chitta Baral,et al. Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning , 2020, EMNLP.
[19] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.
[20] Zhou Yu,et al. Deep Modular Co-Attention Networks for Visual Question Answering , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[21] Hannaneh Hajishirzi,et al. Logic-Guided Data Augmentation and Regularization for Consistent Question Answering , 2020, ACL.
[22] Xinlei Chen,et al. Pythia v0.1: the Winning Entry to the VQA Challenge 2018 , 2018, ArXiv.
[23] Christopher Potts,et al. Recursive Neural Networks Can Learn Logical Semantics , 2014, CVSC.
[24] Wendy Grace Lehnert,et al. The Process of Question Answering , 2022 .
[25] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[26] Christopher D. Manning,et al. GQA: a new dataset for compositional question answering over real-world images , 2019, ArXiv.
[27] Ali Farhadi,et al. OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Lei Zhang,et al. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[29] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[30] Danqi Chen,et al. Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.
[31] H. Carr. Tractatus Logico-Philosophicus , 1923, Nature.
[32] John Corcoran,et al. Completeness of an ancient logic , 1972, Journal of Symbolic Logic.
[33] Chuang Gan,et al. The Neuro-Symbolic Concept Learner: Interpreting Scenes Words and Sentences from Natural Supervision , 2019, ICLR.
[34] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.
[35] W. Johnston,et al. Hegel's Science of Logic , 1931 .
[36] Mitchell P. Marcus,et al. Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.
[37] Massimo Piattelli-Palmarini,et al. Language and Learning: The Debate Between Jean Piaget and Noam Chomsky , 1980 .
[38] Yoav Artzi,et al. A Corpus for Reasoning about Natural Language Grounded in Photographs , 2018, ACL.
[39] Sameer Singh,et al. Low-Dimensional Embeddings of Logic , 2014, ACL 2014.
[40] Li Fei-Fei,et al. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[41] Luca L. Bonatti,et al. Precursors of logical reasoning in preverbal human infants , 2018, Science.
[42] Fan Yang,et al. Differentiable Learning of Logical Rules for Knowledge Base Reasoning , 2017, NIPS.
[43] Daniel G. Bobrow,et al. Natural Language Input for a Computer Problem Solving System , 1964 .
[44] Chitta Baral,et al. Integrating Knowledge and Reasoning in Image Understanding , 2019, IJCAI.
[45] Xinlei Chen,et al. Towards VQA Models That Can Read , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[46] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[47] Roser Morante,et al. Modality and Negation: An Introduction to the Special Issue , 2012, CL.
[48] Jianfeng Gao,et al. Unified Vision-Language Pre-Training for Image Captioning and VQA , 2020, AAAI.
[49] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[50] Laurence R. Horn,et al. Negation and polarity : syntactic and semantic perspectives , 2000 .
[51] Stefan Lee,et al. ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks , 2019, NeurIPS.
[52] M. Fréchet. Généralisation du théorème des probabilités totales , 1935 .
[53] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[54] Jason Weston,et al. Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.
[55] Andrew McCallum,et al. Relation Extraction with Matrix Factorization and Universal Schemas , 2013, NAACL.
[56] Mark Steedman,et al. Combined Distributional and Logical Semantics , 2013, TACL.