Large-scale Simple Question Answering with Memory Networks

Training large-scale question answering systems is complicated because training sources usually cover a small portion of the range of possible questions. This paper studies the impact of multitask and transfer learning for simple question answering; a setting for which the reasoning required to answer is quite easy, as long as one can retrieve the correct evidence given a question, which can be difficult in large-scale conditions. To this end, we introduce a new dataset of 100k questions that we use in conjunction with existing benchmarks. We conduct our study within the framework of Memory Networks (Weston et al., 2015) because this perspective allows us to eventually scale up to more complex reasoning, and show that Memory Networks can be successfully trained to achieve excellent performance.

[1]  Ellen M. Voorhees,et al.  Overview of the TREC-9 Question Answering Track , 2000, TREC.

[2]  Jimmy J. Lin,et al.  Web question answering: is more always better? , 2002, SIGIR '02.

[3]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[4]  Jason Weston,et al.  Large scale image annotation: learning to rank with joint word-image embeddings , 2010, Machine Learning.

[5]  Stephen J. Wright,et al.  Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.

[6]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[7]  Gerhard Weikum,et al.  Natural Language Questions for the Web of Data , 2012, EMNLP.

[8]  Oren Etzioni,et al.  Entity Linking at Web Scale , 2012, AKBC-WEKEX@NAACL-HLT.

[9]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.

[10]  Alexander Yates,et al.  Large-scale Semantic Parsing via Schema Matching and Lexicon Extension , 2013, ACL.

[11]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[12]  Eunsol Choi,et al.  Scaling Semantic Parsers with On-the-Fly Ontology Matching , 2013, EMNLP.

[13]  Oren Etzioni,et al.  Paraphrase-Driven Learning for Open Question Answering , 2013, ACL.

[14]  Mark Steedman,et al.  Large-scale Semantic Parsing without Question-Answer Pairs , 2014, TACL.

[15]  Hae-Chang Rim,et al.  Joint Relational Embeddings for Knowledge-based Question Answering , 2014, EMNLP.

[16]  Xuchen Yao,et al.  Information Extraction over Structured Data: Question Answering with Freebase , 2014, ACL.

[17]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[18]  Jonathan Berant,et al.  Semantic Parsing via Paraphrasing , 2014, ACL.

[19]  Jason Weston,et al.  Open Question Answering with Weakly Supervised Embedding Models , 2014, ECML/PKDD.

[20]  Oren Etzioni,et al.  Open question answering over curated and extracted knowledge bases , 2014, KDD.

[21]  Jason Weston,et al.  End-To-End Memory Networks , 2015, NIPS.

[22]  Jason Weston,et al.  Memory Networks , 2014, ICLR.

[23]  Jason Weston,et al.  Weakly Supervised Memory Networks , 2015, ArXiv.