Framing QA as Building and Ranking Intersentence Answer Justifications

We propose a question answering (QA) approach for standardized science exams that both identifies correct answers and produces compelling human-readable justifications for why those answers are correct. Our method first identifies the actual information needed in a question using psycholinguistic concreteness norms, then uses this information need to construct answer justifications by aggregating multiple sentences from different knowledge bases using syntactic and lexical information. We then jointly rank answers and their justifications using a reranking perceptron that treats justification quality as a latent variable. We evaluate our method on 1,000 multiple-choice questions from elementary school science exams, and empirically demonstrate that it performs better than several strong baselines, including neural network approaches. Our best configuration answers 44% of the questions correctly, where the top justifications for 57% of these correct answers contain a compelling human-readable justification that explains the inference required to arrive at the correct answer. We include a detailed characterization of the justification quality for both our method and a strong baseline, and show that information aggregation is key to addressing the information need in complex questions.

[1]  Yorick Wilks,et al.  Natural language inference. , 1973 .

[2]  J. Piaget The construction of reality in the child , 1954 .

[3]  Mark Steedman,et al.  Combined Distributional and Logical Semantics , 2013, TACL.

[4]  Peter Jansen,et al.  Discourse Complements Lexical Semantics for Non-factoid Answer Reranking , 2014, ACL.

[5]  Mirella Lapata,et al.  Modeling Local Coherence: An Entity-Based Approach , 2005, ACL.

[6]  Peter Clark Elementary School Science and Math Tests as a Driver for AI: Take the Aristo Challenge! , 2015, AAAI.

[7]  Alessandro Moschitti,et al.  Structural relationships for large-scale learning of answer re-ranking , 2012, SIGIR '12.

[8]  Chris Callison-Burch,et al.  Semi-Markov Phrase-Based Monolingual Alignment , 2013, EMNLP.

[9]  David A. Ferrucci,et al.  Introduction to "This is Watson" , 2012, IBM J. Res. Dev..

[10]  Chitta Baral,et al.  Using AnsProlog with Link Grammar and WordNet for QA with deep reasoning , 2005, 9th International Conference on Information Technology (ICIT'06).

[11]  Alessandro Moschitti,et al.  Automatic Feature Engineering for Answer Selection and Extraction , 2013, EMNLP.

[12]  Aditya Kalyanpur,et al.  A framework for merging and ranking of answers in DeepQA , 2012, IBM J. Res. Dev..

[13]  Chitta Baral,et al.  Towards deep reasoning with respect to natural language text in scientific domains , 2011 .

[14]  Yuliya Lierler,et al.  Knowledge Representation and Question Answering , 2008, Handbook of Knowledge Representation.

[15]  Ming-Wei Chang,et al.  Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base , 2015, ACL.

[16]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[17]  Sanda M. Harabagiu,et al.  COGEX: A Logic Prover for Question Answering , 2003, NAACL.

[18]  Xu Sun,et al.  Latent Variable Perceptron Algorithm for Structured Classification , 2009, IJCAI.

[19]  Ellen M. Voorhees Evaluating Answers to Definition Questions , 2003, HLT-NAACL.

[20]  Alessandro Moschitti,et al.  Assessing the Impact of Syntactic and Semantic Structures for Answer Passages Reranking , 2015, CIKM.

[21]  Christopher D. Manning,et al.  Natural language inference , 2009 .

[22]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[23]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[24]  Richard Socher,et al.  A Neural Network for Factoid Question Answering over Paragraphs , 2014, EMNLP.

[25]  Alessandro Moschitti,et al.  Learning Adaptable Patterns for Passage Reranking , 2013, CoNLL.

[26]  Lukás Burget,et al.  Extensions of recurrent neural network language model , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  Oren Etzioni,et al.  Question Answering via Integer Programming over Semi-Structured Knowledge , 2016, IJCAI.

[28]  Hong Yu,et al.  Building a Foundation System for Producing Short Answers to Factual Questions , 2002, TREC.

[29]  Ben Taskar,et al.  An End-to-End Discriminative Approach to Machine Translation , 2006, ACL.

[30]  Jason Weston,et al.  Large-scale Simple Question Answering with Memory Networks , 2015, ArXiv.

[31]  Sanda M. Harabagiu,et al.  Cogex: A semantically and contextually enriched logic prover for question answering , 2007, J. Appl. Log..

[32]  Eraldo Rezende Fernandes,et al.  Latent Structure Perceptron with Feature Induction for Unrestricted Coreference Resolution , 2012, EMNLP-CoNLL Shared Task.

[33]  Philipp Koehn,et al.  Abstract Meaning Representation for Sembanking , 2013, LAW@ACL.

[34]  Steven Bethard,et al.  Back to Basics for Monolingual Alignment: Exploiting Word Similarity and Contextual Evidence , 2014, TACL.

[35]  Di Wang,et al.  A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering , 2015, ACL.

[36]  Vibhu O. Mittal,et al.  Bridging the lexical chasm: statistical approaches to answer-finding , 2000, SIGIR '00.

[37]  Peter Jansen,et al.  Spinning Straw into Gold: Using Free Text to Train Monolingual Alignment Models for Non-factoid Question Answering , 2015, HLT-NAACL.

[38]  Christopher D. Manning,et al.  Hierarchical Joint Learning: Improving Joint Parsing and Named Entity Recognition with Non-Jointly Labeled Data , 2010, ACL.

[39]  Jonas Kuhn,et al.  Learning Structured Perceptrons for Coreference Resolution with Latent Antecedents and Non-local Features , 2014, ACL.

[40]  Ming Zhou,et al.  Question Answering over Freebase with Multi-Column Convolutional Neural Networks , 2015, ACL.

[41]  Luke S. Zettlemoyer,et al.  Online Learning of Relaxed CCG Grammars for Parsing to Logical Form , 2007, EMNLP.

[42]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[43]  Manoj Kumar Chinnakotla,et al.  Deep Feature Fusion Network for Answer Quality Prediction in Community Question Answering , 2016, ArXiv.

[44]  Marc Najork,et al.  Computing Information Retrieval Performance Measures Efficiently in the Presence of Tied Scores , 2008, ECIR.

[45]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[46]  Vasile Rus,et al.  Logic Form Transformation of WordNet and its Applicability to Question Answering , 2001, ACL.

[47]  Xuchen Yao,et al.  Information Extraction over Structured Data: Question Answering with Freebase , 2014, ACL.

[48]  Heeyoung Lee,et al.  Deterministic Coreference Resolution Based on Entity-Centric, Precision-Ranked Rules , 2013, CL.

[49]  Sanda M. Harabagiu,et al.  FALCON: Boosting Knowledge for Answer Engines , 2000, TREC.

[50]  Sanda M. Harabagiu,et al.  Performance issues and error analysis in an open-domain question answering system , 2003, TOIS.

[51]  Chitta Baral,et al.  From Knowledge Represented in Frame-Based Languages to Declarative Representation and Reasoning via ASP , 2012, KR.

[52]  Daniel Marcu,et al.  A Noisy-Channel Approach to Question Answering , 2003, ACL.

[53]  James Fan,et al.  Textual evidence gathering and analysis , 2012, IBM J. Res. Dev..

[54]  Hal Daumé,et al.  Deep Unordered Composition Rivals Syntactic Methods for Text Classification , 2015, ACL.

[55]  Alessandro Moschitti,et al.  A Study on Convolution Kernels for Shallow Statistic Parsing , 2004, ACL.

[56]  Dan Klein,et al.  Learning Dependency-Based Compositional Semantics , 2011, CL.

[57]  Regina Barzilay,et al.  Information Fusion in the Context of Multi-Document Summarization , 1999, ACL.

[58]  Michael Collins,et al.  Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[59]  Aravind K. Joshi,et al.  Ranking and Reranking with Perceptron , 2005, Machine Learning.

[60]  Amy Beth Warriner,et al.  Concreteness ratings for 40 thousand generally known English word lemmas , 2014, Behavior research methods.

[61]  Mihai Surdeanu,et al.  Learning to Rank Answers to Non-Factoid Questions from Web Collections , 2011, CL.

[62]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[63]  Jimmy J. Lin,et al.  Pairwise Word Interaction Modeling with Deep Neural Networks for Semantic Similarity Measurement , 2016, NAACL.

[64]  松本 裕治,et al.  Abstract Meaning Representationを用いた名詞句の意味構造解析 , 2015 .

[65]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[66]  Danqi Chen,et al.  A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task , 2016, ACL.

[67]  W. Bruce Croft,et al.  Using Key Concepts in a Translation Model for Retrieval , 2015, SIGIR.

[68]  Regina Barzilay,et al.  Sentence Fusion for Multidocument News Summarization , 2005, CL.

[69]  Alessandro Moschitti,et al.  Linguistic kernels for answer re-ranking in question answering systems , 2011, Inf. Process. Manag..

[70]  Mihai Surdeanu,et al.  Higher-order Lexical Semantic Models for Non-factoid Answer Reranking , 2015, TACL.

[71]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[72]  Roberto Basili,et al.  Exploiting Syntactic and Shallow Semantic Kernels for Question Answer Classification , 2007, ACL.

[73]  Chitta Baral,et al.  Answering Why and How questions with respect to a frame-based knowledge base: a preliminary report , 2012, ICLP.

[74]  Ming-Wei Chang,et al.  Question Answering Using Enhanced Lexical Semantic Models , 2013, ACL.

[75]  Christopher D. Manning,et al.  Stanford typed dependencies manual , 2010 .

[76]  Luke S. Zettlemoyer,et al.  Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.

[77]  Chitta Baral,et al.  Towards Addressing the Winograd Schema Challenge - Building and Using a Semantic Parser and a Knowledge Hunting Module , 2015, IJCAI.

[78]  Eric Brill,et al.  Automatic question answering using the web: Beyond the Factoid , 2006, Information Retrieval.

[79]  Yi Liu,et al.  Statistical Machine Translation for Query Expansion in Answer Retrieval , 2007, ACL.

[80]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[81]  Kathleen R. McKeown,et al.  A Hybrid Approach for Answering Definitional Questions , 2003 .

[82]  Peter Clark,et al.  A study of the knowledge base requirements for passing an elementary science test , 2013, AKBC '13.