Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering

Many Question-Answering (QA) datasets contain unanswerable questions, but their treatment in QA systems remains primitive. Our analysis of the Natural Questions (Kwiatkowski et al., 2019) dataset reveals that a substantial portion of unanswerable questions (∼21%) can be explained based on the presence of unverifiable presuppositions. We discuss the shortcomings of current models in handling such questions, and describe how an improved system could handle them. Through a user preference study, we demonstrate that the oracle behavior of our proposed system that provides responses based on presupposition failure is preferred over the oracle behavior of existing QA systems. Then we discuss how our proposed system could be implemented, presenting a novel framework that breaks down the problem into three steps: presupposition generation, presupposition verification and explanation generation. We report our progress in tackling each subproblem, and present a preliminary approach to integrating these steps into an existing QA system. We find that adding presuppositions and their verifiability to an existing model yields modest gains in downstream performance and unanswerability detection. The biggest bottleneck is the verification component, which needs to be substantially improved for the integrated system to approach ideal behavior—even transfer from the best entailment models currently falls short.

[1]  Li Yang,et al.  ETC: Encoding Long and Structured Inputs in Transformers , 2020, EMNLP.

[2]  Jackie Chi Kit Cheung,et al.  Let’s do it “again”: A First Computational Approach to Detecting Adverbial Presupposition Triggers , 2018, ACL.

[3]  C. R.,et al.  On referring , 1950 .

[4]  Benjamin Van Durme,et al.  Reporting bias and knowledge acquisition , 2013, AKBC '13.

[5]  Christopher D. Manning,et al.  Presupposed Content and Entailments in Natural Language Inference , 2009, TextInfer@ACL.

[6]  Adina Williams,et al.  Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition , 2020, ACL.

[7]  Samuel R. Bowman,et al.  A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.

[8]  Jason Weston,et al.  ELI5: Long Form Question Answering , 2019, ACL.

[9]  Haejun Lee,et al.  NeurQuRI: Neural Question Requirement Inspector for Answerability Prediction in Machine Reading Comprehension , 2020, ICLR.

[10]  Lauri Karttunen,et al.  Presupposition: What went wrong? , 2016 .

[11]  Mark Yatskar,et al.  A Qualitative Comparison of CoQA, SQuAD 2.0 and QuAC , 2018, NAACL.

[12]  Julian Michael,et al.  AmbigQA: Answering Ambiguous Open-domain Questions , 2020, EMNLP.

[13]  木村 和夫 Pragmatics , 1997, Language Teaching.

[14]  Anette Frank,et al.  Extending Fine-Grained Semantic Relation Classification to Presupposition Relations between Verbs , 2011 .

[15]  Omer Levy,et al.  GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[16]  Eunsol Choi,et al.  QED: A Framework and Dataset for Explanations in Question Answering , 2020, Transactions of the Association for Computational Linguistics.

[17]  Furu Wei,et al.  Learning to Ask Unanswerable Questions for Machine Reading Comprehension , 2019, ACL.

[18]  Christopher Clark,et al.  Simple and Effective Multi-Paragraph Reading Comprehension , 2017, ACL.

[19]  Ming-Wei Chang,et al.  Natural Questions: A Benchmark for Question Answering Research , 2019, TACL.

[20]  Philippe Schlenker,et al.  Presupposition projection: Explanatory strategies , 2008 .

[21]  M. Lyons Presupposition , 2021, Encyclopedia of Autism Spectrum Disorders.

[22]  Márta Abrusán,et al.  Presuppositional and negative islands: a semantic account , 2011 .

[23]  Furu Wei,et al.  Read + Verify: Machine Reading Comprehension with Unanswerable Questions , 2018, AAAI.

[24]  Eunsol Choi,et al.  Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval , 2020, ACL.

[25]  Kevin Gimpel,et al.  ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[26]  Florian Schwarz,et al.  Experimental Work in Presupposition and Presupposition Projection , 2016 .

[27]  Andreas Vlachos,et al.  FEVER: a Large-scale Dataset for Fact Extraction and VERification , 2018, NAACL.

[28]  Hai Zhao,et al.  Retrospective Reader for Machine Reading Comprehension , 2020, AAAI.

[29]  Jonathan Berant,et al.  Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies , 2021, Transactions of the Association for Computational Linguistics.

[30]  Danqi Chen,et al.  Dense Passage Retrieval for Open-Domain Question Answering , 2020, EMNLP.

[31]  D. Beaver,et al.  Weak Uniqueness: The Only Difference Between Definites and Indefinites , 2012 .

[32]  Rob A. van der Sandt,et al.  Presupposition Projection as Anaphora Resolution , 1992, J. Semant..

[33]  Nadine Theiler,et al.  An Epistemic Bridge for Presupposition Projection in Questions , 2021 .

[34]  Percy Liang,et al.  Know What You Don’t Know: Unanswerable Questions for SQuAD , 2018, ACL.

[35]  Dan Klein,et al.  Learning Accurate, Compact, and Interpretable Tree Annotation , 2006, ACL.

[36]  Bart Geurts Presupposing , 1995 .

[37]  Alexandra Simonenko,et al.  Decomposing universal projection in questions , 2018 .