Question Answering Over Knowledge Graphs: Question Understanding Via Template Decomposition

The gap between unstructured natural language and structured data makes it challenging to build a system that supports using natural language to query large knowledge graphs. Many existing methods construct a structured query for the input question based on a syntactic parser. Once the input question is parsed incorrectly, a false structured query will be generated, which may result in false or incomplete answers. The problem gets worse especially for complex questions. In this paper, we propose a novel systematic method to understand natural language questions by using a large number of binary templates rather than semantic parsers. As sufficient templates are critical in the procedure, we present a low-cost approach that can build a huge number of templates automatically. To reduce the search space, we carefully devise an index to facilitate the online template decomposition. Moreover, we design effective strategies to perform the two-level disambiguations (i.e., entity-level ambiguity and structure-level ambiguity) by considering the query semantics. Extensive experiments over several benchmarks demonstrate that our proposed approach is effective as it significantly outperforms state-of-the-art methods in terms of both precision and recall.

[1]  Xuemin Lin,et al.  Top-k Set Similarity Joins , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[2]  Dan I. Moldovan,et al.  On the Role of Information Retrieval and Information Extraction in Question Answering Systems , 2002, SCIE.

[3]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.

[4]  Lei Zou,et al.  How to Build Templates for RDF Question/Answering: An Uncertain Graph Similarity Join Approach , 2015, SIGMOD Conference.

[5]  Ming Zhou,et al.  Question Answering over Freebase with Multi-Column Convolutional Neural Networks , 2015, ACL.

[6]  Seung-won Hwang,et al.  KBQA: Learning Question Answering over QA Corpora and Knowledge Bases , 2019, Proc. VLDB Endow..

[7]  Guoliang Li,et al.  An Efficient Partition Based Method for Exact Set Similarity Joins , 2015, Proc. VLDB Endow..

[8]  Oren Etzioni,et al.  Open question answering over curated and extracted knowledge bases , 2014, KDD.

[9]  Yanghua Xiao,et al.  KBQA: An Online Template Based Question Answering System over Freebase , 2016, IJCAI.

[10]  Gerhard Weikum,et al.  Fine-grained Semantic Typing of Emerging Entities , 2013, ACL.

[11]  Elena Cabrio,et al.  Question Answering over Linked Data (QALD-5) , 2014, CLEF.

[12]  Pablo N. Mendes,et al.  Improving efficiency and accuracy in multilingual entity extraction , 2013, I-SEMANTICS '13.

[13]  Ying Zhang,et al.  An Efficient Framework for Exact Set Similarity Search Using Tree Structure Indexes , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[14]  Gerhard Weikum,et al.  Automated Template Generation for Question Answering over Knowledge Graphs , 2017, WWW.

[15]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[16]  Dongyan Zhao,et al.  Question Answering on Freebase via Relation Extraction and Textual Evidence , 2016, ACL.

[17]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[18]  Jongik Kim,et al.  Efficient Exact Similarity Searches Using Multiple Token Orderings , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[19]  Xuchen Yao,et al.  Information Extraction over Structured Data: Question Answering with Freebase , 2014, ACL.

[20]  Michael Stonebraker,et al.  Approximate String Joins with Abbreviations , 2017, Proc. VLDB Endow..

[21]  Yong Shi,et al.  Entity Disambiguation with Textual and Connection Information , 2012, ICCS.

[22]  Dan Klein,et al.  Decentralized Entity-Level Modeling for Coreference Resolution , 2013, ACL.

[23]  Rodolfo Delmonte,et al.  Hybrid Systems for Information Extraction and Question Answering , 2006 .

[24]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[25]  Mark Steedman,et al.  Transforming Dependency Structures to Logical Forms for Semantic Parsing , 2016, TACL.

[26]  Jonathan Berant,et al.  Semantic Parsing via Paraphrasing , 2014, ACL.

[27]  Raghav Kaushik,et al.  Efficient exact set-similarity joins , 2006, VLDB.

[28]  Gerhard Weikum,et al.  Natural Language Questions for the Web of Data , 2012, EMNLP.

[29]  Eunsol Choi,et al.  Scaling Semantic Parsers with On-the-Fly Ontology Matching , 2013, EMNLP.

[30]  Hannah Bast,et al.  More Accurate Question Answering on Freebase , 2015, CIKM.

[31]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[32]  Dongyan Zhao,et al.  Natural language question answering over RDF: a graph data driven approach , 2014, SIGMOD Conference.

[33]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[34]  Xuchen Yao,et al.  Lean Question Answering over Freebase from Scratch , 2015, NAACL.

[35]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[36]  Mirella Lapata,et al.  Learning to Paraphrase for Question Answering , 2017, EMNLP.

[37]  Dongyan Zhao,et al.  Answering Natural Language Questions via Phrasal Semantic Parsing , 2014, CLEF.

[38]  Christopher D. Manning,et al.  Improving Coreference Resolution by Learning Entity-Level Distributed Representations , 2016, ACL.

[39]  Ming-Wei Chang,et al.  Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base , 2015, ACL.

[40]  Guoliang Li,et al.  String similarity search and join: a survey , 2016, Frontiers of Computer Science.

[41]  Xiaoming Li,et al.  Type-Aware Question Answering over Knowledge Base with Attention-Based Tree-Structured Neural Networks , 2017, Journal of Computer Science and Technology.

[42]  Vanessa López,et al.  Core techniques of question answering systems over knowledge bases: a survey , 2017, Knowledge and Information Systems.

[43]  Abraham Bernstein,et al.  Evaluating the usability of natural language query languages and interfaces to Semantic Web knowledge bases , 2010, J. Web Semant..

[44]  Traian Rebedea,et al.  QAnswer - Enhanced Entity Matching for Question Answering over Linked Data , 2015, CLEF.

[45]  Guoliang Li,et al.  Can we beat the prefix filtering?: an adaptive framework for similarity join and search , 2012, SIGMOD Conference.

[46]  Fei Li,et al.  Constructing an Interactive Natural Language Interface for Relational Databases , 2014, Proc. VLDB Endow..

[47]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[48]  Anne-Laure Ligozat,et al.  SemGraphQA@QALD5: LIMSI participation at QALD5@CLEF , 2015, CLEF.

[49]  Dongyan Zhao,et al.  Question Answering via Phrasal Semantic Parsing , 2015, CLEF.

[50]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[51]  Christian Bizer,et al.  DBpedia: A Multilingual Cross-domain Knowledge Base , 2012, LREC.

[52]  Silviu Cucerzan,et al.  Large-Scale Named Entity Disambiguation Based on Wikipedia Data , 2007, EMNLP.

[53]  Daniel Jurafsky,et al.  Same Referent, Different Words: Unsupervised Mining of Opaque Coreferent Mentions , 2013, NAACL.