Interactive natural language question answering over knowledge graphs

Abstract As many real-world data are constructed into knowledge graphs, providing effective and convenient query techniques for end users is an urgent and important task. Although structured query languages, such as SPARQL, offer a powerful expression ability to query RDF datasets, they are difficult to use. Keywords are simple but have a very limited expression ability. Natural language question (NLQ) is promising for querying knowledge graphs. A huge challenge is how to understand the question clearly so as to translate the unstructured question into a structured query. In this paper, we present a data + oracle approach to answer NLQs over knowledge graphs. We let users verify the ambiguities during the query understanding. To reduce the interaction cost, we formalize an interaction problem and design an efficient strategy to solve the problem. We also propose a query prefetching technique by exploiting the latency in the interactions with users. Moreover, we devise a hybrid approach that incorporates NLP-based, data-driven, and interaction techniques together to complete the question understanding. Extensive experiments over real datasets demonstrate that our proposed approach is effective as it outperforms state-of-the-art methods significantly.

[1]  Fei Li,et al.  Constructing an Interactive Natural Language Interface for Relational Databases , 2014, Proc. VLDB Endow..

[2]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[3]  Dongyan Zhao,et al.  Natural language question answering over RDF: a graph data driven approach , 2014, SIGMOD Conference.

[4]  Shuigeng Zhou,et al.  QUBLE: towards blending interactive visual subgraph search queries on large networks , 2014, The VLDB Journal.

[5]  Ian Horrocks,et al.  OptiqueVQS: Ontology-Based Visual Querying , 2015, VOILA@ISWC.

[6]  Alex Endert,et al.  VISAGE: Interactive Visual Graph Querying , 2016, AVI.

[7]  Wayne Goddard,et al.  Independent domination in graphs: A survey and recent results , 2013, Discret. Math..

[8]  Lei Zou,et al.  How to Build Templates for RDF Question/Answering: An Uncertain Graph Similarity Join Approach , 2015, SIGMOD Conference.

[9]  Ambuj K. Singh,et al.  Graphs-at-a-time: query language and access methods for graph databases , 2008, SIGMOD Conference.

[10]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[11]  Miloslav Konopík,et al.  SWSNL: Semantic Web Search Using Natural Language , 2013, Expert Syst. Appl..

[12]  Mudhakar Srivatsa,et al.  Exploiting Relevance Feedback in Knowledge Graph Search , 2015, KDD.

[13]  Ming-Wei Chang,et al.  Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base , 2015, ACL.

[14]  Gerhard Weikum,et al.  Automated Template Generation for Question Answering over Knowledge Graphs , 2017, WWW.

[15]  Chengkai Li,et al.  VIIQ: Auto-Suggestion Enabled Visual Interface for Interactive Graph Query Formulation , 2015, Proc. VLDB Endow..

[16]  Dekang Lin,et al.  Phrase Clustering for Discriminative Learning , 2009, ACL.

[17]  Shuigeng Zhou,et al.  PRAGUE: Towards Blending Practical Visual Subgraph Query Formulation and Query Processing , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[18]  Brian M. Sadler,et al.  On Generating Characteristic-rich Question Sets for QA Evaluation , 2016, EMNLP.

[19]  Guoliang Li,et al.  String similarity search and join: a survey , 2016, Frontiers of Computer Science.

[20]  Abraham Bernstein,et al.  Evaluating the usability of natural language query languages and interfaces to Semantic Web knowledge bases , 2010, J. Web Semant..

[21]  Wen-Syan Li,et al.  String Similarity Joins: An Experimental Evaluation , 2014, Proc. VLDB Endow..

[22]  Yinghui Wu,et al.  Schemaless and Structureless Graph Querying , 2014, Proc. VLDB Endow..

[23]  Christian Bizer,et al.  DBpedia: A Multilingual Cross-domain Knowledge Base , 2012, LREC.

[24]  Jonathan Berant,et al.  Semantic Parsing via Paraphrasing , 2014, ACL.

[25]  Dongyan Zhao,et al.  Question Answering on Freebase via Relation Extraction and Textual Evidence , 2016, ACL.

[26]  Gerhard Weikum,et al.  Natural Language Questions for the Web of Data , 2012, EMNLP.

[27]  Dragomir R. Radev,et al.  Mining the web for answers to natural language questions , 2001, CIKM '01.

[28]  Seung-won Hwang,et al.  KBQA: Learning Question Answering over QA Corpora and Knowledge Bases , 2019, Proc. VLDB Endow..

[29]  Oren Etzioni,et al.  Open question answering over curated and extracted knowledge bases , 2014, KDD.

[30]  Sourav S. Bhowmick,et al.  GBLENDER: towards blending visual query formulation and query processing in graph databases , 2010, SIGMOD Conference.

[31]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[32]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[33]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[34]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[35]  Ulf Leser,et al.  A query language for biological networks , 2005, ECCB/JBI.

[36]  Ramez Elmasri,et al.  GQBE: Querying knowledge graphs by example entity tuples , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[37]  Elena Cabrio,et al.  Question Answering over Linked Data (QALD-5) , 2014, CLEF.

[38]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[39]  Gerhard Weikum,et al.  Robust question answering over the web of linked data , 2013, CIKM.

[40]  Themis Palpanas,et al.  Exemplar Queries: Give me an Example of What You Need , 2014, Proc. VLDB Endow..