Knowledge Graph Question Answering using Graph-Pattern Isomorphism

Knowledge Graph Question Answering (KGQA) systems are often based on machine learning algorithms, requiring thousands of question-answer pairs as training examples or natural language processing pipelines that need module fine-tuning. In this paper, we present a novel QA approach, dubbed TeBaQA. Our approach learns to answer questions based on graph isomorphisms from basic graph patterns of SPARQL queries. Learning basic graph patterns is efficient due to the small number of possible patterns. This novel paradigm reduces the amount of training data necessary to achieve state-of-the-art performance. TeBaQA also speeds up the domain adaption process by transforming the QA system development task into a much smaller and easier data compilation task. In our evaluation, TeBaQA achieves state-of-the-art performance on QALD-8 and delivers comparable results on QALD-9 and LC-QuAD v1. Additionally, we performed a fine-grained evaluation on complex queries that deal with aggregation and superlative questions as well as an ablation study, highlighting future research challenges.

[1]  Muhammad Saleem,et al.  9th Challenge on Question Answering over Linked Data (QALD-9) (invited paper) , 2018, Semdeep/NLIWoD@ISWC.

[2]  Muhammad Saleem,et al.  Question Answering Over Linked Data: What is Difficult to Answer? What Affects the F scores? , 2017, BLINK/NLIWoD3@ISWC.

[3]  Jens Lehmann,et al.  Survey on challenges of Question Answering in the Semantic Web , 2017, Semantic Web.

[4]  Alexander G. Gray,et al.  Question Answering over Knowledge Bases by Leveraging Semantic Parsing and Neuro-Symbolic Reasoning , 2020, ArXiv.

[5]  Dongyan Zhao,et al.  Natural language question answering over RDF: a graph data driven approach , 2014, SIGMOD Conference.

[6]  Seung-won Hwang,et al.  KBQA: Learning Question Answering over QA Corpora and Knowledge Bases , 2019, Proc. VLDB Endow..

[7]  Gerhard Weikum,et al.  Never-Ending Learning for Open-Domain Question Answering over Knowledge Bases , 2018, WWW.

[8]  Diego Esteves,et al.  Neural Machine Translation for Query Construction and Composition , 2018, ArXiv.

[9]  M. de Rijke,et al.  Message Passing for Complex Question Answering over Knowledge Graphs , 2019, CIKM.

[10]  Philipp Cimiano,et al.  A Guided Template-Based Question Answering System over Knowledge Graphs , 2018, EKAW.

[11]  Gorjan Alagic,et al.  #p , 2019, Quantum information & computation.

[12]  Ricardo Usbeck,et al.  Template-based Question Answering using Recursive Neural Networks , 2020, 2021 IEEE 15th International Conference on Semantic Computing (ICSC).

[13]  Vanessa López,et al.  Core techniques of question answering systems over knowledge bases: a survey , 2017, Knowledge and Information Systems.

[14]  Pierre Maret,et al.  Towards a Question Answering System over the Semantic Web , 2018, Semantic Web.

[15]  Jens Lehmann,et al.  LC-QuAD: A Corpus for Complex Question Answering over Knowledge Graphs , 2017, SEMWEB.

[16]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[17]  Yuzhong Qu,et al.  Leveraging Frequent Query Substructures to Generate Formal Queries for Complex Question Answering , 2019, EMNLP.

[18]  Marcelo Arenas,et al.  Semantics and complexity of SPARQL , 2006, TODS.

[19]  Axel-Cyrille Ngonga Ngomo,et al.  Entity Linking in 40 Languages using MAG , 2018, ESWC.

[20]  George Hripcsak,et al.  Technical Brief: Agreement, the F-Measure, and Reliability in Information Retrieval , 2005, J. Am. Medical Informatics Assoc..

[21]  Pablo N. Mendes,et al.  Improving efficiency and accuracy in multilingual entity extraction , 2013, I-SEMANTICS '13.

[22]  Kuldeep Singh,et al.  Frankenstein: A Platform Enabling Reuse of Question Answering Components , 2018, ESWC.

[23]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[24]  Felix Conrads,et al.  Benchmarking question answering systems , 2019, Semantic Web.

[25]  Sebastian Rudolph,et al.  Neural Machine Translating from Natural Language to SPARQL , 2019, Future Gener. Comput. Syst..

[26]  Carlo Zaniolo,et al.  Machine Learning of SPARQL Templates for Question Answering Over LinkedSpending , 2019, 2019 IEEE 28th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE).

[27]  Gerhard Weikum,et al.  Automated Template Generation for Question Answering over Knowledge Graphs , 2017, WWW.

[28]  Caio Viktor S. Avila,et al.  CONQUEST: A Framework for Building Template-Based IQA Chatbots for Enterprise Knowledge Graphs , 2020, NLDB.

[29]  Brian M. Sadler,et al.  Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases , 2020, WWW.

[30]  Weiguo Zheng,et al.  Question Answering over Knowledge Graphs via Structural Query Patterns , 2019, ArXiv.

[31]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[32]  Xiang Cheng,et al.  Learning to Order Sub-questions for Complex Question Answering , 2019, ArXiv.

[33]  Felix Conrads,et al.  8th Challenge on Question Answering over Linked Data (QALD-8) (invited paper) , 2018, Semdeep/NLIWoD@ISWC.

[34]  Hao Liu,et al.  Pattern-revising Enhanced Simple Question Answering over Knowledge Bases , 2018, COLING.

[35]  Axel-Cyrille Ngonga Ngomo,et al.  7th Open Challenge on Question Answering over Linked Data (QALD-7) , 2017, SemWebEval@ESWC.

[36]  Jonathan Berant,et al.  Building a Semantic Parser Overnight , 2015, ACL.

[37]  Marina Gavrilova Computational science and its applications - ICCSA 2006 : International Conference, Glasgow, UK, May 8-11, 2006 : proceedings , 2006 .

[38]  Jens Lehmann,et al.  Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs , 2019, ArXiv.

[39]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.