HCqa: Hybrid and Complex Question Answering on Textual Corpus and Knowledge Graph

Question Answering (QA) systems provide easy access to the vast amount of knowledge without having to know the underlying complex structure of the knowledge. The research community has provided ad hoc solutions to the key QA tasks, including named entity recognition and disambiguation, relation extraction and query building. Furthermore, some have integrated and composed these components to implement many tasks automatically and efficiently. However, in general, the existing solutions are limited to simple and short questions and still do not address complex questions composed of several sub-questions. Exploiting the answer to complex questions is further challenged if it requires integrating knowledge from unstructured data sources, i.e., textual corpus, as well as structured data sources, i.e., knowledge graphs. In this paper, an approach (HCqa) is introduced for dealing with complex questions requiring federating knowledge from a hybrid of heterogeneous data sources (structured and unstructured). We contribute in developing (i) a decomposition mechanism which extracts sub-questions from potentially long and complex input questions, (ii) a novel comprehensive schema, first of its kind, for extracting and annotating relations, and (iii) an approach for executing and aggregating the answers of sub-questions. The evaluation of HCqa showed a superior accuracy in the fundamental tasks, such as relation extraction, as well as the federation task.

[1]  赵 徳鑫 英语语法手册 = A handbook of English grammar , 1978 .

[2]  Jens Lehmann,et al.  Why Reinvent the Wheel: Let's Build Question Answering Systems Together , 2018, WWW.

[3]  Sanda M. Harabagiu,et al.  Experiments with Interactive Question Answering in Complex Scenarios , 2004, Workshop On Pragmatics Of Question Answering.

[4]  Marie-Francine Moens,et al.  A survey on question answering technology from an information retrieval perspective , 2011, Inf. Sci..

[5]  Seonyeong Park,et al.  ISOFT at QALD-5: Hybrid Question Answering System over Linked Data and Text Data , 2015, CLEF.

[6]  Eric Mayer Problem Solving With Algorithms And Data Structures Using Python , 2016 .

[7]  Kuldeep Singh,et al.  Frankenstein: A Platform Enabling Reuse of Question Answering Components , 2018, ESWC.

[8]  Stefan Decker,et al.  Framework for the Semantic Web: An RDF Tutorial , 2000, IEEE Internet Comput..

[9]  Richard A. Frost,et al.  An Event-Driven Approach for Querying Graph-Structured Data Using Natural Language , 2014, EDBT/ICDT Workshops.

[10]  Rafael Muñoz,et al.  Splitting Complex Temporal Questions for Question Answering Systems , 2004, ACL.

[11]  Enrico Motta,et al.  Cross ontology query answering on the semantic web: an initial evaluation , 2009, K-CAP '09.

[12]  Hannah Bast,et al.  Broccoli: Semantic Full-Text Search at your Fingertips , 2012, ArXiv.

[13]  Ebrahim Bagheri,et al.  Self-training on refined clause patterns for relation extraction , 2017, Inf. Process. Manag..

[14]  Oren Etzioni,et al.  Open Language Learning for Information Extraction , 2012, EMNLP.

[15]  Sören Auer,et al.  SINA: Semantic interpretation of user queries for question answering on interlinked data , 2015, J. Web Semant..

[16]  Enrico Motta,et al.  Merging and Ranking Answers in the Semantic Web: The Wisdom of Crowds , 2009, ASWC.

[17]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[18]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[19]  Elena Cabrio,et al.  Question Answering over Linked Data (QALD-5) , 2014, CLEF.

[20]  Pablo N. Mendes,et al.  Improving efficiency and accuracy in multilingual entity extraction , 2013, I-SEMANTICS '13.

[21]  Anne-Laure Ligozat,et al.  A Corpus for Hybrid Question Answering Systems , 2018, WWW.

[22]  N. Shepherd Cambridge Grammar of English , 2007 .

[23]  R. Carter,et al.  Cambridge Grammar of English , 2006 .

[24]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[25]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[26]  Elena Cabrio,et al.  6th Open Challenge on Question Answering over Linked Data (QALD-6) , 2016, SemWebEval@ESWC.

[27]  Axel-Cyrille Ngonga Ngomo,et al.  HAWK - Hybrid Question Answering Using Linked Data , 2015, ESWC.

[28]  Lei Zou,et al.  A State-transition Framework to Answer Complex Questions over Knowledge Base , 2018, EMNLP.

[29]  O. Jespersen Essentials of English Grammar : 25th impression, 1987 , 2003 .

[30]  Paolo Ferragina,et al.  TAGME: on-the-fly annotation of short text fragments (by wikipedia entities) , 2010, CIKM.

[31]  Luciano Del Corro,et al.  ClausIE: clause-based open information extraction , 2013, WWW.

[32]  Fabian M. Suchanek,et al.  ESTER: efficient search on text, entities, and relations , 2007, SIGIR.

[33]  Oren Etzioni,et al.  TextRunner: Open Information Extraction on the Web , 2007, NAACL.

[34]  Sung-Hyon Myaeng,et al.  Compositional question answering: A divide and conquer approach , 2011, Inf. Process. Manag..

[35]  Dragomir R. Radev,et al.  Nested Propositions in Open Information Extraction , 2016, EMNLP.

[36]  Sören Auer,et al.  Question answering on interlinked data , 2013, WWW.

[37]  Daniel S. Weld,et al.  Open Information Extraction Using Wikipedia , 2010, ACL.

[38]  Christopher D. Manning,et al.  Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks , 2016, LREC.

[39]  Dongyan Zhao,et al.  Hybrid Question Answering over Knowledge Base and Free Text , 2016, COLING.

[40]  Sanda M. Harabagiu,et al.  Employing Two Question Answering Systems in TREC 2005 , 2005, TREC.

[41]  Otto Jespersen,et al.  Essentials of English Grammar , 1933 .

[42]  Gerhard Weikum,et al.  Automated Template Generation for Question Answering over Knowledge Graphs , 2017, WWW.

[43]  Oren Etzioni,et al.  Paraphrase-Driven Learning for Open Question Answering , 2013, ACL.