QAestro - Semantic-Based Composition of Question Answering Pipelines

The demand for interfaces that allow users to interact with computers in an intuitive, effective, and efficient way is increasing. Question Answering (QA) systems address this need by answering questions posed by humans using knowledge bases. In recent years, many QA systems and related components have been developed both by practitioners and the research community. Since QA involves a vast number of (partially overlapping) subtasks, existing QA components can be combined in various ways to build tailored QA systems that perform better in terms of scalability and accuracy in specific domains and use cases. However, to the best of our knowledge, no systematic way exists to formally describe and automatically compose such components. Thus, in this work, we introduce QAestro, a framework for semantically describing both QA components and developer requirements for QA component composition. QAestro relies on a controlled vocabulary and the Local-as-View (LAV) approach to model QA tasks and components, respectively. Furthermore, the problem of QA component composition is mapped to the problem of LAV query rewriting, and state-of-the-art SAT solvers are utilized to efficiently enumerate the solutions. We have formalized 51 existing QA components implemented in 20 QA systems using QAestro. Our empirical results suggest that QAestro enumerates the combinations of QA components that effectively implement QA developer requirements.

[1]  André Freitas,et al.  An Introduction to Question Answering over Linked Data , 2014, Reasoning Web.

[2]  Sören Auer,et al.  AGDISTIS - Graph-Based Disambiguation of Named Entities Using Linked Data , 2014, International Semantic Web Conference.

[3]  Günter Neumann,et al.  The QALL-ME Framework: A specifiable-domain multilingual Question Answering architecture , 2011, J. Web Semant..

[4]  Fabien L. Gandon,et al.  QAKiS: an Open Domain QA System based on Relational Patterns , 2012, SEMWEB.

[5]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[6]  Jens Lehmann,et al.  Towards an open question answering architecture , 2014, SEM '14.

[7]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[8]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[9]  Kuldeep Singh,et al.  Qanary - A Methodology for Vocabulary-Driven Open Question Answering Systems , 2016, ESWC.

[10]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.

[11]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[12]  Enrico Motta,et al.  Is Question Answering fit for the Semantic Web?: A survey , 2011, Semantic Web.

[13]  José Luis Ambite,et al.  Scalable query rewriting: a graph-based approach , 2011, SIGMOD '11.

[14]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[15]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[16]  Maria-Esther Vidal,et al.  An Expressive and Efficient Solution to the Service Selection Problem , 2010, SEMWEB.

[17]  Bart Selman,et al.  Satisfiability Solvers , 2008, Handbook of Knowledge Representation.

[18]  Giuseppe De Giacomo,et al.  Automatic Service Composition via Simulation , 2008, Int. J. Found. Comput. Sci..

[19]  Jens Lehmann,et al.  AskNow: A Framework for Natural Language Query Formalization in SPARQL , 2016, ESWC.

[20]  Maria-Esther Vidal,et al.  Compilation of Query-Rewriting Problems into Tractable Fragments of Propositional Logic , 2006, AAAI.

[21]  Jens Lehmann,et al.  Survey on challenges of Question Answering in the Semantic Web , 2017, Semantic Web.

[22]  Leonard Bolc,et al.  Natural Language Communication with Computers , 1978, Lecture Notes in Computer Science.

[23]  Kuldeep Singh,et al.  Towards a Message-Driven Vocabulary for Promoting the Interoperability of Question Answering Systems , 2016, 2016 IEEE Tenth International Conference on Semantic Computing (ICSC).