Ontology-Based Translation of Natural Language Queries to SPARQL

We present an implemented approach to transform natural language sentences into SPARQL, using background knowledge from ontologies and lexicons. Therefore, eligible technologies and data storage possibilities are analyzed and evaluated. The contributions of this paper are twofold. Firstly, we describe the motivation and current needs for a natural language access to industry data. We describe several scenarios where the proposed solution is required. Resulting in an architectural approach based on automatic SPARQL query construction for effective natural language queries. Secondly, we analyze the performance of RDBMS, RDF and Triple Stores for the knowledge representation. The proposed approach will be evaluated on the basis of a query catalog by means of query efficiency, accuracy, and data storage performance. The results show, that natural language access to industry data using ontologies and lexicons, is a simple but effective approach to improve the diagnosis process and the data search for a broad range of users. Furthermore, virtual RDF graphs do support the DB-driven knowledge graph representation process, but do not perform efficient under industry conditions in terms of performance and scalability.

[1]  Sebastian Rudolph,et al.  Ontology-Based Interpretation of Keywords for Semantic Search , 2007, ISWC/ASWC.

[2]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[3]  Judith Hurwitz,et al.  Big Data For Dummies , 2013 .

[4]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.

[5]  Michael Schmidt,et al.  Foundations of SPARQL query optimization , 2008, ICDT '10.

[6]  Hamish Cunningham,et al.  FREyA: An Interactive Way of Querying Linked Data Using Natural Language , 2011, ESWC Workshops.

[7]  Rajendra Akerkar,et al.  Big Data Computing , 2013 .

[8]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[9]  Archana P Kumar,et al.  A Comprehensive Comparative study of SPARQL and SQL , 2011 .

[10]  Jens Lehmann,et al.  AutoSPARQL: Let Users Query Your Knowledge Base , 2011, ESWC.

[11]  Dominique Estival,et al.  Towards Ontology-based Natural Language Processing , 2004, NLPXML@ACL.

[12]  Abraham Bernstein,et al.  OptARQ: A SPARQL Optimization Approach based on Triple Pattern Selectivity Estimation , 2007 .

[13]  Bi Liu,et al.  A Normalized Levenshtein Distance Metric , 2007, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Kalina Bontcheva,et al.  A Text-based Query Interface to OWL Ontologies , 2008, LREC.

[15]  Philippe Smets,et al.  Belief functions: The disjunctive rule of combination and the generalized Bayesian theorem , 1993, Int. J. Approx. Reason..

[16]  Sean Bechhofer,et al.  SKOS Simple Knowledge Organization System Reference , 2009 .

[17]  Chong Wang,et al.  SPARK: Adapting Keyword Query to Semantic Search , 2007, ISWC/ASWC.

[18]  Diego Calvanese Scalable End-User Access to Big Data , 2014 .

[19]  Sean Sullivan,et al.  USI Answers: Natural Language Question Answering Over (Semi-) Structured Industry Data , 2013, IAAI.