A German Natural Language Interface for Semantic Search

Semantic data is the key for an efficient information retrieval. It relies on a well-defined structure and enables automated processing. Therefore, more and more ontologies are specified, extended and interlinked. By now, only the query language SPARQL provides a precise access to semantic data. Since most common users are overstrained in formulating queries, which satisfy the structure of semantic data, more search-interface approaches emerge aiming at good usability and correct answers. We implemented a Natural Language Interface (NLI), that answers questions formulated in German natural language. In order to query the domain ontology, the user query is translated into SPARQL first. Since domain-ontology resources are required for the SPARQL-query formulation, this paper introduces an approach for the identification of resources in user query. We show a path-based identification of semantically similar resources and a similarity measure. After running 100 test questions, our system achieves a precision and recall of 66%.

[1]  Hamish Cunningham,et al.  Natural Language Interfaces to Ontologies: Combining Syntactic Analysis and Ontology-Based Lookup through the User Interaction , 2010, ESWC.

[2]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[3]  Philipp Cimiano,et al.  Porting natural language interfaces between domains: an experimental user study with the ORAKEL system , 2007, IUI '07.

[4]  Philipp Cimiano,et al.  Linking Lexical Resources and Ontologies on the Semantic Web with Lemon , 2011, ESWC.

[5]  Chong Wang,et al.  PANTO: A Portable Natural Language Interface to Ontologies , 2007, ESWC.

[6]  Esther Kaufmann Talking to the Semantic Web - Query Interfaces to Ontologies for the Casual User , 2006, International Semantic Web Conference.

[7]  Joakim Nivre,et al.  MaltParser: A Language-Independent System for Data-Driven Dependency Parsing , 2007, Natural Language Engineering.

[8]  Frank van Harmelen,et al.  A Semantic Web Primer, 2nd Edition (Cooperative Information Systems) , 2008 .

[9]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[10]  Christopher D. Manning,et al.  Parsing Three German Treebanks: Lexicalized and Unlexicalized Baselines , 2008 .

[11]  Abraham Bernstein,et al.  How Useful Are Natural Language Interfaces to the Semantic Web for Casual End-Users? , 2007, ISWC/ASWC.

[12]  Helmut Schmid,et al.  Improvements in Part-of-Speech Tagging with an Application to German , 1999 .

[13]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[14]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[15]  Thomas Ertl,et al.  Facet Graphs: Complex Semantic Querying Made Easy , 2010, ESWC.

[16]  Giovanni Tummarello,et al.  A Node Indexing Scheme for Web Entity Retrieval , 2010, ESWC.

[17]  Abraham Bernstein,et al.  Querix: A Natural Language Interface to Query Ontologies Based on Clarification Dialogs , 2006 .

[18]  Sanda M. Harabagiu,et al.  The Structure and Performance of an Open-Domain Question Answering System , 2000, ACL.

[19]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[20]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[21]  Sandra Kübler,et al.  Why is German Dependency Parsing More Reliable than Constituent Parsing , 2006 .