Automatic Construction of Benchmarks for RDF Keyword Search Systems Evaluation

Keyword search systems provide users with a friendly alternative to access Resource Description Framework (RDF) datasets. The evaluation of such systems requires adequate benchmarks, consisting of RDF datasets and keyword queries, with their correct answers. However, the sets of correct answers such benchmarks provide for each query are often incomplete, mostly because they are manually built with experts’ help. The central contribution of this paper is an offline method that helps build RDF keyword search benchmarks automatically, leading to more complete sets of correct answers, called solution generators. The paper focuses on computing sets of generators and describes heuristics that circumvent the combinatorial nature of the problem. The paper then describes five benchmarks, constructed with the proposed method and based on three real datasets, DBpedia, IMDb, and Mondial, and two synthetic datasets, LUBM and BSBM. Finally, the paper compares the constructed benchmarks with keyword search benchmarks published in the literature.

[1]  Dongyan Zhao,et al.  Keyword Search on RDF Graphs - A Query Graph Assembly Approach , 2017, CIKM.

[2]  Björn Buchhold,et al.  Semantic Search on Text and Knowledge Bases , 2016, Found. Trends Inf. Retr..

[3]  Chong Wang,et al.  SPARK: Adapting Keyword Query to Semantic Search , 2007, ISWC/ASWC.

[4]  Marco A. Casanova,et al.  QUIOW: A Keyword-Based Query Processing Tool for RDF Datasets and Relational Databases , 2018, DEXA.

[5]  Bernardo Pereira Nunes,et al.  SCS Connector - Quantifying and Visualising Semantic Paths Between Entity Pairs , 2014, ESWC.

[6]  Christian Bizer,et al.  The Berlin SPARQL Benchmark , 2009, Int. J. Semantic Web Inf. Syst..

[7]  Jens Lehmann,et al.  LC-QuAD 2.0: A Large Dataset for Complex Question Answering over Wikidata and DBpedia , 2019, SEMWEB.

[8]  Marco A. Casanova,et al.  RDF Keyword-based Query Technology Meets a Real-World Dataset , 2017, EDBT.

[9]  Wolfgang Nejdl,et al.  From keywords to semantic queries - Incremental query construction on the semantic web , 2009, J. Web Semant..

[10]  Xiaojie Yuan,et al.  KAT: Keywords-to-SPARQL Translation Over RDF Graphs , 2018, DASFAA.

[11]  Jens Lehmann,et al.  LC-QuAD: A Corpus for Complex Question Answering over Knowledge Graphs , 2017, SEMWEB.

[12]  Yehoshua Sagiv,et al.  Efficiently enumerating results of keyword search over data graphs , 2008, Inf. Syst..

[13]  Jeff Heflin,et al.  LUBM: A benchmark for OWL knowledge base systems , 2005, J. Web Semant..

[14]  Alfred C. Weaver,et al.  A framework for evaluating database keyword search strategies , 2010, CIKM.

[15]  Gianmaria Silvello,et al.  Search Text to Retrieve Graphs: A Scalable RDF Keyword-Based Search System , 2020, IEEE Access.

[16]  Feifei Li,et al.  Scalable Keyword Search on Large RDF Data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[17]  S. Sudarshan,et al.  Keyword searching and browsing in databases using BANKS , 2002, Proceedings 18th International Conference on Data Engineering.

[18]  Lei Zou,et al.  Semantic SPARQL Similarity Search Over RDF Knowledge Graphs , 2016, Proc. VLDB Endow..

[19]  Haofen Wang,et al.  Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF) Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[20]  Edleno Silva de Moura,et al.  Efficient Match-Based Candidate Network Generation for Keyword Queries over Relational Databases , 2020 .

[21]  Mitre Costa Dourado,et al.  Generating all the Steiner trees and computing Steiner intervals for a fixed number of terminals , 2009, Electron. Notes Discret. Math..

[22]  Roi Blanco,et al.  Keyword search over RDF graphs , 2011, CIKM '11.

[23]  Krisztian Balog,et al.  A test collection for entity search in DBpedia , 2013, SIGIR.

[24]  Zoubida Kedad,et al.  Keyword Search Over RDF Graphs Using WordNet , 2018, BDCSIntell.

[25]  Mohand Boughanem,et al.  Novel Node Importance Measures to Improve Keyword Search over RDF Graphs , 2019, DEXA.

[26]  Li Yan,et al.  RDF Keyword Search Using a Type-based Summary , 2018, J. Inf. Sci. Eng..

[27]  Vagelis Hristidis,et al.  DISCOVER: Keyword Search in Relational Databases , 2002, VLDB.

[28]  Ben Carterette Test Collection , 2009, Encyclopedia of Database Systems.