Skyline Queries over Knowledge Graphs

With the continuously growing amount of data offered in the form of knowledge graphs, users are often overwhelmed by the amount of potentially relevant information and entities. Hence, helping users find relevant data is a problem that becomes more and more important. Skyline queries are typically used in multi-criteria decision making applications to find a set of objects that are of interest to a user. This type of queries has been extensively studied over relational data in the database community. But only little attention has yet been paid to investigating if and how the skyline principle can help identifying sets of interesting entities in knowledge graphs. In this paper, we therefore show how the skyline principle can be applied to RDF knowledge graphs and help the user find interesting entities. In particular, we present algorithms using commonly used standard interfaces for accessing RDF data and a lightweight extension of existing interfaces (SkyTPF) to process skyline queries. Our experiments show that the proposed algorithms enable efficient and scalable skyline query processing over knowledge graphs.

[1]  Katja Hose,et al.  A survey of skyline processing in highly distributed environments , 2011, The VLDB Journal.

[2]  Jiawei Han,et al.  The Multi-Relational Skyline Operator , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[3]  Christos Doulkeridis,et al.  Skyline query processing over joins , 2011, SIGMOD '11.

[4]  Lei Zou,et al.  Online Subgraph Skyline Analysis over Knowledge Graphs , 2016, IEEE Transactions on Knowledge and Data Engineering.

[5]  Miguel A. Martínez-Prieto,et al.  Exchange and Consumption of Huge RDF Data , 2012, ESWC.

[6]  K. Selçuk Candan,et al.  Skyline-sensitive joins with LR-pruning , 2012, EDBT '12.

[7]  Wolf-Tilo Balke,et al.  Efficient Distributed Skylining for Web Information Systems , 2004, EDBT.

[8]  Ruben Verborgh,et al.  Triple Pattern Fragments: A low-cost knowledge graph interface for the Web , 2016, J. Web Semant..

[9]  Katja Hose,et al.  A Relaxed But Not Necessarily Constrained Way from the Top to the Sky , 2007, OTM Conferences.

[10]  Antonis Troumpoukis,et al.  An extension of SPARQL for expressing qualitative preferences , 2017, SEMWEB.

[11]  Thomas Lukasiewicz,et al.  Preference-Based Query Answering in Datalog+/- Ontologies , 2013, Description Logics.

[12]  Rik Van de Walle,et al.  Querying Datasets on the Web with High Availability , 2014, SEMWEB.

[13]  Gerhard Weikum,et al.  YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract , 2013, IJCAI.

[14]  Katja Hose,et al.  Processing relaxed skylines in PDMS using distributed data summaries , 2006, CIKM '06.

[15]  Huan Liu,et al.  Resource description framework: metadata and its applications , 2001, SKDD.

[16]  Anthony K. H. Tung,et al.  Efficient Skyline Query Processing on Peer-to-Peer Networks , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[17]  Peter F. Patel-Schneider,et al.  Comparative Preferences in SPARQL , 2018, EKAW.

[18]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[19]  Ilaria Bartolini,et al.  SaLSa: computing the skyline without scanning the whole sky , 2006, CIKM '06.

[20]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[21]  Jan Chomicki,et al.  Skyline with presorting , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[22]  Yin Yang,et al.  Skyline Processing on Distributed Vertical Decompositions , 2013, IEEE Transactions on Knowledge and Data Engineering.

[23]  Christos Doulkeridis,et al.  SKYPEER: Efficient Subspace Skyline Computation over Distributed Data , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[24]  Olaf Hartig,et al.  Bindings-Restricted Triple Pattern Fragments , 2016, OTM Conferences.

[25]  K. Selçuk Candan,et al.  Efficient Processing of Skyline-Join Queries over Multiple Data Sources , 2015, TODS.

[26]  Thomas Lukasiewicz,et al.  Preference-Based Query Answering in Probabilistic Datalog+/– Ontologies , 2013, Journal on Data Semantics.

[27]  Jeff Z. Pan,et al.  Querying the Semantic Web with Preferences , 2006, SEMWEB.

[28]  Ling Chen,et al.  Efficiently Evaluating Skyline Queries on RDF Databases , 2011, ESWC.