WBSum: Workload-based Summaries for RDF/S KBs

Semantic summaries try to extract compact information from the original RDF graph, while reducing its size. State of the art structural semantic summaries, focus primarily on the graph structure of the data, trying to maximize the summary’s utility for a specific purpose, such as indexing, query answering and source selection. In this paper, we present an approach that is able to construct high quality summaries, exploiting a small part of the query workload, maximizing their utility for query answering, i.e. the query coverage. We demonstrate our approach using two real world datasets and the corresponding query workloads and we show that we strictly dominates current state of the art in terms of query coverage.

[1]  Haridimos Kondylakis,et al.  SOFOS: Demonstrating the Challenges of Materialized View Selection on Knowledge Graphs , 2021, SIGMOD Conference.

[2]  Nikos Papadakis,et al.  EvoRDF: A Framework for Exploring Ontology Evolution , 2017, ESWC.

[3]  Enrico Motta,et al.  Identifying Key Concepts in an Ontology, through the Integration of Cognitive Principles with Statistical and Topological Measures , 2008, ASWC.

[4]  Kenza Kellou-Menouer,et al.  SchemaDecrypt++: Parallel on-line Versioned Schema Inference for Large Semantic Web Data sources , 2020, Inf. Syst..

[5]  Dimitris Plexousakis,et al.  Ontology Evolution in Data Integration: Query Rewriting to the Rescue , 2011, ER.

[6]  Kleber Xavier Sampaio de Souza,et al.  Visualization of ontologies through hypertrees , 2003, CLIHC '03.

[7]  Dimitris Plexousakis,et al.  Exploring Importance Measures for Summarizing RDF/S KBs , 2017, ESWC.

[8]  Xiang Zhang,et al.  Ontology summarization based on rdf sentence graph , 2007, WWW '07.

[9]  Dimitris Plexousakis,et al.  Ontology Evolution: Assisting Query Migration , 2012, ER.

[10]  Kenza Kellou-Menouer,et al.  HInT: Hybrid and Incremental Type Discovery for Large RDF Data Sources , 2021, SSDBM.

[11]  Kostas Stefanidis,et al.  Exploring RDFS KBs Using Summaries , 2018, International Semantic Web Conference.

[12]  Gang Wu,et al.  Identifying Potentially Important Concepts and Relations in an Ontology , 2008, International Semantic Web Conference.

[13]  François Goasdoué,et al.  Summarizing semantic graphs: a survey , 2018, The VLDB Journal.

[14]  Dimitris Plexousakis,et al.  Ontology understanding without tears: The summarization approach , 2017, Semantic Web.

[15]  Kostas Stefanidis,et al.  RDF Query Answering Using Apache Spark: Review and Assessment , 2018, 2018 IEEE 34th International Conference on Data Engineering Workshops (ICDEW).

[16]  Stefan Voß,et al.  Steiner's Problem in Graphs: Heuristic Methods , 1992, Discret. Appl. Math..

[17]  Ioana Manolescu,et al.  RDF graph summarization: principles, techniques and applications , 2019, EDBT.

[18]  Muhammad Saleem,et al.  FEASIBLE: A Feature-Based SPARQL Benchmark Generation Framework , 2015, SEMWEB.