ELITE: An Entailment-Based Federated Query Engine for Complete and Transparent Semantic Data Integration

In recent years the core of the semantic web has evolved to a conceptual layer built by a set of ontologies mapped onto data distributed in numerous data sources, interlinked, interpreted and processed in terms of its semantics. One of the central issues in this context became the federated querying of such linked data. This paper presents the federated query engine ELITE that facilitates a complete and transparent integration and querying of distributed autonomous data sources. To achieve this aim a combination of existing approaches for Ontologybased Data Access (OBDA) and federated query processing on Linked Open Data (LOD) are applied. Consolidating technologies like entailment regimes, the DL-Lite formalism, query rewriting, mapping relational data to RDF and an improved implementation of R-Tree based indexing contributes to the unique features of this federation engine. ELITE thereby enables the integration of various kinds of data sources, for example as relational databases or triple stores, simplicity of query design, guaranteed completeness of query results and highly efficient query processing. The federation engine has been developed and evaluated in the domain of carbon reduction in urban planning. Keywords—Federated Querying, Entailment Regime, Query Rewriting, Ontology-based Data Access (OBDA), DL-Lite, OWL QL, SPARQL, Linked Open Data (LOD), Indexing, R-Tree Index

[1]  S. Christodoulakis,et al.  SPARQL Rewriting for Query Mediation over Mapped Ontologies , 2010 .

[2]  Amit P. Sheth,et al.  Alignment-Based Querying of Linked Open Data , 2012, OTM Conferences.

[3]  Christian Bizer,et al.  Executing SPARQL Queries over the Web of Linked Data , 2009, SEMWEB.

[4]  Joan Horvath Links and Resources , 2014 .

[5]  Diego Calvanese,et al.  Tractable Reasoning and Efficient Query Answering in Description Logics: The DL-Lite Family , 2007, Journal of Automated Reasoning.

[6]  Diego Calvanese,et al.  High Performance Query Answering over DL-Lite Ontologies , 2012, KR.

[7]  Jérôme Euzenat,et al.  Ontology Matching: State of the Art and Future Challenges , 2013, IEEE Transactions on Knowledge and Data Engineering.

[8]  Steffen Staab,et al.  SPLENDID: SPARQL Endpoint Federation Exploiting VOID Descriptions , 2011, COLD.

[9]  Jens Lehmann,et al.  Triplify: light-weight linked data publication from relational databases , 2009, WWW '09.

[10]  Jeff Heflin,et al.  A federated query answering system for semantic web data , 2013 .

[11]  Nektarios Gioldasis,et al.  Ontology Mapping and SPARQL Rewriting for Querying Federated RDF Data Sources - (Short Paper) , 2010, OTM Conferences.

[12]  Wolfram Wöß,et al.  A Semantic Web middleware for Virtual Data Integration on the Web , 2008, ESWC.

[13]  Diego Calvanese,et al.  The MASTRO system for ontology-based data access , 2011, Semantic Web.

[14]  Zahir Tari,et al.  On the Move to Meaningful Internet Systems. OTM 2018 Conferences , 2018, Lecture Notes in Computer Science.

[15]  Amit P. Sheth,et al.  LOQUS: Linked Open Data SPARQL Querying System , 2010 .

[16]  Sebastian Rudolph,et al.  Foundations of Semantic Web Technologies , 2009 .

[17]  Diego Calvanese,et al.  The DL-Lite Family and Relations , 2009, J. Artif. Intell. Res..

[18]  Manfred Hauswirth,et al.  Scalable distributed indexing and query processing over Linked Data , 2012, J. Web Semant..

[19]  Christian Bizer,et al.  D2R Server - Publishing Relational Databases on the Semantic Web , 2004 .

[20]  Mariana Damova,et al.  Storing the Semantic Web: Repositories , 2011, Handbook of Semantic Web Technologies.

[21]  Jeff Heflin,et al.  Using Reformulation Trees to Optimize Queries over Distributed Heterogeneous Sources , 2010, International Semantic Web Conference.

[22]  Ulf Leser,et al.  Querying Distributed RDF Data Sources with SPARQL , 2008, ESWC.

[23]  Peter F. Patel-Schneider,et al.  OWL 2 Web Ontology Language Primer (Second Edition) , 2012 .

[24]  Katja Hose,et al.  FedX: A Federation Layer for Distributed Query Processing on Linked Open Data , 2011, ESWC.

[25]  Abraham Bernstein,et al.  Avalanche: Putting the Spirit of the Web back into Semantic Web Querying , 2010, ISWC Posters&Demos.

[26]  Stanislav Kikot,et al.  Long Rewritings, Short Rewritings , 2012, Description Logics.

[27]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[28]  Marco A. Casanova,et al.  Query Processing in a Mediator Based Framework for Linked Data Integration , 2011, Int. J. Bus. Data Commun. Netw..

[29]  Diego Calvanese,et al.  Linking Data to Ontologies , 2008, J. Data Semant..

[30]  Steffen Staab,et al.  Federated Data Management and Query Optimization for Linked Open Data , 2011, New Directions in Web Data Management 1.

[31]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[32]  Diego Calvanese,et al.  MASTRO-I: Efficient Integration of Relational Data through DL Ontologies , 2007, Description Logics.

[33]  Jürgen Umbrich,et al.  Data summaries for on-demand queries over linked data , 2010, WWW '10.

[34]  Fabian Prasser,et al.  Efficient distributed query processing for autonomous RDF databases , 2012, EDBT '12.

[35]  Carsten Lutz,et al.  The Combined Approach to Ontology-Based Data Access , 2011, IJCAI.

[36]  Álvaro Sicilia,et al.  Data integration driven ontology design, case study smart city , 2013, WIMS '13.

[37]  Katja Hose,et al.  FedX: Optimization Techniques for Federated Query Processing on Linked Data , 2011, SEMWEB.

[38]  Hugh Glaser,et al.  SPARQL query rewriting for implementing data integration over linked data , 2010, EDBT '10.

[39]  Diego Calvanese,et al.  Quest, an OWL 2 QL Reasoner for Ontology-based Data Access , 2012, OWLED.

[40]  Birte Glimm,et al.  Using SPARQL with RDFS and OWL Entailment , 2011, Reasoning Web.

[41]  Maribel Acosta,et al.  ANAPSID: An Adaptive Query Processing Engine for SPARQL Endpoints , 2011, SEMWEB.

[42]  Stanislav Kikot,et al.  On (In)Tractability of OBDA with OWL 2 QL , 2011, Description Logics.