Full-Text Support for Publish/Subscribe Ontology Systems

In this work, we envision a publish/subscribe ontology system that is able to index large numbers of expressive continuous queries and filter them against RDF data that arrive in a streaming fashion. To this end, we propose a SPARQL extension that supports the creation of full-text continuous queries and propose a family of main-memory query indexing algorithms which perform matching at low complexity and minimal filtering time. We experimentally compare our approach against a state-of-the-art competitor extended to handle indexing of full-text queries both on structural and full-text tasks using real-world data. Our approach proves two orders of magnitude faster than the competitor in all types of filtering tasks.

[1]  Hans-Arno Jacobsen,et al.  Predicate-based Filtering of XPath Expressions , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[2]  Andrew Trotman,et al.  Sound and complete relevance assessment for XML retrieval , 2008, TOIS.

[3]  Hao Zhang,et al.  Path sharing and predicate evaluation for high-performance XML filtering , 2003, TODS.

[4]  Chin-Wan Chung,et al.  iBroker: An Intelligent Broker for Ontology Based Publish/Subscribe Systems , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[5]  Dan Suciu,et al.  Processing XML streams with deterministic automata and stream indexes , 2004, TODS.

[6]  Wolfgang Nejdl,et al.  Publish/Subscribe for RDF-based P2P Networks , 2004, ESWS.

[7]  Hector Garcia-Molina,et al.  The SIFT information dissemination system , 1999, TODS.

[8]  Dieter Fensel,et al.  Sparkwave: continuous schema-enhanced pattern matching over RDF data streams , 2012, DEBS.

[9]  Manolis Koubarakis,et al.  Information filtering and query indexing for an information retrieval model , 2009, TOIS.

[10]  Hans-Arno Jacobsen,et al.  G-ToPSS: fast filtering of graph-based metadata , 2005, WWW '05.

[11]  Daniele Braga,et al.  C-SPARQL: a Continuous Query Language for RDF Data Streams , 2010, Int. J. Semantic Comput..

[12]  Rajeev Rastogi,et al.  Efficient filtering of XML documents with XPath expressions , 2002, The VLDB Journal.

[13]  Kevin Chen-Chuan Chang,et al.  Predicate rewriting for translating Boolean queries in a heterogeneous information system , 1999, TOIS.

[14]  Manolis Koubarakis,et al.  RDFS Reasoning and Query Answering on Top of DHTs , 2008, SEMWEB.

[15]  Françoise Baude,et al.  A Distributed Publish/Subscribe System for RDF Data , 2013, Globe.

[16]  Sudarshan S. Chawathe,et al.  XPath queries on streaming data , 2003, SIGMOD '03.

[17]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[18]  Jing Li,et al.  An Ontology-Based Publish/Subscribe System , 2004, Middleware.

[19]  Manolis Koubarakis,et al.  Publish/Subscribe with RDF Data over Large Structured Overlay Networks , 2005, DBISP2P.

[20]  Hans-Arno Jacobsen,et al.  S-ToPSS: Semantic Toronto Publish/Subscribe System , 2003, VLDB.