Distributed Stream Consistency Checking

Dealing with noisy data is one of the big issues in stream processing. While noise has been widely studied in settings where streams have simple schemas, e.g. time series, few solutions focused on streams characterized by complex data structures. This paper studies how to check consistency over large amounts of complex streams. Our proposed methods exploit reasoning to assess if portions of the streams are compliant to a reference conceptual model. To achieve scalability, our methods run on state-of-the-art distributed stream processing platforms, e.g. Apache Storm or Twitter Heron. Our first method computes the closure of Negative Inclusions (NIs) for DL-Lite ontologies and registers the NIs as queries. The second method compiles the ontology into a processing pipeline to evenly distribute the workload. Experiments compares the two methods and show that the second one improves the throughput up to 139% with the LUBM ontology and 330% with the NPD ontology.

[1]  Jan Chomicki,et al.  Consistent query answers in inconsistent databases , 1999, PODS '99.

[2]  Ian Horrocks,et al.  Reasoning Web: Logical Foundation of Knowledge Graph Construction and Query Answering - 12th International Summer School 2016, Aberdeen, UK, September 5-9, 2016, Tutorial Lectures , 2017, Reasoning Web.

[3]  Heiko Paulheim,et al.  Fast Approximate A-Box Consistency Checking Using Machine Learning , 2016, ESWC.

[4]  Diego Calvanese,et al.  Query Rewriting in DL-Lite_^(HN)_horn , 2010, Description Logics.

[5]  Aldo Gangemi,et al.  Serving DBpedia with DOLCE - More than Just Adding a Cherry on Top , 2015, International Semantic Web Conference.

[6]  Jeff Z. Pan,et al.  Optimising ontology stream reasoning with truth maintenance system , 2011, CIKM '11.

[7]  Diego Calvanese,et al.  The DL-Lite Family and Relations , 2009, J. Artif. Intell. Res..

[8]  Jeff Heflin,et al.  LUBM: A benchmark for OWL knowledge base systems , 2005, J. Web Semant..

[9]  Sebastian Rudolph,et al.  EP-SPARQL: a unified language for event processing and stream reasoning , 2011, WWW.

[10]  Jeff Z. Pan,et al.  Inconsistencies, Negations and Changes in Ontologies , 2006, AAAI.

[11]  Mieczyslaw M. Kokar,et al.  Consistency Checking of Semantic Web Ontologies , 2002, SEMWEB.

[12]  Domenico Lembo,et al.  Consistent Query Answering over Description Logic Ontologies , 2007, Description Logics.

[13]  Steffen Staab,et al.  Incrementally Maintaining Materializations of Ontologies Stored in Logic Databases , 2005, J. Data Semant..

[14]  Frank van Harmelen,et al.  Stream reasoning: A survey and outlook , 2017, Data Sci..

[15]  Freddy Lécué,et al.  Towards Consistency Checking over Evolving Ontologies , 2014, CIKM.

[16]  Monika Solanki,et al.  RFID-based logistics monitoring with semantics-driven event processing , 2016, DEBS.

[17]  Boris Motik,et al.  HermiT: An OWL 2 Reasoner , 2014, Journal of Automated Reasoning.

[18]  Jeff Z. Pan,et al.  A Combined Approach to Incremental Reasoning for EL Ontologies , 2016, RR.

[19]  Danh Le Phuoc,et al.  A Native and Adaptive Approach for Unified Processing of Linked Streams and Linked Data , 2011, SEMWEB.

[20]  Alessandro Margara,et al.  Processing flows of information: From data stream to complex event processing , 2012, CSUR.

[21]  Jignesh M. Patel,et al.  Twitter Heron: Stream Processing at Scale , 2015, SIGMOD Conference.

[22]  Boris Motik,et al.  A Comparison of Query Rewriting Techniques for DL-lite , 2009, Description Logics.

[23]  Daniele Braga,et al.  Querying RDF streams with C-SPARQL , 2010, SGMD.

[24]  Ian Horrocks,et al.  Publishing the Norwegian Petroleum Directorate's FactPages as Semantic Web Data , 2013, SEMWEB.

[25]  Jignesh M. Patel,et al.  Storm@twitter , 2014, SIGMOD Conference.

[26]  Carsten Lutz,et al.  The Combined Approach to Query Answering in DL-Lite , 2010, KR.

[27]  Seif Haridi,et al.  Apache Flink™: Stream and Batch Processing in a Single Engine , 2015, IEEE Data Eng. Bull..

[28]  Freddy Lécué,et al.  Consistent Knowledge Discovery from Evolving Ontologies , 2015, AAAI.