Reasoning Web. Semantic Technologies for Advanced Query Answering

Description Logics (DLs) play a central role as formalisms for representing ontologies and reasoning about them. This lecture introduces the basics of DLs. We discuss the knowledge modeling capabilities of some of the most prominent DLs, including expressive ones, and present some DL reasoning services. Particular attention is devoted to the query answering problem, and to the increasingly popular framework in which data repositories are queried through DL ontologies. We give an overview of the main challenges that arise in this setting, survey some query answering techniques for both lightweight and expressive DLs, and give an overview of the computational complexity landscape.

[1]  Lukasz Golab,et al.  Indexing Time-Evolving Data With Variable Lifetimes , 2006, 18th International Conference on Scientific and Statistical Database Management (SSDBM'06).

[2]  Jae-Gil Lee,et al.  Continuous query processing in data streams using duality of data and queries , 2006, SIGMOD Conference.

[3]  Kirk Pruhs,et al.  Algorithms and metrics for processing multiple heterogeneous continuous queries , 2008, TODS.

[4]  Franz Baader Description Logics , 2009, Reasoning Web.

[5]  Markus Krötzsch,et al.  Unchain My EL Reasoner , 2011, Description Logics.

[6]  Ian Horrocks,et al.  A Description Logic Primer , 2012, ArXiv.

[7]  Ciro Cattuto,et al.  Semantics, Sensors, and the Social Web: The Live Social Semantics Experiments , 2010, ESWC.

[8]  Kirk Pruhs,et al.  Freshness-Aware Scheduling of Continuous Queries in the Dynamic Web , 2005, WebDB.

[9]  Jeffrey F. Naughton,et al.  Rate-based query optimization for streaming information sources , 2002, SIGMOD '02.

[10]  Samuel Madden,et al.  Continuously adaptive continuous queries over streams , 2002, SIGMOD '02.

[11]  Michael Sipser,et al.  Introduction to the Theory of Computation , 1996, SIGA.

[12]  Philip S. Yu,et al.  Interval query indexing for efficient stream processing , 2004, CIKM '04.

[13]  Abraham Bernstein,et al.  Hexastore: sextuple indexing for semantic web data management , 2008, Proc. VLDB Endow..

[14]  Joseph M. Hellerstein,et al.  Using state modules for adaptive query processing , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[15]  Sharma Chakravarthy,et al.  Scheduling Strategies for Processing Continuous Queries over Streams , 2004, BNCOD.

[16]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[17]  Óscar Corcho,et al.  Linked Stream Data: A Position Paper , 2009, SSN.

[18]  Ulrike Sattler,et al.  Reasoning in Description Logics: Basics, Extensions, and Relatives , 2007, Reasoning Web.

[19]  Frank van Harmelen,et al.  WebPIE: A Web-scale Parallel Inference Engine using MapReduce , 2012, J. Web Semant..

[20]  Franz Baader,et al.  Pushing the EL Envelope Further , 2008, OWLED.

[21]  Walid G. Aref,et al.  Hash-merge join: a non-blocking join algorithm for producing fast and early join results , 2004, Proceedings. 20th International Conference on Data Engineering.

[22]  Dejing Dou,et al.  Ontology-based information extraction: An introduction and a survey of current approaches , 2010, J. Inf. Sci..

[23]  Franz Baader,et al.  CEL - A Polynomial-Time Reasoner for Life Science Ontologies , 2006, IJCAR.

[24]  Theodore Johnson,et al.  Out-of-order processing: a new architecture for high-performance stream systems , 2008, Proc. VLDB Endow..

[25]  Joseph M. Hellerstein,et al.  The Case for Precision Sharing , 2004, VLDB.

[26]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[27]  John Miles Smith,et al.  Optimizing the performance of a relational algebra database interface , 1975, CACM.

[28]  Jeffrey F. Naughton,et al.  Evaluating window joins over unbounded streams , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[29]  David Maier,et al.  Exploiting Punctuation Semantics in Continuous Data Streams , 2003, IEEE Trans. Knowl. Data Eng..

[30]  Sebastian Rudolph,et al.  Complexities of Horn Description Logics , 2013, TOCL.

[31]  Wolfgang Lehner,et al.  QStream: Deterministic Querying of Data Streams , 2004, VLDB.

[32]  S. Kotoulas,et al.  High-performance Distributed Stream Reasoning using S4 , 2011 .

[33]  Yanlei Diao,et al.  High-performance complex event processing over streams , 2006, SIGMOD Conference.

[34]  Diego Calvanese,et al.  The DL-Lite Family and Relations , 2009, J. Artif. Intell. Res..

[35]  Theodore Johnson,et al.  Prefilter: predicate pushdown at streaming speeds , 2008, SSPS '08.

[36]  Jeffrey F. Naughton,et al.  Maximizing the Output Rate of Multi-Way Join Queries over Streaming Information Sources , 2003, VLDB.

[37]  Sebastian Rudolph,et al.  Foundations of Semantic Web Technologies , 2009 .

[38]  Anni-Yasmin Turhan,et al.  Implementing Completion-Based Inferences for the EL-family , 2011, Description Logics.

[39]  Gerhard Weikum,et al.  The RDF-3X engine for scalable management of RDF data , 2010, The VLDB Journal.

[40]  Hector Garcia-Molina,et al.  Wave-indices: indexing evolving databases , 1997, SIGMOD '97.

[41]  Jianzhong Li,et al.  Processing Sliding Window Join Aggregate in Continuous Queries over Data Streams , 2004, ADBIS.

[42]  Calton Pu,et al.  Continual Queries for Internet Scale Event-Driven Information Delivery , 1999, IEEE Trans. Knowl. Data Eng..

[43]  Magdalena Ortiz,et al.  Reasoning and Query Answering in Description Logics , 2012, Reasoning Web.

[44]  Norman W. Paton,et al.  Adaptive Query Processing: A Survey , 2002, BNCOD.

[45]  Weifeng Su,et al.  ODE: Ontology-assisted data extraction , 2009, TODS.

[46]  Umberto Straccia,et al.  AnQL: SPARQLing Up Annotated RDFS , 2010, SEMWEB.

[47]  Franz Baader,et al.  Pushing the EL Envelope , 2005, IJCAI.

[48]  Laurent Amsaleg,et al.  Cost-based query scrambling for initial delays , 1998, SIGMOD '98.

[49]  Diego Calvanese,et al.  Tractable Reasoning and Efficient Query Answering in Description Logics: The DL-Lite Family , 2007, Journal of Automated Reasoning.

[50]  Bernardo Cuenca Grau,et al.  OWL 2 Web Ontology Language: Profiles , 2009 .

[51]  Bernhard Seeger,et al.  A Temporal Foundation for Continuous Queries over Data Streams , 2005, COMAD.

[52]  Sharma Chakravarthy,et al.  Queueing analysis of relational operators for continuous data streams , 2003, CIKM '03.

[53]  Samuel Madden,et al.  Fjording the stream: an architecture for queries over streaming sensor data , 2002, Proceedings 18th International Conference on Data Engineering.

[54]  Hongjun Lu,et al.  Stabbing the sky: efficient skyline computation over sliding windows , 2005, 21st International Conference on Data Engineering (ICDE'05).

[55]  David J. DeWitt,et al.  Tuple Routing Strategies for Distributed Eddies , 2003, VLDB.

[56]  Andrew Heybey,et al.  Tribeca: A System for Managing Large Databases of Network Traffic , 1998, USENIX Annual Technical Conference.

[57]  Sebastian Rudolph,et al.  Foundations of Description Logics , 2011, Reasoning Web.

[58]  Carsten Lutz,et al.  Conjunctive Query Answering in the Description Logic EL Using a Relational Database System , 2009, IJCAI.

[59]  Wolfgang Lehner,et al.  Robust Real-time Query Processing with QStream , 2005, VLDB.

[60]  Beng Chin Ooi,et al.  Multiple aggregations over data streams , 2005, SIGMOD '05.

[61]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[62]  Carlo Zaniolo,et al.  Query Languages and Data Models for Database Sequences and Data Streams , 2004, VLDB.

[63]  Helen J. Wang,et al.  Online aggregation , 1997, SIGMOD '97.

[64]  Walid G. Aref,et al.  Optimizing In-Order Execution of Continuous Queries over Streamed Sensor Data , 2005, International Conference on Statistical and Scientific Database Management.

[65]  Boris Motik,et al.  A Comparison of Query Rewriting Techniques for DL-lite , 2009, Description Logics.

[66]  Heiner Stuckenschmidt,et al.  Index structures and algorithms for querying distributed RDF repositories , 2004, WWW '04.

[67]  Marcelo Arenas,et al.  Semantics and Complexity of SPARQL , 2006, International Semantic Web Conference.

[68]  Carsten Lutz,et al.  The Combined Approach to Ontology-Based Data Access , 2011, IJCAI.

[69]  Charu C. Aggarwal,et al.  gSketch: On Query Estimation in Graph Streams , 2011, Proc. VLDB Endow..

[70]  Jeremy J. Carroll,et al.  OWL 2 Web Ontology Language RDF-Based Semantics , 2009 .

[71]  Yufei Tao,et al.  RPJ: producing fast join results on streams through rate-based optimization , 2005, SIGMOD '05.

[72]  Georg Lausen,et al.  SP2Bench: A SPARQL Performance Benchmark , 2008, Semantic Web Information Management.

[73]  Ge Yu,et al.  Tick Scheduling: A Deadline Based Optimal Task Scheduling Approach for Real-Time Data Stream Systems , 2005, WAIM.

[74]  Jennifer Widom,et al.  Flexible time management in data stream systems , 2004, PODS.

[75]  Markus Krötzsch,et al.  Concurrent Classification of EL Ontologies , 2011, International Semantic Web Conference.

[76]  Markus Krötzsch,et al.  Practical Reasoning with Nominals in the EL Family of Description Logics , 2012, KR.

[77]  Herman J. ter Horst,et al.  Completeness, decidability and complexity of entailment for RDF Schema and a semantic extension involving the OWL vocabulary , 2005, J. Web Semant..

[78]  Janusz R. Getta,et al.  Processing of Continuous Queries over Unlimited Data Streams , 2002, DEXA.

[79]  Peter J. Haas,et al.  Ripple joins for online aggregation , 1999, SIGMOD '99.

[80]  Michael J. Franklin,et al.  Dynamic Pipeline Scheduling for Improving Interactive Query Performance , 2001, VLDB.

[81]  Donald Kossmann,et al.  The state of the art in distributed query processing , 2000, CSUR.

[82]  David J. DeWitt,et al.  Efficient mid-query re-optimization of sub-optimal query execution plans , 1998, SIGMOD '98.

[83]  Jürgen Umbrich,et al.  YARS2: A Federated Repository for Querying Graph Structured Data from the Web , 2007, ISWC/ASWC.

[84]  Jeffrey F. Naughton,et al.  A non-blocking parallel spatial join algorithm , 2002, Proceedings 18th International Conference on Data Engineering.

[85]  Carlo Zaniolo,et al.  ATLAS: A Small but Complete SQL Extension for Data Mining and Data Streams , 2003, VLDB.

[86]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[87]  Jianzhong Li,et al.  Dynamic Adjustment of Sliding Windows over Data Streams , 2004, WAIM.

[88]  Danh Le Phuoc,et al.  A Native and Adaptive Approach for Unified Processing of Linked Streams and Linked Data , 2011, SEMWEB.

[89]  Walid G. Aref,et al.  Stream window join: tracking moving objects in sensor-network databases , 2003, 15th International Conference on Scientific and Statistical Database Management, 2003..

[90]  Wee Hyong Tok,et al.  Efficient and Adaptive Processing of Multiple Continuous Queries , 2002, EDBT.

[91]  Samuel Madden,et al.  ZStream: a cost-based query processor for adaptively detecting composite events , 2009, SIGMOD Conference.

[92]  A. N. Wilschut,et al.  Dataflow query execution in a parallel main-memory environment , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[93]  Jürgen Umbrich,et al.  Towards Understanding the Changing Web: Mining the Dynamics of Linked-Data Sources and Entities , 2010, LWA.

[94]  Giorgio Orsi,et al.  Ontological queries: Rewriting and optimization , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[95]  JÜRGEN KRÄMER,et al.  Semantics and implementation of continuous sliding window queries over data streams , 2009, TODS.

[96]  Theodore Johnson,et al.  Query-Aware Partitioning for Monitoring Massive Network Data Streams , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[97]  Elke A. Rundensteiner,et al.  State-slice: new paradigm of multi-query optimization of window-based stream queries , 2006, VLDB.

[98]  Frank van Harmelen,et al.  Mind the data skew: distributed inferencing by speeddating in elastic regions , 2010, WWW '10.

[99]  Carsten Lutz,et al.  The Combined Approach to Query Answering in DL-Lite , 2010, KR.

[100]  Sebastian Rudolph,et al.  Conjunctive Queries for a Tractable Fragment of OWL 1.1 , 2007, ISWC/ASWC.

[101]  David Maier,et al.  Semantics and evaluation techniques for window aggregates in data streams , 2005, SIGMOD '05.

[102]  Yufei Tao,et al.  Maintaining sliding window skylines on data streams , 2006, IEEE Transactions on Knowledge and Data Engineering.

[103]  Lukasz Golab,et al.  Update-pattern-aware modeling and processing of continuous queries , 2005, SIGMOD '05.

[104]  Michael Stonebraker,et al.  Optimization of parallel query execution plans in XPRS , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[105]  Christos H. Papadimitriou,et al.  Computational complexity , 1993 .

[106]  Ken C. K. Lee,et al.  QUAY: a data stream processing system using chunking , 2004, Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04..

[107]  Lukasz Golab,et al.  Data Stream Management , 2017, Data Stream Management.

[108]  Miron Livny,et al.  SEQ: A model for sequence databases , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[109]  Walid G. Aref,et al.  Efficient Execution of Sliding-Window Queries Over Data Streams , 2003 .

[110]  Lukasz Golab,et al.  Processing Sliding Window Multi-Joins in Continuous Queries over Data Streams , 2003, VLDB.

[111]  Kyriakos Mouratidis,et al.  Continuous monitoring of top-k queries over sliding windows , 2006, SIGMOD Conference.

[112]  Michael J. Franklin,et al.  On-the-fly sharing for streamed aggregation , 2006, SIGMOD Conference.

[113]  Oren Etzioni,et al.  TextRunner: Open Information Extraction on the Web , 2007, NAACL.

[114]  Amit P. Sheth,et al.  Semantic Sensor Web , 2008, IEEE Internet Computing.

[115]  Jian Pei,et al.  Can we learn a template-independent wrapper for news article extraction from a single training site? , 2009, KDD.

[116]  Kamin Whitehouse,et al.  Semantic Streams: A Framework for Composable Semantic Interpretation of Sensor Data , 2006, EWSN.

[117]  Andreas Harth,et al.  Scalable Authoritative OWL Reasoning for the Web , 2011, Semantic Services, Interoperability and Web Applications.

[118]  Claudio Gutiérrez,et al.  Introducing Time into RDF , 2007, IEEE Transactions on Knowledge and Data Engineering.

[119]  Viktor K. Prasanna,et al.  Parallel Inferencing for OWL Knowledge Bases , 2008, 2008 37th International Conference on Parallel Processing.