Reasoning on regular path queries

Current information systems are required to deal with more complex data with respect to traditional relational data. The database community has already proposed abstractions for these kinds of data, in particular in terms of semistructured data models. A semistructured model conceives a database essentially as a finite directed labeled graph whose nodes represent objects, and whose edges represent relationships between objects. In the same way as conjunctive queries form the core of any query language for the relational model, regular path queries (RPQs) and their variants are considered the basic querying mechanisms for semistructured data.Besides the basic task of query answering, i.e., evaluating a query over a database, databases should support other reasoning services related to querying. One of the most important is query containment, i.e., verifying whether for all databases the answer to a query is a subset of the answer to a second query. Another important reasoning service that has received considerable attention in the recent years is view-based query processing, which amounts to processing queries based on a set of materialized views, rather than on the raw data in the database.The goal of this paper is to describe basic results and techniques concerning query containment and view based query processing for the class of two-way regular-path queries (which extend RPQs with the inverse operator). We will demonstrate that the basic services for reasoning about two way regular path queries are decidable, thus showing that the limited form of recursion expressible by these queries does not endanger the decidability of reasoning. Besides the specific results, our methods show the power of two-way automata in reasoning on complex queries.

[1]  Todd D. Millstein,et al.  Navigational Plans For Data Integration , 1999, AAAI/IAAI.

[2]  Tomás Feder,et al.  The Computational Structure of Monotone Monadic SNP and Constraint Satisfaction: A Study through Datalog and Group Theory , 1999, SIAM J. Comput..

[3]  Walter J. Savitch,et al.  Relationships Between Nondeterministic and Deterministic Tape Complexities , 1970, J. Comput. Syst. Sci..

[4]  Diego Calvanese,et al.  Description Logic Framework for Information Integration , 1998, KR.

[5]  Divesh Srivastava,et al.  Answering Queries Using Views. , 1999, PODS 1995.

[6]  Alin Deutsch,et al.  Optimization Properties for Classes of Conjunctive Regular Path Queries , 2001, DBPL.

[7]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[8]  Yehoshua Sagiv,et al.  Semantic query optimization in Datalog programs (extended abstract) , 1995, ILPS Workshop: Constraints and Databases.

[9]  Albert R. Meyer,et al.  The Equivalence Problem for Regular Expressions with Squaring Requires Exponential Space , 1972, SWAT.

[10]  Anthony C. Klug On conjunctive queries containing inequalities , 1988, JACM.

[11]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[12]  S. Boag,et al.  XQuery 1.0 : An XML query language, W3C Working Draft 12 November 2003 , 2003 .

[13]  Serge Abiteboul,et al.  Regular path queries with constraints , 1997, PODS '97.

[14]  Alon Y. Halevy,et al.  Semantic query optimization in Datalog programs (extended abstract) , 1995, PODS '95.

[15]  Ashok K. Chandra,et al.  Optimal implementation of conjunctive queries in relational data bases , 1977, STOC '77.

[16]  Moshe Y. Vardi A temporal fixpoint calculus , 1988, POPL '88.

[17]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[18]  Alon Y. Halevy,et al.  Theory of answering queries using views , 2000, SGMD.

[19]  Diego Calvanese,et al.  View-based query containment , 2003, PODS '03.

[20]  Alex Thomo,et al.  Query containment and rewriting using views for regular path queries under constraints , 2003, PODS.

[21]  Moshe Y. Vardi The complexity of relational query languages (Extended Abstract) , 1982, STOC '82.

[22]  Diego Calvanese,et al.  Query processing using views for regular path queries with inverse , 2000, PODS 2000.

[23]  Alberto O. Mendelzon,et al.  Tableau Techniques for Querying Information Sources through Global Schemas , 1999, ICDT.

[24]  Diego Calvanese,et al.  What is Query Rewriting? , 2000, CIA.

[25]  Diego Calvanese,et al.  Lossless regular views , 2002, PODS.

[26]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[27]  R. V. D. Meyden The complexity of querying indefinite information: defined relations, recursion and linear order , 1992 .

[28]  Diego Calvanese,et al.  Containment of Conjunctive Regular Path Queries with Inverse , 2000, KR.

[29]  Jeffrey D. Ullman,et al.  Generalizing Conjunctive Query Containment for View Maintenance and Integrity Constraint Verification (Abstract) , 1992, Workshop on Deductive Databases, JICSLP.

[30]  Alon Y. Halevy,et al.  Verification of Knowledge Bases: a Unifying Logical View , 1997, EUROVAV.

[31]  Kyuseok Shim,et al.  Optimizing queries with materialized views , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[32]  Serge Abiteboul,et al.  Complexity of answering queries using materialized views , 1998, PODS.

[33]  K. Selçuk Candan,et al.  Query caching and optimization in distributed mediator systems , 1996, SIGMOD '96.

[34]  Diego Calvanese,et al.  Rewriting of regular expressions and regular path queries , 1999, PODS '99.

[35]  Oded Shmueli,et al.  Equivalence of DATALOG Queries is Undecidable , 1993, J. Log. Program..

[36]  Mihalis Yannakakis,et al.  Equivalences Among Relational Expressions with the Union and Difference Operators , 1980, J. ACM.

[37]  Nicolás Marín,et al.  Review of Data on the Web: from relational to semistructured data and XML by Serge Abiteboul, Peter Buneman, and Dan Suciu. Morgan Kaufmann 1999. , 2003, SGMD.

[38]  Diego Calvanese,et al.  View-Based Query Answering and Query Containment over Semistructured Data , 2001, DBPL.

[39]  Diego Calvanese,et al.  View-based query processing for regular path queries with inverse , 2000, PODS '00.

[40]  Todd D. Millstein,et al.  Query containment for data integration systems , 2000, PODS '00.

[41]  Amihai Motro Panorama: A database system that annotates its answers to queries with their properties , 2004, Journal of Intelligent Information Systems.

[42]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[43]  Inderpal Singh Mumick,et al.  Answering Queries Using Views , 1999 .

[44]  Alfred V. Aho,et al.  Equivalences Among Relational Expressions , 1979, SIAM J. Comput..

[45]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[46]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[47]  Diego Calvanese,et al.  View-based query processing and constraint satisfaction , 2000, Proceedings Fifteenth Annual IEEE Symposium on Logic in Computer Science (Cat. No.99CB36332).

[48]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .