Independence of Containing Patterns Property and Its Application in Tree Pattern Query Rewriting Using Views

We show that several classes of tree patterns observe the independence of containing patterns property, that is, if a pattern is contained in the union of several patterns, then it is contained in one of them. We apply this property to two related problems on tree pattern rewriting using views. First, given view V and query Q, is it possible for Q to have an equivalent rewriting using V which is the union of two or more tree patterns, but not an equivalent rewriting which is a single pattern? This problem is of both theoretical and practical importance because, if the answer is no, then, to find an equivalent rewriting of a tree pattern using a view, we should use more efficient methods, such as the polynomial time algorithm of Xu and Özsoyoglu (2005), rather than try to find the union of all contained rewritings (which takes exponential time in the worst case) and test its equivalence to Q. Second, given a set S of views, we want to know under what conditions a subset S′ of S is redundant in the sense that for any queryQ, the contained rewritings of Q using the views in S′ are contained in those using the views in S − S′. Solving this problem can help us to, for example, choose the minimum number of views to be cached, or better design the virtual schema in a mediated data integration system, or avoid repeated calculation in query optimization. For the first problem, we identify several classes of tree patterns for which the equivalent rewriting can be expressed as a single tree pattern. For the second problem, we present necessary and sufficient conditions for S′ to be redundant with respect to some classes of tree patterns. For both problems we consider extension to cases where there are rewritings using the intersection of multiple views and/or where a schema graph is present.

[1]  Chen Li,et al.  Minimizing View Sets without Losing Query-Answering Power , 2001, ICDT.

[2]  Keishi Tajima,et al.  Answering XPath Queries over Networks by Sending Minimal Views , 2004, VLDB.

[3]  Laks V. S. Lakshmanan,et al.  Answering tree pattern queries using views , 2006, VLDB.

[4]  Alin Deutsch,et al.  Rewriting nested XML queries using nested views , 2006, SIGMOD Conference.

[5]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[6]  Dan Suciu,et al.  Containment and equivalence for a fragment of XPath , 2004, JACM.

[7]  Dan Suciu,et al.  Query Caching and View Selection for XML Databases , 2005, VLDB.

[8]  Jean-Louis Lassez,et al.  Independence of Negative Constraints , 1989, TAPSOFT, Vol.1.

[9]  Kam-Fai Wong,et al.  Multiple Materialized View Selection for XPath Query Rewriting , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[10]  Dan Suciu,et al.  Containment and equivalence for an XPath fragment , 2002, PODS.

[11]  Hamid Pirahesh,et al.  A Framework for Using Materialized XPath Views in XML Query Processing , 2004, VLDB.

[12]  Cong Yu,et al.  Constraint-based XML query rewriting for data integration , 2004, SIGMOD '04.

[13]  Z. Meral Özsoyoglu,et al.  Rewriting XPath Queries Using Materialized Views , 2005, VLDB.

[14]  Laks V. S. Lakshmanan,et al.  Minimization of tree pattern queries , 2001, SIGMOD '01.

[15]  Shuigeng Zhou,et al.  A Theoretic Framework for Answering XPath Queries Using Views , 2005, XSym.

[16]  Jeffrey Xu Yu,et al.  Contained Rewritings of XPath Queries Using Views Revisited , 2008, WISE.

[17]  Ioana Manolescu,et al.  Structured Materialized Views for XML Queries , 2007, VLDB.

[18]  Alan Nash,et al.  Determinacy and Rewriting of Conjunctive Queries Using Views: A Progress Report , 2007, ICDT.

[19]  Alin Deutsch,et al.  Reformulation of XML Queries and Constraints , 2003, ICDT.

[20]  Jeffrey Xu Yu,et al.  XPath Rewriting Using Multiple Views , 2008, DEXA.

[21]  Michael J. Maher,et al.  Rewriting Union Queries Using Views , 2005, Constraints.