Efficient evaluation of query rewriting plan over materialized XML view

The query rewriting plan generation over XML views has received wide attention recently. However, little work has been done on efficient evaluation of the query rewriting plans, which is not trivial since the plan may contain an exponential size of sub-plans. This paper investigates the reason for the potentially exponential number of sub-plans, and then proposes a new space-efficient form called ABCPlan (Plan with Automata Based Combinations) to equivalently represent the original query rewriting plan. ABCPlan contains a set of buckets containing suffix paths in the query tree and an automata to indicate the combination of the suffix paths from different buckets as valid query rewriting sub-plans. We also design an evaluation method called ABCScan, which constructs a unified evaluation tree for the ABCPlan and handles the evaluation tree in one scan of the XML view. In the evaluation, we introduce node existence automata to encode the structure of the sub-tree and convert the satisfaction of the ABCPlan into the intersection problem of deterministic finite automata. The experiments show that ABCPlan based method outperforms existing methods significantly in terms of scalability and efficiency.

[1]  Hao Zhang,et al.  Path sharing and predicate evaluation for high-performance XML filtering , 2003, TODS.

[2]  Z. Meral Özsoyoglu,et al.  Rewriting XPath Queries Using Materialized Views , 2005, VLDB.

[3]  Elke A. Rundensteiner,et al.  XCache: a semantic caching system for XML queries , 2002, SIGMOD '02.

[4]  Wanhong Xu,et al.  The Framework of an XML Semantic Caching System , 2005, WebDB.

[5]  Boris Chidlovskii,et al.  Semantic caching of Web queries , 2000, The VLDB Journal.

[6]  Dan Suciu,et al.  Query Caching and View Selection for XML Databases , 2005, VLDB.

[7]  Kam-Fai Wong,et al.  Multiple Materialized View Selection for XPath Query Rewriting , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[8]  Dan Suciu,et al.  Containment and equivalence for an XPath fragment , 2002, PODS.

[9]  Hamid Pirahesh,et al.  A Framework for Using Materialized XPath Views in XML Query Processing , 2004, VLDB.

[10]  Xiaoying Wu,et al.  Answering XML queries using materialized views revisited , 2009, CIKM.

[11]  Chen Li,et al.  Generating efficient plans for queries using views , 2001, SIGMOD '01.

[12]  Byron Choi,et al.  What are real DTDs like? , 2002, WebDB.

[13]  Alin Deutsch,et al.  Rewriting nested XML queries using nested views , 2006, SIGMOD Conference.

[14]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[15]  Gao Jun,et al.  QUERY REWRITING FOR SEMI-STRUCTURED DATA , 2002 .

[16]  Alin Deutsch,et al.  Efficient Rewriting of XPath Queries Using Query Set Specifications , 2009, Proc. VLDB Endow..

[17]  Jun Gao,et al.  MQTree Based Query Rewriting over Multiple XML Views , 2007, DEXA.

[18]  Diego Calvanese,et al.  Rewriting of regular expressions and regular path queries , 1999, PODS '99.

[19]  Alin Deutsch,et al.  XPath Rewriting Using Multiple Views: Achieving Completeness and Efficiency , 2008, WebDB.

[20]  Yannis Papakonstantinou,et al.  Query rewriting for semistructured data , 1999, SIGMOD '99.

[21]  Thomas Schwentick,et al.  XPath Containment in the Presence of Disjunction, DTDs, and Variables , 2003, ICDT.

[22]  Keishi Tajima,et al.  Answering XPath Queries over Networks by Sending Minimal Views , 2004, VLDB.

[23]  Ioana Manolescu,et al.  Structured Materialized Views for XML Queries , 2007, VLDB.

[24]  Jeffrey Xu Yu,et al.  XPath Rewriting Using Multiple Views , 2008, DEXA.