Semantic Query Optimization for XQuery over XML Streams

We study XML stream-specific schema-based optimization. We assume a widely-adopted automata-based execution model for XQuery evaluation. Criteria are established regarding what schema constraints are useful to a particular query. How to apply multiple optimization techniques on an XQuery is then addressed. Finally we present how to correctly and efficiently execute a plan enhanced with our SQO techniques. Our experimentation on both real and synthetic data illustrates that these techniques bring significant performance improvement with little overhead.

[1]  Dan Suciu,et al.  Optimizing regular path expressions using graph schemas , 1998, Proceedings 14th International Conference on Data Engineering.

[2]  Sudarshan S. Chawathe,et al.  XPath queries on streaming data , 2003, SIGMOD '03.

[3]  Laks V. S. Lakshmanan,et al.  On the evaluation of tree pattern queries , 2006, ICSOFT.

[4]  John Grant,et al.  Logic-based approach to semantic query optimization , 1990, TODS.

[5]  D. G. George,et al.  Protein sequence database. , 2001, Methods in enzymology.

[6]  Xin Zhang,et al.  XAT: XML Algebra for the Rainbow System , 2002 .

[7]  Winona C. Barker,et al.  Protein sequence database. , 1990 .

[8]  Dan Suciu,et al.  Type inference for queries on semistructured data , 1999, PODS '99.

[9]  Dan Suciu,et al.  Processing XML Streams with Deterministic Automata , 2003, ICDT.

[10]  Dan Suciu,et al.  Stream processing of XPath queries with predicates , 2003, SIGMOD '03.

[11]  Rajeev Rastogi,et al.  Efficient filtering of XML documents with XPath expressions , 2002, The VLDB Journal.

[12]  Ioana Manolescu,et al.  XMark: A Benchmark for XML Data Management , 2002, VLDB.

[13]  Stefanie Scherzinger,et al.  FluXQuery: An Optimizing XQuery Processor for Streaming XML Data , 2004, VLDB.

[14]  Laks V. S. Lakshmanan,et al.  Minimization of tree pattern queries , 2001, SIGMOD '01.

[15]  Louiqa Raschid,et al.  Semantic query optimization for object databases , 1997, Proceedings 13th International Conference on Data Engineering.

[16]  Denilson Barbosa,et al.  ToXgene: a template-based data generator for XML , 2002, SIGMOD '02.

[17]  Alon Y. Halevy,et al.  An XML query engine for network-bound data , 2002, The VLDB Journal.

[18]  Bertram Ludäscher,et al.  A Transducer-Based XML Query Processor , 2002, VLDB.

[19]  Yanlei Diao,et al.  Query Processing for High-Volume XML Message Brokering , 2003, VLDB.

[20]  Qi Cheng,et al.  Implementation of Two Semantic Query Optimization Techniques in DB2 Universal Database , 1999, VLDB.

[21]  Elke A. Rundensteiner,et al.  Raindrop: a uniform and layered algebraic framework for XQueries on XML streams , 2003, CIKM '03.

[22]  Michael J. Carey,et al.  The BEA/XQRL Streaming XQuery Processor , 2003, VLDB.