Using Structural Joins and Holistic Twig Joins for Native XML Query Optimization

One of the most important factors for success of native XML database systems is a powerful query optimizer. Surprisingly, little has been done to develop cost models to enable cost-based optimization in such systems. Since the entire optimization process is so complex, only a stepwise approach will lead to a satisfying (future) solution. In this work, we are paving the way for cost-based XML query optimization by developing cost formulae for two important join operators, which allow to perform join reordering and join fusion in a cost-aware way, and, therefore, make joint application of Structural Joins and Holistic Twig Joins possible.

[1]  Chun Zhang,et al.  Cost-based optimization in DB2 XML , 2006, IBM Syst. J..

[2]  Ioana Manolescu,et al.  XMark: A Benchmark for XML Data Management , 2002, VLDB.

[3]  Christian Mathis,et al.  Rules for query rewrite in native XML databases , 2008, DataX '08.

[4]  Theo Härder,et al.  Framework-Based Development and Evaluation of Cost-Based Native XML Query Optimization Techniques , 2009, VLDB PhD Workshop.

[5]  Peter J. Haas,et al.  Statistical Learning Techniques for Costing XML Queries , 2005, VLDB.

[6]  Jignesh M. Patel,et al.  Structural join order selection for XML query optimization , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[7]  Eugene Wong,et al.  Query optimization by simulated annealing , 1987, SIGMOD '87.

[8]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.

[9]  Neoklis Polyzotis,et al.  Structure and Value Synopses for XML Data Graphs , 2002, VLDB.

[10]  Hongjun Lu,et al.  Holistic Twig Joins on Indexed XML Documents , 2003, VLDB.

[11]  Jennifer Widom,et al.  Query Optimization for XML , 1999, VLDB.

[12]  Christian Mathis,et al.  Node labeling schemes for dynamic XML documents reconsidered , 2007, Data Knowl. Eng..

[13]  Hongjun Lu,et al.  Bloom Histogram: Path Selectivity Estimation for XML Data with Updates , 2004, VLDB.

[14]  Patricia G. Selinger,et al.  Access path selection in a relational database management system , 1979, SIGMOD '79.

[15]  José de Aguiar Moraes Filho,et al.  EXsum: an XML summarization framework , 2008, IDEAS '08.

[16]  Jignesh M. Patel,et al.  Structural joins: a primitive for efficient XML query pattern matching , 2002, Proceedings 18th International Conference on Data Engineering.

[17]  Theo Härder,et al.  An efficient infrastructure for native transactional XML processing , 2007, Data Knowl. Eng..