Optimization of XQuery Queries Including FOR Clauses

XML has been established as a format for data exchange on the Internet, which has a large volume scale. By the increase of the volume of XML data, applications need to have a standard query language for efficient retrieving and integrating XML data. XQuery, which is proposed by W3C, is one of the standard query languages for XML data which is widely accepted by many applications. In relation to this, the efficient processing of XQuery queries has become a new research topic and a few researchers have proposed the methods for optimizing XQuery queries to resolve the problem of efficiency. A number of previous researches refer to other information like XML schema or DTD for optimizing the queries. However, our optimization technique does not use any other information except the XQuery query instance itself. Also, some researches just define system dependant optimization methods. In other words, they optimize the execution plan of XQuery queries therefore their methods depend on their XQuery execution engine. However our method can be used in any XQuery engine and in any middleware like a mediator over distributed XQuery engines, because our input is an XQuery query and output also is an optimized XQuery query- The express power of XQuery language is abundant but its syntax is very complex because XQuery borrowed features from several other languages. Therefore, we think that it is necessary to classify the queries and give an optimization method according to the characteristics of classified queries. For these reasons, we consider FOR clauses in XQuery queries because FOR clauses can generate some redundant execution of unnecessary operations. In this paper, we classify the general cases of XQuery queries including FOR clauses, and propose the rewriting rules based on these cases. The focus of our optimization is to reduce the frequency of iterations generated by FOR clauses, of course, without semantic modification of the original query. We implement a prototype system for our optimization algorithm and evaluate the performance between original XQuery queries and optimized XQuery queries.

[1]  Kevin P. Hinshaw,et al.  Distributed XQuery , 2004 .

[2]  Xin Zhang,et al.  XAT: XML Algebra for the Rainbow System , 2002 .

[3]  Torsten. Grust,et al.  Accelerating XPath location steps , 2002, SIGMOD '02.

[4]  Dimitrios Gunopulos,et al.  Architecture and Implementation of an XQuery-based Information Integration Platform. , 2002 .

[5]  Flavius Frasincar,et al.  XAL: An Algebra For XML Query Optimization , 2002, Australasian Database Conference.

[6]  Alon Y. Halevy,et al.  An XML query engine for network-bound data , 2002, The VLDB Journal.

[7]  Ioana Manolescu,et al.  Answering XML Queries on Heterogeneous Data Sources , 2001, VLDB.

[8]  Maxim N. Grinev,et al.  Rewriting-based optimization for XQuery transformational queries , 2005, 9th International Database Engineering & Application Symposium (IDEAS'05).

[9]  Ling Feng,et al.  Web-Based Distributed XML Query Processing , 2003, Intelligent Search on XML Data.

[10]  Tim Furche,et al.  XPath: Looking Forward , 2002, EDBT Workshops.

[11]  Ge Yu,et al.  Effective schema-based XML query optimization techniques , 2003, Seventh International Database Engineering and Applications Symposium, 2003. Proceedings..

[12]  Sven Groppe,et al.  Schema-based Query Optimization for XQuery Queries , 2005, ADBIS Research Communications.

[13]  Ioana Manolescu,et al.  XMark: A Benchmark for XML Data Management , 2002, VLDB.

[14]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[15]  Stefanie Scherzinger,et al.  FluXQuery: An Optimizing XQuery Processor for Streaming XML Data , 2004, VLDB.

[16]  Michael J. Carey,et al.  The BEA streaming XQuery processor , 2004, The VLDB Journal.

[17]  Bradford Pielech,et al.  XML Algebra Optimization , 2002 .

[18]  Elke A. Rundensteiner,et al.  Honey, I shrunk the XQuery!: an XML algebra optimization approach , 2002, WIDM '02.

[19]  Maxim N. Grinev,et al.  XQuery Function Inlining for Optimizing XQuery Queries , 2004, ADBIS.