Determined: a system with novel techniques for XML query optimization and evaluation

Purpose – Efficient processing of XML queries is critical for XML data management and related applications. Previously proposed techniques are unsatisfactory. The purpose of this paper is to present Determined – a new prototype system designed for XML query processing and optimization from a system perspective. With Determined, a number of novel techniques for XML query processing are proposed and demonstrated.Design/methodology/approach – The methodology emphasizes on query pattern minimization, logic‐level optimization, and efficient query execution. Accordingly, three lines of investigation have been pursued in the context of Determined: XML tree pattern query (TPQ) minimization; logic‐level XML query optimization utilizing deterministic transformation; and specialized algorithms for fast XML query execution.Findings – Developed and demonstrated were: a runtime optimal and powerful algorithm for XML TPQ minimization; a unique logic‐level XML query optimization approach that solely pursues deterministic...

[1]  Michael J. Carey,et al.  The BEA/XQRL Streaming XQuery Processor , 2003, VLDB.

[2]  Vishu Krishnamurthy,et al.  Performance Challenges in Object-Relational DBMSs , 1999, IEEE Data Eng. Bull..

[3]  Yangjun Chen,et al.  Efficient Processing of XML Tree Pattern Queries , 2006, J. Adv. Comput. Intell. Intell. Informatics.

[4]  Hongjun Lu,et al.  Bloom Histogram: Path Selectivity Estimation for XML Data with Updates , 2004, VLDB.

[5]  Patricia G. Selinger,et al.  Access path selection in a relational database management system , 1979, SIGMOD '79.

[6]  Flavius Frasincar,et al.  XAL: An Algebra For XML Query Optimization , 2002, Australasian Database Conference.

[7]  Cong Yu,et al.  TIMBER: A native XML database , 2002, The VLDB Journal.

[8]  Jignesh M. Patel,et al.  Structural joins: a primitive for efficient XML query pattern matching , 2002, Proceedings 18th International Conference on Data Engineering.

[9]  Ranjan K. Dash,et al.  A Fully Pipelined XQuery Processor , 2006, XIME-P.

[10]  Jignesh M. Patel,et al.  Structural join order selection for XML query optimization , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[11]  Hyoung-Joo Kim,et al.  XML query processing using document type definitions , 2002, J. Syst. Softw..

[12]  Yangjun Chen,et al.  Minimization of XML Tree Pattern Queries in the Presence of Integrity Constraints , 2006, J. Adv. Comput. Intell. Intell. Informatics.

[13]  Karl Aberer,et al.  Query optimization in XML structured-document databases , 2005, The VLDB Journal.

[14]  Dunren Che,et al.  An efficient algorithm for tree pattern query minimization under broad integrity constraints , 2007, Int. J. Web Inf. Syst..

[15]  Tok Wang Ling,et al.  TwigStackList-: A Holistic Twig Join Algorithm for Twig Query with Not-Predicates on XML Data , 2006, DASFAA.

[16]  Laks V. S. Lakshmanan,et al.  TAX: A Tree Algebra for XML , 2001, DBPL.

[17]  Roy Goldman,et al.  Lore: a database management system for semistructured data , 1997, SGMD.

[18]  Norman May,et al.  Nested queries and quantifiers in an ordered context , 2004, Proceedings. 20th International Conference on Data Engineering.

[19]  Prakash V. Ramanan,et al.  Efficient algorithms for minimizing tree pattern queries , 2002, SIGMOD '02.

[20]  Hongjun Lu,et al.  Holistic Twig Joins on Indexed XML Documents , 2003, VLDB.

[21]  Hongjun Lu,et al.  Efficient Processing of XML Twig Queries with All Predicates , 2004, 2009 Eighth IEEE/ACIS International Conference on Computer and Information Science.

[22]  Mathias Weske,et al.  NaXDB - Realizing Pipelined XQuery Processing in a Native XML Database System , 2005, XIME-P.

[23]  Jignesh M. Patel,et al.  Using histograms to estimate answer sizes for XML queries , 2003, Inf. Syst..

[24]  M. Tamer Özsu,et al.  XSEED: Accurate and Fast Cardinality Estimation for XPath Queries , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[25]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.

[26]  Hongjun Lu,et al.  Efficient Processing of Twig Queries with OR-Predicates. , 2004, ACM SIGMOD Conference.

[27]  Norman May,et al.  Three Cases for Query Decorrelation in XQuery , 2003, Xsym.

[28]  Laks V. S. Lakshmanan,et al.  Minimization of tree pattern queries , 2001, SIGMOD '01.

[29]  Chun Zhang,et al.  Cost-based optimization in DB2 XML , 2006, IBM Syst. J..

[30]  Tok Wang Ling,et al.  On boosting holism in XML twig pattern matching using structural indexing techniques , 2005, SIGMOD '05.

[31]  Vikas Arora,et al.  Native Xquery processing in oracle XMLDB , 2005, SIGMOD '05.

[32]  Airi Salminen PAT expressions: an algebra for text search , 2007 .

[33]  David J. DeWitt,et al.  On supporting containment queries in relational database management systems , 2001, SIGMOD '01.

[34]  David J. DeWitt,et al.  The EXODUS optimizer generator , 1987, SIGMOD '87.

[35]  Jennifer Widom,et al.  Query Optimization for XML , 1999, VLDB.

[36]  Mengchi Liu,et al.  Query Processing and Optimization for Regular Path Expressions , 2003, CAiSE.

[37]  Hongjun Lu,et al.  Containment join size estimation: models and methods , 2003, SIGMOD '03.