XML Query Optimisation: Specify your Selectivity

The problem of efficiently evaluating XPath and XQuery queries has become increasingly significant since more and more XML data is stored in its native form. We propose a novel optimisation technique for XML queries that is based on the semantic properties exhibited by XML data. In sharp contrast to previous studies on selectivity estimation we propose to specify bounds on the number of element nodes in an XML tree that form the root of isomorphic subtrees. It turns out that efficient reasoning about these constraints provides effective means to predict the number of XPath and XQuery query answers, to predict the number of updates using the XQuery update facility, to predict the number of en(de)cryptions using XML encryption, and to optimise XML queries.

[1]  Alin Deutsch,et al.  Query reformulation with constraints , 2006, SGMD.

[2]  Elke A. Rundensteiner,et al.  Semantic Query Optimization for XQuery over XML Streams , 2005, VLDB.

[3]  Karl Aberer,et al.  Query optimization in XML structured-document databases , 2005, The VLDB Journal.

[4]  Stéphane Bressan,et al.  Answering Queries in the Presence of XML Keys , 2006, 17th International Workshop on Database and Expert Systems Applications (DEXA'06).

[5]  Jennifer Widom,et al.  Query Optimization for XML , 1999, VLDB.

[6]  Maurizio Lenzerini,et al.  On The Satisfiability of Dependency Constraints in Entity-Relationship Schemata , 1987, VLDB.

[7]  G. S. V. Radha Krishna Rao,et al.  Query Optimization Techniques for XML Databases , 2008 .

[8]  Jeffrey F. Naughton,et al.  Estimating the Selectivity of XML Path Expressions for Internet Scale Applications , 2001, VLDB.

[9]  Sven Hartmann,et al.  Unlocking Keys for XML Trees , 2007, ICDT.

[10]  Jignesh M. Patel,et al.  Estimating Answer Sizes for XML Queries , 2002, EDBT.

[11]  Neoklis Polyzotis,et al.  Selectivity estimation for XML twigs , 2004, Proceedings. 20th International Conference on Data Engineering.

[12]  Wenfei Fan,et al.  Keys for XML , 2002, Comput. Networks.

[13]  David W. Embley,et al.  Cardinality Constraints in Semantic Data Models , 1993, Data Knowl. Eng..