Materialized view selection for XQuery workloads

The efficient processing of XQuery still poses significant challenges. A particularly effective technique to improve XQuery processing performance consists of using materialized views to answer queries. In this work, we consider the problem of choosing the best views to materialize within a given space budget in order to improve the performance of a query workload. The paper is the first to address the view selection problem for queries and views with value joins and multiple return nodes. The challenges we face stem from the expressive power and features of both the query and view languages and from the size of the search space of candidate views to materialize. While the general problem has prohibitive complexity, we propose and study a heuristic algorithm and demonstrate its superior performance compared to the state of the art.

[1]  Ioana Manolescu,et al.  Efficient XQuery rewriting using multiple views , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[2]  Jeffrey F. Naughton,et al.  Covering indexes for branching path queries , 2002, SIGMOD '02.

[3]  Z. Meral Özsoyoglu,et al.  Rewriting XPath Queries Using Materialized Views , 2005, VLDB.

[4]  Inderpal Singh Mumick,et al.  Selection of Views to Materialize Under a Maintenance Cost Constraint , 1999, ICDT.

[5]  Daniel C. Zilio,et al.  Recommending XMLTable Views for XQuery Workloads , 2009, XSym.

[6]  Andrew Lim,et al.  D(k)-index: an adaptive structural summary for graph-structured data , 2003, SIGMOD '03.

[7]  Hao Tang,et al.  Materialized View Selection in XML Databases , 2009, DASFAA.

[8]  Alin Deutsch,et al.  XPath Rewriting Using Multiple Views: Achieving Completeness and Efficiency , 2008, WebDB.

[9]  Joonho Kwon,et al.  FiST: Scalable XML Document Filtering by Sequencing Twig Patterns , 2005, VLDB.

[10]  Susan B. Davidson,et al.  An Efficient XPath Query Processor for XML Streams , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[11]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[12]  Evaggelia Pitoura,et al.  Cooperative XPath caching , 2008, SIGMOD Conference.

[13]  Serge Abiteboul,et al.  Incremental Maintenance for Materialized Views over Semistructured Data , 1998, VLDB.

[14]  Dan Suciu,et al.  Query Caching and View Selection for XML Databases , 2005, VLDB.

[15]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.

[16]  Laks V. S. Lakshmanan,et al.  Tree pattern query minimization , 2002, The VLDB Journal.

[17]  Surajit Chaudhuri,et al.  Automated Selection of Materialized Views and Indexes in SQL Databases , 2000, VLDB.

[18]  Dan Suciu,et al.  Containment and equivalence for a fragment of XPath , 2004, JACM.

[19]  Wim Martens,et al.  Efficient algorithms for descendant-only tree pattern queries , 2009, Inf. Syst..

[20]  Mong-Li Lee,et al.  Efficient Mining of XML Query Patterns for Caching , 2003, VLDB.

[21]  Alin Deutsch,et al.  MARS: A System for Publishing XML from Mixed and Redundant Storage , 2003, VLDB.

[22]  Ioana Manolescu,et al.  Structured Materialized Views for XML Queries , 2007, VLDB.

[23]  Kam-Fai Wong,et al.  Multiple Materialized View Selection for XPath Query Rewriting , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[24]  Hamid Pirahesh,et al.  A Framework for Using Materialized XPath Views in XML Query Processing , 2004, VLDB.

[25]  Neoklis Polyzotis,et al.  Approximate XML query answers , 2004, SIGMOD '04.

[26]  Chee Yong Chan,et al.  ViewJoin: Efficient view-based evaluation of tree pattern queries , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[27]  Bin Fu,et al.  XML Reconstruction View Selection in XML Databases: Complexity Analysis and Approximation Scheme , 2010, COCOA.

[28]  Alin Deutsch,et al.  Rewriting nested XML queries using nested views , 2006, SIGMOD Conference.

[29]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[30]  Ioana Manolescu,et al.  XML Tuple Algebra , 2009, Encyclopedia of Database Systems.

[31]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[32]  Hamid Pirahesh,et al.  Efficiently publishing relational data as XML documents , 2001, The VLDB Journal.