On the Discovery of Conserved XML Query Patterns for Evolution-Conscious Caching

Existing xml query pattern-based caching strategies focus on extracting the set of frequently issued query pattern trees ( qpt ) based on the support of the qpt s in the history. These approaches ignore the evolutionary features of the qpt s. In this paper, we propose a novel type of query pattern called conserved query paths ( cqp ) for efficient caching by integrating the support and evolutionary features together. cqp s are paths in qpt s that never change or do not change significantly most of the time (if not always) in terms of their support values during a specific time period. We proposed a set of algorithms to extract frequent cqp s ( fcqp s) and infrequent cqp s ( icqp s) and rank these query paths using evolution-conscious ranking functions . Then, these ranked query paths are used in evolution-conscious caching strategy for efficient xml query processing. Finally, we report our experimental results to show that our strategy is superior to previous qpt -based caching approaches.

[1]  S. Weisberg,et al.  Applied Linear Regression (2nd ed.). , 1986 .

[2]  Mong-Li Lee,et al.  2PXMiner: an efficient two pass mining of frequent XML query patterns , 2004, KDD.

[3]  Dan Suciu,et al.  Query Caching and View Selection for XML Databases , 2005, VLDB.

[4]  Yong Zhang,et al.  Incremental Mining of Frequent Query Patterns from XML Queries for Caching , 2006, Sixth International Conference on Data Mining (ICDM'06).

[5]  Quanzhong Li,et al.  Indexing and Querying XML Data for Regular Path Expressions , 2001, VLDB.

[6]  Malik Beshir Malik,et al.  Applied Linear Regression , 2005, Technometrics.

[7]  Mong-Li Lee,et al.  Mining frequent query patterns from XML queries , 2003, Eighth International Conference on Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings..

[8]  Elke A. Rundensteiner,et al.  XCache: a semantic caching system for XML queries , 2002, SIGMOD '02.

[9]  Elke A. Rundensteiner,et al.  Replacement strategies for XQuery caching systems , 2004, Data Knowl. Eng..

[10]  Vagelis Hristidis,et al.  Semantic Caching of XML Databases , 2002, WebDB.

[11]  Mohammed J. Zaki Efficiently mining frequent trees in a forest , 2002, KDD.

[12]  Mong-Li Lee,et al.  Efficient Mining of XML Query Patterns for Caching , 2003, VLDB.

[13]  Petra Perner,et al.  Advances in Data Mining , 2002, Lecture Notes in Computer Science.

[14]  Jignesh M. Patel,et al.  Structural joins: a primitive for efficient XML query pattern matching , 2002, Proceedings 18th International Conference on Data Engineering.

[15]  I. V. Ramakrishnan,et al.  Nonlinear pattern matching in trees , 1988, JACM.

[16]  Yi Chen,et al.  Incremental mining of frequent XML query patterns , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).