Processing XML Twig Pattern Query with Wildcards

In this paper, we present a novel and complementary technique to optimize XML twig pattern queries with wildcards(*). Our approach is based on utilizing a new axis called AD-dis, to equivalently rewrite a query with wildcards (non-branching as well as branching wildcards) into a single query without any wildcards. We present efficient rewriting algorithms and also twig pattern matching algorithms to process the rewritten queries with AD-dis, which is proven to be I/O and CPU optimal. In addition, the experimental results not only verify the scalability and efficiency of our extended matching algorithms, but also demonstrate the effectiveness of our rewriting algorithms.

[1]  Tok Wang Ling,et al.  From Region Encoding To Extended Dewey: On Efficient Processing of XML Twig Pattern Matching , 2005, VLDB.

[2]  Bernhard Thalheim,et al.  Conceptual Modeling - ER 2007 , 2007, Lecture Notes in Computer Science.

[3]  Tok Wang Ling,et al.  Effective XML Keyword Search with Relevance Oriented Ranking , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[4]  Hongjun Lu,et al.  Efficient Processing of XML Twig Queries with All Predicates , 2004, 2009 Eighth IEEE/ACIS International Conference on Computer and Information Science.

[5]  Wenfei Fan,et al.  Taming XPath Queries by Minimizing Wildcard Steps , 2004, VLDB.

[6]  Chen Wang,et al.  Extended XML Tree Pattern Matching: Theories and Algorithms , 2011, IEEE Transactions on Knowledge and Data Engineering.

[7]  David J. DeWitt,et al.  On supporting containment queries in relational database management systems , 2001, SIGMOD '01.

[8]  Tok Wang Ling,et al.  VERT: A Semantic Approach for Content Search and Content Extraction in XML Query Processing , 2007, ER.

[9]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.

[10]  Tok Wang Ling,et al.  TwigStackList-: A Holistic Twig Join Algorithm for Twig Query with Not-Predicates on XML Data , 2006, DASFAA.

[11]  Yannis Papakonstantinou,et al.  Supporting top-K keyword search in XML databases , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[12]  Tok Wang Ling,et al.  Efficient processing of XML twig patterns with parent child edges: a look-ahead approach , 2004, CIKM '04.