Answering ordered tree pattern queries over fuzzy XML data

The widespread employment of XML requires the development of efficient methods to support advanced tree pattern queries. In addition, in real-world applications that involve analysis and management of databases, uncertainty is often an inherent property of data. Although a large number of researches have been dedicated to modeling uncertain XML and answering queries over uncertain XML data, how to conduct order-based tree pattern queries on fuzzy XML data remains an open problem. In this work, we aim to fill this gap and tackle the problem of order-based queries on fuzzy XML data. Based on the node encoding, we propose efficient tree pattern matching algorithms serving for ordered queries over fuzzy XML data. In particular, our algorithms adopt a two-phase processing technique: the first phase generates the root-to-leaf path solutions, and the second phase merges these path solutions for producing the desired matches to the ordered tree pattern. Typically, our approach only needs to access data related to the query nodes in the tree pattern once to obtain all the matching answers. Finally, a comprehensive experimental evaluation is carried out to investigate the performance of the proposed approaches.

[1]  Jian Liu,et al.  Efficient processing of twig pattern matching in fuzzy XML , 2009, CIKM.

[2]  Serge Abiteboul,et al.  Representing and querying XML with incomplete information , 2006, TODS.

[3]  Yehoshua Sagiv,et al.  Incorporating constraints in probabilistic XML , 2009, TODS.

[4]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[5]  L. Zadeh Fuzzy sets as a basis for a theory of possibility , 1999 .

[6]  Klaus Turowski,et al.  Representing and processing fuzzy information - an XML-based approach , 2002, Knowl. Based Syst..

[7]  H. V. Jagadish,et al.  ProTDB: Probabilistic Data in XML , 2002, VLDB.

[8]  Yehoshua Sagiv,et al.  Matching Twigs in Probabilistic XML , 2007, VLDB.

[9]  Yehoshua Sagiv,et al.  Query efficiency in probabilistic XML models , 2008, SIGMOD Conference.

[10]  Hongjun Lu,et al.  Holistic Twig Joins on Indexed XML Documents , 2003, VLDB.

[11]  Peter J. Haas,et al.  MCDB: a monte carlo approach to managing uncertain data , 2008, SIGMOD Conference.

[12]  Letizia Tanca,et al.  Fuzzy XML queries via context-based choice of aggregations , 2000, Kybernetika.

[13]  Adnan Yazici,et al.  IFOOD: An Intelligent Fuzzy Object-Oriented Database Architecture , 2003, IEEE Trans. Knowl. Data Eng..

[14]  V. S. Subrahmanian,et al.  PXML: a probabilistic semistructured data model and algebra , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[15]  Serge Abiteboul,et al.  Querying and Updating Probabilistic Information in XML , 2006, EDBT.

[16]  Serge Abiteboul,et al.  On the complexity of managing probabilistic XML data , 2007, PODS '07.

[17]  Bin Jiang,et al.  Probabilistic Skylines on Uncertain Data , 2007, VLDB.

[18]  Jignesh M. Patel,et al.  Structural joins: a primitive for efficient XML query pattern matching , 2002, Proceedings 18th International Conference on Data Engineering.

[19]  Reda Alhajj,et al.  Incorporating fuzziness in XML and mapping fuzzy relational data into fuzzy XML , 2006, SAC.

[20]  Jonathan Lee,et al.  Modeling imprecise requirements with XML , 2003, Inf. Softw. Technol..

[21]  George J. Klir,et al.  Fuzzy sets, uncertainty and information , 1988 .

[22]  Ronald R. Yager,et al.  Targeted E-commerce Marketing Using Fuzzy Intelligent Agents , 2000, IEEE Intell. Syst..

[23]  Jian Liu,et al.  Matching twigs in fuzzy XML , 2011, Inf. Sci..

[24]  Tok Wang Ling,et al.  Efficient Processing of Ordered XML Twig Pattern , 2005, DEXA.

[25]  Jian Liu,et al.  Querying and ranking incomplete twigs in probabilistic XML , 2013, World Wide Web.

[26]  Elke A. Rundensteiner,et al.  Efficiently supporting order in XML query processing , 2003, WIDM '03.

[27]  Arun K. Majumdar,et al.  Fuzzy Functional Dependencies and Lossless Join Decomposition of Fuzzy Relational Database Systems , 1988, ACM Trans. Database Syst..

[28]  Yawen Li,et al.  Holistically Twig Matching in Probabilistic XML , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[29]  Tok Wang Ling,et al.  From Region Encoding To Extended Dewey: On Efficient Processing of XML Twig Pattern Matching , 2005, VLDB.

[30]  Vassilis J. Tsotras,et al.  Tree-Pattern Queries on a Lightweight XML Processor , 2005, VLDB.

[31]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.