Dynamic labeling scheme for XML updates

Nowadays several labeling schemes are proposed to facilitate XML query processing, in which structural relationships among nodes could be quickly determined without accessing original XML documents. However, previous node indexing often encounters some troublesome problems when updates take place, such as a large amount of labels requiring re-labeling, huge space requirements for the updated labels, and inefficient determination of structural relationships. In this paper, we propose a novel labeling scheme that not only completely avoids re-labeling but also improves the performance of determining the structural relationships when XML documents are frequently updated at arbitrary positions. The fundamental difference between our scheme and previous ones is that, the gain in update performance of our labeling scheme does not come at the expense of the label size and the query performance. In particular, instead of completely assigning new labels for inserted nodes, the deleted labels are reused in our labeling scheme for encoding newly inserted nodes, which could effectively lower the label size. Moreover, we formally analyze the effectiveness of our proposed labeling scheme. Finally, we complement our analysis with experimental results on a range of real XML data.

[1]  Mong-Li Lee,et al.  A Prime Number Labeling Scheme for Dynamic Ordered XML Trees , 2004, ICDE.

[2]  Mark Roantree,et al.  SCOOTER: A Compact and Scalable Dynamic Labeling Scheme for XML Updates , 2012, DEXA.

[3]  Edith Cohen,et al.  Labeling dynamic XML trees , 2002, SIAM J. Comput..

[4]  David J. DeWitt,et al.  On supporting containment queries in relational database management systems , 2001, SIGMOD '01.

[5]  Tok Wang Ling,et al.  Efficient updates in dynamic XML data: from binary string to quaternary string , 2008, The VLDB Journal.

[6]  Tok Wang Ling,et al.  DDE: from dewey to a fully dynamic XML labeling scheme , 2009, SIGMOD Conference.

[7]  Yannis Papakonstantinou,et al.  Efficient keyword search for smallest LCAs in XML databases , 2005, SIGMOD '05.

[8]  Chin-Wan Chung,et al.  Dynamic interval-based labeling scheme for efficient XML query and update processing , 2008, J. Syst. Softw..

[9]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.

[10]  Dominique Laurent,et al.  Prefix based numbering schemes for XML: techniques, applications and performances , 2008, Proc. VLDB Endow..

[11]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[12]  David Ratcliffe,et al.  A semantic approach to data translation: A case study of environmental observations data , 2015, Knowl. Based Syst..

[13]  Toshiyuki Amagasa,et al.  Dynamic Range Labeling for XML Trees , 2004, EDBT Workshops.

[14]  Quanzhong Li,et al.  Indexing and Querying XML Data for Regular Path Expressions , 2001, VLDB.

[15]  Alon Y. Halevy,et al.  Updating XML , 2001, SIGMOD '01.

[16]  Maarten Marx,et al.  The quality of the XML Web , 2013, J. Web Semant..

[17]  Su-Cheng Haw,et al.  ReLab: A subtree based labeling scheme for efficient XML query processing , 2014, 2014 IEEE 2nd International Symposium on Telecommunication Technologies (ISTT).

[18]  Jian Liu,et al.  Efficient labeling scheme for dynamic XML trees , 2013, Inf. Sci..

[19]  Tok Wang Ling,et al.  Labeling Dynamic XML Documents: An Order-Centric Approach , 2012, IEEE Transactions on Knowledge and Data Engineering.

[20]  Mark Roantree,et al.  Desirable properties for XML update mechanisms , 2010, EDBT '10.

[21]  Pasquale De Meo,et al.  XML Matchers: Approaches and challenges , 2014, Knowl. Based Syst..

[22]  Mark Roantree,et al.  EBSL: Supporting Deleted Node Label Reuse in XML , 2010, XSym.

[23]  Toshiyuki Amagasa,et al.  QRS: a robust numbering scheme for XML documents , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[24]  Richard Chbeir,et al.  A novel XML document structure comparison framework based-on sub-tree commonalities and label semantics , 2012, J. Web Semant..

[25]  Lin Guo XRANK : Ranked Keyword Search over XML Documents , 2003 .

[26]  Patrick E. O'Neil,et al.  ORDPATHs: insert-friendly XML node labels , 2004, SIGMOD '04.

[27]  Mark Roantree,et al.  FibLSS: A Scalable Label Storage Scheme for Dynamic XML Updates , 2013, ADBIS.

[28]  Taher Ahmed Ghaleb,et al.  A Dynamic Labeling Scheme Based on Logical Operators: A Support for Order-Sensitive XML Updates , 2015 .

[29]  Tok Wang Ling,et al.  From Region Encoding To Extended Dewey: On Efficient Processing of XML Twig Pattern Matching , 2005, VLDB.

[30]  SangKeun Lee,et al.  A Binary String Approach for Updates in Dynamic Ordered XML Data , 2010, IEEE Transactions on Knowledge and Data Engineering.

[31]  Tok Wang Ling,et al.  QED: a novel quaternary encoding to completely avoid re-labeling in XML updates , 2005, CIKM '05.

[32]  Vadim Tropashko,et al.  Nested intervals tree encoding in SQL , 2005, SGMD.

[33]  Hongjun Lu,et al.  Dynamically Updating XML Data: Numbering Scheme Revisited , 2004, World Wide Web.

[34]  Beng Chin Ooi,et al.  Lazy XML updates: laziness as a virtue, of update and structural join efficiency , 2005, SIGMOD '05.

[35]  Chee Yong Chan,et al.  Multiway SLCA-based keyword search in XML data , 2007, WWW '07.

[36]  Tok Wang Ling,et al.  Efficient Processing of Updates in Dynamic XML Data , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[37]  King Fahd,et al.  Siblings Labeling Scheme for Updating XML Trees Dynamically , .

[38]  Tok Wang Ling,et al.  Reuse or Never Reuse the Deleted Labels in XML Query Processing Based on Labeling Schemes , 2006, DASFAA.

[39]  Canwei Zhuang,et al.  Full Tree-Based Encoding Technique for Dynamic XML Labeling Schemes , 2012, DEXA.

[40]  Belgin Ergenc,et al.  OrderBased Labeling Scheme for Dynamic XML Query Processing , 2012, CD-ARES.

[41]  Jian Liu,et al.  Efficient processing of twig pattern matching in fuzzy XML , 2009, CIKM.

[42]  Marta Mattoso,et al.  ARAXA: Storing and managing Active XML documents , 2010, J. Web Semant..

[43]  Su-Cheng Haw,et al.  Data storage practices and query processing in XML databases: A survey , 2011, Knowl. Based Syst..