History-Offset Implementation Scheme of XML Documents and Its Evaluations

This paper presents a novel implementation scheme of XML documents. First, we describe a labeling scheme for dynamic XML trees, in which no relabeling is necessary against the structural update of trees by the help of small auxiliary data structure. Second, two kinds of encoding/decoding data structure are proposed for implementing XML documents based on history-offset encoding, which is designed for encoding multidimensional datasets. One is for XML tree structure and the other is for path expressions from the root node. By cross-referencing the encoded/decoded results obtained by using these data structures, the structural retrieval using both axis specification and path expressions can be performed very efficiently. Finally, using the constructed prototype system, the performance of our implementation scheme is evaluated and compared with eXist-db, a native XMLDB system.

[1]  Torsten. Grust,et al.  Accelerating XPath location steps , 2002, SIGMOD '02.

[2]  Wolfgang Meier,et al.  eXist: An Open Source Native XML Database , 2002, Web, Web-Services, and Database Systems.

[3]  Tatsuo Tsuji,et al.  An Efficient Implementation for MOLAP Basic Data Structure and Its Evaluation , 2007, DASFAA.

[4]  Erhard Rahm,et al.  Supporting Efficient Streaming and Insertion of XML Data in RDBMS , 2004, DIWeb.

[5]  Mong-Li Lee,et al.  A Prime Number Labeling Scheme for Dynamic Ordered XML Trees , 2004, ICDE.

[6]  Tatsuo Tsuji,et al.  Implementing Vertical Splitting for Large Scale Multidimensional Datasets and Its Evaluations , 2011, DaWaK.

[7]  Patrick E. O'Neil,et al.  ORDPATHs: insert-friendly XML node labels , 2004, SIGMOD '04.

[8]  Erhard Rahm,et al.  Web, Web-Services, and Database Systems , 2003, Lecture Notes in Computer Science.

[9]  Dong Jin,et al.  An Incremental Maintenance Scheme of Data Cubes , 2008, DASFAA.

[10]  Mukesh K. Mohania,et al.  Advances in Databases: Concepts, Systems and Applications , 2007 .

[11]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[12]  Tok Wang Ling,et al.  QED: a novel quaternary encoding to completely avoid re-labeling in XML updates , 2005, CIKM '05.

[13]  Toshiyuki Amagasa,et al.  XRel: a path-based approach to storage and retrieval of XML documents using relational databases , 2001, ACM Trans. Internet Techn..

[14]  Tatsuo Tsuji,et al.  A Labeling Scheme for Dynamic XML Trees Based on History-offset Encoding , 2010 .

[15]  Doron Rotem,et al.  Efficient Storage Allocation of Large-Scale Extendible Multi-dimensional Scientific Datasets , 2006, 18th International Conference on Scientific and Statistical Database Management (SSDBM'06).