Incremental fusion of XML fragments through semantic identifiers

Many applications, like materialized view maintenance and stream query processing, construct views incrementally over data sources. This results in computed pieces of objects that need to be merged by fusing corresponding objects together. This problem is challenging when dealing with XML data for many reasons including the hierarchical and semi-structured nature of XML data. Also XML query languages (e.g., XQuery) are capable of performing complex operations and transformations such as arbitrary nesting and result reconstruction. Moreover, since XML is an ordered data model, XML order has to be taken into consideration when constructing XML results incrementally. In this paper we study the problem of how to fuse XML pieces (fragments) generated by incrementally processing XML data into XML results. We consider an expressive subset of XQuery language transformations and propose an id-based solution for this problem that supports XML order. We prove the correctness of our approach, in particular that using our mechanism we can correctly yet incrementally merge XML result fragments. We have implemented our proposed semantic identifiers solution. Our experimental results show that it comes with a very small overhead to the query execution time.

[1]  Elke A. Rundensteiner,et al.  Rainbow: multi-XQuery optimization using materialized XML views , 2003, SIGMOD '03.

[2]  Wenfei Fan,et al.  Incremental evaluation of schema-directed XML publishing , 2004, SIGMOD '04.

[3]  Alin Deutsch,et al.  Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats , 1999 .

[4]  Susan B. Davidson,et al.  View Maintenance for Hierarchical Semistructured Data , 2000, DaWaK.

[5]  Alon Y. Halevy,et al.  An XML query engine for network-bound data , 2002, The VLDB Journal.

[6]  David Levine,et al.  Query processing of streamed XML data , 2002, CIKM '02.

[7]  Dan Suciu,et al.  An overview of semistructured data , 1998, SIGA.

[8]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[9]  Elke A. Rundensteiner,et al.  Efficiently supporting order in XML query processing , 2005, Data Knowl. Eng..

[10]  Ronald Fagin,et al.  Translating Web Data , 2002, VLDB.

[11]  Elke A. Rundensteiner,et al.  Efficiently supporting order in XML query processing , 2003, WIDM '03.

[12]  Elke A. Rundensteiner,et al.  MASS: a multi-axis storage structure for large XML documents , 2003, CIKM '03.

[13]  Yannis Papakonstantinou,et al.  Object Fusion in Mediator Systems , 1996, VLDB.

[14]  Catriel Beeri,et al.  SAL: An Algebra for Semistructured Data and XML , 1999, WebDB.

[15]  Yue Zhuge,et al.  Graph structured views and their incremental maintenance , 1998, Proceedings 14th International Conference on Data Engineering.

[16]  Elke A. Rundensteiner,et al.  Honey, I shrunk the XQuery!: an XML algebra optimization approach , 2002, WIDM '02.

[17]  Norman May,et al.  Nested queries and quantifiers in an ordered context , 2004, Proceedings. 20th International Conference on Data Engineering.

[18]  Serge Abiteboul,et al.  Incremental Maintenance for Materialized Views over Semistructured Data , 1998, VLDB.

[19]  Norman W. Paton,et al.  MOVIE: An incremental maintenance system for materialized object views , 2003, Data Knowl. Eng..

[20]  Ioana Manolescu,et al.  XMark: A Benchmark for XML Data Management , 2002, VLDB.

[21]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.