Synchronizing XPath views

The increasing availability of XML-based data sources, e.g., for publishing data on the WWW, means that more and more applications (data consumers) rely on accessing and using XML data. Typically, the access is achieved by defining views over the XML data, and accessing data through these views. However, the XML data sources are often independent of the data consumers and may change their schemas without notification, invalidating the XML views defined by the data consumers. This requires the view definitions to be updated to reflect the new structure of the data sources, a process termed view synchronization. XPath is the most commonly used language for retrieving parts of XML documents, and is thus an important cornerstone for XML view definitions. This work presents techniques for discovering schema changes in XML data sources and synchronizing XPath-based views to reflect these schema changes. In many cases, this allows the XML data consumers to continue their operation without interruption. Experiments show that the techniques work well even if both schema and data change at the same time. To our knowledge, this is the first presented technique for synchronizing views over XML data.

[1]  Amélie Marian,et al.  Change-Centric Management of Versions in an XML Warehouse , 2001, VLDB.

[2]  Elke A. Rundensteiner,et al.  Evolvable view environment (EVE): non-equivalent view maintenance under schema changes , 1999, SIGMOD '99.

[3]  Elke A. Rundensteiner,et al.  The CVS Algorithm for View Synchronization in Evolvable Large-Scale Information Systems , 1998, EDBT.

[4]  Torben Bach Pedersen,et al.  Integrating XML data in the TARGIT OLAP system , 2004, Proceedings. 20th International Conference on Data Engineering.

[5]  Serge Abiteboul,et al.  Detecting changes in XML documents , 2002, Proceedings 18th International Conference on Data Engineering.

[6]  Yue Zhuge,et al.  Graph structured views and their incremental maintenance , 1998, Proceedings 14th International Conference on Data Engineering.

[7]  Nabil R. Adam,et al.  Detecting data and schema changes in scientific documents , 2000, Proceedings IEEE Advances in Digital Libraries 2000.

[8]  Jennifer Widom,et al.  Change detection in hierarchically structured information , 1996, SIGMOD '96.

[9]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[10]  Proceedings International Database Engineering and Applications Symposium , 2003, Seventh International Database Engineering and Applications Symposium, 2003. Proceedings..

[11]  Serge Abiteboul,et al.  Monitoring XML data on the Web , 2001, SIGMOD '01.

[12]  M. Hascoet,et al.  Xyleme, a dynamic warehouse for XML data of the Web , 2001, Proceedings 2001 International Database Engineering and Applications Symposium.

[13]  Torben Bach Pedersen,et al.  XML-extended OLAP querying , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[14]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .