Achieving adaptivity for OLAP-XML federations

Motivated by the need for more flexible OLAP systems, this paper presents work on logical integration of external data in OLAP databases, carried out in cooperation between the Danish OLAP client vendor targit and Aalborg University. Flexibility is ensured by supporting XML as the external data format, since almost all data sources can be efficiently wrapped in XML. Earlier work has resulted in an extension of the targit system, allowing external XML data to be used as dimensions and measures in OLAP databases. This work has led to a number of new idea for improving the system's ability to adapt to changes in its surroundings.This paper describes the potential problems that may interrupt the operation of the integration system, in particular those caused by the often autonomous and unreliable nature of external XML data sources, and methods for handling these problems. Specifically, we describe techniques for handling changes in external XML data sources. We also describe techniques for improving the reliability of external XML sources, e.g., when these are found on the Internet, by dynamically trying to locate alternative sources during the evaluation of a query. Finally, we discuss solutions to a number of other possible problems, and show how the techniques can be integrated in the targit architecture. Experiments performed with a prototype implementation of central functionality shows the viability of the proposed solutions.

[1]  Nabil R. Adam,et al.  Detecting data and schema changes in scientific documents , 2000, Proceedings IEEE Advances in Digital Libraries 2000.

[2]  Serge Abiteboul,et al.  Detecting changes in XML documents , 2002, Proceedings 18th International Conference on Data Engineering.

[3]  Torben Bach Pedersen,et al.  Integration af XML Data i TARGIT OLAP Systemet , 2004 .

[4]  Serge Abiteboul,et al.  Monitoring XML data on the Web , 2001, SIGMOD '01.

[5]  Amélie Marian,et al.  Change-Centric Management of Versions in an XML Warehouse , 2001, VLDB.

[6]  M. Hascoet,et al.  Xyleme, a dynamic warehouse for XML data of the Web , 2001, Proceedings 2001 International Database Engineering and Applications Symposium.

[7]  Erick Thomsen,et al.  Microsoft? OLAP Solutions , 1999 .

[8]  Torben Bach Pedersen,et al.  XML-extended OLAP querying , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[9]  Torben Bach Pedersen,et al.  Synchronizing XPath views , 2004, Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04..

[10]  Akhil Kumar,et al.  A dynamic warehouse for XML Data of the Web. , 2001 .

[11]  Yue Zhuge,et al.  Graph structured views and their incremental maintenance , 1998, Proceedings 14th International Conference on Data Engineering.

[12]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[13]  C. M. Sperberg-McQueen,et al.  eXtensible Markup Language (XML) 1.0 (Second Edition) , 2000 .

[14]  Joseph M. Hellerstein,et al.  Eddies: continuously adaptive query processing , 2000, SIGMOD '00.