Analytical Processing Over XML and XLink

Current commercial and academic OLAP tools do not process XML data that contains XLink. Aiming at overcoming this issue, this paper proposes an analytical system composed by LMDQL, an analytical query language. Also, the XLDM metamodel is given to model cubes of XML documents with XLink and to deal with syntactic, semantic and structural heterogeneities commonly found in XML documents. As current W3C query languages for navigating in XML documents do not support XLink, XLPath is discussed in this article to provide features for the LMDQL query processing. A prototype system enabling the analytical processing of XML documents that use XLink is also detailed. This prototype includes a driver, named sql2xquery, which performs the mapping of SQL queries into XQuery. To validate the proposed system, a case study and its performance evaluation are presented to analyze the impact of analytical processing over XML/XLink documents.

[1]  David Taniar,et al.  A Methodology for Building XML Data Warehouses , 2005, Int. J. Data Warehous. Min..

[2]  Wolfgang Hümmer,et al.  XCube: XML for data warehouses , 2003, DOLAP '03.

[3]  Frank Neven,et al.  Logical definability and query languages over unranked trees , 2003, 18th Annual IEEE Symposium of Logic in Computer Science, 2003. Proceedings..

[4]  Steven J. DeRose,et al.  Xml linking language (xlink), version 1. 0 , 2000, WWW 2000.

[5]  Alin Deutsch,et al.  A Query Language for XML , 1999, Comput. Networks.

[6]  Leonid Libkin,et al.  Logics for Unranked Trees: An Overview , 2005, Log. Methods Comput. Sci..

[7]  Hongzhi Wang,et al.  Flexible and Effective Aggregation Operator for XML Data , 2007 .

[8]  Georg Gottlob,et al.  XPath query evaluation: improving time and space efficiency , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[9]  Eric van der Vlist,et al.  XML Schema , 2002 .

[10]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[11]  David Taniar,et al.  Object-relational complex structures for XML storage , 2006, Inf. Softw. Technol..

[12]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[13]  David Taniar,et al.  On Building XML Data Warehouses , 2004, IDEAL.

[14]  David Taniar,et al.  Preserving Conceptual Constraints During XML Updates , 2005, Int. J. Web Inf. Syst..

[15]  Pablo Barceló,et al.  Temporal logics over unranked trees , 2005, 20th Annual IEEE Symposium on Logic in Computer Science (LICS' 05).

[16]  David Taniar,et al.  Partitioning methods for multi-version XML data warehouses , 2009, Distributed and Parallel Databases.

[17]  Erich A. Helfert Techniques of Financial Analysis: A Practical Guide to Managing and Measuring Business Performance , 1994 .

[18]  Hamid Pirahesh,et al.  Extending XQuery for analytics , 2005, SIGMOD '05.

[19]  Olivier Teste,et al.  Finding an application-appropriate model for XML data warehouses , 2010, Inf. Syst..

[20]  Timo Niemi,et al.  A new measure of clustering effectiveness: Algorithms and experimental studies , 2008 .

[21]  Hyoil Han,et al.  XML-OLAP: A Multidimensional Analysis Framework for XML Warehouses , 2005, DaWaK.

[22]  Omar Boussaïd,et al.  X-Warehousing: An XML-Based Approach for Warehousing Complex Data , 2006, ADBIS.

[23]  Valéria Cesário Times,et al.  XPath+: A Tool for Linked XML Documents Navigation , 2009, XSym.

[24]  J. Wenny Rahayu,et al.  Double-layered schema integration of heterogeneous XML sources , 2011, J. Syst. Softw..

[25]  Valéria Cesário Times,et al.  LMDQL: link-based and multidimensional query language , 2009, DOLAP.

[26]  Laks V. S. Lakshmanan,et al.  X^ 3: A Cube Operator for XML OLAP , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[27]  Rajesh Bordawekar,et al.  Analytical processing of XML documents: opportunities and challenges , 2005, SGMD.

[28]  Jian Pei,et al.  Ix-cubes: iceberg cubes for data warehousing and olap on xml data , 2007, CIKM '07.

[29]  Paulo Caetano da Silva,et al.  XBRL TAXONOMY FOR INDEXES OF FINANCIAL ANALYSIS , 2010, ICISTM 2010.

[30]  Valéria Cesário Times,et al.  A query language for navigation over links , 2008 .