OLAP Visualization: Models, Issues, and Techniques

INTRODUCTION The problem of efficiently visualizing multidimensional data sets produced by scientific and statistical tasks/ processes is becoming increasingly challenging, and is attracting the attention of a wide multidisciplinary community of researchers and practitioners. Basically, this problem consists in visualizing multidimensional data sets by capturing the dimensionality of data, which is the most difficult aspect to be considered. Human analysts interacting with high-dimensional data often experience disorientation and cognitive overload. Analysis of high-dimensional data is a challenge encountered in a wide set of real-life applications such as (i) biological databases storing massive gene and protein data sets, (ii) real-time monitoring systems accumulating data sets produced by multiple, multi-rate streaming sources, (iii) advanced Business Intelligence (BI) systems collecting business data for decision making purposes etc. Traditional DBMS front-end tools, which are usually tuple-bag-oriented, are completely inadequate to fulfill the requirements posed by an interactive exploration of high-dimensional data sets due to two major reasons: (i) DBMS implement the OLTP paradigm, which is optimized for transaction processing and deliberately neglects the dimensionality of data; (ii) DBMS operators are very poor and offer nothing beyond the capability of conventional SQL statements, what makes such tools very inefficient with respect to the goal of visualizing and, above all, interacting with multidimensional data sets embedding a large number of dimensions. Despite the above-highlighted practical relevance of the problem of visualizing multidimensional data sets, the literature in this field is rather scarce, due to the fact that, for many years, this problem has been of relevance for life science research communities only, and interaction of the latter with the computer science research community has been insufficient. Following the enormous growth of scientific disciplines like Bio-Informatics, this problem has then become a fundamental field in the computer science academic as well as industrial research. At the same time, a number of proposals dealing with the multidimensional data visualization problem appeared in literature, with the amenity of stimulating novel and exciting application fields such as the visualization of Data Mining results generated by challenging techniques like clustering and association rule discovery. The above-mentioned issues are meant to facilitate understanding of the high relevance and attractiveness of the problem of visualizing multidimensional data sets at present and in the future, with challenging research findings accompanied by significant spin-offs in the Information Technology (IT) industrial field. A possible solution to tackle this problem is represented by well-known OLAP techniques (Codd …

[1]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[2]  Alfredo Cuzzocrea,et al.  A Hierarchy-Driven Compression Technique for Advanced OLAP Visualization of Multidimensional Data Cubes , 2006, DaWaK.

[3]  Hing-Yan Lee,et al.  A New Visualisation Technique for Knowledge Discovery in OLAP (Abstract) , 1995, KDOOD/TDOOD.

[4]  Pat Hanrahan,et al.  Multiscale Visualization Using Data Cubes , 2003, IEEE Trans. Vis. Comput. Graph..

[5]  Panos Vassiliadis,et al.  Advanced visualization for OLAP , 2003, DOLAP '03.

[6]  Amitava Datta,et al.  Interactive Visualization for OLAP , 2005, ICCSA.

[7]  E. F. Codd,et al.  Providing OLAP to User-Analysts: An IT Mandate , 1998 .

[8]  Dominique Laurent,et al.  Computing appropriate representations for multidimensional data , 2003 .

[9]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[10]  Marc H. Scholl,et al.  Exploring OLAP aggregates with hierarchical visualization techniques , 2007, SAC '07.

[11]  Pat Hanrahan,et al.  Polaris: a system for query, analysis and visualization of multi-dimensional relational databases , 2000, IEEE Symposium on Information Visualization 2000. INFOVIS 2000. Proceedings.

[12]  Dominique Laurent,et al.  Computing appropriate representations for multidimensional data , 2001, DOLAP '01.

[13]  David P. Tegarden,et al.  Business Information Visualization , 1999, Commun. Assoc. Inf. Syst..

[14]  Randy Goebel,et al.  Towards a Novel OLAP Interface for Distributed Data Warehouses , 2001, DaWaK.

[15]  Hans-Peter Kriegel,et al.  VisDB: database exploration using multidimensional visualization , 1994, IEEE Computer Graphics and Applications.

[16]  Matthias Jarke,et al.  A toolkit for negotiation support interfaces to multi-dimensional data , 1997, SIGMOD '97.

[17]  Stephen G. Eick Visualizing multi-dimensional data , 2000, SIGGRAPH 2000.

[18]  Alfred Inselberg Visualization and knowledge discovery for high dimensional data , 2001, Proceedings Second International Workshop on User Interfaces in Data Intensive Systems. UIDIS 2001.

[19]  Mark Sifer A visual interface technique for exploring OLAP data with coordinated dimension hierarchies , 2003, CIKM '03.

[20]  Panos Vassiliadis,et al.  CPM: A Cube Presentation Model for OLAP , 2003, DaWaK.

[21]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[22]  Tosiyasu L. Kunii,et al.  Bending and creasing virtual paper , 1994, IEEE Computer Graphics and Applications.

[23]  Alfredo Cuzzocrea,et al.  Semantics-Aware Advanced OLAP Visualization of Multidimensional Data Cubes , 2007, Int. J. Data Warehous. Min..