A data model for supporting on-line analytical processing

A database application, called “on-line analytical processing” (or OLAP) and aimed at providing business intelligence through on-line multidimensional data analysis, has become increasingly important due to the existence of huge amounts of on-line data. This paper formalizes a multidimensional data (MDD) model for OLAP, and develops an algebraic query language called grouping algebra. The basic component of the MDD model is a multidimensional cube, consisting of a number of relations (called dimensions) and for each combination of tuples (called a coordinate), one from each dimension, there is an associated data value. Each dimension is viewed as a basic grouping, i.e., each tuple in the dimension comesponds to the group consisting of all the coordinates that contain this tuple. In order to express user queries, relational algebra expressions are then extended to those on basic groupings for obtaining complex groupings, including orderoriented groupings (for expressing, e.g., cumulative sum). The paper then considers the environment where the multidimensional cubes are materialized views derived from base data situated at remote sites. A multidimensional cube algebra is introduced in order to facilitate the data derivation. The putpose of the paper is to establish a formal foundation for further research regarding databasesupport for OLAP applications.

[1]  Marianne Winslett,et al.  Physical schemas for large multidimensional arrays in scientific computing applications , 1994, Seventh International Working Conference on Scientific and Statistical Database Management.

[2]  Chinya V. Ravishankar,et al.  A physical storage model for efficient statistical query processing , 1994, Seventh International Working Conference on Scientific and Statistical Database Management.

[3]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[4]  Maurizio Rafanelli,et al.  Mefisto: A Functional Model for Statistical Entities , 1993, IEEE Trans. Knowl. Data Eng..

[5]  Sanju K. Bansal Real world requirements for decision support—implications for RDBMS , 1995, SIGMOD '95.

[6]  Meng Chang Chen,et al.  On the Data Model and Access Method of Summary Data Management , 1989, IEEE Trans. Knowl. Data Eng..

[7]  Gultekin Özsoyoglu,et al.  A language and a physical organization technique for summary tables , 1985, SIGMOD Conference.

[8]  J. Davenport Editor , 1960 .

[9]  Michael Stonebraker,et al.  Efficient Organization of Large Multidimensional , 1993 .

[10]  Michael Stonebraker,et al.  Efficient organization of large multidimensional arrays , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[11]  K. Reach,et al.  What is a Language , 1939 .

[12]  Stanley Y. W. Su,et al.  SAM*: A Semantic Association Model for Corporate and Scientific/Statistical Databases , 1983, Inf. Sci..