Normal forms for multidimensional databases

In the area of online analytical processing (OLAP), the concept of multidimensional databases is receiving much popularity. Thus, a couple of different multidimensional data models were proposed from the research as well as from the commercial product side, each emphasizing different perspectives. However, very little work has been done investigating guidelines for good schema design within such a multidimensional data model. Based on a logical reconstruction of multidimensional schema design, this paper proposes two multidimensional normal forms. These normal forms define modeling constraints for summary attributes describing the cells within a multidimensional data cube and constraints to model complex dimensional structures appropriately. Multidimensional schemas compliant to these normal forms do not only ensure the validity of analytical computations on the multidimensional database, but also favor an efficient physical database design.

[1]  Arie Shoshani,et al.  OLAP and statistical databases: similarities and differences , 1997, PODS '97.

[2]  Wolfgang Lehner,et al.  Modelling Large Scale OLAP Scenarios , 1998, EDBT.

[3]  E. F. Codd,et al.  Missing information (applicable and inapplicable) in relational databases , 1986, SGMD.

[4]  Maurizio Rafanelli,et al.  STORM+: Statistical Data Storage and Manipulation System , 1994 .

[5]  Diane C. P. Smith,et al.  Database abstractions: aggregation and generalization , 1977, TODS.

[6]  Arie Shoshani,et al.  Statistical Databases: Characteristics, Problems, and some Solutions , 1982, VLDB.

[7]  E. F. Codd,et al.  Further Normalization of the Data Base Relational Model , 1971, Research Report / RJ / IBM / San Jose, California.

[8]  Arie Shoshani,et al.  STORM: A Statistical Object Representation Model , 1990, IEEE Data Eng. Bull..

[9]  Arie Shoshani,et al.  SUBJECT: A Directory Driven System for Organizing and Accessing Large Statistical Databases , 1981, VLDB.

[10]  Maurizio Rafanelli,et al.  The aggregate data problem: a system for their definition and management , 1996, SGMD.

[11]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[12]  Sunita Sarawagi,et al.  Modeling multidimensional databases , 1997, Proceedings 13th International Conference on Data Engineering.

[13]  Elena Baralis,et al.  Materialized Views Selection in a Multidimensional Database , 1997, VLDB.

[14]  Sakti P. Ghosh Statistical Relational Databases: Normal Forms , 1991, IEEE Trans. Knowl. Data Eng..