Querying Multidimensional Databases

Multidimensional databases are large collections of data, often historical, used for sophisticated analysis oriented to decision making. This activity is supported by an emerging category of software technology, called On-Line Analytical Processing (OLAP). In spite of a lot of commercial tools already available, a fundamental study for OLAP systems is still lacking. In this paper we introduce a model and a query language to establish a theoretical basis for multi-dimensional data. The model is based on the notions of dimension and f-table. Dimensions are linguistic categories corresponding to different ways of looking at the information. F-tables are the constructs used to represent factual data, and are the logical counterpart of multi-dimensional arrays, the way in which current analytical tools store data. The query language is a calculus for f-tables, and as such it offers a high-level support to multi-dimensional data analysis. Scalar and aggregate functions can be embedded in calculus expressions in a natural way. We discuss on conceptual problems related with the design of multidimensional query languages, and compare our model and language with other approaches.

[1]  Richard Hull,et al.  Safety and translation of calculus queries with scalar functions , 1993, PODS.

[2]  Arie Shoshani,et al.  OLAP and statistical databases: similarities and differences , 1997, PODS '97.

[3]  Kenneth A. Ross,et al.  Querying Multiple Features of Groups in Relational Databases , 1996, VLDB.

[4]  Anthony C. Klug Equivalence of Relational Algebra and Relational Calculus Query Languages Having Aggregate Functions , 1982, JACM.

[5]  Stephen R. Gardner Building the data warehouse , 1998, CACM.

[6]  Venky Harinarayan,et al.  Implementing Data Cubes E ciently , 1996 .

[7]  Surajit Chaudhuri,et al.  Optimization of queries with user-defined predicates , 1996, TODS.

[8]  Limsoon Wong,et al.  A query language for multidimensional arrays: design, implementation, and optimization techniques , 1996, SIGMOD '96.

[9]  Catriel Beeri,et al.  On the power of languages for manipulation of complex objects , 1987, VLDB 1987.

[10]  Jeffrey F. Naughton,et al.  On the Computation of Multidimensional Aggregates , 1996, VLDB.

[11]  Limsoon Wong,et al.  Aggregate Functions, Conservative Extensions, and Linear Orders , 1993, DBPL.

[12]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[13]  George Colliat,et al.  OLAP, relational, and multidimensional database systems , 1996, SGMD.

[14]  Surajit Chaudhuri,et al.  Data warehousing and OLAP for decision support , 1997, SIGMOD '97.

[15]  Divesh Srivastava,et al.  Answering Queries with Aggregation Using Views , 1996, VLDB.

[16]  Sunita Sarawagi,et al.  Modeling multidimensional databases , 1997, Proceedings 13th International Conference on Data Engineering.

[17]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[18]  Antonio Badia,et al.  Providing better support for a class of decision support queries , 1996, SIGMOD '96.

[19]  Henry C. Lucas,et al.  Toward a logical/physical theory of spreadsheet modeling , 1992, TOIS.

[20]  Laks V. S. Lakshmanan,et al.  Tables as a paradigm for querying and restructuring (extended abstract) , 1996, PODS '96.