Meta Galaxy: A Flexible and Efficient Cube Model for Data Retrieval in OLAP

OLAP is widely used in data analysis. The existing design models, such as star schema and snowflake schema, are not flexible when the data model is changed. For example, the task for inserting a dimension may involve complex operations over model and application implementation. To deal with this problem, a new cube model, called Meta Galaxy, is proposed. The main contributions of this work include: (1) analyzing the shortcoming of traditional design method, (2) proposing a new cube model which is flexible for dimension changes, and (3) designing an index structure and an algorithm to accelerate the cube query. The time complexity of query algorithm is linear. The extensive experiments on the real application and synthetic dataset show that Meta Galaxy is effective and efficient for cube query. Specifically, our method decreases the storage size by 95.12%, decreases the query time by 89.89% in average compared with SQL Server 2005, and has good scalability on data size.

[1]  Elaheh Pourabbas,et al.  Efficient estimation of joint queries from multiple OLAP databases , 2007, TODS.

[2]  Ruoming Jin,et al.  Communication and memory optimal parallel data cube construction , 2005, IEEE Transactions on Parallel and Distributed Systems.

[3]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[4]  T. S. Jayram,et al.  OLAP over uncertain and imprecise data , 2007, The VLDB Journal.

[5]  Elisa Bertino,et al.  PARALLEL AND DISTRIBUTED SYSTEMS , 2010 .

[6]  Qing He,et al.  MSMiner - a developing platform for OLAP , 2007, Decis. Support Syst..

[7]  W. S. Cho,et al.  An Efficient OLAP Query Processing Technique Using Measure Attribute Indexes , 2004, WISE.

[8]  Thomas Neumuth,et al.  OLAP Technology for Business Process Intelligence: Challenges and Solutions , 2007, DaWaK.

[9]  Torben Bach Pedersen,et al.  R-Cubes: OLAP Cubes Contextualized with Documents , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[10]  Lotfi Lakhal,et al.  Emerging Cubes for Trends Analysis in OlapDatabases , 2007, DaWaK.

[11]  Erhard Rahm,et al.  Multi-Dimensional Database Allocation for Parallel Data Warehouses , 2000, VLDB.

[12]  Rada Chirkova,et al.  Exact and inexact methods for selecting views and indexes for OLAP performance improvement , 2008, EDBT '08.

[13]  Yon Dohn Chung,et al.  An efficient, robust method for processing of partial top-k/bottom-k queries using the RD-Tree in OLAP , 2007, Decis. Support Syst..

[14]  Torben Bach Pedersen,et al.  Multidimensional Database Technology , 2001, Computer.