A Neural-Based Approach for Extending OLAP to Prediction

In the Data Warehouse (DW) technology, On-line Analytical Processing (OLAP) is a good applications package that empowers decision makers to explore and navigate into a multidimensional structure of precomputed measures, which is referred to as a Data Cube. Though, OLAP is poorly equipped for forecasting and predicting empty measures of data cubes. Usually, empty measures translate inexistent facts in the DW and in most cases are a source of frustration for enterprise managements, especially when strategic decisions need to be taken. In the recent years, various studies have tried to add prediction capabilities to OLAP applications. For this purpose, generally, Data Mining and Machine Learning methods have been widely used to predict new measures' values in DWs. In this paper, we introduce a novel approach attempting to extend OLAP to a prediction application. Our approach operates in two main stages. The first one is a preprocessing one that makes use of the Principal Component Analysis (PCA) to reduce the dimensionality of the data cube and then generates ad hoc training sets. The second stage proposes a novel OLAP oriented architecture of Multilayer Perceptron Networks (MLP) that learns from each training set and comes out with predicted measures of inexistent facts. Carried out experiments demonstrate the effectiveness of our proposal and the performance of its predictive capabilities.

[1]  Nimrod Megiddo,et al.  Discovery-Driven Exploration of OLAP Data Cubes , 1998, EDBT.

[2]  Tshilidzi Marwala,et al.  Computational Intelligence for Missing Data Imputation, Estimation, and Management - Knowledge Optimization Techniques , 2009, Computational Intelligence for Missing Data Imputation, Estimation, and Management.

[3]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[4]  Rokia Missaoui,et al.  OLEMAR: An Online Environment for Mining Association Rules in Multidimensional Data , 2008 .

[5]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[6]  Sabine Loudcher,et al.  Vers l'intégration de la prédiction dans les cubes OLAP , 2008, EGC.

[7]  Alberto O. Mendelzon,et al.  Using datacube aggregates for approximate querying and deviation detection , 2005, IEEE Transactions on Knowledge and Data Engineering.

[8]  Yan Zhang,et al.  Using the Method Combining PCA with BP Neural Network to Predict Water Demand for Urban Development , 2009, 2009 Fifth International Conference on Natural Computation.

[9]  Isidro Ramos,et al.  Advances in Database Technology — EDBT'98 , 1998, Lecture Notes in Computer Science.

[10]  Yixin Chen,et al.  Regression Cubes with Lossless Compression and Aggregation , 2006, IEEE Transactions on Knowledge and Data Engineering.

[11]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[12]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[13]  Yi Lin,et al.  Prediction Cubes , 2005, VLDB.

[14]  Deepak Agarwal,et al.  Latent OLAP: data cubes over latent variables , 2011, SIGMOD '11.

[15]  W. H. Inmon,et al.  Building the data warehouse (2nd ed.) , 1996 .

[16]  Sadok Ben Yahia,et al.  Yet Another Approach for Completing Missing Values , 2006, CLA.

[17]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[18]  Fred Collopy,et al.  How effective are neural networks at forecasting and prediction? A review and evaluation , 1998 .

[19]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .