Materialized View Selection for a Data Warehouse Using Frequent Itemset Mining

Data warehouses are subject oriented, consolidated, integrated, and time variant repository of possibly heterogeneous data. A data warehouse is used to response to on-line analytical queries over the millions records of data in an acceptable time. Since a data warehouse often has millions of records of data, it is an important challenge how we can reduce the time of on-line analytical processing. One of the most important issues which address this problem is the view materialization. Each sub-query results an intermediate table, called virtual view, which is used to find final result of the analytical query. These virtual views often are commonly used to response to several analytical queries. We can materialize such views to prevent multiple redundant computations and thus lead to reduction in response time of queries. The constraint of storage memory on one hand, and the maintenance cost of materialized views when the source data are updated on the other hand, cause that it is impossible to materialize all or even large part of views. Therefore, selection of a proper set of views to materialization plays a major role in performance. There are many methods of view selection to materialization which uses different techniques and frameworks to select optimal set of views to materialization. In this paper, we present a new efficient method to conduct selecting proper set of views to materialization using a frequent itemset mining approach. In our algorithm, the set of given queries is transformed to a transaction database where a transaction corresponds to a query and items of a transaction are the original query's predicates. Our performance study showed that this algorithm outperformed substantially the best former algorithms.

[1]  Ahmad Abdollahzadeh Barforoush,et al.  Parallel frequent itemset mining using systolic arrays , 2013, Knowl. Based Syst..

[2]  P. R. Vishwanath,et al.  An Association Rule Mining for Materialized View Selection and View Maintanance , 2015 .

[3]  Jian Yang,et al.  Algorithms for Materialized View Design in Data Warehousing Environment , 1997, VLDB.

[4]  Jiratta Phuboon-ob,et al.  Two-Phase Optimization for Selecting Materialized Views in a Data Warehouse , 2007 .

[5]  Vahid Ghods,et al.  Top-down vertical itemset mining , 2015, International Conference on Graphic and Image Processing.

[6]  Surajit Chaudhuri,et al.  Automated Selection of Materialized Views and Indexes in SQL Databases , 2000, VLDB.

[7]  Zhou Lijuan,et al.  Efficient Materialized View Selection Dynamic Improvement Algorithm , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[8]  Jérôme Darmont,et al.  Clustering-Based Materialized View Selection in Data Warehouses , 2006, ADBIS.

[9]  T. Nalini,et al.  A novel algorithm with IM-LSI index for incremental maintenance of materialized view , 2012 .

[10]  Inderpal Singh Mumick,et al.  Selection of Views to Materialize in a Data Warehouse , 2005, IEEE Trans. Knowl. Data Eng..

[11]  Bela Stantic,et al.  Parallel Simulated Annealing for Materialized View Selection in Data Warehousing Environments , 2008, ICA3PP.

[12]  Ahmad Abdollahzadeh Barforoush,et al.  Efficient colossal pattern mining in high dimensional datasets , 2012, Knowl. Based Syst..

[13]  Xin Yao,et al.  An evolutionary approach to materialized views selection in a data warehouse environment , 2001, IEEE Trans. Syst. Man Cybern. Part C.

[14]  Kamalakar Karlapalem,et al.  View Relevance Driven Materialized View Selection in Data Warehousing Environment , 2002, Australasian Database Conference.

[15]  Raweewan Auepanwiriyakul,et al.  Re-Optimization MVPP Using Common Subexpression for Materialized View Selection , 2013 .

[16]  W. H. Inmon,et al.  Building the data warehouse (2nd ed.) , 1996 .

[17]  Hongjun Lu,et al.  A/sup */ search: an efficient and flexible approach to materialized view selection , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[18]  Jin-Hyuk Yang,et al.  ASVMRT: Materialized View Selection Algorithm in Data Warehouse , 2006, J. Inf. Process. Syst..

[19]  T. Nalini,et al.  An Efficient I-MINE Algorithm for Materialized Views in a Data Warehouse Environment , 2011 .