Efficient maintenance of basic statistical functions in data warehouses

In general, some simple but very meaningful statistical functions are often used to retrieve valuable summary information in corporate databases. However, it is not uncommon that such information is obtained from computerized information systems which spend a great deal of time calculating the large volume of collected data. In practice, such data is usually stored in a data warehouse in which a large number of summary tables or materialized aggregate views are built in order to improve the system performance. Upon changes, most notable new transactional data are collected from various data sources, and all summary tables in the data warehouse that correspond to the transactional data must be updated accordingly. Since the number of summary tables that need to be maintained is often large, efficiently maintaining these is thus a critical issue for managing a data warehouse. In this study, an efficient maintenance approach to enhance the performance of a data warehouse is proposed, in which some additional auxiliary tables are kept inside a data warehouse with the role of improving the maintenance processes of some statistical functions, such as MIN, MAX, MEAN, and MEDIAN. Finally, a comparative analysis is performed to verify the effectiveness of the proposal method.

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  Inderpal Singh Mumick,et al.  Incremental maintenance of aggregate and outerjoin expressions , 2006, Inf. Syst..

[3]  Injun Choi,et al.  Efficiency evaluation of data warehouse operations , 2008, Decis. Support Syst..

[4]  Hua Shu View Maintenance Using Conditional Tables , 1997, DOOD.

[5]  V. S. Subrahmanian,et al.  Maintaining views incrementally , 1993, SIGMOD Conference.

[6]  Yon Dohn Chung,et al.  An efficient method for maintaining data cubes incrementally , 2010, Inf. Sci..

[7]  Maria E. Orlowska,et al.  Making Multiple Views Self-Maintainable in a Data Warehouse , 1999, Data Knowl. Eng..

[8]  Ambuj K. Singh,et al.  Efficient view maintenance at data warehouses , 1997, SIGMOD '97.

[9]  Jennifer Widom,et al.  Making views self-maintainable for data warehousing , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[10]  Tomasz Imielinski,et al.  Incomplete Information in Relational Databases , 1984, JACM.

[11]  Hua-Gang Li,et al.  Progressive Ranking of Range Aggregates , 2005, DaWaK.

[12]  Ching-Ming Chao Incremental maintenance of object-oriented data warehouses , 2004, Inf. Sci..

[13]  William A. Gruver,et al.  Multiagent immediate incremental view maintenance for data warehouses , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[14]  Mukesh K. Mohania,et al.  Making Aggregate Views Self-maintainable , 2000, Data Knowl. Eng..

[15]  Gottfried Vossen,et al.  Monotonic complements for independent data warehouses , 2001, The VLDB Journal.

[16]  John Grant,et al.  Incomplete Information in a Relational Database , 1980, Fundamenta Informaticae.

[17]  Elke A. Rundensteiner,et al.  Incremental Maintenance of Schema-Restructuring Views , 2002, EDBT.

[18]  Qing He,et al.  MSMiner - a developing platform for OLAP , 2007, Decis. Support Syst..

[19]  Michael H. Böhlen,et al.  Minimizing Detail Data in Data Warehouses , 1998, EDBT.

[20]  Elke A. Rundensteiner,et al.  Integrating the maintenance and synchronization of data warehouses using a cooperative framework , 2002, Inf. Syst..

[21]  Jian Pei,et al.  Answering ad hoc aggregate queries from data streams using prefix aggregate trees , 2007, Knowledge and Information Systems.

[22]  Dong Jin,et al.  An Incremental Maintenance Scheme of Data Cubes , 2008, DASFAA.

[24]  John V. Harrison Incremental view maintenance in extended relational databases , 1995, Inf. Softw. Technol..