Performance Optimization of Analysis Rules in Real-Time Active Data Warehouses

Analysis rule is an important component of a real-time active data warehouse. Performance optimization of analysis rules may greatly improve the system response time when a new event occurs. In this paper, we carry out the optimization work through the following three ways: (1)initiating non-real-time analysis rules as less as possible during rush hour of real-time analysis rules; (2) executing non-real-time analysis rules using the same cube at the same time interval; and (3) preparing frequent cubes for the use of real-time analysis rules ahead of time. We design the LADE system to get all the reference information required by optimization work. A new algorithm, called ARPO, is proposed to carry out the optimization work. Empirical studies show that our methods can effectively improve the performance of analysis rules.

[1]  A Min Tjoa,et al.  Capturing Delays and Valid Times in Data Warehouses—Towards Timely Consistent Analyses , 2002, Journal of Intelligent Information Systems.

[2]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[3]  David Taniar,et al.  Towards Near Real-Time Data Warehousing , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[4]  Chen Lin,et al.  Maintaining Internal Consistency of Report for Real-Time OLAP with Layer-Based View , 2011, APWeb.

[5]  Panos Vassiliadis,et al.  Supporting Streaming Updates in an Active Data Warehouse , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[6]  Kyuseok Shim,et al.  Web Technologies and Applications , 2014, Lecture Notes in Computer Science.

[7]  Michael Schrefl,et al.  Active data warehouses: complementing OLAP with analysis rules , 2001, Data Knowl. Eng..

[8]  Michael Schrefl,et al.  On Making Data Warehouses Active , 2000, DaWaK.

[9]  Tan Hong-xing Dynamic Selection of Materialized Views of Multi-Dimensional Data , 2002 .

[10]  A Min Tjoa,et al.  Zero-Latency Data Warehousing for Heterogeneous Data Sources and Continuous Data Streams , 2003, iiWAS.