Cooperative caching for grid-enabled OLAP

In this paper, we propose a grid-based On-Line Analytical Processing (OLAP) application which distributes query computation across an enterprise grid. Our application follows a two-tiered process for answering queries based on sharing Cached OLAP data between the users at the local grid site and using grid scheduling approaches to execute the remaining parts of a query amongst a distributed set of OLAP Servers. A new technique for extraction and aggregation of shared Cached OLAP data is proposed, along with an efficient, aggregate-aware Cache controller. An experimental evaluation of the proposed query processing and cooperative Caching methods shows a significant reduction in query times compared to previously proposed methods.

[1]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[2]  Hyun-Wook Jin,et al.  Designing Efficient Cooperative Caching Schemes for Multi-Tier Data-Centers over RDMA-enabled Networks , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[3]  Peter Thanisch,et al.  Constructing an OLAP cube from distributed XML data , 2002, DOLAP '02.

[4]  Nick Roussopoulos,et al.  DynaMat: a dynamic view management system for data warehouses , 1999, SIGMOD '99.

[5]  Nick Roussopoulos,et al.  A case for dynamic view management , 2001, ACM Trans. Database Syst..

[6]  Jeffrey F. Naughton,et al.  Storage Estimation for Multidimensional Aggregates in the Presence of Hierarchies , 1996, VLDB.

[7]  Andrew Rau-Chaplin,et al.  Implementing OLAP Query Fragment Aggregation and Recombination for the OLAP Enabled Grid , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[8]  Peter Brezany,et al.  On-line analytical processing on large databases managed by computational grids , 2004 .

[9]  Ivan Janciak,et al.  Knowledge Grid Support for Treatment of Traumatic Brain Injury Victims , 2003, ICCSA.

[10]  Peter Thanisch,et al.  Applying Grid Technologies to XML Based OLAP Cube Construction , 2003, DMDW.

[11]  Ying Chen,et al.  Parallel ROLAP Data Cube Construction on Shared-Nothing Multiprocessors , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[12]  Beng Chin Ooi,et al.  An adaptive peer-to-peer network for distributed caching of OLAP results , 2002, SIGMOD '02.

[13]  Andrew Rau-Chaplin,et al.  The cgmCUBE project: Optimizing parallel data cube generation for ROLAP , 2006, Distributed and Parallel Databases.

[14]  Andrew Rau-Chaplin,et al.  Parallel multi-dimensional ROLAP indexing , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[15]  Sang-Min Park,et al.  Chameleon: a resource scheduler in a data grid environment , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[16]  Bernd Schuller,et al.  Grid-enabled data warehousing for molecular engineering , 2004, Parallel Comput..

[17]  Frank Dehne,et al.  Cooperative Caching for Grid Based DataWarehouses , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[18]  Fabrizio Silvestri,et al.  Scheduling High Performance Data Mining Tasks on a Data Grid Environment , 2002, Euro-Par.

[19]  Panos Kalnis,et al.  Proxy-server architectures for OLAP , 2001, SIGMOD '01.

[20]  Panos Kalnis,et al.  Active caching of on-line-analytical-processing queries in WWW proxies , 2001, International Conference on Parallel Processing, 2001..

[21]  Ian T. Foster,et al.  The Anatomy of the Grid: Enabling Scalable Virtual Organizations , 2001, Int. J. High Perform. Comput. Appl..

[22]  Michael Lawrence,et al.  Multiobjective genetic algorithms for materialized view selection in OLAP data warehouses , 2006, GECCO '06.

[23]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[24]  Andrew Rau-Chaplin,et al.  The OLAP-Enabled Grid: Model and Query Processing Algorithms , 2006, 20th International Symposium on High-Performance Computing in an Advanced Collaborative Environment (HPCS'06).

[25]  Jeffrey F. Naughton,et al.  Aggregate Aware Caching for Multi-Dimensional Queries , 2000, EDBT.