Opportunistic Prioritised Clustering Framework (OPCF)

Ever since the 'early days' of database management systems, clustering has proven to be one of the most effective performance enhancement techniques for object oriented database management systems. The bulk of the work in the area has been on static clustering algorithms which re-cluster the object base when the database is static. However, this type of re-clustering cannot be used when 24-hour database access is required. In such situations dynamic clustering is required, which allows the object base to be reclustered while the database is in operation. We believe that most existing dynamic clustering algorithms lack three important properties. These include: the use of opportunism to imposes the smallest I/O footprint for re-organisation; the re-use of prior research on static clustering algorithms; and the prioritisation of re-clustering so that the worst clustered pages are re-clustered first. In this paper, we present OPCF, a framework in which any existing static clustering algorithm can be made dynamic and given the desired properties of I/O opportunism and clustering prioritisation. In addition, this paper presents a performance evaluation of the ideas suggested above.The main contribution of this paper is the observation that existing static clustering algorithms, when transformed via a simple transformation framework such as OPCF, can produce dynamic clustering algorithms that out-perform complex existing dynamic algorithms, in a variety of situations. This makes the solution presented in this paper particularly attractive to real OODBMS system implementers who often prefer to opt for simpler solutions.

[1]  Michel Schneider,et al.  VOODB: A Generic Discrete-Event Random Simulation Model To Evaluate the Performances of OODBs , 1999, VLDB.

[2]  Guido Moerkotte,et al.  Partition-Based Clustering in Object Bases: From Theory to Practice , 1993, FODO.

[3]  Mehmet A. Orgun,et al.  Dynamic reorganization of object databases , 1999, Proceedings. IDEAS'99. International Database Engineering and Applications Symposium (Cat. No.PR00265).

[4]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[5]  David J. DeWitt,et al.  DBMSs on a Modern Processor: Where Does Time Go? , 1999, VLDB.

[6]  Rajeev Rastogi,et al.  On-line reorganization in object databases , 2000, SIGMOD 2000.

[7]  Guido Moerkotte,et al.  On the cost of monitoring and reorganization of object bases for clustering , 1996, SGMD.

[8]  Michel Schneider,et al.  Dynamic Clustering in Object Databases Exploiting Effective Use of Relationships Between Objects , 1996, ECOOP.

[9]  Roger King,et al.  Cactis: a self-adaptive, concurrent implementation of an object-oriented database management system , 1989, ACM Trans. Database Syst..

[10]  Jeffrey F. Naughton,et al.  On the performance of object clustering techniques , 1992, SIGMOD '92.

[11]  Mehmet A. Orgun,et al.  Clustering Techniques for Minimizing Object Access Time , 1998, ADBIS.

[12]  Guido Moerkotte,et al.  Clustering in Object Bases , 1992 .

[13]  Manolis M. Tsangaris Principles of Static Clustering for Object Oriented Databases , 1992 .

[14]  Roger King,et al.  Self-adaptive, on-line reclustering of complex object data , 1994, SIGMOD '94.

[15]  J. Banerjee,et al.  Clustering a DAG for CAD Databases , 1988, IEEE Trans. Software Eng..

[16]  Jeffrey F. Naughton,et al.  A stochastic approach for clustering in object bases , 1991, SIGMOD '91.

[17]  Michel Schneider,et al.  OCB: A Generic Benchmark to Evaluate the Performances of Object-Oriented Database Systems , 1998, EDBT.

[18]  Véronique Benzaken,et al.  Enhancing Performance in a Persistent Object Store: Clustering Strategies in O2 , 1990, POS.

[19]  Nils Knafla,et al.  Prefetching techniques for client server object-oriented database systems , 1999 .

[20]  Patrick Valduriez,et al.  Open issues in parallel query optimization , 1996, SGMD.

[21]  Chak-Kuen Wong,et al.  On the Optimality of the Probability Ranking Scheme in Storage Applications , 1973, JACM.

[22]  Roger King,et al.  The Performance and Utility of the Cactis Implementation Algorithms , 1990, VLDB.