Dynamic Declustering Methods for Parallel Grid Files

Several declustering functions for distributing multi-attribute data on a set of disks have been proposed in recent years. Since these functions map grid regions to disks in a static way, performance deteriorates in case of dynamic datasets and/or non-stationary data distributions. We first analyze how declustering functions can be extended in order to deal with dynamic datasets without requiring periodic reorganizations. In order to support dynamic declustering, we propose to organize the directory as a parallel Multilevel Grid File. On this structure we experiment five dynamic declustering functions and two index-based allocation methods that only use locally available information. This first comparison among the two approaches reveals that methods based on local criteria always yield better results.

[1]  Jack A. Orenstein Spatial query processing in an object-oriented database system , 1986, SIGMOD '86.

[2]  Christos Faloutsos,et al.  Disk Allocation Methods Using Error Correcting Codes , 1991, IEEE Trans. Computers.

[3]  Christos Faloutsos,et al.  Declustering using fractals , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[4]  Jürg Nievergelt,et al.  The Grid File: An Adaptable, Symmetric Multikey File Structure , 1984, TODS.

[5]  Benoit B. Mandelbrot,et al.  Fractal Geometry of Nature , 1984 .

[6]  Shashi Shekhar,et al.  A similarity graph-based approach to declustering problems and its application towards parallelizing grid files , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[7]  David A. Patterson Massive Parallelism and Massive Storage: Trends and Predictions for 1995 to 2000 , 1993, PDIS.

[8]  Pavel Zezula,et al.  Declustering of key-based partitioned signature files , 1996, TODS.

[9]  Christos Faloutsos,et al.  Parallel R-trees , 1992, SIGMOD '92.

[10]  Jaideep Srivastava,et al.  Algorithms for loading parallel grid files , 1993, SIGMOD Conference.

[11]  Jaideep Srivastava,et al.  CMD: A Multidimensional Declustering Method for Parallel Data Systems , 1992, VLDB.

[12]  David J. DeWitt,et al.  Parallel database systems: the future of high performance database systems , 1992, CACM.

[13]  Shashi Shekhar,et al.  Disk allocation methods for parallelizing grid files , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[14]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[15]  John S. Sobolewski,et al.  Disk allocation for Cartesian product files on multiple-disk systems , 1982, TODS.

[16]  Jaideep Srivastava,et al.  Performance evaluation of grid based multi-attribute record declustering methods , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[17]  Sakti Pramanik,et al.  Optimal file distribution for partial match retrieval , 1988, SIGMOD '88.

[18]  Jianzhong Li,et al.  CMD : A Multidimensional Declustering Method for Parallel Database Systems 1 , 1992 .