Algorithms for Multidimensional Partitioning of Static Files

The problem of multidimensional file partitioning (MDFP) arises in large databases that are subject to frequent range queries on one or more attributes. In an MDFP scheme, the search attribute space is partitioned into cells, which are mapped to physical disk locations. This mapping preserves the order of the search attribute values so that range queries can be answered most efficiently, while maintaining good performance for other types of queries. Recently, MDFP schemes have been suggested to include both dynamic and static file organizations. Optimal and heuristic MDFP algorithms are developed for the static case. The results of extensive computational experiments show that the proposed heuristics perform better than known static ones. It is also shown that incorporating a static algorithm into a dynamic MDFP such as a grid file at conversion and/or periodical reorganization points significantly improves the resulting storage utilization of the data file and decreases the size of the directory file. >

[1]  Arie Segev,et al.  Optimization of join operations in horizontally partitioned database systems , 1986, TODS.

[2]  Aris M. Ouksel,et al.  Dynamic and Order Preserving Data Partitioning for Database Machines , 1985, VLDB.

[3]  Ellis Horowitz,et al.  Fundamentals of Computer Algorithms , 1978 .

[4]  Jürg Nievergelt,et al.  The Grid File: An Adaptable, Symmetric Multikey File Structure , 1984, TODS.

[5]  Azad Bolour Optimality Properties of Multiple-Key Hashing Functions , 1979, JACM.

[6]  Jon Louis Bentley,et al.  Multidimensional Binary Search Trees in Database Applications , 1979, IEEE Transactions on Software Engineering.

[7]  Arie Shoshani,et al.  Temporal Data Management , 1986, VLDB.

[8]  Alfred V. Aho,et al.  Optimal partial-match retrieval when fields are independently specified , 1979, ACM Trans. Database Syst..

[9]  T H Merrett,et al.  Relational Information Systems , 1983 .

[10]  Michael Stonebraker,et al.  B-trees re-examined , 1978, CACM.

[11]  Aris M. Ouksel,et al.  Multidimensional B-trees for associative searching in database systems , 1982, Inf. Syst..

[12]  Esen A. Ozkarahan Database machines and database management , 1986 .

[13]  J. T. Robinson,et al.  The K-D-B-tree: a search structure for large multidimensional dynamic indexes , 1981, SIGMOD '81.

[14]  T. H. Merrett,et al.  Dynamic Multipaging: A Storage Structure for Large Shared Data Banks , 1982, JCDKB.

[15]  Mireille Régnier,et al.  Analysis of grid file algorithms , 1985, BIT.

[16]  Patrick Valduriez,et al.  A multikey hashing scheme using predicate trees , 1984, SIGMOD '84.