Genetic algorithms based approach to database vertical partition

Vertical partition clusters attributes of a relation to generate fragments suitable for subsequent allocation over a distributed platform with the goal of improving performance. Vertical partition is an optimization problem that can resort to genetic algorithms (GA). However, the performance of the classical GA application to vertical partition as well as to similar problems such as clustering and grouping suffers from two major drawbacks—redundant encoding and non-group oriented genetic operations. This paper applies the restricted growth (RG) string Ruskey (1993) constraint to manipulate the chromosomes so that redundant chromosomes are excluded during the GA process. On RG string compliant chromosomes, the group oriented crossover and mutation become realizable. We thus propose a novel approach called Group oriented Restricted Growth String GA (GRGS-GA) which incorporates the two above features. Finally, we compare the proposed approach with a rudimental RG string based approach and a classical GA based approach. The conducted experiments demonstrate a significant improvement of GRGS-GA on partition speed and result, especially for large size vertical partition problems.

[1]  Wesley W. Chu,et al.  A Transaction-Based Approach to Vertical Partitioning for Relational Database Systems , 1993, IEEE Trans. Software Eng..

[2]  Philip S. Yu,et al.  An Effective Approach to Vertical Partitioning for Physical Design of Relational Databases , 1990, IEEE Trans. Software Eng..

[3]  E. M. Wright,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[4]  Derek Smith,et al.  Bin Packing with Adaptive Search , 1985, ICGA.

[5]  Shamkant B. Navathe,et al.  A Mixed Fragmentation Methodology For Initial Distributed Database Design , 1995 .

[6]  Emanuel Falkenauer,et al.  A hybrid grouping genetic algorithm for bin packing , 1996, J. Heuristics.

[7]  Shamkant B. Navathe,et al.  Vertical partitioning algorithms for database design , 1984, TODS.

[8]  Shamkant B. Navathe,et al.  A formal approach to the vertical partitioning problem in distributed database design , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[9]  Narasimhaiah Gorla,et al.  A Genetic Algorithm for Vertical Fragmentation and Access Path Selection , 2000, Comput. J..

[10]  Michael Hammer,et al.  A heuristic approach to attribute partitioning , 1979, SIGMOD '79.

[11]  Robert T. Sumichrast,et al.  CPG EA : a grouping genetic algorithm for material cutting plan generation , 2003 .

[12]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[13]  Dennis G. Severance,et al.  The use of cluster analysis in physical data base design , 1975, VLDB '75.

[14]  Emanuel Falkenauer,et al.  Genetic Algorithms and Grouping Problems , 1998 .

[15]  Nicholas J. Radcliffe,et al.  Forma Analysis and Random Respectful Recombination , 1991, ICGA.

[16]  Shamkant B. Navathe,et al.  Vertical partitioning for database design: a graphical algorithm , 1989, SIGMOD '89.

[17]  Frank Ruskey,et al.  Simple Combinatorial Gray Codes Constructed by Reversing Sublists , 1993, ISAAC.

[18]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .