Disk Allocation Methods Using Genetic Allgorithm

The disk allocation problem examined in this paper is finding a method to distribute a Binary Cartesian Product File on multiple disks to maximize parallel disk I/O accesses for partial match retrieval. This problem is known to be NP-hard, and heuristic approaches have been applied to obtain suboptimal solutions. Recently, efficient methods such as Binary Disk Modulo (BDM) and Error Correcting Code (ECC) methods have been proposed along with the restrictions that the number of disks in which files are stored should be a power of 2. In this paper, a new Disk Allocation method based on Genetic Algorithm (DAGA) is proposed. The DAGA does not place restrictions on the number of disks to be applied and it can allocate the disks adaptively by taking into account the data access patterns. Using the schema theory, it is proven that the DAGA can realize a near-optimal solution with high probability. Comparing the quality of solution derived by the DAGA with the General Disk Modulo (GDM), BDM, and ECC methods through the simulation, shows that 1) the DAGA is superior to the GDM method in all the cases and 2) with the restrictions being placed on the number of disks, the average response time of the DAGA is always less than that of the BDM method and greater than that of the ECC method in the absence of data skew and 3) when data skew is considered, the DAGA performs better than or equal to both BDM and ECC methods, even when restrictions on the number of disks are enforced. key words: disk allocation, genetic algorithm, parallel I/O, partial match query, cartesian product file

[1]  Hava T. Siegelmann,et al.  On the allocation of documents in multiprocessor information retrieval systems , 1991, SIGIR '91.

[2]  J. A. Bergstra Disk allocation for cartesian product files, a counterexample , 1983 .

[3]  Tag Gon Kim,et al.  Multidisk Partial Match File Design with Known Access Pattern , 1993, Inf. Process. Lett..

[4]  Jens Lienig,et al.  A Parallel Genetic Algorithm for , 1997 .

[5]  Heinz Mühlenbein,et al.  The parallel genetic algorithm as function optimizer , 1991, Parallel Comput..

[6]  Chin-Chen Chang,et al.  Performance of two-disk partition data allocations , 1987, BIT.

[7]  A. L. Narasimha Reddy,et al.  An Evaluation of Multiple-Disk I/O Systems , 1989, IEEE Trans. Computers.

[8]  David E. Goldberg,et al.  Alleles, loci and the traveling salesman problem , 1985 .

[9]  Michelle Y. Kim,et al.  Synchronized Disk Interleaving , 1986, IEEE Transactions on Computers.

[10]  H KatzRandy,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988 .

[11]  Sam Yuan Sung Performance Analysis of Disk Modulo Allocation Method for Cartesian Product Files , 1987, IEEE Transactions on Software Engineering.

[12]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[13]  Khaled A. S. Abdel-Ghaffar,et al.  Optimal disk allocation for partial match queries , 1993, TODS.

[14]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[15]  Christos Faloutsos,et al.  Disk Allocation Methods Using Error Correcting Codes , 1991, IEEE Trans. Computers.

[16]  Chin-Chen Chang,et al.  Performance Analysis of the Generalised Disc Modulo Allocation method for Multiple Key Hashing Files on Multi-Disc Systems , 1987, Comput. J..

[17]  Sakti Pramanik,et al.  Optimal file distribution for partial match retrieval , 1988, SIGMOD '88.

[18]  David B. Fogel,et al.  An introduction to simulated evolutionary optimization , 1994, IEEE Trans. Neural Networks.

[19]  Hung-Chang Du Disk allocation methods for binary Cartesian product files , 1986, BIT Comput. Sci. Sect..

[20]  Kejitan Domas Discovery of Maximal Distance Codes Using Genetic Algorithms , 1990 .

[21]  Yuan Y. Sung Parallel searching for binary Cartesian product files , 1985, CSC '85.

[22]  Lalit M. Patnaik,et al.  Genetic algorithms: a survey , 1994, Computer.

[23]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .