Memetic NSGA - a multi-objective genetic algorithm for classification of microarray data

In Gene Expression studies, the identification of gene subsets responsible for classifying available samples to two or more classes is an important task. One major difficulty in identifying these gene subsets is the availability of only a few samples compared to the number of genes in the samples. Here we treat this problem as a Multi-objective optimization problem of minimizing the gene subset size and minimizing the number of misclassified samples. We present a new elitist non-dominated sorting-based genetic algorithm (NSGA) called memetic- NSGA which uses the concept of memes. Memes are a group of genes which have a particular functionality at the phenotype level. We have chosen a 50 gene Leukemia dataset to evaluate our algorithm. A comparative study between Memetic-NSGA and another non-dominated sorting genetic algorithm, called NSGA-II, is presented. Memetic-NSGA is found to perform better in terms of execution time and gene-subset length identified.

[1]  Gary B. Lamont,et al.  Multiobjective Evolutionary Algorithms: Analyzing the State-of-the-Art , 2000, Evolutionary Computation.

[2]  Johan Andersson,et al.  A survey of multiobjective optimization in engineering design , 2001 .

[3]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[4]  C R Appledorn,et al.  A new approach to the interpolation of sampled data , 1996, IEEE Trans. Medical Imaging.

[5]  R. Lewontin ‘The Selfish Gene’ , 1977, Nature.

[6]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[7]  Everardo Gutiérrez,et al.  Multi-objective Go with the Winners Algorithm: A Preliminary Study , 2005, EMO.

[8]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[9]  Sandip Sarkar,et al.  Early Vision and Image Processing: Evidences Favouring a Dynamic Receptive Field Model , 2006, ICVGIP.

[10]  Bþ KHI,et al.  Classification of Two-Class Cancer Data Reliably Using , .

[11]  Sandip Sarkar,et al.  Understanding image structure from a new multi-scale representation of higher order derivative filters , 2007, Image Vis. Comput..

[12]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[13]  Pedro Oliveira,et al.  An evolution strategy for multiobjective optimization , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[14]  Hussein A. Abbass,et al.  A Memetic Pareto Evolutionary Approach to Artificial Neural Networks , 2001, Australian Joint Conference on Artificial Intelligence.

[15]  S. Sarkar,et al.  A Weighted Sum of Multi-scale Gaussians Generates New Near-ideal Interpolation Functions , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[16]  R. K. Ursem Multi-objective Optimization using Evolutionary Algorithms , 2009 .

[17]  Carlos A. Coello Coello,et al.  Evolutionary multiobjective optimization using a cultural algorithm , 2003, Proceedings of the 2003 IEEE Swarm Intelligence Symposium. SIS'03 (Cat. No.03EX706).

[18]  Sandip Sarkar,et al.  A possible mechanism of zero-crossing detection using the concept of the extended classical receptive field of retinal ganglion cells , 2005, Biological Cybernetics.

[19]  Songde Ma,et al.  Derivative computation by multiscale filters , 1998, Image Vis. Comput..

[20]  Shawki Areibi,et al.  A COMPARISON OF GENETIC / MEMETIC ALGORITHMS AND OTHER HEURISTIC , 2001 .