Towards an efficient genetic algorithm optimizer for sequential projection pursuit

Sequential projection pursuit (SPP) is a useful tool for revealing interesting structures hidden in high-dimensional data. SPP constructs sequentially the bases of a low-dimensional space where the projected data evidence such structures. Genetic algorithms (GAs) are promising finders of these bases, but their performance is determined by the choice of the crossover operator. Until now it is not clear which operator is more suitable for SPP. In this paper we compare the performance of eight crossover operators: three available in literature (arithmetic, single-point and multi-point) and five newly proposed here (two hyperconic, two fitness-biased and one extension of arithmetic crossover). The results on five benchmark datasets showed that the proposed hyperconic operators have the best performance in finding high-fitness projections. The performance of a canonical GA with one of these hyperconic operators was compared against two representative SPP optimizers, the PSO and the RSSA algorithms. We found that our GA with the hyperconic operator tends to find better solutions than the other methods at different numbers of fitness computations. These results suggest that the optimization of SPP can be improved with GAs by taking advantage of the exploratory capabilities of the proposed hyperconic operators.

[1]  Zbigniew Michalewicz,et al.  Evolutionary Algorithms in Engineering Applications , 1997, Springer Berlin Heidelberg.

[2]  Lester Ingber,et al.  Simulated annealing: Practice versus theory , 1993 .

[3]  Shin Ishii,et al.  A Bayesian missing value estimation method for gene expression profile data , 2003, Bioinform..

[4]  G. Nason Three‐Dimensional Projection Pursuit , 1995 .

[5]  C. Posse Projection pursuit exploratory data analysis , 1995 .

[6]  Wojtek J. Krzanowski,et al.  Projection Pursuit Clustering for Exploratory Data Analysis , 2003 .

[8]  Zbigniew Michalewicz,et al.  Genetic algorithms + data structures = evolution programs (3rd ed.) , 1996 .

[9]  B. W. Wright,et al.  An improved optimization algorithm and a Bayes factor termination criterion for sequential projection pursuit , 2005 .

[10]  Alain Berro,et al.  Genetic algorithms and particle swarm optimization for exploratory projection pursuit , 2010, Annals of Mathematics and Artificial Intelligence.

[11]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[12]  A. Buja,et al.  Projection Pursuit Indexes Based on Orthonormal Function Expansions , 1993 .

[13]  Alden H. Wright,et al.  Genetic Algorithms for Real Parameter Optimization , 1990, FOGA.

[14]  Mancang Liu,et al.  Prediction of ozone tropospheric degradation rate constants by projection pursuit regression. , 2007, Analytica chimica acta.

[15]  Eun-Kyung Lee,et al.  Projection Pursuit for Exploratory Supervised Classification , 2005 .

[16]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[17]  Chein-I Chang,et al.  Unsupervised target detection in hyperspectral images using projection pursuit , 2001, IEEE Trans. Geosci. Remote. Sens..

[18]  C. Posse An effective two-dimensional projection pursuit algorithm , 1990 .

[19]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[20]  John W. Tukey,et al.  A Projection Pursuit Algorithm for Exploratory Data Analysis , 1974, IEEE Transactions on Computers.

[21]  Jason F. Ralph,et al.  Automatic Induction of Projection Pursuit Indices , 2010, IEEE Transactions on Neural Networks.

[22]  Kenneth A. De Jong,et al.  An Analysis of Multi-Point Crossover , 1990, FOGA.

[23]  F. Prieto,et al.  Cluster Identification Using Projections , 2001 .

[24]  J. Friedman Exploratory Projection Pursuit , 1987 .

[25]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[26]  Vince D. Calhoun,et al.  A projection pursuit algorithm to classify individuals using fMRI data: Application to schizophrenia , 2008, NeuroImage.

[27]  J. Kruskal TOWARD A PRACTICAL METHOD WHICH HELPS UNCOVER THE STRUCTURE OF A SET OF MULTIVARIATE OBSERVATIONS BY FINDING THE LINEAR TRANSFORMATION WHICH OPTIMIZES A NEW “INDEX OF CONDENSATION” , 1969 .

[28]  Gilbert Syswerda,et al.  Uniform Crossover in Genetic Algorithms , 1989, ICGA.

[29]  Y. Heyden,et al.  Robust statistics in data analysis — A review: Basic concepts , 2007 .

[30]  R. Eberhart,et al.  Fuzzy adaptive particle swarm optimization , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[31]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[32]  Mauro Birattari,et al.  Swarm Intelligence , 2012, Lecture Notes in Computer Science.

[33]  D. Massart,et al.  Sequential projection pursuit using genetic algorithms for data mining of analytical data. , 2000, Analytical chemistry.

[34]  Kathryn A. Dowsland,et al.  Simulated Annealing , 1989, Encyclopedia of GIS.

[35]  Desire L. Massart,et al.  Feature selection in sequential projection pursuit , 2001 .

[36]  C. Posse Tools for Two-Dimensional Exploratory Projection Pursuit , 1995 .

[37]  Lalit M. Patnaik,et al.  Genetic algorithms: a survey , 1994, Computer.