On the Crossover Operator for GA-based Optimizers in Sequential Projection Pursuit

Sequential Projection Pursuit (SPP) is a useful tool to uncover structures hidden in high-dimensional data by constructing sequentially the basis of a low-dimensional projection space where the structure is exposed. Genetic algorithms (GAs) are promising finders of optimal basis for SPP, but their performance is determined by the choice of the crossover operator. It is unknown until now which operator is more suitable for SPP. In this paper we compare, over four public datasets, the performance of eight crossover operators: three available in literature (arithmetic, single-point and multi-point) and five new proposed here (two hyperconic, two fitnessbiased and one extension of arithmetic crossover). The proposed hyperconic operators and the multi-point operator showed the best performance, finding high-fitness projections. However, it was noted that the final selection is dependent on the dataset dimension and the timeframe allowed to get the answer. Some guidelines to select the most appropriate operator for each situation are presented.

[1]  J. Kruskal TOWARD A PRACTICAL METHOD WHICH HELPS UNCOVER THE STRUCTURE OF A SET OF MULTIVARIATE OBSERVATIONS BY FINDING THE LINEAR TRANSFORMATION WHICH OPTIMIZES A NEW “INDEX OF CONDENSATION” , 1969 .

[2]  John W. Tukey,et al.  A Projection Pursuit Algorithm for Exploratory Data Analysis , 1974, IEEE Transactions on Computers.

[3]  Vince D. Calhoun,et al.  A projection pursuit algorithm to classify individuals using fMRI data: Application to schizophrenia , 2008, NeuroImage.

[4]  C. Posse An effective two-dimensional projection pursuit algorithm , 1990 .

[5]  P. J. Huber Projection Pursuit for , 2022 .

[6]  B. W. Wright,et al.  An improved optimization algorithm and a Bayes factor termination criterion for sequential projection pursuit , 2005 .

[7]  A. Buja,et al.  Projection Pursuit Indexes Based on Orthonormal Function Expansions , 1993 .

[8]  Jason F. Ralph,et al.  Automatic Induction of Projection Pursuit Indices , 2010, IEEE Transactions on Neural Networks.

[9]  Y. Heyden,et al.  Robust statistics in data analysis — A review: Basic concepts , 2007 .

[10]  Chein-I Chang,et al.  Unsupervised target detection in hyperspectral images using projection pursuit , 2001, IEEE Trans. Geosci. Remote. Sens..

[11]  Desire L. Massart,et al.  Feature selection in sequential projection pursuit , 2001 .

[12]  C. Posse Tools for Two-Dimensional Exploratory Projection Pursuit , 1995 .

[13]  Mancang Liu,et al.  Prediction of ozone tropospheric degradation rate constants by projection pursuit regression. , 2007, Analytica chimica acta.

[14]  Alain Berro,et al.  Genetic algorithms and particle swarm optimization for exploratory projection pursuit , 2010, Annals of Mathematics and Artificial Intelligence.

[15]  F. Prieto,et al.  Cluster Identification Using Projections , 2001 .

[16]  Wojtek J. Krzanowski,et al.  Projection Pursuit Clustering for Exploratory Data Analysis , 2003 .

[17]  Lalit M. Patnaik,et al.  Genetic algorithms: a survey , 1994, Computer.

[18]  D. Massart,et al.  Sequential projection pursuit using genetic algorithms for data mining of analytical data. , 2000, Analytical chemistry.

[19]  G. Nason Three‐Dimensional Projection Pursuit , 1995 .

[20]  C. Posse Projection pursuit exploratory data analysis , 1995 .

[21]  Eun-Kyung Lee,et al.  Projection Pursuit for Exploratory Supervised Classification , 2005 .

[22]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[23]  J. Friedman Exploratory Projection Pursuit , 1987 .

[24]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .