Sequential Projection Pursuit with Kernel Matrix Update and Symbolic Model Selection

This paper proposes a novel way for generating reliable low-dimensional features with improved class separability in a kernel-induced feature space. The feature projections rely on a very efficient sequential projection pursuit method, adapted to support nonlinear projections using a new kernel matrix update scheme. This enables the gradual removal of structure from the space of residual dimensions to allow the recovery of multiple projections. An adaptive kernel function is employed to unfold different types of data characteristics. We follow a holistic model selection procedure that, together with the optimal projections, dimensionality, and kernel parameters, additionally optimizes symbolically the projection index that controls the actual measurement of the data interestingness without user interaction. We tackle the underlying complex bi-level optimization model as a mixture of evolutionary and gradient search. The effectiveness of the proposed algorithm over existing approaches is demonstrated with benchmark evaluations and comparisons.

[1]  D. Massart,et al.  Sequential projection pursuit using genetic algorithms for data mining of analytical data. , 2000, Analytical chemistry.

[2]  Liwei Wang,et al.  On Feature Extraction via Kernels , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  Tingting Mu,et al.  Adaptive Data Embedding Framework for Multiclass Classification , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[5]  Shingo Mabu,et al.  Enhanced decision making mechanism of rule-based genetic network programming for creating stock trading signals , 2013, Expert Syst. Appl..

[6]  Jing Liu,et al.  A multiagent evolutionary algorithm for constraint satisfaction problems , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[7]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[8]  Frank Y. Shih,et al.  Genetic algorithm based methodology for breaking the steganalytic systems , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  Yuxiao Hu,et al.  Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Tom Diethe,et al.  Online Learning with (Multiple) Kernels: A Review , 2013, Neural Computation.

[11]  Asoke K. Nandi,et al.  Automatic Modulation Classification Using Combination of Genetic Programming and KNN , 2012, IEEE Transactions on Wireless Communications.

[12]  Robert P. W. Duin,et al.  A Generalized Kernel Approach to Dissimilarity-based Classification , 2002, J. Mach. Learn. Res..

[13]  MengChu Zhou,et al.  Deadlock-Free Genetic Scheduling Algorithm for Automated Manufacturing Systems Based on Deadlock Control Policy , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  Tingting Mu,et al.  Proximity-Based Frameworks for Generating Embeddings from Multi-Output Data , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[16]  Jenq-Neng Hwang,et al.  Tracking Human Under Occlusion Based on Adaptive Multiple Kernels With Projected Gradients , 2013, IEEE Transactions on Multimedia.

[17]  Yang Zhang,et al.  A generic optimising feature extraction method using multiobjective genetic programming , 2011, Appl. Soft Comput..

[18]  Francisco Herrera,et al.  A Survey on the Application of Genetic Programming to Classification , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[19]  Jason F. Ralph,et al.  Automatic Induction of Projection Pursuit Indices , 2010, IEEE Transactions on Neural Networks.

[20]  Shiro Usui,et al.  Mutation-based genetic neural network , 2005, IEEE Transactions on Neural Networks.

[21]  Sebastián Ventura,et al.  Using Ant Programming Guided by Grammar for Building Rule-Based Classifiers , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems , 2002, Genetic Algorithms and Evolutionary Computation.

[23]  Jon Atli Benediktsson,et al.  A Novel Technique for Optimal Feature Selection in Attribute Profiles Based on Genetic Algorithms , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[24]  Jing Huang,et al.  Rotation invariant iris feature extraction using Gaussian Markov random fields with non-separable wavelet , 2010, Neurocomputing.

[25]  Dit-Yan Yeung,et al.  Robust path-based spectral clustering , 2008, Pattern Recognit..

[26]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[27]  John W. Tukey,et al.  A Projection Pursuit Algorithm for Exploratory Data Analysis , 1974, IEEE Transactions on Computers.

[28]  Jian-Huang Lai,et al.  GA-fisher: a new LDA-based face recognition algorithm with selection of principal components , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[29]  Gunnar Rätsch,et al.  Input space versus feature space in kernel-based methods , 1999, IEEE Trans. Neural Networks.

[30]  Tao Jiang,et al.  Efficient and robust feature extraction by maximum margin criterion , 2003, IEEE Transactions on Neural Networks.

[31]  Erkki Oja,et al.  Independent Component Analysis , 2001 .

[32]  Hiok Chai Quek,et al.  eFSM—A Novel Online Neural-Fuzzy Semantic Memory Model , 2010, IEEE Transactions on Neural Networks.

[33]  R. Spang,et al.  Predicting the clinical status of human breast cancer by using gene expression profiles , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Nikhil R. Pal,et al.  Genetic programming for simultaneous feature selection and classifier design , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[35]  Jieping Ye,et al.  Hypergraph spectral learning for multi-label classification , 2008, KDD.

[36]  Charles T. Zahn,et al.  Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters , 1971, IEEE Transactions on Computers.

[37]  Stephen Lin,et al.  Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Sophia Ananiadou,et al.  DISCOVERING ROBUST EMBEDDINGS IN (DIS)SIMILARITY SPACE FOR HIGH‐DIMENSIONAL LINGUISTIC FEATURES , 2014, Comput. Intell..

[39]  Chein-I Chang,et al.  Unsupervised target detection in hyperspectral images using projection pursuit , 2001, IEEE Trans. Geosci. Remote. Sens..

[40]  Asoke K. Nandi,et al.  Feature generation using genetic programming with comparative partner selection for diabetes classification , 2013, Expert Syst. Appl..

[41]  Elias Kyriakides,et al.  Hybrid Ant Colony-Genetic Algorithm (GAAPI) for Global Continuous Optimization , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[42]  Wei-Chang Yeh,et al.  New Parameter-Free Simplified Swarm Optimization for Artificial Neural Network Training and its Application in the Prediction of Time Series , 2013, IEEE Transactions on Neural Networks and Learning Systems.