Multiclass Classification Through Multidimensional Clustering

Classification is one of the most important machine learning tasks in science and engineering. However, it can be a difficult task, in particular when a high number of classes is involved. Genetic Programming, despite its recognized successfulness in so many different domains, is one of the machine learning methods that typically struggles, and often fails, to provide accurate solutions for multiclass classification problems. We present a novel algorithm for tree based GP that incorporates some ideas on the representation of the solution space in higher dimensions, and can be generalized to other types of GP. We test three variants of this new approach on a large set of benchmark problems from several different sources, and observe their competitiveness against the most successful state-of-the-art classifiers like Random Forests, Random Subspaces and Multilayer Perceptron.

[1]  Ivanoe De Falco,et al.  Discovering interesting classification rules with genetic programming , 2002, Appl. Soft Comput..

[2]  Venu Govindaraju,et al.  Issues in evolving GP based classifiers for a pattern recognition task , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[3]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[4]  Luis Muñoz,et al.  M3GP - Multiclass Classification with GP , 2015, EuroGP.

[5]  Stephan M. Winkler,et al.  Advanced Genetic Programming Based Machine Learning , 2007, J. Math. Model. Algorithms.

[6]  Yang Zhang,et al.  A Generic Multi-dimensional Feature Extraction Method Using Multiobjective Genetic Programming , 2009, Evolutionary Computation.

[7]  Feiping Nie,et al.  Learning a Mahalanobis distance metric for data clustering and classification , 2008, Pattern Recognit..

[8]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[9]  W. A. Sandham,et al.  A new approach to brain tumour diagnosis using fuzzy logic based genetic programming , 2003, Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (IEEE Cat. No.03CH37439).

[10]  Thomas Haynes,et al.  Collective Adaptation: The Exchange of Coding Segments , 1998, Evolutionary Computation.

[11]  Chih-Jen Lin,et al.  A Comparison of Methods for Multi-class Support Vector Machines , 2015 .

[12]  Francisco Herrera,et al.  A Survey on the Application of Genetic Programming to Classification , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[13]  John R. Koza,et al.  Human-competitive results produced by genetic programming , 2010, Genetic Programming and Evolvable Machines.

[14]  Jesús Alcalá-Fdez,et al.  KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework , 2011, J. Multiple Valued Log. Soft Comput..

[15]  Nikhil R. Pal,et al.  A novel approach to design classifiers using genetic programming , 2004, IEEE Transactions on Evolutionary Computation.

[16]  Walter Alden Tackett,et al.  Genetic Programming for Feature Discovery and Image Discrimination , 1993, ICGA.

[17]  Leonardo Vanneschi,et al.  A Multi-dimensional Genetic Programming Approach for Multi-class Classification Problems , 2014, EuroGP.

[18]  M. C. Sinclair,et al.  Classification rule mining for automatic credit approval using genetic programming , 2007, 2007 IEEE Congress on Evolutionary Computation.

[19]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[20]  Wei-Pang Yang,et al.  Designing a classifier by a layered multi-population genetic programming approach , 2007, Pattern Recognit..

[21]  Lalit M. Patnaik,et al.  Application of genetic programming for multicategory pattern classification , 2000, IEEE Trans. Evol. Comput..

[22]  Arthur Tay,et al.  Mining multiple comprehensible classification rules using genetic programming , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[23]  Mengjie Zhang,et al.  Multiclass Object Classification Using Genetic Programming , 2004, EvoWorkshops.

[24]  Celia C. Bojarczuk,et al.  Genetic programming for knowledge discovery in chest-pain diagnosis. , 2000, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[25]  Wei-Pang Yang,et al.  Classifier design with feature selection and feature extraction using layered genetic programming , 2008, Expert Syst. Appl..

[26]  Leonardo Vanneschi,et al.  Land Cover/Land Use Multiclass Classification Using GP with Geometric Semantic Operators , 2013, EvoApplications.

[27]  Victor Ciesielski,et al.  Genetic Programming for Multiple Class Object Detection , 1999, Australian Joint Conference on Artificial Intelligence.

[28]  Sara Silva,et al.  Classification of Seafloor Habitats Using Genetic Programming , 2008, EvoWorkshops.

[29]  Sara Silva,et al.  Reassembling operator equalisation: a secret revealed , 2011, GECCO '11.

[30]  Riccardo Poli,et al.  Foundations of Genetic Programming , 1999, Springer Berlin Heidelberg.

[31]  Abdul Rauf Baig,et al.  Two-stage learning for multi-class classification using genetic programming , 2013, Neurocomputing.