A genetic algorithm for improving the induction of attribute-based decision graph classifiers

Attribute-based Decision Graphs (AbDG) have been recently proposed as a novel and effective way to represent data as weighted labeled graphs. However, for some domains, the definition of a graph structure that best fits the data can be a hard task. In machine learning it is very common to rely on evolutionary algorithms to guide the model selection phase of learning processes. Particularly, as far as classification tasks are concerned, evolutionary algorithms can be of great help when searching for characteristics which would promote the induction of a suitable classifier, without the need to exhaustively test all possibilities. This paper proposes a genetic algorithm (GA-AbDG) which explores the possible benefits of inducing part of the structure of AbDGs, by evolving suitable edge sets for them. In addition, the paper shows that the results obtained with the GA-AbDG algorithm outperform a prior proposal, with a fixed p-partite structure of AbDGs, as well as the C4.5.

[1]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[2]  Michaël Aupetit,et al.  High-dimensional labeled data analysis with topology representing graphs , 2005, Neurocomputing.

[3]  Maria do Carmo Nicoletti,et al.  An investigation of the use of three selection-based genetic algorithm families when minimizing the production cost of hollow core slabs , 2005 .

[4]  Nor Ashidi Mat Isa,et al.  Adaptive Evolutionary Artificial Neural Networks for Pattern Classification , 2011, IEEE Transactions on Neural Networks.

[5]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[6]  Maria do Carmo Nicoletti,et al.  Attribute-based Decision Graphs for multiclass data classification , 2013, 2013 IEEE Congress on Evolutionary Computation.

[7]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[8]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[9]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[10]  Raymond Chiong,et al.  Nature-Inspired Algorithms for Optimisation , 2009, Nature-Inspired Algorithms for Optimisation.

[11]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[12]  Maria do Carmo Nicoletti,et al.  Ensemble of complete P-partite graph classifiers for non-stationary environments , 2013, 2013 IEEE Congress on Evolutionary Computation.

[13]  Hong Zhao,et al.  Pattern recognition with weighted complex networks. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Shiro Usui,et al.  Mutation-based genetic neural network , 2005, IEEE Transactions on Neural Networks.

[15]  Lakhmi C. Jain,et al.  Evolutionary computation in data mining , 2005 .

[16]  Victor O. K. Li,et al.  Evolutionary artificial neural network based on Chemical Reaction Optimization , 2011, 2011 IEEE Congress of Evolutionary Computation (CEC).

[17]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[18]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[19]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1992, Artificial Intelligence.

[20]  Duoqian Miao,et al.  A graph-theoretical clustering method based on two rounds of minimum spanning trees , 2010, Pattern Recognit..

[21]  Raymond Chiong,et al.  Novel evolutionary algorithms for supervised classification problems: an experimental study , 2011, Evol. Intell..