A Genetic Algorithm for Optimizing the Label Ordering in Multi-label Classifier Chains

First proposed in 2009, the classifier chains model (CC) has become one of the most influential algorithms for multi-label classification. It is distinguished by its simple and effective approach to exploit label dependencies. The CC method involves the training of q single-label binary classifiers, where each one is solely responsible for classifying a specific label in ll, ..., lq. These q classifiers are linked in a chain, such that each binary classifier is able to consider the labels predicted by the previous ones as additional information at classification time. The label ordering has a strong effect on predictive accuracy, however it is decided at random and/or combining random orders via an ensemble. A disadvantage of the ensemble approach consists of the fact that it is not suitable when the goal is to generate interpretable classifiers. To tackle this problem, in this work we propose a genetic algorithm for optimizing the label ordering in classifier chains. Experiments on diverse benchmark datasets, followed by the Wilcoxon test for assessing statistical significance, indicate that the proposed strategy produces more accurate classifiers.

[1]  Amanda Clare,et al.  Knowledge Discovery in Multi-label Phenotype Data , 2001, PKDD.

[2]  Grigorios Tsoumakas,et al.  MULAN: A Java Library for Multi-Label Learning , 2011, J. Mach. Learn. Res..

[3]  Lior Rokach,et al.  Taxonomy for characterizing ensemble methods in classification tasks: A review and annotated bibliography , 2009, Comput. Stat. Data Anal..

[4]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[5]  Eyke Hüllermeier,et al.  Bayes Optimal Multilabel Classification via Probabilistic Classifier Chains , 2010, ICML.

[6]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[7]  Adriano Lorena Inácio de Oliveira,et al.  Comparative Study of FOREX Trading Systems Built with SVR+GHSOM and Genetic Algorithms Optimization of Technical Indicators , 2012, 2012 IEEE 24th International Conference on Tools with Artificial Intelligence.

[8]  Pablo Hernandez-Leal,et al.  Hybrid Binary-Chain Multi-label Classifiers , 2012 .

[9]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[10]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[11]  Luca Martino,et al.  Efficient monte carlo optimization for multi-label classifier chains , 2012, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12]  Pedro M. Domingos The Role of Occam's Razor in Knowledge Discovery , 1999, Data Mining and Knowledge Discovery.

[13]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[14]  Ian Witten,et al.  Data Mining , 2000 .

[15]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[16]  Dan Boneh,et al.  On genetic algorithms , 1995, COLT '95.

[17]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[18]  Dayou Liu,et al.  Genetic Algorithm with Local Search for Community Mining in Complex Networks , 2010, 2010 22nd IEEE International Conference on Tools with Artificial Intelligence.

[19]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[20]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[21]  Charles Elkan,et al.  Beam search algorithms for multilabel learning , 2013, Machine Learning.

[22]  Giovanni Seni,et al.  Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions , 2010, Ensemble Methods in Data Mining.

[23]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[24]  Ivo Gonçalves,et al.  Balancing Learning and Overfitting in Genetic Programming with Interleaved Sampling of Training Data , 2013, EuroGP.

[25]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[26]  Alex Alves Freitas,et al.  A Tutorial on Multi-label Classification Techniques , 2009, Foundations of Computational Intelligence.

[27]  Ian H. Witten,et al.  Data Mining: Practical Machine Learning Tools and Techniques, 3/E , 2014 .

[28]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[29]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[30]  Mukund Seshadri,et al.  Comprehensibility & Overfitting Avoidance in Genetic Programming for Technical Trading Rules , 2003 .

[31]  Dr. Alex A. Freitas Data Mining and Knowledge Discovery with Evolutionary Algorithms , 2002, Natural Computing Series.

[32]  Concha Bielza,et al.  Bayesian Chain Classifiers for Multidimensional Classification , 2011, IJCAI.

[33]  Sunita Sarawagi,et al.  Discriminative Methods for Multi-labeled Classification , 2004, PAKDD.

[34]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[35]  Mohak Shah,et al.  Evaluating Learning Algorithms: A Classification Perspective , 2011 .

[36]  Concha Bielza,et al.  Multi-dimensional classification with Bayesian networks , 2011, Int. J. Approx. Reason..