A Case-Based Meta-Learning and Reasoning Framework for Classifiers Selection

In machine learning area, a large number of classification algorithms are available that can be used for solving the problems of prediction and classification in different domains. These classifiers perform differently on different learning problems. For example, if one algorithm perform better on one dataset, the same algorithm may perform badly on another dataset. The reason is that each dataset has its own nature in terms of its local and global characteristics. Similarly, the number of candidate algorithms are also large in number and is therefore very hard for a machine learning practitioner to know the intrinsic behaviors of the algorithms on different kinds of datasets and are therefore unable to select a right algorithm for his problem in-hand. To overcome the issue, this study proposes an automatic classifier selection methodology. A case-based meta-learning and reasoning (CB-MLR) framework is designed and implemented to recommend appropriate classifier for mining the new dataset. The framework exploits inherit characteristics of the datasets mapped against the algorithms performance. The key contributions of CB-MLR include: (a) design of a flexible and incremental meta-learning and reasoning framework using multi-view learning, and (b) implementation of the CBR methodology to accurately recommend most relevant top-3 classifiers as the suggested algorithms for the new data mining problem. The proposed framework is tested for 9 decision tree classifiers, from Weka environment, and 52 datasets from UCI repository over a case-base of 100 resolved cases. The accuracy obtained is 94% within the scope of top-3 most relevant classifiers.

[1]  Carlos Soares,et al.  Ranking Learning Algorithms: Using IBL and Meta-Learning on Accuracy and Time Results , 2003, Machine Learning.

[2]  João Gama,et al.  Characterization of Classification Algorithms , 1995, EPIA.

[3]  David W. Aha,et al.  Generalizing from Case studies: A Case Study , 1992, ML.

[4]  Carla E. Brodley,et al.  Addressing the Selective Superiority Problem: Automatic Algorithm/Model Class Selection , 1993 .

[5]  Kate Smith-Miles,et al.  Cross-disciplinary perspectives on meta-learning for algorithm selection , 2009, CSUR.

[6]  Dirk Söffker,et al.  Case indexing in Case-Based Reasoning by applying Situation Operator Model as knowledge representation model , 2015 .

[7]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[8]  Qinbao Song,et al.  Automatic recommendation of classification algorithms based on data set characteristics , 2012, Pattern Recognit..

[9]  R. Geoff Dromey,et al.  An algorithm for the selection problem , 1986, Softw. Pract. Exp..

[10]  Ian H. Witten,et al.  WEKA - Experiences with a Java Open-Source Project , 2010, J. Mach. Learn. Res..

[11]  Qinbao Song,et al.  A Generic Multilabel Learning-Based Classification Algorithm Recommendation Method , 2014, TKDD.

[12]  Tin Kam Ho,et al.  Domain of competence of XCS classifier system in complexity measurement space , 2005, IEEE Transactions on Evolutionary Computation.

[13]  Kate Smith-Miles,et al.  On learning algorithm selection for classification , 2006, Appl. Soft Comput..

[14]  Pedro A. González-Calero,et al.  JColibri: An Object-Oriented Framework for Building CBR Systems , 2004, ECCBR.

[15]  Luís Torgo,et al.  OpenML: A Collaborative Science Platform , 2013, ECML/PKDD.

[16]  João Gama,et al.  Characterizing the Applicability of Classification Algorithms Using Meta-Level Learning , 1994, ECML.

[17]  Carla E. Brodley,et al.  Automatic Algorith/Model Class Selection , 1993, International Conference on Machine Learning.