A Study of Algorithm Selection in Data Mining using Meta-Learning

This article discusses the algorithm selection problem in data mining with the help of meta-learning. We present the issue with the help of the classification and clustering problems. In this study, we have analyzed the working of a metalearning system in connection with the classical algorithm selection problem. Various ranking combination methods available in the literature have been explored from the perspective of the measurement system. Discussion about two new ranking combination methods namely the relative ranking and the percentage ranking have been included. The study also identifies few potential challenges in relation to algorithm selection in data mining using meta-learning.

[1]  Carlos Soares,et al.  A Comparison of Ranking Methods for Classification Algorithm Selection , 2000, ECML.

[2]  Teresa Bernarda Ludermir,et al.  Selective generation of training examples in active meta-learning , 2008, Int. J. Hybrid Intell. Syst..

[3]  Neena Sondhi,et al.  Research methodology concepts and cases , 2012 .

[4]  Hilan Bensusan,et al.  Tell me who can learn you and I can tell you who you are: Landmarking Various Learning Algorithms , 2000 .

[5]  Francisco de A. T. de Carvalho,et al.  An Analysis of Meta-learning Techniques for Ranking Clustering Algorithms Applied to Artificial Data , 2009, ICANN.

[6]  Ricardo Vilalta,et al.  A Perspective View and Survey of Meta-Learning , 2002, Artificial Intelligence Review.

[7]  Chuleui Hong,et al.  A Meta-Learning Approach for Combining Multiple Classifiers , 2013 .

[8]  Teresa Bernarda Ludermir,et al.  Combining Meta-learning and Active Selection of Datasetoids for Algorithm Selection , 2011, HAIS.

[9]  Christophe G. Giraud-Carrier Metalearning - A Tutorial , 2008 .

[10]  David W. Aha,et al.  Generalizing from Case studies: A Case Study , 1992, ML.

[11]  Jong-Seok Lee,et al.  A meta-learning approach for determining the number of clusters with consideration of nearest neighbors , 2013, Inf. Sci..

[12]  Christophe G. Giraud-Carrier,et al.  The data mining advisor: meta-learning at the service of practitioners , 2005, Fourth International Conference on Machine Learning and Applications (ICMLA'05).

[13]  Alexandros Kalousis,et al.  NOEMON: Design, implementation and performance results of an intelligent assistant for classifier selection , 1999, Intell. Data Anal..

[14]  Andreas Dengel,et al.  Automatic classifier selection for non-experts , 2012, Pattern Analysis and Applications.

[15]  John R. Rice,et al.  The Algorithm Selection Problem , 1976, Adv. Comput..

[16]  Alexander Schliep,et al.  Ranking and selecting clustering algorithms using a meta-learning approach , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[17]  Carlos Soares,et al.  Zoomed Ranking: Selection of Classification Algorithms Based on Relevant Performance Information , 2000, PKDD.

[18]  Ferrante Neri,et al.  Linear Algebra for Computational Sciences and Engineering , 2019 .

[19]  Leandro Nunes de Castro,et al.  Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods , 2015, Inf. Sci..

[20]  Kate Smith-Miles,et al.  Cross-disciplinary perspectives on meta-learning for algorithm selection , 2009, CSUR.

[21]  Cao Feng,et al.  STATLOG: COMPARISON OF CLASSIFICATION ALGORITHMS ON LARGE REAL-WORLD PROBLEMS , 1995 .

[22]  Leandro Nunes de Castro,et al.  Clustering Algorithm Recommendation: A Meta-learning Approach , 2012, SEMCCO.

[23]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[24]  Amit Thakkar,et al.  A Survey & Current Research Challenges in Meta Learning Approaches based on Dataset Characteristics , 2012 .

[25]  Ricardo Vilalta,et al.  Using Meta-Learning to Support Data Mining , 2004, Int. J. Comput. Sci. Appl..

[26]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[27]  Kenneth E. Anderson,et al.  MEASUREMENT IN SCIENCE , 1948 .

[28]  Peter A. Flach,et al.  Decision Tree-Based Data Characterization for Meta-Learning , 2002 .

[29]  Ricardo Vilalta,et al.  Metalearning - Applications to Data Mining , 2008, Cognitive Technologies.