SA Tabu Miner: A hybrid heuristic algorithm for rule induction

This paper presents a Hybrid Heuristic algorithm for induction of classification rules called SA Tabu Miner Simulated Annealing and Tabu Search based Data Miner. The proposed procedure is inspired by both research on heuristic optimization algorithms and rule induction data mining concepts and principles. A comparison is made of the performance of SA Tabu Miner with CN2 and C4.5, well-known data mining algorithms for classification, and Ant-Miner, a recently proposed Ant Colony Optimization based algorithm, over public domain data sets. The results provide evidence that: our algorithm is comparable with CN2, C4.5 and Ant-Miner in terms of predictive accuracy; and the rule lists discovered by our algorithm are considerably simpler smaller than those discovered by other algorithms.

[1]  Christian Voigtmann,et al.  Evolving Classifiers - Evolutionary Algorithms in Data Mining , 2007 .

[2]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[3]  Sotiris B. Kotsiantis,et al.  Locally application of cascade generalization for classification problems , 2008, Intell. Decis. Technol..

[4]  Alex Alves Freitas,et al.  Mining Very Large Databases with Parallel Processing , 1997, The Kluwer International Series on Advances in Database Systems.

[5]  Rema Padman,et al.  Tabu Search Enhanced Markov Blanket Classifier for High Dimensional Data Sets , 2005 .

[6]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.

[7]  Reda Alhajj,et al.  Discovering Accurate and Interesting Classification Rules Using Genetic Algorithm , 2006, DMIN.

[8]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[9]  Maria do Carmo Nicoletti,et al.  The influence of search mechanisms in feature subset selection processes , 2008, Intell. Decis. Technol..

[10]  Vili Podgorelec,et al.  Knowledge discovery with classification rules in a cardiovascular dataset , 2005, Comput. Methods Programs Biomed..

[11]  Fernando E. B. Otero,et al.  Genetic Programming for Attribute Construction in Data Mining , 2002, EuroGP.

[12]  Abbes Amira,et al.  A Novel Prostate Cancer Classification Technique Using Intermediate Memory Tabu Search , 2005, EURASIP J. Adv. Signal Process..

[13]  Jing Liu,et al.  A traveling salesman approach for predicting protein functions , 2006, Source Code for Biology and Medicine.

[14]  Heitor Silvério Lopes,et al.  AN EVOLUTIONARY APPROACH TO SIMULATE COGNITIVE FEEDBACK LEARNING IN MEDICAL DOMAIN , 1997 .

[15]  John H. Holland,et al.  Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .

[16]  Lakhmi C. Jain,et al.  Building a decision making framework using agent teams , 2007, Intell. Decis. Technol..

[17]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[18]  Celia C. Bojarczuk,et al.  Genetic programming for knowledge discovery in chest-pain diagnosis. , 2000, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[19]  Alex A. Freitas,et al.  An Ant Colony Algorithm for Classification Rule Discovery , 2002 .