Mining classification rules with Reduced MEPAR-miner Algorithm

Abstract In this study, a new classification technique based on rough set theory and MEPAR-miner algorithm for association rule mining is introduced. Proposed method is called as ‘Reduced MEPAR-miner Algorithm’. In the method being improved rough sets are used in the preprocessing stage in order to reduce the dimensionality of the feature space and improved MEPAR-miner algorithms are then used to extract the classification rules. Besides, a new and an effective default class structure is also defined in this proposed method. Integrating rough set theory and improved MEPAR-miner algorithm, an effective rule mining structure is acquired. The effectiveness of our approach is tested on eight publicly available binary and n -ary classification data sets. Comprehensive experiments are performed to demonstrate that Reduced MEPAR-miner Algorithm can discover effective classification rules which are as good as (or better) the other classification algorithms. These promising results show that the rough set approach is a useful tool for preprocessing of data for improved MEPAR-miner algorithm.

[1]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[2]  Eiichiro Tazaki,et al.  Rough neural classifier system , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[3]  T. Ravindra Babu,et al.  Hybrid learning scheme for data mining applications , 2004, Fourth International Conference on Hybrid Intelligent Systems (HIS'04).

[4]  Mihai Oltean,et al.  Evolving digital circuits using multi expression programming , 2004, Proceedings. 2004 NASA/DoD Conference on Evolvable Hardware, 2004..

[5]  Julian F. Miller,et al.  Genetic and Evolutionary Computation — GECCO 2003 , 2003, Lecture Notes in Computer Science.

[6]  Sankar K. Pal,et al.  Case generation using rough sets with fuzzy representation , 2004, IEEE Transactions on Knowledge and Data Engineering.

[7]  Alex A. Freitas,et al.  A survey of evolutionary algorithms for data mining and knowledge discovery , 2003 .

[8]  Alex A. Freitas,et al.  An ant colony based system for data mining: applications to medical data , 2001 .

[9]  Lale Özbakir,et al.  MEPAR-miner: Multi-expression programming for classification rule mining , 2007, Eur. J. Oper. Res..

[10]  David R. Gilbert,et al.  An Empirical Comparison of Supervised Machine Learning Techniques in Bioinformatics , 2003, APBC.

[11]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[12]  Ching-Chang Wong,et al.  Fuzzy rules extraction by a hybrid method for pattern classification , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[13]  Darrell Whitley,et al.  Genitor: a different genetic algorithm , 1988 .

[14]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[15]  Te-sheng Li,et al.  A hybrid approach of rough set theory and genetic algorithm for fault diagnosis , 2005 .

[16]  Renpu Li,et al.  Mining classification rules using rough sets and neural networks , 2004, Eur. J. Oper. Res..

[17]  Gexiang Zhang,et al.  A Hybrid Classifier Based on Rough Set Theory and Support Vector Machines , 2005, FSKD.

[18]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[19]  Alex Alves Freitas,et al.  Data mining with an ant colony optimization algorithm , 2002, IEEE Trans. Evol. Comput..

[20]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[21]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[22]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.

[23]  Staal A. Vinterbo,et al.  Minimal approximate hitting sets and rule templates , 2000, Int. J. Approx. Reason..

[24]  M. Oltean,et al.  Multi Expression Programming , 2021 .

[25]  Tong Heng Lee,et al.  A distributed evolutionary classifier for knowledge discovery in data mining , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[26]  Eiichiro Tazaki,et al.  INDUCTION OF KNOWLEDGE USING EVOLUTIONARY ROUGH SET THEORY , 2003, Cybern. Syst..

[27]  Andrzej Skowron,et al.  Rudiments of rough sets , 2007, Inf. Sci..

[28]  Pasi Luukka Similarity Classifier using Measure Derived from Yus Norms Applied to Medical Data Sets , 2006, 2006 IEEE International Conference on Fuzzy Systems.

[29]  Witold Pedrycz,et al.  Data Mining Methods for Knowledge Discovery , 1998, IEEE Trans. Neural Networks.

[30]  Jaroslaw Stepaniuk,et al.  Hybrid Classifier Based on Rough Sets and Neural Networks , 2003, Electron. Notes Theor. Comput. Sci..

[31]  Alex A. Freitas,et al.  New Results for a Hybrid Decision Tree/Genetic Algorithm for Data Mining , 2004 .