A Novel Attribute Reduction Algorithm of Decomposition Based on Rough Sets

Attribute reduction is a key task for the research of rough sets. However, when dealing with large-scale data, many existing proposals based on rough set theory get worse performance. In this paper, we propose a novel attribute reduction algorithm of decomposition based on rough sets. The idea of decomposition is to break down a complex table into a super-table and several sub-tables that are simpler, more manageable and solvable by using existing induction methods, then joining them together in order to solve the original table. Compared with the traditional methods, experiments with some standard datasets from UCI database are done and experimental results illustrate that the algorithm of this paper improve computational efficiency.

[1]  Lior Rokach,et al.  Decomposition Methodology for Knowledge Discovery and Data Mining - Theory and Applications , 2005, Series in Machine Perception and Artificial Intelligence.

[2]  Byung Ro Moon,et al.  Hybrid Genetic Algorithms for Feature Selection , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Lior Rokach,et al.  Decomposition methodology for classification tasks: a meta decomposer framework , 2006, Pattern Analysis and Applications.

[4]  Andrzej Skowron,et al.  Decomposition of Task Specification Problems , 1999, ISMIS.

[5]  Qinghua Hu,et al.  Mixed feature selection based on granulation and approximation , 2008, Knowl. Based Syst..

[6]  Andrzej Skowron,et al.  Rough set methods in feature selection and recognition , 2003, Pattern Recognit. Lett..

[7]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[8]  Masahiro Inuiguchi,et al.  Fuzzy rough sets and multiple-premise gradual decision rules , 2006, Int. J. Approx. Reason..

[9]  Zhang Qizhong,et al.  An Approach to Rough Set Decomposition of Incomplete Information Systems , 2007, 2007 2nd IEEE Conference on Industrial Electronics and Applications.

[10]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[11]  Jerzy W. Grzymala-Busse,et al.  Handling Missing Attribute Values , 2010, Data Mining and Knowledge Discovery Handbook.

[12]  Ming Yang,et al.  A novel condensing tree structure for rough set feature selection , 2008, Neurocomputing.

[13]  Miao Duo-qian,et al.  Information-based algorithm for reduction of knowledge , 1997, 1997 IEEE International Conference on Intelligent Processing Systems (Cat. No.97TH8335).

[14]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[15]  J. Grzymala-Busse Data reduction: discretization of numerical attributes , 2002 .

[16]  Rafal Latkowski,et al.  Missing Template Decomposition Method and Its Implementation in Rough Set Exploration System , 2006, RSCTC.