A Heuristic Algorithm Based on Attribute Importance for Feature Selection

In this paper we devote to study some feature selection of an information system in which redundant or insignificant attributes in data sets can be eliminated. An approach of importance gain function is suggested to evaluate the global average information gain associated with a subset of features. A heuristic algorithm on iterative criterion of feature selection on the significance of attributes is proposed to get the least reduction of attribute set in knowledge discovery. The feasibility of feature selection proposed here is validated by some of examples.

[1]  M. E. Jernigan,et al.  Texture Analysis and Discrimination in Additive Noise , 1990, Comput. Vis. Graph. Image Process..

[2]  Andrzej Skowron,et al.  Rough set methods in feature selection and recognition , 2003, Pattern Recognit. Lett..

[3]  B.F. Momin,et al.  Reduct Generation and Classification of Gene Expression Data , 2006, 2006 International Conference on Hybrid Information Technology.

[4]  Marco Pastori,et al.  Information mining in remote sensing image archives: system concepts , 2003, IEEE Trans. Geosci. Remote. Sens..

[5]  Dominik Slezak,et al.  Approximate Entropy Reducts , 2002, Fundam. Informaticae.

[6]  Dominik Slezak,et al.  Normalized Decision Functions and Measures for Inconsistent Decision Tables Analysis , 2000, Fundam. Informaticae.

[7]  Shusaku Tsumoto Statistical extension of rough set rule induction , 2001, SPIE Defense + Commercial Sensing.

[8]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[9]  Sushmita Mitra,et al.  Evolutionary Rough Feature Selection in Gene Expression Data , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[10]  T. Medhat,et al.  Topological reduction of information systems , 2005 .

[11]  Renpu Li,et al.  Mining classification rules using rough sets and neural networks , 2004, Eur. J. Oper. Res..

[12]  Xinyu Shao,et al.  Integrating data mining and rough set for customer group-based discovery of product configuration rules , 2006 .

[13]  Roman Slowinski,et al.  Rough Classification of Patients After Highly Selective Vagotomy for Duodenal Ulcer , 1986, Int. J. Man Mach. Stud..