Feature Selection with Positive Region Constraint for Test-Cost-Sensitive Data

In many data mining and machine learning applications, data are not free, and there is a test cost for each data item. Due to economic, technological and legal reasons, it is neither possible nor necessary to obtain a classifier with 100 % accuracy. In this paper, we consider such a situation and propose a new constraint satisfaction problem to address it. With this in mind, one has to minimize the test cost to keep the accuracy of the classification under a budget. The constraint is expressed by the positive region, whereas the object is to minimizing the total test cost. The new problem is essentially a dual of the test cost constraint attribute reduction problem, which has been addressed recently. We propose a heuristic algorithm based on the information gain, the test cost, and a user specified parameter \(\lambda \) to deal with the new problem. The algorithm is tested on four University of California - Irvine datasets with various test cost settings. Experimental results indicate that the algorithm finds optimal feature subset in most cases, the rational setting of \(\lambda \) is different among datasets, and the algorithm is especially stable when the test cost is subject to the Pareto distribution.

[1]  Witold Pedrycz,et al.  Positive approximation: An accelerator for attribute reduction in rough set theory , 2010, Artif. Intell..

[2]  Ning Zhong,et al.  Using Rough Sets with Heuristics for Feature Selection , 1999, Journal of Intelligent Information Systems.

[3]  Chun-Che Huang,et al.  Rough set-based approach to feature selection in customer relationship management , 2007 .

[4]  Fan Min,et al.  A hierarchical model for test-cost-sensitive decision systems , 2009, Inf. Sci..

[5]  Z. Pawlak Rough set approach to knowledge-based decision support , 1997 .

[6]  Sadaaki Miyamoto,et al.  Rough Sets and Current Trends in Computing , 2012, Lecture Notes in Computer Science.

[7]  Guoyin Wang,et al.  Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing , 2013, Lecture Notes in Computer Science.

[8]  F. Min,et al.  Attribute Reduction with Test Cost Constraint , 2011 .

[9]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[10]  William Zhu,et al.  Relationship between generalized rough sets based on binary relation and covering , 2009, Inf. Sci..

[11]  Hong Zhao,et al.  Test-cost-sensitive attribute reduction of data with normal distribution measurement errors , 2012, ArXiv.

[12]  William Zhu,et al.  A genetic algorithm to the minimal test cost reduct problem , 2011, 2011 IEEE International Conference on Granular Computing.

[13]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[14]  Xiangyang Wang,et al.  Feature selection based on rough sets and particle swarm optimization , 2007, Pattern Recognit. Lett..

[15]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[16]  Yuhua Qian,et al.  Test-cost-sensitive attribute reduction , 2011, Inf. Sci..

[17]  Rozaida Ghazali,et al.  An Improved Back Propagation Neural Network Algorithm on Classification Problems , 2010, FGIT-DTA/BSBT.

[18]  Tsau Young Lin,et al.  Granular computing: structures, representations, and applications , 2003 .

[19]  G. Y. Wang Attribute Core of Decision Table , 2002, Rough Sets and Current Trends in Computing.

[20]  Salvatore Greco,et al.  Variable Consistency Model of Dominance-Based Rough Sets Approach , 2000, Rough Sets and Current Trends in Computing.

[21]  Zdzislaw Pawlak,et al.  Rough sets and intelligent data analysis , 2002, Inf. Sci..

[22]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[23]  Qinghua Hu,et al.  Feature selection with test cost constraint , 2012, ArXiv.

[24]  Zhang Wen Knowledge Reductions in Inconsistent Information Systems , 2003 .

[25]  Fan Min,et al.  Accumulated Cost Based Test-Cost-Sensitive Attribute Reduction , 2011, RSFDGrC.

[26]  Liwen Ma,et al.  On some types of neighborhood-related covering rough sets , 2012, Int. J. Approx. Reason..

[27]  Andrzej Skowron,et al.  Rough set methods in feature selection and recognition , 2003, Pattern Recognit. Lett..

[28]  William Zhu,et al.  Topological approaches to covering rough sets , 2007, Inf. Sci..

[29]  William Zhu,et al.  Minimal Cost Attribute Reduction through Backtracking , 2011, FGIT-DTA/BSBT.

[30]  William Zhu,et al.  Ant colony optimization to minimal test cost reduction , 2012, 2012 IEEE International Conference on Granular Computing.

[31]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[32]  William Zhu,et al.  Generalized rough sets based on relations , 2007, Inf. Sci..

[33]  Xue-wen Chen An improved branch and bound algorithm for feature selection , 2003, Pattern Recognit. Lett..

[34]  Fei-Yue Wang,et al.  Reduction and axiomization of covering generalized rough sets , 2003, Inf. Sci..

[35]  Philip J. Stone,et al.  Experiments in induction , 1966 .

[36]  William Zhu,et al.  A genetic algorithm to attribute reduction with test cost constraint , 2011, 2011 6th International Conference on Computer Sciences and Convergence Information Technology (ICCIT).

[37]  William Zhu,et al.  Optimal sub-reducts in the dynamic environment , 2011, 2011 IEEE International Conference on Granular Computing.

[38]  Hong Zhao,et al.  A Backtracking Approach to Minimal Cost Feature Selection of Numerical Data , 2013 .

[39]  Hong Zhao,et al.  Test-cost-sensitive attribute reduction based on neighborhood rough set , 2011, 2011 IEEE International Conference on Granular Computing.

[40]  Pier Luca Lanzi,et al.  Fast feature selection with genetic algorithms: a filter approach , 1997, Proceedings of 1997 IEEE International Conference on Evolutionary Computation (ICEC '97).

[41]  William Zhu,et al.  Optimal Sub-Reducts with Test Cost Constraint , 2011, RSKT.

[42]  Tsau Young Lin Granular computing on binary relations: Analysis of conflict and Chinese wall security policy , 2002 .

[43]  Yiyu Yao,et al.  Attribute reduction in decision-theoretic rough set models , 2008, Inf. Sci..

[44]  Ming Zhang,et al.  Feature Selection with Adjustable Criteria , 2005, RSFDGrC.

[45]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[46]  Roman Słowiński,et al.  Intelligent Decision Support , 1992, Theory and Decision Library.