Feature Selection with Neighborhood Entropy-Based Cooperative Game Theory

Feature selection plays an important role in machine learning and data mining. In recent years, various feature measurements have been proposed to select significant features from high-dimensional datasets. However, most traditional feature selection methods will ignore some features which have strong classification ability as a group but are weak as individuals. To deal with this problem, we redefine the redundancy, interdependence, and independence of features by using neighborhood entropy. Then the neighborhood entropy-based feature contribution is proposed under the framework of cooperative game. The evaluative criteria of features can be formalized as the product of contribution and other classical feature measures. Finally, the proposed method is tested on several UCI datasets. The results show that neighborhood entropy-based cooperative game theory model (NECGT) yield better performance than classical ones.

[1]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[2]  Takeshi Mita,et al.  Discriminative Feature Co-Occurrence Selection for Object Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Kun She,et al.  Multi-Granulation Entropy and Its Applications , 2013, Entropy.

[4]  Eytan Ruppin,et al.  Feature Selection via Coalitional Game Theory , 2007, Neural Computation.

[5]  Kun She,et al.  A Universal neighbourhood rough sets model for knowledge discovering from incomplete heterogeneous data , 2013, Expert Syst. J. Knowl. Eng..

[6]  Xiaofeng Liao,et al.  (Corr. to) Delay-dependent exponential stability analysis of delayed neural networks: an LMI approach , 2002, Neural Networks.

[7]  Hongxia Wang,et al.  Centroid-based semi-fragile audio watermarking in hybrid domain , 2010, Science China Information Sciences.

[8]  Zhou Ming-dan Semantic Web-Oriented Specification of Logic Descriptive Primitives , 2002 .

[9]  Jin Li,et al.  Feature evaluation and selection with cooperative game theory , 2012, Pattern Recognit..

[10]  Salvatore Greco,et al.  Rough sets theory for multicriteria decision analysis , 2001, Eur. J. Oper. Res..

[11]  Jin Li,et al.  Using cooperative game theory to optimize the feature selection problem , 2012, Neurocomputing.

[12]  Ling Liu,et al.  Computing infrastructure for big data processing , 2013, Frontiers of Computer Science.

[13]  William Zhu,et al.  Rough matroids based on relations , 2013, Inf. Sci..

[14]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[15]  Witold Pedrycz,et al.  Measuring relevance between discrete and continuous features based on neighborhood mutual information , 2011, Expert Syst. Appl..

[16]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[18]  Xiaofeng Liao,et al.  (Corr. to) Delay-dependent exponential stability analysis of delayed neural networks: an LMI approach , 2003, Neural Networks.