Feature selection via normative fuzzy information weight with application into tumor classification

Abstract Feature selection via mutual information has been widely used in data analysing. Mutual information with monotonous is an effective tool to analyse the correlation and redundancy of features. However, the mutual information, which adopted in most of existing feature selection criterions, can’t explain the correlation and redundancy of features in the fuzzy situation well. Therefore, we propose feature selection strategy via normative fuzzy information weight based on fuzzy conditional mutual information in this paper. Firstly, the monotone fuzzy metric structure is defined, and some theoretical properties are proved. Secondly, we put forward the concept of fuzzy independent classification information based on fuzzy conditional mutual information, and propose a feature selection method via fuzzy independent classification information. Thirdly, considering the proportion of new classification information provided by the selected feature in its own information, we introduce the concept of normative fuzzy information weight and propose an improved feature selection method. Finally, the availability of the two proposed methods is tested by comparative experiments, and the improved feature selection method is applied to tumor classification. This work provides an alternative strategy for feature selection in real-world data applications.

[1]  Qinghua Hu,et al.  Discrete particle swarm optimization approach for cost sensitive attribute reduction , 2016, Knowl. Based Syst..

[2]  Qiang Shen,et al.  Fuzzy-Rough Sets Assisted Attribute Selection , 2007, IEEE Transactions on Fuzzy Systems.

[3]  Zdzislaw Pawlak,et al.  Rough Set Theory and its Applications to Data Analysis , 1998, Cybern. Syst..

[4]  Jacek M. Zurada,et al.  Normalized Mutual Information Feature Selection , 2009, IEEE Transactions on Neural Networks.

[5]  Qinghua Hu,et al.  Attribute Selection for Partially Labeled Categorical Data By Rough Set Approach , 2017, IEEE Transactions on Cybernetics.

[6]  Zdzislaw Pawlak,et al.  Rough sets and intelligent data analysis , 2002, Inf. Sci..

[7]  Liang Liu,et al.  Attribute selection based on a new conditional entropy for incomplete decision systems , 2013, Knowl. Based Syst..

[8]  Hong Chen,et al.  PARA: A positive-region based attribute reduction accelerator , 2019, Inf. Sci..

[9]  Mohamed Elhoseny,et al.  Feature selection based on artificial bee colony and gradient boosting decision tree , 2019, Appl. Soft Comput..

[10]  Hamido Fujita,et al.  An incremental attribute reduction approach based on knowledge granularity with a multi-granulation view , 2017, Inf. Sci..

[11]  Jianhua Dai,et al.  Entropy measures and granularity measures for set-valued information systems , 2013, Inf. Sci..

[12]  Ming-Wen Shao,et al.  Fuzzy rough set-based attribute reduction using distance measures , 2019, Knowl. Based Syst..

[13]  Jiye Liang,et al.  A new method for measuring uncertainty and fuzziness in rough set theory , 2002, Int. J. Gen. Syst..

[14]  Qiang Shen,et al.  New Approaches to Fuzzy-Rough Feature Selection , 2009, IEEE Transactions on Fuzzy Systems.

[15]  N. Iizuka,et al.  MECHANISMS OF DISEASE Mechanisms of disease , 2022 .

[16]  Xizhao Wang,et al.  Attributes Reduction Using Fuzzy Rough Sets , 2008, IEEE Transactions on Fuzzy Systems.

[17]  Witold Pedrycz,et al.  Kernelized Fuzzy Rough Sets and Their Applications , 2011, IEEE Transactions on Knowledge and Data Engineering.

[18]  Bin Zhao,et al.  Prediction of service life of large centrifugal compressor remanufactured impeller based on clustering rough set and fuzzy Bandelet neural network , 2019, Appl. Soft Comput..

[19]  Hui Wang,et al.  Granular Computing Based on Fuzzy Similarity Relations , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[20]  Degang Chen,et al.  Measures of general fuzzy rough sets on a probabilistic space , 2008, Inf. Sci..

[21]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Jianhua Dai,et al.  Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification , 2013, Appl. Soft Comput..

[23]  Xu Weihua,et al.  Knowledge granulation, knowledge entropy and knowledge uncertainty measure in ordered information systems , 2009 .

[24]  Qinghua Hu,et al.  Attribute reduction in interval-valued information systems based on information entropies , 2016, Frontiers of Information Technology & Electronic Engineering.

[25]  Xizhao Wang,et al.  On the generalization of fuzzy rough sets , 2005, IEEE Transactions on Fuzzy Systems.

[26]  Qinghua Hu,et al.  Neighbor Inconsistent Pair Selection for Attribute Reduction by Rough Set Approach , 2018, IEEE Transactions on Fuzzy Systems.

[27]  Vinod Kumar Jain,et al.  Correlation feature selection based improved-Binary Particle Swarm Optimization for gene selection and cancer classification , 2018, Appl. Soft Comput..

[28]  Bernhard Moser,et al.  On the T , 2006, Fuzzy Sets Syst..

[29]  Lei Zhang,et al.  Sample Pair Selection for Attribute Reduction with Rough Set , 2012, IEEE Transactions on Knowledge and Data Engineering.

[30]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[31]  Javier Puente,et al.  Fuzzy inference suitability to determine the utilitarian quality of B2C websites , 2017, Appl. Soft Comput..

[32]  Jun Wang,et al.  Feature Selection by Maximizing Independent Classification Information , 2017, IEEE Transactions on Knowledge and Data Engineering.

[33]  Qinghua Hu,et al.  Parameterized attribute reduction with Gaussian kernel based fuzzy rough sets , 2011, Inf. Sci..

[34]  Jianhua Dai,et al.  Knowledge granularity based incremental attribute reduction for incomplete decision systems , 2020, Int. J. Mach. Learn. Cybern..

[35]  Chen Degang,et al.  Local reduction of decision system with fuzzy rough sets , 2010 .

[36]  Bernhard Moser,et al.  On Representing and Generating Kernels by Fuzzy Equivalence Relations , 2006, J. Mach. Learn. Res..

[37]  Youlong Yang,et al.  Fuzzy rule-based oversampling technique for imbalanced and incomplete data learning , 2018, Knowl. Based Syst..

[38]  Roberto Battiti,et al.  Using mutual information for selecting features in supervised neural net learning , 1994, IEEE Trans. Neural Networks.

[39]  Jianhua Dai,et al.  Fuzzy rough set model for set-valued data , 2013, Fuzzy Sets Syst..

[40]  Qinghua Hu,et al.  Fuzzy Probabilistic Approximation Spaces and Their Information Measures , 2006, IEEE Trans. Fuzzy Syst..

[41]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[42]  Jianhua Dai,et al.  Uncertainty measurement for interval-valued decision systems based on extended conditional entropy , 2012, Knowl. Based Syst..

[43]  Qinghua Hu,et al.  Uncertainty measures for fuzzy relations and their applications , 2007, Appl. Soft Comput..

[44]  Jemal H. Abawajy,et al.  A rough set approach for selecting clustering attribute , 2010, Knowl. Based Syst..

[45]  Lotfi A. Zadeh,et al.  Similarity relations and fuzzy orderings , 1971, Inf. Sci..

[46]  Witold Pedrycz,et al.  Gaussian kernel based fuzzy rough sets: Model, uncertainty measures and applications , 2010, Int. J. Approx. Reason..

[47]  Guo-Jun Wang,et al.  An axiomatic approach of fuzzy rough sets based on residuated lattices , 2009, Comput. Math. Appl..

[48]  Wei-Zhi Wu,et al.  Maximal-Discernibility-Pair-Based Approach to Attribute Reduction in Fuzzy Rough Sets , 2018, IEEE Transactions on Fuzzy Systems.

[49]  F. Fleuret Fast Binary Feature Selection with Conditional Mutual Information , 2004, J. Mach. Learn. Res..

[50]  Lei Lei,et al.  Wavelet Neural Network Prediction Method of Stock Price Trend Based on Rough Set Attribute Reduction , 2018, Appl. Soft Comput..

[51]  Wei-Zhi Wu,et al.  Generalized fuzzy rough sets , 2003, Inf. Sci..

[52]  Siddhartha Bhattacharyya,et al.  A group incremental feature selection for classification using rough set theory based genetic algorithm , 2018, Appl. Soft Comput..

[53]  Asit Kumar Das,et al.  Rough set based incremental crime report labelling in dynamic environment , 2019, Appl. Soft Comput..

[54]  Àngela Nebot,et al.  Fuzzy inductive reasoning forecasting strategies able to cope with missing data: A smart grid application , 2017, Appl. Soft Comput..