A graph approach for fuzzy-rough feature selection

Abstract Rough sets, especially fuzzy-rough sets, have proven to be a powerful tool for dealing with vagueness and uncertainty in data analysis. Fuzzy-rough feature selection has been shown to be highly useful in data dimensionality reduction. However, many fuzzy-rough feature selection algorithms are still time-consuming when dealing with the large-scale data sets. In this paper, the problem of feature selection in fuzzy-rough sets is studied in the framework of graph theory. We propose a new mechanism for fuzzy-rough feature selection. It is shown that finding the attribute reduction of a fuzzy decision system can be translated into finding the transversal of a derivative hypergraph. Based on the graph-representation model, a novel graph-theoretic algorithm for fuzzy-rough feature selection is proposed. The performance of the proposed method is compared with those of the state-of-the-art methods on various classification tasks. Experimental results show that the proposed technique outperforms all other known feature selection methods in terms of the computation time. Especially for the large-scale data sets, it demonstrates promising performance. Moreover, our proposed method can achieve better classification accuracies with the usage of small number of features.

[1]  Zhifei Zhang,et al.  Incremental approaches for updating reducts in dynamic covering information systems , 2017, Knowl. Based Syst..

[2]  Witold Pedrycz,et al.  Positive approximation: An accelerator for attribute reduction in rough set theory , 2010, Artif. Intell..

[3]  Richard Jensen,et al.  Towards scalable fuzzy-rough feature selection , 2015, Inf. Sci..

[4]  Qiang Shen,et al.  Fuzzy-Rough Sets Assisted Attribute Selection , 2007, IEEE Transactions on Fuzzy Systems.

[5]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[6]  Yuhua Qian,et al.  Multigranulation fuzzy rough set over two universes and its application to decision making , 2017, Knowl. Based Syst..

[7]  Vasek Chvátal,et al.  A Greedy Heuristic for the Set-Covering Problem , 1979, Math. Oper. Res..

[8]  Qiang Shen,et al.  New Approaches to Fuzzy-Rough Feature Selection , 2009, IEEE Transactions on Fuzzy Systems.

[9]  Wen-Xiu Zhang,et al.  An axiomatic characterization of a fuzzy generalization of rough sets , 2004, Inf. Sci..

[10]  Guangming Lang,et al.  Three-way decision approaches to conflict analysis using decision-theoretic rough set theory , 2017, Inf. Sci..

[11]  Manish Sarkar,et al.  Fuzzy-rough neural networks for vowel classification , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[12]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[13]  Xizhao Wang,et al.  On the generalization of fuzzy rough sets , 2005, IEEE Transactions on Fuzzy Systems.

[14]  Qinghua Hu,et al.  Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation , 2007, Pattern Recognit..

[15]  Yee Leung,et al.  Generalized fuzzy rough sets determined by a triangular norm , 2008, Inf. Sci..

[16]  Jianhua Dai,et al.  Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification , 2013, Appl. Soft Comput..

[17]  Degang Chen,et al.  Attribute Reduction for Heterogeneous Data Based on the Combination of Classical and Fuzzy Rough Set Models , 2014, IEEE Transactions on Fuzzy Systems.

[18]  Qinghua Hu,et al.  Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information , 2017, IEEE Transactions on Fuzzy Systems.

[19]  Geoffrey I. Webb,et al.  MultiBoosting: A Technique for Combining Boosting and Wagging , 2000, Machine Learning.

[20]  Xizhao Wang,et al.  Incremental Perspective for Feature Selection Based on Fuzzy Rough Sets , 2018, IEEE Transactions on Fuzzy Systems.

[21]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[22]  Xiao Zhang,et al.  Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy , 2016, Pattern Recognit..

[23]  Rajen B. Bhatt,et al.  On fuzzy-rough sets approach to feature selection , 2005, Pattern Recognit. Lett..

[24]  Xiao Zhang,et al.  A fuzzy rough set-based feature selection method using representative instances , 2018, Knowl. Based Syst..

[25]  Daren Yu,et al.  Fuzzy Mutual Information Based min-Redundancy and Max-Relevance Heterogeneous Feature Selection , 2011 .

[26]  Jiye Liang,et al.  Fuzzy-rough feature selection accelerator , 2015, Fuzzy Sets Syst..

[27]  Witold Pedrycz,et al.  Large-Scale Multimodality Attribute Reduction With Multi-Kernel Fuzzy Rough Sets , 2018, IEEE Transactions on Fuzzy Systems.

[28]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[29]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[30]  Qinghua Hu,et al.  A Novel Algorithm for Finding Reducts With Fuzzy Rough Sets , 2012, IEEE Transactions on Fuzzy Systems.

[31]  Ming-Wen Shao,et al.  Uncertainty measures for general fuzzy relations , 2019, Fuzzy Sets Syst..

[32]  Yiyu Yao,et al.  Constructive and Algebraic Methods of the Theory of Rough Sets , 1998, Inf. Sci..

[33]  Yaojin Lin,et al.  The relationship between attribute reducts in rough sets and minimal vertex covers of graphs , 2015, Inf. Sci..

[34]  K. Thangavel,et al.  Dimensionality reduction based on rough set theory: A review , 2009, Appl. Soft Comput..

[35]  Dun Liu,et al.  A fuzzy rough set approach for incremental feature selection on hybrid information systems , 2015, Fuzzy Sets Syst..

[36]  Hui Wang,et al.  Fuzzy rough set based incremental attribute reduction from dynamic data with sample arriving , 2017, Fuzzy Sets Syst..

[37]  Ming-Wen Shao,et al.  Fuzzy rough set-based attribute reduction using distance measures , 2019, Knowl. Based Syst..

[38]  Ming-Wen Shao,et al.  Attribute reduction based on k-nearest neighborhood rough sets , 2019, Int. J. Approx. Reason..

[39]  Yaojin Lin,et al.  A rough set method for the minimum vertex cover problem of graphs , 2016, Appl. Soft Comput..

[40]  Yi-Fan Wang,et al.  Mining stock price using fuzzy rough set system , 2003, Expert Syst. Appl..

[41]  J. A. Bondy,et al.  Graph Theory with Applications , 1978 .

[42]  Yiyu Yao,et al.  A Comparative Study of Fuzzy Sets and Rough Sets , 1998 .

[43]  Qiang Shen,et al.  Fuzzy-rough sets for descriptive dimensionality reduction , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[44]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[45]  Chen Degang,et al.  Local reduction of decision system with fuzzy rough sets , 2010 .

[46]  Qinghua Hu,et al.  Information-preserving hybrid data reduction based on fuzzy-rough techniques , 2006, Pattern Recognit. Lett..

[47]  Jiye Liang,et al.  Fuzzy Granular Structure Distance , 2015, IEEE Transactions on Fuzzy Systems.

[48]  Bruno Simeone,et al.  A O(nm)-Time Algorithm for Computing the Dual of a Regular Boolean Function , 1994, Discret. Appl. Math..

[49]  Lei Zhang,et al.  Sample Pair Selection for Attribute Reduction with Rough Set , 2012, IEEE Transactions on Knowledge and Data Engineering.

[50]  Piotr Sapiecha,et al.  Approximation Algorithm for the Argument Reduction Problem , 2005, CORES.

[51]  Degang Chen,et al.  The Model of Fuzzy Variable Precision Rough Sets , 2009, IEEE Transactions on Fuzzy Systems.

[52]  Degang Chen,et al.  Fuzzy rough set based attribute reduction for information systems with fuzzy decisions , 2011, Knowl. Based Syst..

[53]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[54]  Jianming Zhan,et al.  On a novel uncertain soft set model: Z-soft fuzzy rough set model and corresponding decision making methods , 2017, Appl. Soft Comput..

[55]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[56]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[57]  Jiye Liang,et al.  Ieee Transactions on Knowledge and Data Engineering 1 a Group Incremental Approach to Feature Selection Applying Rough Set Technique , 2022 .

[58]  Qinghua Hu,et al.  A Fitting Model for Feature Selection With Fuzzy Rough Sets , 2017, IEEE Transactions on Fuzzy Systems.

[59]  Yee Leung,et al.  On characterizations of (I, J)-fuzzy rough approximation operators , 2005, Fuzzy Sets Syst..

[60]  A. Sunny Kuriakose,et al.  A novel feature selection method using fuzzy rough sets , 2018, Comput. Ind..

[61]  Qinghua Hu,et al.  On Robust Fuzzy Rough Set Models , 2012, IEEE Transactions on Fuzzy Systems.

[62]  Didier Dubois,et al.  Putting Rough Sets and Fuzzy Sets Together , 1992, Intelligent Decision Support.

[63]  Wei-Zhi Wu,et al.  Constructive and axiomatic approaches of fuzzy approximation operators , 2004, Inf. Sci..

[64]  Yuwen Li,et al.  Attribute reduction for multi-label learning with fuzzy rough set , 2018, Knowl. Based Syst..

[65]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[66]  Xizhao Wang,et al.  Attributes Reduction Using Fuzzy Rough Sets , 2008, IEEE Transactions on Fuzzy Systems.

[67]  Witold Pedrycz,et al.  Kernelized Fuzzy Rough Sets and Their Applications , 2011, IEEE Transactions on Knowledge and Data Engineering.

[68]  Panos M. Pardalos,et al.  Experimental Analysis of Approximation Algorithms for the Vertex Cover and Set Covering Problems , 2006, Comput. Oper. Res..

[69]  Guanghui Lan,et al.  An effective and simple heuristic for the set covering problem , 2007, Eur. J. Oper. Res..