Feature selection based on composition of rough sets induced by feature granulation

Abstract The term “feature selection” refers to the problem of selecting the most predictive features for a given outcome. Reducing the number of features is important to lower the computation cost of algorithms and also to achieve better generalization capabilities. The rough set theory has rapidly established itself as an effective tool for finding the smallest sets of features without any loss of information. This paper introduces a new approach based on the idea of exploiting rough set theory to compose partitions of the data induced by feature granulation. The partitions are iteratively aggregated in a single representative partition and, at each iteration, the obtained partition is used to evaluate the quality of the subset of features selected, thus reducing the cost of each evaluation. The consequence of using rough set theory is an algorithm that presents good reduction capability with less computational complexity w.r.t. Quickreduct algorithm. The proposed approach, called Roughinement, has been compared with approaches recently appeared in literature yielding comparable to better results.

[1]  Nan Zhang,et al.  Attribute reduction for sequential three-way decisions under dynamic granulation , 2017, Int. J. Approx. Reason..

[2]  Richard Jensen,et al.  Towards scalable fuzzy-rough feature selection , 2015, Inf. Sci..

[3]  Qiang Shen,et al.  New Approaches to Fuzzy-Rough Feature Selection , 2009, IEEE Transactions on Fuzzy Systems.

[4]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[5]  Sankar K. Pal,et al.  The Role of Soft Computing in Image Analysis , 2012, Handbook of Soft Computing for Video Surveillance.

[6]  Xiao Zhang,et al.  Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy , 2016, Pattern Recognit..

[7]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[8]  Yiyu Yao,et al.  A Comparative Study of Formal Concept Analysis and Rough Set Theory in Data Analysis , 2004, Rough Sets and Current Trends in Computing.

[9]  Qiang Shen,et al.  Semantics-preserving dimensionality reduction: rough and fuzzy-rough-based approaches , 2004, IEEE Transactions on Knowledge and Data Engineering.

[10]  Degang Chen,et al.  An incremental algorithm for attribute reduction with variable precision rough sets , 2016, Appl. Soft Comput..

[11]  Xizhao Wang,et al.  Attributes Reduction Using Fuzzy Rough Sets , 2008, IEEE Transactions on Fuzzy Systems.

[12]  Sankar K. Pal,et al.  Feature Selection Using f-Information Measures in Fuzzy Approximation Spaces , 2010, IEEE Transactions on Knowledge and Data Engineering.

[13]  Qinghua Hu,et al.  Information-preserving hybrid data reduction based on fuzzy-rough techniques , 2006, Pattern Recognit. Lett..

[14]  Usman Qamar,et al.  Feature selection using rough set-based direct dependency calculation by avoiding the positive region , 2018, Int. J. Approx. Reason..

[15]  Richard Jensen,et al.  Unsupervised fuzzy-rough set-based dimensionality reduction , 2013, Inf. Sci..

[16]  Alberto Apostolico,et al.  C-calculus: An elementary approach to some problems in pattern recognition , 1978, Pattern Recognit..

[17]  Qiang Shen,et al.  Finding rough and fuzzy-rough set reducts with SAT , 2014, Inf. Sci..

[18]  Xiaojun Xie,et al.  A novel incremental attribute reduction approach for dynamic incomplete decision systems , 2018, Int. J. Approx. Reason..

[19]  Chris Cornelis,et al.  Attribute selection with fuzzy decision reducts , 2010, Inf. Sci..

[20]  Huan Liu,et al.  A Probabilistic Approach to Feature Selection - A Filter Solution , 1996, ICML.

[21]  Francesco Camastra,et al.  Data dimensionality estimation methods: a survey , 2003, Pattern Recognit..

[22]  Mohammad Masoud Javidi,et al.  Online streaming feature selection using rough sets , 2016, Int. J. Approx. Reason..

[23]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[24]  Qiang Shen,et al.  A modular approach to generating fuzzy rules with reduced attributes for the monitoring of complex systems , 2000 .

[25]  Nikhil R. Pal,et al.  Feature Selection With Controlled Redundancy in a Fuzzy Rule Based Framework , 2018, IEEE Transactions on Fuzzy Systems.

[26]  Xizhao Wang,et al.  Comparison of reduction in formal decision contexts , 2017, Int. J. Approx. Reason..

[27]  Z. Pawlak Granularity of knowledge, indiscernibility and rough sets , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[28]  Bing Huang,et al.  Sequential three-way decision and granulation for cost-sensitive face recognition , 2016, Knowl. Based Syst..

[29]  Yiyu Yao,et al.  On Reduct Construction Algorithms , 2006, RSKT.

[30]  Yiyu Yao,et al.  Granular Computing and Sequential Three-Way Decisions , 2013, RSKT.

[31]  Kaushal Kumar Shukla,et al.  Tolerance-based intuitionistic fuzzy-rough set approach for attribute reduction , 2018, Expert Syst. Appl..

[32]  Yuhua Qian,et al.  Three-way cognitive concept learning via multi-granularity , 2017, Inf. Sci..

[33]  Qiang Shen,et al.  Computational Intelligence and Feature Selection - Rough and Fuzzy Approaches , 2008, IEEE Press series on computational intelligence.

[34]  Yong Wang,et al.  BSSReduce an $O(\left|U\right|)$ Incremental Feature Selection Approach for Large-Scale and High-Dimensional Data , 2018, IEEE Transactions on Fuzzy Systems.

[35]  Jinhai Li,et al.  A heuristic knowledge-reduction method for decision formal contexts , 2011, Comput. Math. Appl..

[36]  Wojciech Ziarko,et al.  Variable Precision Rough Set Model , 1993, J. Comput. Syst. Sci..

[37]  Alfredo Petrosino,et al.  Feature Discovery through Hierarchies of Rough Fuzzy Sets , 2011 .

[38]  Marko Robnik-Sikonja,et al.  Overcoming the Myopia of Inductive Learning Algorithms with RELIEFF , 2004, Applied Intelligence.

[39]  Jan G. Bazan,et al.  Rough set algorithms in classification problem , 2000 .

[40]  Tianrui Li,et al.  An incremental attribute reduction approach based on knowledge granularity under the attribute generalization , 2016, Int. J. Approx. Reason..

[41]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[42]  C. Cornelis,et al.  Vaguely Quantified Rough Sets , 2009, RSFDGrC.

[43]  Xiao Zhang,et al.  A fuzzy rough set-based feature selection method using representative instances , 2018, Knowl. Based Syst..

[44]  Alfredo Petrosino,et al.  Rough fuzzy set-based image compression , 2009, Fuzzy Sets Syst..

[45]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[46]  Tao Feng,et al.  Uncertainty and reduction of variable precision multigranulation fuzzy rough sets based on three-way decisions , 2017, Int. J. Approx. Reason..

[47]  Lloyd A. Smith,et al.  Practical feature subset selection for machine learning , 1998 .

[48]  Witold Pedrycz,et al.  Building the fundamentals of granular computing: A principle of justifiable granularity , 2013, Appl. Soft Comput..