Granular maximum decision entropy-based monotonic uncertainty measure for attribute reduction

Abstract Attribute reduction is considered an important preprocessing step in machine learning, pattern recognition, and data mining, and several attribute reduction measures based on rough set theory have been introduced to deal with vague, imprecise, or uncertain data. However, some of the measures inherently suffer from nonmonotonicity and redundancy. In this paper, a monotonic uncertainty measure, called granular maximum decision entropy (GMDE), is proposed. Specifically, we first develop a notion of maximum decision entropy. By integrating the uncertainty of the maximum decision entropy with the granulation of knowledge, a novel uncertainty measure is then presented, and its monotonicity is theoretically proved. We also provide a forward heuristic attribute reduction algorithm based on the proposed uncertainty measure, which could simultaneously select the informative attributes and remove the unnecessary attributes in the procedure of attribute reduction, thus resulting in high efficiency. The experiments conducted on several UCI data sets demonstrate that the proposed measure and algorithm are effective and computationally inexpensive and are superior to the representatives in terms of classification performance and efficiency.

[1]  Wang Guo,et al.  Decision Table Reduction based on Conditional Information Entropy , 2002 .

[2]  Yiyu Yao,et al.  Discernibility matrix simplification for constructing attribute reducts , 2009, Inf. Sci..

[3]  Jie Duan,et al.  Multi-label feature selection based on neighborhood mutual information , 2016, Appl. Soft Comput..

[4]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[5]  Zhenmin Tang,et al.  On an optimization representation of decision-theoretic rough set model , 2014, Int. J. Approx. Reason..

[6]  Wei-Zhi Wu,et al.  Approaches to knowledge reduction based on variable precision rough set model , 2004, Inf. Sci..

[7]  Yiyu Yao,et al.  The superiority of three-way decisions in probabilistic rough set models , 2011, Inf. Sci..

[8]  Miao Duo,et al.  A HEURISTIC ALGORITHM FOR REDUCTION OF KNOWLEDGE , 1999 .

[9]  Witold Pedrycz,et al.  Selecting Discrete and Continuous Features Based on Neighborhood Decision Error Minimization , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[10]  Duoqian Miao,et al.  Three-way attribute reducts , 2017, Int. J. Approx. Reason..

[11]  Witold Pedrycz,et al.  Granular Computing: Perspectives and Challenges , 2013, IEEE Transactions on Cybernetics.

[12]  Tao Li,et al.  Recent advances in feature selection and its applications , 2017, Knowledge and Information Systems.

[13]  Yiyu Yao,et al.  Attribute reduction in decision-theoretic rough set models , 2008, Inf. Sci..

[14]  S. K. Michael Wong,et al.  Rough Sets: Probabilistic versus Deterministic Approach , 1988, Int. J. Man Mach. Stud..

[15]  Yiyu Yao,et al.  Relative reducts in consistent and inconsistent decision tables of the Pawlak rough set model , 2009, Inf. Sci..

[16]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[17]  Jian Yang,et al.  Rotational Invariant Dimensionality Reduction Algorithms , 2017, IEEE Transactions on Cybernetics.

[18]  Jiye Liang,et al.  The Information Entropy, Rough Entropy And Knowledge Granulation In Rough Set Theory , 2004, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[19]  Xizhao Wang,et al.  Comparison of reduction in formal decision contexts , 2017, Int. J. Approx. Reason..

[20]  Zhenmin Tang,et al.  Minimum cost attribute reduction in decision-theoretic rough set models , 2013, Inf. Sci..

[21]  Can Gao,et al.  Rough possibilistic C-means clustering based on multigranulation approximation regions and shadowed sets , 2018, Knowl. Based Syst..

[22]  Xinye Cai,et al.  Neighborhood based decision-theoretic rough set models , 2016, Int. J. Approx. Reason..

[23]  Wojciech Ziarko,et al.  Variable Precision Rough Set Model , 1993, J. Comput. Syst. Sci..

[24]  Xiaodong Yue,et al.  Tri-partition neighborhood covering reduction for robust classification , 2017, Int. J. Approx. Reason..

[25]  Yiyu Yao,et al.  Probabilistic rough set approximations , 2008, Int. J. Approx. Reason..

[26]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[27]  Can Gao,et al.  Multigranulation rough-fuzzy clustering based on shadowed sets , 2020, Inf. Sci..

[28]  Yiyu Yao,et al.  Generalized attribute reduct in rough set theory , 2016, Knowl. Based Syst..

[29]  Yiyu Yao,et al.  Three-Way Decisions and Cognitive Computing , 2016, Cognitive Computation.

[30]  Guoyin Wang,et al.  Decision region distribution preservation reduction in decision-theoretic rough set model , 2014, Inf. Sci..

[31]  Decui Liang,et al.  A novel three-way decision model based on incomplete information system , 2016, Knowl. Based Syst..

[32]  Nan Zhang,et al.  Attribute reduction for sequential three-way decisions under dynamic granulation , 2017, Int. J. Approx. Reason..

[33]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[34]  Duoqian Miao,et al.  Reduction target structure-based hierarchical attribute reduction for two-category decision-theoretic rough sets , 2014, Inf. Sci..

[35]  Yiyu Yao,et al.  A Decision Theoretic Framework for Approximating Concepts , 1992, Int. J. Man Mach. Stud..

[36]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[37]  K. Thangavel,et al.  Dimensionality reduction based on rough set theory: A review , 2009, Appl. Soft Comput..

[38]  Yiyu Yao,et al.  Three-way decisions with probabilistic rough sets , 2010, Inf. Sci..

[39]  Wei-Zhi Wu,et al.  Approaches to knowledge reductions in inconsistent systems , 2003, Int. J. Intell. Syst..

[40]  Hamido Fujita,et al.  An incremental attribute reduction approach based on knowledge granularity with a multi-granulation view , 2017, Inf. Sci..

[41]  Majid Komeili,et al.  Local Feature Selection for Data Classification , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Jiye Liang,et al.  Local multigranulation decision-theoretic rough sets , 2017, Int. J. Approx. Reason..

[43]  Qinghua Hu,et al.  Attribute Selection for Partially Labeled Categorical Data By Rough Set Approach , 2017, IEEE Transactions on Cybernetics.

[44]  Qingxin Zhu,et al.  Cost-Sensitive Attribute Reduction in Decision-Theoretic Rough Set Models , 2014 .

[45]  Dun Liu,et al.  Non-Monotonic Attribute Reduction in Decision-Theoretic Rough Sets , 2013, Fundam. Informaticae.

[46]  Zhifei Zhang,et al.  International Journal of Approximate Reasoning Diverse Reduct Subspaces Based Co-training for Partially Labeled Data , 2022 .

[47]  XIAOHUA Hu,et al.  LEARNING IN RELATIONAL DATABASES: A ROUGH SET APPROACH , 1995, Comput. Intell..

[48]  Duoqian Miao,et al.  Region-based quantitative and hierarchical attribute reduction in the two-category decision theoretic rough set model , 2014, Knowl. Based Syst..

[49]  Witold Pedrycz,et al.  Granular multi-label feature selection based on mutual information , 2017, Pattern Recognit..

[50]  Witold Pedrycz,et al.  Decision-theoretic rough set approaches to multi-covering approximation spaces based on fuzzy probability measure , 2018, J. Intell. Fuzzy Syst..