Attribute reduction based on max-decision neighborhood rough set model

Abstract The neighborhood rough set model only focuses on the consistent samples whose neighborhoods are completely contained in some decision classes, and ignores the divisibility of the boundary samples whose neighborhoods can not be contained in any decision classes. In this paper, we pay close attention to the boundary samples, and enlarge the positive region by adding the samples whose neighborhoods have maximal intersection with some decision classes. Applying the mentioned idea, we introduce a new neighborhood rough set model, named max-decision neighborhood rough set model. An attribute reduction algorithm is designed based on the model. Both theoretical analysis and experimental results show that the proposed algorithm is effective for removing most redundant attributes without loss of classification accuracy.

[1]  Lawrence Carin,et al.  Cost-sensitive feature acquisition and classification , 2007, Pattern Recognit..

[2]  Baoli Wang,et al.  Determining decision makers’ weights in group ranking: a granular computing method , 2015, Int. J. Mach. Learn. Cybern..

[3]  Qinghua Hu,et al.  Cost-sensitive feature selection based on adaptive neighborhood granularity with multi-level confidence , 2016, Inf. Sci..

[4]  Ming-Wen Shao,et al.  Feature subset selection based on fuzzy neighborhood rough sets , 2016, Knowl. Based Syst..

[5]  Witold Pedrycz,et al.  Measuring relevance between discrete and continuous features based on neighborhood mutual information , 2011, Expert Syst. Appl..

[6]  Ming Zhang,et al.  Neighborhood systems-based rough sets in incomplete information system , 2011, Knowl. Based Syst..

[7]  Qinghua Hu,et al.  Adaptive neighborhood granularity selection and combination based on margin distribution optimization , 2013, Inf. Sci..

[8]  Jiang Yunliang,et al.  Quick attribute reduct algorithm for neighborhood rough set model , 2014 .

[9]  Adam Gacek,et al.  Granular modelling of signals: A framework of Granular Computing , 2013, Inf. Sci..

[10]  Zhi-Hua Zhou,et al.  Ieee Transactions on Knowledge and Data Engineering 1 Training Cost-sensitive Neural Networks with Methods Addressing the Class Imbalance Problem , 2022 .

[11]  Yanqing Zhang,et al.  Constructive granular systems with universal approximation and fast knowledge discovery , 2005, IEEE Transactions on Fuzzy Systems.

[12]  Daijin Kim,et al.  Data classification based on tolerant rough set , 2001, Pattern Recognit..

[13]  Yuhua Qian,et al.  Test-cost-sensitive attribute reduction , 2011, Inf. Sci..

[14]  Hong-Ying Zhang,et al.  On inclusion measures of intuitionistic and interval-valued intuitionistic fuzzy values and their applications to group decision making , 2016, Int. J. Mach. Learn. Cybern..

[15]  William Zhu,et al.  Generalized rough sets based on relations , 2007, Inf. Sci..

[16]  Hong Zhao,et al.  Optimal cost-sensitive granularization based on rough sets for variable costs , 2014, Knowl. Based Syst..

[17]  Shi-Jinn Horng,et al.  Matrix-based dynamic updating rough fuzzy approximations for data mining , 2017, Knowl. Based Syst..

[18]  Witold Pedrycz,et al.  Hierarchical Granular Clustering: An Emergence of Information Granules of Higher Type and Higher Order , 2015, IEEE Transactions on Fuzzy Systems.

[19]  Lin Sun,et al.  A Gene selection approach based on the fisher linear discriminant and the neighborhood rough set , 2017, Bioengineered.

[20]  T. Y. Lin,et al.  NEIGHBORHOOD SYSTEMS AND APPROXIMATION IN RELATIONAL DATABASES AND KNOWLEDGE BASES , 1989 .

[21]  Yiyu Yao,et al.  A Partition Model of Granular Computing , 2004, Trans. Rough Sets.

[22]  Zhenmin Tang,et al.  Minimum cost attribute reduction in decision-theoretic rough set models , 2013, Inf. Sci..

[23]  Magdalene Marinaki,et al.  A bumble bees mating optimization algorithm for the feature selection problem , 2016, Int. J. Mach. Learn. Cybern..

[24]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[25]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[26]  Wei-Zhi Wu,et al.  Neighborhood operator systems and approximations , 2002, Inf. Sci..

[27]  Qinghua Hu,et al.  A Fitting Model for Feature Selection With Fuzzy Rough Sets , 2017, IEEE Transactions on Fuzzy Systems.

[28]  Haiyan Zhao,et al.  An approach to emergency decision making based on decision-theoretic rough set over two universes , 2016, Soft Comput..

[29]  Qinghua Hu,et al.  A novel method for attribute reduction of covering decision systems , 2014, Inf. Sci..

[30]  Geoff Holmes,et al.  Benchmarking Attribute Selection Techniques for Discrete Class Data Mining , 2003, IEEE Trans. Knowl. Data Eng..

[31]  Jie Gui,et al.  Tumor classification by combining PNN classifier ensemble with neighborhood rough set based gene reduction , 2010, Comput. Biol. Medicine.

[32]  Qingguo Li,et al.  Incremental approaches to knowledge reduction based on characteristic matrices , 2017, Int. J. Mach. Learn. Cybern..

[33]  Yu Xue,et al.  Gene selection for tumor classification using neighborhood rough sets and entropy measures , 2017, J. Biomed. Informatics.

[34]  Zhang Yi,et al.  Incremental rough set approach for hierarchical multicriteria classification , 2018, Inf. Sci..

[35]  Duoqian Miao,et al.  A rough set approach to feature selection based on ant colony optimization , 2010, Pattern Recognit. Lett..

[36]  Hong Zhao,et al.  A Backtracking Approach to Minimal Cost Feature Selection of Numerical Data , 2013 .

[37]  Guido Dedene,et al.  Cost-sensitive learning and decision making revisited , 2005, Eur. J. Oper. Res..

[38]  Ming-Wen Shao,et al.  A unified information measure for general binary relations , 2017, Knowl. Based Syst..

[39]  Xizhao Wang,et al.  Building a Rule-Based Classifier—A Fuzzy-Rough Set Approach , 2010, IEEE Transactions on Knowledge and Data Engineering.

[40]  Witold Pedrycz,et al.  A Granular Description of ECG Signals , 2006, IEEE Transactions on Biomedical Engineering.

[41]  Huan Liu,et al.  Consistency-based search in feature selection , 2003, Artif. Intell..

[42]  Zhifei Zhang,et al.  Incremental approaches for updating reducts in dynamic covering information systems , 2017, Knowl. Based Syst..

[43]  Qinghua Hu,et al.  Neighborhood classifiers , 2008, Expert Syst. Appl..

[44]  Qinghua Hu,et al.  An efficient gene selection technique for cancer recognition based on neighborhood mutual information , 2010, Int. J. Mach. Learn. Cybern..

[45]  Yuhua Qian,et al.  Multigranulation fuzzy rough set over two universes and its application to decision making , 2017, Knowl. Based Syst..

[46]  Zexuan Zhu,et al.  Wrapper–Filter Feature Selection Algorithm Using a Memetic Framework , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[47]  Yiyu Yao,et al.  A unified model of sequential three-way decisions and multilevel incremental processing , 2017, Knowl. Based Syst..

[48]  Qinghua Hu,et al.  A Novel Algorithm for Finding Reducts With Fuzzy Rough Sets , 2012, IEEE Transactions on Fuzzy Systems.

[49]  Qinghua Hu,et al.  Feature Selection Based on Neighborhood Discrimination Index , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[50]  Lin Sun,et al.  Feature selection using rough entropy-based uncertainty measures in incomplete decision systems , 2012, Knowl. Based Syst..

[51]  Yiyu Yao,et al.  MGRS: A multi-granulation rough set , 2010, Inf. Sci..

[52]  Xizhao Wang,et al.  Induction of multiple fuzzy decision trees based on rough set technique , 2008, Inf. Sci..

[53]  William Zhu,et al.  Attribute reduction of data with error ranges and test costs , 2012, Inf. Sci..

[54]  Fan Min,et al.  A hierarchical model for test-cost-sensitive decision systems , 2009, Inf. Sci..

[55]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[56]  Rajen B. Bhatt,et al.  On fuzzy-rough sets approach to feature selection , 2005, Pattern Recognit. Lett..

[57]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.