Incremental approaches for feature selection from dynamic data with the variation of multiple objects

Abstract Owing to the dynamic characteristics of data in the big data era, multiple objects of a decision system often vary with time when new information arrives in real-world applications. However, many feature selection algorithms are designed for static decision systems, and some dynamic feature selection algorithms treat the variation of multiple objects as the cumulative variation of a single object. In an environment where multiple objects vary with time, these algorithms are often time-consuming. Therefore, strategic behaviors need to be reinforced to improve the efficiency of feature selection. Incremental updating is an efficient technique, which can be applied to deal with dynamic learning tasks because it can make use of previous knowledge to obtain new knowledge. In this paper, we focus on the incremental updating to select a new feature subset with the variation of multiple objects. First, the dependency function is updated in an incremental manner to evaluate the quality of candidate features. Then two incremental feature selection algorithms are developed when multiple objects are added to or deleted from a decision system. Experiments on different UCI data sets show that the proposed algorithms can select new feature subset in much less computational time and do not lose the classification performance when compared with other algorithms.

[1]  Jianhui Lin,et al.  A Rough-Set-Based Incremental Approach for Updating Approximations under Dynamic Maintenance Environments , 2013, IEEE Transactions on Knowledge and Data Engineering.

[2]  Bao Qing Hu,et al.  A fast heuristic attribute reduction approach to ordered decision systems , 2018, Eur. J. Oper. Res..

[3]  Jiye Liang,et al.  Ieee Transactions on Knowledge and Data Engineering 1 a Group Incremental Approach to Feature Selection Applying Rough Set Technique , 2022 .

[4]  Qinghua Hu,et al.  A Fitting Model for Feature Selection With Fuzzy Rough Sets , 2017, IEEE Transactions on Fuzzy Systems.

[5]  Zhang Yi,et al.  Incremental rough set approach for hierarchical multicriteria classification , 2018, Inf. Sci..

[6]  Dun Liu,et al.  A fuzzy rough set approach for incremental feature selection on hybrid information systems , 2015, Fuzzy Sets Syst..

[7]  Yiyu Yao,et al.  A unified model of sequential three-way decisions and multilevel incremental processing , 2017, Knowl. Based Syst..

[8]  Witold Pedrycz,et al.  Positive approximation: An accelerator for attribute reduction in rough set theory , 2010, Artif. Intell..

[9]  Yiyu Yao,et al.  Dynamic probabilistic rough sets with incomplete data , 2017, Inf. Sci..

[10]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[11]  Bing Huang,et al.  Cost-sensitive sequential three-way decision modeling using a deep neural network , 2017, Int. J. Approx. Reason..

[12]  Manish Aggarwal,et al.  Rough Information Set and Its Applications in Decision Making , 2017, IEEE Transactions on Fuzzy Systems.

[13]  Xu Zhang,et al.  A Quick Attribute Reduction Algorithm with Complexity of max(O(|C||U|),O(|C|~2|U/C|)) , 2006 .

[14]  Yang Ming An Incremental Updating Algorithm for Attribute Reduction Based on Improved Discernibility Matrix , 2007 .

[15]  Rajen B. Bhatt,et al.  On the extension of functional dependency degree from crisp to fuzzy partitions , 2006, Pattern Recognit. Lett..

[16]  Tao Feng,et al.  Uncertainty and reduction of variable precision multigranulation fuzzy rough sets based on three-way decisions , 2017, Int. J. Approx. Reason..

[17]  Fan Min,et al.  Three-way recommender systems based on random forests , 2016, Knowl. Based Syst..

[18]  Yitian Xu,et al.  A dynamic attribute reduction algorithm based on 0-1 integer programming , 2011, Knowl. Based Syst..

[19]  Parham Moradi,et al.  OSFSMI: Online stream feature selection method based on mutual information , 2017, Appl. Soft Comput..

[20]  Jiye Liang,et al.  Space Structure and Clustering of Categorical Data , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Yong Qi,et al.  Updating multigranulation rough approximations with increasing of granular structures , 2014, Knowl. Based Syst..

[22]  Jiye Liang,et al.  Attribute reduction for dynamic data sets , 2013, Appl. Soft Comput..

[23]  Zhifei Zhang,et al.  A three-way decisions model with probabilistic rough sets for stream computing , 2017, Int. J. Approx. Reason..

[24]  Huan Liu,et al.  Consistency-based search in feature selection , 2003, Artif. Intell..

[25]  Dun Liu,et al.  Incremental updating approximations in probabilistic rough sets under the variation of attributes , 2015, Knowl. Based Syst..

[26]  Tianrui Li,et al.  Incremental update of approximations in dominance-based rough sets approach under the variation of attribute values , 2015, Inf. Sci..

[27]  Zhang Yi,et al.  Efficient updating of probabilistic approximations with incremental objects , 2016, Knowl. Based Syst..

[28]  Angelo Gaeta,et al.  Resilience Analysis of Critical Infrastructures: A Cognitive Approach Based on Granular Computing , 2019, IEEE Transactions on Cybernetics.

[29]  Bao Qing Hu,et al.  Attribute reduction in ordered decision tables via evidence theory , 2016, Inf. Sci..

[30]  Daisuke Yamaguchi,et al.  Attribute dependency functions considering data efficiency , 2009, Int. J. Approx. Reason..

[31]  Tianrui Li,et al.  Fast algorithms for computing rough approximations in set-valued decision systems while updating criteria values , 2015, Inf. Sci..

[32]  Witold Pedrycz,et al.  Covering-based multi-granulation fuzzy rough sets , 2015, J. Intell. Fuzzy Syst..

[33]  Dun Liu,et al.  A rough set-based incremental approach for learning knowledge in dynamic incomplete information systems , 2014, Int. J. Approx. Reason..

[34]  K. Thangavel,et al.  Dimensionality reduction based on rough set theory: A review , 2009, Appl. Soft Comput..

[35]  Tianrui Li,et al.  Composite rough sets for dynamic data mining , 2014, Inf. Sci..

[36]  Hong-Ying Zhang,et al.  Feature selection and approximate reasoning of large-scale set-valued decision tables based on α-dominance-based quantitative rough sets , 2017, Inf. Sci..

[37]  Salem Chakhar,et al.  Spare parts classification in industrial manufacturing using the dominance-based rough set approach , 2017, Eur. J. Oper. Res..

[38]  Duoqian Miao,et al.  Three-layer granular structures and three-way informational measures of a decision table , 2017, Inf. Sci..

[39]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[40]  Witold Pedrycz,et al.  Granular multi-label feature selection based on mutual information , 2017, Pattern Recognit..

[41]  Yuhua Qian,et al.  A comparative study of multigranulation rough sets and concept lattices via rule acquisition , 2016, Knowl. Based Syst..

[42]  Qinghua Hu,et al.  Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information , 2017, IEEE Transactions on Fuzzy Systems.

[43]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[44]  Wenhao Shu,et al.  An incremental approach to attribute reduction from dynamic incomplete decision systems in rough set theory , 2015, Data Knowl. Eng..

[45]  Yi Cheng,et al.  The incremental method for fast computing the rough fuzzy approximations , 2011, Data Knowl. Eng..

[46]  Jiye Liang,et al.  Attribute reduction: A dimension incremental strategy , 2013, Knowl. Based Syst..

[47]  Andrzej Skowron,et al.  Rudiments of rough sets , 2007, Inf. Sci..

[48]  Da Ruan,et al.  Rough sets based matrix approaches with dynamic attribute variation in set-valued information systems , 2012, Int. J. Approx. Reason..