Incremental feature selection based on fuzzy rough sets

Abstract Incremental feature selection can improve learning of accumulated data. We focus on incremental feature selection based on rough sets, which along with their generalizations (e.g., fuzzy rough sets), reduce dimensionality without requiring domain knowledge, such as data distributions. By analyzing the basic concepts of fuzzy rough sets on incremental datasets, we propose incremental mechanisms of information measure. Moreover, we introduce a key instance set containing representative instances to select supplementary features when new instances arrive. As the key instance set is much smaller than the whole dataset, the proposed incremental feature selection mostly suppresses redundant computations. We experimentally compare the proposed method with various non-incremental and two state-of-the-art incremental methods on a variety of datasets. The comparison results demonstrate that the proposed method achieves compact results with reduced computation time, especially on high-dimensional datasets.

[1]  Hamido Fujita,et al.  An efficient selector for multi-granularity attribute reduction , 2019, Inf. Sci..

[2]  Yang Ming An Incremental Updating Algorithm for Attribute Reduction Based on Improved Discernibility Matrix , 2007 .

[3]  Qinghua Hu,et al.  On Robust Fuzzy Rough Set Models , 2012, IEEE Transactions on Fuzzy Systems.

[4]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[5]  Gert Cauwenberghs,et al.  Incremental and Decremental Support Vector Machine Learning , 2000, NIPS.

[6]  Bing Huang,et al.  Sequential three-way decision based on multi-granular autoencoder features , 2020, Inf. Sci..

[7]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[8]  Xianzhong Zhou,et al.  Cost-sensitive dual-bidirectional linear discriminant analysis , 2020, Inf. Sci..

[9]  Siddhartha Bhattacharyya,et al.  A group incremental feature selection for classification using rough set theory based genetic algorithm , 2018, Appl. Soft Comput..

[10]  Jing Wang,et al.  A survey on online feature selection with streaming features , 2018, Frontiers of Computer Science.

[11]  D. Coomans,et al.  Alternative k-nearest neighbour rules in supervised pattern recognition : Part 1. k-Nearest neighbour classification by using alternative voting rules , 1982 .

[12]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[13]  Zhi-Hua Zhou,et al.  Least Square Incremental Linear Discriminant Analysis , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[14]  Giancarlo Fortino,et al.  Intelligent temporal classification and fuzzy rough set-based feature selection algorithm for intrusion detection system in WSNs , 2019, Inf. Sci..

[15]  Witold Pedrycz,et al.  Granular Computing: Perspectives and Challenges , 2013, IEEE Transactions on Cybernetics.

[16]  Sanjoy Dasgupta,et al.  Incremental Clustering: The Case for Extra Clusters , 2014, NIPS.

[17]  Jing Wang,et al.  Online Feature Selection with Group Structure Analysis , 2015, IEEE Transactions on Knowledge and Data Engineering.

[18]  Carlo Batini,et al.  On the Meaningfulness of “Big Data Quality” (Invited Paper) , 2015, Data Science and Engineering.

[19]  Paul E. Utgoff,et al.  Incremental Induction of Decision Trees , 1989, Machine Learning.

[20]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[21]  J. C. Schlimmer,et al.  Incremental learning from noisy data , 2004, Machine Learning.

[22]  James Theiler,et al.  Online Feature Selection using Grafting , 2003, ICML.

[23]  Yiyu Yao,et al.  On Reduct Construction Algorithms , 2006, Trans. Comput. Sci..

[24]  Witold Pedrycz,et al.  Positive approximation: An accelerator for attribute reduction in rough set theory , 2010, Artif. Intell..

[25]  James Theiler,et al.  Grafting: Fast, Incremental Feature Selection by Gradient Descent in Function Space , 2003, J. Mach. Learn. Res..

[26]  Xizhao Wang,et al.  Attributes Reduction Using Fuzzy Rough Sets , 2008, IEEE Transactions on Fuzzy Systems.

[27]  Andrzej Skowron,et al.  Rough sets: Some extensions , 2007, Inf. Sci..

[28]  Guoyin Wang,et al.  Incremental Attribute Reduction Based on Elementary Sets , 2005, RSFDGrC.

[29]  Stephen Grossberg,et al.  Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps , 1992, IEEE Trans. Neural Networks.

[30]  Xizhao Wang,et al.  Incremental Perspective for Feature Selection Based on Fuzzy Rough Sets , 2018, IEEE Transactions on Fuzzy Systems.

[31]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[32]  Jiye Liang,et al.  Combination Entropy and Combination Granulation in Rough Set Theory , 2008, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[33]  Hong-Ying Zhang,et al.  Feature selection based on generalized variable-precision $$(\vartheta ,\sigma )$$(ϑ,σ)-fuzzy granular rough set model over two universes , 2019, Int. J. Mach. Learn. Cybern..

[34]  Wei Wei,et al.  Accelerating incremental attribute reduction algorithm by compacting a decision table , 2018, Int. J. Mach. Learn. Cybern..

[35]  Christophe G. Giraud-Carrier,et al.  A Note on the Utility of Incremental Learning , 2000, AI Commun..

[36]  Steven Guan,et al.  An incremental approach to genetic-algorithms-based classification , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[37]  Ju-Sheng Mi,et al.  A novel approach for learning label correlation with application to feature selection of multi-label data , 2020, Inf. Sci..

[38]  Hui Wang,et al.  Fuzzy rough set based incremental attribute reduction from dynamic data with sample arriving , 2017, Fuzzy Sets Syst..

[39]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[40]  Jing Zhou,et al.  Streamwise Feature Selection , 2006, J. Mach. Learn. Res..

[41]  Hao Wang,et al.  Online Feature Selection with Streaming Features , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[43]  Xiao Zhang,et al.  Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy , 2016, Pattern Recognit..

[44]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[45]  Yuan Yan Tang,et al.  New Incremental Learning Algorithm With Support Vector Machines , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[46]  Wang Guo-yin,et al.  Incremental algorithms for attribute reduction in decision table , 2007 .

[47]  Jiye Liang,et al.  Ieee Transactions on Knowledge and Data Engineering 1 a Group Incremental Approach to Feature Selection Applying Rough Set Technique , 2022 .

[48]  Qinghua Hu,et al.  A Fitting Model for Feature Selection With Fuzzy Rough Sets , 2017, IEEE Transactions on Fuzzy Systems.