An incremental approach to attribute reduction from dynamic incomplete decision systems in rough set theory

Attribute reduction is an important preprocessing step in data mining and knowledge discovery. The effective computation of an attribute reduct has a direct bearing on the efficiency of knowledge acquisition and various related tasks. In real-world applications, some attribute values for an object may be incomplete and an object set may vary dynamically in the knowledge representation systems, also called decision systems in rough set theory. There are relatively few studies on attribute reduction in such systems. This paper mainly focuses on this issue. For the immigration and emigration of a single object in the incomplete decision system, an incremental attribute reduction algorithm is developed to compute a new attribute reduct, rather than to obtain the dynamic system as a new one that has to be computed from scratch. In particular, for the immigration and emigration of multiple objects in the system, another incremental reduction algorithm guarantees that a new attribute reduct can be computed on the fly, which avoids some re-computations. Compared with other attribute reduction algorithms, the proposed algorithms can effectively reduce the time required for reduct computations without losing the classification performance. Experiments on different real-life data sets are conducted to test and demonstrate the efficiency and effectiveness of the proposed algorithms.

[1]  Witold Pedrycz,et al.  Positive approximation: An accelerator for attribute reduction in rough set theory , 2010, Artif. Intell..

[2]  Da Ruan,et al.  Rough sets based matrix approaches with dynamic attribute variation in set-valued information systems , 2012, Int. J. Approx. Reason..

[3]  Wojciech Ziarko Incremental Learning and Evaluation of Structures of Rough Decision Tables , 2005, Trans. Rough Sets.

[4]  Mahesh Viswanathan,et al.  A counterexample-guided abstraction-refinement framework for markov decision processes , 2008, TOCL.

[5]  Jiye Liang,et al.  Information entropy, rough entropy and knowledge granulation in incomplete information systems , 2006, Int. J. Gen. Syst..

[6]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[7]  Arkadiusz Wojna,et al.  Constraint Based Incremental Learning of Classification Rules , 2000, Rough Sets and Current Trends in Computing.

[8]  Da Ruan,et al.  An Incremental Approach for Inducing Knowledge from Dynamic Information Systems , 2009, Fundam. Informaticae.

[9]  Xiao-Jun Zeng,et al.  Core-generating approximate minimum entropy discretization for rough set feature selection in pattern classification , 2011, Int. J. Approx. Reason..

[10]  Andrzej Skowron,et al.  Rough sets and Boolean reasoning , 2007, Inf. Sci..

[11]  Roman Słowiński,et al.  Dealing with Missing Data in Rough Set Analysis of Multi-Attribute and Multi-Criteria Decision Problems , 2000 .

[12]  Jianhui Lin,et al.  A Rough-Set-Based Incremental Approach for Updating Approximations under Dynamic Maintenance Environments , 2013, IEEE Transactions on Knowledge and Data Engineering.

[13]  Jerzy W. Grzymala-Busse,et al.  An Experimental Comparison of Three Rough Set Approaches to Missing Attribute Values , 2007, Trans. Rough Sets.

[14]  Jiye Liang,et al.  Approximation reduction in inconsistent incomplete decision tables , 2010, Knowl. Based Syst..

[15]  Wenhao Shu,et al.  A fast approach to attribute reduction from perspective of attribute measures in incomplete decision systems , 2014, Knowl. Based Syst..

[16]  Yang Ming An Incremental Updating Algorithm for Attribute Reduction Based on Improved Discernibility Matrix , 2007 .

[17]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[18]  Jiye Liang,et al.  Attribute reduction for dynamic data sets , 2013, Appl. Soft Comput..

[19]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[20]  Lei Zhang,et al.  Sample Pair Selection for Attribute Reduction with Rough Set , 2012, IEEE Transactions on Knowledge and Data Engineering.

[21]  Huan Liu,et al.  Consistency-based search in feature selection , 2003, Artif. Intell..

[22]  Shaojie Qiao,et al.  A rough set based dynamic maintenance approach for approximations in coarsening and refining attribute values , 2010 .

[23]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[24]  Xizhao Wang,et al.  Attributes Reduction Using Fuzzy Rough Sets , 2008, IEEE Transactions on Fuzzy Systems.

[25]  Md. Aquil Khan,et al.  Logics for information systems and their dynamic extensions , 2011, TOCL.

[26]  Yi Cheng,et al.  The incremental method for fast computing the rough fuzzy approximations , 2011, Data Knowl. Eng..

[27]  Zdzislaw Pawlak,et al.  Rough Set Theory and its Applications to Data Analysis , 1998, Cybern. Syst..

[28]  Ching-Hsue Cheng,et al.  Fuzzy time series model based on probabilistic approach and rough set rule induction for empirical research in stock markets , 2008, Data Knowl. Eng..

[29]  Qinghua Hu,et al.  Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation , 2007, Pattern Recognit..

[30]  Wojciech Ziarko,et al.  DATA‐BASED ACQUISITION AND INCREMENTAL MODIFICATION OF CLASSIFICATION RULES , 1995, Comput. Intell..

[31]  Jennifer Blackhurst,et al.  MMR: An algorithm for clustering categorical data using Rough Set Theory , 2007, Data Knowl. Eng..

[32]  Jiye Liang,et al.  Incomplete Multigranulation Rough Set , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[33]  Zhongzhi Shi,et al.  A fast approach to attribute reduction in incomplete decision systems with tolerance relation-based rough sets , 2009, Inf. Sci..

[34]  Yiyu Yao,et al.  Data analysis based on discernibility and indiscernibility , 2007, Inf. Sci..

[35]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[36]  Guoyin Wang,et al.  Incremental Attribute Reduction Based on Elementary Sets , 2005, RSFDGrC.

[37]  Jerzy W. Grzymala-Busse,et al.  Characteristic Relations for Incomplete Data: A Generalization of the Indiscernibility Relation , 2005, Trans. Rough Sets.

[38]  Jing-Yu Yang,et al.  Dominance-based rough set approach to incomplete interval-valued information system , 2009, Data Knowl. Eng..

[39]  Yitian Xu,et al.  A dynamic attribute reduction algorithm based on 0-1 integer programming , 2011, Knowl. Based Syst..

[40]  Geert Wets,et al.  A rough sets based characteristic relation approach for dynamic attribute generalization in data mining , 2007, Knowl. Based Syst..

[41]  Qiang Shen,et al.  A Distance Measure Approach to Exploring the Rough Set Boundary Region for Attribute Reduction , 2010, IEEE Transactions on Knowledge and Data Engineering.

[42]  Jiye Liang,et al.  The Algorithm on Knowledge Reduction in Incomplete Information Systems , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[43]  Pradeep Kumar,et al.  Rough clustering of sequential data , 2007, Data Knowl. Eng..

[44]  Witold Pedrycz,et al.  An efficient accelerator for attribute reduction from incomplete data in rough set framework , 2011, Pattern Recognit..

[45]  Marzena Kryszkiewicz,et al.  Rules in Incomplete Information Systems , 1999, Inf. Sci..

[46]  Wei-Zhi Wu,et al.  Knowledge reduction in random information systems via Dempster-Shafer theory of evidence , 2005, Inf. Sci..

[47]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[48]  Jiye Liang,et al.  Ieee Transactions on Knowledge and Data Engineering 1 a Group Incremental Approach to Feature Selection Applying Rough Set Technique , 2022 .

[49]  Yee Leung,et al.  Maximal consistent block technique for rule acquisition in incomplete information systems , 2003, Inf. Sci..

[50]  Jiye Liang,et al.  On the evaluation of the decision performance of an incomplete decision table , 2008, Data Knowl. Eng..

[51]  Yang Xu,et al.  An approach to attribute generalization in incomplete information systems , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).

[52]  Marzena Kryszkiewicz,et al.  Rough Set Approach to Incomplete Information Systems , 1998, Inf. Sci..

[53]  Bingru Yang,et al.  Feature Selection using Compact Discernibility Matrix-based Approach in Dynamic Incomplete Decision System , 2015, J. Inf. Sci. Eng..

[54]  Lin Sun,et al.  Feature selection using rough entropy-based uncertainty measures in incomplete decision systems , 2012, Knowl. Based Syst..