Dynamic Updating Rough Approximations in Distributed Information Systems

Rough set theory is an effective mathematical tool for processing the uncertainty and inexact data. In some real-life applications, data stores in information systems distributively which are called as Distributed Information Systems (DIS). It is hard to centralize the large-scale data in DIS for data mining tasks. Furthermore, knowledge needs updating as the attributes dynamically increase in size in DIS. In this paper, we present an incremental approach for maintaining rough approximations in DIS under attribute generalization. Firstly, a matrix-based approach is presented to compute approximations. Then, an incremental approach for updating rough approximations in DIS is proposed, which does not need to centralize data from different locations and recompute the whole data sets from scratch. Finally, a case study is provided for validating the efficiency and effectiveness of the proposed method.

[1]  Yong Qi,et al.  Updating multigranulation rough approximations with increasing of granular structures , 2014, Knowl. Based Syst..

[2]  Yiyu Yao,et al.  A Comparative Study of Fuzzy Sets and Rough Sets , 1998 .

[3]  Da Ruan,et al.  Rough sets based matrix approaches with dynamic attribute variation in set-valued information systems , 2012, Int. J. Approx. Reason..

[4]  Hong Shen,et al.  Incremental feature selection based on rough set in dynamic incomplete data , 2014, Pattern Recognit..

[5]  Tianrui Li,et al.  Fast algorithms for computing rough approximations in set-valued decision systems while updating criteria values , 2015, Inf. Sci..

[6]  Jerzy W. Grzymala-Busse,et al.  Characteristic Relations for Incomplete Data: A Generalization of the Indiscernibility Relation , 2005, Trans. Rough Sets.

[7]  Yaojin Lin,et al.  Fast approach to knowledge acquisition in covering information systems using matrix operations , 2015, Knowl. Based Syst..

[8]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[9]  Mihaela van der Schaar,et al.  A fast online learning algorithm for distributed mining of BigData , 2014, PERV.

[10]  Chris Clifton,et al.  Privacy-preserving distributed mining of association rules on horizontally partitioned data , 2004, IEEE Transactions on Knowledge and Data Engineering.

[11]  Lin Sun,et al.  Feature selection using rough entropy-based uncertainty measures in incomplete decision systems , 2012, Knowl. Based Syst..

[12]  Silvia Calegari,et al.  Granular computing applied to ontologies , 2010, Int. J. Approx. Reason..

[13]  S. R,et al.  Data Mining with Big Data , 2017, 2017 11th International Conference on Intelligent Systems and Control (ISCO).

[14]  Anastasios Tefas,et al.  A distributed framework for trimmed Kernel k-Means clustering , 2015, Pattern Recognit..

[15]  Chris Clifton,et al.  Tools for privacy preserving distributed data mining , 2002, SKDD.

[16]  Yuan Li,et al.  Approaches to attribute reductions based on rough set and matrix computation in inconsistent ordered information systems , 2012, Knowl. Based Syst..

[17]  Jiye Liang,et al.  Attribute reduction: A dimension incremental strategy , 2013, Knowl. Based Syst..

[18]  Gui-Long Liu,et al.  The Axiomatization of the Rough Set Upper Approximation Operations , 2006, Fundam. Informaticae.

[19]  Da Ruan,et al.  A parallel method for computing rough set approximations , 2012, Inf. Sci..

[20]  Caihui Liu,et al.  Hierarchical attribute reduction algorithms for big data using MapReduce , 2015, Knowl. Based Syst..

[21]  Xinyu Dai,et al.  Incremental Versus Non-incremental: Data and Algorithms Based on Ordering Relations , 2011, Int. J. Comput. Intell. Syst..

[22]  Salvatore Greco,et al.  Rough approximation of a preference relation by dominance relations , 1999, Eur. J. Oper. Res..

[23]  Alexis Tsoukiàs,et al.  On the Extension of Rough Sets under Incomplete Information , 1999, RSFDGrC.

[24]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[25]  Qingxin Zhu,et al.  Characteristic matrix of covering and its application to Boolean matrix decomposition , 2012, Inf. Sci..

[26]  Hongmei Chen,et al.  Dynamic maintenance of approximations in set-valued ordered decision systems under the attribute generalization , 2014, Inf. Sci..

[27]  TsumotoShusaku Automated extraction of medical expert system rules from clinical databases based on rough set theory , 1998 .

[28]  Witold Pedrycz,et al.  Granular Computing: Analysis and Design of Intelligent Systems , 2013 .

[29]  Guoyin Wang,et al.  A Decision-Theoretic Rough Set Approach for Dynamic Data Mining , 2015, IEEE Transactions on Fuzzy Systems.

[30]  Dun Liu,et al.  Incremental approaches for updating approximations in set-valued ordered information systems , 2013, Knowl. Based Syst..

[31]  Tianrui Li,et al.  Composite rough sets for dynamic data mining , 2014, Inf. Sci..

[32]  Hillol Kargupta,et al.  Distributed Data Mining: Algorithms, Systems, and Applications , 2003 .

[33]  Jiye Liang,et al.  Ieee Transactions on Knowledge and Data Engineering 1 a Group Incremental Approach to Feature Selection Applying Rough Set Technique , 2022 .