On Robust Fuzzy Rough Set Models

Rough sets, especially fuzzy rough sets, are supposedly a powerful mathematical tool to deal with uncertainty in data analysis. This theory has been applied to feature selection, dimensionality reduction, and rule learning. However, it is pointed out that the classical model of fuzzy rough sets is sensitive to noisy information, which is considered as a main source of uncertainty in applications. This disadvantage limits the applicability of fuzzy rough sets. In this paper, we reveal why the classical fuzzy rough set model is sensitive to noise and how noisy samples impose influence on fuzzy rough computation. Based on this discussion, we study the properties of some current fuzzy rough models in dealing with noisy data and introduce several new robust models. The properties of the proposed models are also discussed. Finally, a robust classification algorithm is designed based on fuzzy lower approximations. Some numerical experiments are given to illustrate the effectiveness of the models. The classifiers that are developed with the proposed models achieve good generalization performance.

[1]  Marina V. Fomina,et al.  Problem of knowledge discovery in noisy databases , 2011, Int. J. Mach. Learn. Cybern..

[2]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[3]  S. Pal,et al.  Rough-Fuzzy C-Medoids Algorithm and Selection of Bio-Basis for Amino Acid Sequence Analysis , 2007, IEEE Transactions on Knowledge and Data Engineering.

[4]  Witold Pedrycz,et al.  The Development of Fuzzy Rough Sets with the Use of Structures and Algebras of Axiomatic Fuzzy Sets , 2009, IEEE Transactions on Knowledge and Data Engineering.

[5]  Haoyang Wu,et al.  An Interval Type-2 Fuzzy Rough Set Model for Attribute Reduction , 2009, IEEE Transactions on Fuzzy Systems.

[6]  Qiang Shen,et al.  Fuzzy-Rough Sets Assisted Attribute Selection , 2007, IEEE Transactions on Fuzzy Systems.

[7]  T. Hong,et al.  Learning a coverage set of maximally general fuzzy rules by rough sets , 2000 .

[8]  Yixin Chen,et al.  Outlier Detection with the Kernelized Spatial Depth Function , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Lei Zhou,et al.  On characterization of intuitionistic fuzzy rough sets based on intuitionistic fuzzy implicators , 2009, Inf. Sci..

[10]  Witold Pedrycz,et al.  Gaussian kernel based fuzzy rough sets: Model, uncertainty measures and applications , 2010, Int. J. Approx. Reason..

[11]  Wei-Zhi Wu,et al.  Constructive and axiomatic approaches of fuzzy approximation operators , 2004, Inf. Sci..

[12]  Xi-Zhao Wang,et al.  Improving Generalization of Fuzzy IF--THEN Rules by Maximizing Fuzzy Entropy , 2009, IEEE Transactions on Fuzzy Systems.

[13]  Michael Kearns,et al.  Efficient noise-tolerant learning from statistical queries , 1993, STOC.

[14]  Xizhao Wang,et al.  Learning fuzzy rules from fuzzy samples based on rough set technique , 2007, Inf. Sci..

[15]  Malcolm J. Beynon,et al.  Reducts within the variable precision rough sets model: A further investigation , 2001, Eur. J. Oper. Res..

[16]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[17]  Jun-Hai Zhai,et al.  Fuzzy decision tree based on fuzzy-rough technique , 2011, Soft Comput..

[18]  Jonathan Lawry,et al.  Granular Knowledge Representation and Inference Using Labels and Label Expressions , 2010, IEEE Transactions on Fuzzy Systems.

[19]  William Zhu,et al.  Matroidal approaches to generalized rough sets based on relations , 2011, Int. J. Mach. Learn. Cybern..

[20]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[21]  Wen-Xiu Zhang,et al.  An axiomatic characterization of a fuzzy generalization of rough sets , 2004, Inf. Sci..

[22]  F. Mosteller,et al.  Understanding robust and exploratory data analysis , 1985 .

[23]  Jesús Manuel Fernández Salido,et al.  Rough set analysis of a general type of fuzzy data using transitive aggregations of fuzzy similarity relations , 2003, Fuzzy Sets Syst..

[24]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[25]  Witold Pedrycz,et al.  Fuzzy Clustering With Viewpoints , 2010, IEEE Transactions on Fuzzy Systems.

[26]  Xindong Wu,et al.  Mining With Noise Knowledge: Error-Aware Data Mining , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[27]  Qinghua Hu,et al.  Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation , 2007, Pattern Recognit..

[28]  Qinghua Hu,et al.  Soft fuzzy rough sets for robust feature evaluation and selection , 2010, Inf. Sci..

[29]  Lotfi A. Zadeh,et al.  Fuzzy logic = computing with words , 1996, IEEE Trans. Fuzzy Syst..

[30]  Rajen B. Bhatt,et al.  FRCT: fuzzy-rough classification trees , 2007, Pattern Analysis and Applications.

[31]  Wojciech Ziarko,et al.  Variable Precision Rough Set Model , 1993, J. Comput. Syst. Sci..

[32]  Amir Globerson,et al.  Nightmare at test time: robust learning by feature deletion , 2006, ICML.

[33]  Shie Mannor,et al.  Robustness and Regularization of Support Vector Machines , 2008, J. Mach. Learn. Res..

[34]  C. Cornelis,et al.  Vaguely Quantified Rough Sets , 2009, RSFDGrC.

[35]  Xizhao Wang,et al.  On the generalization of fuzzy rough sets , 2005, IEEE Transactions on Fuzzy Systems.

[36]  Yong Liu,et al.  Modeling Complex Architectures Based on Granular Computing on Ontology , 2010, IEEE Transactions on Fuzzy Systems.

[37]  Chris Cornelis,et al.  Ordered Weighted Average Based Fuzzy Rough Sets , 2010, RSKT.

[38]  Qiang Shen,et al.  New Approaches to Fuzzy-Rough Feature Selection , 2009, IEEE Transactions on Fuzzy Systems.

[39]  Wei-Zhi Wu,et al.  Approaches to knowledge reduction based on variable precision rough set model , 2004, Inf. Sci..

[40]  De-gang Chen,et al.  The Model of Fuzzy Variable Precision Rough Sets , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[41]  Qiang Shen,et al.  Semantics-preserving dimensionality reduction: rough and fuzzy-rough-based approaches , 2004, IEEE Transactions on Knowledge and Data Engineering.

[42]  Xingquan Zhu,et al.  Class Noise vs. Attribute Noise: A Quantitative Study , 2003, Artificial Intelligence Review.

[43]  Zhaohong Deng,et al.  Robust Relief-Feature Weighting, Margin Maximization, and Fuzzy Optimization , 2010, IEEE Transactions on Fuzzy Systems.

[44]  Hans-Dieter Kochs,et al.  Adapted variable precision rough set approach for EEG analysis , 2009, Artif. Intell. Medicine.

[45]  Chris Cornelis,et al.  Fuzzy Rough Sets: The Forgotten Step , 2007, IEEE Transactions on Fuzzy Systems.

[46]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[47]  Lotfi A. Zadeh,et al.  Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic , 1997, Fuzzy Sets Syst..

[48]  Jiye Liang,et al.  Information Granularity in Fuzzy Binary GrC Model , 2011, IEEE Transactions on Fuzzy Systems.

[49]  Xizhao Wang,et al.  Building a Rule-Based Classifier—A Fuzzy-Rough Set Approach , 2010, IEEE Transactions on Knowledge and Data Engineering.

[50]  Xizhao Wang,et al.  Attributes Reduction Using Fuzzy Rough Sets , 2008, IEEE Transactions on Fuzzy Systems.

[51]  Witold Pedrycz,et al.  Kernelized Fuzzy Rough Sets and Their Applications , 2011, IEEE Transactions on Knowledge and Data Engineering.

[52]  Yiyu Yao,et al.  Attribute reduction in decision-theoretic rough set models , 2008, Inf. Sci..

[53]  Xizhao Wang,et al.  Induction of multiple fuzzy decision trees based on rough set technique , 2008, Inf. Sci..

[54]  Melanie Hilario,et al.  Knowledge and Information Systems , 2007 .

[55]  Albert Fornells,et al.  A study of the effect of different types of noise on the precision of supervised learning techniques , 2010, Artificial Intelligence Review.

[56]  Chris Cornelis,et al.  A noise-tolerant approach to fuzzy-rough feature selection , 2008, 2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence).

[57]  Alicja Mieszkowicz-Rolka,et al.  Variable Precision Fuzzy Rough Sets , 2004, Trans. Rough Sets.

[58]  Saso Dzeroski,et al.  Noise detection and elimination in data preprocessing: Experiments in medical domains , 2000, Appl. Artif. Intell..

[59]  Shourya Roy,et al.  How Much Noise Is Too Much: A Study in Automatic Text Classification , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).