Data-Distribution-Aware Fuzzy Rough Set Model and its Application to Robust Classification

Fuzzy rough sets (FRSs) are considered to be a powerful model for analyzing uncertainty in data. This model encapsulates two types of uncertainty: 1) fuzziness coming from the vagueness in human concept formation and 2) roughness rooted in the granulation coming with human cognition. The rough set theory has been widely applied to feature selection, attribute reduction, and classification. However, it is reported that the classical FRS model is sensitive to noisy information. To address this problem, several robust models have been developed in recent years. Nevertheless, these models do not consider a statistical distribution of data, which is an important type of uncertainty. Data distribution serves as crucial information for designing an optimal classification or regression model. Thus, we propose a data-distribution-aware FRS model that considers distribution information and incorporates it in computing lower and upper fuzzy approximations. The proposed model considers not only the similarity between samples, but also the probability density of classes. In order to demonstrate the effectiveness of the proposed model, we design a new sample evaluation index for prototype-based classification based on the model, and a prototype selection algorithm is developed using this index. Furthermore, a robust classification algorithm is constructed with prototype covering and nearest neighbor classification. Experimental results confirm the robustness and effectiveness of the proposed model.

[1]  Qinghua Hu,et al.  Fuzzy probabilistic approximation spaces and their information measures , 2006, IEEE Transactions on Fuzzy Systems.

[2]  Qinghua Hu,et al.  Soft Minimum-Enclosing-Ball Based Robust Fuzzy Rough Sets , 2012, Fundam. Informaticae.

[3]  F. Wilcoxon SOME RAPID APPROXIMATE STATISTICAL PROCEDURES , 1950 .

[4]  Francisco Herrera,et al.  OWA-FRPS: A Prototype Selection Method Based on Ordered Weighted Average Fuzzy Rough Set Theory , 2013, RSFDGrC.

[5]  Wei-Zhi Wu,et al.  Constructive and axiomatic approaches of fuzzy approximation operators , 2004, Inf. Sci..

[6]  Naresh Manwani,et al.  Noise Tolerance Under Risk Minimization , 2011, IEEE Transactions on Cybernetics.

[7]  De-gang Chen,et al.  The Model of Fuzzy Variable Precision Rough Sets , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[8]  Yiyu Yao,et al.  Two Semantic Issues in a Probabilistic Rough Set Model , 2011, Fundam. Informaticae.

[9]  Partha Garai,et al.  IT2 Fuzzy-Rough Sets and Max Relevance-Max Significance Criterion for Attribute Selection , 2015, IEEE Transactions on Cybernetics.

[10]  Yiyu Yao,et al.  Decision-Theoretic Rough Set Models , 2007, RSKT.

[11]  Qinghua Hu,et al.  Soft fuzzy rough sets for robust feature evaluation and selection , 2010, Inf. Sci..

[12]  Xizhao Wang,et al.  Building a Rule-Based Classifier—A Fuzzy-Rough Set Approach , 2010, IEEE Transactions on Knowledge and Data Engineering.

[13]  Qinghua Hu,et al.  Noise model based v-support vector regression with its application to short-term wind speed forecasting , 2014, Neural Networks.

[14]  Jianwu Dang,et al.  Fuzzy rough regression with application to wind speed prediction , 2014, Inf. Sci..

[15]  G. J. Gibson,et al.  On the decision regions of multilayer perceptrons , 1990, Proc. IEEE.

[16]  Yiyu Yao,et al.  A Decision Theoretic Framework for Approximating Concepts , 1992, Int. J. Man Mach. Stud..

[17]  Nehad N. Morsi,et al.  Axiomatics for fuzzy rough sets , 1998, Fuzzy Sets Syst..

[18]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[19]  Jiye Liang,et al.  International Journal of Approximate Reasoning Multigranulation Decision-theoretic Rough Sets , 2022 .

[20]  Partha Garai,et al.  Fuzzy–Rough Simultaneous Attribute Selection and Feature Extraction Algorithm , 2013, IEEE Transactions on Cybernetics.

[21]  Matthew Franchetti,et al.  Bayes Net Classifiers for Prediction of Renal Graft Status and Survival Period , 2010 .

[22]  Jianhua Dai,et al.  An Uncertainty Measure for Incomplete Decision Tables and Its Applications , 2013, IEEE Transactions on Cybernetics.

[23]  Dominik Slezak,et al.  Variable Precision Bayesian Rough Set Model , 2003, RSFDGrC.

[24]  Qinghua Hu,et al.  On Robust Fuzzy Rough Set Models , 2012, IEEE Transactions on Fuzzy Systems.

[25]  Patrick Haffner,et al.  Support vector machines for histogram-based image classification , 1999, IEEE Trans. Neural Networks.

[26]  Zhoujun Li,et al.  A novel variable precision (θ, σ)-fuzzy rough set model based on fuzzy granules , 2014, Fuzzy Sets Syst..

[27]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[28]  Witold Pedrycz,et al.  Granular Computing: Perspectives and Challenges , 2013, IEEE Transactions on Cybernetics.

[29]  Wen-Xiu Zhang,et al.  An axiomatic characterization of a fuzzy generalization of rough sets , 2004, Inf. Sci..

[30]  Qinghua Hu,et al.  A Novel Algorithm for Finding Reducts With Fuzzy Rough Sets , 2012, IEEE Transactions on Fuzzy Systems.

[31]  Bingzhen Sun,et al.  Soft fuzzy rough sets and its application in decision making , 2011, Artificial Intelligence Review.

[32]  Z. Pawlak,et al.  Rough membership functions , 1994 .

[33]  C. Cornelis,et al.  Vaguely Quantified Rough Sets , 2009, RSFDGrC.

[34]  Xizhao Wang,et al.  On the generalization of fuzzy rough sets , 2005, IEEE Transactions on Fuzzy Systems.

[35]  W. Marsden I and J , 2012 .

[36]  Alicja Mieszkowicz-Rolka,et al.  Variable Precision Fuzzy Rough Sets , 2004, Trans. Rough Sets.

[37]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[38]  Bingzhen Sun,et al.  Probabilistic rough set over two universes and rough entropy , 2012, Int. J. Approx. Reason..

[39]  Chris Cornelis,et al.  Ordered Weighted Average Based Fuzzy Rough Sets , 2010, RSKT.

[40]  Fei Chao,et al.  Feature Selection Inspired Classifier Ensemble Reduction , 2014, IEEE Transactions on Cybernetics.

[41]  Francisco Herrera,et al.  FRPS: A Fuzzy Rough Prototype Selection method , 2013, Pattern Recognit..

[42]  Jesús Manuel Fernández Salido,et al.  Rough set analysis of a general type of fuzzy data using transitive aggregations of fuzzy similarity relations , 2003, Fuzzy Sets Syst..