Attribute selection with fuzzy decision reducts

Rough set theory provides a methodology for data analysis based on the approximation of concepts in information systems. It revolves around the notion of discernibility: the ability to distinguish between objects, based on their attribute values. It allows to infer data dependencies that are useful in the fields of feature selection and decision model construction. In many cases, however, it is more natural, and more effective, to consider a gradual notion of discernibility. Therefore, within the context of fuzzy rough set theory, we present a generalization of the classical rough set framework for data-based attribute selection and reduction using fuzzy tolerance relations. The paper unifies existing work in this direction, and introduces the concept of fuzzy decision reducts, dependent on an increasing attribute subset measure. Experimental results demonstrate the potential of fuzzy decision reducts to discover shorter attribute subsets, leading to decision models with a better coverage and with comparable, or even higher accuracy.

[1]  Martine De Cock,et al.  On (un)suitable fuzzy relations to model approximate equality , 2003, Fuzzy Sets Syst..

[2]  Chris Cornelis,et al.  Feature Selection with Fuzzy Decision Reducts , 2008, RSKT.

[3]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[4]  Andrzej Skowron,et al.  Rudiments of rough sets , 2007, Inf. Sci..

[5]  Jorma Rissanen,et al.  Minimum Description Length Principle , 2010, Encyclopedia of Machine Learning.

[6]  Salvatore Greco,et al.  Rough Set Processing of Vague Information Using Fuzzy Similarity Relations , 2000, Finite Versus Infinite.

[7]  Yiyu Yao,et al.  Data analysis based on discernibility and indiscernibility , 2007, Inf. Sci..

[8]  S. Tsumoto,et al.  Rough set methods and applications: new developments in knowledge discovery in information systems , 2000 .

[9]  Qiang Shen,et al.  Rough set-aided keyword reduction for text categorization , 2001, Appl. Artif. Intell..

[10]  Jing-Yu Yang,et al.  Dominance-based rough set approach and knowledge reductions in incomplete ordered information system , 2008, Inf. Sci..

[11]  Tsau Young Lin,et al.  Rough Sets and Data Mining: Analysis of Imprecise Data , 1996 .

[12]  Yiyu Yao Combination of Rough and Fuzzy Sets Based on α-Level Sets , 1997 .

[13]  Vladik Kreinovich,et al.  Handbook of Granular Computing , 2008 .

[14]  Xizhao Wang,et al.  Attribute Reduction Based on Fuzzy Rough Sets , 2007, RSEISP.

[15]  Jan G. Bazan,et al.  Rough set algorithms in classification problem , 2000 .

[16]  Dominik Ślęzak,et al.  Various approaches to reasoning with frequency based decision reducts: a survey , 2000 .

[17]  Ming Yang,et al.  A Novel Approach of Rough Set-Based Attribute Reduction Using Fuzzy Discernibility Matrix , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[18]  Xizhao Wang,et al.  Attributes Reduction Using Fuzzy Rough Sets , 2008, IEEE Transactions on Fuzzy Systems.

[19]  Alexis Tsoukiàs,et al.  Incomplete Information Tables and Rough Classification , 2001, Comput. Intell..

[20]  Yiyu Yao,et al.  Attribute reduction in decision-theoretic rough set models , 2008, Inf. Sci..

[21]  Hung Son Nguyen,et al.  Approximate Boolean Reasoning: Foundations and Applications in Data Mining , 2006, Trans. Rough Sets.

[22]  Jacques Teghem,et al.  Some Experiments to Compare Rough Sets Theory and Ordinal Statistical Methods , 1992, Intelligent Decision Support.

[23]  Andrzej Skowron,et al.  Rough sets and Boolean reasoning , 2007, Inf. Sci..

[24]  Jaroslaw Stepaniuk,et al.  Tolerance Information Granules , 2004, MSRAS.

[25]  Marko Robnik-Sikonja,et al.  Overcoming the Myopia of Inductive Learning Algorithms with RELIEFF , 2004, Applied Intelligence.

[26]  Lech Polkowski,et al.  Rough Mereology as a Link Between Rough and Fuzzy Set Theories. A Survey , 2004, Trans. Rough Sets.

[27]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[28]  Ian Witten,et al.  Data Mining , 2000 .

[29]  R. Słowiński Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory , 1992 .

[30]  Salvatore Greco,et al.  Rough approximation by dominance relations , 2002, Int. J. Intell. Syst..

[31]  Xizhao Wang,et al.  OFFSS: optimal fuzzy-valued feature subset selection , 2003, IEEE Trans. Fuzzy Syst..

[32]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[33]  Qiang Shen,et al.  Centre for Intelligent Systems and Their Applications Fuzzy Rough Attribute Reduction with Application to Web Categorization Fuzzy Rough Attribute Reduction with Application to Web Categorization Fuzzy Sets and Systems ( ) – Fuzzy–rough Attribute Reduction with Application to Web Categorization , 2022 .

[34]  David Maier,et al.  The Theory of Relational Databases , 1983 .

[35]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[36]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[37]  Chris Cornelis,et al.  Fuzzy Rough Sets: The Forgotten Step , 2007, IEEE Transactions on Fuzzy Systems.

[38]  Qinghua Hu,et al.  A new approach to attribute reduction of consistent and inconsistent covering decision systems with covering rough sets , 2007, Inf. Sci..

[39]  Anna Maria Radzikowska,et al.  A comparative study of fuzzy rough sets , 2002, Fuzzy Sets Syst..

[40]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[41]  Witold Pedrycz,et al.  Feature analysis through information granulation and fuzzy sets , 2002, Pattern Recognit..

[42]  Pawan Lingras,et al.  Survey of Rough and Fuzzy Hybridization , 2007, 2007 IEEE International Fuzzy Systems Conference.

[43]  P. Grünwald The Minimum Description Length Principle (Adaptive Computation and Machine Learning) , 2007 .

[44]  Eric C. C. Tsang,et al.  On fuzzy approximation operators in attribute reduction with fuzzy rough sets , 2008, Inf. Sci..

[45]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[46]  Sebastian Widz,et al.  Approximation Degrees in Decision Reduct-Based MRI Segmentation , 2007, 2007 Frontiers in the Convergence of Bioscience and Information Technologies.

[47]  Xizhao Wang,et al.  Learning fuzzy rules from fuzzy samples based on rough set technique , 2007, Inf. Sci..

[48]  Degang Chen,et al.  An approach of attributes reduction based on fuzzy TL rough sets , 2007, 2007 IEEE International Conference on Systems, Man and Cybernetics.

[49]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[50]  Andrzej Skowron,et al.  Rough sets: Some extensions , 2007, Inf. Sci..

[51]  Cristian S. Calude,et al.  Finite Versus Infinite: Contributions to an Eternal Dilemma , 2000 .

[52]  Dominik Lzak Degrees of conditional (in)dependence: A framework for approximate Bayesian networks and examples related to the rough set-based feature selection , 2009 .

[53]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[54]  Didier Dubois,et al.  Putting Rough Sets and Fuzzy Sets Together , 1992, Intelligent Decision Support.

[55]  Klaus Nordhausen,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman , 2009 .

[56]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[57]  Qiang Shen,et al.  Fuzzy-Rough Sets Assisted Attribute Selection , 2007, IEEE Transactions on Fuzzy Systems.

[58]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[59]  Qinghua Hu,et al.  Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation , 2007, Pattern Recognit..

[60]  Chen Degang,et al.  A new approach to attribute reduction of consistent and inconsistent covering decision systems with covering rough sets , 2007 .

[61]  Salvatore Greco,et al.  Fuzzy Similarity Relation as a Basis for Rough Approximations , 1998, Rough Sets and Current Trends in Computing.

[62]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[63]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[64]  Jakub Wroblewski,et al.  Theoretical Foundations of Order-Based Genetic Algorithms , 1996, Fundam. Informaticae.

[65]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[66]  SunBingzhen,et al.  Fuzzy rough set theory for the interval-valued fuzzy information systems , 2008 .

[67]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[68]  Bernhard Ganter,et al.  Formal Concept Analysis: Mathematical Foundations , 1998 .

[69]  Yee Leung,et al.  Generalized fuzzy rough sets determined by a triangular norm , 2008, Inf. Sci..

[70]  Qiang Shen,et al.  New Approaches to Fuzzy-Rough Feature Selection , 2009, IEEE Transactions on Fuzzy Systems.

[71]  Chris Cornelis,et al.  Fuzzy Rough Sets: from Theory into Practice , 2008, GrC 2008.

[72]  Degang Chen,et al.  Fuzzy rough set theory for the interval-valued fuzzy information systems , 2008, Inf. Sci..