Rough Set Theory with Applications to Data Mining

This paper is an introduction to rough set theory with an emphasis on applications to data mining. First, consistent data are discussed, including blocks of attribute-value pairs, reducts of information tables, indiscernibility relation, decision tables, and global and local coverings. Rule induction algorithms LEM1 and LEM2 are presented. Then the rough set approach to inconsistent data is introduced, with lower and upper approximations and certain and possible rule sets. The last topic is a rough set approach to incomplete data. How to define modified blocks of attribute-value pairs, characteristic sets, and characteristic relation are explained. Additionally, two definitions of definability and three definitions of approximations are presented. Finally, some remarks about applications of the LERS data mining system are included.

[1]  Jerzy W. Grzymala-Busse,et al.  Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction , 2004, Trans. Rough Sets.

[2]  Jerzy W. Grzymala-Busse,et al.  Characteristic Relations for Incomplete Data: A Generalization of the Indiscernibility Relation , 2005, Trans. Rough Sets.

[3]  Daniel Vanderpooten,et al.  A Generalized Definition of Rough Approximations Based on Similarity , 2000, IEEE Trans. Knowl. Data Eng..

[4]  T. Y. Lin,et al.  NEIGHBORHOOD SYSTEMS AND APPROXIMATION IN RELATIONAL DATABASES AND KNOWLEDGE BASES , 1989 .

[5]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[6]  Jerzy W. Grzymala-Busse,et al.  Rough Set Strategies to Data with Missing Attribute Values , 2006, Foundations and Novel Approaches in Data Mining.

[7]  Jerzy W. Grzymala-Busse,et al.  Knowledge acquisition under uncertainty — a rough set approach , 1988, J. Intell. Robotic Syst..

[8]  Jerzy W. Grzymala-Busse,et al.  A New Version of the Rule Induction System LERS , 1997, Fundam. Informaticae.

[9]  Roman Słowiński,et al.  Dealing with Missing Data in Rough Set Analysis of Multi-Attribute and Multi-Criteria Decision Problems , 2000 .

[10]  Jerzy W. Grzymala-Busse,et al.  Rough Set Approach to Incomplete Data , 2004, ICAISC.

[11]  Jerzy W. Grzymala-Busse,et al.  A Comparison of Several Approaches to Missing Attribute Values in Data Mining , 2000, Rough Sets and Current Trends in Computing.

[12]  Marzena Kryszkiewicz,et al.  Rules in Incomplete Information Systems , 1999, Inf. Sci..

[13]  Yiyu Yao,et al.  Relational Interpretations of Neigborhood Operators and Rough Set Approximation Operators , 1998, Inf. Sci..

[14]  Alexis Tsoukiàs,et al.  On the Extension of Rough Sets under Incomplete Information , 1999, RSFDGrC.

[15]  Yiyu Yao,et al.  Two views of the theory of rough sets in finite universes , 1996, Int. J. Approx. Reason..

[16]  Jerzy W. Grzymala-Busse,et al.  On the Unknown Attribute Values in Learning from Examples , 1991, ISMIS.

[17]  J. Grzymala-Busse Managing uncertainty in expert systems , 1991 .

[18]  Nada Lavrac,et al.  The Multi-Purpose Incremental Learning System AQ15 and Its Testing Application to Three Medical Domains , 1986, AAAI.

[19]  Marzena Kryszkiewicz,et al.  Rough Set Approach to Incomplete Information Systems , 1998, Inf. Sci..

[20]  Alexis Tsoukiàs,et al.  Incomplete Information Tables and Rough Classification , 2001, Comput. Intell..

[21]  Jerzy W. Grzymala-Busse,et al.  LERS-A System for Learning from Examples Based on Rough Sets , 1992, Intelligent Decision Support.

[22]  Roman Słowiński,et al.  Intelligent Decision Support , 1992, Theory and Decision Library.

[23]  Yiyu Yao,et al.  On Generalizing Rough Set Theory , 2003, RSFDGrC.

[24]  J. Grzymala-Busse,et al.  Rough Set Approaches to Rule Induction from Incomplete Data , 2004 .

[25]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .