Supervised Learning: A Generalized Rough Set Approach

Classification rules induction is a central problem addressed by machine learning and data mining. Rough sets theory is an important tool for data classification. Traditional rough sets approach, however, pursuits the fully correct or certain classification rules without considering other factors such as uncertain class labeling, importance of examples, as well as the uncertainty of the final rules. A generalized rough sets model, GRS, is proposed and a classification rules induction approach based on GRS is suggested. Our approach extends the variable precision rough sets model and attempts to reduce the influence of noise by considering the importance of each training example and handling the uncertain class labels. The final classification rules are also measured with the uncertainty factor.

[1]  C. J. V. Rijsbergen,et al.  Rough Sets, Fuzzy Sets and Knowledge Discovery , 1994, Workshops in Computing.

[2]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[3]  Xiaohua Hu,et al.  Mining knowledge rules from databases: a rough set approach , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[4]  T. Y. Lin,et al.  Rough Sets and Data Mining , 1997, Springer US.

[5]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[6]  S. K. Michael Wong,et al.  Comparison of Rough-Set and Statistical Methods in Inductive Learning , 1986, Int. J. Man Mach. Stud..

[7]  Wojciech Ziarko,et al.  Variable Precision Rough Set Model , 1993, J. Comput. Syst. Sci..