Rule Evaluations, Attributes, and Rough Sets: Extension and a Case Study

Manually evaluating important and interesting rules generated from data is generally infeasible due to the large number of rules extracted. Different approaches such as rule interestingness measures and rule quality measures have been proposed and explored previously to extract interesting and high quality association rules and classification rules. Rough sets theory was originally presented as an approach to approximate concepts under uncertainty. In this paper, we explore rough sets based rule evaluation approaches in knowledge discovery. We demonstrate rule evaluation approaches through a real-world geriatric care data set from Dalhousie Medical School. Rough set based rule evaluation approaches can be used in a straightforward way to rank the importance of the rules. One interesting system developed along these lies in HYRIS (HYbrid Rough sets Intelligent System). We introduce HYRIS through a case study on survival analysis using the geriatric care data set.

[1]  J. Klein,et al.  Survival Analysis: Techniques for Censored and Truncated Data , 1997 .

[2]  Andrew Kusiak,et al.  Predicting survival time for kidney dialysis patients: a data mining approach , 2005, Comput. Biol. Medicine.

[3]  Nick Cercone,et al.  ELEM2: A Learning System for More Accurate Classifications , 1998, Canadian Conference on AI.

[4]  Tsau Young Lin,et al.  A New Rough Sets Model Based on Database Systems , 2003, Fundam. Informaticae.

[5]  Andrzej Skowron,et al.  Transactions on Rough Sets V , 2006, Trans. Rough Sets.

[6]  Xiaohua Hu,et al.  DBROUGH: A Rough Set Based Knowledge Discovery System , 1994, ISMIS.

[7]  Andrzej Skowron,et al.  Searching for the Complex Decision Reducts: The Case Study of the Survival Analysis , 2003, ISMIS.

[8]  Jerzy W. Grzymala-Busse,et al.  Analyzing the relation between heart rate, problem behavior, and environmental events using data mining system LERS , 2001, Proceedings 14th IEEE Symposium on Computer-Based Medical Systems. CBMS 2001.

[9]  Elisa T. Lee,et al.  Statistical Methods for Survival Data Analysis , 1994, IEEE Transactions on Reliability.

[10]  Aleksander Øhrn,et al.  Discernibility and Rough Sets in Medicine: Tools and Applications , 2000 .

[11]  S. Tsumoto,et al.  Rough set methods and applications: new developments in knowledge discovery in information systems , 2000 .

[12]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[13]  Nick Cercone,et al.  Rule Quality Measures for Rule Induction Systems: Description and Evaluation , 2001, Comput. Intell..

[14]  Sadaaki Miyamoto,et al.  Rough Sets and Current Trends in Computing , 2012, Lecture Notes in Computer Science.

[15]  Shusaku Tsumoto,et al.  Foundations of Intelligent Systems, 15th International Symposium, ISMIS 2005, Saratoga Springs, NY, USA, May 25-28, 2005, Proceedings , 2005, ISMIS.

[16]  Nick Cercone,et al.  Hybrid intelligent systems: selecting attributes for soft-computing analysis , 2005, 29th Annual International Computer Software and Applications Conference (COMPSAC'05).

[17]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[18]  Szymon Wilk,et al.  Rough Set Based Data Exploration Using ROSE System , 1999, ISMIS.

[19]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[20]  Zbigniew W. Ras,et al.  Methodologies for Intelligent Systems , 1991, Lecture Notes in Computer Science.

[21]  Jan G. Bazan,et al.  Rough set algorithms in classification problem , 2000 .

[22]  Gholamreza Nakhaeizadeh,et al.  Machine learning and statistics: the interface , 1996 .

[23]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[24]  Andrzej Skowron,et al.  Rough Set Approach to the Survival Analysis , 2002, Rough Sets and Current Trends in Computing.

[25]  Nick Cercone,et al.  Rule analysis with rough sets theory , 2006, 2006 IEEE International Conference on Granular Computing.

[26]  Ivo Düntsch,et al.  The Rough Set Engine GROBIAN , 1999 .

[27]  Marzena Kryszkiewicz,et al.  Finding Reducts in Composed Information Systems , 1993, RSKD.

[28]  Jiye Li,et al.  Introducing a Rule Importance Measure , 2006, Trans. Rough Sets.

[29]  Christian Borgelt,et al.  EFFICIENT IMPLEMENTATIONS OF APRIORI AND ECLAT , 2003 .

[30]  Jiye Li,et al.  Assigning missing attribute values based on rough sets theory , 2006, 2006 IEEE International Conference on Granular Computing.

[31]  Qiang Shen,et al.  Rough set-aided keyword reduction for text categorization , 2001, Appl. Artif. Intell..

[32]  Nick Cercone,et al.  Selecting Attributes for Soft-Computing Analysis in Hybrid Intelligent Systems , 2005, RSFDGrC.

[33]  Jiye Li,et al.  Discovering and ranking important rules , 2005, 2005 IEEE International Conference on Granular Computing.