Choosing the best knowledge acquisition for dengue dataset based on rough set approach

Knowledge based system need quality knowledge to provide an accurate decision. Due to irrelevant and incorrect information of expert during knowledge acquisition, the information extraction from data mining approach can reduce the biases and misconceptions of multiple experts. Moreover, the real world data which is huge and incomplete has made it complicated to extract only the quality knowledge. Hence, this paper is to investigate the capability of rough set (RS) in generating quality knowledge for dengue dataset problem. Four RS reduct algorithms i.e Genetic Algorithm (GA), Johnson Reducer (JR), Exhaustive Calculation (EC) and Dynamic Reduct (DR) are compared. The comparisons are made based on the accuracy, rules quantity, and rules length. The dengue dataset contains 182 is obtained from Sungai Petani Hospital, Kedah, Malaysia. The result shows that the performances of all reduct algorithms have comparable accuracy and in several algorithms, they obviously generates more and longer rule. GA is the most accurate classifier with shorter rule. Similarly, JA produces comparable accuracy but distinctly generate lower number and shorter rule. EC and DR are capable to produce good accuracy but use more and longer rules to obtain the good accuracy. In conclusion, JA is chosen as the best method for knowledge acquisition for dengue dataset.

[1]  Jan G. Bazan Dynamic Reducts and Statistical Inference , 1996 .

[2]  Shusaku Tsumoto,et al.  Automated extraction of hierarchical decision rules from clinical databases using rough set model , 2003, Expert Syst. Appl..

[3]  Zdzisław Pawlak,et al.  Rough sets and data analysis , 1996, Soft Computing in Intelligent Systems and Information Processing. Proceedings of the 1996 Asian Fuzzy Systems Symposium.

[4]  F. Skov,et al.  Rule-based expert system. , 1990 .

[5]  Wojciech Ziarko,et al.  Discovery through rough set theory , 1999, Commun. ACM.

[6]  Heikki Mannila,et al.  A Perspective on Databases and Data Mining , 1995, KDD.

[7]  Aleksander Øhrn ROSETTA Technical Reference Manual , 2001 .

[8]  Sally Jo Cunningham,et al.  Using data mining to support the construction and maintenance of expert systems , 1993, Proceedings 1993 The First New Zealand International Two-Stream Conference on Artificial Neural Networks and Expert Systems.

[9]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[10]  Azuraliza Abu Bakar Propositional Satisfiability Method in Rough Classification Modeling for Data Mining , 2002 .

[11]  Thomas W. Miller Data and Text Mining: A Business Applications Approach , 2004 .