Mining A Primary Biliary Cirrhosis Dataset Using Rough Sets and a Probabilistic Neural Network

In this paper, a decision support system based on rough sets and a probabilistic neural network is presented. Rough sets were employed as they have the capacity to reduce the dimensionality of the dataset and also produce a set of readily understandable rules. A probabilistic neural network was also employed to classify this dataset, comparing the classification accuracy to that obtained with rough sets. We firstly evaluate the effectiveness of these machine learning algorithms on a real-life small biomedical dataset. The classification results indicate that both classifiers produce a high level of accuracy (87% or better). The rough sets algorithm produced a set of rules that are readily interpretable by a domain expert. The PNN algorithm produced a classifier that was robust to noise and missing values. These preliminary results indicate that the both rough sets and PNN machine learning approaches can be successfully applied synergistically to biomedical datasets that contain a variety of attribute types, missing values and multiple decision classes

[1]  Kenneth Revett,et al.  A Rough Sets Based Breast Cancer Decision Support System , 2005, METMBS.

[2]  Jakub Wroblewski,et al.  Theoretical Foundations of Order-Based Genetic Algorithms , 1996, Fundam. Informaticae.

[3]  P. Grambsch,et al.  Primary biliary cirrhosis: Prediction of short‐term survival based on repeated patient visits , 1994, Hepatology.

[4]  D. F. Specht,et al.  Probabilistic neural networks for classification, mapping, or associative memory , 1988, IEEE 1988 International Conference on Neural Networks.

[5]  K. Revett,et al.  A Breast Cancer Diagnosis System: A Combined Approach Using Rough Sets and Probabilistic Neural Networks , 2005, EUROCON 2005 - The International Conference on "Computer as a Tool".

[6]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[7]  A. Tromm,et al.  Long-term response of primary biliary cirrhosis (stage I) to therapy with ursodeoxycholic acid. , 2005, Hepato-gastroenterology.

[8]  K. Revett,et al.  Data mining the PIMA dataset using rough set theory with a special emphasis on rule reduction , 2004, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..

[9]  Dominik Slezak,et al.  Approximate Entropy Reducts , 2002, Fundam. Informaticae.