Using Genetic Algorithms to Optimise Rough Set Partition Sizes for HIV Data Analysis

In this paper, we present a method to optimise rough set partition sizes, to which rule extraction is performed on HIV (Human Immunodeficiency Virus) data. The genetic algorithm optimisation technique is used to determine the partition sizes of a rough set in order to maximise the rough sets prediction accuracy. The proposed method is tested on a set of six demographic properties of individuals obtained from the South African antenatal survey, with the outcome or decision being either HIV positive or negative. Rough set theory is chosen based on the fact that it is easy to interpret the extracted rules. The prediction accuracy of equal width bin partitioning is 69.8% while the accuracy achieved after optimising the partitions is 87.5%.

[1]  A. Ohrn,et al.  Rough sets: a knowledge discovery technique for multifactorial medical outcomes. , 2000, American journal of physical medicine & rehabilitation.

[2]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[3]  Frank Witlox,et al.  The application of rough sets analysis in activity-based modelling. Opportunities and constraints , 2004, Expert Syst. Appl..

[4]  B. L. Betechuoh,et al.  Autoencoder networks for HIV classification , 2006 .

[5]  Lucila Ohno-Machado,et al.  Comparison of multiple prediction models for ambulation following spinal cord injury , 1998, AMIA.

[6]  Michael W. Carter,et al.  Multi-level resource allocation for HIV prevention: A model for developing countries , 2007, Eur. J. Oper. Res..

[7]  Yangsheng Xu,et al.  Computational Intelligence for Modeling Human Sensations in Virtual Environments , 2004, J. Adv. Comput. Intell. Intell. Informatics.

[8]  Lixiang Shen,et al.  Fault diagnosis based on Rough Set Theory , 2003 .

[9]  Peter Noerr The Difficulty of Search and an Interesting Solution , 2007, IDC.

[10]  D. Celentano,et al.  The social epidemiology of human immunodeficiency virus/acquired immunodeficiency syndrome. , 2004, Epidemiologic reviews.

[11]  José M. Peña,et al.  Application of Rough Sets Algorithms to Prediction of Aircraft Component Failure , 1999, IDA.

[12]  Tshilidzi Marwala,et al.  Prediction of HIV Status from Demographic Data Using Neural Networks , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[13]  Aleksander Øhrn,et al.  Discernibility and Rough Sets in Medicine: Tools and Applications , 2000 .

[14]  Rob Law,et al.  Incorporating the rough sets theory into travel demand analysis , 2003 .

[15]  Reha Uzsoy,et al.  A genetic algorithm for minimizing maximum lateness on parallel identical batch processing machines with dynamic job arrivals and incompatible job families , 2007, Comput. Oper. Res..

[16]  James Cannady,et al.  Artificial Neural Networks for Misuse Detection , 1998 .

[17]  Masahiro Inuiguchi,et al.  Rough set based rule induction from two decision tables , 2007, Eur. J. Oper. Res..

[18]  N. Krieger,et al.  Understanding AIDS: historical interpretations and the limits of biomedical individualism. , 1993, American journal of public health.

[19]  Tshilidzi Marwala,et al.  Using Inverse Neural Network for HIV Adaptive Control , 2007 .

[20]  Alicja Wakulicz-Deja,et al.  Applying Rough Set Theory to Multi Stage Medical Diagnosing , 2003, Fundam. Informaticae.

[21]  Wojciech Ziarko,et al.  A methodology for stock market analysis utilizing rough set theory , 1995, Proceedings of 1995 Conference on Computational Intelligence for Financial Engineering (CIFEr).

[22]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.