Review on Data Mining Methods for Tuberculosis Diagnosis

Tuberculosis, which is the oldest human disease wit h the highest mortality rates among infectious diseases, continues to be the worl d's attention. Previous methods to diagnose tuberculosis are tuberculin test, Sputum-s mear microscopy and chest radiography. Unfortunately, these methods are time consuming and perform poorly. Furthermore, they require varied sensitivity, Mycob acterium tuberculosis bacilli alive, sputum which is difficult to obtain from chi ldren, trained personnel to avoid human error, and hence, high cost. Researchers keep developing accurate data mining methods for rapid Tuberculosis diagnosis to reduce the rate of growth of the world population of tuberculosis patients. This paper aims to provide state-of-the-art of data mining methods in diagnosing Tuberculosis u sing clinical symptoms as input parameters. First, it introduces tuberculosis and c urrent methods used for tuberculosis diagnosis. Then it discusses technique s for preprocessing data and data mining methods for tuberculosis diagnosis currently used. The result shows that the most frequently used variables are sweating at nigh t, more than 3 weeks of cough, fever, weight loss, age, and chest pain respectivel y. Support Vector Machine and Bayesian Network gave the highest accuracy compared to other methods.

[1]  P. Cardona,et al.  Understanding Tuberculosis - Global Experiences and Innovative Approaches to the Diagnosis , 2012 .

[2]  Adem Karahoca,et al.  Predicting existence of Mycobacterium tuberculosis on patients using data mining approaches , 2011, WCIT.

[3]  S. Kant,et al.  Pulmonary tuberculosis as differential diagnosis of lung cancer , 2012, South Asian Journal of Cancer.

[4]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[5]  Wahidah Mansor,et al.  Review of mycobacterium tuberculosis detection , 2011, 2011 IEEE Control and System Graduate Research Colloquium.

[6]  Haihe Wang,et al.  Identification of M. tuberculosis complex by a novel hybridization signal amplification method , 2011, Proceedings 2011 International Conference on Human Health and Biomedical Engineering.

[7]  R. Lestari,et al.  The lung diseases diagnosis software: Influenza and Tuberculosis case studies in the cloud computing environment , 2012, 2012 International Conference on Cloud Computing and Social Networking (ICCCSN).

[8]  K. N. Balasubramanya Murthy,et al.  A Data Mining Approach to the Diagnosis of Tuberculosis by Cascading Clustering and Classification , 2011, ArXiv.

[9]  M. Y. Mashor,et al.  Detection of mycobacterium tuberculosis in Ziehl-Neelsen stained tissue images using Zernike moments and hybrid multilayered perceptron network , 2010, 2010 IEEE International Conference on Systems, Man and Cybernetics.

[10]  Adem Karahoca,et al.  Tuberculosis disease diagnosis by using adaptive neuro fuzzy inference system and rough sets , 2012, Neural Computing and Applications.

[11]  L. Gabbasova,et al.  Global tuberculosis report (2014) , 2014 .

[12]  M. Y. Mashor,et al.  A genetic algorithm-neural network approach for Mycobacterium tuberculosis detection in Ziehl-Neelsen stained tissue slide images , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[13]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[14]  Y. H. Kimbi A Decision Support System for Tuberculosis Diagnosis. , 2011 .

[15]  Andrew Kusiak,et al.  Data Mining: Medical and Engineering Case Studies , 2000 .

[16]  Dimitris Kanellopoulos,et al.  Data Preprocessing for Supervised Leaning , 2007 .

[17]  M. Bahadori,et al.  Common Challenges in Laboratory Diagnosis and Management of Tuberculosis , 2012 .

[18]  Brian Tracey,et al.  Cough detection algorithm for monitoring patient recovery from pulmonary tuberculosis , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[19]  S. Natarajan,et al.  Data Mining Techniques in the Diagnosis of Tuberculosis , 2012 .

[20]  H. Dag,et al.  Comparison of feature selection algorithms for medical data , 2012, 2012 International Symposium on Innovations in Intelligent Systems and Applications.

[21]  Nor Hazlyna Harun,et al.  Performance comparison between RGB and HSI linear stretching for tuberculosis bacilli detection in Ziehl-Neelsen tissue slide images , 2009, 2009 IEEE International Conference on Signal and Image Processing Applications.

[22]  M. Y. Mashor,et al.  Compact single hidden layer feedforward network for mycobacterium tuberculosis detection , 2011, 2011 IEEE International Conference on Control System, Computing and Engineering.

[23]  Faith-Michael E. Uzoka,et al.  A framework for cell phone based diagnosis and management of priority tropical diseases , 2011, 2011 IST-Africa Conference Proceedings.

[24]  Okure Udo Obot,et al.  Clinical decision support system (DSS) in the diagnosis of malaria: A case comparison of two soft computing methodologies , 2011, Expert Syst. Appl..