Majority Voting Algorithm for Diagnosing of Imbalanced Malaria Disease

Vector borne diseases like malaria fever is one of the most elevating issues in medical domain. Accurate identification of a patient from the given set of samples and classification becomes one of the challenging task when dealing with imbalanced datasets. Many conventional machine learning and data mining algorithms are shows poor performance to classify skewed distributed data because they are trained very well with the majority class samples only. Proposing an ensemble method called majority voting defined with a set of machine learning algorithms namely decision tree—C4.5, Naive Bayesian and K-Nearest Neighbor (KNN) classifiers. Classification of samples can be done based on the majority voting of classifiers. Experiment results stating that voting ensemble method shows classification accuracy of 95.2% on imbalanced malaria disease data whereas dealing with balanced malaria disease data voting ensembler shows 92.1% of accuracy. Consequently voting shows 100% classification report on precision, Recall and F1-Score on imbalanced malaria disease data sets whereas on balanced malaria disease data voting shows 96% of Precision, Recall and F1-Score metrics.

[1]  Honggang Zhang,et al.  An imbalanced data classification method based on automatic clustering under-sampling , 2016, 2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC).

[2]  Leonid Roytman,et al.  Environmental data analysis and remote sensing for early detection of dengue and malaria , 2014, Sensing Technologies + Applications.

[3]  José Salvador Sánchez,et al.  On the effectiveness of preprocessing methods when dealing with different levels of class imbalance , 2012, Knowl. Based Syst..

[4]  Ma Li,et al.  CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests , 2017, BMC Bioinformatics.

[5]  Yazan F. Roumani,et al.  Classifying highly imbalanced ICU data , 2013, Health care management science.

[6]  Bartosz Krawczyk,et al.  Learning from imbalanced data: open challenges and future directions , 2016, Progress in Artificial Intelligence.

[7]  Salma Jamal,et al.  Predictive modeling of anti-malarial molecules inhibiting apicoplast formation , 2013, BMC Bioinformatics.

[8]  A. Anand,et al.  Artificial Neural Networks for Detection of Malaria in RBCs , 2016, ArXiv.

[9]  Yanchun Zhang,et al.  Toward breast cancer survivability prediction models through improving training space , 2009, Expert Syst. Appl..

[10]  Farah Zakiyah Rahmanti,et al.  Plasmodium vivax classification from digitalization microscopic thick blood film using combination of second order statistical feature extraction and K-Nearest Neighbor (K-NN) classifier method , 2015, 2015 4th International Conference on Instrumentation, Communications, Information Technology, and Biomedical Engineering (ICICI-BME).

[11]  Paul M. Thompson,et al.  Analysis of sampling techniques for imbalanced data: An n=648 ADNI study , 2014, NeuroImage.

[12]  Hai Minh Pham,et al.  Web-based GIS for spatial pattern detection: application to malaria incidence in Vietnam , 2016, SpringerPlus.

[13]  Peter Jehopio,et al.  Clinical malaria diagnosis: rule-based classification statistical prototype , 2016, SpringerPlus.

[14]  C. Chakraborty,et al.  Automated system for characterization and classification of malaria‐infected stages using light microscopic images of thin blood smears , 2015, Journal of microscopy.

[15]  Gregory Ditzler,et al.  Incremental Learning of Concept Drift from Streaming Imbalanced Data , 2013, IEEE Transactions on Knowledge and Data Engineering.

[16]  Hanung Adi Nugroho,et al.  Feature extraction and classification for detection malaria parasites in thin blood smear , 2015, 2015 2nd International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE).

[17]  Daniel S. Yeung,et al.  Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems , 2015, IEEE Transactions on Cybernetics.

[18]  Dave MacLeod,et al.  Demonstration of successful malaria forecasts for Botswana using an operational seasonal climate model , 2015 .

[19]  Thomas Smith,et al.  Ensemble Modeling of the Likely Public Health Impact of a Pre-Erythrocytic Malaria Vaccine , 2012, PLoS medicine.

[20]  Patricia J. Y. Wong,et al.  Multi-dimensional discrete Halanay inequalities and the global stability of the disease free equilibrium of a discrete delayed malaria model , 2016 .

[21]  Kshipra C. Charpe,et al.  Automated malaria parasite and there stage detection in microscopic blood images , 2015, 2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO).

[22]  Francisco Herrera,et al.  An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics , 2013, Inf. Sci..

[23]  John J. Aponte,et al.  Reduced Risk of Plasmodium vivax Malaria in Papua New Guinean Children with Southeast Asian Ovalocytosis in Two Cohorts and a Case-Control Study , 2012, PLoS medicine.

[24]  B. Andrade,et al.  Severe Plasmodium vivax malaria exhibits marked inflammatory imbalance , 2010, Malaria Journal.

[25]  Samir Bhatt,et al.  Defining the relationship between infection prevalence and clinical incidence of Plasmodium falciparum malaria , 2015, Nature Communications.

[26]  B. Eswara Reddy,et al.  Segmentation of erythrocytes infected with malaria parasites for the diagnosis using microscopy imaging , 2015, Comput. Electr. Eng..

[27]  M. Mostafizur Rahman,et al.  Addressing the Class Imbalance Problem in Medical Datasets , 2013 .

[28]  Xin Li,et al.  Protein classification with imbalanced data , 2007, Proteins.

[29]  Zhang Chunkai,et al.  A new sampling approach for classification of imbalanced data sets with high density , 2014, 2014 International Conference on Big Data and Smart Computing (BIGCOMP).

[30]  Testing a multi-malaria-model ensemble against 30 years of data in the Kenyan highlands , 2014, Malaria Journal.

[31]  Siti Mariyam Shamsuddin,et al.  Classification with class imbalance problem: A review , 2015, SOCO 2015.

[32]  Shyr-Shen Yu,et al.  Blood Smear Image Based Malaria Parasite and Infected-Erythrocyte Detection and Segmentation , 2015, Journal of Medical Systems.