XGBoost Model for Chronic Kidney Disease Diagnosis

Chronic Kidney Disease (CKD) is a menace that is affecting 10 percent of the world population and 15 percent of the South African population. The early and cheap diagnosis of this disease with accuracy and reliability will save 20,000 lives in South Africa per year. Scientists are developing smart solutions with Artificial Intelligence (AI). In this paper, several typical and recent AI algorithms are studied in the context of CKD and the extreme gradient boosting (XGBoost) is chosen as our base model for its high performance. Then, the model is optimized and the optimal full model trained on all the features achieves a testing accuracy, sensitivity, and specificity of 1.000, 1.000, and 1.000, respectively. Note that, to cover the widest range of people, the time and monetary costs of CKD diagnosis have to be minimized with fewest patient tests. Thus, the reduced model using fewer features is desirable while it should still maintain high performance. To this end, the set-theory based rule is presented which combines a few feature selection methods with their collective strengths. The reduced model using about a half of the original full features performs better than the models based on individual feature selection methods and achieves accuracy, sensitivity and specificity, of 1.000, 1.000, and 1.000, respectively.

[1]  John A. Stankovic,et al.  Detection of Chronic Kidney Disease and Selecting Important Predictive Attributes , 2016, 2016 IEEE International Conference on Healthcare Informatics (ICHI).

[2]  Meng Joo Er,et al.  A Survey of Adaptive Fuzzy Controllers: Nonlinearities and Classifications , 2016, IEEE Transactions on Fuzzy Systems.

[3]  Qing-Guo Wang,et al.  Enhanced XGBoost-Based Automatic Diagnosis System for Chronic Kidney Disease , 2018, 2018 IEEE 14th International Conference on Control and Automation (ICCA).

[4]  Lixun Zhang,et al.  Experiment study of fuzzy impedance control on horizontal lower limbs rehabilitation robot , 2011, 2011 International Conference on Electronics, Communications and Control (ICECC).

[5]  Majid A. Al-Taee,et al.  Clinical decision support system for diagnosis and management of Chronic Renal Failure , 2013, 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT).

[6]  Sudan Basnet,et al.  A novel fuzzy-logic controller for an artificial heart , 2009, 2009 IEEE Control Applications, (CCA) & Intelligent Control, (ISIC).

[7]  Samuel Oluwarotimi Williams,et al.  Hybrid Intelligent System for the Diagnosis of Typhoid Fever , 2013 .

[8]  N.K. Swain,et al.  A Survey of Application of Fuzzy Logic in Intelligent Transportation Systems (ITS) and Rural ITS , 2006, Proceedings of the IEEE SoutheastCon 2006.

[9]  De-Shuang Huang,et al.  Identifying Stages of Kidney Renal Cell Carcinoma by Combining Gene Expression and DNA Methylation Data , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[10]  Abdulkader Helwan,et al.  Neural network diagnosis of heart disease , 2015, 2015 International Conference on Advances in Biomedical Engineering (ICABME).

[11]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[12]  S. Naicker,et al.  End-stage renal disease in sub-Saharan and South Africa. , 2003, Kidney international. Supplement.

[13]  Mei-Hui Wang,et al.  A Fuzzy Expert System for Diabetes Decision Support Application , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  Rajeshwar Singh,et al.  Use of soft computing techniques in medical decision making: A survey , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[15]  Shawn K. McGuire,et al.  Automated microscopy and machine learning for expert-level malaria field diagnosis , 2015, 2015 IEEE Global Humanitarian Technology Conference (GHTC).

[16]  Nikola Bogunovic,et al.  A review of feature selection methods with applications , 2015, 2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[17]  S. Naicker,et al.  Important causes of chronic kidney disease in South Africa. , 2015, South African medical journal = Suid-Afrikaanse tydskrif vir geneeskunde.

[18]  Rashidah Funke Olanrewaju,et al.  Application of neural networks in early detection and diagnosis of Parkinson's disease , 2014, 2014 International Conference on Cyber and IT Service Management (CITSM).

[19]  Jun Shi,et al.  Fused Hierarchical Neural Networks for Cardiovascular Disease Diagnosis , 2012, IEEE Sensors Journal.

[20]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[21]  Dimitrios I. Fotiadis,et al.  A Framework for Fuzzy Expert System Creation—Application to Cardiovascular Diseases , 2007, IEEE Transactions on Biomedical Engineering.

[22]  Qiang Zheng,et al.  Transfer learning for diagnosis of congenital abnormalities of the kidney and urinary tract in children based on ultrasound imaging data , 2017, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[23]  A. Parrish,et al.  Advances in Chronic Kidney Disease , 2016, International journal of molecular sciences.

[24]  Leonardo Yunda,et al.  A Web-based Fuzzy Inference System Based Tool for Cardiovascular Disease Risk Assessment , 2015 .

[25]  Kemal Maulana Alhasa,et al.  Modeling of Tropospheric Delays Using ANFIS , 2015 .

[26]  Ritu Tiwari,et al.  Expert system for breast cancer diagnosis: A survey , 2016, 2016 International Conference on Computer Communication and Informatics (ICCCI).

[27]  Kwang Sun Ryu,et al.  Determining minimum feature number of classification on clear cell renal cell carcinoma clinical dataset , 2016, 2016 International Conference on Machine Learning and Cybernetics (ICMLC).

[28]  Vladik Kreinovich,et al.  Fuzzy logic and its applications in medicine , 2001, Int. J. Medical Informatics.

[29]  Kemal Polat,et al.  Detection of ECG Arrhythmia using a differential expert system approach based on principal component analysis and least square support vector machine , 2007, Appl. Math. Comput..

[30]  Qi Xuan,et al.  Multiview Generative Adversarial Network and Its Application in Pearl Classification , 2019, IEEE Transactions on Industrial Electronics.

[31]  Sonali Agarwal,et al.  Clinical decision support system for diabetes disease diagnosis using optimized neural network , 2014, 2014 Students Conference on Engineering and Systems.

[32]  B. Spyropoulos,et al.  A Neuro-Fuzzy Controller for the estimation of Tidal Volume and Respiration Frequency ventilator settings for COPD patients ventilated in control mode , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[33]  T. Łukaszuk,et al.  APPLICATION OF THE RECURSIVE FEATURE ELIMINATION AND THE RELAXED LINEAR SEPARABILITY FEATURE SELECTION ALGORITHMS TO GENE EXPRESSION DATA ANALYSIS , 2013 .

[34]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[35]  Matti Pietikäinen,et al.  Thorax disease diagnosis using deep convolutional neural network , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).