Diagnosis of Chronic Kidney Disease Based on Support Vector Machine by Feature Selection Methods

As Chronic Kidney Disease progresses slowly, early detection and effective treatment are the only cure to reduce the mortality rate. Machine learning techniques are gaining significance in medical diagnosis because of their classification ability with high accuracy rates. The accuracy of classification algorithms depend on the use of correct feature selection algorithms to reduce the dimension of datasets. In this study, Support Vector Machine classification algorithm was used to diagnose Chronic Kidney Disease. To diagnose the Chronic Kidney Disease, two essential types of feature selection methods namely, wrapper and filter approaches were chosen to reduce the dimension of Chronic Kidney Disease dataset. In wrapper approach, classifier subset evaluator with greedy stepwise search engine and wrapper subset evaluator with the Best First search engine were used. In filter approach, correlation feature selection subset evaluator with greedy stepwise search engine and filtered subset evaluator with the Best First search engine were used. The results showed that the Support Vector Machine classifier by using filtered subset evaluator with the Best First search engine feature selection method has higher accuracy rate (98.5%) in the diagnosis of Chronic Kidney Disease compared to other selected methods.

[1]  Asha Gowda Karegowda,et al.  Feature Subset Selection Problem using Wrapper Approach in Supervised Learning , 2010 .

[2]  Tommaso Di Noia,et al.  An end stage kidney disease predictor based on an artificial neural networks ensemble , 2013, Expert Syst. Appl..

[3]  T. John Peter,et al.  Study and Development of Novel Feature Selection Framework for Heart Disease Prediction , 2012 .

[4]  Jian Huang,et al.  Penalized feature selection and classification in bioinformatics , 2008, Briefings Bioinform..

[5]  Taghi M. Khoshgoftaar,et al.  Optimizing Wrapper-Based Feature Selection for Use on Bioinformatics Data , 2014, FLAIRS.

[6]  Manas Ranjan Patra,et al.  Augmenting Weighted Average with Confusion Matrix to Enhance Classification Accuracy , 2014 .

[7]  L. Ladha,et al.  FEATURE SELECTION METHODS AND ALGORITHMS , 2011 .

[8]  J. Ramírez,et al.  SVM-based computer-aided diagnosis of the Alzheimer's disease using t-test NMSE feature selection with feature correlation weighting , 2009, Neuroscience Letters.

[9]  Patrick Van Damme,et al.  Application of genetic algorithm and greedy stepwise to select input variables in classification tree models for the prediction of habitat requirements of Azolla filiculoides (Lam.) in Anzali wetland, Iran , 2013 .

[10]  Tauseef Ibne Mamun,et al.  An Analytical Comparison on Filter Feature Extraction Method in Data Mining using J48 Classifier , 2015 .

[11]  José Neves,et al.  A Soft Computing Approach to Kidney Diseases Evaluation , 2015, Journal of Medical Systems.

[12]  S. Appavu alias Balamurugan,et al.  A Novel Feature Selection Technique for Improved Survivability Diagnosis of Breast Cancer , 2015 .

[13]  Osiris Villacampa Feature Selection and Classification Methods for Decision Making: A Comparative Analysis , 2015 .

[14]  K. Hajian‐Tilaki,et al.  Receiver Operating Characteristic (ROC) Curve Analysis for Medical Diagnostic Test Evaluation. , 2013, Caspian journal of internal medicine.

[15]  Jennifer S Yeom,et al.  Textile Fingerprinting for Dismount Analysis in the Visible, Near, and Shortwave Infrared Domain , 2014 .

[16]  Laetitia Vermeulen-Jourdan,et al.  Feature Selection Using Tabu Search with Learning Memory: Learning Tabu Search , 2016, LION.

[17]  Ayu Purwarianti,et al.  Analyzing bandung public mood using Twitter data , 2016, 2016 4th International Conference on Information and Communication Technology (ICoICT).

[18]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[19]  Yanling Zhang,et al.  Prediction of hepatotoxicity of traditional Chinese medicine compounds by support vector machine approach , 2014, 2014 8th International Conference on Systems Biology (ISB).

[20]  Chao-Ton Su,et al.  Feature selection for the SVM: An application to hypertension diagnosis , 2008, Expert Syst. Appl..

[21]  Lizhuang Ma,et al.  Feature selection and syndrome prediction for liver cirrhosis in traditional Chinese medicine , 2009, Comput. Methods Programs Biomed..

[22]  Mehmet Fatih Akay,et al.  Support vector machines combined with feature selection for breast cancer diagnosis , 2009, Expert Syst. Appl..

[23]  Juanying Xie,et al.  Using support vector machines with a novel hybrid feature selection method for diagnosis of erythemato-squamous diseases , 2011, Expert Syst. Appl..

[24]  Mu-Yen Chen,et al.  Integrating data mining with case-based reasoning for chronic diseases prognosis and diagnosis , 2007, Expert Syst. Appl..

[25]  Sri Ramakrishna,et al.  FEATURE SELECTION METHODS AND ALGORITHMS , 2011 .

[26]  Juan de Oña,et al.  A method for simplifying the analysis of traffic accidents injury severity on two-lane highways using Bayesian networks. , 2011, Journal of safety research.

[27]  M. Sivabalakrishnan,et al.  Feature Selection of Gene Expression Data for Cancer Classification: A Review , 2015 .

[28]  Sun I. Kim,et al.  Application of irregular and unbalanced data to predict diabetic nephropathy using visualization and feature selection methods , 2008, Artif. Intell. Medicine.

[29]  Tripti Swarnkar,et al.  Filter versus Wrapper Feature Subset Selection in Large Dimensionality Micro array : A Review , 2011 .

[30]  Charles E. McCulloch,et al.  Chronic kidney disease and the risks of death, cardiovascular events, and hospitalization. , 2004, The New England journal of medicine.

[31]  Andreas Zell,et al.  Prediction of breast cancer by profiling of urinary RNA metabolites using Support Vector Machine-based feature selection , 2009, BMC Cancer.

[32]  Ljiljana Trajkovic,et al.  Performance evaluation of BGP anomaly classifiers , 2015, 2015 Third International Conference on Digital Information, Networking, and Wireless Communications (DINWC).

[33]  Rina Dechter,et al.  Generalized best-first search strategies and the optimality of A* , 1985, JACM.

[34]  Arif Gülten,et al.  Genetic algorithm wrapped Bayesian network feature selection applied to differential diagnosis of erythemato-squamous diseases , 2013, Digit. Signal Process..

[35]  K. Usha Rani,et al.  ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS , 2011 .

[36]  Zhuoyong Zhang,et al.  Clinical risk assessment of patients with chronic kidney disease by using clinical data and multivariate models , 2016, International Urology and Nephrology.