A new proposed feature selection method to predict kidney transplantation outcome

Kidney transplantation graft survival prediction is important because of the difficulty of finding the organs. The exact prediction of kidney transplantation outcome is still not accurate even with the enhancements in acute rejection results. Machine learning methods introduce many ways to solve the kidney transplantation prediction problem than that of other methods. The power of any prediction method relies on the choosing of the proper variables. Feature selection is one of the important preprocessing procedures. It is the method that selects the minimal suitable variables that introduced in a set of features. This paper introduced a new proposed feature selection method that combines statistical methods with classification procedures of data mining technology to predict the probability of graft survival after kidney transplantation. Univariate analysis using Kaplan-Meier survival analysis method combined with Naive Bayes classifier was used to specify the significant variables. Three data mining tools, namely naive Bayes, decision tree and K-nearest neighbor classifiers were utilized to examine the instances of kidney transplantation, and their accuracy was compared with using the new proposed feature selection method and without using it. Experimental results have presented that the new proposed feature selection method have better results than other techniques.

[1]  M. Kattan Comparison of Cox regression with other methods for determining prediction models and nomograms. , 2003, The Journal of urology.

[2]  Dursun Delen,et al.  Predicting the graft survival for heart-lung transplantation patients: An integrated data mining methodology , 2009, Int. J. Medical Informatics.

[3]  Joyce A. Mitchell,et al.  Countering imbalanced datasets to improve adverse drug event predictive models in labor and delivery , 2009, J. Biomed. Informatics.

[4]  James F. Antaki,et al.  Prognosis of Right Ventricular Failure in Patients With Left Ventricular Assist Device Based on Decision Tree With SMOTE , 2012, IEEE Transactions on Information Technology in Biomedicine.

[5]  Mevlut Ture,et al.  Comparing classification techniques for predicting essential hypertension , 2005, Expert Syst. Appl..

[6]  Ali Selamat,et al.  A survey on software fault detection based on different prediction approaches , 2014, Vietnam Journal of Computer Science.

[7]  H. Tsubouchi,et al.  Algorithm to determine the outcome of patients with acute liver failure: a data-mining analysis using decision trees , 2012, Journal of Gastroenterology.

[8]  M. J. Norušis,et al.  SPSS 14.0 Advanced Statistical Procedures Companion , 2005 .

[9]  Ali Dag,et al.  Predicting heart transplantation outcomes through data analytics , 2017, Decis. Support Syst..

[10]  Sergey Krikov,et al.  Predicting Kidney Transplant Survival Using Tree-Based Modeling , 2007, ASAIO journal.

[11]  Rolf Stadler,et al.  Discovering Data Mining: From Concept to Implementation , 1997 .

[12]  Aise Zülal Sevkli,et al.  Predicting quality of life for lung transplant recipients: A hybrid genetic algorithms-based methodology , 2017, 2017 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT).

[13]  Ayman El-Sayed,et al.  Predicting kidney transplantation outcome based on hybrid feature selection and KNN classifier , 2019, Multimedia Tools and Applications.

[14]  Mesut Remzi,et al.  Novel artificial neural network for early detection of prostate cancer. , 2002, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[15]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[16]  M. Zand,et al.  Minimizing Morbidity of Organ Donation: Analysis of Factors for Perioperative Complications After Living-Donor Nephrectomy in the United States , 2008, Transplantation.

[17]  Andrew Kusiak,et al.  Predicting survival time for kidney dialysis patients: a data mining approach , 2005, Comput. Biol. Medicine.

[18]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.

[19]  G. Eknoyan,et al.  National Kidney Foundation Practice Guidelines for Chronic Kidney Disease: Evaluation, Classification, and Stratification , 2003, Annals of Internal Medicine.

[20]  R. Wolfe,et al.  Comparison of mortality in all patients on dialysis, patients on dialysis awaiting transplantation, and recipients of a first cadaveric transplant. , 1999, The New England journal of medicine.

[21]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[22]  Ali Serhan Koyuncugil,et al.  Detecting Road Maps for Capacity Utilization Decisions by Clustering Analysis and CHAID Decision Tress , 2010, Journal of Medical Systems.

[23]  A. Monaco,et al.  Utility of a Mathematical Nomogram to Predict Delayed Graft Function: A Single-Center Experience , 2006, Transplantation.

[24]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[25]  B. Brenner,et al.  Early experience with dual kidney transplantation in adults using expanded donor criteria. Double Kidney Transplant Group (DKG). , 1999, Journal of the American Society of Nephrology : JASN.