Applying data mining techniques to determine important parameters in chronic kidney disease and the relations of these parameters to each other

Introduction: Chronic kidney disease (CKD) includes a wide range of pathophysiological processes which will be observed along with abnormal function of kidneys and progressive decrease in glomerular filtration rate (GFR). According to the definition decreasing GFR must have been present for at least three months. CKD will eventually result in end-stage kidney disease. In this process different factors play role and finding the relations between effective parameters in this regard can help to prevent or slow progression of this disease. There are always a lot of data being collected from the patients’ medical records. This huge array of data can be considered a valuable source for analyzing, exploring and discovering information. Objectives: Using the data mining techniques, the present study tries to specify the effective parameters and also aims to determine their relations with each other in Iranian patients with CKD. Material and Methods: The study population includes 31996 patients with CKD. First, all of the data is registered in the database. Then data mining tools were used to find the hidden rules and relationships between parameters in collected data. Results: After data cleaning based on CRISP-DM (Cross Industry Standard Process for Data Mining) methodology and running mining algorithms on the data in the database the relationships between the effective parameters was specified. Conclusion: This study was done using the data mining method pertaining to the effective factors on patients with CKD.

[1]  Rüdiger Wirth,et al.  CRISP-DM: Towards a Standard Process Model for Data Mining , 2000 .

[2]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[3]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[4]  Andrew Kusiak,et al.  Predicting survival time for kidney dialysis patients: a data mining approach , 2005, Comput. Biol. Medicine.

[5]  G. Eknoyan,et al.  Definition and classification of chronic kidney disease: a position statement from Kidney Disease: Improving Global Outcomes (KDIGO). , 2005, Kidney international.

[6]  A. Kribben,et al.  Management of advanced chronic kidney disease in primary care – current data from Germany , 2006, International journal of clinical practice.

[7]  MusílekPetr,et al.  A survey of Knowledge Discovery and Data Mining process models , 2006 .

[8]  Selected data on chronic kidney disease in North Carolina. , 2008, North Carolina medical journal.

[9]  Illhoi Yoo,et al.  Data Mining in Healthcare and Biomedicine: A Survey of the Literature , 2012, Journal of Medical Systems.

[10]  Soni Jyoti,et al.  Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction , 2011 .

[11]  Mohammad Mehdi Sepehri,et al.  Implementation of Predictive Data Mining Techniques for Identifying Risk Factors of Early AVF Failure in Hemodialysis Patients , 2013, Comput. Math. Methods Medicine.

[12]  Understanding risks associated with chronic kidney disease: translating observational data to patient care. , 2013, Clinical chemistry.

[13]  W. Hörl Anaemia management and mortality risk in chronic kidney disease , 2013, Nature Reviews Nephrology.

[14]  D. de Zeeuw,et al.  LDL cholesterol in CKD--to treat or not to treat? , 2013, Kidney international.

[15]  Marcello Tonelli,et al.  Using linked administrative data to study periprocedural mortality in obesity and chronic kidney disease (CKD). , 2013, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[16]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[17]  Data Mining Based On Real World Data In Chronic Kidney Disease Patients Not On Dialysis: The Key Role Of Early Hemoglobin Levels Control. , 2015, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[18]  Adler J. Perotte,et al.  Risk prediction for chronic kidney disease progression using heterogeneous electronic health record data and time series analysis , 2015, J. Am. Medical Informatics Assoc..

[19]  T. Celik,et al.  Relationship of Systolic Blood Pressure and Body Mass Index With Left Ventricular Mass and Mass Index in Adolescents , 2016, Angiology.

[20]  N. Pallet,et al.  Rule-Mining for the Early Prediction of Chronic Kidney Disease Based on Metabolomics and Multi-Source Data , 2016, PloS one.

[21]  P. Aljama,et al.  A New Data Analysis System to Quantify Associations between Biochemical Parameters of Chronic Kidney Disease-Mineral Bone Disease , 2016, PloS one.