Data Mining in Healthcare – A Review

Abstract The knowledge discovery in database (KDD) is alarmed with development of methods and techniques for making use of data. One of the most important step of the KDD is the data mining. Data mining is the process of pattern discovery and extraction where huge amount of data is involved. Both the data mining and healthcare industry have emerged some of reliable early detection systems and other various healthcare related systems from the clinical and diagnosis data. In regard to this emerge, we have reviewed the various paper involved in this field in terms of method, algorithms and results. This review paper has consolidated the papers reviewed inline to the disciplines, model, tasks and methods. Results and evaluation methods are discussed for selected papers and a summary of the finding is presented to conclude the paper.

[1]  Kung-Min Wang,et al.  An improved survivability prognosis of breast cancer by using sampling and feature selection technique to solve imbalanced patient classification data , 2013, BMC Medical Informatics and Decision Making.

[2]  Daniel J. Licht,et al.  Prediction of periventricular leukomalacia. Part I: Selection of hemodynamic features using logistic regression and decision tree algorithms , 2009, Artif. Intell. Medicine.

[3]  Hari Om,et al.  Data mining models for predicting oral cancer survivability , 2013, Network Modeling Analysis in Health Informatics and Bioinformatics.

[4]  Longbing Cao,et al.  SVDD-based outlier detection on uncertain data , 2012, Knowledge and Information Systems.

[5]  Concha Bielza,et al.  Unveiling relevant non-motor Parkinson's disease severity symptoms using a machine learning approach , 2013, Artif. Intell. Medicine.

[6]  Jianqiang Li,et al.  Emerging information technologies for enhanced healthcare , 2015, Comput. Ind..

[7]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[8]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[9]  Bernard C. Jiang,et al.  Application of classification techniques on development an early-warning system for chronic illnesses , 2012, Expert Syst. Appl..

[10]  Chun-hung Li,et al.  L0-Constrained Regression for Data Mining , 2007, PAKDD.

[11]  Chi-Ren Shyu,et al.  Identifying key variables in African American adherence to colorectal cancer screening: the application of data mining , 2014, BMC Public Health.

[12]  Sang Won Yoon,et al.  Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms , 2014, Expert Syst. Appl..

[13]  Pa-Chun Wang,et al.  Data Mining Techniques for Assisting the Diagnosis of Pressure Ulcer Development in Surgical Patients , 2012, Journal of Medical Systems.

[14]  Kemal Polat,et al.  A new hybrid method based on fuzzy-artificial immune system and k-nn algorithm for breast cancer diagnosis , 2007, Comput. Biol. Medicine.

[15]  Kuhu Pal,et al.  Breast cancer detection using rank nearest neighbor classification rules , 2003, Pattern Recognit..

[16]  Filipe Portela,et al.  A Clustering Approach for Predicting Readmissions in Intensive Medicine , 2014 .

[17]  Ayaz Isazadeh,et al.  Data-mining by probability-based patterns , 2008, ITI 2008 - 30th International Conference on Information Technology Interfaces.

[18]  Pedro Abreu,et al.  Missing data imputation on the 5-year survival prediction of breast cancer patients with unknown discrete values , 2015, Comput. Biol. Medicine.

[19]  Sheng-wei Fei,et al.  Diagnostic study on arrhythmia cordis based on particle swarm optimization-based support vector machine , 2010, Expert Syst. Appl..

[20]  David L. Buckeridge,et al.  Towards probabilistic decision support in public health practice: Predicting recent transmission of tuberculosis from patient attributes , 2015, J. Biomed. Informatics.

[21]  Wei-Chang Yeh,et al.  A new hybrid approach for mining breast cancer pattern using discrete particle swarm optimization and statistical method , 2009, Expert Syst. Appl..

[22]  Sungzoon Cho,et al.  An efficient and effective ensemble of support vector machines for anti-diabetic drug failure prediction , 2015, Expert Syst. Appl..

[23]  Tzung-Pei Hong,et al.  Fuzzy data mining for interesting generalized association rules , 2003, Fuzzy Sets Syst..

[24]  Davar Giveki,et al.  Automatic detection of erythemato-squamous diseases using PSO-SVM based on association rules , 2013, Eng. Appl. Artif. Intell..

[25]  Amir Hassan Zadeh,et al.  Predicting overall survivability in comorbidity of cancers: A data mining approach , 2015, Decis. Support Syst..

[26]  Thomas W. Ferratt,et al.  The use of computer-based information systems by German managers to support decision making , 2004, Inf. Manag..

[27]  Shu-Hsien Liao,et al.  Data mining techniques and applications - A decade review from 2000 to 2011 , 2012, Expert Syst. Appl..

[28]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[29]  David Madigan,et al.  Large-Scale Bayesian Logistic Regression for Text Categorization , 2007, Technometrics.

[30]  Gediminas Adomavicius,et al.  Data mining for censored time-to-event data: a Bayesian network model for predicting cardiovascular risk from electronic health record data , 2014, Data Mining and Knowledge Discovery.