Bayesian Models for Healthcare Data Analysis

The rapid increasing amount of healthcare data poses great challenges to data mining and machine learning study and applications. Recently a large number of algorithms and models have been proposed to discover knowledge and information from large scale healthcare datasets. In medical applications, confidence measured by posterior probability is well accepted since it can quantify the certainty or severity of targets. In this article, we propose a sparse Bayesian model for healthcare data analysis. The proposed model utilizes a set of basic functions and it learns a sparse weight vector to combine them together. Our model is a fully Bayesian method which can incoporate a prior and derive a likelihood function from a given training data set. Working with the images of Pulmonary Embolism diagnosis dataset and Breast Cancer clinical dataset from KDDCup, our experiments demonstrate that the Bayesian approach lead to 83% and 80% test accuracy in modeling principles of healthcare data and it significantly improves the performance of its couterparts.

[1]  Gediminas Adomavicius,et al.  Data mining for censored time-to-event data: a Bayesian network model for predicting cardiovascular risk from electronic health record data , 2014, Data Mining and Knowledge Discovery.

[2]  A. Govardhan,et al.  Data Mining Issues and Challenges in Healthcare Domain , 2014 .

[3]  Divya Tomar,et al.  A survey on Data Mining approaches for Healthcare , 2013, BSBT 2013.

[4]  Peter J. F. Lucas,et al.  Multilevel Bayesian networks for the analysis of hierarchical health care data , 2013, Artif. Intell. Medicine.

[5]  Bhavani M. Thuraisingham,et al.  Sparse Bayesian Adversarial Learning Using Relevance Vector Machine Ensembles , 2012, 2012 IEEE 12th International Conference on Data Mining.

[6]  Xiaohua Hu,et al.  A Bayesian-based prediction model for personalized medical health care , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[7]  Yanqing Niu,et al.  A Bayesian regression approach to the prediction of MHC-II binding affinity , 2008, Comput. Methods Programs Biomed..

[8]  Sellappan Palaniappan,et al.  Intelligent heart disease prediction system using data mining techniques , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[9]  E. El-Darzi,et al.  Healthcare Data Mining: Prediction Inpatient Length of Stay , 2006, 2006 3rd International IEEE Conference Intelligent Systems.

[10]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[11]  Peng Liu,et al.  An Analysis of Missing Data Treatment Methods and Their Application to Health Care Dataset , 2005, ADMA.

[12]  Peter J.F. Lucas Bayesian analysis, pattern analysis, and data mining in health care , 2004, Current opinion in critical care.

[13]  Wei Tang,et al.  Selective Ensemble of Decision Trees , 2003, RSFDGrC.

[14]  Shamsher Bahadur Patel,et al.  A Literature Review in Health Informatics Using Data Mining Techniques , 2014 .

[15]  Davar Giveki,et al.  Automatic detection of erythemato-squamous diseases using PSO-SVM based on association rules , 2013, Eng. Appl. Artif. Intell..

[16]  Ruben D. Canlas Data Mining in Healthcare : Current Applications and Issues By , 2010 .

[17]  Dan Zhu,et al.  Applications of Data Mining in the Healthcare Industry , 2008 .

[18]  Greg Rogers,et al.  MINING YOUR DATA FOR HEALTH CARE QUALITY IMPROVEMENT , 1997 .