Latent Patient Profile Modelling and Applications with Mixed-Variate Restricted Boltzmann Machine

Efficient management of chronic diseases is critical in modern health care. We consider diabetes mellitus, and our ongoing goal is to examine how machine learning can deliver information for clinical efficiency. The challenge is to aggregate highly heterogeneous sources including demographics, diagnoses, pathologies and treatments, and extract similar groups so that care plans can be designed. To this end, we extend our recent model, the mixed-variate restricted Boltzmann machine (MV.RBM), as it seamlessly integrates multiple data types for each patient aggregated over time and outputs a homogeneous representation called “latent profile” that can be used for patient clustering, visualisation, disease correlation analysis and prediction. We demonstrate that the method outperforms all baselines on these tasks - the primary characteristics of patients in the same groups are able to be identified and the good result can be achieved for the diagnosis codes prediction.

[1]  Svetha Venkatesh,et al.  Mixed-Variate Restricted Boltzmann Machines , 2014, ACML.

[2]  Geoffrey E. Hinton,et al.  Replicated Softmax: an Undirected Topic Model , 2009, NIPS.

[3]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[4]  David Haussler,et al.  Unsupervised learning of distributions on binary vectors using two layer networks , 1991, NIPS 1991.

[5]  Fei Wang,et al.  SOR: Scalable Orthogonal Regression for Low-Redundancy Feature Selection and its Healthcare Applications , 2012, SDM.

[6]  Shahram Ebadollahi,et al.  Toward personalized care management of patients at risk: the diabetes case study , 2011, KDD.

[7]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[8]  Yu Cao,et al.  An integrated machine learning approach to stroke prediction , 2010, KDD.

[9]  B. Wu,et al.  Copula‐based regression models for a bivariate mixed discrete and continuous outcome , 2011, Statistics in medicine.

[10]  Svetha Venkatesh,et al.  Cumulative Restricted Boltzmann Machines for Ordinal Matrix Data Analysis , 2014, ACML.

[11]  Svetha Venkatesh,et al.  Ordinal Boltzmann Machines for Collaborative Filtering , 2009, UAI.

[12]  Svetha Venkatesh,et al.  Embedded Restricted Boltzmann Machines for fusion of mixed data types and applications in social measurements analysis , 2012, 2012 15th International Conference on Information Fusion.

[13]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[14]  Mingyao Li,et al.  Joint Regression Analysis of Correlated Data Using Gaussian Copulas , 2009, Biometrics.

[15]  D. Dunson,et al.  Bayesian latent variable models for mixed discrete outcomes. , 2005, Biostatistics.

[16]  Geoffrey E. Hinton,et al.  Semantic hashing , 2009, Int. J. Approx. Reason..

[17]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[18]  C. McCulloch Joint modelling of mixed outcome types using latent variables , 2008, Statistical methods in medical research.