Deep learning for prediction of population health costs

Accurate prediction of healthcare costs is important for optimally managing health costs. However, methods leveraging the medical richness from data such as health insurance claims or electronic health records are missing. Here, we developed a deep neural network to predict future cost from health insurance claims records. We applied the deep network and a ridge regression model to a sample of 1.4 million German insurants to predict total one-year health care costs. Both methods were compared to Morbi-RSA models with various performance measures and were also used to predict patients with a change in costs and to identify relevant codes for this prediction. We showed that the neural network outperformed the ridge regression as well as all Morbi-RSA models for cost prediction. Further, the neural network was superior to ridge regression in predicting patients with cost change and identified more specific codes. In summary, we showed that our deep neural network can leverage the full complexity of the patient records and outperforms standard approaches. We suggest that the better performance is due to the ability to incorporate complex interactions in the model and that the model might also be used for predicting other health phenotypes.

[1]  Martine De Cock,et al.  Population Cost Prediction on Public Healthcare Datasets , 2015, Digital Health.

[2]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[3]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[4]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[5]  Santosh S. Vempala,et al.  Algorithmic Prediction of Health-Care Costs , 2008, Oper. Res..

[6]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[7]  D Y Lin,et al.  Methods for analyzing health care utilization and costs. , 1999, Annual review of public health.

[8]  Bibudh Lahiri,et al.  Predicting Healthcare Expenditure Increase for an Individual from Medicare Data , 2014 .

[9]  Ian Duncan,et al.  Testing Alternative Regression Frameworks for Predictive Modeling of Healthcare Costs , 2015 .

[10]  Jeffrey Dean,et al.  Scalable and accurate deep learning with electronic health records , 2018, npj Digital Medicine.

[11]  Jaron J. R. Chong,et al.  Canadian Association of Radiologists White Paper on Artificial Intelligence in Radiology , 2018, Canadian Association of Radiologists journal = Journal l'Association canadienne des radiologistes.

[12]  Ankur Taly,et al.  Axiomatic Attribution for Deep Networks , 2017, ICML.

[13]  F. Andersohn,et al.  Characteristics and external validity of the German Health Risk Institute (HRI) Database , 2016, Pharmacoepidemiology and drug safety.

[14]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[15]  Edward W. Frees,et al.  Actuarial Applications of Multivariate Two-Part Regression Models , 2013, Annals of Actuarial Science.