Predicting death by suicide using administrative health care system data: Can feedforward neural network models improve upon logistic regression models?

BACKGROUND Suicide is a leading cause of death worldwide. With the increasing volume of administrative health care data, there is an opportunity to evaluate whether machine learning models can improve upon statistical models for quantifying suicide risk. OBJECTIVE To compare the relative performance of logistic regression and single hidden layer feedforward neural network models that quantify suicide risk with predictors available in administrative health care system data. METHODS The modeling dataset contained 3548 persons that died by suicide and 35,480 persons that did not die by suicide between 2000 and 2016. 101 predictors were selected, and these were assembled for each of the 40 quarters (10 years) prior to the quarter of death, resulting in 4040 predictors in total for each person. Logistic regression and single hidden layer feedforward neural network model configurations were evaluated using 10-fold cross-validation. RESULTS The optimal feedforward neural network model configuration (AUC: 0.8352) outperformed logistic regression (AUC: 0.8179). LIMITATIONS Many important predictors are not available in administrative data and this likely places a limit on how well prediction models developed with administrative data can perform. CONCLUSIONS Although the models developed in this study showed promise, further research is needed to determine the performance limits of statistical and machine learning models that quantify suicide risk, and to develop prediction models optimized for implementation in clinical settings.

[1]  Nancy Fugate Woods,et al.  A Global Imperative: Development, Safety, and Health from Girl Child to Woman , 2009, Health care for women international.

[2]  Tim Kendall,et al.  Predicting suicide following self-harm: systematic review of risk factors and risk scales. , 2016, The British journal of psychiatry : the journal of mental science.

[3]  Kurt Hornik,et al.  Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[4]  Jane Pirkis,et al.  Predicting suicidal behaviours using clinical instruments: Systematic review and meta-analysis of positive predictive values for risk scales , 2017, British Journal of Psychiatry.

[5]  K. Saunders,et al.  The sad truth about the SADPERSONS Scale: an evaluation of its clinical utility in self-harm patients , 2013, Emergency Medicine Journal.

[6]  M. Large,et al.  Meta-Analysis of Longitudinal Cohort Studies of Suicide Risk Assessment among Psychiatric Patients: Heterogeneity in Results and Lack of Improvement over Time , 2016, PloS one.

[7]  Evan M. Kleiman,et al.  Self-injurious thoughts and behaviors as risk factors for future suicide ideation, attempts, and death: a meta-analysis of longitudinal studies , 2015, Psychological Medicine.

[8]  W. Leslie,et al.  Predicting suicide with the SAD PERSONS scale , 2017, Depression and anxiety.

[9]  Evan M. Kleiman,et al.  Letter to the Editor: Suicide as a complex classification problem: machine learning and related techniques can advance suicide prediction - a reply to Roaldset (2016) , 2016, Psychological Medicine.

[10]  J. Ribeiro,et al.  Demographics as predictors of suicidal thoughts and behaviors: A meta-analysis , 2017, PloS one.

[11]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[12]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[13]  Svetha Venkatesh,et al.  Predicting Risk of Suicide Attempt Using History of Physical Illnesses From Electronic Medical Records , 2016, JMIR mental health.

[14]  Daniela M. Witten,et al.  An Introduction to Statistical Learning: with Applications in R , 2013 .

[15]  Daniel C. Murrie,et al.  Reformulating Suicide Risk Formulation: From Prediction to Prevention , 2015, Academic Psychiatry.

[16]  Jeremy W Coid,et al.  The futility of risk prediction in psychiatry. , 2016, The British journal of psychiatry : the journal of mental science.

[17]  Wei Luo,et al.  Risk stratification using data from electronic medical records better predicts suicide risks than clinician assessments , 2014, BMC Psychiatry.

[18]  J. Sareen,et al.  Clinician Prediction of Future Suicide Attempts , 2016 .