Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach

Background Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum. Objective This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events. Methods We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words–based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance. Results The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an F1 score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an F1 score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance. Conclusions The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data.

[1]  L. Brenner,et al.  Assessment and Management of Patients at Risk for Suicide: Synopsis of the 2019 U.S. Department of Veterans Affairs and U.S. Department of Defense Clinical Practice Guidelines , 2019, Annals of Internal Medicine.

[2]  Colin G. Walsh,et al.  Predicting Risk of Suicide Attempts Over Time Through Machine Learning , 2017 .

[3]  Tim Kendall,et al.  Predicting suicide following self-harm: systematic review of risk factors and risk scales. , 2016, The British journal of psychiatry : the journal of mental science.

[4]  E. D. Klonsky,et al.  Suicide, Suicide Attempts, and Suicidal Ideation. , 2016, Annual review of clinical psychology.

[5]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[6]  Evan M. Kleiman,et al.  Meta-analysis of risk factors for nonsuicidal self-injury. , 2015, Clinical psychology review.

[7]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[8]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[9]  Ben Y. Reis,et al.  Predicting suicides after outpatient mental health visits in the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS) , 2016, Molecular Psychiatry.

[10]  P. Harris,et al.  Research electronic data capture (REDCap) - A metadata-driven methodology and workflow process for providing translational research informatics support , 2009, J. Biomed. Informatics.

[11]  Jonathan Culpeper Keyness: words, parts-of-speech and semantic categories in the character-talk of Shakespeare's "Romeo and Juliet" , 2009 .

[12]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[13]  D Delgado-Gomez,et al.  Computerized Adaptive Test vs. decision trees: Development of a support decision system to identify suicidal behavior. , 2016, Journal of affective disorders.

[14]  Jason Weston,et al.  Multi-Class Support Vector Machines , 1998 .

[15]  L. Flashman,et al.  Predicting the Risk of Suicide by Analyzing the Text of Clinical Notes , 2014, PloS one.

[16]  Max Kuhn,et al.  The caret Package , 2007 .

[17]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  E. Arensman,et al.  Risk Factors for Repetition of Self-Harm: A Systematic Review of Prospective Hospital-Based Studies , 2014, PloS one.

[21]  Chunhua Weng,et al.  A survey of practices for the use of electronic health records to support research recruitment , 2017, Journal of Clinical and Translational Science.

[22]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[23]  Sumithra Velupillai,et al.  Detection of Suicidality in Adolescents with Autism Spectrum Disorders: Developing a Natural Language Processing Approach for Use in Electronic Health Records , 2017, AMIA.

[24]  Ross Jacobucci,et al.  The use of machine learning in the study of suicidal and non-suicidal self-injurious thoughts and behaviors: A systematic review. , 2019, Journal of affective disorders.

[25]  D. Gunnell,et al.  Suicide risk assessment and intervention in people with mental illness , 2015, BMJ : British Medical Journal.

[26]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[27]  Daniel L. Rubin,et al.  Intelligent Word Embeddings of Free-Text Radiology Reports , 2017, AMIA.

[28]  Ngoc Thang Vu,et al.  Combining Recurrent and Convolutional Neural Networks for Relation Classification , 2016, NAACL.

[29]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[30]  Evan M. Kleiman,et al.  Risk Factors for Suicidal Thoughts and Behaviors: A Meta-Analysis of 50 Years of Research , 2017, Psychological bulletin.

[31]  Gary King,et al.  Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference , 2007, Political Analysis.

[32]  Jingcheng Du,et al.  Extracting psychiatric stressors for suicide from social media using deep learning , 2018, BMC Medical Informatics and Decision Making.

[33]  Philip Resnik,et al.  Expert, Crowdsourced, and Machine Assessment of Suicide Risk via Online Postings , 2018, CLPsych@NAACL-HTL.

[34]  H. Hedegaard,et al.  Issues in Developing a Surveillance Case Definition for Nonfatal Suicide Attempt and Intentional Self-harm Using International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM) Coded Data. , 2018, National health statistics reports.

[35]  Jihad S. Obeid,et al.  Word2Vec inversion and traditional text classifiers for phenotyping lupus , 2017, BMC Medical Informatics and Decision Making.

[36]  Sumithra Velupillai,et al.  Identifying Suicide Ideation and Suicidal Attempts in a Psychiatric Clinical Research Database using Natural Language Processing , 2018, Scientific Reports.

[37]  Lewis J. Frey,et al.  Automated detection of altered mental status in emergency department clinical notes: a deep learning approach , 2019, BMC Medical Informatics and Decision Making.

[38]  Jun Wang,et al.  Learning text representation using recurrent convolutional neural network with highway layers , 2016, SIGIR 2016.

[39]  Agneta Pettersson,et al.  Instruments for the assessment of suicide risk: A systematic review evaluating the certainty of the evidence , 2017, PloS one.

[40]  M. Pompili,et al.  Taking care of suicidal patients with new technologies and reaching-out means in the post-discharge period , 2017, World journal of psychiatry.

[41]  Carol Friedman,et al.  Methods for Identifying Suicide or Suicidal Ideation in EHRs , 2012, AMIA.

[42]  E. Arias,et al.  Mortality in the United States, 2016. , 2017, NCHS data brief.

[43]  Jimeng Sun,et al.  Explainable Prediction of Medical Codes from Clinical Text , 2018, NAACL.

[44]  Ian Cook,et al.  Practice guideline for the assessment and treatment of patients with suicidal behaviors. , 2003, The American journal of psychiatry.

[45]  C. Waternaux,et al.  Classification trees distinguish suicide attempters in major psychiatric disorders: a model of clinical decision making. , 2008, The Journal of clinical psychiatry.

[46]  Y. Levi-Belz,et al.  Serious Suicide Attempts: Systematic Review of Psychological Risk Factors , 2018, Front. Psychiatry.

[47]  Lydia Denworth,et al.  Preventing Suicide. , 2018, Scientific American.

[48]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[49]  J. Leiva-Murillo,et al.  Distinguishing the relevant features of frequent suicide attempters. , 2011, Journal of psychiatric research.

[50]  Quentin Gicquel,et al.  Use of emergency department electronic medical records for automated epidemiological surveillance of suicide attempts: a French pilot study , 2017, International journal of methods in psychiatric research.

[51]  T. McCoy,et al.  Improving Prediction of Suicide and Accidental Death After Discharge From General Hospitals With Natural Language Processing. , 2016, JAMA psychiatry.

[52]  Haiyan Wang,et al.  quanteda: An R package for the quantitative analysis of textual data , 2018, J. Open Source Softw..

[53]  Franck Dernoncourt,et al.  Sequential Short-Text Classification with Recurrent and Convolutional Neural Networks , 2016, NAACL.

[54]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[55]  D. Goldston,et al.  The Problematic Label of Suicide Gesture: Alternatives for Clinical Research and Practice. , 2010, Professional psychology, research and practice.

[56]  Tianxi Cai,et al.  Screening pregnant women for suicidal behavior in electronic medical records: diagnostic codes vs. clinical notes processed by natural language processing , 2018, BMC Medical Informatics and Decision Making.