Leveraging Contextual Relatedness to Identify Suicide Documentation in Clinical Notes through Zero Shot Learning

 Abstract — Objectives: Identifying suicidality including suicidal ideation, attempts, and risk factors in electronic health record data in clinical notes is difficult. A major difficulty is the lack of training samples given the small number of true positive instances among the increasingly large number of patients being screened. This paper describes a novel methodology that identifies suicidality in clinical notes by addressing this data sparsity issue through zero-shot learning. Materials and Methods: U.S. Veterans Affairs clinical notes served as data. The training dataset label was determined using diagnostic codes of suicide attempt and self-harm. A base string associated with the target label of suicidality was used to provide auxiliary information by narrowing the positive training cases to those containing the base string. A deep neural network was trained by mapping the training documents’ contents to a semantic space. For comparison, we trained another deep neural network using the identical training dataset labels and bag-of-words features. Results: The zero shot learning model outperformed the baseline model in terms of AUC, sensitivity, specificity, and positive predictive value at multiple probability thresholds. In applying a 0.90 probability threshold, the methodology identified notes not associated with a relevant ICD-10-CM code that documented suicidality, with 94% accuracy. Conclusion: This new method can effectively identify suicidality without requiring manual annotation.

[1]  Senait Gebremichael Tesfagergish,et al.  Zero-Shot Emotion Detection for Semi-Supervised Sentiment Analysis Using Sentence Transformers and Ensemble Learning , 2022, Applied Sciences.

[2]  T. McCoy,et al.  Natural Language Processing of Admission Notes to Predict Severe Maternal Morbidity during the Delivery Encounter. , 2022, American Journal of Obstetrics and Gynecology.

[3]  Yanshan Wang,et al.  HealthPrompt: A Zero-shot Learning Paradigm for Clinical Natural Language Processing , 2022, AMIA.

[4]  K. Verspoor,et al.  Detection of self-harm and suicidal ideation in emergency department triage notes , 2021, J. Am. Medical Informatics Assoc..

[5]  Jyotishman Pathak,et al.  Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation. , 2021, Journal of psychiatric research.

[6]  Colin G. Walsh,et al.  Natural language processing and machine learning of electronic health records for prediction of first-time suicide attempts , 2021, JAMIA open.

[7]  M. Voracek,et al.  Suicide mortality in the United States following the suicides of Kate Spade and Anthony Bourdain , 2020, The Australian and New Zealand journal of psychiatry.

[8]  C. Brandt,et al.  A Prototype Application to Identify LGBT Patients in Clinical Notes , 2020, 2020 IEEE International Conference on Big Data (Big Data).

[9]  Q. Zeng-Treitler,et al.  Clinical Sublanguage Trend and Usage Analysis from a Large Clinical Corpus , 2020, 2020 IEEE International Conference on Big Data (Big Data).

[10]  D. Carroll,et al.  Addressing Suicide in the Veteran Population: Engaging a Public Health Approach , 2020, Frontiers in Psychiatry.

[11]  M. Abdar,et al.  A Review of Generalized Zero-Shot Learning Methods , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Jinan Gu,et al.  Research progress of zero-shot learning , 2020, Applied Intelligence.

[13]  Jihad S Obeid,et al.  Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach , 2020, JMIR medical informatics.

[14]  J. Gui,et al.  Natural language processing of clinical mental health notes may add predictive value to existing suicide risk models , 2020, Psychological Medicine.

[15]  Guy Divita,et al.  Discovering Sublanguages in a Large Clinical Corpus through Unsupervised Machine Learning and Information Gain , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[16]  Benjamin Lê Cook,et al.  Identification of suicidal behavior among psychiatrically hospitalized adolescents using natural language processing and machine learning of electronic health records , 2019, PloS one.

[17]  T. Takiguchi,et al.  Semantic embeddings of generic objects for zero-shot learning , 2019, EURASIP J. Image Video Process..

[18]  Hyoun-Joong Kong,et al.  Managing Unstructured Big Data in Healthcare System , 2019, Healthcare informatics research.

[19]  Yaoyun Zhang,et al.  Psychiatric stressor recognition from clinical notes to reveal association with suicide , 2018, Health Informatics J..

[20]  Tianxi Cai,et al.  Screening pregnant women for suicidal behavior in electronic medical records: diagnostic codes vs. clinical notes processed by natural language processing , 2018, BMC Medical Informatics and Decision Making.

[21]  S. Velupillai,et al.  Identifying Suicide Ideation and Suicidal Attempts in a Psychiatric Clinical Research Database using Natural Language Processing , 2018, Scientific Reports.

[22]  S. Mérelle,et al.  Characteristics Associated with Non-Disclosure of Suicidal Ideation in Adults , 2018, International journal of environmental research and public health.

[23]  H. Hedegaard,et al.  Issues in Developing a Surveillance Case Definition for Nonfatal Suicide Attempt and Intentional Self-harm Using International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM) Coded Data. , 2018, National health statistics reports.

[24]  Michele Filannino,et al.  A natural language processing challenge for clinical records: Research Domains Criteria (RDoC) for psychiatry. , 2017, Journal of biomedical informatics.

[25]  P. A. Bradley,et al.  Developing a practical suicide risk prediction model for targeting high‐risk patients in the Veterans health Administration , 2017, International journal of methods in psychiatric research.

[26]  Christopher R. Ratto,et al.  Feature Selection Methods for Zero-Shot Learning of Neural Activity , 2017, Front. Neuroinform..

[27]  Martin Wattenberg,et al.  Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation , 2016, TACL.

[28]  Enrique Baca-García,et al.  Novel Use of Natural Language Processing (NLP) to Predict Suicidal Ideation and Psychiatric Symptoms in a Text-Based Mental Health Intervention in Madrid , 2016, Comput. Math. Methods Medicine.

[29]  J. Bostwick,et al.  Suicide Attempt as a Risk Factor for Completed Suicide: Even More Lethal Than We Knew. , 2016, The American journal of psychiatry.

[30]  Rosa L. Figueroa,et al.  Extracting Information from Electronic Medical Records to Identify the Obesity Status of a Patient Based on Comorbidities and Bodyweight Measures , 2016, Journal of Medical Systems.

[31]  Robert Bossarte,et al.  Does Suicidal Ideation as Measured by the PHQ-9 Predict Suicide Among VA Patients? , 2016, Psychiatric services.

[32]  I. Katz,et al.  Predictive Modeling and Concentration of the Risk of Suicide: Implications for Preventive Interventions in the US Department of Veterans Affairs. , 2015, American journal of public health.

[33]  Philip H. S. Torr,et al.  An embarrassingly simple approach to zero-shot learning , 2015, ICML.

[34]  C. Henning‐Smith,et al.  Disparities in Health and Disability Among Older Adults in Same-Sex Cohabiting Relationships , 2015, Journal of aging and health.

[35]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[36]  Georgiana Dinu,et al.  Improving zero-shot learning by mitigating the hubness problem , 2014, ICLR.

[37]  Dilek Z. Hakkani-Tür,et al.  Zero-Shot Learning for Semantic Utterance Classification , 2013, ICLR 2014.

[38]  Keith Hawton,et al.  Risk factors for suicide in individuals with depression: a systematic review. , 2013, Journal of affective disorders.

[39]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[40]  I. Katz Lessons learned from mental health enhancement and suicide prevention activities in the Veterans Health Administration. , 2012, American journal of public health.

[41]  A. Berman Estimating the population of survivors of suicide: seeking an evidence base. , 2011, Suicide & life-threatening behavior.

[42]  Christoph H. Lampert,et al.  Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Yoshua Bengio,et al.  Zero-data Learning of New Tasks , 2008, AAAI.

[44]  Jacqueline A. Moss,et al.  An Analysis of Narrative Nursing Documentation in an Otherwise Structured Intensive Care Clinical Information System , 2007, AMIA.

[45]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[46]  S. Stack,et al.  Suicide: a 15-year review of the sociological literature. Part I: cultural and economic factors. , 2000, Suicide & life-threatening behavior.

[47]  R. Hu Zero-Shot Image Classification Guided by Natural Language Descriptions of Classes : A Meta-Learning Approach , 2019 .

[48]  A. Haas,et al.  The American Foundation for Suicide Prevention , 2009 .

[49]  E. Chiocca Suicidal Ideation , 2020, Definitions.