Ascertaining Depression Severity by Extracting Patient Health Questionnaire-9 (PHQ-9) Scores from Clinical Notes

The Patient Health Questionnaire-9 (PHQ-9) is a validated instrument for assessing depression severity. While some electronic health record (EHR) systems capture PHQ-9 scores in a structured format, unstructured clinical notes remain the only source in many settings, which presents data retrieval challenges for research and clinical decision support. To address this gap, we extended the open-source Leo natural language processing (NLP) platform to extract PHQ-9 scores from clinical notes and evaluated performance using EHR data for n=123,703 patients who were prescribed antidepressants. Compared to a reference standard, the NLP method exhibited high accuracy (97%), sensitivity (98%), precision (97%), and F-score (97%). Furthermore, of patients with PHQ-9 scores identified by the NLP method, 31% (n=498) had at least one PHQ-9 score clinically indicative of major depressive disorder (MDD), but lacked a structured ICD-9/10 diagnosis code for MDD. This NLP technique may facilitate accurate identification and stratification of patients with depression.

[1]  Stephen B. Johnson,et al.  From Sour Grapes to Low-Hanging Fruit: A Case Study Demonstrating a Practical Strategy for Natural Language Processing Portability , 2018, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[2]  A V Gundlapalli,et al.  Development and Validation of a Natural Language Processing Tool to Identify Patients Treated for Pneumonia across VA Emergency Departments , 2018, Applied Clinical Informatics.

[3]  Sherri Rose,et al.  Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings , 2017, J. Am. Medical Informatics Assoc..

[4]  Scott L. DuVall,et al.  Unlocking echocardiogram measurements for heart disease research through natural language processing , 2017, BMC Cardiovascular Disorders.

[5]  S. Grover,et al.  Clinical Practice Guidelines for the management of Depression , 2017, Indian journal of psychiatry.

[6]  Stephen B. Johnson,et al.  Secondary Use of Patients' Electronic Records (SUPER): An Approach for Meeting Specific Data Needs of Clinical and Translational Researchers , 2017, AMIA.

[7]  Mark Olfson,et al.  Treatment of Adult Depression in the United States. , 2016, JAMA internal medicine.

[8]  Adi V. Gundlapalli,et al.  v3NLP Framework: Tools to Build Applications for Extracting Concepts from Clinical Text , 2016, EGEMS.

[9]  Rachel L. Richesson,et al.  A Framework to Support the Sharing and Reuse of Computable Phenotype Definitions Across Health Care Delivery and Clinical Research Applications , 2016, EGEMS.

[10]  Joshua C. Denny,et al.  Combining billing codes, clinical notes, and medications from electronic health records provides superior phenotyping performance , 2016, J. Am. Medical Informatics Assoc..

[11]  Paul A. Harris,et al.  Desiderata for computable representations of electronic health records-driven phenotype algorithms , 2015, J. Am. Medical Informatics Assoc..

[12]  I. Kohane,et al.  Development of phenotype algorithms using electronic medical records and incorporating natural language processing , 2015, BMJ : British Medical Journal.

[13]  Laura Manea,et al.  A diagnostic meta-analysis of the Patient Health Questionnaire-9 (PHQ-9) algorithm scoring method as a screen for depression. , 2015, General hospital psychiatry.

[14]  Shamkant B. Navathe,et al.  Identifying Patients with Depression Using Free-text Clinical Documents , 2015, MedInfo.

[15]  K. E. Ravikumar,et al.  Automated chart review for asthma cohort identification using natural language processing: an exploratory study. , 2013, Annals of allergy, asthma & immunology : official publication of the American College of Allergy, Asthma, & Immunology.

[16]  G. Freedman,et al.  Burden of Depressive Disorders by Country, Sex, Age, and Year: Findings from the Global Burden of Disease Study 2010 , 2013, PLoS medicine.

[17]  Sue E Bowman Impact of electronic health record systems on information integrity: quality and safety implications. , 2013, Perspectives in health information management.

[18]  Shuying Shen,et al.  Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure , 2012, J. Am. Medical Informatics Assoc..

[19]  Hongfang Liu,et al.  Clinical decision support with automated text processing for cervical cancer screening , 2012, J. Am. Medical Informatics Assoc..

[20]  Joanne Rodda,et al.  Depression in older adults , 2011, BMJ : British Medical Journal.

[21]  M. Fava,et al.  Using electronic medical records to enable large-scale studies in psychiatry: treatment resistant depression as a model , 2011, Psychological Medicine.

[22]  Christopher G Chute,et al.  Analyzing the heterogeneity and complexity of Electronic Health Record oriented phenotyping algorithms. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[23]  Hua Xu,et al.  Extracting timing and status descriptors for colonoscopy testing from electronic medical records , 2010, J. Am. Medical Informatics Assoc..

[24]  Hans-Ulrich Wittchen,et al.  Mental disorders in primary care , 2003, Dialogues in clinical neuroscience.

[25]  Derek D. Reed,et al.  Major depression. , 1998, Managed care interface.