Using natural language processing to identify problem usage of prescription opioids

BACKGROUND Accurate and scalable surveillance methods are critical to understand widespread problems associated with misuse and abuse of prescription opioids and for implementing effective prevention and control measures. Traditional diagnostic coding incompletely documents problem use. Relevant information for each patient is often obscured in vast amounts of clinical text. OBJECTIVES We developed and evaluated a method that combines natural language processing (NLP) and computer-assisted manual review of clinical notes to identify evidence of problem opioid use in electronic health records (EHRs). METHODS We used the EHR data and text of 22,142 patients receiving chronic opioid therapy (≥70 days' supply of opioids per calendar quarter) during 2006-2012 to develop and evaluate an NLP-based surveillance method and compare it to traditional methods based on International Classification of Disease, Ninth Edition (ICD-9) codes. We developed a 1288-term dictionary for clinician mentions of opioid addiction, abuse, misuse or overuse, and an NLP system to identify these mentions in unstructured text. The system distinguished affirmative mentions from those that were negated or otherwise qualified. We applied this system to 7336,445 electronic chart notes of the 22,142 patients. Trained abstractors using a custom computer-assisted software interface manually reviewed 7751 chart notes (from 3156 patients) selected by the NLP system and classified each note as to whether or not it contained textual evidence of problem opioid use. RESULTS Traditional diagnostic codes for problem opioid use were found for 2240 (10.1%) patients. NLP-assisted manual review identified an additional 728 (3.1%) patients with evidence of clinically diagnosed problem opioid use in clinical notes. Inter-rater reliability among pairs of abstractors reviewing notes was high, with kappa=0.86 and 97% agreement for one pair, and kappa=0.71 and 88% agreement for another pair. CONCLUSIONS Scalable, semi-automated NLP methods can efficiently and accurately identify evidence of problem opioid use in vast amounts of EHR text. Incorporating such methods into surveillance efforts may increase prevalence estimates by as much as one-third relative to traditional methods.

[1]  Scott R. Halgrim,et al.  Using natural language processing to improve efficiency of manual chart abstraction in research: the case of breast cancer recurrence. , 2014, American journal of epidemiology.

[2]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[3]  Matt Schiller,et al.  Societal costs of prescription opioid abuse, dependence, and misuse in the United States. , 2011, Pain medicine.

[4]  James C. Reed Book Reviews : Visual Perceptual Abilities and Early Reading Progress by Jean Turner Goins, Supplementary Educational Monographs, #87, Chicago: University of Chicago Press, 1958, Pp. x + 108 , 1960 .

[5]  R. Chou,et al.  Opioids for chronic noncancer pain: prediction and identification of aberrant drug-related behaviors: a review of the evidence for an American Pain Society and American Academy of Pain Medicine clinical practice guideline. , 2009, The journal of pain : official journal of the American Pain Society.

[6]  Robert V. Tauxe,et al.  Public Health Surveillance: A Tool for Targeting and Monitoring Interventions , 2006 .

[7]  Sarah M. Greene,et al.  Building a virtual cancer research organization. , 2005, Journal of the National Cancer Institute. Monographs.

[8]  Peter L. Elkin,et al.  Comparison of Natural Language Processing Biosurveillance Methods for Identifying Influenza From Encounter Notes , 2012, Annals of Internal Medicine.

[9]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[10]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[11]  Goran Nenadic,et al.  Text mining of cancer-related information: Review of current status and future directions , 2014, Int. J. Medical Informatics.

[12]  Jimeng Sun,et al.  Automatic identification of heart failure diagnostic criteria, using text analysis of clinical notes from electronic health records , 2014, Int. J. Medical Informatics.

[13]  Lucila Ohno-Machado,et al.  Natural language processing: an introduction , 2011, J. Am. Medical Informatics Assoc..

[14]  Judith W. Dexheimer,et al.  Natural Language Processing – The Basics , 2012 .

[15]  P. Buckley,et al.  Risks for possible and probable opioid misuse among recipients of chronic opioid therapy in commercial and medicaid insurance plans: The TROUP Study , 2012 .

[16]  S B Thacker,et al.  Public health surveillance in the United States. , 1988, Epidemiologic reviews.

[17]  Vihang N. Vahia,et al.  Diagnostic and statistical manual of mental disorders 5: A quick glance , 2013, Indian journal of psychiatry.

[18]  Lin Chen,et al.  Importance of multi-modal approaches to effectively identify cataract cases from electronic health records , 2012, J. Am. Medical Informatics Assoc..

[19]  S. McGuire Frayar, D.C., Ervin, R.B. Caloric intake from fast food among adults: United States, 2007-2010. NCHS Data Brief, No. 114, February 2013. Hyattsville, MD: National Center for Health Statistics, 2013. , 2013, Advances in nutrition.

[20]  Paea LePendu,et al.  Pharmacovigilance using Clinical Text , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[21]  Steven H. Brown,et al.  Automated identification of postoperative complications within an electronic medical record using natural language processing. , 2011, JAMA.

[22]  Nigam H. Shah,et al.  Mining clinical text for signals of adverse drug-drug interactions , 2014, J. Am. Medical Informatics Assoc..

[23]  Joseph A Boscarino,et al.  Risk factors for drug dependence among out-patients on opioid therapy in a large US health-care system. , 2010, Addiction.

[24]  Christopher G Chute,et al.  Invited commentary: Observational research in the age of the electronic health record. , 2014, American journal of epidemiology.

[25]  Nigam H. Shah,et al.  Practice-Based Evidence: Profiling the Safety of Cilostazol by Text-Mining of Clinical Notes , 2013, PloS one.

[26]  Douglas D. Heckathorn,et al.  Respondent-driven sampling : A new approach to the study of hidden populations , 1997 .

[27]  Shuying Shen,et al.  Optimizing A Syndromic Surveillance Text Classifier for Influenza-like Illness: Does Document Source Matter? , 2008, AMIA.

[28]  Kathleen Saunders,et al.  The prevalence of problem opioid use in patients receiving chronic opioid therapy: computer-assisted review of electronic health record clinical notes , 2015, Pain.

[29]  Marlon P Mundt,et al.  Substance use disorders in a primary care sample receiving daily opioid therapy. , 2007, The journal of pain : official journal of the American Pain Society.

[30]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[31]  Peter L. Elkin,et al.  Detection of infectious symptoms from VA emergency department and primary care clinical documentation , 2012, Int. J. Medical Informatics.

[32]  Robert J. Gatchel,et al.  Predicting Opioid Misuse by Chronic Pain Patients: A Systematic Review and Literature Synthesis , 2008, The Clinical journal of pain.

[33]  N. Shah,et al.  Pharmacovigilance Using Clinical Notes , 2013, Clinical pharmacology and therapeutics.

[34]  Judith W. Dexheimer,et al.  Natural Language Processing: Applications in Pediatric Research , 2016 .

[35]  Alan G. White,et al.  Direct Costs of Opioid Abuse in an Insured Population in the United States , 2005, Journal of managed care pharmacy : JMCP.

[36]  Bruce M Psaty,et al.  Use of administrative data to estimate the incidence of statin-related rhabdomyolysis. , 2012, JAMA.

[37]  Jeanmarie Mayer,et al.  Inductive Creation of an Annotation Schema and a Reference Standard for De-identification of VA Electronic Clinical Notes , 2009, AMIA.