Identifying QT prolongation from ECG impressions using a general-purpose Natural Language Processor

OBJECTIVE Typically detected via electrocardiograms (ECGs), QT interval prolongation is a known risk factor for sudden cardiac death. Since medications can promote or exacerbate the condition, detection of QT interval prolongation is important for clinical decision support. We investigated the accuracy of natural language processing (NLP) for identifying QT prolongation from cardiologist-generated, free-text ECG impressions compared to corrected QT (QTc) thresholds reported by ECG machines. METHODS After integrating negation detection to a locally developed natural language processor, the KnowledgeMap concept identifier, we evaluated NLP-based detection of QT prolongation compared to the calculated QTc on a set of 44,318 ECGs obtained from hospitalized patients. We also created a string query using regular expressions to identify QT prolongation. We calculated sensitivity and specificity of the methods using manual physician review of the cardiologist-generated reports as the gold standard. To investigate causes of "false positive" calculated QTc, we manually reviewed randomly selected ECGs with a long calculated QTc but no mention of QT prolongation. Separately, we validated the performance of the negation detection algorithm on 5000 manually categorized ECG phrases for any medical concept (not limited to QT prolongation) prior to developing the NLP query for QT prolongation. RESULTS The NLP query for QT prolongation correctly identified 2364 of 2373 ECGs with QT prolongation with a sensitivity of 0.996 and a positive predictive value of 1.000. There were no false positives. The regular expression query had a sensitivity of 0.999 and positive predictive value of 0.982. In contrast, the positive predictive value of common QTc thresholds derived from ECG machines was 0.07-0.25 with corresponding sensitivities of 0.994-0.046. The negation detection algorithm had a recall of 0.973 and precision of 0.982 for 10,490 concepts found within ECG impressions. CONCLUSION NLP and regular expression queries of cardiologists' ECG interpretations can more effectively identify QT prolongation than the automated QTc intervals reported by ECG machines. Future clinical decision support could employ NLP queries to detect QTc prolongation and other reported ECG abnormalities.

[1]  Peter L. Elkin,et al.  A controlled trial of automated classification of negation from clinical notes , 2005, BMC Medical Informatics Decis. Mak..

[2]  Jan A. Kors,et al.  Common NOS1AP Variants Are Associated With a Prolonged QTc Interval in the Rotterdam Study , 2007 .

[3]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[4]  Peter L. Elkin,et al.  UMLS Concept Indexing for Production Databases: A Feasibility Study , 2001, J. Am. Medical Informatics Assoc..

[5]  Randolph A. Miller,et al.  Identifying UMLS concepts from ECG Impressions using Knowledge Map , 2005, AMIA.

[6]  Hc. Bazett An analysis of the time relationships of the heart , 1920 .

[7]  Marek Malik,et al.  Errors and misconceptions in ECG measurement used for the detection of drug induced QT interval prolongation. , 2004, Journal of electrocardiology.

[8]  Werner Ceusters,et al.  Negative findings in electronic health records and biomedical ontologies: A realist approach , 2007, Int. J. Medical Informatics.

[9]  Martin Romacker,et al.  MedSynDikate - a natural language system for the extraction of medical information from findings reports , 2002, Int. J. Medical Informatics.

[10]  Kevin B. Johnson,et al.  The Impact of Peer Management on Test-Ordering Behavior , 2004, Annals of Internal Medicine.

[11]  H. Bazett,et al.  AN ANALYSIS OF THE TIME‐RELATIONS OF ELECTROCARDIOGRAMS. , 1997 .

[12]  Carlo Marchesi,et al.  Discovering dangerous patterns in long-term ambulatory ECG recordings using a fast QRS detection algorithm and explorative data analysis , 2006, Comput. Methods Programs Biomed..

[13]  Mattias Ohlsson,et al.  Decision support for the initial triage of patients with acute coronary syndromes , 2006, Clinical physiology and functional imaging.

[14]  J. L. Willems,et al.  The diagnostic performance of computer programs for the interpretation of electrocardiograms. , 1992, The New England journal of medicine.

[15]  Yang Huang,et al.  Research Paper: A Pilot Study of Contextual UMLS Indexing to Improve the Precision of Concept-based Representation in XML-structured Clinical Radiology Reports , 2003, J. Am. Medical Informatics Assoc..

[16]  Robert Riddell,et al.  Cardiovascular events associated with rofecoxib in a colorectal adenoma chemoprevention trial. , 2005, The New England journal of medicine.

[17]  William R. Hersh,et al.  SAPHIRE International: a tool for cross-language information retrieval , 1998, AMIA.

[18]  Fleur Mougin,et al.  A unified representation of findings in clinical radiology using the UMLS and DICOM , 2008, Int. J. Medical Informatics.

[19]  George Hripcsak,et al.  Automated encoding of clinical documents based on natural language processing. , 2004, Journal of the American Medical Informatics Association : JAMIA.

[20]  Marc Berg,et al.  Overriding of drug safety alerts in computerized physician order entry. , 2006, Journal of the American Medical Informatics Association : JAMIA.

[21]  Scott T. Weiss,et al.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system , 2006, BMC Medical Informatics Decis. Mak..

[22]  David Zeltser,et al.  Drug-induced atrioventricular block: prognosis after discontinuation of the culprit drug. , 2004, Journal of the American College of Cardiology.

[23]  Scott D. Solomon,et al.  Cardiovascular Risk Associated With Celecoxib in a Clinical Trial for Colorectal Adenoma Prevention , 2005 .

[24]  Michael G. Strintzis,et al.  ECG pattern recognition and classification using non-linear transformations and neural networks: A review , 1998, Int. J. Medical Informatics.

[25]  Anderson Spickard,et al.  Research Paper: "Understanding" Medical School Curriculum Content Using KnowledgeMap , 2003, J. Am. Medical Informatics Assoc..

[26]  Peter J. Haug,et al.  Randomized controlled trial of an automated problem list with improved sensitivity , 2008, Int. J. Medical Informatics.

[27]  Panagiotis Korantzopoulos,et al.  Drug-induced prolongation of the QT interval. , 2004, The New England journal of medicine.

[28]  Prakash M. Nadkarni,et al.  Research Paper: Use of General-purpose Negation Detection to Augment Concept Indexing of Medical Documents: A Quantitative Study Using the UMLS , 2001, J. Am. Medical Informatics Assoc..

[29]  Douglas B. Fridsma,et al.  Research Paper: Computer Decision Support as a Source of Interpretation Error: The Case of Electrocardiograms , 2003, J. Am. Medical Informatics Assoc..

[30]  Peter L. Elkin,et al.  A randomized controlled trial of the accuracy of clinical record retrieval using SNOMED-RT as compared with ICD9-CM , 2001, AMIA.

[31]  Gregory F Cooper,et al.  Research Paper: Creating a Text Classifier to Detect Radiology Reports Describing Mediastinal Findings Associated with Inhalational Anthrax and Other Disorders , 2003, J. Am. Medical Informatics Assoc..

[32]  David Zeltser,et al.  Drug-induced prolongation of the QT interval. , 2004, The New England journal of medicine.

[33]  Carol Friedman,et al.  ISO reference terminology models for nursing: Applicability for natural language processing of nursing narratives , 2005, Int. J. Medical Informatics.

[34]  George Hripcsak,et al.  Technical Brief: Agreement, the F-Measure, and Reliability in Information Retrieval , 2005, J. Am. Medical Informatics Assoc..

[35]  G. Bigelow,et al.  QT-interval effects of methadone, levomethadyl, and buprenorphine in a randomized trial. , 2007, Archives of internal medicine.

[36]  Yang Huang,et al.  A novel hybrid approach to automated negation detection in clinical radiology reports. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[37]  C. Michael Stein,et al.  Oral erythromycin and the risk of sudden death from cardiac causes , 2004 .

[38]  Peter J. Haug,et al.  Research Paper: Automatic Detection of Acute Bacterial Pneumonia from Chest X-ray Reports , 2000, J. Am. Medical Informatics Assoc..

[39]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[40]  R A Miller,et al.  A new approach to the implementation of direct care-provider order entry. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.