Use of Radcube for Extraction of Finding Trends in a Large Radiology Practice

The purpose of our study was to demonstrate the use of Natural Language Processing (Leximer), along with Online Analytic Processing, (NLP-OLAP), for extraction of finding trends in a large radiology practice. Prior studies have validated the Natural Language Processing (NLP) program, Leximer for classifying unstructured radiology reports based on the presence of positive radiology findings (FPOS) and negative radiology findings (FNEG). The FPOS included new relevant radiology findings and any change in status from prior imaging. Electronic radiology reports from 1995–2002 and data from analysis of these reports with NLP-Leximer were saved in a data warehouse and exported to a multidimensional structure called the Radcube. Various relational queries on the data in the Radcube were performed using OLAP technique. Thus, NLP-OLAP was applied to determine trends of FPOS in different radiology exams for different patient and examination attributes. Pivot tables were exported from NLP-OLAP interface to Microsoft Excel for statistical analysis. Radcube allowed rapid and comprehensive analysis of FPOS and FNEG trends in a large radiology report database. Trends of FPOS were extracted for different patient attributes such as age groups, gender, clinical indications, diseases with ICD codes, patient types (inpatient, ambulatory), imaging characteristics such as imaging modalities, referring physicians, radiology subspecialties, and body regions. Data analysis showed substantial differences between FPOS rates for different imaging modalities ranging from 23.1% (mammography, 49,163/212,906) to 85.8% (nuclear medicine, 93,852/109,374; p < 0.0001). In conclusion, NLP-OLAP can help in analysis of yield of different radiology exams from a large radiology report database.

[1]  S. Muluk,et al.  A decade of change in abdominal aortic aneurysm repair in the United States: Have we improved outcomes equally between men and women? , 2006, Journal of vascular surgery.

[2]  George Hripcsak,et al.  Coding Neuroradiology Reports for the Northern Manhattan Stroke Study: A Comparison of Natural Language Processing and Manual Review , 2000, Comput. Biomed. Res..

[3]  George Hripcsak,et al.  Medical text representations for inductive learning , 2000, AMIA.

[4]  Peter J. Haug,et al.  Research Paper: Automatic Detection of Acute Bacterial Pneumonia from Chest X-ray Reports , 2000, J. Am. Medical Informatics Assoc..

[5]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[6]  W. DuMouchel,et al.  Unlocking Clinical Data from Narrative Reports: A Study of Natural Language Processing , 1995, Annals of Internal Medicine.

[7]  Priyanka Gupta,et al.  BioWarehouse: a bioinformatics database warehouse toolkit , 2006, BMC Bioinformatics.

[8]  Mythreyi Bhargavan,et al.  Utilization of radiology services in the United States: levels and trends in modalities, regions, and populations. , 2005, Radiology.

[9]  Tao Xu,et al.  Atlas – a data warehouse for integrative bioinformatics , 2005, BMC Bioinformatics.

[10]  Donald P. Frush,et al.  Pediatric CT: practical approach to diminish the radiation dose , 2002, Pediatric Radiology.

[11]  F. Frizelle,et al.  Colorectal cancer treated at Christchurch Hospital, New Zealand: a comparison of 1993 and 1998 cohorts. , 2005, The New Zealand medical journal.

[12]  Henry J. Lowe,et al.  Selective Automated Indexing of Findings and Diagnoses in Radiology Reports , 2001, J. Biomed. Informatics.

[13]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[14]  James H Thrall,et al.  Application of Recently Developed Computer Algorithm for Automatic Classification of Unstructured Radiology Reports: Validation Study 1 , 2004 .

[15]  Clement J. McDonald,et al.  Automated Extraction and Normalization of Findings from Cancer-Related Free-Text Radiology Reports , 2003, AMIA.

[16]  William E Field,et al.  Analysis of Factors Contributing to 674 Agricultural Driveline-Related Injuries and Fatalities Documented Between 1970 and 2003 , 2005, Journal of agromedicine.

[17]  George Hripcsak,et al.  Research Paper: The Role of Domain Knowledge in Automating Medical Text Report Classification , 2003, J. Am. Medical Informatics Assoc..

[18]  Christoph Wick,et al.  Augmented Reality Simulator for Training in Two-Dimensional Echocardiography , 2000, Comput. Biomed. Res..

[19]  Everett F. Cataldo The House Committee on Ways and Means , 1965 .

[20]  J. Austin,et al.  Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports. , 2002, Radiology.

[21]  E. Bradbury,et al.  Large-scale quantitative proteomic study of PUMA-induced apoptosis using two-dimensional liquid chromatography-mass spectrometry coupled with amino acid-coded mass tagging. , 2004, Journal of proteome research.

[22]  Ramin Khorasani,et al.  Inpatient radiology utilization: trends over the past decade. , 2006, AJR. American journal of roentgenology.

[23]  Chad Creighton,et al.  Mining gene expression databases for association rules , 2003, Bioinform..

[24]  C. Cowan,et al.  Health spending projections through 2016: modest changes obscure part D's impact. , 2007, Health affairs.

[25]  R F Newland,et al.  Electronic data processing: the pathway to automated quality control of cardiopulmonary bypass. , 2006, The journal of extra-corporeal technology.

[26]  J. Lubitz Health, technology, and medical care spending. , 2005, Health affairs.

[27]  N L Jain,et al.  Identification of suspected tuberculosis patients based on natural language processing of chest radiograph reports. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[28]  S. Diederich,et al.  Solitary pulmonary nodule: detection and management , 2006, Cancer imaging : the official publication of the International Cancer Imaging Society.

[29]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[30]  Dan M. Spengler,et al.  Web client and ODBC access to legacy database information: a low cost approach , 1997, AMIA.

[31]  Rajul Patel,et al.  Safety of elective--including "high risk"--percutaneous coronary interventions without on-site cardiac surgery. , 2005, American heart journal.