Research Paper: A Pilot Study of Contextual UMLS Indexing to Improve the Precision of Concept-based Representation in XML-structured Clinical Radiology Reports

OBJECTIVE Despite the advantages of structured data entry, much of the patient record is still stored as unstructured or semistructured narrative text. The issue of representing clinical document content remains problematic. The authors' prior work using an automated UMLS document indexing system has been encouraging but has been affected by the generally low indexing precision of such systems. In an effort to improve precision, the authors have developed a context-sensitive document indexing model to calculate the optimal subset of UMLS source vocabularies used to index each document section. This pilot study was performed to evaluate the utility of this indexing approach on a set of clinical radiology reports. DESIGN A set of clinical radiology reports that had been indexed manually using UMLS concept descriptors was indexed automatically by the SAPHIRE indexing engine. Using the data generated by this process the authors developed a system that simulated indexing, at the document section level, of the same document set using many permutations of a subset of the UMLS constituent vocabularies. MEASUREMENTS The precision and recall scores generated by simulated indexing for each permutation of two or three UMLS constituent vocabularies were determined. RESULTS While there was considerable variation in precision and recall values across the different subtypes of radiology reports, the overall effect of this indexing strategy using the best combination of two or three UMLS constituent vocabularies was an improvement in precision without significant impact of recall. CONCLUSION In this pilot study a contextual indexing strategy improved overall precision in a set of clinical radiology reports.

[1]  N. T. Cheung,et al.  Structured Data Entry of Clinical Information for Documentation and Data Collection , 2001, MedInfo.

[2]  C A Smith,et al.  Automated Semantic Indexing of Imaging Reports to Support Retrieval of Medical Images in the Multimedia Electronic Medical Record , 1999, Methods of Information in Medicine.

[3]  Betsy L. Humphreys,et al.  Technical Milestone: The Unified Medical Language System: An Informatics Research Collaboration , 1998, J. Am. Medical Informatics Assoc..

[4]  H J Lowe Multimedia electronic medical record systems. , 1999, Academic medicine : journal of the Association of American Medical Colleges.

[5]  Robert H. Baud,et al.  The future of natural language processing for biomedical applications , 2002, Int. J. Medical Informatics.

[6]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[7]  M E Funk,et al.  Indexing consistency in MEDLINE. , 1983, Bulletin of the Medical Library Association.

[8]  Naomi Sager,et al.  Research Paper: Natural Language Processing and the Representation of Clinical Data , 1994, J. Am. Medical Informatics Assoc..

[9]  D. Lindberg,et al.  Unified Medical Language System , 2020, Definitions.

[10]  George Hripcsak,et al.  Research Paper: A Reliability Study for Evaluating Information Extraction from Radiology Reports , 1999, J. Am. Medical Informatics Assoc..

[11]  J. Austin,et al.  Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports. , 2002, Radiology.

[12]  Allen C. Browne,et al.  The UMLS Knowledge Source Server: a versatile Internet-based research tool. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[13]  Susanne M. Humphrey Information retrieval in medicine , 1996 .

[14]  P. V. Biron,et al.  The HL7 Clinical Document Architecture. , 2001, Journal of the American Medical Informatics Association : JAMIA.

[15]  Lawrence M. Fagan,et al.  Research Paper: Methods for Semi-automated Indexing for High Precision Information Retrieval , 2002, J. Am. Medical Informatics Assoc..

[16]  William R. Hersh,et al.  Information Retrieval in Medicine: The SAPHIRE Experience , 1995, J. Am. Soc. Inf. Sci..

[17]  Arie Hasman,et al.  The granularity of medical narratives and its effect on the speed and completeness of information retrieval. , 1998, Journal of the American Medical Informatics Association : JAMIA.

[18]  Henry J. Lowe,et al.  Selective Automated Indexing of Findings and Diagnoses in Radiology Reports , 2001, J. Biomed. Informatics.