Feedback-Driven Radiology Exam Report Retrieval with Semantics

Clinical documents are vital resources for radiologists to have a better understanding of patient history. The use of clinical documents can complement the often brief reasons for exams that are provided by physicians in order to perform more informed diagnoses. With the large number of study exams that radiologists have to perform on a daily basis, it becomes too time-consuming for radiologists to sift through each patient's clinical documents. It is therefore important to provide a capability that can present contextually relevant clinical documents, and at the same time satisfy the diverse information needs among radiologists from different specialties. In this work, we propose a knowledge-based semantic similarity approach that uses domain-specific relationships such as part-of along with taxonomic relationships such as is-a to identify relevant radiology exam records. Our approach also incorporates explicit relevance feedback to personalize radiologists information needs. We evaluated our approach on a corpus of 6,265 radiology exam reports through study sessions with radiologists and demonstrated that the retrieval performance of our approach yields an improvement of 5% over the baseline. We further performed intra-class and inter-class similarities using a subset of 2,384 reports spanning across 10 exam codes. Our result shows that intra-class similarities are always higher than the inter-class similarities and our approach was able to obtain 6% percent improvement in intra-class similarities against the baseline. Our results suggest that the use of domain-specific relationships together with relevance feedback provides a significant value to improve the accuracy of the retrieval of radiology exam reports.

[1]  Paul G Nagy,et al.  The future of the radiology information system. , 2013, AJR. American journal of roentgenology.

[2]  Alexander F. Gelbukh,et al.  Advanced Relevance Feedback Query Expansion Strategy for Information Retrieval in MEDLINE , 2004, CIARP.

[3]  Jonathan Weese,et al.  UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems , 2013, *SEMEVAL.

[4]  Noémie Elhadad,et al.  A hybrid knowledge-based and data-driven approach to identifying semantically similar concepts , 2012, J. Biomed. Informatics.

[5]  David Sánchez,et al.  An ontology-based measure to compute semantic similarity in biomedicine , 2011, J. Biomed. Informatics.

[6]  Carolyn R. Watters,et al.  Extending the Rocchio Relevance Feedback Algorithm to Provide Contextual Retrieval , 2004, AWIC.

[7]  Daniel L. Rubin,et al.  Creating and Curating a Terminology for Radiology: Ontology Modeling and Analysis , 2008, Journal of Digital Imaging.

[8]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[9]  Fei Wang,et al.  Medical prognosis based on patient similarity and expert feedback , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[10]  Yuval Shahar,et al.  Vaidurya: A multiple-ontology, concept-based, context-sensitive clinical-guideline search engine , 2009, J. Biomed. Informatics.

[11]  Thusitha De Silva Mabotuwana,et al.  An ontology-based similarity measure for biomedical data - Application to radiology reports , 2013, J. Biomed. Informatics.

[12]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[13]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[14]  Michael K. Ng,et al.  A Comparative Study of Ontology Based Term Similarity Measures on PubMed Document Clustering , 2007, DASFAA.

[15]  Martin Chodorow,et al.  Combining local context and wordnet similarity for word sense identification , 1998 .

[16]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[17]  Thorsten Joachims,et al.  A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization , 1997, ICML.

[18]  Carlo Strapparava,et al.  Corpus-based and Knowledge-based Measures of Text Semantic Similarity , 2006, AAAI.

[19]  Ted Pedersen,et al.  Measures of semantic similarity and relatedness in the biomedical domain , 2007, J. Biomed. Informatics.

[20]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[21]  Vagelis Hristidis,et al.  Ontology-Aware Search on XML-based Electronic Medical Records , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[22]  L. Dekang,et al.  Extracting collocations from text corpora , 1998 .

[23]  Ted Pedersen,et al.  UMLS-Interface and UMLS-Similarity : Open Source Software for Measuring Paths and Semantic Similarity , 2009, AMIA.

[24]  Xiangji Huang,et al.  York University at TREC 2011: Medical Records Track , 2011, TREC.

[25]  Mobyen Uddin Ahmed,et al.  Similarity of Medical Cases in Health Care Using Cosine Similarity and Ontology , 2007 .

[26]  Emine Yilmaz,et al.  A geometric interpretation and analysis of R-precision , 2005, CIKM '05.

[27]  Eric Brill,et al.  Improving web search ranking by incorporating user behavior information , 2006, SIGIR.

[28]  Samuel Fernando,et al.  A Semantic Similarity Approach to Paraphrase Detection , 2008 .

[29]  George Hripcsak,et al.  Inter-patient distance metrics using SNOMED CT defining relationships , 2006, J. Biomed. Informatics.