Design and Development of a Multimodal Biomedical Information Retrieval System

The search for relevant and actionable information is a key to achieving clinical and research goals in biomedicine. Biomedical information exists in different forms: as text and illustrations in journal articles and other documents, in images stored in databases, and as patients’ cases in electronic health records. This paper presents ways to move beyond conventional text-based searching of these resources, by combining text and visual features in search queries and document representation. A combination of techniques and tools from the fields of natural language processing, information retrieval, and content-based image retrieval allows the development of building blocks for advanced information services. Such services enable searching by textual as well as visual queries, and retrieving documents enriched by relevant images, charts, and other illustrations from the journal literature, patient records and image databases.

[1]  Marti A. Hearst,et al.  Full Text and Figure Display Improves Bioscience Literature Search , 2010, PloS one.

[2]  L. Rodney Long,et al.  Multi-modal Query Expansion Based on Local Analysis for Medical Image Retrieval , 2009, MCBR-CDS.

[3]  Thomas Martin Deserno,et al.  Geographically Distributed Complementary Content-Based Image Retrieval Systems for Biomedical Image Informatics , 2007, MedInfo.

[4]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[5]  Jung-jae Kim,et al.  Automatic Suggestion for PubMed Query Reformulation , 2012, J. Comput. Sci. Eng..

[6]  Shih-Fu Chang,et al.  Overview of the MPEG-7 standard , 2001, IEEE Trans. Circuits Syst. Video Technol..

[7]  Michael Krauthammer,et al.  Yale Image Finder (YIF): a new search engine for retrieving biomedical images , 2008, Bioinform..

[8]  Mathias Lux,et al.  Lire: lucene image retrieval: an extensible java CBIR library , 2008, ACM Multimedia.

[9]  Tefko Saracevic,et al.  Evaluation of evaluation in information retrieval , 1995, SIGIR '95.

[10]  Dina Demner-Fushman,et al.  Application of Information Technology: Essie: A Concept-based Search Engine for Structured Biomedical Text , 2007, J. Am. Medical Informatics Assoc..

[11]  Dina Demner-Fushman,et al.  Evaluating the Importance of Image-related Text for Ad-hoc and Case-based Biomedical Article Retrieval. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[12]  Yiannis S. Boutalis,et al.  CEDD: Color and Edge Directivity Descriptor: A Compact Descriptor for Image Indexing and Retrieval , 2008, ICVS.

[13]  Venu Govindaraju,et al.  Detecting Figure-Panel Labels in Medical Journal Articles Using MRF , 2011, 2011 International Conference on Document Analysis and Recognition.

[14]  Dina Demner-Fushman,et al.  Towards Automatic Image Region Annotation - Image Region Textual Coreference Resolution , 2009, HLT-NAACL.

[15]  Dina Demner-Fushman,et al.  UMLS content views appropriate for NLP processing of the biomedical literature vs. clinical text , 2010, J. Biomed. Informatics.

[16]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[17]  George R. Thoma,et al.  Automatically identifying health outcome information in MEDLINE records. , 2006, Journal of the American Medical Informatics Association : JAMIA.

[18]  Preslav Nakov,et al.  BioText Search Engine: beyond abstract search , 2007, Bioinform..

[19]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  D. Lindberg,et al.  Unified Medical Language System , 2020, Definitions.

[21]  Venu Govindaraju,et al.  Biomedical article retrieval using multimodal features and image annotations in region-based CBIR , 2010, Electronic Imaging.

[22]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[23]  Amanda Spink,et al.  How are we searching the World Wide Web? A comparison of nine search engine transaction logs , 2006, Inf. Process. Manag..