Multimodal biomedical image retrieval using hierarchical classification and modality fusion

Images are frequently used in articles to convey essential information in context with correlated text. However, searching images in a task-specific way poses significant challenges. To minimize limitations of low-level feature representations in content-based image retrieval (CBIR), and to complement text-based search, we propose a multi-modal image search approach that exploits hierarchical organization of modalities and employs both intra and inter-modality fusion techniques. For the CBIR search, several visual features were extracted to represent the images. Modality-specific information was used for similarity fusion and selection of a relevant image subset. Intra-modality fusion of retrieval results was performed by searching images for specific informational elements. Our methods use text extracted from relevant components in a document to create structured representations as “enriched citations” for the text-based search approach. Finally, the multi-modal search consists of a weighted linear combination of similarity scores of independent output results from textual and visual search approaches (inter modality). Search results were evaluated using a standard ImageCLEFmed 2012 evaluation dataset of 300,000 images with associated annotations. We achieved a mean average precision (MAP) score of 0.2533, which is statistically significant, and better in performance (7 % improvement) over comparable results in ImageCLEFmed 2012.

[1]  Henning Müller,et al.  Overview of the CLEF 2009 Medical Image Retrieval Track , 2009, CLEF.

[2]  Shih-Fu Chang,et al.  Overview of the MPEG-7 standard , 2001, IEEE Trans. Circuits Syst. Video Technol..

[3]  Michael Krauthammer,et al.  Yale Image Finder (YIF): a new search engine for retrieving biomedical images , 2008, Bioinform..

[4]  George R. Thoma,et al.  Annotation and retrieval of clinically relevant images , 2009, Int. J. Medical Informatics.

[5]  George R. Thoma,et al.  Text- and Content-based Approaches to Image Modality Detection and Retrieval for the ImageCLEF 2010 Medical Retrieval Track , 2010, CLEF.

[6]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[7]  Carla E. Brodley,et al.  ASSERT: A Physician-in-the-Loop Content-Based Retrieval System for HRCT Image Databases , 1999, Comput. Vis. Image Underst..

[8]  George R. Thoma,et al.  Text and Content-based Approaches to Image Modality Classification and Retrieval for the ImageCLEF 2011 Medical Retrieval Track , 2011, CLEF.

[9]  Henning Müller,et al.  Log analysis to understand medical professionals' image searching behaviour. , 2012, Studies in health technology and informatics.

[10]  Nawei Chen,et al.  A Survey of Indexing and Retrieval of Multimodal Documents: Text and Images , 2006 .

[11]  Hermann Ney,et al.  Automatic categorization of medical images for content-based retrieval and data mining. , 2005, Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society.

[12]  Ami N. Rubinowitz,et al.  Recognizing lung disease in patients with rheumatoid arthritis, part 2 , 2008 .

[13]  Ellen M Voorhes,et al.  IT: The Thirteenth Text Retrieval Conference, TREC 2004 , 2005 .

[14]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[16]  Sameer Antani,et al.  Creating a classification of image types in the medical literature for visual categorization , 2012, Other Conferences.

[17]  David A. Hanauer,et al.  Enhanced identification of eligibility for depression research using an electronic medical record search engine , 2009, Int. J. Medical Informatics.

[18]  Preslav Nakov,et al.  BioText Search Engine: beyond abstract search , 2007, Bioinform..

[19]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[20]  Daekeun You,et al.  Image retrieval from scientific publications: Text and image content processing to separate multipanel figures , 2013, J. Assoc. Inf. Sci. Technol..

[21]  Carla E. Brodley,et al.  ASSERT: A PHYSICIAN-IN-THE-LOOP CONTENT-BASED IMAGE RETRIEVAL SYSTEM FOR HRCT IMAGE DATABASES , 1999 .

[22]  Henning Müller,et al.  Overview of the ImageCLEF 2012 Medical Image Retrieval and Classification Tasks , 2012, CLEF.

[23]  Mathias Lux,et al.  Lire: lucene image retrieval: an extensible java CBIR library , 2008, ACM Multimedia.

[24]  Peter Ingwersen,et al.  Cognitive Perspectives of Information Retrieval Interaction: Elements of a Cognitive IR Theory , 1996, J. Documentation.

[25]  Henning Müller,et al.  Overview of the CLEF 2011 Medical Image Classification and Retrieval Tasks , 2011, CLEF.

[26]  George R. Thoma,et al.  Design and Development of a Multimodal Biomedical Information Retrieval System , 2012, J. Comput. Sci. Eng..

[27]  Daekeun You,et al.  ITI's Participation in the ImageCLEF 2012 Medical Retrieval and Classification Tasks , 2012, CLEF.

[28]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[29]  Joon Ho Lee,et al.  Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.

[30]  L. Rodney Long,et al.  SPIRS: A Web-based image retrieval system for large biomedical databases , 2009, Int. J. Medical Informatics.

[31]  Topi Mäenpää,et al.  The local binary pattern approach to texture analysis - extensions and applications , 2003 .

[32]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[33]  Sameer Antani,et al.  Text- and content-based biomedical image modality classification , 2013, Medical Imaging.

[34]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[35]  George R. Thoma,et al.  A Learning-Based Similarity Fusion and Filtering Approach for Biomedical Image Retrieval Using SVM Classification and Relevance Feedback , 2011, IEEE Transactions on Information Technology in Biomedicine.

[36]  James Allan,et al.  A comparison of statistical significance tests for information retrieval evaluation , 2007, CIKM '07.

[37]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[38]  Dina Demner-Fushman,et al.  Application of Information Technology: Essie: A Concept-based Search Engine for Structured Biomedical Text , 2007, J. Am. Medical Informatics Assoc..

[39]  Marti A. Hearst,et al.  TREC 2007 Genomics Track Overview , 2007, TREC.