Multimodal medical image retrieval system

In this paper we depict an implemented system for medical image retrieval. Our system performs retrieval based on both textual and visual content, separately and combined, using advanced encoding and quantization techniques. The text-based retrieval subsystem uses textual data acquired from an image’s corresponding article to generate a suitable representation. Using a vector space model, the generated representations structure is altered to increase performance. Query expansion with pseudo-relevance feedback is applied to fine-tune the results. The content-based retrieval subsystem performs retrieval based on visual features extracted from the images. A Gaussian Mixture Model is constructed from the extracted visual features, in our case - RGB histograms, and is used in encoding the same features into Fisher Vectors. With scalability and speed in mind, we utilized a product quantization technique over the generated vectors, which provides fast response times over large image collections. Product quantization drastically reduces the size of the image representation at almost no cost to accuracy, thus improving the scalability factor of our system. Our system uses modality classification to further improve retrieval results. This subsystem labels the image modality based on their visual content. The images are described using state-of-the-art opponentSIFT visual features. Classification was performed using Support Vector Machines (SVMs). The predictions from the SVMs are used for re-ranking the resulting images based on their modality and the modality of the query. The system was evaluated against the standardized ImageCLEF 2013, 2012 and 2011 medical datasets and it reported state-of-the-art performance for all datasets.

[1]  Stéfan Jacques Darmoni,et al.  Natural language processing versus content-based image analysis for medical document retrieval , 2009, J. Assoc. Inf. Sci. Technol..

[2]  Henning Müller,et al.  Comparing the quality of accessing medical literature using content-based visual and textual information retrieval , 2009, Medical Imaging.

[3]  Craig MacDonald,et al.  University of Glasgow at WebCLEF 2005: Experiments in per-field Normalisation and Language Specific Stemming , 2005, CLEF.

[4]  Sébastien Ourselin,et al.  Immediate ROI Search for 3-D Medical Images , 2012, MCBR-CDS.

[5]  C. Begley,et al.  Selecting a database for literature searches in nursing: MEDLINE or CINAHL? , 1996, Journal of advanced nursing.

[6]  Lei Zheng,et al.  Design and analysis of a content-based pathology image retrieval system , 2003, IEEE Transactions on Information Technology in Biomedicine.

[7]  Javed A. Aslam,et al.  Relevance score normalization for metasearch , 2001, CIKM '01.

[8]  Daekeun You,et al.  Multimodal biomedical image retrieval using hierarchical classification and modality fusion , 2013, International Journal of Multimedia Information Retrieval.

[9]  Hsuan-Tien Lin,et al.  A note on Platt’s probabilistic outputs for support vector machines , 2007, Machine Learning.

[10]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[11]  Cordelia Schmid,et al.  Combining attributes and Fisher vectors for efficient image retrieval , 2011, CVPR 2011.

[12]  Ivan Kitanovski,et al.  Multimodal Medical Image Retrieval , 2012, ICT Innovations.

[13]  Mohan S. Kankanhalli,et al.  Multimodal fusion for multimedia analysis: a survey , 2010, Multimedia Systems.

[14]  Barbara Caputo,et al.  Discriminative cue integration for medical image annotation , 2008, Pattern Recognit. Lett..

[15]  Gjorgji Madjarov,et al.  Medical Image Retrieval Using Multimodal Data , 2014, Discovery Science.

[16]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  William R. Hersh,et al.  Effectiveness of global features for automatic medical image classification and retrieval - The experiences of OHSU at ImageCLEFmed , 2008, Pattern Recognit. Lett..

[18]  Payel Ghosh,et al.  Review of medical image retrieval systems and future directions , 2011, 2011 24th International Symposium on Computer-Based Medical Systems (CBMS).

[19]  L. Christophorou Science , 2018, Emerging Dynamics: Science, Energy, Society and Values.

[20]  Henning Müller,et al.  Overview of the ImageCLEF 2013 Medical Tasks , 2013, CLEF.

[21]  Henning Müller,et al.  Overview of the CLEF 2011 Medical Image Classification and Retrieval Tasks , 2011, CLEF.

[22]  Nikolas P. Galatsanos,et al.  A similarity learning approach to content-based image retrieval: application to digital mammography , 2004, IEEE Transactions on Medical Imaging.

[23]  Henning Müller,et al.  The medGIFT Group in ImageCLEFmed 2013 , 2013, CLEF.

[24]  Adil Alpkocak,et al.  DEMIR at ImageCLEFMed 2013: The Effects of Modality Classification to Information Retrieval , 2013, CLEF.

[25]  Preslav Nakov,et al.  BioText Search Engine: beyond abstract search , 2007, Bioinform..

[26]  Thomas Martin Deserno,et al.  Content-based image retrieval in medical applications: a novel multistep approach , 1999, Electronic Imaging.

[27]  Cheng Thao,et al.  GoldMiner: a radiology image search engine. , 2007, AJR. American journal of roentgenology.

[28]  David Dagan Feng,et al.  Content-Based Medical Image Retrieval: A Survey of Applications to Multidimensional and Multimodality Data , 2013, Journal of Digital Imaging.

[29]  Cor J. Veenman,et al.  Visual Word Ambiguity , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Julie Delon,et al.  Combining color and geometry for local image matching , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[31]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[32]  Andrew Zisserman,et al.  The devil is in the details: an evaluation of recent feature encoding methods , 2011, BMVC.

[33]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  C. J. van Rijsbergen,et al.  Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[35]  Michael Kohnen,et al.  Quality of DICOM header information for image categorization , 2002, SPIE Medical Imaging.

[36]  Theodore Kalamboukis,et al.  IPL at CLEF 2013 Medical Retrieval Task , 2013, CLEF.

[37]  Ivan Kitanovski,et al.  FCSE at Medical Tasks of ImageCLEF 2013 , 2013, CLEF.

[38]  Henning Müller,et al.  Overview of the ImageCLEF 2012 Medical Image Retrieval and Classification Tasks , 2012, CLEF.

[39]  H. Damasio,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence: Special Issue on Perceptual Organization in Computer Vision , 1998 .

[40]  Craig MacDonald,et al.  Terrier Information Retrieval Platform , 2005, ECIR.

[41]  Mark Sanderson,et al.  The CLEF Cross Language Image Retrieval Track (ImageCLEF) 2004 , 2004, CLEF.

[42]  Ivan Kitanovski,et al.  Merging words and concepts for medical articles retrieval , 2013, OAIR.

[43]  Erkki Oja,et al.  Medical Document Mining Combining Image Exploration and Text Characterization , 2014, Discovery Science.

[44]  Michael Krauthammer,et al.  Yale Image Finder (YIF): a new search engine for retrieving biomedical images , 2008, Bioinform..

[45]  Dina Demner-Fushman,et al.  Application of Information Technology: Essie: A Concept-based Search Engine for Structured Biomedical Text , 2007, J. Am. Medical Informatics Assoc..

[46]  C. Dye,et al.  Research for Universal Health Coverage , 2013, Science Translational Medicine.

[47]  Sašo Džeroski,et al.  Improved medical image modality classification using a combination of visual and textual features , 2015, Comput. Medical Imaging Graph..

[48]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[49]  Hugo Jair Escalante,et al.  Late fusion of heterogeneous methods for multimedia image retrieval , 2008, MIR '08.