An Approach for Multimodal Medical Image Retrieval using Latent Dirichlet Allocation

Modern medical practices are increasingly dependent on Medical Imaging for clinical analysis and diagnoses of patient illnesses. A significant challenge when dealing with the extensively available medical data is that it often consists of heterogeneous modalities. Existing works in the field of Content based medical image retrieval (CBMIR) have several limitations as they focus mainly on visual or textual features for retrieval. Given the unique manifold of medical data, we seek to leverage both the visual and textual modalities to improve the image retrieval. We propose a Latent Dirichlet Allocation (LDA) based technique for encoding the visual features and show that these features effectively model the medical images. We explore early fusion and late fusion techniques to combine these visual features with the textual features. The proposed late fusion technique achieved a higher mAP than the state-of-the-art on the ImageCLEF 2009 dataset, underscoring its suitability for effective multimodal medical image retrieval.

[1]  Muhammad Awais,et al.  Medical image retrieval using deep convolutional neural network , 2017, Neurocomputing.

[2]  Rainer Lienhart,et al.  Multilayer pLSA for multimodal image retrieval , 2009, CIVR '09.

[3]  H. Greenspan,et al.  Automated retrieval of CT images of liver lesions on the basis of image similarity: method and preliminary results. , 2010, Radiology.

[4]  Degui Xiao,et al.  Medical Image Retrieval: A Multimodal Approach , 2014, Cancer informatics.

[5]  Hermann Ney,et al.  The IRMA Project: A State of the Art Report on Content-Based Image Retrieval in Medical Applications , 2003 .

[6]  Xiaoying Tai,et al.  An Improved Approach Based on FCM Using Feature Fusion for Medical Image Retrieval , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[7]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[8]  Hayit Greenspan,et al.  Medical Image Categorization and Retrieval for PACS Using the GMM-KL Framework , 2007, IEEE Transactions on Information Technology in Biomedicine.

[9]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[10]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Joo-Hwee Lim,et al.  Latent semantic fusion model for image retrieval and annotation , 2007, CIKM '07.

[12]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[13]  Dorota Glowacka,et al.  Interactive Content-Based Image Retrieval with Deep Neural Networks , 2016, Symbiotic.

[14]  Yongwang Zhao,et al.  Medical Image Retrieval with Query-Dependent Feature Fusion Based on One-Class SVM , 2010, 2010 13th IEEE International Conference on Computational Science and Engineering.

[15]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[16]  L. Rodney Long,et al.  Multi-modal Query Expansion Based on Local Analysis for Medical Image Retrieval , 2009, MCBR-CDS.