Minimizing the semantic gap in biomedical content-based image retrieval

A major challenge in biomedical Content-Based Image Retrieval (CBIR) is to achieve meaningful mappings that minimize the semantic gap between the high-level biomedical semantic concepts and the low-level visual features in images. This paper presents a comprehensive learning-based scheme toward meeting this challenge and improving retrieval quality. The article presents two algorithms: a learning-based feature selection and fusion algorithm and the Ranking Support Vector Machine (Ranking SVM) algorithm. The feature selection algorithm aims to select 'good' features and fuse them using different similarity measurements to provide a better representation of the high-level concepts with the low-level image features. Ranking SVM is applied to learn the retrieval rank function and associate the selected low-level features with query concepts, given the ground-truth ranking of the training samples. The proposed scheme addresses four major issues in CBIR to improve the retrieval accuracy: image feature extraction, selection and fusion, similarity measurements, the association of the low-level features with high-level concepts, and the generation of the rank function to support high-level semantic image retrieval. It models the relationship between semantic concepts and image features, and enables retrieval at the semantic level. We apply it to the problem of vertebra shape retrieval from a digitized spine x-ray image set collected by the second National Health and Nutrition Examination Survey (NHANES II). The experimental results show an improvement of up to 41.92% in the mean average precision (MAP) over conventional image similarity computation methods.

[1]  Philip N. Klein,et al.  Recognition of shapes by editing their shock graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  G. Naber The geometry of Minkowski spacetime : an introduction to the mathematics of the special theory of relativity , 1992 .

[3]  Hemant D. Tagare Deformable 2-D template matching using orthogonal curves , 1997, IEEE Transactions on Medical Imaging.

[4]  Esther M. Arkin,et al.  An efficiently computable metric for comparing polygonal shapes , 1991, SODA '90.

[5]  Kaleem Siddiqi,et al.  Matching Hierarchical Structures Using Association Graphs , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Rogério Schmidt Feris,et al.  Efficient partial shape matching using Smith-Waterman algorithm , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[7]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Dah-Jye Lee,et al.  Shape-based human detection for threat assessment , 2004, SPIE Defense + Commercial Sensing.

[9]  B. J. Doherty,et al.  Morphologic Study of Lumbar Vertebral Osteophytes , 1998, Southern medical journal.

[10]  L. Rodney Long,et al.  Comparative study of spine vertebra shape retrieval using learning-based feature selection , 2009, 2009 22nd IEEE International Symposium on Computer-Based Medical Systems.

[11]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[12]  Sameer Antani,et al.  Biomedical information from a national collection of spine x-rays: film to content-based retrieval , 2003, SPIE Medical Imaging.

[13]  Thore Graepel,et al.  Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[14]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[15]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[16]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[17]  George R Thoma,et al.  Image informatics at a national research center. , 2005, Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society.

[18]  F. Aghdasi,et al.  Feature selection for classification of mammographic microcalcification clusters , 1993, Proceedings of the 15th Annual International Conference of the IEEE Engineering in Medicine and Biology Societ.

[19]  J. Wolfowitz,et al.  An Introduction to the Theory of Statistics , 1951, Nature.

[20]  Guojun Lu,et al.  Content-based shape retrieval using different shape descriptors: a comparative study , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[21]  Sameer Antani,et al.  Evaluating Partial Shape Queries for Pathology-based Retrieval of Vertebra , 2004 .

[22]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[23]  Dengsheng Zhang,et al.  A comparative study on shape retrieval using Fourier descriptiors with different shape signatures , 2001 .

[24]  Guojun Lu,et al.  Review of shape representation and description techniques , 2004, Pattern Recognit..