Parallel content-based sub-image retrieval using hierarchical searching

MOTIVATION The capacity to systematically search through large image collections and ensembles and detect regions exhibiting similar morphological characteristics is central to pathology diagnosis. Unfortunately, the primary methods used to search digitized, whole-slide histopathology specimens are slow and prone to inter- and intra-observer variability. The central objective of this research was to design, develop, and evaluate a content-based image retrieval system to assist doctors for quick and reliable content-based comparative search of similar prostate image patches. METHOD Given a representative image patch (sub-image), the algorithm will return a ranked ensemble of image patches throughout the entire whole-slide histology section which exhibits the most similar morphologic characteristics. This is accomplished by first performing hierarchical searching based on a newly developed hierarchical annular histogram (HAH). The set of candidates is then further refined in the second stage of processing by computing a color histogram from eight equally divided segments within each square annular bin defined in the original HAH. A demand-driven master-worker parallelization approach is employed to speed up the searching procedure. Using this strategy, the query patch is broadcasted to all worker processes. Each worker process is dynamically assigned an image by the master process to search for and return a ranked list of similar patches in the image. RESULTS The algorithm was tested using digitized hematoxylin and eosin (H&E) stained prostate cancer specimens. We have achieved an excellent image retrieval performance. The recall rate within the first 40 rank retrieved image patches is ∼90%. AVAILABILITY AND IMPLEMENTATION Both the testing data and source code can be downloaded from http://pleiad.umdnj.edu/CBII/Bioinformatics/.

[1]  Yan Ke,et al.  An efficient parts-based near-duplicate and sub-image retrieval system , 2004, MULTIMEDIA '04.

[2]  Antonio Criminisi,et al.  Immediate Structured Visual Search for Medical Images , 2011, MICCAI.

[3]  Mario A. Nascimento,et al.  Content-based sub-image retrieval using relevance feedback , 2004, MMDB '04.

[4]  Zhiwei Li,et al.  Contextual synonym dictionary for visual object retrieval , 2011, ACM Multimedia.

[5]  Matti Pietikäinen,et al.  Block-Based Methods for Image Retrieval Using Local Binary Patterns , 2005, SCIA.

[6]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[8]  Tony Pan,et al.  ImageMiner: a software system for comparative analysis of tissue microarrays using content-based image retrieval, high-performance computing, and grid technology , 2011, J. Am. Medical Informatics Assoc..

[9]  George R. Thoma,et al.  A Learning-Based Similarity Fusion and Filtering Approach for Biomedical Image Retrieval Using SVM Classification and Relevance Feedback , 2011, IEEE Transactions on Information Technology in Biomedicine.

[10]  Kien A. Hua,et al.  Image Retrieval Based on Regions of Interest , 2003, IEEE Trans. Knowl. Data Eng..

[11]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Andrew Zisserman,et al.  Efficient Visual Search of Videos Cast as Text Retrieval , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Matti Pietikäinen,et al.  Rotation-Invariant Image and Video Description With Local Binary Pattern Features , 2012, IEEE Transactions on Image Processing.

[14]  Christoph H. Lampert Detecting objects in large image collections and videos by efficient subimage retrieval , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[15]  Tat-Seng Chua,et al.  A match and tiling approach to content-based video retrieval , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[16]  Yan Ke,et al.  Efficient Near-duplicate Detection and Sub-image Retrieval , 2004 .

[17]  Anant Madabhushi,et al.  A boosted distance metric: application to content based image retrieval and classification of digitized histopathology , 2009, Medical Imaging.

[18]  Reynold Cheng,et al.  Proceedings of the 13th international conference on Advances in Spatial and Temporal Databases , 2011 .

[19]  Ming-Huwi Horng,et al.  Texture Feature Coding Method for Classification of Liver Sonography , 1996, ECCV.

[20]  Jacob D. Furst,et al.  Content-based image retrieval for pulmonary computed tomography nodule images , 2007, SPIE Medical Imaging.

[21]  Metin Nafi Gürcan,et al.  Content-Based Microscopic Image Retrieval System for Multi-Image Queries , 2012, IEEE Transactions on Information Technology in Biomedicine.

[22]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Anant Madabhushi,et al.  A Boosting Cascade for Automated Detection of Prostate Cancer from Digitized Histology , 2006, MICCAI.

[24]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Lei Zheng,et al.  Design and analysis of a content-based pathology image retrieval system , 2003, IEEE Transactions on Information Technology in Biomedicine.

[26]  Ying Zhang,et al.  Notice of Violation of IEEE Publication PrinciplesBag-of-Features Based Medical Image Retrieval via Multiple Assignment and Visual Words Weighting , 2011, IEEE Transactions on Medical Imaging.

[27]  Winston H. Hsu,et al.  Query expansion for hash-based image object retrieval , 2009, ACM Multimedia.