Combating Ambiguity for Hash-Code Learning in Medical Instance Retrieval

When encountering a dubious diagnostic case, medical instance retrieval can help radiologists make evidence-based diagnoses by finding images containing instances similar to a query case from a large image database. The similarity between the query case and retrieved similar cases is determined by visual features extracted from pathologically abnormal regions. However, the manifestation of these regions often lacks specificity, i.e., different diseases can have the same manifestation, and different manifestations may occur at different stages of the same disease. To combat the manifestation ambiguity in medical instance retrieval, we propose a novel deep framework called Y-Net, encoding images into compact hash-codes generated from convolutional features by feature aggregation. Y-Net can learn highly discriminative convolutional features by unifying the pixel-wise segmentation loss and classification loss. The segmentation loss allows exploring subtle spatial differences for good spatial-discriminability while the classification loss utilizes class-aware semantic information for good semantic-separability. As a result, Y-Net can enhance the visual features in pathologically abnormal regions and suppress the disturbing of the background during model training, which could effectively embed discriminative features into the hash-codes in the retrieval stage. Extensive experiments on two medical image datasets demonstrate that Y-Net can alleviate the ambiguity of pathologically abnormal regions and its retrieval performance outperforms the state-of-the-art method by an average of 9.27% on the returned list of 10.

[1]  Hayit Greenspan,et al.  Semi-supervised lung nodule retrieval , 2020, ArXiv.

[2]  Nassir Navab,et al.  Hashing with Residual Networks for Image Retrieval , 2017, MICCAI.

[3]  Nassir Navab,et al.  Metric hashing forests , 2016, Medical Image Anal..

[4]  David Stutz,et al.  Neural Codes for Image Retrieval , 2015 .

[5]  Jaeyoon Kim,et al.  Regional Attention Based Deep Feature for Image Retrieval , 2018, BMVC.

[6]  Ronan Sicre,et al.  Particular object retrieval with integral max-pooling of CNN activations , 2015, ICLR.

[7]  Xavier Giró-i-Nieto,et al.  Class-Weighted Convolutional Features for Visual Instance Search , 2017, BMVC.

[8]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Wan-Lei Zhao,et al.  Instance Search via Instance Level Segmentation and Feature Representation , 2018, J. Vis. Commun. Image Represent..

[10]  Henning Müller,et al.  Large‐scale retrieval for medical image analytics: A comprehensive review , 2018, Medical Image Anal..

[11]  Jon Almazán,et al.  Learning With Average Precision: Training Image Retrieval With a Listwise Loss , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[12]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[13]  Nassir Navab,et al.  Deep Multiple Instance Hashing for Scalable Medical Image Retrieval , 2017, MICCAI.

[14]  Atsuto Maki,et al.  Visual Instance Retrieval with Deep Convolutional Networks , 2014, ICLR.

[15]  Ian D. Reid,et al.  Fast Training of Triplet-Based Deep Binary Embedding Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Ş. Öztürk Image Inpainting based Compact Hash Code Learning using Modified U-Net , 2020, 2020 4th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT).

[17]  Qi Tian,et al.  Recent Advance in Content-based Image Retrieval: A Literature Survey , 2017, ArXiv.

[18]  Shiguang Shan,et al.  Deep Supervised Hashing for Fast Image Retrieval , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Simon Osindero,et al.  Cross-Dimensional Weighting for Aggregated Deep Convolutional Features , 2015, ECCV Workshops.

[20]  Daniel L Rubin,et al.  Content-based image retrieval in radiology: analysis of variability in human perception of similarity , 2015, Journal of medical imaging.

[21]  Victor S. Lempitsky,et al.  Aggregating Local Deep Features for Image Retrieval , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[22]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Ngai-Man Cheung,et al.  Compact Hash Code Learning With Binary Deep Neural Network , 2017, IEEE Transactions on Multimedia.

[24]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[25]  Zi Huang,et al.  Where to Focus: Query Adaptive Matching for Instance Retrieval Using Convolutional Feature Maps , 2016, ArXiv.

[26]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[27]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[28]  Yichen Wei,et al.  Circle Loss: A Unified Perspective of Pair Similarity Optimization , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  K. Doi,et al.  Investigation of new psychophysical measures for evaluation of similar images on thoracic computed tomography for distinction between benign and malignant nodules. , 2003, Medical physics.

[30]  Roberto Cipolla,et al.  Multi-task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31]  Bram van Ginneken,et al.  Segmentation of anatomical structures in chest radiographs using supervised methods: a comparative study on a public database , 2006, Medical Image Anal..

[32]  Trevor Darrell,et al.  Do Convnets Learn Correspondence? , 2014, NIPS.

[33]  Jing Liu,et al.  Deep Incremental Hashing Network for Efficient Image Retrieval , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Noel E. O'Connor,et al.  Bags of Local Convolutional Features for Scalable Instance Search , 2016, ICMR.

[35]  Juho Kannala,et al.  Context Aware Query Image Representation for Particular Object Retrieval , 2017, SCIA.

[36]  Şaban Öztürk,et al.  Stacked auto-encoder based tagging with deep features for content-based medical image retrieval , 2020, Expert Syst. Appl..

[37]  Krystian Mikolajczyk,et al.  SOLAR: Second-Order Loss and Attention for Image Retrieval , 2020, ECCV.

[38]  Andrew Zisserman,et al.  Smooth object retrieval using a bag of boundaries , 2011, 2011 International Conference on Computer Vision.

[39]  Svetlana Lazebnik,et al.  Locality-sensitive binary codes from shift-invariant kernels , 2009, NIPS.

[40]  Huazhu Fu,et al.  Applications of Deep Learning in Fundus Images: A Review , 2021, Medical Image Anal..

[41]  Jeff Johnson,et al.  Billion-Scale Similarity Search with GPUs , 2017, IEEE Transactions on Big Data.

[42]  Hervé Jégou,et al.  Orientation Covariant Aggregation of Local Descriptors with Embeddings , 2014, ECCV.

[43]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Jie Lin,et al.  Deeply Activated Salient Region for Instance Search , 2020, ArXiv.

[45]  Xiaoqing Zhang,et al.  Attention-based Saliency Hashing for Ophthalmic Image Retrieval , 2020, 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[46]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[47]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[48]  M. Slaney,et al.  Locality-Sensitive Hashing for Finding Nearest Neighbors [Lecture Notes] , 2008, IEEE Signal Processing Magazine.

[49]  Anton van den Hengel,et al.  The treasure beneath convolutional layers: Cross-convolutional-layer pooling for image classification , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Jianmin Wang,et al.  Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[51]  Yi Shi,et al.  Deep Supervised Hashing with Triplet Labels , 2016, ACCV.

[52]  Jiwen Lu,et al.  Order-Sensitive Deep Hashing for Multimorbidity Medical Image Retrieval , 2018, MICCAI.

[53]  Huazhu Fu,et al.  Deep Triplet Hashing Network for Case-based Medical Image Retrieval , 2021, Medical Image Anal..

[54]  Ş. Öztürk Two-Stage Sequential Losses based Automatic Hash Code Generation using Siamese Network , 2020 .

[55]  Qi Tian,et al.  SIFT Meets CNN: A Decade Survey of Instance Retrieval , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Albert Gordo,et al.  End-to-End Learning of Deep Visual Representations for Image Retrieval , 2016, International Journal of Computer Vision.

[57]  K. Doi,et al.  Development of a digital image database for chest radiographs with and without a lung nodule: receiver operating characteristic analysis of radiologists' detection of pulmonary nodules. , 2000, AJR. American journal of roentgenology.

[58]  Michael A. Casey,et al.  Locality-Sensitive Hashing for Finding Nearest Neighbors , 2008 .

[59]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[60]  Yannis Avrithis,et al.  To Aggregate or Not to aggregate: Selective Match Kernels for Image Search , 2013, 2013 IEEE International Conference on Computer Vision.

[61]  Yuning Jiang,et al.  SOLO: Segmenting Objects by Locations , 2020, ECCV.

[62]  Shin'ichi Satoh,et al.  Faster R-CNN Features for Instance Search , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[63]  Tien Yin Wong,et al.  Similarity regularized sparse group lasso for cup to disc ratio computation. , 2017, Biomedical optics express.

[64]  Noel E. O'Connor,et al.  Saliency Weighted Convolutional Features for Instance Search , 2017, 2018 International Conference on Content-Based Multimedia Indexing (CBMI).

[65]  Dinggang Shen,et al.  Deep Disentangled Hashing with Momentum Triplets for Neuroimage Search , 2020, MICCAI.

[66]  Wu-Jun Li,et al.  Feature Learning Based Deep Supervised Hashing with Pairwise Labels , 2015, IJCAI.