Binary codes for tagging x-ray images via deep de-noising autoencoders

A Content-Based Image Retrieval (CBIR) system which identifies similar medical images based on a query image can assist clinicians for more accurate diagnosis. The recent CBIR research trend favors the construction and use of binary codes to represent images. Deep architectures could learn the non-linear relationship among image pixels adaptively, allowing the automatic learning of high-level features from raw pixels. However, most of them require class labels, which are expensive to obtain, particularly for medical images. The methods which do not need class labels utilize a deep autoencoder for binary hashing, but the code construction involves a specific training algorithm and an ad-hoc regularization technique. In this study, we explored using a deep de-noising autoencoder (DDA), with a new unsupervised training scheme using only backpropagation and dropout, to hash images into binary codes. We conducted experiments on more than 14,000 x-ray images. By using class labels only for evaluating the retrieval results, we constructed a 16-bit DDA and a 512-bit DDA independently. Comparing to other unsupervised methods, we succeeded to obtain the lowest total error by using the 512-bit codes for retrieval via exhaustive search, and speed up 9.27 times with the use of the 16-bit codes while keeping a comparable total error. We found that our new training scheme could reduce the total retrieval error significantly by 21.9%. To further boost the image retrieval performance, we developed Radon Autoencoder Barcode (RABC) which are learned from the Radon projections of images using a de-noising autoencoder. Experimental results demonstrated its superior performance in retrieval when it was combined with DDA binary codes.

[1]  Wu-Jun Li,et al.  Feature Learning Based Deep Supervised Hashing with Pairwise Labels , 2015, IJCAI.

[2]  Geoffrey E. Hinton,et al.  Semantic hashing , 2009, Int. J. Approx. Reason..

[3]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Shih-Fu Chang,et al.  Semi-Supervised Hashing for Large-Scale Search , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[8]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[9]  Jianmin Li,et al.  CNN Based Hashing for Image Retrieval , 2015, ArXiv.

[10]  Qi Zhang,et al.  Medical Image Retrieval Using Local Binary Patterns with Image Euclidean Distance , 2009, 2009 International Conference on Information Engineering and Computer Science.

[11]  Hamid R. Tizhoosh,et al.  Medical Image Classification via SVM Using LBP Features from Saliency-Based Folded Data , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[12]  Jiwen Lu,et al.  Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Loris Nanni,et al.  Local binary patterns variants as texture descriptors for medical image analysis , 2010, Artif. Intell. Medicine.

[14]  Mehrdad J. Gangeh,et al.  Tumour ROI estimation in ultrasound images via radon barcodes in patients with locally advanced breast cancer , 2016, 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI).

[15]  Hamid R. Tizhoosh,et al.  Barcode annotations for medical image retrieval: A preliminary investigation , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[16]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[17]  Marco Wiering,et al.  2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) , 2011, IJCNN 2011.

[18]  Bahram Parvin,et al.  Classification of tumor histopathology via sparse feature learning , 2013, 2013 IEEE 10th International Symposium on Biomedical Imaging.

[19]  M. L. Dewal,et al.  Progressive medical image coding using binary wavelet transforms , 2014, Signal Image Video Process..

[20]  Hayit Greenspan,et al.  Addressing the ImageClef 2009 Challenge Using a Patch-based Visual Words Representation , 2009, CLEF.

[21]  Simon J. Doran,et al.  Stacked Autoencoders for Unsupervised Feature Learning and Multiple Organ Detection in a Pilot Study Using 4D Patient Data , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Wei Liu,et al.  Towards Large-Scale Histopathological Image Analysis: Hashing-Based Image Retrieval , 2015, IEEE Transactions on Medical Imaging.

[23]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[24]  Lei Zhang,et al.  Bit-Scalable Deep Hashing With Regularized Similarity Learning for Image Retrieval and Person Re-Identification , 2015, IEEE Transactions on Image Processing.

[25]  Ahmet Ekin,et al.  Medical image search and retrieval using local binary patterns and KLT feature points , 2009, 2009 IEEE 17th Signal Processing and Communications Applications Conference.

[26]  Geoffrey E. Hinton,et al.  Using very deep autoencoders for content-based image retrieval , 2011, ESANN.

[27]  Di Huang,et al.  Local Binary Patterns and Its Application to Facial Image Analysis: A Survey , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[28]  Geoffrey E. Hinton A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[29]  Hamed Kiani Galoogahi,et al.  Face sketch recognition by Local Radon Binary Pattern: LRBP , 2012, 2012 19th IEEE International Conference on Image Processing.

[30]  Hamid R. Tizhoosh,et al.  Autoencoding the retrieval relevance of medical images , 2015, 2015 International Conference on Image Processing Theory, Tools and Applications (IPTA).

[31]  Barbara Caputo,et al.  Overview of the CLEF 2009 Medical Image Annotation Track , 2009, CLEF.

[32]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[33]  Kristen Grauman,et al.  Learning Binary Hash Codes for Large-Scale Image Search , 2013, Machine Learning for Computer Vision.

[34]  Hanjiang Lai,et al.  Supervised Hashing for Image Retrieval via Image Representation Learning , 2014, AAAI.

[35]  WangJun,et al.  Semi-Supervised Hashing for Large-Scale Search , 2012 .

[36]  Chikkannan Eswaran,et al.  Using Autoencoders for Mammogram Compression , 2011, Journal of Medical Systems.

[37]  Trevor Darrell,et al.  Learning to Hash with Binary Reconstructive Embeddings , 2009, NIPS.

[38]  David J. Fleet,et al.  Minimal Loss Hashing for Compact Binary Codes , 2011, ICML.

[39]  Jen-Hao Hsiao,et al.  Deep learning of binary hash codes for fast image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[40]  Kristen Grauman,et al.  Kernelized locality-sensitive hashing for scalable image search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[41]  Yang Song,et al.  A bag of semantic words model for medical content-based retrieval , 2013 .

[42]  Daniel Fabbri,et al.  Toward content-based image retrieval with deep convolutional neural networks , 2015, Medical Imaging.

[43]  Razvan Pascanu,et al.  Theano: new features and speed improvements , 2012, ArXiv.