论文信息 - SDL: Saliency-Based Dictionary Learning Framework for Image Similarity

SDL: Saliency-Based Dictionary Learning Framework for Image Similarity

In image classification, obtaining adequate data to learn a robust classifier has often proven to be difficult in several scenarios. Classification of histological tissue images for health care analysis is a notable application in this context due to the necessity of surgery, biopsy or autopsy. To adequately exploit limited training data in classification, we propose a saliency guided dictionary learning method and subsequently an image similarity technique for histo-pathological image classification. Salient object detection from images aids in the identification of discriminative image features. We leverage the saliency values for the local image regions to learn a dictionary and respective sparse codes for an image, such that the more salient features are reconstructed with smaller error. The dictionary learned from an image gives a compact representation of the image itself and is capable of representing images with similar content, with comparable sparse codes. We employ this idea to design a similarity measure between a pair of images, where local image features of one image, are encoded with the dictionary learned from the other and vice versa. To effectively utilize the learned dictionary, we take into account the contribution of each dictionary atom in the sparse codes to generate a global image representation for image comparison. The efficacy of the proposed method was evaluated using three tissue data sets that consist of mammalian kidney, lung and spleen tissue, breast cancer, and colon cancer tissue images. From the experiments, we observe that our methods outperform the state of the art with an increase of 14.2% in the average classification accuracy over all data sets.

Scott T. Acton | Rituparna Sarkar | S. Acton | Rituparna Sarkar

[1] Pere-Pau Vázquez,et al. Using Normalized Compression Distance for image similarity measurement: an experimental study , 2011, The Visual Computer.

[2] Eamonn J. Keogh,et al. A Compression Based Distance Measure for Texture , 2010, SDM.

[3] Tanaya Guha,et al. Sparse representation-based image quality assessment , 2013, Signal Process. Image Commun..

[4] Kevin Skadron,et al. A meta-algorithm for classification by feature nomination , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[5] Péter Gács,et al. Information Distance , 1998, IEEE Trans. Inf. Theory.

[6] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[7] David Zhang,et al. Fisher Discrimination Dictionary Learning for sparse representation , 2011, 2011 International Conference on Computer Vision.

[8] Nanning Zheng,et al. Learning to Detect a Salient Object , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] C. Koch,et al. Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[10] S. Süsstrunk,et al. Frequency-tuned salient region detection , 2009, CVPR 2009.

[11] Scott T. Acton,et al. Region Based Segmentation in Presence of Intensity Inhomogeneity Using Legendre Polynomials , 2015, IEEE Signal Processing Letters.

[12] Aidong Zhang,et al. Semantics-Based Image Retrieval by Region Saliency , 2002, CIVR.

[13] Fabio A. González,et al. Histopathology Image Classification Using Bag of Features and Kernel Functions , 2009, AIME.

[14] R. A. Leibler,et al. On Information and Sufficiency , 1951 .

[15] Ming Li,et al. Clustering by compression , 2003, IEEE International Symposium on Information Theory, 2003. Proceedings..

[16] Guillermo Sapiro,et al. Online dictionary learning for sparse coding , 2009, ICML '09.

[17] Eduardo Romero,et al. A supervised visual model for finding regions of interest in basal cell carcinoma images , 2011, Diagnostic pathology.

[18] Scott T. Acton,et al. Slide: Saliency guided image dictionary and image similarity evaluation , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[19] Yihong Gong,et al. Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20] Larry S. Davis,et al. Learning a discriminative dictionary for sparse coding via label consistent K-SVD , 2011, CVPR 2011.

[21] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[22] King Ngi Ngan,et al. Saliency model-based face segmentation and tracking in head-and-shoulder video sequences , 2008, J. Vis. Commun. Image Represent..

[23] Yousef Al-Kofahi,et al. Improved Automatic Detection and Segmentation of Cell Nuclei in Histopathology Images , 2010, IEEE Transactions on Biomedical Engineering.

[24] Yael Pritch,et al. Saliency filters: Contrast based filtering for salient region detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Pascal Fua,et al. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Francesco Bianconi,et al. Collection of textures in colorectal cancer histology , 2016 .

[27] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[28] Nuno Vasconcelos,et al. Saliency-based discriminant tracking , 2009, CVPR.

[29] Shiri Gordon,et al. An efficient image similarity measure based on approximations of KL-divergence between two gaussian mixtures , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[30] B. S. Manjunath,et al. Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[31] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[32] Chunhua Shen,et al. Real-time visual tracking using compressive sensing , 2011, CVPR 2011.

[33] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34] Haibin Ling,et al. An Efficient Earth Mover's Distance Algorithm for Robust Histogram Comparison , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35] Vishal Monga,et al. Simultaneous Sparsity Model for Histopathological Image Representation and Classification , 2014, IEEE Transactions on Medical Imaging.

[36] Kevin Skadron,et al. Image classification by multi-kernel dictionary learning , 2014, 2014 48th Asilomar Conference on Signals, Systems and Computers.

[37] Bing Li,et al. Active Contour External Force Using Vector Field Convolution for Image Segmentation , 2007, IEEE Transactions on Image Processing.

[38] Scott T. Acton,et al. Dictionary Learning Level Set , 2015, IEEE Signal Processing Letters.

[39] Ali Borji,et al. Scene classification with a sparse set of salient regions , 2011, 2011 IEEE International Conference on Robotics and Automation.

[40] Francesco Bianconi,et al. Multi-class texture analysis in colorectal cancer histology , 2016, Scientific Reports.

[41] Michael Elad,et al. Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[42] Pietro Perona,et al. Graph-Based Visual Saliency , 2006, NIPS.

[43] Christof Koch,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[44] B. S. Manjunath,et al. Use of imperfectly segmented nuclei in the classification of histopathology images of breast cancer , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[45] Guillermo Sapiro,et al. Discriminative learned dictionaries for local image analysis , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[46] Peter E. Hart,et al. Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[47] Scott T. Acton,et al. SSPARED: Saliency and sparse code analysis for rare event detection in video , 2016, 2016 IEEE 12th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP).

[48] Adrian G. Bors,et al. Image retrieval based on query by saliency content , 2015, Digit. Signal Process..

[49] Wilson S. Geisler,et al. Multichannel Texture Analysis Using Localized Spatial Filters , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[50] Huchuan Lu,et al. Saliency Detection via Graph-Based Manifold Ranking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[51] Joel A. Tropp,et al. Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[52] Gert R. G. Lanckriet,et al. Multi-class object localization by combining local contextual interactions , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[53] Kjersti Engan,et al. Learned dictionaries for sparse image representation: properties and results , 2011, Optical Engineering + Applications.

[54] Vishal Monga,et al. Histopathological Image Classification Using Discriminative Feature-Oriented Dictionary Learning , 2015, IEEE Transactions on Medical Imaging.

[55] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[56] Tanaya Guha,et al. Image Similarity Using Sparse Representation and Compression Distance , 2012, IEEE Transactions on Multimedia.

[57] Dimitris N. Metaxas,et al. Deformable segmentation via sparse representation and dictionary learning , 2012, Medical Image Anal..

[58] Guillermo Sapiro,et al. Classification and clustering via dictionary learning with structured incoherence and shared features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[59] Tony F. Chan,et al. Active contours without edges , 2001, IEEE Trans. Image Process..

[60] Baoxin Li,et al. Discriminative K-SVD for dictionary learning in face recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[61] Ming Li,et al. Normalized Information Distance , 2008, ArXiv.

[62] Namrata Vaswani,et al. Tracking sparse signal sequences from nonlinear/non-Gaussian measurements and applications in illumination-motion tracking , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[63] Patrick Le Callet,et al. A coherent computational approach to model bottom-up visual attention , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.