Learning Dictionaries of Discriminative Image Patches

Remarkable results have been obtained using image models based on image patches, for example sparse generative models for image inpainting, noise reduction and superresolution, sparse texture segmentation or texton models. In this paper we propose a powerful and yet simple approach for segmentation using dictionaries of image patches with associated label data. The approach is based on ideas from sparse generative image models and texton based texture modeling. The intensity and label dictionaries are learned from training images with associated label information of (a subset) of the pixels based on a modified vector quantization approach. For new images the intensity dictionary is used to encode the image data and the label dictionary is used to build a segmentation of the image. We demonstrate the algorithm on composite and real texture images and show how successful training is possible even for noisy image and low-quality label training data. In our experimental evaluation we achieve state-of-the-art performance for segmentation.

[1]  Guillermo Sapiro,et al.  Discriminative learned dictionaries for local image analysis , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[3]  Guillermo Sapiro,et al.  Supervised Dictionary Learning , 2008, NIPS.

[4]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[5]  Gabriel Peyré,et al.  Sparse Modeling of Textures , 2009, Journal of Mathematical Imaging and Vision.

[6]  D. Donoho,et al.  Simultaneous cartoon and texture image inpainting using morphological component analysis (MCA) , 2005 .

[7]  Michael Elad,et al.  Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing , 2010 .

[8]  Meena Mahajan,et al.  The Planar k-means Problem is NP-hard I , 2009 .

[9]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[10]  Michael Elad,et al.  Compression of facial images using the K-SVD algorithm , 2008, J. Vis. Commun. Image Represent..

[11]  Thomas S. Huang,et al.  Image super-resolution as sparse representation of raw image patches , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Roberto Cipolla,et al.  Semantic texton forests for image categorization and segmentation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Trygve Randen,et al.  Filtering for Texture Classification: A Comparative Study , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Kannan Ramchandran,et al.  Denoising by Sparse Approximation: Error Bounds Based on Rate-Distortion Theory , 2006, EURASIP J. Adv. Signal Process..

[15]  B. Julesz Textons, the elements of texture perception, and their interactions , 1981, Nature.

[16]  Karl Skretting,et al.  Texture Classification Using Sparse Frame-Based Representations , 2006, EURASIP J. Adv. Signal Process..

[17]  Jitendra Malik,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.