Sparse coding in practice

The goal in sparse coding is to seek a linear basis representation where each image is represented by a small number of active coefficients. The learning algorithm involves adapting a basis vector set while imposing a low-entropy, or sparse, prior on the output coefficients. Sparse coding applied on natural images has been shown to extract wavelet-like structure [9, 4]. However, our experience in using sparse coding for extracting multi-scale structure in object-specific ensembles, such as face images or images of a gesturing hand, has been negative. In this paper we highlight three points about the reliability of sparse coding for extracting the desired structure: using an overcomplete representation projecting data into a low-dimensional subspace before attempting to resolve the sparse structure and applying sparsity constraint on the basis elements, as opposed to the output coefficients.

[1]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[2]  Daniel L. Ruderman,et al.  Origins of scaling in natural images , 1996, Vision Research.

[3]  R W Prager,et al.  Development of low entropy coding in a recurrent network. , 1996, Network.

[4]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[5]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[6]  M. Bartlett,et al.  Face image analysis by unsupervised learning and redundancy reduction , 1998 .

[7]  David Mumford,et al.  Statistics of natural images and models , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[8]  Terrence J. Sejnowski,et al.  Learning Overcomplete Representations , 2000, Neural Computation.

[9]  A. Jepson,et al.  Sparse PCA. Extracting multi-scale structure from data , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[10]  Allan D. Jepson,et al.  Sparse PCA: Extracting Multi-scale Structure from Data , 2001, ICCV.