论文信息 - An efficient initialization method for D-KSVD algorithm for image classification

An efficient initialization method for D-KSVD algorithm for image classification

In the fields of pattern recognition and signal processing, there has been a growing interest in task-driven dictionary learning, which is effective in applications in computer vision such as face recognition and image classification. Discriminative K-SVD (D-KSVD), a newly proposed dictionary learning method, has better discrimination ability since it incorporates the classification error into its object function and learns a discriminative dictionary and a linear classifier simultaneously. But D-KSVD is still a two-step iterative method, and its convergence speed is heavily influenced by the initialization values. In this paper, a novel initialization method is proposed for the D-KSVD dictionary learning algorithm, in which the naive Bayesian classifier is employed to initialize the linear classifier in D-KSVD. Then the D-KSVD problem is reformulated and the globally optimal solution for all the parameters can be found by an extended K-SVD algorithm. The reformulated problem also learns a multi-class classifier, which is particularly suitable for datasets with large number of categories. Experimental results show that D-KSVDs with initialization of our method converge faster and have better classification results compared with several baseline dictionary learning algorithms.

Zhongrong Shi | Zhongrong Shi | Yanting Lu | Yanting Lu

[1] Jean Ponce,et al. Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2] Pietro Perona,et al. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[3] Marc'Aurelio Ranzato,et al. Efficient Learning of Sparse Representations with an Energy-Based Model , 2006, NIPS.

[4] Michael Elad,et al. Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[5] Xuelong Li,et al. General Tensor Discriminant Analysis and Gabor Features for Gait Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Chengjun Liu,et al. Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition , 2002, IEEE Trans. Image Process..

[7] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[8] Svetha Venkatesh,et al. Joint learning and dictionary construction for pattern recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Yihong Gong,et al. Linear spatial pyramid matching using sparse coding for image classification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[11] Baoxin Li,et al. Discriminative K-SVD for dictionary learning in face recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12] Zhiwei Li,et al. Max-Margin Dictionary Learning for Multiclass Image Categorization , 2010, ECCV.

[13] Guillermo Sapiro,et al. Discriminative learned dictionaries for local image analysis , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14] M. Elad,et al. $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[15] Ke Huang,et al. Sparse Representation for Signal Classification , 2006, NIPS.

[16] Y. C. Pati,et al. Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[17] David J. Field,et al. Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[18] J. Andrew Bagnell,et al. Differential Sparse Coding , 2008 .

[19] Guillermo Sapiro,et al. Supervised Dictionary Learning , 2008, NIPS.

[20] Lei Zhang,et al. Sparse representation or collaborative representation: Which helps face recognition? , 2011, 2011 International Conference on Computer Vision.

[21] Larry S. Davis,et al. Learning a discriminative dictionary for sparse coding via label consistent K-SVD , 2011, CVPR 2011.

[22] Michael Elad,et al. Compression of facial images using the K-SVD algorithm , 2008, J. Vis. Commun. Image Represent..

[23] Yu Qiao,et al. Face recognition based on Gradient Gabor feature , 2008, 2008 15th IEEE International Conference on Image Processing.

[24] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Michael J. Lyons,et al. Coding facial expressions with Gabor wavelets , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[26] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[27] Thomas S. Huang,et al. Supervised translation-invariant sparse coding , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28] Thomas G. Dietterich,et al. Learning non-redundant codebooks for classifying complex objects , 2009, ICML '09.

[29] Martial Hebert,et al. Discriminative Sparse Image Models for Class-Specific Edge Detection and Image Interpretation , 2008, ECCV.

[30] A. Martínez,et al. The AR face databasae , 1998 .

[31] Michael Elad,et al. Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[32] Aleix M. Martinez,et al. The AR face database , 1998 .