Non-negative Sparse Coding Using Independent Multi-Codebooks for Near-Duplicate Image Detection

In this paper, we propose an efficient approach for detecting near-duplicate images and make three contributions as follows. First, for each sub-region of spatial pyramid, we learn one distinct codebook such that independent multi-codebooks (IMC) are produced. IMC is more accurate than traditional codebook because it considers the spatial information of visual words to a certain extent. Second, we adopt non-negative sparse coding (NSC) technique to encode features. This encoding scheme can effectively encourage similar features to share similar sparse representations. Third, we design an improved intersection kernel (IIK) to compute image similarity. We validate our approach on two datasets respectively, namely our 6K dataset where images are collected from three web image search engines and publicly available University of Kentucky dataset. The experimental results demonstrate our technique achieves significant performance gain compared with state-of-the-art approaches.

[1]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[2]  Subhransu Maji,et al.  Classification using intersection kernel support vector machines is efficient , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Zhe Wang,et al.  High-confidence near-duplicate image detection , 2012, ICMR.

[4]  Shih-Fu Chang,et al.  Detecting image near-duplicate by stochastic attributed relational graph matching with learning , 2004, MULTIMEDIA '04.

[5]  Axel Pinz,et al.  Computer Vision – ECCV 2006 , 2006, Lecture Notes in Computer Science.

[6]  Trevor Darrell,et al.  The pyramid match kernel: discriminative classification with sets of image features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[7]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[10]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[12]  Edward Y. Chang,et al.  Enhancing DPF for near-replica image recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..