Principal object detection towards product image search

Online shopping is an attractive, convenient, and efficient shopping way for billions of web users. A disappointing fact is that it is usually difficult for users to find the products fitting their needs purely based on text search. Content-based product image search becomes a promising way to solve this problem. However, the presence of natural backgrounds and fashion models significantly affect the feature matching, which makes product image search a challenging task. To clean the background and minimize the influence of noises, in this paper, a graph-based principal object detection algorithm is proposed to extract the product items while removing backgrounds and noises. A product image retrieval system is then constructed to verify the effectiveness of the proposed approach. Experiments on a large scale dataset with 1.36 million product images crawled from Taobao demonstrate the proposed approach significantly improves the retrieval performance.

[1]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Shumeet Baluja,et al.  Pagerank for product image search , 2008, WWW.

[3]  Yan Ke,et al.  Efficient Near-duplicate Detection and Sub-image Retrieval , 2004 .

[4]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[5]  Chong-Wah Ngo,et al.  Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation , 2006, MM '06.

[6]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[7]  智一 吉田,et al.  Efficient Graph-Based Image Segmentationを用いた圃場図自動作成手法の検討 , 2014 .

[8]  Haojie Li,et al.  iSearch: towards precise retrieval of item image , 2011, ICIMCS '11.

[9]  Nenghai Yu,et al.  iLike: integrating visual and textual features for vertical search , 2010, ACM Multimedia.

[10]  VekslerOlga,et al.  Fast Approximate Energy Minimization via Graph Cuts , 2001 .

[11]  Chong-Wah Ngo,et al.  On the Annotation of Web Videos by Efficient Near-Duplicate Search , 2010, IEEE Transactions on Multimedia.

[12]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[13]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[14]  Hai Jin,et al.  Image segmentation with patch-pair density priors , 2010, ACM Multimedia.

[15]  Dorin Comaniciu,et al.  Mean shift analysis and applications , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[16]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[17]  Charles T. Zahn,et al.  Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters , 1971, IEEE Transactions on Computers.