UbiShop: Commercial item recommendation using visual part-based object representation

With the popularity of online shopping, people have used to shop commercial items on the online shopping websites for convenience. However, based on traditional text search methods, people usually can not find the interesting commercial item they want if they do not know its detailed information, e.g., the name and the seller. Therefore, a more convenient method to help people find the commercial item they want is desired. In this work, we develop a practical system, UbiShop, on mobile phones, whereby users can timely get the related information of interesting commercial items by taking pictures of them. Users can also obtain recommendations on visually similar commercial items to help their buying selections. With the observation that people’s preferences on commercial items usually simply depend on their partial visual styles, we propose a novel representation, Visual Part-based Object Representation (VPOR), for commercial item images. The concept of VPOR is to decompose an item image into a set of disjointed partitions, with each of them represents a meaningful semantic parts. User can thus assign non-uniform preferences on the different parts of the commercial item to obtain a personalized recommended results. The experimental results verify our observation and show that the proposed VPOR based commercial item recommendation can achieve better performance than existing text-based and visual-based methods according to the user study.

[1]  Xiaogang Wang,et al.  IntentSearch: Capturing User Intention for One-Click Internet Image Search , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Zibin Zheng,et al.  WSRec: A Collaborative Filtering Based Web Service Recommender System , 2009, 2009 IEEE International Conference on Web Services.

[3]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[5]  Sabine Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Alexander C. Berg,et al.  Automatic Attribute Discovery and Characterization from Noisy Web Data , 2010, ECCV.

[7]  Jitendra Malik,et al.  Semantic segmentation using regions and parts , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Hsuan-Tien Lin,et al.  Unsupervised Semantic Feature Discovery for Image Object Retrieval and Tag Refinement , 2012, IEEE Transactions on Multimedia.

[9]  Kyoung-jae Kim,et al.  A recommender system using GA K-means clustering in an online shopping market , 2008, Expert Syst. Appl..

[10]  Rafael C. González,et al.  Digital image processing using MATLAB , 2006 .

[11]  Xiao Wu,et al.  Interactive product image search with complex scenes , 2012, ICIMCS '12.

[12]  Bo Luo,et al.  iLike: Bridging the Semantic Gap in Vertical Image Search by Integrating Text and Visual Features , 2013, IEEE Transactions on Knowledge and Data Engineering.

[13]  D. Norman Emotional design : why we love (or hate) everyday things , 2004 .

[14]  Ming-Syan Chen,et al.  PISAR: Progressive image search and recommendation system by auto-interpretation and user behavior , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[15]  Tong Zhang,et al.  Clothes search in consumer photos via color matching and attribute learning , 2011, ACM Multimedia.

[16]  Chao Liu,et al.  Recommender systems with social regularization , 2011, WSDM '11.

[17]  Ronen Basri,et al.  Shape representation and classification using the Poisson equation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[18]  Philip S. Yu,et al.  An effective hash-based algorithm for mining association rules , 1995, SIGMOD '95.

[19]  Charles T. Meadow,et al.  Text information retrieval systems , 1992 .

[20]  Su Myeon Kim,et al.  Development of a recommender system based on navigational and behavioral patterns of customers in e-commerce sites , 2005, Expert Syst. Appl..

[21]  Mariëlle E. H. Creusen,et al.  The Different Roles of Product Appearance in Consumer Choice , 2005 .

[22]  A. M. Madni,et al.  Recommender systems in e-commerce , 2014, 2014 World Automation Congress (WAC).

[23]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Kristen Grauman,et al.  Reshaping Visual Datasets for Domain Adaptation , 2013, NIPS.

[26]  Xing Xie,et al.  Salient Region Detection Using Weighted Feature Maps Based on the Human Visual Attention Model , 2004, PCM.

[27]  Yi-Hsuan Yang,et al.  Unsupervised auxiliary visual words discovery for large-scale image object retrieval , 2011, CVPR 2011.

[28]  Haojie Li,et al.  iSearch: towards precise retrieval of item image , 2011, ICIMCS '11.

[29]  Christopher K. I. Williams,et al.  A Generative Model for Parts-based Object Segmentation , 2012, NIPS.

[30]  Fei-Fei Li,et al.  Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[31]  Pierre Soille,et al.  Morphological Image Analysis: Principles and Applications , 2003 .

[32]  Fei-Fei Li,et al.  Image Segmentation with Topic Random Field , 2010, ECCV.

[33]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[34]  Amy L. Parsons,et al.  Emotional Design: Why We Love (or Hate) Everyday Things , 2006 .

[35]  HongJiang Zhang,et al.  Contrast-based image attention analysis by using fuzzy growing , 2003, MULTIMEDIA '03.

[36]  Cordelia Schmid,et al.  Improving Bag-of-Features for Large Scale Image Search , 2010, International Journal of Computer Vision.

[37]  Kristen Grauman,et al.  Fine-Grained Visual Comparisons with Local Learning , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[39]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[40]  Ronald R. Yager,et al.  On ordered weighted averaging aggregation operators in multicriteria decisionmaking , 1988, IEEE Trans. Syst. Man Cybern..

[41]  Ming-Syan Chen,et al.  MOSRO: Enabling Mobile Sensing for Real-Scene Objects with Grid Based Structured Output Learning , 2014, MMM.

[42]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[43]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Andrew Zisserman,et al.  Efficient Visual Search of Videos Cast as Text Retrieval , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Min-Chun Hu,et al.  Learning and Recognition of On-Premise Signs From Weakly Labeled Street View Images , 2014, IEEE Transactions on Image Processing.

[46]  Andrew Zisserman,et al.  A Boundary-Fragment-Model for Object Detection , 2006, ECCV.

[47]  Ming-Syan Chen,et al.  Recommendation for online social feeds by exploiting user response behavior , 2013, WWW '13 Companion.

[48]  Ricardo de Borobia Pires Gonçalves,et al.  Consumer Behavior: Product Characteristics and Quality Perception , 2008 .