An Approach of Text-Based and Image-Based Multi-modal Search for Online Shopping

Nowadays, more and more people prefer online shopping to physical store shopping for its convenience, cheapness and timesaving. Customers visit some commercial shopping websites, and select their favorite commodities by accessing links or retrieving by search box. However, in our real life, most online shopping websites provide a simple and single text retrieval method only, to some extent it’s difficult for customers to submit query and retrieve satisfactory results. In this paper, a multi-modal search approach combining text-based and image-based search techniques is presented. Besides text search, a two-stage image search approach is proposed, which utilizes basic features consisting of color and textural features to filter mismatching images in first stage, and further uses SIFT features for accurate search in second stage. Moreover, a prototype system has been developed for multi-modal search on online shopping websites. By submitting some words, phrases, images or their combination, customers can search out what they want. The experiments compared with traditional algorithms based on single visual feature validate that our approach and multi-modal search prototype system are effective, and the retrieval results can satisfy customers’ requirements well for online shopping.

[1]  F. Parmiggiani,et al.  An investigation of the textural characteristics associated with gray level cooccurrence matrix statistical parameters , 1995, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[3]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[4]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[5]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[6]  David G. Lowe,et al.  Scene modelling, recognition and tracking with invariant image features , 2004, Third IEEE and ACM International Symposium on Mixed and Augmented Reality.

[7]  Kannan Ramchandran,et al.  Multimedia Analysis and Retrieval System (MARS) Project , 1996, Data Processing Clinic.

[8]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[9]  Louise E. Moser,et al.  A Personal Handheld Multi-Modal Shopping Assistant , 2006, International conference on Networking and Services (ICNS'06).

[10]  Anil K. Jain,et al.  Shape-Based Retrieval: A Case Study With Trademark Image Databases , 1998, Pattern Recognit..

[11]  Josef Kittler,et al.  Defect detection in random colour textures , 1996, Image Vis. Comput..

[12]  Helen Couclelis,et al.  Pizza over the Internet: e-commerce, the fragmentation of activity and the tyranny of the region , 2004 .

[13]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[14]  Jitendra Malik,et al.  Contour and Texture Analysis for Image Segmentation , 2001, International Journal of Computer Vision.

[15]  Michael R. Ward,et al.  Consumer acquisition of product information and subsequent purchase channel decisions , 2002, The Economics of the Internet and E-commerce.

[16]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .