A Framework of Mobile Visual Search Based on the Weighted Matching of Dominant Descriptor

As a kind of interesting mobile application, Mobile Visual Search (MVS) has attracted extensive research efforts from both academy and industry. Most of the MVS systems adopt the client-server framework, in which transmission latency caused by the limited bandwidth in wireless network is a big problem. To address this problem, the state-of-the-art work focuses on designing low bit-rate descriptors for MVS. However, few work focuses on reducing the number of descriptors. To further reduce the latency, we propose a novel framework of MVS based on the weighted matching of dominant descriptor. Firstly, we present an affinity propagation based algorithm for dominant descriptor selection. Secondly, we propose a weighted feature matching method to consider the differences of dominant descriptors in feature matching. By the proposed framework, we not only reduce the network latency in MVS, but also avoid transmitting useless descriptors to improve the retrieval accuracy of MVS. The experimental results on Stanford MVS data set show that when using CHoG descriptors, the proposed framework outperforms the existing framework by reducing more than 40% of the amount of data transmission and increasing 5% of the average retrieval accuracy.

[1]  Bernd Girod,et al.  Mobile product recognition , 2010, ACM Multimedia.

[2]  Shih-Fu Chang,et al.  Mobile product search with Bag of Hash Bits and boundary reranking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Huizhong Chen,et al.  The stanford mobile visual search data set , 2011, MMSys.

[4]  Yan-Ying Chen,et al.  Enabling low bitrate mobile visual recognition: a performance versus bandwidth evaluation , 2013, MM '13.

[5]  Bernd Girod,et al.  Rate-efficient, real-time cd cover recognition on a camera-phone , 2008, ACM Multimedia.

[6]  Ke Gao,et al.  Geometric context-preserving progressive transmission in mobile visual search , 2012, ACM Multimedia.

[7]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[8]  Bernd Girod,et al.  Mobile Visual Search , 2011, IEEE Signal Processing Magazine.

[9]  Bernd Girod,et al.  Compressed Histogram of Gradients: A Low-Bitrate Descriptor , 2011, International Journal of Computer Vision.

[10]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[11]  Xin Chen,et al.  City-scale landmark identification on mobile devices , 2011, CVPR 2011.

[12]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..