Tree-based Shape Descriptor for scalable logo detection

Detecting logos in real-world images is a great challenging task due to a variety of viewpoint or light condition changes and real-time requirements in practice. Conventional object detection methods, e.g., part-based model, may suffer from expensively computational cost if it was directly applied to this task. A promising alternative, triangle structural descriptor associated with matching strategy, offers an efficient way of recognizing logos. However, the descriptor fails to the rotation of logo images that often occurs when viewpoint changes. To overcome this shortcoming, we propose a new Tree-based Shape Descriptor (TSD) in this paper, which is strictly invariant to affine transformation in real-world images. The core of proposed descriptor is to encode the shape of logos by depicting both appearance and spatial information of four local key-points. In the training stage, an efficient algorithm is introduced to mine a discriminate subset of four tuples from all possible key-point combinations. Moreover, a root indexing scheme is designed to enable to detect multiple logos simultaneously. Extensive experiments on three benchmarks demonstrate the superiority of proposed approach over state-of-the-art methods.

[1]  Alberto Del Bimbo,et al.  Context-Dependent Logo Matching and Recognition , 2013, IEEE Transactions on Image Processing.

[2]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[3]  Marjorie Skubic,et al.  Evaluation of an inexpensive depth camera for in-home gait assessment , 2011, J. Ambient Intell. Smart Environ..

[4]  Gianfranco Burzio,et al.  Automotive Vision Systems , 1999 .

[5]  Robert T. Collins,et al.  Gait Shape Estimation for Identification , 2003, AVBPA.

[6]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Deepen Sinha,et al.  On the optimal choice of a wavelet for signal representation , 1992, IEEE Trans. Inf. Theory.

[8]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[9]  Tsuhan Chen,et al.  Image retrieval with geometry-preserving visual phrases , 2011, CVPR 2011.

[10]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[11]  Y Sheng,et al.  Wavelet transform as a bank of the matched filters. , 1992, Applied optics.

[12]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Ram Chakka,et al.  Symlet2 Based Wavelet Filter Bank for Uncorrelated Digital Signal Recovery: Some Advantages over Matched Filter , 2011 .

[14]  Yannis Avrithis,et al.  Scalable triangulation-based logo recognition , 2011, ICMR.

[15]  Jiri Matas,et al.  Geometric min-Hashing: Finding a (thick) needle in a haystack , 2009, CVPR.

[16]  Bir Bhanu,et al.  Physical models for moving shadow and object detection in video , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Puteh Saad,et al.  Object Detection using Circular Hough Transform , 2005 .

[18]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[19]  Sheng Tang,et al.  Logo detection based on spatial-spectral saliency and partial spatial context , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[20]  Rainer Lienhart,et al.  Scalable logo recognition in real-world images , 2011, ICMR.

[21]  Claudia Linnhoff-Popien,et al.  Gait Recognition with Kinect , 2012 .

[22]  Marjorie Skubic,et al.  Passive in-home measurement of stride-to-stride gait variability comparing vision and Kinect sensing , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[23]  J. Benedetto,et al.  The Theory of Multiresolution Analysis Frames and Applications to Filter Banks , 1998 .

[24]  Ryo Kurazume,et al.  Gait-Based Person Identification Robust to Changes in Appearance , 2013, Sensors.

[25]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[26]  A. Aldroubi,et al.  Families of multiresolution and wavelet spaces with optimal properties , 1993 .