METU dataset: A big dataset for benchmarking trademark retrieval

Trademark retrieval (TR) is the problem of retrieving similar trademarks (logos) for a query, and the main aim is to detect copyright infringements in trademarks. Since there are millions of companies worldwide, automatically retrieving similar trademarks has become an important problem, and currently, checking trademark infringements is mostly performed manually by humans. However, although there have been many attempts for automated TR, as also acknowledged in the community, the problem is largely unsolved. One of the main reasons for that is the unavailability of a publicly available comprehensive dataset that includes the various challenges of the TR problem. In this article, we propose and introduce a large dataset composed of more than 930,000 trademarks, and evaluate the existing approaches in the literature on this dataset. We show that the existing methods are far from being useful in such a challenging dataset, and we hope that the dataset can facilitate the development of better methods to make progress in the performance of trademark retrieval systems.

[1]  Anil K. Jain,et al.  Shape-Based Retrieval: A Case Study With Trademark Image Databases , 1998, Pattern Recognit..

[2]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[3]  Babu M. Mehtre,et al.  STAR-A System for Trademark Archival and Retrieval , 1995 .

[4]  Lei Zhang,et al.  A CBIR method based on color-spatial feature , 1999, Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030).

[5]  Alberto Del Bimbo,et al.  Context-Dependent Logo Matching and Recognition , 2013, IEEE Transactions on Image Processing.

[6]  Dimitrios Androutsos,et al.  Content-based retrieval of logo and trademarks in unconstrained color image databases using Color Edge Gradient Co-occurrence Histograms , 2010, Comput. Vis. Image Underst..

[7]  Josep Lladós,et al.  Efficient logo retrieval through hashing shape context descriptors , 2010, DAS '10.

[8]  Chang-Tsun Li,et al.  Trademark image retrieval using synthetic features for describing global shape and interior structure , 2009, Pattern Recognit..

[9]  Matti Pietikäinen,et al.  A Generalized Local Binary Pattern Operator for Multiresolution Gray Scale and Rotation Invariant Texture Classification , 2001, ICAPR.

[10]  Toshikazu Kato,et al.  Database architecture for content-based image retrieval , 1992, Electronic Imaging.

[11]  Olivier Buisson,et al.  Logo retrieval with a contrario visual query expansion , 2009, ACM Multimedia.

[12]  Horst Bischof,et al.  Planar Trademark and Logo Retrieval , 2013 .

[13]  Rainer Lienhart,et al.  Scalable logo recognition in real-world images , 2011, ICMR.

[14]  Yannis Avrithis,et al.  Scalable triangulation-based logo recognition , 2011, ICMR.

[15]  Anil K. Jain,et al.  Image retrieval using color and shape , 1996, Pattern Recognit..

[16]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[17]  Josep Lladós,et al.  Interactive Trademark Image Retrieval by Fusing Semantic and Visual Content , 2011, ECIR.

[18]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[19]  David Doermann,et al.  Handbook of Document Image Processing and Recognition , 2014, Springer London.