论文信息 - Elliptical ASIFT Agglomeration in Class Prototype for Logo Detection

Elliptical ASIFT Agglomeration in Class Prototype for Logo Detection

Logo (graphic entity that contains colors, shapes, textures and identifies organizations, goods, etc.) localization and recognition is a subproblem of object detection and recognition and a challenging pattern recognition task. Applications are in the automotive industry, sports transmissions, legal or feedback for advertising. Logos in natural images are approached within retrieval systems [1] [6], [4], etc. or by integrated detection (localization and recognition) [2], [7], etc. Our contribution falls in the second category and consists in: (1) a new class prototyping method based on a central image extracted by analyzing the homographies graph and re-projecting the relevant keypoints on that image (2) a logo detection system that exhibits great performance. The main conceptual difference to previous systems is that they manually branched their process to deal with corner-cases, while we perform the branching automatically, proposing a compact and self-adjusting system. Class Description. In the training phase (detailed in Fig. 1) we construct models (prototypes) to describe classes. In the testing phase (Fig. 2) the query image is compared with class prototypes and if they are enough similar we count a detection. Feature extraction. The logo images are described by the Affine Difference of Gausssians (ADoG) [5] followed by the description with oriented SIFT elliptical local features. ADoG provides more keypoints on logos than other choices, while elliptical features are able to provide correctly the orientation even for circular logos. Class Graph. All the logo crops from the same class are grouped in a weighted graph: the nodes are the logos, while an edge is created if a homography is found between that pair of logos-nodes. The edge has an weight equal to the inverse of the number of keypoints pairs matched. The homography between two logos is found with the direct linear transformation (DLT) and 4 keypoint pairs are needed for this determination. The matching between the reference logo and the subject logo is found with RANSAC. Yet, to provide the best match, one needs to iterate more subsets than usual [3]. Next, the quality of the homographic fit is evaluated using an error map built as the Hellinger distance between the reference logo and the back-projected logo described with Dense SIFT. Given the class graph, the central image is the node with the most connections. Class Model. The class model is built by agglomerating onto the central image the suitable keypoints and their SIFT description. This information is taken from all the logo images from the main cluster of the class graph, by projecting them on the plane of the central image. Keypoints in images directly connected to the central image are backprojected (by inverting the matching homography) on the central one. The equivalent homography between images that are not directly connected to the central image is determined by composing the homographies placed on the path between that image and the central one. The chosen path is the one that ensures minimum cumulative weight. The corresponding SIFT Figure 2: The schematic of the system used to locate and classify a logo in a testing image.

[1] Andrea Vedaldi,et al. Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[2] Jonathon S. Hare,et al. Efficient clustering and quantisation of SIFT features: exploiting characteristics of the SIFT descriptor and interest region detectors under image inversion , 2011, ICMR '11.

[3] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[4] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[5] Wei Liang,et al. Individualized matching based on logo density for scalable logo recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6] Robert E. Tarjan,et al. Depth-First Search and Linear Graph Algorithms , 1972, SIAM J. Comput..

[7] Florent Perronnin,et al. Instance classification with prototype selection , 2014, ICMR.

[8] Alberto Del Bimbo,et al. Trademark matching and retrieval in sports video databases , 2007, MIR '07.

[9] Matthew A. Brown,et al. Automatic Panoramic Image Stitching using Invariant Features , 2007, International Journal of Computer Vision.

[10] David H. Reiley,et al. Measuring the Effects of Advertising: The Digital Frontier , 2013 .

[11] John C. Hart,et al. Accelerating arrays of linear classifiers using approximate range queries , 2014, IEEE Winter Conference on Applications of Computer Vision.

[12] Corneliu Florea,et al. Homographic Class Template for Logo Localization and Recognition , 2015, IbPRIA.

[13] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[14] Fan Yang,et al. Feature Fusion by Similarity Regression for Logo Retrieval , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[15] Rainer Lienhart,et al. Automatic object annotation from weakly labeled data with latent structured SVM , 2014, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI).

[16] Yannis Avrithis,et al. Scalable triangulation-based logo recognition , 2011, ICMR.

[17] Lizhuang Ma,et al. A new framework for feature descriptor based on SIFT , 2009, Pattern Recognit. Lett..

[18] Shaozi Li,et al. Logo detection with extendibility and discrimination , 2013, Multimedia Tools and Applications.

[19] Alberto Del Bimbo,et al. Context-Dependent Logo Matching and Recognition , 2013, IEEE Transactions on Image Processing.

[20] Rainer Lienhart,et al. Bundle min-hashing for logo recognition , 2013, ICMR '13.

[21] Olivier Buisson,et al. Logo retrieval with a contrario visual query expansion , 2009, ACM Multimedia.

[22] Xing Xie,et al. Spatial pyramid mining for logo detection in natural scenes , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[23] Rainer Lienhart,et al. Scalable logo recognition in real-world images , 2011, ICMR.

[24] Jean-Michel Morel,et al. ASIFT: A New Framework for Fully Affine Invariant Image Comparison , 2009, SIAM J. Imaging Sci..

[25] Eleftherios Kayafas,et al. Vehicle Logo Recognition Using a SIFT-Based Enhanced Matching Scheme , 2010, IEEE Transactions on Intelligent Transportation Systems.

[26] Cordelia Schmid,et al. Correlation-based burstiness for logo retrieval , 2012, ACM Multimedia.