Towards fully un-supervised methods for generating object detection classifiers using social data

In this work a framework for constructing object detection classifiers using weakly annotated social data is proposed. Social information is combined with computer vision techniques to automatically obtain a set of images annotated at region-detail. All assumptions made to automate the proposed framework are driven by the reasonable expectation that due to the collaborative aspect of social data, linguistic descriptions and visual representations will start to converge on common concepts, as the scale of the analyzed dataset increases. Comparison tests performed againstmanually trained object detectors showed that comparable performance can be achieved.

[1]  Yiannis Kompatsiaris,et al.  SEMSOC: SEMantic, SOcial and Content-Based Clustering in Multimedia Collaborative Tagging Systems , 2008, 2008 IEEE International Conference on Semantic Computing.

[2]  David A. Forsyth,et al.  Learning the semantics of words and pictures , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[3]  Bill Triggs,et al.  Region Classification with Markov Field Aspect Models , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Michael G. Strintzis,et al.  Still Image Segmentation Tools For Object-Based Multimedia Applications , 2004, Int. J. Pattern Recognit. Artif. Intell..

[5]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[6]  David A. Forsyth,et al.  Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[7]  Gustavo Carneiro,et al.  Weakly Supervised Top-down Image Segmentation , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Pietro Perona,et al.  Learning object categories from Google's image search , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.