Generic object classifiers based on real image selection from the web

In this paper we present our semi-supervised technique for building object category classifiers using real image data from the Internet. Our technique not only reduces the overhead of manual training by humans, but also achieves robust classifiers that can be evaluated in real time. Given a sample object and its name (keyword), we collect a large amount of object-related images from two main image sources: Google Images and the LabelMe website. We deal with the problem of separating good training samples from noisy images by performing two steps: similar image selection and non-real image filtering. We use a variant of Gaussian Discriminant Analysis (GDA) to filter out non-real images (drawings, cartoons, etc.) that tentatively affect classifier performance in real environments. In order to select true object-related training samples, we introduce a Simile Selector Classifier (SSC) that is constructed from a small set of images taken from the sample object. The SSC not only is able to select similar samples from the large unordered set of images, but also it can separate desired object category images from other categories that have the same name (polysemes), i.e. “apple” as a fruit, or as a company logo. Finally, the experiments which we performed in real environments demonstrate the performance of our object classifiers.

[1]  Kristen Grauman,et al.  Keywords to visual categories: Multiple-instance learning forweakly supervised object categorization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Pietro Perona,et al.  Learning object categories from Google's image search , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[3]  B. Schiele,et al.  Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[4]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[5]  Fei-Fei Li,et al.  OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Peter Auer,et al.  Weak Hypotheses and Boosting for Generic Object Detection and Recognition , 2004, ECCV.

[7]  Dan Roth,et al.  Learning to detect objects in images via a sparse, part-based representation , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Horst Bunke,et al.  Off-Line, Handwritten Numeral Recognition by Perturbation Method , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Robert C. Holte,et al.  C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling , 2003 .

[10]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.