An ontological bagging approach for image classification of crowdsourced data

In this paper, we study how to use semantic relationships for image classification in order to improve the classification accuracy. We achieve the goal by imitating the human visual system which classifies categories from coarse to fine grains based on different visual features. We propose an ontological bagging algorithm where most discriminative weak attributes are automatically learned for different semantic levels by multiple instance learning and the bagging idea is applied to reduce the error propagations of hierarchical classifiers. We also leverage ontological knowledge to augment crowdsourcing annotations (e.g., a hatchback is also a vehicle) in order to train hierarchical classifiers. Our method is tested on a vehicle dataset from the popular crowdsourcing dataset ImageNet. Experimental results show that our method not only achieves state-of-the-art results but also identifies semantically meaningful visual features.

[1]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Cordelia Schmid,et al.  Semantic Hierarchies for Visual Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Fei-Fei Li,et al.  Combining randomization and discrimination for fine-grained image categorization , 2011, CVPR 2011.

[4]  Charu C. Aggarwal,et al.  Towards semantic knowledge propagation from text corpus to web images , 2011, WWW.

[5]  Zhuowen Tu,et al.  Max-Margin Multiple-Instance Dictionary Learning , 2013, ICML.

[6]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8]  Charles A. Collin,et al.  Subordinate-level categorization relies on high spatial frequencies to a greater degree than basic-level categorization , 2005, Perception & psychophysics.

[9]  Kristen Grauman,et al.  Learning a Tree of Metrics with Disjoint Visual Features , 2011, NIPS.

[10]  Hao Su,et al.  Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.

[11]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[12]  Qi Tian,et al.  Multimedia LEGO: Learning Structured Model by Probabilistic Logic Ontology Tree , 2013, 2013 IEEE 13th International Conference on Data Mining.

[13]  Pietro Perona,et al.  Learning and using taxonomies for fast visual categorization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Bin Fan,et al.  Local Intensity Order Pattern for feature description , 2011, 2011 International Conference on Computer Vision.