Enhancing bag of visual words with color information for iconic image classification

Iconic images represent an abstract topic and use a presentation that is intuitively understood within a certain cultural context. For example, the abstract topic “global warming” may be represented by a polar bear standing alone on an ice floe. This paper presents a system for the classification of iconic images. It uses a variation of the Bag of Visual Words approach with enhanced feature descriptors. Our novel color pyramids feature incorporates color information into the classification scheme. It improves the average F1 measure of the classification by 0.118.

[1]  Cordelia Schmid,et al.  Discriminative spatial saliency for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Wolfgang Effelsberg,et al.  Automatic classification of iconic images based on a multimodal model. An interdisciplinary project , 2015 .

[3]  Adnan Yazici,et al.  Towards Effective Image Classification Using Class-Specific Codebooks and Distinctive Local Features , 2015, IEEE Transactions on Multimedia.

[4]  Svetlana Lazebnik,et al.  Computing iconic summaries of general visual concepts , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[5]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6]  Gerald Kühne,et al.  Contour-based classification of video objects , 2001, IS&T/SPIE Electronic Imaging.

[7]  Gang Hua,et al.  Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications , 2011, IEEE Transactions on Image Processing.

[8]  Pabitra Mitra,et al.  A survey on image retrieval performance of different bag of visual words indexing techniques , 2014, Proceedings of the 2014 IEEE Students' Technology Symposium.

[9]  S. Chitrakala,et al.  A survey on scalable image indexing and searching , 2013, 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT).

[10]  Wolfgang Effelsberg,et al.  Robust tracking for interactive social video , 2012, 2012 IEEE Workshop on the Applications of Computer Vision (WACV).

[11]  Alexander C. Berg,et al.  Finding iconic images , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[12]  Wolfgang Effelsberg,et al.  Shape-based posture and gesture recognition in videos , 2005, IS&T/SPIE Electronic Imaging.