Bayesian-Based Probabilistic Architecture for Image Categorization Using Macro- and Micro-Sense Visual Vocabulary

Visual vocabulary representation approach has been successfully applied to many multimedia and vision applications, including visual recognition, image retrieval, and scene modeling/categorization. The idea behind the visual vocabulary representation is that an image can be represented by visual words, a collection of local features of images. In this work, we will develop a new scheme for the construction of visual vocabulary based on the analysis of visual word contents. By considering the content homogeneity of visual words, we design a visual vocabulary which contains macro-sense and micro-sense visual words. The two types of visual words are appropriately further combined to describe an image effectively. We also apply the visual vocabulary to construct image categorization system. The performance evaluation for the system indicates that the proposed visual vocabulary achieves promising results.

[1]  N. H. C. Yung,et al.  Scene categorization via contextual visual words , 2010, Pattern Recognit..

[2]  Stefan Poslad,et al.  An Enhanced Bag-of-Visual Word Vector Space Model to Represent Visual Content in Athletics Images , 2012, IEEE Transactions on Multimedia.

[3]  Antonio Torralba,et al.  Describing Visual Scenes using Transformed Dirichlet Processes , 2005, NIPS.

[4]  Keqiu Li,et al.  Object-based image retrieval with kernel on adjacency matrix and local combined features , 2012, TOMCCAP.

[5]  Dimitrios Rafailidis,et al.  Similarity Search over the Cloud Based on Image Descriptors' Dimensions Value Cardinalities , 2015, ACM Trans. Multim. Comput. Commun. Appl..

[6]  Chong-Wah Ngo,et al.  Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study , 2010, IEEE Transactions on Multimedia.

[7]  Tao Mei,et al.  Contextual Bag-of-Words for Visual Categorization , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Tinne Tuytelaars,et al.  Towards a more discriminative and semantic visual vocabulary , 2011, Comput. Vis. Image Underst..

[9]  Zhe-Ming Lu,et al.  Image retrieval based on histograms of EOPs and VQ indices , 2016 .

[10]  Florent Perronnin,et al.  Universal and Adapted Vocabularies for Generic Visual Categorization , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Chaur-Heh Hsieh,et al.  Constructing a discriminative visual vocabulary with macro and micro sense of visual words , 2015, Multimedia Tools and Applications.

[12]  Jianying Hu,et al.  Extraction of perceptually important colors and similarity measurement for image matching, retrieval and analysis , 2002, IEEE Trans. Image Process..

[13]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Gang Hua,et al.  Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications , 2011, IEEE Transactions on Image Processing.

[15]  Wei-Han Chang,et al.  A fast MPEG-7 dominant color extraction with new similarity measure for image retrieval , 2008, J. Vis. Commun. Image Represent..

[16]  Sabine Süsstrunk,et al.  Eigenregions for image classification , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Ioannis Pratikakis,et al.  Bag of spatio-visual words for context inference in scene classification , 2013, Pattern Recognit..

[18]  Peter Wonka,et al.  RAID: A Relation-Augmented Image Descriptor , 2016, ACM Trans. Graph..

[19]  Dong Liu,et al.  Image retrieval with query-adaptive hashing , 2013, TOMCCAP.

[20]  Nenghai Yu,et al.  Semantics-Preserving Bag-of-Words Models and Applications , 2010, IEEE Transactions on Image Processing.

[21]  Qi Tian,et al.  Principal Visual Word Discovery for Automatic License Plate Detection , 2012, IEEE Transactions on Image Processing.

[22]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[23]  Lei Zhu,et al.  Theory of keyblock-based image retrieval , 2002, TOIS.

[24]  Alberto Del Bimbo,et al.  Image Tag Assignment, Refinement and Retrieval , 2015, ACM Multimedia.

[25]  Alberto Del Bimbo,et al.  Socializing the Semantic Gap , 2015, ACM Comput. Surv..