论文信息 - Automatic image annotation using semi-supervised generative modeling

Automatic image annotation using semi-supervised generative modeling

Image annotation approaches need an annotated dataset to learn a model for the relation between images and words. Unfortunately, preparing a labeled dataset is highly time consuming and expensive. In this work, we describe the development of an annotation system in semi-supervised learning framework which by incorporating unlabeled images into training phase reduces the system demand to labeled images. Our approach constructs a generative model for each semantic class in two main steps. First, based on Gamma distribution, a generative model is constructed for each semantic class using labeled images in that class. The second step incorporates the unlabeled images by using a modified EM algorithm to update parameters of the constructed generative models. Performance evaluation of the proposed method on a standard dataset reveals that using unlabeled images will result in considerable improvement in accuracy of the annotation systems when a limited number of labeled images for each semantic class are available. We propose a modified EM algorithm to incorporate unlabeled images in training phase.Grouping images using spectral clustering improves prototypes and models of concepts.For noisy annotated images, semi-supervised mixture model outperforms graph learning.Incorporating unlabeled images will improve annotation performance significantly.

Mansour Jamzad | S. Hamid Amiri

[1] Jing Liu,et al. Image annotation via graph learning , 2009, Pattern Recognit..

[2] Tat-Seng Chua,et al. Image Annotation by Graph-Based Inference With Integrated Multiple/Single Instance Representations , 2010, IEEE Transactions on Multimedia.

[3] Daniel Gatica-Perez,et al. Modeling Semantic Aspects for Cross-Media Image Indexing , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Peter J. Bickel,et al. The Earth Mover's distance is the Mallows distance: some insights from statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[5] James Ze Wang,et al. Real-time computerized annotation of pictures. , 2008, IEEE transactions on pattern analysis and machine intelligence.

[6] Hujun Bao,et al. Semi-supervised topic modeling for image annotation , 2009, MM '09.

[7] Mohan S. Kankanhalli,et al. Multimodal fusion for multimedia analysis: a survey , 2010, Multimedia Systems.

[8] Stanley Wasserman,et al. Social Network Analysis: Methods and Applications , 1994 .

[9] Yoshua Bengio,et al. Semi-supervised Learning by Entropy Minimization , 2004, CAP.

[10] Mansour Jamzad,et al. Large-scale image annotation using prototype-based models , 2011, 2011 7th International Symposium on Image and Signal Processing and Analysis (ISPA).

[11] Yuncai Liu,et al. Semi-Supervised Learning Model Based Efficient Image Annotation , 2009, IEEE Signal Processing Letters.

[12] Chris H. Q. Ding,et al. Image annotation using bi-relational graph of images and semantic labels , 2011, CVPR 2011.

[13] Yixin Chen,et al. MILES: Multiple-Instance Learning via Embedded Instance Selection , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Ying Liu,et al. A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[15] Xiaojin Zhu,et al. Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[16] Vladimir Pavlovic,et al. Baselines for Image Annotation , 2010, International Journal of Computer Vision.

[17] Ramesh C. Jain,et al. Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images , 2011, TIST.

[18] James Ze Wang,et al. Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[19] Nenghai Yu,et al. Image Annotation in a Progressive Way , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[20] Hugo Jair Escalante,et al. The segmented and annotated IAPR TC-12 benchmark , 2010, Comput. Vis. Image Underst..

[21] James Ze Wang,et al. Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[22] Gustavo Carneiro,et al. Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23] Kristen Grauman,et al. What's it going to cost you?: Predicting effort vs. informativeness for multi-label image annotations , 2009, CVPR.

[24] Fabio Gagliardi Cozman,et al. Semi-Supervised Learning of Mixture Models , 2003, ICML.

[25] Dorin Comaniciu,et al. Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[26] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .

[27] Bernhard Schölkopf,et al. Learning with Local and Global Consistency , 2003, NIPS.

[28] Pietro Perona,et al. Self-Tuning Spectral Clustering , 2004, NIPS.

[29] B. S. Manjunath,et al. Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..