论文信息 - Image Annotation by Incorporating Word Correlations into Multi-class SVM

Image Annotation by Incorporating Word Correlations into Multi-class SVM

Image annotation systems aim at automatically annotating images with some predefined keywords. In this paper, we propose an automatic image annotation approach by incorporating word correlations into multi-class Support Vector Machine (SVM). At first, each image is segmented into five fixed-size blocks or tiles and MPEG-7 visual descriptors are applied to represent color and texture features of blocks. Keywords are manually assigned to every block of training images. Then, multi-class SVM classifier is trained for semantic concepts. Word or concept correlations are computed by a co-occurrence matrix. The probability outputs from SVM and word correlations are combined to obtain the final results. The minimal-redundancy-maximum-relevance (mRMR) method is used to reduce feature dimensions. The experiments on Corel 5000 dataset demonstrate our approach is effective and efficient.

Lei Zhang | Jun Ma

[1] Huan Liu,et al. Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[2] Igor Kononenko,et al. Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[3] B. S. Manjunath,et al. Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[4] David A. Forsyth,et al. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[5] Michael I. Jordan,et al. Modeling annotated data , 2003, SIGIR.

[6] Chih-Jen Lin,et al. Probability Estimates for Multi-class Classification by Pairwise Coupling , 2003, J. Mach. Learn. Res..

[7] Fuhui Long,et al. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[9] Clement H. C. Leung,et al. Automatic Semantic Annotation of Real-World Web Images , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Yixin Chen,et al. Image Categorization by Learning and Reasoning with Regions , 2004, J. Mach. Learn. Res..

[11] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[12] Gustavo Carneiro,et al. Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Jianping Fan,et al. Automatic image annotation by using concept-sensitive salient objects for image content representation , 2004, SIGIR '04.

[14] James Ze Wang,et al. Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[15] James Ze Wang,et al. SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[16] Chih-Jen Lin,et al. Combining SVMs with Various Feature Selection Strategies , 2006, Feature Extraction.

[17] Xiaojun Qi,et al. Incorporating multiple SVMs for automatic image annotation , 2007, Pattern Recognit..

[18] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[19] Edward Y. Chang,et al. Using one-class and two-class SVMs for multiclass image annotation , 2005, IEEE Transactions on Knowledge and Data Engineering.

[20] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[21] Horst M. Eidenberger,et al. How good are the visual MPEG-7 features? , 2003, Visual Communications and Image Processing.

[22] Mark A. Hall,et al. Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[23] Nuno Vasconcelos,et al. Bridging the Gap: Query by Semantic Example , 2007, IEEE Transactions on Multimedia.

[24] Raimondo Schettini,et al. Image annotation using SVM , 2003, IS&T/SPIE Electronic Imaging.

[25] Edward Y. Chang,et al. CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines , 2003, IEEE Trans. Circuits Syst. Video Technol..

[26] R. Manmatha,et al. Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[27] John Tait,et al. CLAIRE: A modular support vector image indexing and classification system , 2006, TOIS.

[28] Daniel Gatica-Perez,et al. On image auto-annotation with latent space models , 2003, ACM Multimedia.

[29] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[30] Chih-Jen Lin,et al. A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[31] Bipin C. Desai,et al. A Feature Level Fusion in Similarity Matching to Content-Based Image Retrieval , 2006, 2006 9th International Conference on Information Fusion.

[32] Qi Zhang,et al. Automatic image annotation by an iterative approach: incorporating keyword correlations and region matching , 2007, CIVR '07.

[33] Jiayu Tang,et al. A Study of Quality Issues for Image Auto-Annotation With the Corel Dataset , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[34] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[35] Markus A. Stricker,et al. Spectral covariance and fuzzy regions for image indexing , 1997, Machine Vision and Applications.