Semantic analysis of real-world images using support vector machine

Digital cameras and thus digital images are now ubiquitous. How to efficiently manage a large amount of images has become important. The semantic analysis of images is an important issue in multimedia processing. Region-based image retrieval systems attempt to reduce the gap between high-level semantics and low-level features by representing images at the object level. Recently, the support vector machine (SVM) has been proposed to solve the classification problem. It can generate a hyperplane to separate two sets of features and provides good generalization performance. In this paper, we propose a novel method which integrates principal component analysis (PCA) and SVM neural networks for analyzing the semantic content of natural images, in which principal component analysis (PCA) is applied to reduce the dimension of features. Experimental results show that the proposed method is capable of analyzing the components of photographs into semantic categories with high accuracy, resulting in photographic analysis that is similar to human perception. The performance of the proposed method is better than that of the traditional radial basis function (RBF) neural network.

[1]  Jitendra Malik,et al.  Blobworld: A System for Region-Based Image Indexing and Retrieval , 1999, VISUAL.

[2]  Anil K. Jain,et al.  On image classification: city vs. landscape , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[3]  Jacek M. Zurada,et al.  RBF-based neurodynamic nearest neighbor classification in real pattern space , 2006, Pattern Recognit..

[4]  Yue-Shi Lee,et al.  Robust and efficient multiclass SVM models for phrase pattern recognition , 2008, Pattern Recognit..

[5]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Proceedings of International Conference on Image Processing.

[6]  Myung-Mook Han,et al.  Learning similarity for semantic images classification , 2005, Neurocomputing.

[7]  Jamshid Shanbehzadeh,et al.  Image retrieval based on shape similarity by edge orientation autocorrelogram , 2003, Pattern Recognit..

[8]  B. S. Manjunath,et al.  Unsupervised Segmentation of Color-Texture Regions in Images and Video , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[10]  Thomas S. Huang,et al.  Supporting content-based queries over images in MARS , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[11]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Patrick Haffner,et al.  Support vector machines for histogram-based image classification , 1999, IEEE Trans. Neural Networks.

[13]  Anil K. Jain,et al.  Image classification for content-based indexing , 2001, IEEE Trans. Image Process..

[14]  Youji Iiguni,et al.  Image interpolation for progressive transmission by using radial basis function networks , 1999, IEEE Trans. Neural Networks.

[15]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Multimedia Systems.

[16]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[17]  K. Wakimoto,et al.  Efficient and Effective Querying by Image Content , 1994 .

[18]  Vicki Bruce,et al.  Face Recognition: From Theory to Applications , 1999 .

[19]  Jesse S. Jin,et al.  Combining intra-image and inter-class semantics for consumer image retrieval , 2005, Pattern Recognit..

[20]  Lei Wang,et al.  AdaBoost with SVM-based component classifiers , 2008, Eng. Appl. Artif. Intell..

[21]  Dianhui Wang,et al.  Semantics modeling based image retrieval system using neural networks , 2005, IEEE International Conference on Image Processing 2005.

[22]  Yixin Chen,et al.  CLUE: cluster-based retrieval of images by unsupervised learning , 2005, IEEE Transactions on Image Processing.

[23]  E. K. WONG,et al.  Model matching in robot vision by subgraph isomorphism , 1992, Pattern Recognit..

[24]  Stephen W. Smoliar,et al.  Video parsing, retrieval and browsing: an integrated and content-based solution , 1997, MULTIMEDIA '95.

[25]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[27]  Andrey Temko,et al.  Classification of acoustic events using SVM-based clustering schemes , 2006, Pattern Recognit..

[28]  Jae Won Lee,et al.  Content-based image classification using a neural network , 2004, Pattern Recognit. Lett..

[29]  Masafumi Hagiwara,et al.  An image retrieval system by impression words and specific object names - IRIS , 2002, Neurocomputing.

[30]  Amrane Houacine,et al.  Principal component analysis of multispectral images using neural network , 2001, Proceedings ACS/IEEE International Conference on Computer Systems and Applications.

[31]  David E. Booth,et al.  Multivariate statistical inference and applications , 1997 .

[32]  John Tait,et al.  Image classification using hybrid neural networks , 2003, SIGIR.

[33]  Chih-Fong Tsai,et al.  Image mining by spectral features: A case study of scenery image classification , 2007, Expert Syst. Appl..

[34]  Rama Chellappa,et al.  Principal components null space analysis for image and video classification , 2006, IEEE Transactions on Image Processing.

[35]  Veerendra Singh,et al.  Application of image processing and radial basis neural network techniques for ore sorting and ore classification , 2005 .

[36]  Jung-Hsien Chiang,et al.  Hierarchically SVM classification based on support vector clustering method and its application to document categorization , 2007, Expert Syst. Appl..

[37]  Shuyuan Yang,et al.  Ridgelet kernel regression , 2007, Neurocomputing.

[38]  Ying Liu,et al.  Region-Based Image Retrieval with High-Level Semantic Color Names , 2005, 11th International Multimedia Modelling Conference.