Visual Object Categorization Based on Orientation Descriptor

The demand of new fast technology and image investigation in many applications has made managing visual object categorization techniques extremely important. The main problem of visual object categorization is the semantic gap (categorization problem). Currently, several researches show that using a texture feature could improve the categorization problem especially when using orientation descriptors. Mainly, in this research the edge histogram descriptor has been selected to extract the texture feature. Obviously, the main demerit of using this kind of texture descriptor is it uses single orientation to extract the texture feature. Therefore, the Gabor filter has been proposed to improve the performance of this descriptor by constructing different feature maps based on different scale and orientation. To demonstrate the performance of the proposed method, the first 20 classes of the Caltech 101 dataset have been used. Moreover, we compared the performance recognition of the proposed method in two different domains, namely spatial and frequency domains. Finally, the result shows that the proposed method in the spatial domain outperforms the proposed method in the frequency domain. This is because of losing some of the basic raw data though using Fast Fourier Transform algorithm in converting the system to the frequency domain.

[1]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[2]  W. Sardha Wijesoma,et al.  Fast Vanishing-Point Detection in Unstructured Environments , 2012, IEEE Transactions on Image Processing.

[3]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[4]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[5]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[6]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  A. Abdullah,et al.  CIREC : Cluster Correlogram Image Retrieval and Categorization using MPEG-7 Descriptors , 2007, 2007 IEEE Symposium on Computational Intelligence in Image and Signal Processing.

[8]  Peng Un Mak,et al.  A New Enhanced Nearest Feature Space (ENFS) Classifier for Gabor Wavelets Features-Based Face Recognition , 2004, ICBA.

[9]  Wan-Chi Siu,et al.  Multimedia Information Retrieval and Management: Technological Fundamentals and Applications , 2010 .

[10]  Alan R. Jones,et al.  Fast Fourier Transform , 1970, SIGP.

[11]  A. bin Abdullah,et al.  Supervised learning algorithms for visual object categorization , 2010 .

[12]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Muhammad Ikram,et al.  Image Retrieval in Multimedia Databases: A Survey , 2009, 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.