Kernel Visual Keyword Description for Object and Place Recognition

The most important aspects in computer and mobile robotics are both visual object and place recognition; they have been used to tackle numerous applications via different techniques as established previously in the literature, however, combining the machine learning techniques for learning objects to obtain best possible recognition and as well as to obtain its image descriptors for describing the content of the image fully is considered as another vital way which can be used in computer vision. Thus, the ability of the system is to learn and describe the structural features of objects or places more effectively, which in turn; it leads to a correct recognition of objects. This paper introduces a method that uses Naive Base to combine the Kernel Principle Component (KPCA) features with HOG features from the visual scene. According to this approach, a set of SURF features and Histogram of Gradient (HOG) are extracted from a given image. The minimum Euclidean Distance between all SURF features is computed from the visual codebook which was constructed by K-means previously to be combined with HOG features. A classification method such as Support Vector Machine (SVM) was used for data analysis and the results indicate that KPCA with HOG method significantly outperforms bag of visual keyword (BOW) approach on Caltech-101 object dataset and IDOL visual place dataset.

[1]  Chong-Wah Ngo,et al.  Towards optimal bag-of-features for object categorization and semantic video retrieval , 2007, CIVR '07.

[2]  Wenyu Chen,et al.  Method of item recognition based on SIFT and SURF , 2014, Math. Struct. Comput. Sci..

[3]  Santanu Kumar Rath,et al.  Protein superfamily classification using Kernel Principal Component Analysis and Probabilistic Neural Networks , 2011, 2011 Annual IEEE India Conference.

[4]  Binoy Pinto,et al.  Speeded Up Robust Features , 2011 .

[5]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6]  Gregory Dudek,et al.  Learning Landmarks for Robot Localization , 2000, AAAI/IAAI.

[7]  Horst Bischof,et al.  Mobile robot localization under varying illumination , 2002, Object recognition supported by user interaction for service robots.

[8]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[9]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[10]  Haryati Jaafar,et al.  Robust palm print verification system based on evolution of kernel principal component analysis , 2014, 2014 IEEE International Conference on Control System, Computing and Engineering (ICCSCE 2014).

[11]  Gabriela Csurka,et al.  Adapted Vocabularies for Generic Visual Categorization , 2006, ECCV.

[12]  Barbara Caputo,et al.  A realistic benchmark for visual indoor place recognition , 2010, Robotics Auton. Syst..

[13]  G. Baudat,et al.  Generalized Discriminant Analysis Using a Kernel Approach , 2000, Neural Computation.

[14]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[15]  Parinya Sanguansat,et al.  An improved feature extraction and combination of multiple classifiers for query-by-humming , 2014, Int. Arab J. Inf. Technol..

[16]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Ales Leonardis,et al.  Mobile robot localization using an incremental eigenspace model , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[18]  Xiaoyi Ma,et al.  An Improved KPCA/GA-SVM Classication Model for Plant Leaf Disease Recognition ? , 2012 .

[19]  W. Förstner,et al.  A Metric for Covariance Matrices , 2003 .

[20]  Ben J. A. Kröse,et al.  Probabilistic localization by appearance models and active vision , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[21]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[22]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[23]  Michael Isard,et al.  Lost in quantization: Improving particular object retrieval in large scale image databases , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Ronald M. Summers,et al.  Multilevel image recognition using discriminative patches and kernel covariance descriptor , 2014, Medical Imaging.

[25]  Tapio Seppänen,et al.  ECG-derived respiration methods: adapted ICA and PCA. , 2015, Medical engineering & physics.

[26]  Xiang Li,et al.  Face Recognition Using KPCA and KFDA , 2013 .

[27]  Siti Anom Ahmad,et al.  Performance evaluation of feature detection and feature matching for stereo visual odometry using SIFT and SURF , 2014, 2014 IEEE REGION 10 SYMPOSIUM.