Object Templates for Visual Place Categorization

The Visual Place Categorization (VPC) problem refers to the categorization of the semantic category of a place using only visual information collected from an autonomous robot. Previous works on this problem only made use of the global configurations observation, such as the Bag-of-Words model and spatial pyramid matching. In this paper, we present a novel system solving the problem utilizing both global configurations observation and local objects information. To be specific, we propose a local objects classifier that can automatically and effectively select key local objects of a semantic category from randomly sampled patches by the structural similarity support vector machine; and further classify the test frames with the Local Naive Bayes Nearest Neighbors algorithm. We also improve the global configurations observation with histogram intersection codebook and a noisy codewords removal mechanism. The temporal smoothness of the classification results is ensured by employing a Bayesian filtering framework. Empirically, our system outperforms state-of-the-art methods on two large scale and difficult datasets, demonstrating the superiority of the system.

[1]  Jongwoo Lim,et al.  Visual place categorization in maps , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  John K. Tsotsos,et al.  Histogram of Oriented Uniform Patterns for robust place recognition and categorization , 2012, Int. J. Robotics Res..

[3]  Barbara Caputo,et al.  Towards robust place recognition for robot localization , 2008, 2008 IEEE International Conference on Robotics and Automation.

[4]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[5]  James M. Rehg,et al.  Visual Place Categorization: Problem, dataset, and algorithm , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[7]  James J. Little,et al.  Place Classification Using Visual Object Categorization and Global Information , 2011, 2011 Canadian Conference on Computer and Robot Vision.

[8]  Jianxin Wu,et al.  Power mean SVM for large scale visual classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  James M. Rehg,et al.  Efficient and Effective Visual Codebook Generation Using Additive Kernels , 2011, J. Mach. Learn. Res..

[10]  Barbara Caputo,et al.  Visual Servoing to Help Camera Operators Track Better , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  David G. Lowe,et al.  Local Naive Bayes Nearest Neighbor for image classification , 2011, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Antonio Torralba,et al.  Recognizing indoor scenes , 2009, CVPR.

[13]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[14]  Barbara Caputo,et al.  Multi-modal Semantic Place Classification , 2010, Int. J. Robotics Res..

[15]  Ananth Ranganathan,et al.  PLISS: Detecting and Labeling Places Using Online Change-Point Detection , 2010, Robotics: Science and Systems.

[16]  Jianxin Wu,et al.  Balance Support Vector Machines Locally Using the Structural Similarity Kernel , 2011, PAKDD.

[17]  Hongbin Zha,et al.  Computer Vision - ACCV 2009, 9th Asian Conference on Computer Vision, Xi'an, China, September 23-27, 2009, Revised Selected Papers, Part III , 2010, Asian Conference on Computer Vision.

[18]  Antonio Torralba,et al.  Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[19]  Svetlana Lazebnik,et al.  Scene recognition and weakly supervised object localization with deformable part-based models , 2011, 2011 International Conference on Computer Vision.

[20]  Tal Hassner,et al.  Similarity Scores Based on Background Samples , 2009, ACCV.

[21]  Ananth Ranganathan PLISS: labeling places using online changepoint detection , 2012, Auton. Robots.

[22]  Eli Shechtman,et al.  In defense of Nearest-Neighbor based image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  B. Caputo,et al.  Cold: the Cosy Localization Database Cold: the Cosy Localization Database , 2009 .

[24]  James M. Rehg,et al.  CENTRIST: A Visual Descriptor for Scene Categorization , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Barbara Caputo,et al.  Confidence-based cue integration for visual place recognition , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26]  Patric Jensfelt,et al.  Hierarchical Multi-Modal Place Categorization , 2011, ECMR.