Online context-based object recognition for mobile robots

This work proposes a robotic object recognition system that takes advantage of the contextual information latent in human-like environments in an online fashion. To fully leverage context, it is needed perceptual information from (at least) a portion of the scene containing the objects of interest, which could not be entirely covered by just an one-shot sensor observation. Information from a larger portion of the scenario could still be considered by progressively registering observations, but this approach experiences difficulties under some circumstances, e.g. limited and heavily demanded computational resources, dynamic environments, etc. Instead of this, the proposed recognition system relies on an anchoring process for the fast registration and propagation of objects' features and locations beyond the current sensor frustum. In this way, the system builds a graph-based world model containing the objects in the scenario (both in the current and previously perceived shots), which is exploited by a Probabilistic Graphical Model (PGM) in order to leverage contextual information during recognition. We also propose a novel way to include the outcome of local object recognition methods in the PGM, which results in a decrease in the usually high CRF learning complexity. A demonstration of our proposal has been conducted employing a dataset captured by a mobile robot from restaurant-like settings, showing promising results.

[1]  Daniel Huber,et al.  Using Context to Create Semantic 3D Models of Indoor Environments , 2010, BMVC.

[2]  Serge J. Belongie,et al.  Context based object categorization: A critical survey , 2010, Comput. Vis. Image Underst..

[3]  Bastian Leibe,et al.  Joint 2D-3D temporally consistent semantic segmentation of street scenes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  José-Raúl Ruiz-Sarmiento,et al.  A survey on learning approaches for Undirected Graphical Models. Application to scene object recognition , 2017, Int. J. Approx. Reason..

[5]  Thorsten Joachims,et al.  Contextually guided semantic labeling and search for three-dimensional point clouds , 2013, Int. J. Robotics Res..

[6]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[7]  José-Raúl Ruiz-Sarmiento,et al.  Scene object recognition for mobile robots through Semantic Knowledge and Probabilistic Graphical Models , 2015, Expert Syst. Appl..

[8]  Hema Swetha Koppula,et al.  Learning Spatio-Temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation , 2013, ICML.

[9]  Jos Elfring,et al.  Semantic world modeling using probabilistic multiple hypothesis anchoring , 2013, Robotics Auton. Syst..

[10]  Dieter Fox,et al.  RGB-(D) scene labeling: Features and algorithms , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Nico Blodow,et al.  Perception and probabilistic anchoring for dynamic world state logging , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[12]  Luc Van Gool,et al.  On-line semantic perception using uncertainty , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Antonios Gasteratos,et al.  Semantic mapping for mobile robotics tasks: A survey , 2015, Robotics Auton. Syst..

[14]  Matei T. Ciocarlie,et al.  Towards Reliable Grasping and Manipulation in Household Environments , 2010, ISER.

[15]  José-Raúl Ruiz-Sarmiento,et al.  Building Multiversal Semantic Maps for Mobile Robot Operation , 2017, Knowl. Based Syst..

[16]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[17]  Joachim Hertzberg,et al.  Model-based furniture recognition for building semantic object maps , 2017, Artif. Intell..

[18]  Gi Hyun Lim,et al.  A perceptual memory system for grounding semantic representations in intelligent service robots , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[20]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[21]  Ali Shahrokni,et al.  Mesh Based Semantic Modelling for Indoor and Outdoor Scenes , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  José-Raúl Ruiz-Sarmiento,et al.  Exploiting semantic knowledge for robot object recognition , 2015, Knowl. Based Syst..

[23]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[24]  Alessandro Saffiotti,et al.  An introduction to the anchoring problem , 2003, Robotics Auton. Syst..

[25]  Chong-Wah Ngo,et al.  Semantic context modeling with maximal margin Conditional Random Fields for automatic image annotation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26]  Cipriano Galindo,et al.  UPGMpp: a Software Library for Contextual Object Recognition , 2015 .