Joint categorization of objects and rooms for mobile robots

In general, the problems of objects' and rooms' categorizations for robotic applications have been addressed separately. The current trend is, however, towards a joint modelling of both issues in order to leverage their mutual contextual relations: object → room (e.g. the detection of a microwave indicates that the room is likely to be a kitchen), and room → object (e.g. if the robot is in a bathroom, it is probable to find a toilet). Probabilistic Graphical Models (PGMs) are typically employed to conveniently cope with such relations, relying on inference processes to hypothesize about objects' and rooms' categories. In this work we present a Conditional Random Field (CRF) model, a particular type of PGM, to jointly categorize objects and rooms from RGBD images exploiting object-object and object-room relations. The learning phase of the proposed CRF uses Human Knowledge (HK) to eliminate the necessity of gathering real training data. Concretely, HK is acquired through elicitation and codified into an ontology, which is exploited to effortless generate an arbitrary number of representative synthetic samples for training. The performance of the proposed CRF model has been assessed using the NYU2 dataset, achieving a success of ~ 70% categorizing both, objects and rooms.

[1]  José-Raúl Ruiz-Sarmiento,et al.  Exploiting semantic knowledge for robot object recognition , 2015, Knowl. Based Syst..

[2]  Michael Uschold,et al.  Ontologies: principles, methods and applications , 1996, The Knowledge Engineering Review.

[3]  Patric Jensfelt,et al.  Hierarchical Multi-Modal Place Categorization , 2011, ECMR.

[4]  James J. Little,et al.  Place Classification Using Visual Object Categorization and Global Information , 2011, 2011 Canadian Conference on Computer and Robot Vision.

[5]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[6]  Cipriano Galindo,et al.  UPGMpp: a Software Library for Contextual Object Recognition , 2015 .

[7]  Barbara Caputo,et al.  Multi-modal Semantic Place Classification , 2010, Int. J. Robotics Res..

[8]  Antonio Torralba,et al.  Recognizing indoor scenes , 2009, CVPR.

[9]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[10]  D. Greig,et al.  Exact Maximum A Posteriori Estimation for Binary Images , 1989 .

[11]  Antonio Torralba,et al.  Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12]  Cipriano Galindo,et al.  Multihierarchical Interactive Task Planning: Application to Mobile Robotics , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[13]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[14]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Cipriano Galindo,et al.  Mobile Robot Object Recognition through the Synergy of Probabilistic Graphical Models and Semantic Knowledge , 2014 .

[16]  Nicholas Roy,et al.  Indoor scene recognition through object detection , 2010, 2010 IEEE International Conference on Robotics and Automation.

[17]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[18]  Sanja Fidler,et al.  Holistic Scene Understanding for 3D Object Detection with RGBD Cameras , 2013, 2013 IEEE International Conference on Computer Vision.

[19]  Henrik I. Christensen,et al.  A conditional random field model for place and object classification , 2012, 2012 IEEE International Conference on Robotics and Automation.

[20]  Alessandro Saffiotti,et al.  Using semantic information for improving efficiency of robot task planning , 2007 .

[21]  Thorsten Joachims,et al.  Contextually guided semantic labeling and search for three-dimensional point clouds , 2013, Int. J. Robotics Res..