A model-based approach to finding substitute tools in 3D vision data

A robot can feasibly be given knowledge of a set of tools for manipulation activities (e.g. hammer, knife, spatula). If the robot then operates outside a closed environment it is likely to face situations where the tool it knows is not available, but alternative unknown tools are present. We tackle the problem of finding the best substitute tool based solely on 3D vision data. Our approach has simple hand-coded models of known tools in terms of superquadrics and relationships among them. Our system attempts to fit these models to point clouds of unknown tools, producing a numeric value for how good a fit is. This value can be used to rate candidate substitutes. We explicitly control how closely each part of a tool must match our model, under direction from parameters of a target task. We allow bottom-up information from segmentation to dictate the sizes that should be considered for various parts of the tool. These ideas allow for a flexible matching so that tools may be superficially quite different, but similar in the way that matters. We evaluate our system's ratings relative to other approaches and relative to human performance in the same task. This is an approach to knowledge transfer, via a suitable representation and reasoning engine, and we discuss how this could be extended to transfer in planning.

[1]  Ashok K. Goel,et al.  A Visual Analogy Approach to Source Case Retrieval in Robot Learning from Observation , 2014, AAAI 2014.

[2]  Franc Solina,et al.  Superquadrics and Their Geometric Properties , 2000 .

[3]  Yukie Nagai,et al.  Learning to grasp with parental scaffolding , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.

[4]  Franc Solina,et al.  Part-level object recognition using superquadrics , 2004, Comput. Vis. Image Underst..

[5]  Moritz Tenorth,et al.  Decomposing CAD models of objects of daily use and reasoning about their functional parts , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Markus Vincze,et al.  4D Space-Time Mereotopogeometry-Part Connectivity Calculus for Visual Object Representation , 2014, 2014 22nd International Conference on Pattern Recognition.

[7]  Justus H. Piater,et al.  Bottom-up learning of object categories, action effects and logical rules: From continuous manipulative exploration to symbolic planning , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[8]  Florentin Wörgötter,et al.  Bootstrapping the Semantics of Tools: Affordance Analysis of Real World Objects on a Per-part Basis , 2016, IEEE Transactions on Cognitive and Developmental Systems.

[9]  L. Barsalou,et al.  Whither structured representation? , 1999, Behavioral and Brain Sciences.

[10]  Daniel Cohen-Or,et al.  Contextual Part Analogies in 3D Objects , 2010, International Journal of Computer Vision.

[11]  Oliver Kroemer,et al.  A kernel-based approach to direct action perception , 2012, 2012 IEEE International Conference on Robotics and Automation.

[12]  Moritz Tenorth,et al.  KnowRob: A knowledge processing infrastructure for cognition-enabled robots , 2013, Int. J. Robotics Res..

[13]  Markus Vincze,et al.  3DNet: Large-scale object class recognition from CAD models , 2012, 2012 IEEE International Conference on Robotics and Automation.

[14]  Florentin Wörgötter,et al.  Constrained planar cuts - Object partitioning for point clouds , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Ashok K. Goel,et al.  A Similarity-Based Approach to Skill Transfer , 2015 .

[16]  Markus Vincze,et al.  Model-based 3D object detection , 2010, Machine Vision and Applications.

[17]  Tamim Asfour,et al.  Autonomous acquisition of visual multi-view object representations for object recognition on a humanoid robot , 2010, 2010 IEEE International Conference on Robotics and Automation.

[18]  Yiannis Aloimonos,et al.  Affordance detection of tool parts from geometric features , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[19]  B. Indurkhya Metaphor and Cognition: An Interactionist Approach , 1992 .

[20]  Justus H. Piater,et al.  Development of Object and Grasping Knowledge by Robot Exploration , 2010, IEEE Transactions on Autonomous Mental Development.

[21]  Michael Beetz,et al.  PR2 looking at things — Ensemble learning for unstructured information processing with Markov logic networks , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).