论文信息 - Bootstrapping Relational Affordances of Object Pairs Using Transfer

Bootstrapping Relational Affordances of Object Pairs Using Transfer

Robots acting in everyday environments need a good knowledge of how a manipulation action can affect pairs of objects in a relationship, such as “inside” or “behind” or “on top.” These relationships afford certain means-end actions such as pulling a container to retrieve the contents, or pulling a tool to retrieve a desired object. We investigate how these relational affordances could be learned by a robot from its own action experience. A major challenge in this approach is to reduce the number of training samples needed to achieve accuracy, and hence we investigate an approach which can leverage past knowledge to accelerate current learning (which we call bootstrapping). We learn random forest-based affordance predictors from visual inputs and demonstrate two approaches to knowledge transfer for bootstrapping. In the first approach [direct bootstrapping (DB)], the state-space for a new affordance predictor is augmented with the output of previously learned affordances. In the second approach [category-based bootstrapping (CB)], we form categories that capture underlying commonalities of a pair of existing affordances and augment the state-space with this category classifier’s output. In addition, we introduce a novel heuristic, which suggests how a large set of potential affordance categories can be pruned to leave only those categories which are most promising for bootstrapping future affordances. Our results show that both bootstrapping approaches outperform learning without bootstrapping. We also show that there is no significant difference in performance between DB and CB.

[1] Dirk Kraft,et al. Real-time extraction of surface patches with associated uncertainties by means of Kinect cameras , 2012, Journal of Real-Time Image Processing.

[2] J. Mandler. How to build a baby: II. Conceptual primitives. , 1992, Psychological review.

[3] Leslie Pack Kaelbling,et al. Constructing Symbolic Representations for High-Level Planning , 2014, AAAI.

[4] Eren Erdal Aksoy,et al. Categorizing object-action relations from semantic scene graphs , 2010, 2010 IEEE International Conference on Robotics and Automation.

[5] Lawrence W Barsalou,et al. Simulation, situated conceptualization, and prediction , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[6] Norbert Krüger,et al. Multi-view object recognition using view-point invariant shape relations and appearance information , 2013, 2013 IEEE International Conference on Robotics and Automation.

[7] Florentin Wörgötter,et al. Constrained planar cuts - Object partitioning for point clouds , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Stephen Hart,et al. Learning Generalizable Control Programs , 2011, IEEE Transactions on Autonomous Mental Development.

[9] Terry L. Zimmerman,et al. Learning-Assisted Automated Planning: Looking Back, Taking Stock, Going Forward , 2003, AI Mag..

[10] Risto Miikkulainen,et al. The constructivist learning architecture: a model of cognitive development for robust autonomous robots , 2004 .

[11] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[12] Jivko Sinapov,et al. A Behavior-Grounded Approach to Forming Object Categories: Separating Containers From Noncontainers , 2012, IEEE Transactions on Autonomous Mental Development.

[13] L. P. Kaelbling,et al. Learning Symbolic Models of Stochastic Domains , 2007, J. Artif. Intell. Res..

[14] J. Lockman. A perception--action perspective on tool use development. , 2000, Child development.

[15] Justus H. Piater,et al. Emergent structuring of interdependent affordance learning tasks , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[16] Dirk Kraft,et al. A Survey of the Ontogeny of Tool Use: From Sensorimotor Experience to Planning , 2013, IEEE Transactions on Autonomous Mental Development.

[17] Wei-Yin Loh,et al. Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[18] Tamim Asfour,et al. Learn to wipe: A case study of structural bootstrapping from sensorimotor experience , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[19] Benjamin Rosman,et al. Learning spatial relationships between objects , 2011, Int. J. Robotics Res..

[20] Richard Bellman,et al. Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[21] Shahram Izadi,et al. Modeling Kinect Sensor Noise for Improved 3D Reconstruction and Tracking , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[22] Sander Oude Elberink,et al. Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications , 2012, Sensors.

[23] Jimmy A. Jørgensen,et al. Learning object relationships which determine the outcome of actions , 2012, Paladyn J. Behav. Robotics.

[24] Leslie Pack Kaelbling,et al. Symbol Acquisition for Probabilistic High-Level Planning , 2015, IJCAI.

[25] J. Piaget. The construction of reality in the child , 1954 .

[26] Dirk Kraft,et al. Using Relational Histogram Features and Action Labelled Data to Learn Preconditions for Means-End Actions , 2015 .

[27] George Konidaris,et al. Constructing Abstraction Hierarchies Using a Skill-Symbol Loop , 2015, IJCAI.

[28] P. Willatts. Development of means-end behavior in young infants: pulling a support to retrieve a distant object. , 1999, Developmental psychology.

[29] Alessandro Lazaric,et al. Transfer in Reinforcement Learning: A Framework and a Survey , 2012, Reinforcement Learning.

[30] Justus H. Piater,et al. Bootstrapping paired-object affordance learning with learned single-affordance features , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[31] Pierre-Yves Oudeyer,et al. Intrinsic Motivation Systems for Autonomous Mental Development , 2007, IEEE Transactions on Evolutionary Computation.

[32] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[33] Pierre-Yves Oudeyer,et al. Intrinsically Motivated Learning of Real-World Sensorimotor Skills with Developmental Constraints , 2013, Intrinsically Motivated Learning in Natural and Artificial Systems.

[34] A. Karmiloff-Smith,et al. The cognizer's innards: A psychological and philosophical perspective on the development of thought. , 1993 .

[35] Benjamin Kuipers,et al. Autonomous Learning of High-Level States and Actions in Continuous Environments , 2012, IEEE Transactions on Autonomous Mental Development.

[36] Markus Vincze,et al. Object part segmentation and classification in range images for grasping , 2011, 2011 15th International Conference on Advanced Robotics (ICAR).

[37] Gary L. Drescher,et al. Made-up minds - a constructivist approach to artificial intelligence , 1991 .

[38] C. V. Jawahar,et al. Learning support order for manipulation in clutter , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39] Mark H. Lee,et al. Staged Competence Learning in Developmental Robotics , 2007, Adapt. Behav..

[40] Roger K. Moore. Spoken Language Processing: Where Do We Go from Here? , 2013, Your Virtual Butler.

[41] Jimmy A. Jørgensen,et al. RobWorkSim - an Open Simulator for Sensor based Grasping , 2010, ISR/ROBOTIK.

[42] L. Barsalou,et al. The situated nature of concepts. , 2006, The American journal of psychology.

[43] Christopher W. Geib,et al. Structural Bootstrapping—A Novel, Generative Mechanism for Faster and More Efficient Acquisition of Action-Knowledge , 2015, IEEE Transactions on Autonomous Mental Development.

[44] Luc Van Gool,et al. What makes a chair a chair? , 2011, CVPR 2011.

[45] Alexander Stoytchev,et al. Some Basic Principles of Developmental Robotics , 2009, IEEE Transactions on Autonomous Mental Development.

[46] Hema Swetha Koppula,et al. Learning human activities and object affordances from RGB-D videos , 2012, Int. J. Robotics Res..

[47] Luc De Raedt,et al. Learning relational affordance models for robots in multi-object manipulation tasks , 2012, 2012 IEEE International Conference on Robotics and Automation.

[48] Silvio Savarese,et al. Understanding Indoor Scenes Using 3D Geometric Phrases , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[49] Dirk Kraft,et al. Learning spatial relationships from 3D vision using histograms , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[50] P. L. Adams. THE ORIGINS OF INTELLIGENCE IN CHILDREN , 1976 .