Structural Bootstrapping—A Novel, Generative Mechanism for Faster and More Efficient Acquisition of Action-Knowledge

Humans, but also robots, learn to improve their behavior. Without existing knowledge, learning either needs to be explorative and, thus, slow or-to be more efficient-it needs to rely on supervision, which may not always be available. However, once some knowledge base exists an agent can make use of it to improve learning efficiency and speed. This happens for our children at the age of around three when they very quickly begin to assimilate new information by making guided guesses how this fits to their prior knowledge. This is a very efficient generative learning mechanism in the sense that the existing knowledge is generalized into as-yet unexplored, novel domains. So far generative learning has not been employed for robots and robot learning remains to be a slow and tedious process. The goal of the current study is to devise for the first time a general framework for a generative process that will improve learning and which can be applied at all different levels of the robot's cognitive architecture. To this end, we introduce the concept of structural bootstrapping-borrowed and modified from child language acquisition-to define a probabilistic process that uses existing knowledge together with new observations to supplement our robot's data-base with missing information about planning-, object-, as well as, action-relevant entities. In a kitchen scenario, we use the example of making batter by pouring and mixing two components and show that the agent can efficiently acquire new knowledge about planning operators, objects as well as required motor pattern for stirring by structural bootstrapping. Some benchmarks are shown, too, that demonstrate how structural bootstrapping improves performance.

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  J. Church Language in Childhood , 1962 .

[3]  Earl D. Sacerdoti,et al.  Planning in a Hierarchy of Abstraction Spaces , 1974, IJCAI.

[4]  Norman I. Badler,et al.  Temporal scene analysis: conceptual descriptions of object movements. , 1975 .

[5]  Austin Tate,et al.  Generating Project Networks , 1977, IJCAI.

[6]  Martin Kay,et al.  Syntactic Process , 1979, ACL.

[7]  Steven Pinker,et al.  Language learnability and language development , 1985 .

[8]  L. Gleitman The Structural Sources of Verb Meanings , 2020, Sentence First, Arguments Afterward.

[9]  James A. Hendler,et al.  HTN Planning: Complexity and Expressivity , 1994, AAAI.

[10]  Roberta Michnick Golinkoff,et al.  Action Meets Word: How Children Learn Verbs , 1995 .

[11]  C. Fisher Structural Limits on Verb Mapping: The Role of Analogy in Children's Interpretations of Sentences , 1996, Cognitive Psychology.

[12]  Letitia R. Naigles,et al.  The use of multiple frames in verb learning via syntactic bootstrapping , 1996, Cognition.

[13]  Gösta H. Granlund,et al.  The complexity of vision , 1999, Signal Process..

[14]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[15]  J. Snedeker,et al.  Cross-situational observation and the semantic bootstrapping hypothesis , 2000 .

[16]  Fahiem Bacchus,et al.  A Knowledge-Based Approach to Planning with Incomplete Information and Sensing , 2002, AIPS.

[17]  Li Cuiyan,et al.  A survey of repetitive control , 2004, IROS.

[18]  L. Gleitman,et al.  Hard Words , 2005, Sentence First, Arguments Afterward.

[19]  Aude Billard,et al.  Discriminative and adaptive imitation in uni-manual and bi-manual tasks , 2006, Robotics Auton. Syst..

[20]  Tamim Asfour,et al.  ARMAR-III: An Integrated Humanoid Platform for Sensory-Motor Control , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[21]  Mark Steedman,et al.  CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank , 2007, CL.

[22]  Rüdiger Dillmann,et al.  Incremental Learning of Tasks From User Demonstrations, Past Experiences, and Vocal Comments , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[23]  Stefan Schaal,et al.  Dynamics systems vs. optimal control--a unifying view. , 2007, Progress in brain research.

[24]  John C. Trueswell,et al.  Learning to parse and its implications for language acquisition , 2007 .

[25]  Manuel Lopes,et al.  Learning Object Affordances: From Sensory--Motor Coordination to Imitation , 2008, IEEE Transactions on Robotics.

[26]  S. Crain,et al.  Language Acquisition , 2008 .

[27]  Anthony G. Cohn,et al.  Learning Functional Object-Categories from a Relational Spatio-Temporal Representation , 2008, ECAI.

[28]  Danica Kragic,et al.  Robot Learning from Demonstration: A Task-level Planning Approach , 2008 .

[29]  Tamim Asfour,et al.  Toward humanoid manipulation in human-centred environments , 2008, Robotics Auton. Syst..

[30]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[31]  Tamim Asfour,et al.  Combining Harris interest points and the SIFT descriptor for fast scale-invariant object recognition , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[32]  Alexander Bierbaum,et al.  Grasp affordances from multi-fingered tactile exploration using dynamic potential fields , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[33]  Florentin Wörgötter,et al.  Cognitive agents - a procedural perspective relying on the predictability of Object-Action-Complexes (OACs) , 2009, Robotics Auton. Syst..

[34]  Andrej Gams,et al.  On-line learning and modulation of periodic movements with nonlinear dynamical systems , 2009, Auton. Robots.

[35]  Tamim Asfour,et al.  Accurate shape-based 6-DoF pose estimation of single-colored objects , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[36]  Sylvia Yuan,et al.  Syntactic bootstrapping. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[37]  Eren Erdal Aksoy,et al.  Categorizing object-action relations from semantic scene graphs , 2010, 2010 IEEE International Conference on Robotics and Automation.

[38]  Andrej Gams,et al.  On-line periodic movement and force-profile learning for adaptation to new surfaces , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[39]  Jun Morimoto,et al.  Task-Specific Generalization of Discrete and Periodic Dynamic Movement Primitives , 2010, IEEE Transactions on Robotics.

[40]  Rüdiger Dillmann,et al.  Advances in Robot Programming by Demonstration , 2010, KI - Künstliche Intelligenz.

[41]  Eren Erdal Aksoy,et al.  Learning the semantics of object–action relations by observation , 2011, Int. J. Robotics Res..

[42]  Ales Ude,et al.  Exploiting previous experience to constrain robot sensorimotor learning , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.

[43]  Mark Steedman,et al.  Semi-supervised CCG Lexicon Extension , 2011, EMNLP.

[44]  Learning and Application of High-Level Concepts with Conceptual Spaces and PDDL , 2011 .

[45]  Klas Kronander,et al.  Learning to control planar hitting motions in a minigolf-like task , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[46]  Mark Steedman,et al.  Object-Action Complexes: Grounded abstractions of sensory-motor processes , 2011, Robotics Auton. Syst..

[47]  Danica Kragic,et al.  Visual object-action recognition: Inferring object affordances from human demonstration , 2011, Comput. Vis. Image Underst..

[48]  Tamim Asfour,et al.  6-DoF model-based tracking of arbitrarily shaped 3D objects , 2011, 2011 IEEE International Conference on Robotics and Automation.

[49]  Heinz Wörn,et al.  Haptic object recognition for multi-fingered robot hands , 2012, 2012 IEEE Haptics Symposium (HAPTICS).

[50]  R. Dillmann,et al.  Learning continuous grasp stability for a humanoid robot hand based on tactile sensing , 2012, 2012 4th IEEE RAS & EMBS International Conference on Biomedical Robotics and Biomechatronics (BioRob).

[51]  Jan Peters,et al.  Nonamemanuscript No. (will be inserted by the editor) Reinforcement Learning to Adjust Parametrized Motor Primitives to , 2011 .

[52]  Adam Prügel-Bennett,et al.  Kernel-Mapping Recommender system algorithms , 2012, Inf. Sci..

[53]  Mark Steedman,et al.  A Probabilistic Model of Syntactic and Semantic Acquisition from Child-Directed Utterances and their Meanings , 2012, EACL.

[54]  Minija Tamosiunaite,et al.  Joining Movement Sequences: Modified Dynamic Movement Primitives for Robotics Applications Exemplified on Handwriting , 2012, IEEE Transactions on Robotics.

[55]  Stefan Schaal,et al.  Encoding of periodic and their transient motions by a single dynamic movement primitive , 2012, 2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012).

[56]  Tamim Asfour,et al.  Action sequence reproduction based on automatic segmentation and Object-Action Complexes , 2013, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids).

[57]  Leon Bodenhagen,et al.  Statistical Identification of Composed Visual Features Indicating High Likelihood of Grasp Success , 2013, ICRA 2013.

[58]  Justus H. Piater,et al.  Homogeneity analysis for object-action relation reasoning in kitchen scenarios , 2013, MLIS '13.

[59]  Yiannis Aloimonos,et al.  Detection of Manipulation Action Consequences (MAC) , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[60]  Norbert Krüger,et al.  Multi-view object recognition using view-point invariant shape relations and appearance information , 2013, 2013 IEEE International Conference on Robotics and Automation.

[61]  Justus H. Piater,et al.  Continuous Pose Estimation in 2D Images at Instance and Category Levels , 2013, 2013 International Conference on Computer and Robot Vision.

[62]  Dirk Kraft,et al.  A Survey of the Ontogeny of Tool Use: From Sensorimotor Experience to Planning , 2013, IEEE Transactions on Autonomous Mental Development.

[63]  Jun Nakanishi,et al.  Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors , 2013, Neural Computation.

[64]  Christopher W. Geib,et al.  Structural bootstrapping at the sensorimotor level for the fast acquisition of action knowledge for cognitive robots , 2013, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[65]  Rüdiger Dillmann,et al.  ARMAR-III: Advances in Humanoid Grasping and Manipulation , 2013 .

[66]  Ales Ude,et al.  A Simple Ontology of Manipulation Actions Based on Hand-Object Relations , 2013, IEEE Transactions on Autonomous Mental Development.

[67]  Sinan Kalkan,et al.  Deep Hierarchies in the Primate Visual Cortex: What Can We Learn for Computer Vision? , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[68]  Andrej Gams,et al.  Rich periodic motor skills on humanoid robots: Riding the pedal racer , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[69]  Tamim Asfour,et al.  Learn to wipe: A case study of structural bootstrapping from sensorimotor experience , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[70]  Eren Erdal Aksoy,et al.  Using structural bootstrapping for object substitution in robotic executions of human-like manipulation tasks , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).