Task Transfer by a Developmental Robot

Scaffolding is a process of transferring learned skills to new and more complex tasks through arranged experience in open-ended development. In this paper, we propose a developmental learning architecture that enables a robot to transfer skills acquired in early learning settings to later more complex task settings. We show that a basic mechanism that enables this transfer is sequential priming combined with attention, which is also the driving mechanism for classical conditioning, secondary conditioning, and instrumental conditioning in animal learning. A major challenge of this work is that training and testing must be conducted in the same program operational mode through online, real-time interactions between the agent and the trainers. In contrast with former modeling studies, the proposed architecture does not require the programmer to know the tasks to be learned and the environment is uncontrolled. All possible perceptions and actions, including the actual number of classes, are not available until the programming is finished and the robot starts to learn in the real world. Thus, a predesigned task-specific symbolic representation is not suited for such an open-ended developmental process. Experimental results on a robot are reported in which the trainer shaped the behaviors of the agent interactively, continuously, and incrementally through verbal commands and other sensory signals so that the robot learns new and more complex sensorimotor tasks by transferring sensorimotor skills learned in earlier periods of open-ended development

[1]  Juyang Weng,et al.  Value system development for a robot , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[2]  B. Schwartz Psychology of Learning and Behavior , 1978 .

[3]  Peter Secretan Learning , 1965, Mental Health.

[4]  N. Mackintosh,et al.  Conditioning And Associative Learning , 1983 .

[5]  Juyang Weng,et al.  Online image classification using IHDR , 2003, International Journal on Document Analysis and Recognition.

[6]  Juyang Weng,et al.  A theory for mentally developing robots , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[7]  J. P. Byrnes,et al.  Cognitive Development and Learning in Instructional Contexts , 1995 .

[8]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[9]  Juyang Weng,et al.  Hierarchical Discriminant Regression , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  A. Klopf A neuronal model of classical conditioning , 1988 .

[11]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[12]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[13]  Jean-Arcady Meyer,et al.  Computational Models of Classical Conditioning: A Comparative Study , 1998 .

[14]  Juyang Weng,et al.  On developmental mental architectures , 2007, Neurocomputing.

[15]  Juyang Weng,et al.  Incremental Hierarchical Discriminant Regression , 2007, IEEE Transactions on Neural Networks.

[16]  Juyang Weng,et al.  Conjunctive Visual and Auditory Development via Real-Time Dialogue , 2003 .

[17]  C. L. Hull Goal attraction and directing ideas conceived as habit phenomena. , 1931 .

[18]  Juyang Weng,et al.  Developmental Robotics: Theory and Experiments , 2004, Int. J. Humanoid Robotics.

[19]  Christian Balkenius,et al.  Computational models of classical conditioning: a comparative study , 1998 .

[20]  Juyang Weng,et al.  Chained action learning through real-time interactions , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[21]  E. Kehoe,et al.  Rapid reacquisition in conditioning of the rabbit's nictitating membrane response. , 1992 .

[22]  M. Domjan The principles of learning and behavior , 1982 .

[23]  M. Alexander,et al.  Principles of Neural Science , 1981 .

[24]  James L. McClelland,et al.  Autonomous Mental Development by Robots and Animals , 2001, Science.

[25]  J. Bruner,et al.  The role of tutoring in problem solving. , 1976, Journal of child psychology and psychiatry, and allied disciplines.

[26]  Pattie Maes,et al.  Generalization in Instrumental Learning , 1996 .

[27]  Alex Pentland,et al.  Learning words from sights and sounds: a computational model , 2002, Cogn. Sci..

[28]  Olaf Sporns,et al.  Plasticity in Value Systems and its Role in Adaptive Behavior , 2000, Adapt. Behav..

[29]  Juyang Weng,et al.  Auditory learning: a developmental method , 2005, IEEE Transactions on Neural Networks.

[30]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[31]  Juyang Weng,et al.  Developmental Humanoids: Humanoids that Develop Skills Automatically , 2000 .

[32]  R. Rescorla A theory of pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement , 1972 .

[33]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[34]  M. Domjan The principles of learning and behavior, 4th ed. , 1998 .

[35]  B. Babkin Conditioned Reflexes; an Investigation of the Physiological Activity of the Cerebral Cortex. , 1929 .

[36]  J KONORSKI,et al.  [Chronic extinction and restoration of conditioned reflexes]. , 1953, Acta physiologica Polonica.

[37]  S. Zeger,et al.  Markov regression models for time series: a quasi-likelihood approach. , 1988, Biometrics.

[38]  David S. Touretzky,et al.  Operant Conditioning in Skinnerbots , 1997, Adapt. Behav..

[39]  Juyang Weng,et al.  Action chaining by a developmental robot with a value system , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[40]  Juyang Weng,et al.  A Theory of Developmental Mental Architecture and the Dav Architecture Design , 2005, Int. J. Humanoid Robotics.

[41]  Juyang Weng,et al.  Autonomous Mental Development: Workshop on Development and Learning (WDL) , 2002, AI Mag..

[42]  C. L. Hull Knowledge and purpose as habit mechanisms. , 1930 .

[43]  David S. Touretzky,et al.  Shaping robot behavior using principles from instrumental conditioning , 1997, Robotics Auton. Syst..

[44]  Juyang Weng,et al.  Grounded auditory development by a developmental robot , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[45]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[46]  J. Weng A Theory of Developmental Architecture , 2004 .

[47]  S. Engel Thought and Language , 1964 .

[48]  Ying Wu,et al.  Robot speech learning via entropy guided LVQ and memory association , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).