Developmental Robotics: Theory and Experiments

A hand-designed internal representation of the world cannot deal with unknown or uncontrolled environments. Motivated by human cognitive and behavioral development, this paper presents a theory, an architecture, and some experimental results for developmental robotics. By a developmental robot, we mean that the robot generates its “brain” (or “central nervous system,” including the information processor and controller) through online, real-time interactions with its environment (including humans). A new Self-Aware Self-Effecting (SASE) agent concept is proposed, based on our SAIL and Dav developmental robots. The manual and autonomous development paradigms are formulated along with a theory of representation suited for autonomous development. Unlike traditional robot learning, the tasks that a developmental robot ends up learning are unknown during the programming time so that the task-specific representation must be generated and updated through real-time “living” experiences. Experimental results with SAIL and Dav developmental robots are presented, including visual attention selection, autonomous navigation, developmental speech learning, range-based obstacle avoidance, and scaffolding through transfer and chaining.

[1]  Narendra Ahuja,et al.  Learning Recognition and Segmentation Using the Cresceptron , 1997, International Journal of Computer Vision.

[2]  Juyang Weng,et al.  Grounded auditory development by a developmental robot , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[3]  Jerome A. Feldman,et al.  Connectionist Models and Their Properties , 1982, Cogn. Sci..

[4]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[5]  Scott A Miller,et al.  Cognitive development, 3rd ed. , 1993 .

[6]  Roderic A. Grupen,et al.  Learning prospective pick and place behavior , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[7]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[8]  Larry S. Davis,et al.  An improved radial basis function network for visual autonomous road following , 1996, IEEE Trans. Neural Networks.

[9]  D. M. Hutton,et al.  Cambrian Intelligence: The Early History of the New AI , 2000 .

[10]  Juyang Weng,et al.  Online image classification using IHDR , 2003, International Journal on Document Analysis and Recognition.

[11]  J. Piaget The moral judgement in the child, New York (Harcourt, Brace & Company) 1932. , 1932 .

[12]  John R. Anderson,et al.  Rules of the Mind , 1993 .

[13]  N. Bayley Bayley Scales of Infant Development , 1999 .

[14]  Robin R. Murphy,et al.  Robot Learning a New Subfield? The Robolearn-96 Workshop , 1997, AI Mag..

[15]  Juyang Weng,et al.  Developmental Humanoids: Humanoids that Develop Skills Automatically , 2000 .

[16]  Xiao Huang,et al.  Novelty and Reinforcement Learning in the Value System of Developmental Robots , 2002 .

[17]  Michael Wooldridge,et al.  Intelligent Agents III , 1997 .

[18]  J. Bruner,et al.  The role of tutoring in problem solving. , 1976, Journal of child psychology and psychiatry, and allied disciplines.

[19]  Juyang Weng,et al.  Developmental Robots: Theory, Method and Experimental Results , 1999 .

[20]  Ronald C. Arkin,et al.  An Behavior-based Robotics , 1998 .

[21]  Juyang Weng,et al.  Developing early senses about the world: "Object Permanence" and visuoauditory real-time learning , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[22]  Giorgio Metta,et al.  Better Vision through Manipulation , 2003, Adapt. Behav..

[23]  James L. McClelland,et al.  Connectionist models of development , 2003 .

[24]  S. Harnad Categorical Perception: The Groundwork of Cognition , 1990 .

[25]  J. Haldane The interaction of nature and nurture. , 1946, Annals of eugenics.

[26]  David S. Touretzky,et al.  Operant Conditioning in Skinnerbots , 1997, Adapt. Behav..

[27]  Juyang Weng,et al.  Action chaining by a developmental robot with a value system , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[28]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[29]  M. Sidman,et al.  Conditional discrimination vs. matching to sample: an expansion of the testing paradigm. , 1982, Journal of the experimental analysis of behavior.

[30]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[31]  J. Piaget,et al.  The Moral Judgement of the Child , 1977 .

[32]  D. Laplane Thought and language. , 1992, Behavioural neurology.

[33]  M. Domjan The principles of learning and behavior, 4th ed. , 1998 .

[34]  R. Brooks,et al.  The cog project: building a humanoid robot , 1999 .

[35]  M. Alexander,et al.  Principles of Neural Science , 1981 .

[36]  Giulio Sandini,et al.  A developmental approach to visually-guided reaching in artificial systems , 1999, Neural Networks.

[37]  Juyang Weng,et al.  Conjunctive Visual and Auditory Development via Real-Time Dialogue , 2003 .

[38]  James L. McClelland,et al.  Autonomous Mental Development by Robots and Animals , 2001, Science.

[39]  M. Asada,et al.  A developmental approach accelerates learning of joint attention , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[40]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[41]  Arthur C. Graesser,et al.  Is it an Agent, or Just a Program?: A Taxonomy for Autonomous Agents , 1996, ATAL.

[42]  Alex Pentland,et al.  Learning words from sights and sounds: a computational model , 2002, Cogn. Sci..

[43]  Ying Wu,et al.  Robot speech learning via entropy guided LVQ and memory association , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[44]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[45]  David Klahr,et al.  Self-modifying production system model of cognitive development , 1987 .

[46]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[47]  Juyang Weng,et al.  Obstacle avoidance through incremental learning with attention selection , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[48]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[49]  Juyang Weng,et al.  State-based SHOSLIF for indoor visual navigation , 2000, IEEE Trans. Neural Networks Learn. Syst..

[50]  Alex Waibel,et al.  Readings in speech recognition , 1990 .

[51]  Nan Zhang,et al.  A developing sensory mapping for robots , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[52]  John K. Tsotsos Analyzing vision at the complexity level , 1990, Behavioral and Brain Sciences.

[53]  Juyang Weng,et al.  Candid Covariance-Free Incremental Principal Component Analysis , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[54]  Dean Pomerleau,et al.  ALVINN, an autonomous land vehicle in a neural network , 2015 .

[55]  Juyang Weng,et al.  Dav: a humanoid robot platform for autonomous mental development , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[56]  G. Mettaa,et al.  A developmental approach to visually-guided reaching in artificial systems , 1999 .

[57]  Juyang Weng,et al.  Progress in outdoor navigation by the SAIL developmental robot , 2002, SPIE Optics East.

[58]  S. Engel Thought and Language , 1964 .

[59]  G. Edelman,et al.  Behavioral constraints in the development of neuronal properties: a cortical model embedded in a real-world device. , 1998, Cerebral cortex.

[60]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[61]  Brian Scassellati,et al.  Infant-like Social Interactions between a Robot and a Human Caregiver , 2000, Adapt. Behav..

[62]  Allen Newell,et al.  SOAR: An Architecture for General Intelligence , 1987, Artif. Intell..

[63]  Giulio Sandini,et al.  A developmental approach to sensori-motor coordination in artificial systems , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[64]  Alexander I. Rudnicky,et al.  Survey of current speech technology , 1994, CACM.

[65]  Juyang Weng,et al.  Hierarchical Discriminant Regression , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[66]  M. Domjan The principles of learning and behavior , 1982 .

[67]  M. Tomasello The Role of Joint Attentional Processes in Early Language Development. , 1988 .

[68]  Juyang Weng,et al.  Vision-guided navigation using SHOSLIF , 1998, Neural Networks.