论文信息 - Developmental Robotics: Theory and Experiments

Developmental Robotics: Theory and Experiments

A hand-designed internal representation of the world cannot deal with unknown or uncontrolled environments. Motivated by human cognitive and behavioral development, this paper presents a theory, an architecture, and some experimental results for developmental robotics. By a developmental robot, we mean that the robot generates its “brain” (or “central nervous system,” including the information processor and controller) through online, real-time interactions with its environment (including humans). A new Self-Aware Self-Effecting (SASE) agent concept is proposed, based on our SAIL and Dav developmental robots. The manual and autonomous development paradigms are formulated along with a theory of representation suited for autonomous development. Unlike traditional robot learning, the tasks that a developmental robot ends up learning are unknown during the programming time so that the task-specific representation must be generated and updated through real-time “living” experiences. Experimental results with SAIL and Dav developmental robots are presented, including visual attention selection, autonomous navigation, developmental speech learning, range-based obstacle avoidance, and scaffolding through transfer and chaining.

Juyang Weng | J. Weng

[1] Narendra Ahuja,et al. Learning Recognition and Segmentation Using the Cresceptron , 1997, International Journal of Computer Vision.

[2] Juyang Weng,et al. Grounded auditory development by a developmental robot , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[3] Jerome A. Feldman,et al. Connectionist Models and Their Properties , 1982, Cogn. Sci..

[4] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[5] Scott A Miller,et al. Cognitive development, 3rd ed. , 1993 .

[6] Roderic A. Grupen,et al. Learning prospective pick and place behavior , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[7] Richard S. Sutton,et al. Reinforcement Learning , 1992, Handbook of Machine Learning.

[8] Larry S. Davis,et al. An improved radial basis function network for visual autonomous road following , 1996, IEEE Trans. Neural Networks.

[9] D. M. Hutton,et al. Cambrian Intelligence: The Early History of the New AI , 2000 .

[10] Juyang Weng,et al. Online image classification using IHDR , 2003, International Journal on Document Analysis and Recognition.

[11] J. Piaget. The moral judgement in the child, New York (Harcourt, Brace & Company) 1932. , 1932 .

[12] John R. Anderson,et al. Rules of the Mind , 1993 .

[13] N. Bayley. Bayley Scales of Infant Development , 1999 .

[14] Robin R. Murphy,et al. Robot Learning a New Subfield? The Robolearn-96 Workshop , 1997, AI Mag..

[15] Juyang Weng,et al. Developmental Humanoids: Humanoids that Develop Skills Automatically , 2000 .

[16] Xiao Huang,et al. Novelty and Reinforcement Learning in the Value System of Developmental Robots , 2002 .

[17] Michael Wooldridge,et al. Intelligent Agents III , 1997 .

[18] J. Bruner,et al. The role of tutoring in problem solving. , 1976, Journal of child psychology and psychiatry, and allied disciplines.

[19] Juyang Weng,et al. Developmental Robots: Theory, Method and Experimental Results , 1999 .

[20] Ronald C. Arkin,et al. An Behavior-based Robotics , 1998 .

[21] Juyang Weng,et al. Developing early senses about the world: "Object Permanence" and visuoauditory real-time learning , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[22] Giorgio Metta,et al. Better Vision through Manipulation , 2003, Adapt. Behav..

[23] James L. McClelland,et al. Connectionist models of development , 2003 .

[24] S. Harnad. Categorical Perception: The Groundwork of Cognition , 1990 .

[25] J. Haldane. The interaction of nature and nurture. , 1946, Annals of eugenics.

[26] David S. Touretzky,et al. Operant Conditioning in Skinnerbots , 1997, Adapt. Behav..

[27] Juyang Weng,et al. Action chaining by a developmental robot with a value system , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[28] A. M. Turing,et al. Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[29] M. Sidman,et al. Conditional discrimination vs. matching to sample: an expansion of the testing paradigm. , 1982, Journal of the experimental analysis of behavior.

[30] John K. Tsotsos,et al. Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[31] J. Piaget,et al. The Moral Judgement of the Child , 1977 .

[32] D. Laplane. Thought and language. , 1992, Behavioural neurology.

[33] M. Domjan. The principles of learning and behavior, 4th ed. , 1998 .

[34] R. Brooks,et al. The cog project: building a humanoid robot , 1999 .

[35] M. Alexander,et al. Principles of Neural Science , 1981 .

[36] Giulio Sandini,et al. A developmental approach to visually-guided reaching in artificial systems , 1999, Neural Networks.

[37] Juyang Weng,et al. Conjunctive Visual and Auditory Development via Real-Time Dialogue , 2003 .

[38] James L. McClelland,et al. Autonomous Mental Development by Robots and Animals , 2001, Science.

[39] M. Asada,et al. A developmental approach accelerates learning of joint attention , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[40] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .

[41] Arthur C. Graesser,et al. Is it an Agent, or Just a Program?: A Taxonomy for Autonomous Agents , 1996, ATAL.

[42] Alex Pentland,et al. Learning words from sights and sounds: a computational model , 2002, Cogn. Sci..

[43] Ying Wu,et al. Robot speech learning via entropy guided LVQ and memory association , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[44] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[45] David Klahr,et al. Self-modifying production system model of cognitive development , 1987 .

[46] A. M. Turing,et al. Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[47] Juyang Weng,et al. Obstacle avoidance through incremental learning with attention selection , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[48] L. Baum,et al. An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[49] Juyang Weng,et al. State-based SHOSLIF for indoor visual navigation , 2000, IEEE Trans. Neural Networks Learn. Syst..

[50] Alex Waibel,et al. Readings in speech recognition , 1990 .

[51] Nan Zhang,et al. A developing sensory mapping for robots , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[52] John K. Tsotsos. Analyzing vision at the complexity level , 1990, Behavioral and Brain Sciences.

[53] Juyang Weng,et al. Candid Covariance-Free Incremental Principal Component Analysis , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[54] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .

[55] Juyang Weng,et al. Dav: a humanoid robot platform for autonomous mental development , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.

[56] G. Mettaa,et al. A developmental approach to visually-guided reaching in artificial systems , 1999 .

[57] Juyang Weng,et al. Progress in outdoor navigation by the SAIL developmental robot , 2002, SPIE Optics East.

[58] S. Engel. Thought and Language , 1964 .

[59] G. Edelman,et al. Behavioral constraints in the development of neuronal properties: a cortical model embedded in a real-world device. , 1998, Cerebral cortex.

[60] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[61] Brian Scassellati,et al. Infant-like Social Interactions between a Robot and a Human Caregiver , 2000, Adapt. Behav..

[62] Allen Newell,et al. SOAR: An Architecture for General Intelligence , 1987, Artif. Intell..

[63] Giulio Sandini,et al. A developmental approach to sensori-motor coordination in artificial systems , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[64] Alexander I. Rudnicky,et al. Survey of current speech technology , 1994, CACM.

[65] Juyang Weng,et al. Hierarchical Discriminant Regression , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[66] M. Domjan. The principles of learning and behavior , 1982 .

[67] M. Tomasello. The Role of Joint Attentional Processes in Early Language Development. , 1988 .

[68] Juyang Weng,et al. Vision-guided navigation using SHOSLIF , 1998, Neural Networks.