Auditory learning: a developmental method

Motivated by the human autonomous development process from infancy to adulthood, we have built a robot that develops its cognitive and behavioral skills through real-time interactions with the environment. We call such a robot a developmental robot. In this paper, we present the theory and the architecture to implement a developmental robot and discuss the related techniques that address an array of challenging technical issues. As an application, experimental results on a real robot, self-organizing, autonomous, incremental learner (SAIL), are presented with emphasis on its audition perception and audition-related action generation. In particular, the SAIL robot conducts the auditory learning from unsegmented and unlabeled speech streams without any prior knowledge about the auditory signals, such as the designated language or the phoneme models. Neither available before learning starts are the actions that the robot is expected to perform. SAIL learns the auditory commands and the desired actions from physical contacts with the environment including the trainers.

[1]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[2]  D. N. Spinelli,et al.  Modification of the distribution of receptive field orientation in cats by selective visual exposure during development , 1971, Experimental Brain Research.

[3]  R. Held,et al.  MOVEMENT-PRODUCED STIMULATION IN THE DEVELOPMENT OF VISUALLY GUIDED BEHAVIOR. , 1963, Journal of comparative and physiological psychology.

[4]  J. Weng,et al.  Convergence Analysis of Complementary Candid Incremental Principal Component Analysis ∗ , 2001 .

[5]  N. L. Johnson,et al.  Linear Statistical Inference and Its Applications , 1966 .

[6]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[7]  Stephen E. Levinson,et al.  The Role of Sensorimotor Function, Associative Memory and Reinforcement Learning in Automatic Acquisition of Spoken Language by an Autonomous Robot , 1996 .

[8]  Andrew James Smith,et al.  Applications of the self-organising map to reinforcement learning , 2002, Neural Networks.

[9]  James S. Harris,et al.  Probability theory and mathematical statistics , 1998 .

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11]  Kazushi Ikeda,et al.  A new criterion using information gain for action selection strategy in reinforcement learning , 2004, IEEE Transactions on Neural Networks.

[12]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[13]  Erkki Oja,et al.  Subspace methods of pattern recognition , 1983 .

[14]  R. Spitz,et al.  Anaclitic depression; an inquiry into the genesis of psychiatric conditions in early childhood. , 1946, The Psychoanalytic study of the child.

[15]  O. Ekeberg,et al.  [Anaclitic depression]. , 1986, Tidsskrift for den Norske laegeforening : tidsskrift for praktisk medicin, ny raekke.

[16]  H. Wellman The Child's Theory of Mind , 1990 .

[17]  Ying Wu,et al.  Robot speech learning via entropy guided LVQ and memory association , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[18]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[19]  W. H. Williams,et al.  Probability Theory and Mathematical Statistics , 1964 .

[20]  D. Hubel,et al.  Plasticity of ocular dominance columns in monkey striate cortex. , 1977, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[21]  Terence D. Sanger,et al.  Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[22]  E. Oja,et al.  On stochastic approximation of the eigenvectors and eigenvalues of the expectation of a random matrix , 1985 .

[23]  Juyang Weng,et al.  Candid Covariance-Free Incremental Principal Component Analysis , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  L Sirovich,et al.  Low-dimensional procedure for the characterization of human faces. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[25]  Alex Pentland,et al.  Learning audio-visual associations using mutual information , 1999, Proceedings Integration of Speech and Image Understanding.

[26]  Juyang Weng,et al.  Developmental Humanoids: Humanoids that Develop Skills Automatically , 2000 .

[27]  Shyang Chang,et al.  An adaptive learning algorithm for principal component analysis , 1995, IEEE Trans. Neural Networks.

[28]  James L. McClelland,et al.  Autonomous Mental Development by Robots and Animals , 2001, Science.

[29]  S. Zeger,et al.  Markov regression models for time series: a quasi-likelihood approach. , 1988, Biometrics.

[30]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[31]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[32]  Juyang Weng,et al.  Hierarchical Discriminant Regression , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  M. Kupperman Linear Statistical Inference and Its Applications 2nd Edition (C. Radhakrishna Rao) , 1975 .

[34]  Lawrence R. Rabiner,et al.  Toward Vision 2001: Voice and audio processing considerations , 1995, AT&T Technical Journal.

[35]  G. Edelman,et al.  Behavioral constraints in the development of neuronal properties: a cortical model embedded in a real-world device. , 1998, Cerebral cortex.

[36]  Ralf Möller,et al.  Coupled principal component analysis , 2004, IEEE Transactions on Neural Networks.

[37]  P. L. Adams THE ORIGINS OF INTELLIGENCE IN CHILDREN , 1976 .

[38]  Juyang Weng,et al.  The developmental approach to multimedia speech learning , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[39]  Deb Roy,et al.  Learning from multimodal observations , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[40]  Pavel Pudil,et al.  Introduction to Statistical Pattern Recognition , 2006 .