论文信息 - Rapid learning in robotics

Rapid learning in robotics

Robotics deals with the control of actuators using various types of sensors and control schemes. The availability of precise sensorimotor mappings – able to transform between various involved motor, joint, sensor, and physical spaces – is a crucial issue. These mappings are often highly non-linear and sometimes hard to derive analytically. Consequently, there is a strong need for rapid learning algorithms which take into account that the acquisition of training data is often a costly operation. The present book discusses many of the issues that are important to make learning approaches in robotics more feasible. Basis for the major part of the discussion is a new learning algorithm, the Parameterized Self-Organizing Maps, that is derived from a model of neural self-organization. A key feature of the new method is the rapid construction of even highly non-linear variable relations from rather modestly-sized training data sets by exploiting topology information that is not utilized in more traditional approaches. In addition, the author shows how this approach can be used in a modular fashion, leading to a learning architecture for the acquisition of basic skills during an " investment learning " phase, and, subsequently, for their rapid combination to adapt to new situational contexts. ii Foreword The rapid and apparently effortless adaptation of their movements to a broad spectrum of conditions distinguishes both humans and animals in an important way even from nowadays most sophisticated robots. Algorithms for rapid learning will, therefore, become an important prerequisite for future robots to achieve a more intelligent coordination of their movements that is closer to the impressive level of biological performance. The present book discusses many of the issues that are important to make learning approaches in robotics more feasible. A new learning algorithm , the Parameterized Self-Organizing Maps, is derived from a model of neural self-organization. It has a number of benefits that make it particularly suited for applications in the field of robotics. A key feature of the new method is the rapid construction of even highly non-linear variable relations from rather modestly-sized training data sets by exploiting topology information that is unused in the more traditional approaches. In addition, the author shows how this approach can be used in a modular fashion, leading to a learning architecture for the acquisition of basic skills during an " investment learning " phase, and, subsequently, for their rapid combination to adapt to new situational contexts. The …

Jörg A. Walter

[1] W. Cleveland. Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[2] C. S. G. Lee,et al. Robotics: Control, Sensing, Vision, and Intelligence , 1987 .

[3] R. Paul. Robot manipulators : mathematics, programming, and control : the computer control of robot manipulators , 1981 .

[4] Robert A. Jacobs,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.

[5] Geoffrey E. Hinton,et al. Learning distributed representations of concepts. , 1989 .

[6] Helge J. Ritter,et al. Visual gesture-based robot guidance with a modular neural system , 1995, NIPS.

[7] Yann LeCun,et al. Optimal Brain Damage , 1989, NIPS.

[8] Vincent Hayward,et al. Robot Manipulator Control under Unix RCCL: A Robot Control "C" Library , 1986 .

[9] Enno Littmann. Strukturierung neuronaler Netze zwischen Biologie und Anwendung - biologische Modellierung, Kaskadierung und hybrider Ansatz , 1995, DISKI.

[10] 高等学校計算数学学報編輯委員会編,et al. 高等学校計算数学学報 = Numerical mathematics , 1979 .

[11] H. Ritter. Investment Learning with Hierarchical Psom , 1995 .

[12] J. Nadal,et al. Learning in feedforward layered networks: the tiling algorithm , 1989 .

[13] Mike Parker,et al. Real Time Control Under UNIX for RCCL , 1996 .

[14] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[15] Teuvo Kohonen,et al. Self-Organization and Associative Memory , 1988 .

[16] Helge Ritter,et al. Service Object Request Management Architecture: SORMA Concepts and Examples , 1996 .

[17] Franz Kummert,et al. Recognition of 3D-Orientation from Monocular Color Images by Neural Semantic Networks , 1993 .

[18] Klaus Schulten,et al. Topology-conserving maps for learning visuo-motor-coordination , 1989, Neural Networks.

[19] Helge Ritter,et al. Topology conserving mappings for learning motor tasks , 1987 .

[20] James A. Anderson,et al. Neurocomputing: Foundations of Research , 1988 .

[21] Helge J. Ritter,et al. Rapid learning with parametrized self-organizing maps , 1996, Neurocomputing.

[22] Peter Craven,et al. Smoothing noisy data with spline functions , 1978 .

[23] Helge J. Ritter,et al. A tactile sensor system for a three-fingered robot manipulator , 1997, Proceedings of International Conference on Robotics and Automation.

[24] Marvin Minsky,et al. Perceptrons: An Introduction to Computational Geometry , 1969 .

[25] Klaus Schulten,et al. Implementation of self-organizing neural networks for visuo-motor control of an industrial robot , 1993, IEEE Trans. Neural Networks.

[26] M. F.,et al. Bibliography , 1985, Experimental Gerontology.

[27] J. Freidman,et al. Multivariate adaptive regression splines , 1991 .

[28] J J Hopfield,et al. Neurons with graded response have collective computational properties like those of two-state neurons. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[29] Klaus Pawelzik,et al. Quantifying the neighborhood preservation of self-organizing feature maps , 1992, IEEE Trans. Neural Networks.

[30] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[31] M. Hutchinson,et al. Smoothing noisy data with spline functions , 1985 .

[32] E. Capaldi,et al. The organization of behavior. , 1992, Journal of applied behavior analysis.

[33] Bernd Fritzke,et al. Let It Grow - Self-Organizing Feature Maps With Problem Dependent Cell Structure , 1991 .

[34] Helge Ritter,et al. The NI Robotics Laboratory , 1996 .

[35] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[36] John C. Platt. A Resource-Allocating Network for Function Interpolation , 1991, Neural Computation.

[37] M Kuperstein,et al. Neural model of adaptive hand-eye coordination for single postures. , 1988, Science.

[38] J. Doyne Farmer,et al. Exploiting Chaos to Predict the Future and Reduce Noise , 1989 .

[39] Helge Ritter,et al. Local PSOMs and Chebyshev PSOMs -Improving the Parametrised Self-Organizing Maps , 1995 .

[40] Jörg A. Walter. SORMA: interoperating distributed robotics hardware , 1997, Proceedings of International Conference on Robotics and Automation.

[41] S. Gruber,et al. Robot hands and the mechanics of manipulation , 1987, Proceedings of the IEEE.

[42] Christian Lebiere,et al. The Cascade-Correlation Learning Architecture , 1989, NIPS.

[43] M. J. D. Powell,et al. Radial basis functions for multivariable interpolation: a review , 1987 .

[44] Eric Wan,et al. Finite Impulse Response Neural Networks for Autoregressive Time Series Prediction , 1993 .

[45] Peter M. Todd,et al. Designing Neural Networks using Genetic Algorithms , 1989, ICGA.

[46] C. Gielen,et al. Neural computation and self-organizing maps, an introduction , 1993 .

[47] Marcus Frean,et al. The Upstart Algorithm: A Method for Constructing and Training Feedforward Neural Networks , 1990, Neural Computation.

[48] F. A. Seiler,et al. Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[49] Helge Ritter,et al. Parametrized Self-Organizing Maps , 1993 .

[50] W. Cleveland,et al. Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting , 1988 .

[51] John A. Hertz,et al. Exploiting Neurons with Localized Receptive Fields to Learn Chaos , 1990, Complex Syst..

[52] H. Ritter,et al. A principle for the formation of the spatial structure of cortical feature maps. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[53] J J Hopfield,et al. Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[54] George A. Bekey,et al. On reducing learning time in context-dependent mappings , 1993, IEEE Trans. Neural Networks.

[55] A. A. Mullin,et al. Principles of neurodynamics , 1962 .

[56] C. Malsburg,et al. How to label nerve cells so that they can interconnect in an ordered fashion. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[57] Teuvo Kohonen,et al. The self-organizing map , 1990, Neurocomputing.

[58] Elie Bienenstock,et al. Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[59] P. Werbos,et al. Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[60] Thomas Martinetz,et al. 'Neural-gas' network for vector quantization and its application to time-series prediction , 1993, IEEE Trans. Neural Networks.

[61] Thomas Wengerek. Reinforcement-Lernen in der Robotik , 1996, DISKI.

[62] Stefan Schaal,et al. Robot learning by nonparametric regression , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[63] Bernd Fritzke. Incremental Learning of Local Linear Mappings , 1995 .

[64] Helge Ritter,et al. Neural Networks for Robotics , 1992 .

[65] Helge J. Ritter,et al. Associative Completion and Investment Learning Using PSOMs , 1996, ICANN.

[66] Gerd Hirzinger,et al. ROTEX-the first remotely controlled robot in space , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[67] Helge Ritter. Asymptotic level density for a class of vector quantization processes , 1991, IEEE Trans. Neural Networks.

[68] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[69] Joris De Schutter,et al. Compliant robot motion: task formulation and control , 1986 .

[70] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[71] Andreas Baader,et al. Ein Umwelterfassungssystem für multisensorielle Montageroboter , 1995 .

[72] Helge Ritter,et al. Learning to recognize 3D-Hand Postures from Perspective Pixel Images , 1992 .