Self-Organizing Multi-Resolution Grid for Motion Planning and Control

A fully self-organizing neural network approach to low-dimensional control problems is described. We consider the problem of learning to control an object and solving the path planning problem at the same time. Control is based on the path planning model that follows the gradient of the stationary solution of a diffusion process working in the state space. Previous works are extended by introducing a self-organizing multigrid-like discretizing structure to represent the external world. Diffusion is simulated within a recurrent neural network built on this multigrid system. The novelty of the approach is that the diffusion on the multigrid is fast. Moreover, the diffusion process on the multigrid fits well the requirements of the path planning: it accelerates the diffusion in large free space regions while still keeps the resolution in small bottleneck-like labyrinths along the path. Control is achieved in the usual way: associative learning identifies the inverse dynamics of the system in a direct fashion. To this end there are introduced interneurons between neighboring discretizing units that detect the strength of the steady-state diffusion and forward control commands to the control neurons via modifiable connections. This architecture forms the Multigrid Position-and-Direction-to-Action (MPDA) map. The architecture integrates reactive path planning and continuous motion control. It is also shown that the scheme leads to population coding for the actual command vector.

[1]  L. Rosenhead Conduction of Heat in Solids , 1947, Nature.

[2]  Shun-ichi Amari,et al.  A Theory of Adaptive Pattern Classifiers , 1967, IEEE Trans. Electron. Comput..

[3]  S. Amari A Theory ofAdaptive Pattern Classifiers , 1967 .

[4]  F. Downton Stochastic Approximation , 1969, Nature.

[5]  M. T. Wasan Stochastic Approximation , 1969 .

[6]  A P Georgopoulos,et al.  On the relations between the direction of two-dimensional arm movements and cell discharge in primate motor cortex , 1982, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[7]  P. B. Allen Conduction of Heat. , 1983 .

[8]  W. Thomas Miller,et al.  Sensor-based control of robotic manipulators using a general learning algorithm , 1987, IEEE J. Robotics Autom..

[9]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[10]  PAUL J. WERBOS,et al.  Generalization of backpropagation with application to a recurrent gas market model , 1988, Neural Networks.

[11]  A. P. Georgopoulos,et al.  Primate motor cortex and free arm movements to visual targets in three- dimensional space. II. Coding of the direction of movement by a neuronal population , 1988, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[12]  Mitsuo Kawato,et al.  Feedback-error-learning neural network for trajectory control of a robotic manipulator , 1988, Neural Networks.

[13]  John Baillieul Sensor Based Control of Robotic Mechanisms , 1990 .

[14]  Bernard Widrow Adaptive inverse control , 1990, Defense, Security, and Sensing.

[15]  Michael P. Wellman,et al.  Planning and Control , 1991 .

[16]  Narendra Ahuja,et al.  Gross motion planning—a survey , 1992, CSUR.

[17]  V. VEMURI,et al.  Artificial Neural Networks in Control Applications , 1993, Adv. Comput..

[18]  T. Martínez,et al.  Competitive Hebbian Learning Rule Forms Perfectly Topology Preserving Maps , 1993 .

[19]  B. Widrow,et al.  Adaptive inverse control , 1987, Proceedings of 8th IEEE International Symposium on Intelligent Control.

[20]  Frank L. Lewis,et al.  Control of Robot Manipulators , 1993 .

[21]  Roderic A. Grupen,et al.  The applications of harmonic functions to robotics , 1993, J. Field Robotics.

[22]  Eduardo D. Sontag,et al.  Neural Networks for Control , 1993 .

[23]  Thomas Martinetz,et al.  Topology representing networks , 1994, Neural Networks.

[24]  András Lörincz,et al.  Topology Learning Solved by Extended Objects: A Neural Network Model , 1993, Neural Comput..

[25]  András Lörincz,et al.  Stabilizing Competitive Learning During On-Line Training with an Anti-Hebbian Weight Modulation , 1996, ICANN.

[26]  András Lörincz,et al.  Inverse Dynamics Controllers for Robust Control: Consequences for Neurocontrollers , 1996, ICANN.

[27]  András Lörincz,et al.  Approximate geometry representations and sensory fusion , 1996, Neurocomputing.

[28]  Terrence J. Sejnowski,et al.  Dynamic remapping , 1998 .

[29]  Michael I. Jordan Motor Learning and the Degrees of Freedom Problem , 2018, Attention and Performance XIII.