论文信息 - Learning piecewise control strategies in a modular neural network architecture

Learning piecewise control strategies in a modular neural network architecture

The authors describe a multinetwork, or modular, neural network architecture that learns to perform control tasks using a piecewise control strategy. The architecture's networks compete to learn the training patterns. As a result, a plant's parameter space is adaptively partitioned into a number of regions, and a different network learns a control law in each region. This learning process is described in a probabilistic framework and learning algorithms that perform gradient ascent in a log-likelihood function are discussed. Simulations show that the modular architecture's performance is superior to that of a single network on a multipayload robot motion control task. >

Michael I. Jordan | Robert A. Jacobs | R. Jacobs

[1] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[2] P. Werbos,et al. Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[3] Richard Durbin,et al. An analogue approach to the travelling salesman problem using an elastic net method , 1987, Nature.

[4] Filson H. Glanz,et al. Application of a General Learning Algorithm to the Control of Robotic Manipulators , 1987 .

[5] Geoffrey J. McLachlan,et al. Mixture models : inference and applications to clustering , 1989 .

[6] David J. Reinkensmeyer,et al. Using associative content-addressable memories to control robots , 1988, Proceedings of the 27th IEEE Conference on Decision and Control.

[7] John Scott Bridle,et al. Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition , 1989, NATO Neurocomputing.

[8] Steven J. Nowlan,et al. Maximum Likelihood Competitive Learning , 1989, NIPS.

[9] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[10] Karl Johan Åström,et al. Adaptive Control , 1989, Embedded Digital Control with Microcontrollers.

[11] Kumpati S. Narendra,et al. Identification and control of dynamical systems using neural networks , 1990, IEEE Trans. Neural Networks.

[12] Geoffrey E. Hinton,et al. The Bootstrap Widrow-Hoff Rule as a Cluster-Formation Algorithm , 1990, Neural Computation.

[13] Andrew G. Barto,et al. Connectionist learning for control: an overview , 1990 .

[14] Michael Athans,et al. Analysis of gain scheduled control for nonlinear plants , 1990 .

[15] Michael I. Jordan,et al. Hierarchies of Adaptive Experts , 1991, NIPS.

[16] Michael I. Jordan,et al. Task Decomposition Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks , 1990, Cogn. Sci..

[17] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[18] Michael I. Jordan,et al. Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..