Connectionist Learning Control at GTE Laboratories

At GTE Laboratories, we are advancing the theory of connectionist learning architectures for real-time control while exploring their relationships to animal learning models, applications in manufacturing quality control, and VLSI implementations. We seek connectionist-network architectures with improved convergence rate and scaling properties, as assessed on simulated and actual control problems. Our primary focus is on extensions to reinforcement learning. These include adaptive critics, feature/representation adaptation in multilayer networks, hybrid connectionist/conventional controllers, and modular networks for hierarchical control. We are also extending methods for system identification, or model learning, to include internal models learned using temporal-differences. We propose the integration of reinforcement and model learning based on their relationships to dynamic programming. We are working to resolve how connectionist systems should serve as a total systems concept or as tools in a larger architecture.

[1]  W. A. Clark,et al.  Simulation of self-organizing systems by digital computer , 1954, Trans. IRE Prof. Group Inf. Theory.

[2]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[3]  James S. Albus,et al.  Brains, behavior, and robotics , 1981 .

[4]  P. Anandan,et al.  Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[5]  A G Barto,et al.  Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.

[6]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[7]  Michael Kuperstein,et al.  Adaptive visual-motor coordination in multijoint robots using parallel architecture , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[8]  R. Lumia,et al.  Hierarchical Control of Intelligent Machines Applied to Space Station Telerobots , 1987 .

[9]  Paul J. Werbos,et al.  Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[10]  Charles W. Anderson,et al.  Strategy Learning with Multilayer Connectionist Representations , 1987 .

[11]  W. Thomas Miller,et al.  Sensor-based control of robotic manipulators using a general learning algorithm , 1987, IEEE J. Robotics Autom..

[12]  Richard S. Sutton,et al.  Application of connectionist learning methods to manufacturing process monitoring , 1988, Proceedings IEEE International Symposium on Intelligent Control 1988.

[13]  R.E. Howard,et al.  A programmable analog neural network chip , 1988, Proceedings of the IEEE 1988 Custom Integrated Circuits Conference.

[14]  J. A. Franklin,et al.  Refinement of robot motor skills through reinforcement learning , 1988, Proceedings of the 27th IEEE Conference on Decision and Control.

[15]  M. Kawato,et al.  Hierarchical neural network model for voluntary movement with application to robotics , 1988, IEEE Control Systems Magazine.

[16]  A. Guez,et al.  A trainable neuromorphic controller , 1988 .

[17]  R. J. Williams,et al.  On the use of backpropagation in associative reinforcement learning , 1988, IEEE 1988 International Conference on Neural Networks.

[18]  Richard S. Sutton,et al.  Sequential Decision Problems and Neural Networks , 1989, NIPS 1989.

[19]  Judy A. Franklin,et al.  Historical perspective and state of the art in connectionist learning control , 1989, Proceedings of the 28th IEEE Conference on Decision and Control,.

[20]  Oliver G. Selfridge ADAPTIVE STRATEGIES OF LEARNING A STUDY OF TWO-PERSON ZERO-SUM COMPETITION , 1989, ICML 1989.

[21]  Charles W. Anderson Tower of Hanoi with Connectionist Networks: Learning New Features , 1989, ML.

[22]  Richard S. Sutton,et al.  Learning and Sequential Decision Making , 1989 .

[23]  J. Franklin Input space representation for refinement learning control , 1989, Proceedings. IEEE International Symposium on Intelligent Control 1989.

[24]  Richard S. Sutton,et al.  Time-Derivative Models of Pavlovian Reinforcement , 1990 .

[25]  Andrew G. Barto,et al.  Connectionist learning for control: an overview , 1990 .