论文信息 - Reinforcement learning - 字舞流文

Reinforcement learning

Andrew G. Barto | A. Barto

[1] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[2] Roderic A. Grupen,et al. Learning admittance mappings for force-guided assembly , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[3] K. P. Unnikrishnan,et al. Alopex: A Correlation-Based Learning Algorithm for Feedforward and Recurrent Neural Networks , 1994, Neural Computation.

[4] Jerry M. Mendel,et al. Reinforcement-learning control and pattern recognition systems , 1994 .

[5] Ron Meir,et al. A Parallel Gradient Descent Method for Learning in Analog VLSI Neural Networks , 1992, NIPS.

[6] Roderic A. Grupen,et al. Learning reactive admittance control , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[7] Vijaykumar Gullapalli,et al. A stochastic reinforcement learning algorithm for learning real-valued functions , 1990, Neural Networks.

[8] Michael I. Jordan,et al. Learning to Control an Unstable System with Forward Modeling , 1989, NIPS.

[9] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[10] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[11] A G Barto,et al. Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.

[12] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[13] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.

[14] John H. Holland,et al. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[15] E Harth,et al. Alopex: a stochastic method for determining visual receptive fields. , 1974, Vision research.

[16] Bernard Widrow,et al. Punish/Reward: Learning with a Critic in Adaptive Threshold Systems , 1973, IEEE Trans. Syst. Man Cybern..

[17] A. H. Klopf,et al. Brain Function and Adaptive Systems: A Heterostatic Theory , 1972 .

[18] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..