Learning from Reinforcement and Advice Using Composite Reward Functions
暂无分享,去创建一个
[1] John McCarthy,et al. Programs with common sense , 1960 .
[2] Philip Klahr,et al. Advice-Taking and Knowledge Refinement: An Iterative View of Skill Acquisition , 1980 .
[3] Jude W. Shavlik,et al. Incorporating Advice into Agents that Learn from Reinforcements , 1994, AAAI.
[4] R. A. Grupen,et al. Harmonic control (robot applications) , 1992, Proceedings of the 1992 IEEE International Symposium on Intelligent Control.
[5] Roderic A. Grupen,et al. The applications of harmonic functions to robotics , 1993, J. Field Robotics.
[6] R. Grupen,et al. Harmonic Control , 1992 .
[7] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[8] Long Ji Lin,et al. Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.
[9] Sandip Sen,et al. Evolution and learning in multiagent systems , 1998, Int. J. Hum. Comput. Stud..
[10] C. I. Connolly,et al. Applications of harmonic functions to robotics , 1992, Proceedings of the 1992 IEEE International Symposium on Intelligent Control.
[11] Sandip Sen. IJCAI-95 Workshop on Adaptation and Learning in Multiagent Systems , 1996 .
[12] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..