论文信息 - Reinforcement Learning and the Reward Engineering Principle

Reinforcement Learning and the Reward Engineering Principle

AI agents are becoming significantly more general and autonomous. We argue for the “Reward Engineering Principle”: as reinforcement-learning-based AI systems, become more general and autonomous, the design of reward mechanisms that elicit desired behaviours becomes both more important and more difficult. While early AI research could ignore reward design and focus solely on the problems of efficient, flexible, and effective achievement of arbitrary goals in varied environments, the reward engineering principle will affect modern AI research, both theoretical and applied, in the medium and long terms. We introduce some notation and derive preliminary results that formalize the intuitive landmarks of the area of reward design.

Daniel Dewey | Dan Dewey

[1] Nils J. Nilsson,et al. Artificial Intelligence , 1974, IFIP Congress.

[2] Stuart J. Russell. Rationality and Intelligence , 1995, IJCAI.

[3] Shane Legg,et al. Universal Intelligence: A Definition of Machine Intelligence , 2007, Minds and Machines.

[4] S. Legg. Machine super intelligence , 2008 .

[5] I. Arel. The Threat of a Reward-Driven Adversarial Artificial General Intelligence , 2012 .

[6] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[7] Jing Meng,et al. Abrupt rise of new machine ecology beyond human response time , 2013, Scientific Reports.