Customizing Scripted Bots: Sample Efficient Imitation Learning for Human-like Behavior in Minecraft

Although there have been many advances in machine learning and artificial intelligence, video games still define the behavior of characters through careful scripting. These scripted agents follow a fixed set of rules or behavior trees resulting in behavior which is predictable and discernibly different from human gameplay. We demonstrate how to combine imitation learning with scripted agents in order to efficiently train hierarchical policies. Integrating both programmed and learned components, these hierarchical polices improve upon the expressiveness of the original scripted agent, allowing more diverse and human-like behavior to emerge. Additionally, we demonstrate that learned policies also maintain behavior guarantees afforded by the scripted agent, allowing a developer to be confident in the behavior of the learned policy. We demonstrate this interplay between classical AI techniques and statistical machine learning through a case study in Minecraft, and explore open questions about the necessary and sufficient conditions to achieve a fruitful interplay between these two approaches.

[1]  Peter Stone,et al.  Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.

[2]  Peter Stone,et al.  Behavioral Cloning from Observation , 2018, IJCAI.

[3]  Neil Burch,et al.  Heads-up limit hold’em poker is solved , 2015, Science.

[4]  Darryl Charles,et al.  Machine learning in digital games: a survey , 2008, Artificial Intelligence Review.

[5]  David M. Bourg,et al.  Al for game developers - creating intelligent behavior in games , 2004 .

[6]  Noah A. Smith,et al.  Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2016, ACL 2016.

[7]  Nan Jiang,et al.  Hierarchical Imitation and Reinforcement Learning , 2018, ICML.

[8]  Peter Stone,et al.  Generative Adversarial Imitation from Observation , 2018, ArXiv.

[9]  Eric O. Postma,et al.  Adaptive game AI with dynamic scripting , 2006, Machine Learning.

[10]  Jianfeng Gao,et al.  Deep Reinforcement Learning with a Natural Language Action Space , 2015, ACL.

[11]  Martin Černý,et al.  Using Behavior Objects to Manage Complexity in Virtual Worlds , 2017, IEEE Transactions on Computational Intelligence and AI in Games.

[12]  Christopher De Sa,et al.  Data Programming: Creating Large Training Sets, Quickly , 2016, NIPS.

[13]  Demis Hassabis,et al.  A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play , 2018, Science.

[14]  Steven Rabin,et al.  Game AI Pro 2: Collected Wisdom of Game AI Professionals , 2013 .

[15]  Sergey Levine,et al.  Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[16]  Katja Hofmann,et al.  The Malmo Platform for Artificial Intelligence Experimentation , 2016, IJCAI.