论文信息 - How to train your dragon

How to train your dragon

Imaginary winged creatures in computer animation applications are expected to perform a variety of motor skills in a physically realistic and controllable manner. Designing physics-based controllers for a flying creature is still very challenging particularly when the dynamic model of the creatures is high-dimensional, having many degrees of freedom. In this paper, we present a control method for flying creatures, which are aerodynamically simulated, interactively controllable, and equipped with a variety of motor skills such as soaring, gliding, hovering, and diving. Each motor skill is represented as Deep Neural Networks (DNN) and learned using Deep Q-Learning (DQL). Our control method is example-guided in the sense that it provides the user with direct control over the learning process by allowing the user to specify keyframes of motor skills. Our novel learning algorithm was inspired by evolutionary strategies of Covariance Matrix Adaptation Evolution Strategy (CMA-ES) to improve the convergence rate and the final quality of the control policy. The effectiveness of our Evolutionary DQL method is demonstrated with imaginary winged creatures flying in a physically simulated environment and their motor skills learned automatically from user-provided keyframes.

[1] Nikolaus Hansen,et al. Adapting arbitrary normal mutation distributions in evolution strategies: the covariance matrix adaptation , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[2] Geoffrey E. Hinton,et al. NeuroAnimator: fast neural network emulation and control of physics-based models , 1998, SIGGRAPH.

[3] Zoran Popovic,et al. Realistic modeling of bird flight animations , 2003, ACM Trans. Graph..

[4] Jehee Lee,et al. Precomputing avatar behavior from human motion data , 2004, SCA '04.

[5] Z. Popovic,et al. Near-optimal character animation with continuous control , 2007, ACM Trans. Graph..

[6] Jehee Lee,et al. Simulating biped behaviors from human motion data , 2007, SIGGRAPH 2007.

[7] M.A. Wiering,et al. Reinforcement Learning in Continuous Action Spaces , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[8] M. V. D. Panne,et al. SIMBICON: simple biped locomotion control , 2007, SIGGRAPH 2007.

[9] Philippe Beaudoin,et al. Robust task-based control policies for physics-based characters , 2009, ACM Trans. Graph..

[10] Philippe Beaudoin,et al. Robust task-based control policies for physics-based characters , 2009, SIGGRAPH 2009.

[11] Sungeun Kim,et al. Data-driven biped control , 2010, ACM Trans. Graph..

[12] Aaron Hertzmann,et al. Feature-based locomotion controllers , 2010, SIGGRAPH 2010.

[13] David J. Fleet,et al. Optimizing walking controllers for uncertain inputs and environments , 2010, SIGGRAPH 2010.

[14] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[15] Zoran Popović,et al. Motion fields for interactive character locomotion , 2010, SIGGRAPH 2010.

[16] A. Karpathy,et al. Locomotion skills for simulated quadrupeds , 2011, ACM Trans. Graph..

[17] Greg Turk,et al. Articulated swimming creatures , 2011, SIGGRAPH 2011.

[18] Zoran Popovic,et al. Discovery of complex behaviors through contact-invariant optimization , 2012, ACM Trans. Graph..

[19] Vladlen Koltun,et al. Optimizing locomotion controllers using biologically-based actuators and objectives , 2012, ACM Trans. Graph..