Preparation behaviour synthesis with reinforcement learning

When humans perform a series of motions, they prepare for the next motion in advance so as to enhance the response time of their movements. This kind of preparation behaviour results in a natural and smooth transition of the overall movement. In this paper, we propose a new method to synthesize the behaviour using reinforcement learning. To create preparation movements, we propose a customized motion blending algorithm that is governed by a single numerical value, which we called the level of preparation. During the offline process, the system learns the optimal way to approach a target, as well as the realistic behaviour to prepare for interaction considering the level of preparation. At run-time, the trained controller indicates the character to move to a target with the appropriate level of preparation, resulting in human-like movements. We synthesized scenes in which the character has to move in a complex environment and interact with objects, such as a character crawling under and jumping over obstacles while walking. The method is useful not only for computer animation, but also for real-time applications such as computer games, in which the characters need to accomplish a series of tasks in a given environment.

[1]  Michael Gleicher,et al.  Retargetting motion to new characters , 1998, SIGGRAPH.

[2]  David A. Forsyth,et al.  Learning to move autonomously in a hostile world , 2005, SIGGRAPH '05.

[3]  Lucas Kovar,et al.  Motion graphs , 2002, SIGGRAPH Classes.

[4]  Daniel Thalmann,et al.  Integration of motion control techniques for virtual human and avatar real-time animation , 1997, VRST '97.

[5]  Alex M. Andrew,et al.  Reinforcement Learning: : An Introduction , 1998 .

[6]  Thore Graepel,et al.  LEARNING TO FIGHT , 2004 .

[7]  Tomohiko Mukai,et al.  Geostatistical motion interpolation , 2005, SIGGRAPH 2005.

[8]  Michael Gleicher,et al.  Constraint‐based motion adaptation , 1998 .

[9]  Mahesan Niranjan,et al.  On-line Q-learning using connectionist systems , 1994 .

[10]  Nancy S. Pollard,et al.  Efficient synthesis of physically valid human motion , 2003, ACM Trans. Graph..

[11]  Jessica K. Hodgins,et al.  Interactive control of avatars animated with human motion data , 2002, SIGGRAPH.

[12]  Manfred Lau,et al.  Behavior planning for character animation , 2005, SCA '05.

[13]  Taku Komura,et al.  Angular momentum guided motion concatenation , 2009, Comput. Animat. Virtual Worlds.

[14]  Nancy S. Pollard,et al.  Responsive characters from motion fragments , 2007, SIGGRAPH 2007.

[15]  J. Hodgins,et al.  Construction and optimal search of interpolated motion graphs , 2007, SIGGRAPH 2007.

[16]  Lucas Kovar,et al.  Automated extraction and parameterization of motions in large data sets , 2004, ACM Trans. Graph..

[17]  C. Karen Liu,et al.  Synthesis of complex dynamic character motion from simple animations , 2002, ACM Trans. Graph..

[18]  Hyun Joon Shin,et al.  Fat graphs: constructing an interactive character with continuous controls , 2006, SCA '06.

[19]  Aaron Hertzmann,et al.  Style machines , 2000, SIGGRAPH 2000.

[20]  Kari Pulli,et al.  Style translation for human motion , 2005, SIGGRAPH 2005.

[21]  Andrew P. Witkin,et al.  Spacetime constraints , 1988, SIGGRAPH.

[22]  Jehee Lee,et al.  Precomputing avatar behavior from human motion data , 2006, Graph. Model..

[23]  Taku Komura,et al.  Simulating interactions of avatars in high dimensional state space , 2008, I3D '08.

[24]  Taesoo Kwon,et al.  Motion modeling for on-line locomotion synthesis , 2005, SCA '05.

[25]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[26]  Taku Komura,et al.  Simulating Multiple Character Interactions with Collaborative and Adversarial Goals , 2012, IEEE Transactions on Visualization and Computer Graphics.

[27]  Michael Gleicher,et al.  Constraint-based motion adaptation , 1998, Comput. Animat. Virtual Worlds.

[28]  Bruno Arnaldi,et al.  Motion blending for real-time animation while accounting for the environment , 2004, Proceedings Computer Graphics International, 2004..

[29]  Zoran Popovic,et al.  Motion warping , 1995, SIGGRAPH.

[30]  Victor B. Zordan,et al.  Anticipation from example , 2007, VRST '07.

[31]  Thomas Jakobsen,et al.  Advanced Character Physics , 2003 .