论文信息 - Preparation behaviour synthesis with reinforcement learning

Preparation behaviour synthesis with reinforcement learning

When humans perform a series of motions, they prepare for the next motion in advance so as to enhance the response time of their movements. This kind of preparation behaviour results in a natural and smooth transition of the overall movement. In this paper, we propose a new method to synthesize the behaviour using reinforcement learning. To create preparation movements, we propose a customized motion blending algorithm that is governed by a single numerical value, which we called the level of preparation. During the offline process, the system learns the optimal way to approach a target, as well as the realistic behaviour to prepare for interaction considering the level of preparation. At run-time, the trained controller indicates the character to move to a target with the appropriate level of preparation, resulting in human-like movements. We synthesized scenes in which the character has to move in a complex environment and interact with objects, such as a character crawling under and jumping over obstacles while walking. The method is useful not only for computer animation, but also for real-time applications such as computer games, in which the characters need to accomplish a series of tasks in a given environment.

[1] Michael Gleicher,et al. Retargetting motion to new characters , 1998, SIGGRAPH.

[2] David A. Forsyth,et al. Learning to move autonomously in a hostile world , 2005, SIGGRAPH '05.

[3] Lucas Kovar,et al. Motion graphs , 2002, SIGGRAPH Classes.

[4] Daniel Thalmann,et al. Integration of motion control techniques for virtual human and avatar real-time animation , 1997, VRST '97.

[5] Alex M. Andrew,et al. Reinforcement Learning: : An Introduction , 1998 .

[6] Thore Graepel,et al. LEARNING TO FIGHT , 2004 .

[7] Tomohiko Mukai,et al. Geostatistical motion interpolation , 2005, SIGGRAPH 2005.

[8] Michael Gleicher,et al. Constraint‐based motion adaptation , 1998 .

[9] Mahesan Niranjan,et al. On-line Q-learning using connectionist systems , 1994 .

[10] Nancy S. Pollard,et al. Efficient synthesis of physically valid human motion , 2003, ACM Trans. Graph..

[11] Jessica K. Hodgins,et al. Interactive control of avatars animated with human motion data , 2002, SIGGRAPH.

[12] Manfred Lau,et al. Behavior planning for character animation , 2005, SCA '05.

[13] Taku Komura,et al. Angular momentum guided motion concatenation , 2009, Comput. Animat. Virtual Worlds.

[14] Nancy S. Pollard,et al. Responsive characters from motion fragments , 2007, SIGGRAPH 2007.

[15] J. Hodgins,et al. Construction and optimal search of interpolated motion graphs , 2007, SIGGRAPH 2007.

[16] Lucas Kovar,et al. Automated extraction and parameterization of motions in large data sets , 2004, ACM Trans. Graph..

[17] C. Karen Liu,et al. Synthesis of complex dynamic character motion from simple animations , 2002, ACM Trans. Graph..

[18] Hyun Joon Shin,et al. Fat graphs: constructing an interactive character with continuous controls , 2006, SCA '06.

[19] Aaron Hertzmann,et al. Style machines , 2000, SIGGRAPH 2000.

[20] Kari Pulli,et al. Style translation for human motion , 2005, SIGGRAPH 2005.

[21] Andrew P. Witkin,et al. Spacetime constraints , 1988, SIGGRAPH.

[22] Jehee Lee,et al. Precomputing avatar behavior from human motion data , 2006, Graph. Model..

[23] Taku Komura,et al. Simulating interactions of avatars in high dimensional state space , 2008, I3D '08.

[24] Taesoo Kwon,et al. Motion modeling for on-line locomotion synthesis , 2005, SCA '05.

[25] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[26] Taku Komura,et al. Simulating Multiple Character Interactions with Collaborative and Adversarial Goals , 2012, IEEE Transactions on Visualization and Computer Graphics.

[27] Michael Gleicher,et al. Constraint-based motion adaptation , 1998, Comput. Animat. Virtual Worlds.

[28] Bruno Arnaldi,et al. Motion blending for real-time animation while accounting for the environment , 2004, Proceedings Computer Graphics International, 2004..

[29] Zoran Popovic,et al. Motion warping , 1995, SIGGRAPH.

[30] Victor B. Zordan,et al. Anticipation from example , 2007, VRST '07.

[31] Thomas Jakobsen,et al. Advanced Character Physics , 2003 .