论文信息 - DReCon: data-driven responsive control of physics-based characters

DReCon: data-driven responsive control of physics-based characters

Interactive control of self-balancing, physically simulated humanoids is a long standing problem in the field of real-time character animation. While physical simulation guarantees realistic interactions in the virtual world, simulated characters can appear unnatural if they perform unusual movements in order to maintain balance. Therefore, obtaining a high level of responsiveness to user control, runtime performance, and diversity has often been overlooked in exchange for motion quality. Recent work in the field of deep reinforcement learning has shown that training physically simulated characters to follow motion capture clips can yield high quality tracking results. We propose a two-step approach for building responsive simulated character controllers from unstructured motion capture data. First, meaningful features from the data such as movement direction, heading direction, speed, and locomotion style, are interactively specified and drive a kinematic character controller implemented using motion matching. Second, reinforcement learning is used to train a simulated character controller that is general enough to track the entire distribution of motion that can be generated by the kinematic controller. Our design emphasizes responsiveness to user input, visual quality, and low runtime cost for application in video-games.

J. Forbes | Daniel Holden | Kevin Bergamin | Simon Clavet

[1] Jessica K. Hodgins,et al. Animation of dynamic legged locomotion , 1991, SIGGRAPH.

[2] Thomas Schiex,et al. Soft Constraints , 2000, Handbook of Constraint Programming.

[3] Jessica K. Hodgins,et al. Motion capture-driven simulations that hit and react , 2002, SCA '02.

[4] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[5] Nikolaus Hansen,et al. The CMA Evolution Strategy: A Comparing Review , 2006, Towards a New Evolutionary Computation.

[6] KangKang Yin,et al. SIMBICON: simple biped locomotion control , 2007, ACM Trans. Graph..

[7] Jehee Lee,et al. Simulating biped behaviors from human motion data , 2007, SIGGRAPH 2007.

[8] Kwang Won Sok,et al. Simulating biped behaviors from human motion data , 2007, ACM Trans. Graph..

[9] M. V. D. Panne,et al. SIMBICON: simple biped locomotion control , 2007, SIGGRAPH 2007.

[10] Jovan Popovic,et al. Simulation of Human Motion Data using Short‐Horizon Model‐Predictive Control , 2008, Comput. Graph. Forum.

[11] Victor B. Zordan,et al. Momentum control for balance , 2009, ACM Trans. Graph..

[12] Zoran Popovic,et al. Contact-aware nonlinear control of dynamic characters , 2009, ACM Trans. Graph..

[13] Zoran Popović,et al. Contact-aware nonlinear control of dynamic characters , 2009, SIGGRAPH 2009.

[14] M. van de Panne,et al. Generalized biped walking control , 2010, ACM Trans. Graph..

[15] Yoonsang Lee,et al. Data-driven biped control , 2010, ACM Trans. Graph..

[16] Tianjia Shao,et al. Sampling-based contact-rich motion control , 2010, SIGGRAPH 2010.

[17] M. V. D. Panne,et al. Sampling-based contact-rich motion control , 2010, ACM Trans. Graph..

[18] C. Karen Liu,et al. Modal-space control for articulated characters , 2011, TOGS.

[19] Yuval Tassa,et al. Synthesis and stabilization of complex behaviors through online trajectory optimization , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20] Eva Ostertagová,et al. FORECASTING USING SIMPLE EXPONENTIAL SMOOTHING METHOD , 2012 .

[21] Nicolas Pronost,et al. Interactive Character Animation Using Simulated Physics: A State‐of‐the‐Art Review , 2012, Comput. Graph. Forum.

[22] Victor B. Zordan,et al. Control of rotational dynamics for ground behaviors , 2013, SCA '13.

[23] Michiel van de Panne,et al. Flexible muscle-based locomotion for bipedal creatures , 2013, ACM Trans. Graph..

[24] Taesoo Kwon,et al. Locomotion control for many-muscle humanoids , 2014, ACM Trans. Graph..

[25] Libin Liu,et al. Learning reduced-order feedback policies for motion skills , 2015, Symposium on Computer Animation.

[26] Glen Berseth,et al. Dynamic terrain traversal skills using reinforcement learning , 2015, ACM Trans. Graph..

[27] Baining Guo,et al. Improving Sampling‐based Motion Control , 2015, Comput. Graph. Forum.

[28] Erwin Coumans,et al. Bullet physics simulation , 2015, SIGGRAPH Courses.

[29] Libin Liu,et al. Guided Learning of Control Graphs for Physics-Based Characters , 2016, ACM Trans. Graph..

[30] Glen Berseth,et al. Terrain-adaptive locomotion skills using deep reinforcement learning , 2016, ACM Trans. Graph..

[31] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.

[32] Joaquim B. Cavalcante Neto,et al. Tunable Robustness: An Artificial Contact Strategy with Virtual Actuator Control for Balance , 2017, Comput. Graph. Forum.

[33] Taku Komura,et al. Phase-functioned neural networks for character control , 2017, ACM Trans. Graph..

[34] David Bollo. High performance animation in Gears of War 4 , 2017, SIGGRAPH Talks.

[35] Glen Berseth,et al. DeepLoco: dynamic locomotion skills using hierarchical deep reinforcement learning , 2017, ACM Trans. Graph..

[36] Yuval Tassa,et al. Emergence of Locomotion Behaviours in Rich Environments , 2017, ArXiv.

[37] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[38] J. Hodgins,et al. Learning to Schedule Control Fragments for Physics-Based Characters Using Deep Q-Learning , 2017, ACM Trans. Graph..

[39] Daniel Holden,et al. Robust solving of optical motion capture data by denoising , 2018, ACM Trans. Graph..

[40] Libin Liu,et al. Learning basketball dribbling skills using trajectory optimization and deep reinforcement learning , 2018, ACM Trans. Graph..

[41] Sergey Levine,et al. DeepMimic , 2018, ACM Trans. Graph..

[42] Stefan Jeschke,et al. Physics-based motion capture imitation with deep reinforcement learning , 2018, MIG.

[43] Dieter Fox,et al. GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning , 2018, CoRL.

[44] Sunmin Lee,et al. Learning predict-and-simulate policies from unorganized human motion data , 2019, ACM Trans. Graph..