论文信息 - Iterative Training of Dynamic Skills Inspired by Human Coaching Techniques

Iterative Training of Dynamic Skills Inspired by Human Coaching Techniques

Inspired by how humans learn dynamic motor skills through a progressive process of coaching and practices, we introduce an intuitive and interactive framework for developing dynamic controllers. The user only needs to provide a primitive initial controller and high-level, human-readable instructions as if s/he is coaching a human trainee, while the character has the ability to interpret the abstract instructions, accumulate the knowledge from the coach, and improve its skill iteratively. We introduce “control rigs” as an intermediate layer of control module to facilitate the mapping between high-level instructions and low-level control variables. Control rigs also utilize the human coach's knowledge to reduce the search space for control optimization. In addition, we develop a new sampling-based optimization method, Covariance Matrix Adaptation with Classification (CMA-C), to efficiently compute-control rig parameters. Based on the observation of human ability to “learn from failure”, CMA-C utilizes the failed simulation trials to approximate an infeasible region in the space of control rig parameters, resulting a faster convergence for the CMA optimization. We demonstrate the design process of complex dynamic controllers using our framework, including precision jumps, turnaround jumps, monkey vaults, drop-and-rolls, and wall-backflips.

[1] Steven Dubowsky,et al. A coordinated Jacobian transpose control for mobile multi-limbed robotic systems , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[2] David C. Brogan,et al. Animating human athletics , 1995, SIGGRAPH.

[3] Zoran Popovic,et al. Physically based motion transformation , 1999, SIGGRAPH.

[4] Jessica K. Hodgins,et al. Simulating leaping, tumbling, landing and balancing humans , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[5] Petros Faloutsos,et al. Composable controllers for physics-based character animation , 2001, SIGGRAPH.

[6] C. Karen Liu,et al. Synthesis of complex dynamic character motion from simple animations , 2002, ACM Trans. Graph..

[7] Nancy S. Pollard,et al. Efficient synthesis of physically valid human motion , 2003, ACM Trans. Graph..

[8] Jessica K. Hodgins,et al. Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces , 2004, ACM Trans. Graph..

[9] John Hart,et al. ACM Transactions on Graphics , 2004, SIGGRAPH 2004.

[10] Jessica K. Hodgins,et al. Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces , 2004, SIGGRAPH 2004.

[11] Nikolaus Hansen,et al. Evaluating the CMA Evolution Strategy on Multimodal Test Functions , 2004, PPSN.

[12] KangKang Yin,et al. SIMBICON: simple biped locomotion control , 2007, ACM Trans. Graph..

[13] Jehee Lee,et al. Simulating biped behaviors from human motion data , 2007, SIGGRAPH 2007.

[14] Kwang Won Sok,et al. Simulating biped behaviors from human motion data , 2007, ACM Trans. Graph..

[15] M. V. D. Panne,et al. SIMBICON: simple biped locomotion control , 2007, SIGGRAPH 2007.

[16] Philippe Beaudoin,et al. Continuation methods for adapting simulated skills , 2008, SIGGRAPH 2008.

[17] Philippe Beaudoin,et al. Continuation methods for adapting simulated skills , 2008, ACM Trans. Graph..

[18] David J. Fleet,et al. Optimizing walking controllers , 2009, SIGGRAPH 2009.

[19] David J. Fleet,et al. Optimizing walking controllers , 2009, ACM Trans. Graph..

[20] Frédo Durand,et al. Linear Bellman combination for control of character animation , 2009, SIGGRAPH 2009.

[21] Frédo Durand,et al. Linear Bellman combination for control of character animation , 2009, ACM Trans. Graph..

[22] David J. Fleet,et al. Optimizing walking controllers for uncertain inputs and environments , 2010, ACM Trans. Graph..

[23] M. van de Panne,et al. Generalized biped walking control , 2010, ACM Trans. Graph..

[24] Yoonsang Lee,et al. Data-driven biped control , 2010, ACM Trans. Graph..

[25] Jehee Lee,et al. Data-driven biped control , 2010, SIGGRAPH 2010.

[26] Aaron Hertzmann,et al. Robust physics-based locomotion using low-dimensional planning , 2010, SIGGRAPH 2010.

[27] Martin de Lasa,et al. Feature-based locomotion controllers , 2010, ACM Trans. Graph..

[28] M. V. D. Panne,et al. Sampling-based contact-rich motion control , 2010, ACM Trans. Graph..

[29] David J. Fleet,et al. Optimizing walking controllers for uncertain inputs and environments , 2010, SIGGRAPH 2010.

[30] Martin de Lasa,et al. Robust physics-based locomotion using low-dimensional planning , 2010, ACM Trans. Graph..

[31] Z. Popovic,et al. Terrain-adaptive bipedal locomotion control , 2010, ACM Trans. Graph..

[32] Philippe Beaudoin,et al. Generalized biped walking control , 2010, SIGGRAPH 2010.

[33] A. Karpathy,et al. Locomotion skills for simulated quadrupeds , 2011, SIGGRAPH 2011.

[34] Zoran Popovic,et al. Composite control of physically simulated characters , 2011, TOGS.

[35] Sehoon Ha,et al. Falling and landing motion control for character animation , 2012, ACM Trans. Graph..

[36] Baining Guo,et al. Terrain runner , 2012, ACM Trans. Graph..

[37] Vladlen Koltun,et al. Optimizing locomotion controllers using biologically-based actuators and objectives , 2012, ACM Trans. Graph..

[38] Aaron Hertzmann,et al. Trajectory Optimization for Full-Body Movements with Complex Contacts , 2013, IEEE Transactions on Visualization and Computer Graphics.