论文信息 - OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion

OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion

Muscle-actuated control is a research topic of interest spanning different fields, in particular biomechanics, robotics and graphics. This type of control is particularly challenging because models are often overactuated, and dynamics are delayed and non-linear. It is however a very well tested and tuned actuation model that has undergone millions of years of evolution and that involves interesting properties exploiting passive forces of muscle-tendon units and efficient energy storage and release. To facilitate research on muscle-actuated simulation, we release a 3D musculoskeletal simulation of an ostrich based on the MuJoCo simulator. Ostriches are one of the fastest bipeds on earth and are therefore an excellent model for studying muscle-actuated bipedal locomotion. The model is based on CT scans and dissections used to gather actual muscle data such as insertion sites, lengths and pennation angles. Along with this model, we also provide a set of reinforcement learning tasks, including reference motion tracking and a reaching task with the neck. The reference motion data are based on motion capture clips of various behaviors which we pre-processed and adapted to our model. This paper describes how the model was built and iteratively improved using the tasks. We evaluate the accuracy of the muscle actuation patterns by comparing them to experimentally collected electromyographic data from locomoting birds. We believe that this work can be a useful bridge between the biomechanics, reinforcement learning, graphics and robotics communities, by providing a fast and easy to use simulation. Figure 1: The models, tasks and data are available at https://github.com/vittorione94/ ostrichrl and visualizations at https://sites.google.com/view/ostrichrl. ∗Equal contribution. Deep Reinforcement Learning Workshop, NeurIPS 2021. ar X iv :2 11 2. 06 06 1v 1 [ cs .R O ] 1 1 D ec 2 02 1

[1] Walter Herzog,et al. Musculoskeletal Simulation Tools for Understanding Mechanisms of Lower-Limb Sports Injuries. , 2019, Current sports medicine reports.

[2] Takanobu Tsuihiji,et al. Homologies of the longissimus, iliocostalis, and hypaxial muscles in the anterior presacral region of extant diapsida , 2007, Journal of morphology.

[3] Pooi See Lee,et al. Recent Progress in Artificial Muscles for Interactive Soft Robotics , 2020, Advanced materials.

[4] Raia Hadsell,et al. CoMic: Complementary Task Learning & Mimicry for Reusable Skills , 2020, ICML.

[5] Dmitry Akimov. Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation , 2019, ArXiv.

[6] Alec Jacobson,et al. EMu: Efficient Muscle Simulation In Deformation Space , 2020, ArXiv.

[7] R. McN. Alexander,et al. Mechanics of running of the ostrich (Struthio camelus) , 2009 .

[8] Eftychios Sifakis,et al. Dexterous manipulation and control with volumetric muscles , 2018, ACM Trans. Graph..

[9] Michiel van de Panne,et al. Learning locomotion skills using DeepRL: does the choice of action space matter? , 2016, Symposium on Computer Animation.

[10] Matthew W. Hoffman,et al. Distributed Distributional Deterministic Policy Gradients , 2018, ICLR.

[11] Aaron D. Ames,et al. Dynamic Walking with Compliance on a Cassie Bipedal Robot , 2019, 2019 18th European Control Conference (ECC).

[12] Libin Liu,et al. Learning basketball dribbling skills using trajectory optimization and deep reinforcement learning , 2018, ACM Trans. Graph..

[13] Jerry E. Pratt,et al. FastRunner: A fast, efficient and robust bipedal robot. Concept and planar simulation , 2012, 2012 IEEE International Conference on Robotics and Automation.

[14] Michiel van de Panne,et al. Learning Locomotion Skills for Cassie: Iterative Design and Sim-to-Real , 2019, CoRL.

[15] Roger Quinn,et al. A three-dimensional musculoskeletal model of the dog , 2020, Scientific Reports.

[16] Ayman Habib,et al. OpenSim: Simulating musculoskeletal dynamics and neuromuscular control to study human and animal movement , 2018, PLoS Comput. Biol..

[17] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.

[18] Yuval Tassa,et al. dm_control: Software and Tasks for Continuous Control , 2020, Softw. Impacts.

[19] C. Karen Liu,et al. Synthesis of biologically realistic human motion using joint torque actuation , 2019, ACM Trans. Graph..

[20] Perttu Hämäläinen,et al. Converting Biomechanical Models from OpenSim to MuJoCo , 2020, Biosystems & Biorobotics.

[21] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[22] Matthew Millard,et al. Flexing computational muscle: modeling and simulation of musculotendon dynamics. , 2013, Journal of biomechanical engineering.

[23] Jonas Rubenson,et al. Musculoskeletal modelling of an ostrich (Struthio camelus) pelvic limb: influence of limb orientation on muscular capacity during locomotion , 2015, PeerJ.

[24] Glen Berseth,et al. Feedback Control For Cassie With Deep Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[25] S. Buss. Introduction to Inverse Kinematics with Jacobian Transpose , Pseudoinverse and Damped Least Squares methods , 2004 .

[26] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[27] Herke van Hoof,et al. Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.

[28] Andreas Christian,et al. Flexibility along the neck of the ostrich (Struthio camelus) and consequences for the reconstruction of dinosaurs with extreme neck length , 2007, Journal of morphology.

[29] Sergey Levine,et al. MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies , 2019, NeurIPS.

[30] Vladlen Koltun,et al. Optimizing locomotion controllers using biologically-based actuators and objectives , 2012, ACM Trans. Graph..

[31] Valentin Khrulkov,et al. Sample Efficient Ensemble Learning with Catalyst.RL , 2020, ArXiv.

[32] Sergey Levine,et al. DeepMimic , 2018, ACM Trans. Graph..

[33] Koushil Sreenath,et al. Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[34] Takanobu Tsuihiji,et al. Homologies of the transversospinalis muscles in the anterior presacral region of Sauria (crown Diapsida) , 2005, Journal of morphology.

[35] Jonas Rubenson,et al. Inferring muscle functional roles of the ostrich pelvic limb during walking and running using computer optimization , 2016, Journal of The Royal Society Interface.

[36] Koushil Sreenath,et al. Animated Cassie: A Dynamic Relatable Robotic Character , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[37] Jim R. Potvin,et al. A motor unit-based model of muscle fatigue , 2017, PLoS Comput. Biol..

[38] Anick Abourachid,et al. Gulper, ripper and scrapper: anatomy of the neck in three species of vultures , 2019, Journal of anatomy.

[39] F. Zajac. Muscle and tendon: properties, models, scaling, and application to biomechanics and motor control. , 1989, Critical reviews in biomedical engineering.

[40] Victor Ng-Thow-Hing,et al. A 3D interactive method for estimating body segmental parameters in animals: application to the turning and running performance of Tyrannosaurus rex. , 2007, Journal of theoretical biology.

[41] J. Hodgins,et al. Control strategies for physically simulated characters performing two-player competitive sports , 2021, ACM Transactions on Graphics.

[42] Alan M. Wilson,et al. Mechanics of cutting maneuvers by ostriches (Struthio camelus) , 2007, Journal of Experimental Biology.

[43] Vitaly Levdik,et al. Time Limits in Reinforcement Learning , 2017, ICML.

[44] KangKang Yin,et al. Discovering diverse athletic jumping strategies , 2021, ACM Trans. Graph..

[45] Sergey M. Plis,et al. Run, skeleton, run: skeletal model in a physics-based simulation , 2018, AAAI Spring Symposia.

[46] Fabio Pardo,et al. Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking , 2020, ArXiv.

[47] Brian Wyvill,et al. VIPER: Volume Invariant Position-based Elastic Rods , 2019, PACMCGIT.

[48] Yuval Tassa,et al. Learning human behaviors from motion capture by adversarial imitation , 2017, ArXiv.

[49] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[50] Taylor Apgar,et al. Fast Online Trajectory Optimization for the Bipedal Robot Cassie , 2018, Robotics: Science and Systems.

[51] John R Hutchinson,et al. Relating neuromuscular control to functional anatomy of limb muscles in extant archosaurs , 2019, Journal of morphology.

[52] M. V. D. Panne,et al. Sampling-based contact-rich motion control , 2010, ACM Trans. Graph..

[53] Kyoungmin Lee,et al. Scalable muscle-actuated human simulation and control , 2019, ACM Trans. Graph..

[54] Yee Whye Teh,et al. Neural probabilistic motor primitives for humanoid control , 2018, ICLR.

[55] Hao Tian,et al. Efficient and Robust Reinforcement Learning with Uncertainty-based Value Expansion , 2019, ArXiv.

[56] L. Margetts,et al. Exploring Diagonal Gait Using a Forward Dynamic Three-Dimensional Chimpanzee Simulation , 2013, Folia Primatologica.

[57] Ayman Habib,et al. OpenSim: Open-Source Software to Create and Analyze Dynamic Simulations of Movement , 2007, IEEE Transactions on Biomedical Engineering.

[58] Alec Jacobson,et al. Interactive modelling of volumetric musculoskeletal anatomy , 2021, ACM Trans. Graph..

[59] Michiel van de Panne,et al. Flexible muscle-based locomotion for bipedal creatures , 2013, ACM Trans. Graph..

[60] Stefan Jeschke,et al. Physics-based motion capture imitation with deep reinforcement learning , 2018, MIG.

[61] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.