Reinforcement Learning of Musculoskeletal Control from Functional Simulations

To diagnose, plan, and treat musculoskeletal pathologies, understanding and reproducing muscle recruitment for complex movements is essential. With muscle activations for movements often being highly redundant, nonlinear, and time dependent, machine learning can provide a solution for their modeling and control for anatomy-specific musculoskeletal simulations. Sophisticated biomechanical simulations often require specialized computational environments, being numerically complex and slow, hindering their integration with typical deep learning frameworks. In this work, a deep reinforcement learning (DRL) based inverse dynamics controller is trained to control muscle activations of a biomechanical model of the human shoulder. In a generalizable end-to-end fashion, muscle activations are learned given current and desired position-velocity pairs. A customized reward functions for trajectory control is introduced, enabling straightforward extension to additional muscles and higher degrees of freedom. Using the biomechanical model, multiple episodes are simulated on a cluster simultaneously using the evolving neural models of the DRL being trained. Results are presented for a single-axis motion control of shoulder abduction for the task of following randomly generated angular trajectories.

[1]  D. McAndrew,et al.  Muscles within muscles: Coordination of 19 muscle segments within three shoulder muscles during isometric motor tasks. , 2007, Journal of electromyography and kinesiology : official journal of the International Society of Electrophysiological Kinesiology.

[2]  Stephen James,et al.  3D Simulation for Robot Arm Control with Deep Q-Learning , 2016, ArXiv.

[3]  G. Giacomo,et al.  Atlas of functional shoulder anatomy , 2008 .

[4]  Takamitsu Matsubara,et al.  Deep reinforcement learning with smooth policy update: Application to robotic cloth manipulation , 2019, Robotics Auton. Syst..

[5]  Frans C. T. van der Helm,et al.  Modelling clavicular and scapular kinematics: from measurement to simulation , 2013, Medical & Biological Engineering & Computing.

[6]  Nick J. Little,et al.  Human evolution and tears of the rotator cuff , 2014, International Orthopaedics.

[7]  Sergey Levine,et al.  High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.

[8]  Tania Pizzari,et al.  Quantifying 'normal' shoulder muscle activity during abduction. , 2010, Journal of electromyography and kinesiology : official journal of the International Society of Electrophysiological Kinesiology.

[9]  S. Delp,et al.  A 3D model of muscle reveals the causes of nonuniform strains in the biceps brachii. , 2005, Journal of biomechanics.

[10]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[11]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[12]  Sergey Levine,et al.  Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning , 2018, ArXiv.

[13]  Jess G Snedeker,et al.  Supraspinatus tendon load during abduction is dependent on the size of the critical shoulder angle: A biomechanical analysis , 2014, Journal of orthopaedic research : official publication of the Orthopaedic Research Society.

[14]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[15]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[16]  Sidney Fels,et al.  ArtiSynth: A Fast Interactive Biomechanical Modeling Toolkit Combining Multibody and Finite Element Simulation , 2012 .

[17]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[18]  Amir H. Abdi,et al.  Muscle Excitation Estimation in Biomechanical Simulation Using NAF Reinforcement Learning , 2018, ArXiv.

[19]  Orcun Goksel,et al.  Surface-based modeling of muscles: Functional simulation of the shoulder. , 2020, Medical engineering & physics.

[20]  Christian Duriez,et al.  SOFA: A Multi-Model Framework for Interactive Physical Simulation , 2012 .

[21]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[22]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[23]  Orcun Goksel,et al.  A comprehensive and volumetric musculoskeletal model for the dynamic simulation of the shoulder function , 2019, Computer methods in biomechanics and biomedical engineering.

[24]  Takuro Tamura,et al.  BodyParts3D: 3D structure database for anatomical concepts , 2008, Nucleic Acids Res..

[25]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[26]  J. W. Nieuwenhuis,et al.  Boekbespreking van D.P. Bertsekas (ed.), Dynamic programming and optimal control - volume 2 , 1999 .

[27]  Andrea Biscarini,et al.  Effects of scapular retraction/protraction position and scapular elevation on shoulder girdle muscle activity during glenohumeral abduction. , 2019, Human movement science.

[28]  Yuval Tassa,et al.  Emergence of Locomotion Behaviours in Rich Environments , 2017, ArXiv.

[29]  Christopher McCrum,et al.  Pectoralis major tendon transfer for the treatment of scapular winging due to long thoracic nerve palsy. , 2012, Journal of shoulder and elbow surgery.

[30]  Z. Artstein Discrete and Continuous Bang-Bang and Facial Spaces Or: Look for the Extreme Points , 1980 .

[31]  M. Spong,et al.  Robot Modeling and Control , 2005 .

[32]  Mark Halaki,et al.  Does supraspinatus initiate shoulder abduction? , 2013, Journal of electromyography and kinesiology : official journal of the International Society of Electrophysiological Kinesiology.

[33]  Ian Stavness,et al.  Automatic Prediction of Tongue Muscle Activations Using a Finite Element Model , 2022 .