Learning dynamic models for open loop predictive control of soft robotic manipulators.

The soft capabilities of biological appendages like the arms of Octopus vulgaris and elephants' trunks have inspired roboticists to develop their robotic equivalents. Although there have been considerable efforts to replicate their morphology and behavior patterns, we are still lagging behind in replicating the dexterity and efficiency of these biological systems. This is mostly due to the lack of development and application of dynamic controllers on these robots which could exploit the morphological properties that a soft-bodied manipulator possesses. The complexity of these high-dimensional nonlinear systems has deterred the application of traditional model-based approaches. This paper provides a machine learning-based approach for the development of dynamic models for a soft robotic manipulator and a trajectory optimization method for predictive control of the manipulator in task space. To the best of our knowledge this is the first demonstration of a learned dynamic model and a derived task space controller for a soft robotic manipulator. The validation of the controller is carried out on an octopus-inspired soft manipulator simulation derived from a piecewise constant strain approximation and then experimentally on a pneumatically actuated soft manipulator. The results indicate that such an approach is promising for developing fast and accurate dynamic models for soft robotic manipulators while being applicable on a wide range of soft manipulators.

[1]  Nicolas Schweighofer,et al.  Local Online Support Vector Regression for Learning Control , 2007, 2007 International Symposium on Computational Intelligence in Robotics and Automation.

[2]  Matteo Cianchetti,et al.  Learning Global Inverse Kinematics Solutions for a Continuum Robot , 2016 .

[3]  R. Fletcher Practical Methods of Optimization , 1988 .

[4]  Jan Peters,et al.  Model learning for robot control: a survey , 2011, Cognitive Processing.

[5]  Sergey Levine,et al.  Learning Complex Neural Network Policies with Trajectory Optimization , 2014, ICML.

[6]  Christopher D. Rahn,et al.  Geometrically exact dynamic models for soft robotic manipulators , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Lakmal Seneviratne,et al.  Screw-Based Modeling of Soft Manipulators With Tendon and Fluidic Actuation , 2017 .

[8]  Marc D. Killpack,et al.  A New Soft Robot Control Method: Using Model Predictive Control for a Pneumatically Actuated Humanoid , 2016, IEEE Robotics & Automation Magazine.

[9]  Jeff G. Schneider,et al.  Autonomous helicopter control using reinforcement learning policy search methods , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[10]  J. Kocijan,et al.  Gaussian process model based predictive control , 2004, Proceedings of the 2004 American Control Conference.

[11]  Mariangela Manti,et al.  Induced Vibrations of Soft Robotic Manipulators for Controller Design and Stiffness Estimation , 2018, 2018 7th IEEE International Conference on Biomedical Robotics and Biomechatronics (Biorob).

[12]  Carlos Canudas de Wit,et al.  Friction Models and Friction Compensation , 1998, Eur. J. Control.

[13]  Oliver Sawodny,et al.  Dynamic Control of the Bionic Handling Assistant , 2017, IEEE/ASME Transactions on Mechatronics.

[14]  Oliver Sawodny,et al.  Model-based feedforward position control of constant curvature continuum robots using feedback linearization , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Olivier Sigaud,et al.  On-line regression algorithms for learning mechanical models of robots: A survey , 2011, Robotics Auton. Syst..

[16]  D. Caleb Rucker,et al.  A Geometrically Exact Model for Externally Loaded Concentric-Tube Continuum Robots , 2010, IEEE Transactions on Robotics.

[17]  Michael I. Jordan,et al.  Forward dynamic models in human motor control: Psychophysical evidence , 1994, NIPS.

[18]  Ian D. Walker,et al.  A Neural Network Controller for Continuum Robots , 2007, IEEE Transactions on Robotics.

[19]  Frédéric Boyer,et al.  Poincaré’s Equations for Cosserat Media: Application to Shells , 2017, J. Nonlinear Sci..

[20]  Ian D. Walker,et al.  Task-space control of extensible continuum manipulators , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Daniela Rus,et al.  Dynamics and trajectory optimization for a soft spatial fluidic elastomer manipulator , 2016, Int. J. Robotics Res..

[22]  Guilherme De A. Barreto,et al.  Long-term time series prediction with the NARX network: An empirical evaluation , 2008, Neurocomputing.

[23]  Pieter Abbeel,et al.  An Application of Reinforcement Learning to Aerobatic Helicopter Flight , 2006, NIPS.

[24]  Mariangela Manti,et al.  Soft assistive robot for personal care of elderly people , 2016, 2016 6th IEEE International Conference on Biomedical Robotics and Biomechatronics (BioRob).

[25]  Athanasios S. Polydoros,et al.  A reservoir computing approach for learning forward dynamics of industrial manipulators , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[26]  Eugen Diaconescu,et al.  The use of NARX neural networks to predict chaotic time series , 2008 .

[27]  Lakmal Seneviratne,et al.  Discrete Cosserat approach for soft robot dynamics: A new piece-wise constant strain model with torsion and shears , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[28]  Jian Chen,et al.  A continuous asymptotic tracking control strategy for uncertain nonlinear systems , 2004, IEEE Transactions on Automatic Control.

[29]  J. M. Selig Geometric Fundamentals of Robotics , 2004, Monographs in Computer Science.

[30]  Reza Shadmehr,et al.  Evidence for a Forward Dynamics Model in Human Adaptive Motor Control , 1998, NIPS.

[31]  Christopher G. Atkeson,et al.  Nonparametric Model-Based Reinforcement Learning , 1997, NIPS.

[32]  Ben Tse,et al.  Autonomous Inverted Helicopter Flight via Reinforcement Learning , 2004, ISER.

[33]  Duy Nguyen-Tuong,et al.  Computed torque control with nonparametric regression models , 2008, 2008 American Control Conference.

[34]  Ian D. Walker,et al.  Empirical investigation of closed-loop control of extensible continuum manipulators , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[35]  Axel Gräser,et al.  Modelling and Identification for Control Design of Compliant Fluidic Actuators with Rotary Elastic Chambers: Hydraulic Case Study , 2008 .

[36]  Matteo Cianchetti,et al.  Learning Global Inverse Statics Solution for a Redundant Soft Robot , 2016, ICINCO.

[37]  Martijn Wisse,et al.  Open loop stable control in repetitive manipulation tasks , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[38]  Lakmal D. Seneviratne,et al.  Discrete Cosserat Approach for Multi-Section Soft Robots Dynamics , 2017, ArXiv.

[39]  S. W. Goode Differential Equations and Linear Algebra , 1999 .

[40]  Jun Morimoto,et al.  Nonparametric Representation of Policies and Value Functions: A Trajectory-Based Approach , 2002, NIPS.

[41]  Matteo Cianchetti,et al.  Dynamic Model of a Multibending Soft Robot Arm Driven by Cables , 2014, IEEE Transactions on Robotics.

[42]  Jan Peters,et al.  Model Learning with Local Gaussian Process Regression , 2009, Adv. Robotics.

[43]  Peter Englert,et al.  Multi-task policy search for robotics , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[44]  J. M. Selig Geometric Fundamentals of Robotics (Monographs in Computer Science) , 2004 .

[45]  Ian D. Walker,et al.  A model-based sliding mode controller for extensible continuum robots , 2010 .