Learning Deep Nets for Gravitational Dynamics With Unknown Disturbance Through Physical Knowledge Distillation: Initial Feasibility Study

Learning high-performance deep neural networks for dynamic modeling of high Degree-Of-Freedom (DOF) robots remains challenging due to the sampling complexity. Typical unknown system disturbance caused by unmodeled dynamics (such as internal compliance, cables) further exacerbates the problem. In this letter, a novel framework characterized by both high data efficiency and disturbance-adapting capability is proposed to address the problem of modeling gravitational dynamics using deep nets in feedforward gravity compensation control for high-DOF master manipulators with unknown disturbance. In particular, Feedforward Deep Neural Networks (FDNNs) are learned from both prior knowledge of an existing analytical model and observation of the robot system by Knowledge Distillation (KD). Through extensive experiments in high-DOF master manipulators with significant disturbance, we show that our method surpasses a standard Learning-from-Scratch (LfS) approach in terms of data efficiency and disturbance adaptation. Our initial feasibility study has demonstrated the potential of outperforming the analytical teacher model as the training data increases.

[1]  J.M. Hollerbach,et al.  Identifying mass parameters for gravity compensation and automatic torque sensor calibration , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[2]  Bruno Siciliano,et al.  Modelling and identification of the da Vinci Research Kit robotic arms , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[3]  Jun Wu,et al.  Review: An overview of dynamic parameter identification of robots , 2010 .

[4]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[5]  Oliver Sawodny,et al.  Model-based feedforward position control of constant curvature continuum robots using feedback linearization , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[6]  Aude Billard,et al.  Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[7]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[8]  Giorgio Metta,et al.  Incremental semiparametric inverse dynamics learning , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[9]  Yasin Almalioglu,et al.  Distilling Knowledge From a Deep Pose Regressor Network , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[10]  Jan Peters,et al.  Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning , 2019, ICLR.

[11]  Marc Toussaint,et al.  Learning Multiple Models of Non-linear Dynamics for Control Under Varying Contexts , 2006, ICANN.

[12]  Yan Wang,et al.  A Convex Optimization-Based Dynamic Model Identification Package for the da Vinci Research Kit , 2019, IEEE Robotics and Automation Letters.

[13]  Peter Kazanzides,et al.  Neural Network based Inverse Dynamics Identification and External Force Estimation on the da Vinci Research Kit , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[14]  G. Oriolo,et al.  Robotics: Modelling, Planning and Control , 2008 .

[15]  Dana Kulic,et al.  On-Line Dynamic Model Learning for Manipulator Control , 2012, SyRoCo.

[16]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[17]  Nicolas Schweighofer,et al.  Local Online Support Vector Regression for Learning Control , 2007, 2007 International Symposium on Computational Intelligence in Robotics and Automation.

[18]  Howie Choset,et al.  Design and architecture of the unified modular snake robot , 2012, 2012 IEEE International Conference on Robotics and Automation.

[19]  J. Kocijan,et al.  Gaussian process model based predictive control , 2004, Proceedings of the 2004 American Control Conference.

[20]  Sami Haddadin,et al.  First-order-principles-based constructive network topologies: An application to robot inverse dynamics , 2017, 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids).

[21]  Rich Caruana,et al.  Model compression , 2006, KDD '06.

[22]  Stefan Schaal,et al.  Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning , 2002, Applied Intelligence.

[23]  Daniela Rus,et al.  Model-based dynamic feedback control of a planar soft robot: trajectory tracking and interaction with the environment , 2020, Int. J. Robotics Res..

[24]  Jan Peters,et al.  Using model knowledge for learning inverse dynamics , 2010, 2010 IEEE International Conference on Robotics and Automation.

[25]  Peter Kazanzides,et al.  A Reliable Gravity Compensation Control Strategy for dVRK Robotic Arms With Nonlinear Disturbance Forces , 2019, IEEE Robotics and Automation Letters.

[26]  Wen Yu,et al.  PD control of robot with RBF networks compensation , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[27]  Athanasios S. Polydoros,et al.  Real-time deep learning of robotic manipulator inverse dynamics , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[28]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[29]  Jan Peters,et al.  Model learning for robot control: a survey , 2011, Cognitive Processing.

[30]  Stefan Schaal,et al.  A new data source for inverse dynamics learning , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[31]  Peter Kazanzides,et al.  An open-source research kit for the da Vinci® Surgical System , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[32]  Jan Peters,et al.  Learning inverse dynamics models in O(n) time with LSTM networks , 2017, 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids).

[33]  M. Jansen Learning an Accurate Neural Model of the Dynamics of a Typical Industrial Robot , 1994 .

[34]  Mitsuo Kawato,et al.  MOSAIC Model for Sensorimotor Learning and Control , 2001, Neural Computation.