Enhancing the performance of adaptive iterative learning control with reinforcement learning

In this study we propose a new method to enhance the performance of iterative learning control (ILC). We focus on robotic tasks dealing with adaptation to the unknown or partially known environment, where the robot has to learn the environment geometry in order to perform the desired task with the given reference forces and torques. The initial motion trajectories are obtained by kinesthetic teaching, whereas the required forces and torques are prescribed by the task. We are interested in incremental learning, which assures smooth and safe operation, aiming at handling of delicate, fragile objects, such as objects made of glass. In order to achieve these goals we propose a new adaptive ILC scheme, where the adaptation is supervised by reinforcement learning. We also show how to apply ILC to orientational motion, taking into account the curved geometry of SO(3). The performance of the proposed algorithm is verified on a bi-manual glass wiping task.

[1]  Luigi Villani,et al.  Force Control , 2021, Springer Handbook of Robotics, 2nd Ed..

[2]  Sebastian Thrun,et al.  Lifelong robot learning , 1993, Robotics Auton. Syst..

[3]  Fengfeng Xi,et al.  Iterative Learning Control With Switching Gain Feedback for Nonlinear Systems , 2011 .

[4]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[5]  Mikael Norrlöf,et al.  An adaptive iterative learning control algorithm with experiments on an industrial robot , 2002, IEEE Trans. Robotics Autom..

[6]  Ales Ude,et al.  Force adaptation with recursive regression Iterative Learning Controller , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  Jan Peters,et al.  Noname manuscript No. (will be inserted by the editor) Policy Search for Motor Primitives in Robotics , 2022 .

[8]  Seul Jung,et al.  Force tracking impedance control of robot manipulators under unknown environment , 2004, IEEE Transactions on Control Systems Technology.

[9]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[10]  Ales Ude,et al.  Filtering in a unit quaternion space for model-based object tracking , 1999, Robotics Auton. Syst..

[11]  Olivier Sigaud,et al.  Path Integral Policy Improvement with Covariance Matrix Adaptation , 2012, ICML.

[12]  Stefan Schaal,et al.  Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning , 2002, Applied Intelligence.

[13]  Pasquale Chiacchio,et al.  Task-space regulation of cooperative manipulators , 2000, Autom..

[14]  Bruno Siciliano,et al.  A survey of robot interaction control schemes with experimental comparison , 1999 .

[15]  Joerg F. Hipp,et al.  Time-Frequency Analysis , 2014, Encyclopedia of Computational Neuroscience.

[16]  Jun Morimoto,et al.  Task-Specific Generalization of Discrete and Periodic Dynamic Movement Primitives , 2010, IEEE Transactions on Robotics.

[17]  Jan Peters,et al.  Model learning for robot control: a survey , 2011, Cognitive Processing.

[18]  Giovanni Ulivi,et al.  A frequency-domain approach to learning control: implementation for a robot manipulator , 1992, IEEE Trans. Ind. Electron..

[19]  Nikos A. Aspragathos,et al.  Online Stability in Human-Robot Cooperation with Admittance Control , 2016, IEEE Transactions on Haptics.

[20]  N. Hogan,et al.  Impedance and Interaction Control , 2018 .

[21]  A.G. Alleyne,et al.  A survey of iterative learning control , 2006, IEEE Control Systems.

[22]  Xiao-gang Jia,et al.  Adaptive iterative learning control for robot manipulators , 2010, 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[23]  Sungchul Kang,et al.  Frequency Domain Stability Observer and Active Damping Control for Stable Haptic Interaction , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[24]  Jimmy A. Jørgensen,et al.  Adaptation of manipulation skills in physical contact with the environment to reference force profiles , 2015, Auton. Robots.

[25]  YangQuan Chen,et al.  Iterative Learning Control: A Tutorial and Big Picture View , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[26]  Bojan Nemec,et al.  Comparison of null-space and minimal null-space control algorithms , 2007, Robotica.

[27]  Blake Hannaford,et al.  Time domain passivity control of haptic interfaces , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[28]  Jun Nakanishi,et al.  Operational Space Control: A Theoretical and Empirical Comparison , 2008, Int. J. Robotics Res..

[29]  Oussama Khatib,et al.  A unified approach for motion and force control of robot manipulators: The operational space formulation , 1987, IEEE J. Robotics Autom..

[30]  Svante Gunnarsson,et al.  Experimental comparison of some classical iterative learning control algorithms , 2002, IEEE Trans. Robotics Autom..

[31]  Andrej Gams,et al.  Bimanual human robot cooperation with adaptive stiffness control , 2016, 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids).

[32]  Madan M. Gupta,et al.  An adaptive switching learning control method for trajectory tracking of robot manipulators , 2006 .

[33]  Jun Morimoto,et al.  Orientation in Cartesian space dynamic movement primitives , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).