A New Autoregressive Neural Network Model with Command Compensation for Imitation Learning Based on Bilateral Control

In the near future, robots are expected to work with humans or operate alone and may replace human workers in various fields such as homes and factories. In a previous study, we proposed bilateral control-based imitation learning that enables robots to utilize force information and operate almost simultaneously with an expert's demonstration. In addition, we recently proposed an autoregressive neural network model (SM2SM) for bilateral control-based imitation learning to obtain long-term inferences. In the SM2SM model, both master and slave states must be input, but the master states are obtained from the previous outputs of the SM2SM model, resulting in destabilized estimation under large environmental variations. Hence, a new autoregressive neural network model (S2SM) is proposed in this study. This model requires only the slave state as input and its outputs are the next slave and master states, thereby improving the task success rates. In addition, a new feedback controller that utilizes the error between the responses and estimates of the slave is proposed, which shows better reproducibility.

[1]  Toshiaki Tsuji,et al.  Bilateral control using functional electrical stimulation , 2015, IECON 2015 - 41st Annual Conference of the IEEE Industrial Electronics Society.

[2]  Toshiaki Tsuji,et al.  Resonance-suppression Control for Electro-hydrostatic Actuator as Two-inertia System , 2017 .

[3]  Darwin G. Caldwell,et al.  Imitation Learning of Positional and Force Skills Demonstrated via Kinesthetic Teaching and Haptic Input , 2011, Adv. Robotics.

[4]  Kouhei Ohnishi,et al.  Oblique coordinate control for advanced motion control - Applied to micro-macro bilateral control - , 2009, 2009 IEEE International Conference on Mechatronics.

[5]  Toshiaki Tsuji,et al.  Imitation Learning Based on Bilateral Control for Human–Robot Cooperation , 2020, IEEE Robotics and Automation Letters.

[6]  W. Marsden I and J , 2012 .

[7]  Toshiaki Tsuji,et al.  Estimation and Kinetic Modeling of Human Arm using Wearable Robot Arm , 2017 .

[8]  Wojciech Zaremba,et al.  Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Toshiaki Tsuji,et al.  Motion Generation Using Bilateral Control-Based Imitation Learning With Autoregressive Learning , 2021, IEEE Access.

[10]  Ken Goldberg,et al.  Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation , 2017, ICRA.

[11]  Sergey Levine,et al.  Learning force-based manipulation of deformable objects from multiple demonstrations , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Toshiaki Tsuji,et al.  Time Series Motion Generation Considering Long Short-Term Motion , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Sergey Levine,et al.  Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[14]  Fuchun Sun,et al.  Survey of imitation learning for robotic manipulation , 2019, International Journal of Intelligent Robotics and Applications.

[15]  S. Levine,et al.  Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems , 2020, ArXiv.

[16]  Toshiyuki Murakami,et al.  Torque sensorless control in multidegree-of-freedom manipulator , 1993, IEEE Trans. Ind. Electron..

[17]  Toshiaki Tsuji,et al.  Bilateral Control Between Electric and Hydraulic Actuators Using Linearization of Hydraulic Actuators , 2017, IEEE Transactions on Industrial Electronics.

[18]  Tsuyoshi Adachi,et al.  Imitation Learning for Object Manipulation Based on Position/Force Information Using Bilateral Control , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19]  Feng Gao,et al.  Feeling the force: Integrating force and pose for fluent discovery through imitation learning to open medicine bottles , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[20]  Toshiaki Tsuji,et al.  Bilateral Control in the Vertical Direction Using Functional Electrical Stimulation , 2016 .

[21]  Jonatan S. Dyrstad,et al.  Teaching a Robot to Grasp Real Fish by Imitation Learning from a Human Supervisor in Virtual Reality , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[22]  Toshiaki Tsuji,et al.  Optimized Trajectory Generation based on Model Predictive Control for Turning Over Pancakes , 2018 .

[23]  Kouhei Ohnishi,et al.  Motion control for advanced mechatronics , 1996 .

[24]  Rajesh P. N. Rao,et al.  Robotic imitation from human motion capture using Gaussian processes , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..