论文信息 - Motion Generation Using Bilateral Control-Based Imitation Learning With Autoregressive Learning

Motion Generation Using Bilateral Control-Based Imitation Learning With Autoregressive Learning

Robots that can execute various tasks automatically on behalf of humans are becoming an increasingly important focus of research in the field of robotics. Imitation learning has been studied as an efficient and high-performance method, and imitation learning based on bilateral control has been proposed as a method that can realize fast motion. However, because this method cannot implement autoregressive learning, this method may not generate desirable long-term behavior. Therefore, in this paper, we propose a method of autoregressive learning for bilateral control-based imitation learning. A new neural network model for implementing autoregressive learning is proposed. In this study, three types of experiments are conducted to verify the effectiveness of the proposed method. The performance is improved compared to conventional approaches; the proposed method has the highest rate of success. Owing to the structure and autoregressive learning of the proposed model, the proposed method can generate the desirable motion for successful tasks and have a high generalization ability for environmental changes.

[1] Ken Goldberg,et al. Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation , 2017, ICRA.

[2] K. Ohnishi,et al. Reproducibility and operationality in bilateral teleoperation , 2004, The 8th IEEE International Workshop on Advanced Motion Control, 2004. AMC '04..

[3] Sonia Chernova,et al. Recent Advances in Robot Learning from Demonstration , 2020, Annu. Rev. Control. Robotics Auton. Syst..

[4] Toshiaki Tsuji,et al. Bilateral Control in the Vertical Direction Using Functional Electrical Stimulation , 2016 .

[5] Darwin G. Caldwell,et al. Imitation Learning of Positional and Force Skills Demonstrated via Kinesthetic Teaching and Haptic Input , 2011, Adv. Robotics.

[6] Darwin G. Caldwell,et al. Learning optimal controllers in human-robot cooperative transportation tasks with position and force constraints , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7] Samy Bengio,et al. Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.

[8] Darwin G. Caldwell,et al. Upper-body kinesthetic teaching of a free-standing humanoid robot , 2011, 2011 IEEE International Conference on Robotics and Automation.

[9] Tetsuya Ogata,et al. Learning Multiple Sensorimotor Units to Complete Compound Tasks using an RNN with Multiple Attractors , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[10] Tsuyoshi Adachi,et al. Imitation Learning for Object Manipulation Based on Position/Force Information Using Bilateral Control , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11] Toshiyuki Murakami,et al. Control Structure Determination of Bilateral System based on Reproducibility and Operationality , 2019 .

[12] Carme Torras,et al. A robot learning from demonstration framework to perform force-based manipulation tasks , 2013, Intelligent Service Robotics.

[13] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[14] Yoshua Bengio,et al. Professor Forcing: A New Algorithm for Training Recurrent Networks , 2016, NIPS.

[15] Toshiaki Tsuji,et al. Imitation Learning Based on Bilateral Control for Human–Robot Cooperation , 2020, IEEE Robotics and Automation Letters.

[16] Sergey Levine,et al. One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning , 2018, Robotics: Science and Systems.

[17] Shigeki Sugano,et al. Repeatable Folding Task by Humanoid Robot Worker Using Deep Learning , 2017, IEEE Robotics and Automation Letters.

[18] Yang Feng,et al. Bridging the Gap between Training and Inference for Neural Machine Translation , 2019, ACL.

[19] Rouhollah Rahmatizadeh,et al. Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-to-End Learning from Demonstration , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[20] Heni Ben Amor,et al. A system for learning continuous human-robot interactions from human-human demonstrations , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[21] Alberto Montebelli,et al. Incrementally assisted kinesthetic teaching for programming by demonstration , 2016, 2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[22] Toshiyuki Murakami,et al. Torque sensorless control in multidegree-of-freedom manipulator , 1993, IEEE Trans. Ind. Electron..

[23] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[24] Sergey Levine,et al. Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning , 2019, CoRL.

[25] Sergey Levine,et al. Learning force-based manipulation of deformable objects from multiple demonstrations , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[26] Kensuke Harada,et al. Deep Learning Scooping Motion Using Bilateral Teleoperations , 2018, 2018 3rd International Conference on Advanced Robotics and Mechatronics (ICARM).

[27] Toshiaki Tsuji,et al. Estimation and Kinetic Modeling of Human Arm using Wearable Robot Arm , 2017 .

[28] Fuchun Sun,et al. Survey of imitation learning for robotic manipulation , 2019, International Journal of Intelligent Robotics and Applications.

[29] Jitendra Malik,et al. Learning to Poke by Poking: Experiential Learning of Intuitive Physics , 2016, NIPS.

[30] Kouhei Ohnishi,et al. Multi-DOF Micro-Macro Bilateral Controller Using Oblique Coordinate Control , 2011, IEEE Transactions on Industrial Informatics.

[31] Yifan Xu,et al. Rethinking Exposure Bias In Language Modeling , 2019, ArXiv.

[32] Kouhei Ohnishi,et al. Motion control for advanced mechatronics , 1996 .