Imitation Learning for Variable Speed Object Manipulation

To operate in a real-world environment, robots have several requirements including environmental adaptability. Moreover, the desired success rate for the completion of tasks must be achieved. In this regard, end-to-end learning for autonomous operation is currently being investigated. However, the issue of operating speed has not been investigated in detail. Therefore, in this paper, we propose a method for generating variable operating speeds while adapting to perturbations in the environment. When the work speed changes, there is a nonlinear relationship between the operating speed and force (e.g., inertial and frictional forces). However, the proposed method can be adapted to nonlinearities by utilizing minimal motion data. We experimentally evaluated the proposed method for erasing a line using an eraser fixed to the tip of a robot. Furthermore, the proposed method enables a robot to perform a task faster than a human operator.

[1]  Martin Jägersand,et al.  A Geometric Perspective on Visual Imitation Learning , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  Oliver Brock,et al.  Lessons from the Amazon Picking Challenge: Four Aspects of Building Robotic Systems , 2016, IJCAI.

[3]  Heinrich M. Jaeger,et al.  Universal robotic gripper based on the jamming of granular material , 2010, Proceedings of the National Academy of Sciences.

[4]  Kouhei Ohnishi,et al.  Multi-DOF Micro-Macro Bilateral Controller Using Oblique Coordinate Control , 2011, IEEE Transactions on Industrial Informatics.

[5]  Toshiaki Tsuji,et al.  Dynamic Object Manipulation Considering Contact Condition of Robot With Tool , 2016, IEEE Transactions on Industrial Electronics.

[6]  Kouhei Ohnishi,et al.  A Novel Motion Equation for General Task Description and Analysis of Mobile-Hapto , 2013, IEEE Transactions on Industrial Electronics.

[7]  Aude Billard,et al.  On Learning, Representing, and Generalizing a Task in a Humanoid Robot , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[8]  W. Marsden I and J , 2012 .

[9]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[10]  Toshiaki Tsuji,et al.  Bilateral Control Between Electric and Hydraulic Actuators Using Linearization of Hydraulic Actuators , 2017, IEEE Transactions on Industrial Electronics.

[11]  Jun Tani,et al.  Self-organization of behavioral primitives as multiple attractor dynamics: A robot experiment , 2003, IEEE Trans. Syst. Man Cybern. Part A.

[12]  Zhijun Zhang,et al.  Three Recurrent Neural Networks and Three Numerical Methods for Solving a Repetitive Motion Planning Scheme of Redundant Robot Manipulators , 2017, IEEE/ASME Transactions on Mechatronics.

[13]  Toshiaki Tsuji,et al.  Estimation and Kinetic Modeling of Human Arm using Wearable Robot Arm , 2017 .

[14]  Shigeki Sugano,et al.  Repeatable Folding Task by Humanoid Robot Worker Using Deep Learning , 2017, IEEE Robotics and Automation Letters.

[15]  Sergey Levine,et al.  Learning force-based manipulation of deformable objects from multiple demonstrations , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[16]  Maria Kyrarini,et al.  Robot learning of industrial assembly task via human demonstrations , 2019, Auton. Robots.

[17]  Mamoru Mitsuishi,et al.  Online Trajectory Planning and Force Control for Automation of Surgical Tasks , 2018, IEEE Transactions on Automation Science and Engineering.

[18]  Toshiaki Tsuji,et al.  Motion Generation Using Bilateral Control-Based Imitation Learning With Autoregressive Learning , 2021, IEEE Access.

[19]  Tsuyoshi Adachi,et al.  Imitation Learning for Object Manipulation Based on Position/Force Information Using Bilateral Control , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[20]  Ken Goldberg,et al.  Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation , 2017, ICRA.

[21]  Hermann Ney,et al.  From Feedforward to Recurrent LSTM Neural Networks for Language Modeling , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[22]  Toshiyuki Murakami,et al.  Torque sensorless control in multidegree-of-freedom manipulator , 1993, IEEE Trans. Ind. Electron..

[23]  Kensuke Harada,et al.  Deep Learning Scooping Motion Using Bilateral Teleoperations , 2018, 2018 3rd International Conference on Advanced Robotics and Mechatronics (ICARM).

[24]  W. Wong,et al.  On ψ-Learning , 2003 .

[25]  S. Katsura,et al.  Motion copying system based on real-world haptics in variable speed , 2008, 2008 13th International Power Electronics and Motion Control Conference.

[26]  Toshiaki Tsuji,et al.  Imitation Learning Based on Bilateral Control for Human–Robot Cooperation , 2020, IEEE Robotics and Automation Letters.

[27]  Toshiaki Tsuji,et al.  Time Series Motion Generation Considering Long Short-Term Motion , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[28]  Sergey Levine,et al.  Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[29]  Kouhei Ohnishi,et al.  Motion control for advanced mechatronics , 1996 .

[30]  Carme Torras,et al.  A robot learning from demonstration framework to perform force-based manipulation tasks , 2013, Intelligent Service Robotics.

[31]  Yu Zhang,et al.  Highway long short-term memory RNNS for distant speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[32]  Leonel Rozo Interactive Trajectory Adaptation through Force-guided Bayesian Optimization , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[33]  Rouhollah Rahmatizadeh,et al.  From Virtual Demonstration to Real-World Manipulation Using LSTM and MDN , 2016, AAAI.

[34]  Darwin G. Caldwell,et al.  Learning optimal controllers in human-robot cooperative transportation tasks with position and force constraints , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[35]  Sergey Levine,et al.  Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning , 2019, CoRL.

[36]  Darwin G. Caldwell,et al.  Imitation Learning of Positional and Force Skills Demonstrated via Kinesthetic Teaching and Haptic Input , 2011, Adv. Robotics.