Sequence-to-Sequence Model for Trajectory Planning of Nonprehensile Manipulation Including Contact Model

Nonprehensile manipulation is necessary for robots to operate in humans’ daily lives. As nonprehensile manipulation should satisfy both kinematics and dynamics requirements simultaneously, it is difficult to manipulate objects along given paths. Previous studies have considered the problems with sequence-to-sequence models, which are neural networks for time-series conversion. However, they did not consider nonlinear contact models, such as friction models. When we train the seq2seq models using end-to-end backpropagation, training losses vanish owing to static friction. In this letter, we realize sequence-to-sequence models for trajectory planning of nonprehensile manipulation including contact models between the robots and target objects. This letter proposes a training curriculum that commences training without contact models to bring the seq2seq models outside of the gradient-vanishing zone. This letter discusses sliding manipulation, which includes a friction model between objects and tools, such as frying pans fixed onto the robots. We validated the proposed curriculum through a simulation. In addition, we observed that the trained seq2seq models could handle parameter fluctuations that did not exist during training.

[1]  Rouhollah Rahmatizadeh,et al.  Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-to-End Learning from Demonstration , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Sandra Hirche,et al.  Online deformation of optimal trajectories for constrained nonprehensile manipulation , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[3]  Bowen Zhou,et al.  Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[4]  Nicholas Roy,et al.  Sliding Manipulation of Rigid Bodies on a Controlled 6-DoF Plate , 2012 .

[5]  Kevin M. Lynch,et al.  Dynamic Nonprehensile Manipulation: Controllability, Planning, and Experiments , 1999, Int. J. Robotics Res..

[6]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[7]  M.T. Mason,et al.  Dynamic manipulation , 1993, Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93).

[8]  Quoc V. Le,et al.  A Neural Conversational Model , 2015, ArXiv.

[9]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[10]  Kevin M. Lynch,et al.  Planning and control for dynamic, nonprehensile, and hybrid manipulation tasks , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[11]  Darwin G. Caldwell,et al.  Robot motor skill coordination with EM-based Reinforcement Learning , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[13]  Shigeki Sugano,et al.  Learning to Perceive the World as Probabilistic or Deterministic via Interaction With Others: A Neuro-Robotics Experiment , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Jitendra Malik,et al.  Learning to Poke by Poking: Experiential Learning of Intuitive Physics , 2016, NIPS.

[15]  Sandra Hirche,et al.  Fast trajectory replanning using Laplacian mesh optimization , 2012, 2012 12th International Conference on Control Automation Robotics & Vision (ICARCV).

[16]  Marcin Andrychowicz,et al.  Sim-to-Real Transfer of Robotic Control with Dynamics Randomization , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[17]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[18]  Tetsuya Ogata,et al.  Representation Learning of Logic Words by an RNN: From Word Sequences to Robot Actions , 2017, Front. Neurorobot..

[19]  F. Gers,et al.  Long short-term memory in recurrent neural networks , 2001 .

[20]  Toshiaki Tsuji,et al.  Sequence-to-sequence models for trajectory deformation of dynamic manipulation , 2017, IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society.

[21]  Kevin M. Lynch,et al.  Nonprehensile robotic manipulation: controllability and planning , 1996 .

[22]  Makoto Kaneko,et al.  Dynamic Manipulation Inspired by the Handling of a Pizza Peel , 2009, IEEE Transactions on Robotics.

[23]  Maria Bauza,et al.  A probabilistic data-driven model for planar pushing , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Quang-Cuong Pham,et al.  Dynamic non-prehensile object transportation , 2014, 2014 13th International Conference on Control Automation Robotics & Vision (ICARCV).

[25]  Toshiaki Tsuji,et al.  Dynamic Object Manipulation Considering Contact Condition of Robot With Tool , 2016, IEEE Transactions on Industrial Electronics.

[26]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[27]  Jian Shi,et al.  Dynamic In-Hand Sliding Manipulation , 2017, IEEE Trans. Robotics.