Graph-based Task-specific Prediction Models for Interactions between Deformable and Rigid Objects

Capturing scene dynamics and predicting the future scene state is challenging but essential for robotic manipulation tasks, especially when the scene contains both rigid and deformable objects. In this work, we contribute a simulation environment and generate a novel dataset for task-specific manipulation, involving interactions between rigid objects and a deformable bag. The dataset incorporates a rich variety of scenarios including different object sizes, object numbers and manipulation actions. We approach dynamics learning by proposing an object-centric graph representation and two modules which are Active Prediction Module (APM) and Position Prediction Module (PPM) based on graph neural networks with an encode-process-decode architecture. At the inference stage, we build a two-stage model based on the learned modules for single time step prediction. We combine modules with different prediction horizons into a mixed-horizon model which addresses long-term prediction. In an ablation study, we show the benefits of the two-stage model for single time step prediction and the effectiveness of the mixed-horizon model for long-term prediction tasks. Supplementary material is available at https://github.com/wengzehang/deformable_rigid_interaction_prediction

[1]  Erkut Erdem,et al.  Belief Regulated Dual Propagation Nets for Learning Action Effects on Groups of Articulated Objects , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Yuval Tassa,et al.  Simulation tools for model-based robotics: Comparison of Bullet, Havok, MuJoCo, ODE and PhysX , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[3]  Jitendra Malik,et al.  Learning to Poke by Poking: Experiential Learning of Intuitive Physics , 2016, NIPS.

[4]  Jiajun Wu,et al.  Learning Compositional Koopman Operators for Model-Based Control , 2020, ICLR.

[5]  Nadia Magnenat-Thalmann,et al.  The simulation of cloth using accurate physical parameters , 2008 .

[6]  Jonathan Tompson,et al.  Learning to Rearrange Deformable Cables, Fabrics, and Bags with Goal-Conditioned Transporter Networks , 2020, ArXiv.

[7]  Tamim Asfour,et al.  Predicting Pushing Action Effects on Spatial Object Relations by Learning Internal Prediction Models , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[8]  In-Kwon Lee,et al.  Hierarchical Cloth Simulation using Deep Neural Networks , 2018, CGI 2018.

[9]  Yew Cheong Hou,et al.  A review on modeling of flexible deformable object for dexterous robotic manipulation , 2019, International Journal of Advanced Robotic Systems.

[10]  Ales Ude,et al.  Autonomous acquisition of pushing actions to support object grasping with a humanoid robot , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[11]  Belhassen-Chedli Bouzgarrou,et al.  Robotic manipulation and sensing of deformable objects in domestic and industrial applications: a survey , 2018, Int. J. Robotics Res..

[12]  S. Levine,et al.  Reasoning About Physical Interactions with Object-Centric Models , 2018 .

[13]  Wolfram Burgard,et al.  Learning to Singulate Objects using a Push Proposal Network , 2017, ISRR.

[14]  Dieter Fox,et al.  SE3-nets: Learning rigid body motion using deep neural networks , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[15]  David Held,et al.  SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation , 2020, CoRL.

[16]  Tae-Yong Kim,et al.  Unified particle physics for real-time applications , 2014, ACM Trans. Graph..

[17]  Alberto Rodriguez,et al.  Learning Synergies Between Pushing and Grasping with Self-Supervised Deep Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[18]  Razvan Pascanu,et al.  Visual Interaction Networks: Learning a Physics Simulator from Video , 2017, NIPS.

[19]  Ruben Villegas,et al.  Learning Latent Dynamics for Planning from Pixels , 2018, ICML.

[20]  Razvan Pascanu,et al.  Interaction Networks for Learning about Objects, Relations and Physics , 2016, NIPS.

[21]  Rustam Stolkin,et al.  Learning to predict how rigid objects behave under simple manipulation , 2011, 2011 IEEE International Conference on Robotics and Automation.

[22]  Danica Kragic,et al.  Modeling, learning, perception, and control methods for deformable object manipulation , 2021, Science Robotics.

[23]  Sergey Levine,et al.  Unsupervised Learning for Physical Interaction through Video Prediction , 2016, NIPS.

[24]  Abhinav Gupta,et al.  Object-centric Forward Modeling for Model Predictive Control , 2019, CoRL.

[25]  In-Kwon Lee,et al.  Efficient Cloth Simulation using Miniature Cloth and Upscaling Deep Neural Networks , 2019, ArXiv.

[26]  Jeannette Bohg,et al.  Self-Supervised Learning of State Estimation for Manipulating Deformable Linear Objects , 2020, IEEE Robotics and Automation Letters.