Scene mover

We propose a novel approach for automatically generating a move plan for scene arrangement.1 Given a scene like an apartment with many furniture objects, to transform its layout into another layout, one would need to determine a collision-free move plan. It could be challenging to design this plan manually because the furniture objects may block the way of each other if not moved properly; and there is a large complex search space of move action sequences that grow exponentially with the number of objects. To tackle this challenge, we propose a learning-based approach to generate a move plan automatically. At the core of our approach is a Monte Carlo tree that encodes possible states of the layout, based on which a search is performed to move a furniture object appropriately in the current layout. We trained a policy neural network embedded with a LSTM module for estimating the best actions to take in the expansion step and simulation step of the Monte Carlo tree search process. Leveraging the power of deep reinforcement learning, the network learned how to make such estimations through millions of trials of moving objects. We demonstrated our approach for moving objects under different scenarios and constraints. We also evaluated our approach on synthetic and real-world layouts, comparing its performance with that of humans and other baseline approaches.

[1]  Danica Kragic,et al.  Multi-Object Rearrangement with Monte Carlo Tree Search: A Case Study on Planar Nonprehensile Sorting , 2019, ArXiv.

[2]  Matthew Garratt,et al.  Obstacle Avoidance for Multi-agent Path Planning Based on Vectorized Particle Swarm Optimization , 2017 .

[3]  Yelong Shen,et al.  ReinforceWalk: Learning to Walk in Graph with Monte Carlo Tree Search , 2018, ICLR.

[4]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[5]  Nils J. Nilsson,et al.  A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..

[6]  Angel X. Chang,et al.  PlanIT , 2019, ACM Transactions on Graphics.

[7]  Danica Kragic,et al.  Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[8]  Matthias Nießner,et al.  PiGraphs , 2016, ACM Trans. Graph..

[9]  Siddhartha S. Srinivasa,et al.  Robust trajectory selection for rearrangement planning as a multi-armed bandit problem , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[10]  Peter Stone,et al.  Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.

[11]  H. Jaap van den Herik,et al.  Cross-Entropy for Monte-Carlo Tree Search , 2008, J. Int. Comput. Games Assoc..

[12]  Daniel Cohen-Or,et al.  GRAINS , 2018, ACM Trans. Graph..

[13]  Pat Hanrahan,et al.  Synthesizing open worlds with constraints using locally annealed reversible jump MCMC , 2012, ACM Trans. Graph..

[14]  Siddhartha S. Srinivasa,et al.  Unobservable Monte Carlo planning for nonprehensile rearrangement tasks , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Ligang Liu,et al.  MIQP‐based Layout Design for Building Interiors , 2018, Comput. Graph. Forum.

[16]  Pedro U. Lima,et al.  Robot formation motion planning using Fast Marching , 2011, Robotics Auton. Syst..

[17]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[18]  Siddhartha S. Srinivasa,et al.  Rearrangement planning using object-centric and robot-centric action spaces , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[19]  Gene Eu Jan,et al.  An $\bm{O(n\log n)}$ Shortest Path Algorithm Based on Delaunay Triangulation , 2014, IEEE/ASME Transactions on Mechatronics.

[20]  Danica Kragic,et al.  Learning Manipulation States and Actions for Efficient Non-prehensile Rearrangement Planning , 2019, ArXiv.

[21]  Vladlen Koltun,et al.  Computer-generated residential building layouts , 2010, SIGGRAPH 2010.

[22]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[23]  Chenfanfu Jiang,et al.  Human-Centric Indoor Scene Synthesis Using Stochastic Grammar , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Dinesh Manocha,et al.  Centralized path planning for multiple robots: Optimal decoupling into sequential plans , 2009, Robotics: Science and Systems.

[25]  Matthias Nießner,et al.  ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Murray Campbell,et al.  Deep Blue , 2002, Artif. Intell..

[27]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[28]  W. Ramsey,et al.  The Cambridge Handbook of Artificial Intelligence , 2014 .

[29]  Kai Wang,et al.  Fast and Flexible Indoor Scene Synthesis via Deep Convolutional Generative Models , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Peter Stone,et al.  Reinforcement learning , 2019, Scholarpedia.

[31]  Peter Wonka,et al.  Computing layouts with deformable templates , 2014, ACM Trans. Graph..

[32]  Rui Ma,et al.  Action-driven 3D indoor scene evolution , 2016, ACM Trans. Graph..

[33]  Paul M. Griffin,et al.  Path planning for a mobile robot , 1992, IEEE Trans. Syst. Man Cybern..

[34]  Chi-Keung Tang,et al.  Make it home: automatic optimization of furniture arrangement , 2011, ACM Trans. Graph..

[35]  C A Nelson,et al.  Learning to Learn , 2017, Encyclopedia of Machine Learning and Data Mining.

[36]  Rodney A. Brooks,et al.  A subdivision algorithm in configuration space for findpath with rotation , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[37]  Matthias Nießner,et al.  Activity-centric scene synthesis for functional 3D scene modeling , 2015, ACM Trans. Graph..

[38]  Siddhartha S. Srinivasa,et al.  Kinodynamic randomized rearrangement planning via dynamic transitions between statically stable states , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[39]  Dinesh Manocha,et al.  AutonoVi: Autonomous vehicle planning with dynamic maneuvers and traffic constraints , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[40]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[41]  David Barber,et al.  Thinking Fast and Slow with Deep Learning and Tree Search , 2017, NIPS.

[42]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[43]  Abdeslam Boularias,et al.  Object Rearrangement with Nested Nonprehensile Manipulation Actions , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[44]  Dinesh Manocha,et al.  Real-Time Path Planning in Dynamic Virtual Environments Using Multiagent Navigation Graphs , 2008, IEEE Transactions on Visualization and Computer Graphics.

[45]  Sergey Levine,et al.  Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[46]  Bin Zhou,et al.  Adaptive synthesis of indoor scenes via activity-associated object relation graphs , 2017, ACM Trans. Graph..

[47]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[48]  Mathieu Aubry,et al.  Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning , 2019, IEEE Robotics and Automation Letters.

[49]  Dinesh Manocha,et al.  Path Planning for Deformable Robots in Complex Environments , 2005, Robotics: Science and Systems.

[50]  Danica Kragic,et al.  End-to-end nonprehensile rearrangement with deep reinforcement learning and simulation-to-reality transfer , 2019, Robotics Auton. Syst..

[51]  Wei Liang,et al.  Deep Single-View 3D Object Reconstruction with Visual Hull Embedding , 2019, AAAI.

[52]  Angel X. Chang,et al.  Deep convolutional priors for indoor scene synthesis , 2018, ACM Trans. Graph..

[53]  Wenguan Wang,et al.  Active Visual Information Gathering for Vision-Language Navigation , 2020, ECCV.