Learning Physically Realizable Skills for Online Packing of General 3D Shapes

We study the problem of learning online packing skills for irregular 3D shapes, which is arguably the most challenging setting of bin packing problems. The goal is to consecutively move a sequence of 3D objects with arbitrary shapes into a designated container with only partial observations of the object sequence. Meanwhile, we take physical realizability into account, involving physics dynamics and constraints of a placement. The packing policy should understand the 3D geometry of the object to be packed and make effective decisions to accommodate it in the container in a physically realizable way. We propose a Reinforcement Learning (RL) pipeline to learn the policy. The complex irregular geometry and imperfect object placement together lead to huge solution space. Direct training in such space is prohibitively data intensive. We instead propose a theoretically-provable method for candidate action generation to reduce the action space of RL and the learning burden. A parameterized policy is then learned to select the best placement from the candidates. Equipped with an efficient method of asynchronous RL acceleration and a data preparation process of simulation-ready training sequences, a mature packing policy can be trained in a physics-based environment within 48 hours. Through extensive evaluation on a variety of real-life shape datasets and comparisons with state-of-the-art baselines, we demonstrate that our method outperforms the best-performing baseline on all datasets by at least \(12.8\% \) in terms of packing utility. We also release our datasets and source code to support further research in this direction.

[1]  Jiwen Lu,et al.  Planning Irregular Object Packing via Hierarchical Reinforcement Learning , 2022, IEEE Robotics and Automation Letters.

[2]  B. Bickel,et al.  Computational design of high-level interlocking puzzles , 2022, ACM Trans. Graph..

[3]  Kris K. Hauser,et al.  Dense Robotic Packing of Irregular and Novel 3D Objects , 2022, IEEE Transactions on Robotics.

[4]  Jan Peters,et al.  Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery , 2022, 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[5]  J. Oliveira,et al.  On-line three-dimensional packing problems: A review of off-line and on-line solution approaches , 2022, Comput. Ind. Eng..

[6]  Robert W. Platt,et al.  Equivariant Transporter Network , 2022, Robotics: Science and Systems.

[7]  Wei Zhang,et al.  PackerBot: Variable-Sized Product Packing with Heuristic Deep Reinforcement Learning , 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[8]  Kai Xu,et al.  Learning practically feasible policies for online 3D bin packing , 2021, Science China Information Sciences.

[9]  Danica Kragic,et al.  Modeling, learning, perception, and control methods for deformable object manipulation , 2021, Science Robotics.

[10]  Peter R. Florence,et al.  Transporter Networks: Rearranging the Visual World for Robotic Manipulation , 2020, CoRL.

[11]  Zherong Pan,et al.  Decision Making in Joint Push-Grasp Action Space for Large-Scale Object Sorting , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Jia Deng,et al.  PackIt: A Virtual Environment for Geometric Planning , 2020, ICML.

[13]  Yin Yang,et al.  Online 3D Bin Packing with Constrained Deep Reinforcement Learning , 2020, AAAI.

[14]  Xiao-Ming Fu,et al.  Robust atlas generation via angle-based segmentation , 2020, Comput. Aided Geom. Des..

[15]  Dinesh Manocha,et al.  Grasping Fragile Objects Using A Stress-Minimization Metric , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[16]  Jingjin Yu,et al.  Toward Fast and Optimal Robotic Pick-and-Place on a Moving Conveyor , 2019, IEEE Robotics and Automation Letters.

[17]  Xiao-Ming Fu,et al.  Atlas refinement with bounded packing efficiency , 2019, ACM Trans. Graph..

[18]  Kris Hauser,et al.  Robot Packing With Known Items and Nondeterministic Arrival Order , 2019, IEEE Transactions on Automation Science and Engineering.

[19]  Kostas E. Bekris,et al.  Towards Robust Product Packing with a Minimalistic End-Effector , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[20]  Marc Alexa,et al.  ABC: A Big CAD Model Dataset for Geometric Deep Learning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Kris Hauser,et al.  Stable Bin Packing of Non-convex 3D Objects with a Robot Manipulator , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[22]  Andreas Geiger,et al.  Learning 3D Shape Completion Under Weak Supervision , 2018, International Journal of Computer Vision.

[23]  W. Wang,et al.  Packing Irregular Objects in 3D Space via Hybrid Optimization , 2018, Comput. Graph. Forum.

[24]  Daniele Panozzo,et al.  Generalized motorcycle graphs for imperfect quad-dominant meshes , 2018, ACM Trans. Graph..

[25]  Andreas Geiger,et al.  Learning 3D Shape Completion Under Weak Supervision , 2018, International Journal of Computer Vision.

[26]  Yu Qian,et al.  A Multi-task Selected Learning Approach for Solving 3D Flexible Bin Packing Problem , 2018, AAMAS.

[27]  Matthew W. Hoffman,et al.  Distributed Distributional Deterministic Policy Gradients , 2018, ICLR.

[28]  David Budden,et al.  Distributed Prioritized Experience Replay , 2018, ICLR.

[29]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[30]  Kenneth Y. Goldberg,et al.  Learning Deep Policies for Robot Bin Picking by Simulating Robust Grasping Sequences , 2017, CoRL.

[31]  Tom Schaul,et al.  Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.

[32]  Yinghui Xu,et al.  Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method , 2017, ArXiv.

[33]  Elman Mansimov,et al.  Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation , 2017, NIPS.

[34]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[35]  Marc G. Bellemare,et al.  A Distributional Perspective on Reinforcement Learning , 2017, ICML.

[36]  Shane Legg,et al.  Noisy Networks for Exploration , 2017, ICLR.

[37]  Trung Thanh Nguyen,et al.  An Online Packing Heuristic for the Three-Dimensional Container Loading Problem in Dynamic Environments and the Physical Internet , 2017, EvoApplications.

[38]  P. Abbeel,et al.  Yale-CMU-Berkeley dataset for robotic manipulation research , 2017, Int. J. Robotics Res..

[39]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  J. Oliveira,et al.  A container loading algorithm with static mechanical equilibrium stability constraints , 2016 .

[41]  Wojciech Zaremba,et al.  OpenAI Gym , 2016, ArXiv.

[42]  Mathieu Aubry,et al.  Dex-Net 1.0: A cloud-based network of 3D objects for robust grasp planning using a Multi-Armed Bandit model with correlated rewards , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[43]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[44]  Eric Lengyel Volumetric Hierarchical Approximate Convex Decomposition , 2016 .

[45]  Leonidas J. Guibas,et al.  ShapeNet: An Information-Rich 3D Model Repository , 2015, ArXiv.

[46]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[47]  Tom Schaul,et al.  Prioritized Experience Replay , 2015, ICLR.

[48]  D. Cohen-Or,et al.  Dapper , 2015, ACM Trans. Graph..

[49]  Huamin Wang,et al.  Level-set-based partitioning and packing optimization of a printable model , 2015, ACM Trans. Graph..

[50]  Kostas E. Bekris,et al.  A Dataset for Improved RGBD-Based Object Detection and Pose Estimation for Warehouse Pick-and-Place , 2015, IEEE Robotics and Automation Letters.

[51]  Jeannette Bohg,et al.  Leveraging big data for grasp planning , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[52]  Xiao Liu,et al.  HAPE3D—a new constructive algorithm for the 3D irregular packing problem , 2015, Frontiers of Information Technology & Electronic Engineering.

[53]  Agne Dzidolikaite,et al.  Hybrid genetic algorithm , 2015, 2015 Open Conference of Electrical, Electronic and Information Sciences (eStream).

[54]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[55]  Tobias Nipkow,et al.  A FORMAL PROOF OF THE KEPLER CONJECTURE , 2015, Forum of Mathematics, Pi.

[56]  Pieter Abbeel,et al.  BigBIRD: A large-scale 3D database of object instances , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[57]  Takeo Igarashi,et al.  PacCAM: material capture and interactive 2D packing for efficient material usage on CNC cutting machines , 2013, UIST.

[58]  Wojciech Matusik,et al.  Chopper: partitioning models into 3D-printable parts , 2012, ACM Trans. Graph..

[59]  Rüdiger Dillmann,et al.  The KIT object models database: An object model database for object recognition, localization and manipulation in service robotics , 2012, Int. J. Robotics Res..

[60]  Didier Stricker,et al.  Efficient Packing of Arbitrary Shaped Charts for Automatic Texture Atlas Generation , 2011, EGSR '11.

[61]  Pierre Alliez,et al.  Polygon Mesh Processing , 2010 .

[62]  Georges M. Fadel,et al.  A Fast and Efficient Compact Packing Algorithm for SAE and ISO Luggage Packing Problems , 2010, J. Comput. Inf. Sci. Eng..

[63]  Chi-Wing Fu,et al.  3D polyomino puzzle , 2009, ACM Trans. Graph..

[64]  S Torquato,et al.  Packing, tiling, and covering with tetrahedra. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[65]  Korhan Karabulut,et al.  A Hybrid Genetic Algorithm for Packing in 3D with Deepest Bottom Left with Fill Method , 2004, ADVIS.

[66]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[67]  Bruno Lévy,et al.  Generation of Radiosity Texture Atlas for Realistic Real-Time Rendering , 2003, Eurographics.

[68]  Andrea Lodi,et al.  Two-dimensional packing problems: A survey , 2002, Eur. J. Oper. Res..

[69]  Bruno Lévy,et al.  Least squares conformal maps for automatic texture atlas generation , 2002, ACM Trans. Graph..

[70]  Steven S. Seiden,et al.  On the online bin packing problem , 2001, JACM.

[71]  Daniele Vigo,et al.  The Three-Dimensional Bin Packing Problem , 2000, Oper. Res..

[72]  John F. Canny,et al.  Part pose statistics: estimators and experiments , 1999, IEEE Trans. Robotics Autom..

[73]  Emanuel Falkenauer,et al.  A hybrid grouping genetic algorithm for bin packing , 1996, J. Heuristics.

[74]  Keiichi Abe,et al.  Topological structural analysis of digitized binary images by border following , 1985, Comput. Vis. Graph. Image Process..

[75]  Urs Ramer,et al.  An iterative procedure for the polygonal approximation of plane curves , 1972, Comput. Graph. Image Process..

[76]  Kai Xu,et al.  Learning Efficient Online 3D Bin Packing on Packing Configuration Trees , 2022, ICLR.

[77]  Mark Pauly,et al.  MOCCA: modeling and optimizing cone-joints for complex assemblies , 2021, ACM Trans. Graph..

[78]  Boris Belousov,et al.  Learn2Assemble with Structured Representations and Search for Robotic Architectural Construction , 2021, CoRL.

[79]  Hao Zhang,et al.  TAP-Net: Transport-and-Pack using Reinforcement Learning , 2020 .

[80]  Josef Kallrath,et al.  Packing ellipsoids into volume-minimizing rectangular boxes , 2015, Journal of Global Optimization.

[81]  Mark de Berg,et al.  Computational geometry: algorithms and applications , 1997 .

[82]  J. Hartmanis Computers and Intractability: A Guide to the Theory of NP-Completeness (Michael R. Garey and David S. Johnson) , 1982 .