A Multi-task Selected Learning Approach for Solving 3D Flexible Bin Packing Problem

A 3D flexible bin packing problem (3D-FBPP) arises from the process of warehouse packing in e-commerce. An online customer's order usually contains several items and needs to be packed as a whole before shipping. In particular, 5% of tens of millions of packages are using plastic wrapping as outer packaging every day, which brings pressure on the plastic surface minimization to save traditional logistics costs. Because of the huge practical significance, we focus on the issue of packing cuboid-shaped items orthogonally into a least-surface-area bin. The existing heuristic methods for classic 3D bin packing don't work well for this particular NP-hard problem and designing a good problem-specific heuristic is non-trivial. In this paper, rather than designing heuristics, we propose a novel multi-task framework based on Selected Learning to learn a heuristic-like policy that generates the sequence and orientations of items to be packed simultaneously. Through comprehensive experiments on a large scale real-world transaction order dataset and online AB tests, we show: 1) our selected learning method trades off the imbalance and correlation among the tasks and significantly outperforms the single task Pointer Network and the multi-task network without selected learning; 2) our method obtains an average 5.47% cost reduction than the well-designed greedy algorithm which is previously used in our online production system.

[1]  Teodor Gabriel Crainic,et al.  Extreme Point-Based Heuristics for Three-Dimensional Bin Packing , 2008, INFORMS J. Comput..

[2]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[3]  Roland H. C. Yap,et al.  Learning Robust Search Strategies Using a Bandit-Based Approach , 2018, AAAI.

[4]  Samy Bengio,et al.  Neural Combinatorial Optimization with Reinforcement Learning , 2016, ICLR.

[5]  Daniele Vigo,et al.  Algorithm 864: General and robot-packable variants of the three-dimensional bin packing problem , 2007, TOMS.

[6]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[7]  Korhan Karabulut,et al.  A Hybrid Genetic Algorithm for Packing in 3D with Deepest Bottom Left with Fill Method , 2004, ADVIS.

[8]  Daniele Vigo,et al.  Heuristic algorithms for the three-dimensional bin packing problem , 2002, Eur. J. Oper. Res..

[9]  Lei Wu,et al.  A Linear Programming Approach for the Three-Dimensional Bin-Packing Problem , 2010, Electron. Notes Discret. Math..

[10]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[11]  Alexander M. Rush,et al.  Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.

[12]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[13]  Anurag Gupta,et al.  Small Boxes Big Data: A Deep Learning Approach to Optimize Variable Sized Bin Packing , 2017, 2017 IEEE Third International Conference on Big Data Computing Service and Applications (BigDataService).

[14]  Andreas Bortfeldt,et al.  A genetic algorithm for the two-dimensional strip packing problem with rectangular pieces , 2006, Eur. J. Oper. Res..

[15]  Le Song,et al.  2 Common Formulation for Greedy Algorithms on Graphs , 2018 .

[16]  Navdeep Jaitly,et al.  Pointer Networks , 2015, NIPS.

[17]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[18]  E. A. Silver,et al.  An overview of heuristic solution methods , 2004, J. Oper. Res. Soc..

[19]  Mauricio G. C. Resende,et al.  A biased random key genetic algorithm for 2D and 3D bin packing problems , 2013 .

[20]  K. Lai,et al.  Developing a simulated annealing algorithm for the cutting stock problem , 1997 .

[21]  Ben Paechter,et al.  A Lifelong Learning Hyper-heuristic Method for Bin Packing , 2015, Evolutionary Computation.

[22]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[23]  Richard Socher,et al.  A Deep Reinforced Model for Abstractive Summarization , 2017, ICLR.

[24]  Xueping Li,et al.  A genetic algorithm for the three-dimensional bin packing problem with heterogeneous bins , 2014 .

[25]  Daniel Mack,et al.  A heuristic for the three-dimensional strip packing problem , 2007, Eur. J. Oper. Res..

[26]  Kate Smith-Miles,et al.  Neural Networks for Combinatorial Optimization: A Review of More Than a Decade of Research , 1999, INFORMS J. Comput..

[27]  Mauro Dell'Amico,et al.  Lower and upper bounds for the Bin Packing Problem with Fragile Objects , 2014, Discret. Appl. Math..

[28]  Daniele Vigo,et al.  The Three-Dimensional Bin Packing Problem , 2000, Oper. Res..

[29]  Lawrence V. Snyder,et al.  Deep Reinforcement Learning for Solving the Vehicle Routing Problem , 2018, ArXiv.

[30]  Gerhard Wäscher,et al.  An improved typology of cutting and packing problems , 2007, Eur. J. Oper. Res..

[31]  Max Welling,et al.  Attention Solves Your TSP , 2018, ArXiv.

[32]  Hongfeng Wang,et al.  A hybrid genetic algorithm for 3D bin packing problems , 2010, 2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA).

[33]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[34]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[35]  Chin-Sheng Chen,et al.  An analytical model for the container loading problem , 1995 .