Parallel B&B Algorithm for Hybrid Multi-core/GPU Architectures

B&B algorithms are well known techniques for exact solving of combinatorial optimization problems (COP). They perform an implicit enumeration of the search space instead of exhaustive one. Based on a pruning technique, they reduce considerably the computation time required to explore the whole search space. Nevertheless, these algorithms remain inefficient when dealing with large combinatorial optimization instances. They are time-intensive and they require a huge computing power to be solved optimally. Nowadays, multi-core-based processors and GPU accelerators are often coupled together to achieve impressive performances. However, classical B&B algorithms must be rethought to deal with their two divergent architectures. In this paper, we propose a new B&B approach exploiting both the multi-core aspect of actual processors and GPU accelerators. The proposed approaches have been executed to solve FSP instances that are well-known combinatorial optimization benchmarks. Real experiments have been carried out on an Intel Xeon 64-bit quad-core processor E5520 coupled to an Nvidia Tesla C2075 GPU device. The results show that our hybrid B&B approach speeds up the execution time up to ×123 over the sequential mono-core B&B algorithm.

[1]  Teodor Gabriel Crainic,et al.  PARALLEL BRANCH-AND-BOUND ALGORITHMS: SURVEY AND SYNTHESIS , 1993 .

[2]  Imen Chakroun,et al.  Reducing thread divergence in a GPU‐accelerated branch‐and‐bound algorithm , 2013, Concurr. Comput. Pract. Exp..

[3]  S. M. Johnson,et al.  Optimal two- and three-stage production schedules with setup times included , 1954 .

[4]  Gustavo Augusto Lima de Campos,et al.  A New Parallel Schema for Branch-and-Bound Algorithms Using GPGPU , 2011, 2011 23rd International Symposium on Computer Architecture and High Performance Computing.

[5]  B. J. Lageweg,et al.  A General Bounding Scheme for the Permutation Flow-Shop Problem , 1978, Oper. Res..

[6]  James R. Jackson,et al.  An extension of Johnson's results on job IDT scheduling , 1956 .

[7]  Didier El Baz,et al.  GPU Implementation of the Branch and Bound Method for Knapsack Problems , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[8]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[9]  L. G. Mitten Sequencing n Jobs on Two Machines with Arbitrary Time Lags , 1959 .

[10]  Imen Chakroun,et al.  Operator-Level GPU-Accelerated Branch and Bound Algorithms , 2013, ICCS.

[11]  Bernard Gendron,et al.  Parallel Branch-and-Branch Algorithms: Survey and Synthesis , 1994, Oper. Res..

[12]  Laurent Péridy,et al.  Enhanced Disjunctive Elimination Rules for the Flow-Shop and Permutation Flow-Shop Problems , 1998 .

[13]  David P. Anderson,et al.  SETI@home: an experiment in public-resource computing , 2002, CACM.