论文信息 - Parallel B&B Algorithm for Hybrid Multi-core/GPU Architectures

Parallel B&B Algorithm for Hybrid Multi-core/GPU Architectures

B&B algorithms are well known techniques for exact solving of combinatorial optimization problems (COP). They perform an implicit enumeration of the search space instead of exhaustive one. Based on a pruning technique, they reduce considerably the computation time required to explore the whole search space. Nevertheless, these algorithms remain inefficient when dealing with large combinatorial optimization instances. They are time-intensive and they require a huge computing power to be solved optimally. Nowadays, multi-core-based processors and GPU accelerators are often coupled together to achieve impressive performances. However, classical B&B algorithms must be rethought to deal with their two divergent architectures. In this paper, we propose a new B&B approach exploiting both the multi-core aspect of actual processors and GPU accelerators. The proposed approaches have been executed to solve FSP instances that are well-known combinatorial optimization benchmarks. Real experiments have been carried out on an Intel Xeon 64-bit quad-core processor E5520 coupled to an Nvidia Tesla C2075 GPU device. The results show that our hybrid B&B approach speeds up the execution time up to ×123 over the sequential mono-core B&B algorithm.

[1] Teodor Gabriel Crainic,et al. PARALLEL BRANCH-AND-BOUND ALGORITHMS: SURVEY AND SYNTHESIS , 1993 .

[2] Imen Chakroun,et al. Reducing thread divergence in a GPU‐accelerated branch‐and‐bound algorithm , 2013, Concurr. Comput. Pract. Exp..

[3] S. M. Johnson,et al. Optimal two- and three-stage production schedules with setup times included , 1954 .

[4] Gustavo Augusto Lima de Campos,et al. A New Parallel Schema for Branch-and-Bound Algorithms Using GPGPU , 2011, 2011 23rd International Symposium on Computer Architecture and High Performance Computing.

[5] B. J. Lageweg,et al. A General Bounding Scheme for the Permutation Flow-Shop Problem , 1978, Oper. Res..

[6] James R. Jackson,et al. An extension of Johnson's results on job IDT scheduling , 1956 .

[7] Didier El Baz,et al. GPU Implementation of the Branch and Bound Method for Knapsack Problems , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[8] David S. Johnson,et al. Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[9] L. G. Mitten. Sequencing n Jobs on Two Machines with Arbitrary Time Lags , 1959 .

[10] Imen Chakroun,et al. Operator-Level GPU-Accelerated Branch and Bound Algorithms , 2013, ICCS.

[11] Bernard Gendron,et al. Parallel Branch-and-Branch Algorithms: Survey and Synthesis , 1994, Oper. Res..

[12] Laurent Péridy,et al. Enhanced Disjunctive Elimination Rules for the Flow-Shop and Permutation Flow-Shop Problems , 1998 .

[13] David P. Anderson,et al. SETI@home: an experiment in public-resource computing , 2002, CACM.