A parallel local search in CPU/GPU for scheduling independent tasks on large heterogeneous computing systems

This article presents the parallel implementation on CPU/GPU of two variants of a stochastic local search method to efficiently solve the scheduling problem in heterogeneous computing systems. Both methods are based on a set of simple operators to keep the computational complexity as low as possible, thus allowing large instances of the scheduling problem to be efficiently addressed. The experimental analysis demonstrates that both versions of the parallel CPU/GPU stochastic local search are able to compute accurate suboptimal schedules in significantly shorter execution times than state-of-the-art schedulers, while also outperforming a recently published GPU parallel evolutionary scheduler in terms of both efficiency and solution quality.

[1]  Che-Lun Hung,et al.  Parallel UPGMA Algorithm on Graphics Processing Units Using CUDA , 2012, 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems.

[2]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[3]  E.L. Lawler,et al.  Optimization and Approximation in Deterministic Sequencing and Scheduling: a Survey , 1977 .

[4]  John Levine,et al.  A fast, effective local search for scheduling independent jobs in heterogeneous computing environments , 2003 .

[5]  El-Ghazali Talbi,et al.  A GPU-based iterated tabu search for solving the quadratic 3-dimensional assignment problem , 2010, ACS/IEEE International Conference on Computer Systems and Applications - AICCSA 2010.

[6]  Petru Eles,et al.  Customizing Instruction Set Extensible Reconfigurable Processors Using GPUs , 2012, 2012 25th International Conference on VLSI Design.

[7]  Kamil Rocki,et al.  Accelerating 2-opt and 3-opt local search using GPU in the travelling salesman problem , 2012, 2012 International Conference on High Performance Computing & Simulation (HPCS).

[8]  Howard Jay Siegel,et al.  Representing Task and Machine Heterogeneities for Heterogeneous Computing Systems , 2000 .

[9]  Seif Haridi,et al.  A GPU-enabled solver for time-constrained linear sum assignment problems , 2010, 2010 The 7th International Conference on Informatics and Systems (INFOS).

[10]  Thiago Luís Lopes Siqueira,et al.  Querying data warehouses efficiently using the Bitmap Join Index OLAP Tool , 2012, CLEI Electron. J..

[11]  El-Ghazali Talbi,et al.  GPU Computing for Parallel Local Search Metaheuristic Algorithms , 2013, IEEE Transactions on Computers.

[12]  Jacek Blazewicz,et al.  G-MSA - A GPU-based, fast and accurate algorithm for multiple sequence alignment , 2013, J. Parallel Distributed Comput..

[13]  Kamil Rocki,et al.  An efficient GPU implementation of a multi-start TSP solver for large problem instances , 2012, GECCO '12.

[14]  Sergio Nesmachnow,et al.  Parallel implementations of the MinMin heterogeneous computing scheduler in GPU , 2012, CLEI Electron. J..

[15]  Enrique Alba,et al.  Parallel metaheuristics: recent advances and new trends , 2012, Int. Trans. Oper. Res..

[16]  Pascal Bouvry,et al.  A two-phase heuristic for the scheduling of independent tasks on computational grids , 2011, 2011 International Conference on High Performance Computing & Simulation.

[17]  Ladislau Bölöni,et al.  A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems , 2001, J. Parallel Distributed Comput..

[18]  Michaël Krajecki,et al.  Parallel GPU Implementation of Iterated Local Search for the Travelling Salesman Problem , 2012, LION.

[19]  El-Ghazali Talbi,et al.  ParadisEO-MO-GPU: a framework for parallel GPU-based local search metaheuristics , 2013, GECCO '13.

[20]  Enrique Alba,et al.  An Efficient Stochastic Local Search for Heterogeneous Computing Scheduling , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[21]  El-Ghazali Talbi,et al.  Neighborhood Structures for GPU-Based Local Search Algorithms , 2010, Parallel Process. Lett..

[22]  Howard Jay Siegel,et al.  Task execution time modeling for heterogeneous computing systems , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[23]  Y.-K. Kwok,et al.  Static scheduling algorithms for allocating directed task graphs to multiprocessors , 1999, CSUR.

[24]  Maxim Likhachev,et al.  High-dimensional planning on the GPU , 2010, 2010 IEEE International Conference on Robotics and Automation.

[25]  Jacques Carlier,et al.  Handbook of Scheduling - Algorithms, Models, and Performance Analysis , 2004 .

[26]  Stefano Cagnoni,et al.  libCudaOptimize: an open source library of GPU-based metaheuristics , 2012, GECCO '12.

[27]  El-Ghazali Talbi,et al.  GPU-Based Multi-start Local Search Algorithms , 2011, LION.

[28]  G. Croes A Method for Solving Traveling-Salesman Problems , 1958 .

[29]  Christian Schulz,et al.  Efficient local search on the GPU - Investigations on the vehicle routing problem , 2013, J. Parallel Distributed Comput..

[30]  Michal Czapinski,et al.  An effective Parallel Multistart Tabu Search for Quadratic Assignment Problem on CUDA platform , 2013, J. Parallel Distributed Comput..

[31]  Hesham H. Ali,et al.  Task scheduling in parallel and distributed systems , 1994, Prentice Hall series in innovative technology.

[32]  Pascal Bouvry,et al.  Solving very large instances of the scheduling of independent tasks problem on the GPU , 2013, J. Parallel Distributed Comput..

[33]  Enrique Alba,et al.  A New Local Search Algorithm for the DNA Fragment Assembly Problem , 2007, EvoCOP.

[34]  Enrique Alba,et al.  A parallel micro evolutionary algorithm for heterogeneous computing and grid scheduling , 2012, Appl. Soft Comput..

[35]  Sunil P. Khatri,et al.  Boolean satisfiability on a graphics processor , 2010, GLSVLSI '10.

[36]  Enrique Alba,et al.  Heterogeneous computing scheduling with evolutionary algorithms , 2010, Soft Comput..