Scalable linear programming based resource allocation for makespan minimization in heterogeneous computing systems

Resource management for large-scale high performance computing systems poses difficult challenges to system administrators. The extreme scale of these modern systems require task scheduling algorithms that are capable of handling at least millions of tasks and thousands of machines. Highly scalable algorithms are necessary to efficiently schedule tasks to maintain the highest level of performance from the system. In this study, we design a novel linear programming based resource allocation algorithm for heterogeneous computing systems to efficiently compute high quality solutions for minimizing makespan. The novel algorithm tightly bounds the optimal makespan from below with an infeasible schedule and from above with a fully feasible schedule. The new algorithms are highly scalable in terms of solution quality and computation time as the problem size increases because they leverage similarity in tasks and machines. This novel algorithm is compared to existing algorithms via simulation on a few example systems. We present a novel scheduling algorithm for heterogeneous computing environments.Uses groupings of similar tasks and machines to reduce the computational complexity.Computes upper and lower bounds on the optimal makespan.Schedule approaches a lower bound on the makespan as the number of tasks increases.Scheduling algorithm run time scales linearly with the number of tasks.

[1]  Oscar H. Ibarra,et al.  Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical Processors , 1977, JACM.

[2]  Min-You Wu,et al.  A high-performance mapping algorithm for heterogeneous computing systems , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[3]  Imtiaz Ahmad,et al.  An Integrated Technique for Task Matching and Scheduling onto Distributed Heterogeneous Computing Systems , 2002, J. Parallel Distributed Comput..

[4]  Emmanuel Jeannot,et al.  Optimizing performance and reliability on heterogeneous parallel systems: Approximation algorithms and heuristics , 2012, J. Parallel Distributed Comput..

[5]  Debasish Ghose,et al.  Divisible Load Theory: A New Paradigm for Load Scheduling in Distributed Systems , 2004, Cluster Computing.

[6]  Anthony A. Maciejewski,et al.  Energy and Makespan Tradeoffs in Heterogeneous Computing Systems using Efficient Linear Programming Techniques , 2016, IEEE Transactions on Parallel and Distributed Systems.

[7]  Henri Casanova,et al.  Resource allocation algorithms for virtualized service hosting platforms , 2010, J. Parallel Distributed Comput..

[8]  Jie Wu,et al.  Energy-Aware Scheduling for Frame-Based Tasks on Heterogeneous Multiprocessor Platforms , 2012, 2012 41st International Conference on Parallel Processing.

[9]  Viktor K. Prasanna,et al.  Heterogeneous computing: challenges and opportunities , 1993, Computer.

[10]  Howard Jay Siegel,et al.  Representing Task and Machine Heterogeneities for Heterogeneous Computing Systems , 2000 .

[11]  Gregory A. Koenig,et al.  An Analysis Framework for Investigating the Trade-Offs between System Performance and Energy Consumption in a Heterogeneous Computing Environment , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[12]  Salim Hariri,et al.  Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing , 2002, IEEE Trans. Parallel Distributed Syst..

[13]  Klaus Jansen,et al.  Improved Approximation Schemes for Scheduling Unrelated Parallel Machines , 2001, Math. Oper. Res..

[14]  Debasish Ghose,et al.  Scheduling Divisible Loads in Parallel and Distributed Systems , 1996 .

[15]  Ishfaq Ahmad,et al.  Optimal task assignment in heterogeneous distributed computing systems , 1998, IEEE Concurr..

[16]  Ronald L. Graham,et al.  Bounds on Multiprocessing Timing Anomalies , 1969, SIAM Journal of Applied Mathematics.

[17]  Ladislau Bölöni,et al.  A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems , 2001, J. Parallel Distributed Comput..

[18]  Arif Ghafoor,et al.  A distributed heterogeneous supercomputing management system , 1993, Computer.

[19]  R. F. Freund,et al.  Dynamic Mapping of a Class of Independent Tasks onto Heterogeneous Computing Systems , 1999, J. Parallel Distributed Comput..

[20]  Jan Karel Lenstra,et al.  Approximation algorithms for scheduling unrelated parallel machines , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[21]  Henri Casanova,et al.  Virtual Machine Resource Allocation for Service Hosting on Heterogeneous Distributed Platforms , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[22]  John N. Tsitsiklis,et al.  Introduction to linear optimization , 1997, Athena scientific optimization and computation series.

[23]  Anthony A. Maciejewski,et al.  Efficient and Scalable Pareto Front Generation for Energy and Makespan in Heterogeneous Computing Systems , 2013, WCO@FedCSIS.

[24]  Éva Tardos,et al.  Scheduling unrelated machines with costs , 1993, SODA '93.

[25]  Grey Giddins,et al.  Statistics , 2016, The Journal of hand surgery, European volume.

[26]  Yves Robert,et al.  Energy-aware scheduling under reliability and makespan constraints , 2011, 2012 19th International Conference on High Performance Computing.