Utility Functions and Resource Management in an Oversubscribed Heterogeneous Computing Environment

We model an oversubscribed heterogeneous computing system where tasks arrive dynamically and a scheduler maps the tasks to machines for execution. The environment and workloads are based on those being investigated by the Extreme Scale Systems Center at Oak Ridge National Laboratory. Utility functions that are designed based on specifications from the system owner and users are used to create a metric for the performance of resource allocation heuristics. Each task has a time-varying utility (importance) that the enterprise will earn based on when the task successfully completes execution. We design multiple heuristics, which include a technique to drop low utility-earning tasks, to maximize the total utility that can be earned by completing tasks. The heuristics are evaluated using simulation experiments with two levels of oversubscription. The results show the benefit of having fast heuristics that account for the importance of a task and the heterogeneity of the environment when making allocation decisions in an oversubscribed environment. The ability to drop low utility-earning tasks allow the heuristics to tolerate the high oversubscription as well as earn significant utility.

[1]  Ishfaq Ahmad,et al.  Optimal task assignment in heterogeneous distributed computing systems , 1998, IEEE Concurr..

[2]  S. Ghanbari,et al.  On-line Mapping Algorithms in Highly Heterogeneous Computational Grids : A Learning Automata Approach , 2005 .

[3]  Francine Berman,et al.  Adaptive Computing on the Grid Using AppLeS , 2003, IEEE Trans. Parallel Distributed Syst..

[4]  Anthony A. Maciejewski,et al.  Static allocation of resources to communicating subtasks in a heterogeneous ad hoc grid environment , 2006, J. Parallel Distributed Comput..

[5]  Ladislau Bölöni,et al.  A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems , 2001, J. Parallel Distributed Comput..

[6]  Arif Ghafoor,et al.  A distributed heterogeneous supercomputing management system , 1993, Computer.

[7]  R. F. Freund,et al.  Dynamic Mapping of a Class of Independent Tasks onto Heterogeneous Computing Systems , 1999, J. Parallel Distributed Comput..

[8]  Bora Uçar,et al.  Heuristics for scheduling file-sharing tasks on heterogeneous systems with distributed repositories , 2007, J. Parallel Distributed Comput..

[9]  Arif Ghafoor,et al.  Semi-Distributed Load Balancing For Massively Parallel Multicomputer Systems , 1991, IEEE Trans. Software Eng..

[10]  Anthony A. Maciejewski,et al.  Deadline and energy constrained dynamic resource allocation in a heterogeneous computing environment , 2011, 2011 40th International Conference on Parallel Processing Workshops.

[11]  Anthony A. Maciejewski,et al.  Dynamic Resource Management in Energy Constrained Heterogeneous Computing Systems Using Voltage Scaling , 2008, IEEE Transactions on Parallel and Distributed Systems.

[12]  Binoy Ravindran,et al.  Scheduling distributable real-time threads in Tempus middleware , 2004, Proceedings. Tenth International Conference on Parallel and Distributed Systems, 2004. ICPADS 2004..

[13]  David A. Bader,et al.  A Framework for Measuring Supercomputer Productivity , 2004, Int. J. High Perform. Comput. Appl..

[14]  Binoy Ravindran,et al.  On recent advances in time/utility function real-time scheduling and resource management , 2005, Eighth IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC'05).

[15]  Binoy Ravindran,et al.  A formally verified application-level framework for real-time scheduling on POSIX real-time operating systems , 2004, IEEE Transactions on Software Engineering.

[16]  Cynthia Bailey Lee,et al.  Precise and realistic utility functions for user-centric performance analysis of schedulers , 2007, HPDC '07.

[17]  Klara Nahrstedt,et al.  QoS and Contention-Aware Multi-Resource Reservation , 2004, Cluster Computing.

[18]  Gregory A. Koenig,et al.  Time Utility Functions for Modeling and Evaluating Resource Allocations in a Heterogeneous Computing System , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[19]  Ken Chen,et al.  A scheduling algorithm for tasks described by Time Value Function , 1996, Real-Time Systems.

[20]  Ali Movaghar-Rahimabadi,et al.  Performance Optimization Based on Analytical Modeling in a Real-Time System with Constrained Time/Utility Functions , 2011, IEEE Transactions on Computers.

[21]  Douglas G. Down,et al.  Linear Programming-Based Affinity Scheduling of Independent Tasks on Heterogeneous Computing Systems , 2008, IEEE Transactions on Parallel and Distributed Systems.

[22]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[23]  Anthony A. Maciejewski,et al.  Dynamically mapping tasks with priorities and multiple deadlines in a heterogeneous environment , 2007, J. Parallel Distributed Comput..

[24]  Sadiq M. Sait,et al.  Task matching and scheduling in heterogeneous systems using simulated evolution , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[25]  Guoliang Chen,et al.  A benefit function mapping heuristic for a class of meta-tasks in grid environments , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[26]  Anthony A. Maciejewski,et al.  Heuristics for Robust Resource Allocation of Satellite Weather Data Processing on a Heterogeneous Parallel System , 2011, IEEE Transactions on Parallel and Distributed Systems.

[27]  Anthony A. Maciejewski,et al.  Robust static allocation of resources for independent tasks under makespan and dollar cost constraints , 2007, J. Parallel Distributed Comput..

[28]  Viktor K. Prasanna,et al.  Heterogeneous computing: challenges and opportunities , 1993, Computer.

[29]  Anthony A. Maciejewski,et al.  Stochastic robustness metric and its use for static resource allocations , 2008, J. Parallel Distributed Comput..

[30]  Imtiaz Ahmad,et al.  An Integrated Technique for Task Matching and Scheduling onto Distributed Heterogeneous Computing Systems , 2002, J. Parallel Distributed Comput..

[31]  Howard Jay Siegel,et al.  Representing Task and Machine Heterogeneities for Heterogeneous Computing Systems , 2000 .

[32]  Francine Berman,et al.  Heuristics for scheduling parameter sweep applications in grid environments , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[33]  Hong Zhang,et al.  Segmented min-min: a static mapping algorithm for meta-tasks on heterogeneous computing systems , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).