Stochastic-Based Robust Dynamic Resource Allocation in a Heterogeneous Computing System

This research investigates the problem of robust dynamic resource allocation for heterogeneous distributed computing systems operating under imposed constraints. Often, such systems are expected to function in an environment where uncertainty in system parameters is common. In such an environment, the amount of processing required to complete an application may fluctuate substantially. Determining a resource allocation that accounts for this uncertainty---in a way that can provide a probability that a given level of service is achieved---is an important area of research. We define a mathematical model of stochastic robustness appropriate for a dynamic environment that can be used during resource allocation to aid heuristic decision making. In addition, we design a novel technique for maximizing stochastic robustness in this environment. Our performance results for this technique are compared with several well known resource allocation techniques in a simulated environment that models a heterogeneous distributed computing system.

[1]  Anthony A. Maciejewski,et al.  Dynamic resource allocation heuristics that manage tradeoff between makespan and robustness , 2007, The Journal of Supercomputing.

[2]  Ladislau Bölöni,et al.  A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems , 2001, J. Parallel Distributed Comput..

[3]  Dhabaleswar K. Panda,et al.  Characterization and enhancement of dynamic mapping heuristics for heterogeneous systems , 2000, Proceedings 2000. International Workshop on Parallel Processing.

[4]  Oscar H. Ibarra,et al.  Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical Processors , 1977, JACM.

[5]  David Wright,et al.  Probabilistic scheduling guarantees for fault-tolerant real-time systems , 1999, Dependable Computing for Critical Applications 7.

[6]  A. A. Maciejewski,et al.  Heterogeneous Computing , 2002 .

[7]  Eve A. Riskin,et al.  Signals, Systems, and Transforms , 1994 .

[8]  Johan Tordsson,et al.  An interoperable, standards-based grid resource broker and job submission service , 2005, First International Conference on e-Science and Grid Computing (e-Science'05).

[9]  Edward G. Coffman,et al.  Computer and job-shop scheduling theory , 1976 .

[10]  Anthony A. Maciejewski,et al.  Stochastic robustness metric and its use for static resource allocations , 2008, J. Parallel Distributed Comput..

[11]  Leon Garcia,et al.  Probability and Random Processes for Electrical Engineering , 1993 .

[12]  R. F. Freund,et al.  Dynamic Mapping of a Class of Independent Tasks onto Heterogeneous Computing Systems , 1999, J. Parallel Distributed Comput..

[13]  David Fernández-Baca,et al.  Allocating Modules to Processors in a Distributed System , 1989, IEEE Trans. Software Eng..

[14]  Peter F. Craigmile All of Statistics: A Concise Course in Statistical Inference , 2005 .

[15]  Lee C. Potter,et al.  Statistical Prediction of Task Execution Times through Analytic Benchmarking for Scheduling in a Heterogeneous Environment , 1999, IEEE Trans. Computers.

[16]  Emmanuel Jeannot,et al.  Robust task scheduling in non-deterministic heterogeneous computing systems , 2006, 2006 IEEE International Conference on Cluster Computing.

[17]  Jon B. Weissman,et al.  A new metric for robustness with application to job scheduling , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[18]  David E. Irwin,et al.  Balancing risk and reward in a market-based task service , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[19]  Anurag Kumar,et al.  Performance Analysis and Scheduling of Stochastic Fork-Join Jobs in a Multicomputer System , 1993, IEEE Trans. Parallel Distributed Syst..

[20]  Atakan Dogan,et al.  Genetic Algorithm Based Scheduling of Meta-Tasks with Stochastic Execution Times in Heterogeneous Computing Systemst1 , 2004, Cluster Computing.

[21]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[22]  Behrouz A. Forouzan,et al.  Data Communications and Networking , 2000 .

[23]  A. Doğan,et al.  Genetic Algorithm Based Scheduling of Meta-Tasks with Stochastic Execution Times in Heterogeneous Computing Systems , 2004 .

[24]  Ladislau Bölöni,et al.  Characterizing Resource Allocation Heuristics for Heterogeneous Computing Systems , 2005, Adv. Comput..

[25]  Yan Alexander Li,et al.  Determining the Execution Time Distribution for a Data Parallel Program in a Heterogeneous Computing Environment , 1997, J. Parallel Distributed Comput..

[26]  SiegelHoward Jay,et al.  Task Matching and Scheduling in Heterogeneous Computing Environments Using a Genetic-Algorithm-Based Approach , 1997 .

[27]  Howard Jay Siegel,et al.  Representing Task and Machine Heterogeneities for Heterogeneous Computing Systems , 2000 .

[28]  Larry Wasserman,et al.  All of Statistics: A Concise Course in Statistical Inference , 2004 .

[29]  Anthony A. Maciejewski,et al.  Measuring the Robustness of Resource Allocations in a Stochastic Dynamic Environment , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.