Stochastic-based robust dynamic resource allocation for independent tasks in a heterogeneous computing system

Heterogeneous parallel and distributed computing systems frequently must operate in environments where there is uncertainty in system parameters. Robustness can be defined as the degree to which a system can function correctly in the presence of parameter values different from those assumed. In such an environment, the execution time of any given task may fluctuate substantially due to factors such as the content of data to be processed. Determining a resource allocation that is robust against this uncertainty is an important area of research. In this study, we define a stochastic robustness measure to facilitate resource allocation decisions in a dynamic environment where tasks are subject to individual hard deadlines and each task requires some input data to start execution. In this environment, the tasks that cannot meet their deadlines are dropped (i.e., discarded). We define methods to determine the stochastic completion times of tasks in the presence of the task dropping. The stochastic task completion time is used in the definition of the stochastic robustness measure. Based on this stochastic robustness measure, we design novel resource allocation techniques that work in immediate and batch modes, with the goal of maximizing the number of tasks that meet their individual deadlines. We compare the performance of our technique against several well-known approaches taken from the literature and adapted to our environment. Simulation results of this study demonstrate the suitability of our new technique in a dynamic heterogeneous computing system. Calculating stochastic task completion time in heterogeneous system with task dropping.A model to quantify resource allocation robustness and propose mapping heuristics.Evaluating immediate and batch mappings and optimizing queue-size limit of batch mode.Analyzing impact of over-subscription level on immediate and batch allocation modes.Providing a model in the batch mode to run mapping events before machines become idle.

[1]  Atakan Dogan,et al.  Genetic Algorithm Based Scheduling of Meta-Tasks with Stochastic Execution Times in Heterogeneous Computing Systemst1 , 2004, Cluster Computing.

[2]  Donald Yeung,et al.  Coherent Profiles: Enabling Efficient Reuse Distance Analysis of Multicore Scaling for Loop-based Parallel Programs , 2011, 2011 International Conference on Parallel Architectures and Compilation Techniques.

[3]  Kenli Li,et al.  A Multiple Priority Queueing Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems , 2012, 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems.

[4]  Howard Jay Siegel,et al.  Representing Task and Machine Heterogeneities for Heterogeneous Computing Systems , 2000 .

[5]  Oscar H. Ibarra,et al.  Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical Processors , 1977, JACM.

[6]  Ivan Stojmenovic,et al.  Optimal Power Allocation and Load Distribution for Multiple Heterogeneous Multicore Server Processors across Clouds and Data Centers , 2014, IEEE Transactions on Computers.

[7]  Stuart Barber,et al.  All of Statistics: a Concise Course in Statistical Inference , 2005 .

[8]  Gregory A. Koenig,et al.  Utility Functions and Resource Management in an Oversubscribed Heterogeneous Computing Environment , 2015, IEEE Transactions on Computers.

[9]  Jun Kong,et al.  High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[10]  Xiao Qin,et al.  A novel fault-tolerant scheduling algorithm for precedence constrained tasks in real-time heterogeneous systems , 2006, Parallel Comput..

[11]  Anthony A. Maciejewski,et al.  Measuring the Robustness of Resource Allocations in a Stochastic Dynamic Environment , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[12]  Anthony A. Maciejewski,et al.  Stochastic-Based Robust Dynamic Resource Allocation in a Heterogeneous Computing System , 2009, 2009 International Conference on Parallel Processing.

[13]  Borja Sotomayor,et al.  Resource Leasing and the Art of Suspending Virtual Machines , 2009, 2009 11th IEEE International Conference on High Performance Computing and Communications.

[14]  Anthony A. Maciejewski,et al.  Stochastic robustness metric and its use for static resource allocations , 2008, J. Parallel Distributed Comput..

[15]  Ladislau Bölöni,et al.  A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems , 2001, J. Parallel Distributed Comput..

[16]  Gregory A. Koenig,et al.  Utility maximizing dynamic resource management in an oversubscribed energy-constrained heterogeneous computing system , 2015, Sustain. Comput. Informatics Syst..

[17]  Vyas Sekar,et al.  Understanding the impact of video quality on user engagement , 2011, SIGCOMM.

[18]  Ya-Shu Chen,et al.  Online Real-Time Task Scheduling in Heterogeneous Multicore System-on-a-Chip , 2013, IEEE Transactions on Parallel and Distributed Systems.

[19]  Sufen Fong,et al.  MeshEye: A Hybrid-Resolution Smart Camera Mote for Applications in Distributed Intelligent Surveillance , 2007, 2007 6th International Symposium on Information Processing in Sensor Networks.

[20]  Gregory A. Koenig,et al.  Utility Driven Dynamic Resource Management in an Oversubscribed Energy-Constrained Heterogeneous System , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[21]  Edward G. Coffman,et al.  Computer and job-shop scheduling theory , 1976 .

[22]  Emmanuel Jeannot,et al.  Evaluation and Optimization of the Robustness of DAG Schedules in Heterogeneous Environments , 2010, IEEE Transactions on Parallel and Distributed Systems.

[23]  Douglas G. Down,et al.  Dynamic scheduling for heterogeneous Desktop Grids , 2008, Grid 2008.

[24]  Ewa Deelman,et al.  Community Resources for Enabling Research in Distributed Scientific Workflows , 2014, 2014 IEEE 10th International Conference on e-Science.

[25]  Leon S. Lasdon,et al.  Two-stage Stochastic Optimization for the Allocation of Medical Assets in Steady-state Combat Operations , 2010 .

[26]  R. F. Freund,et al.  Dynamic Mapping of a Class of Independent Tasks onto Heterogeneous Computing Systems , 1999, J. Parallel Distributed Comput..

[27]  Douglas G. Down,et al.  Linear Programming-Based Affinity Scheduling of Independent Tasks on Heterogeneous Computing Systems , 2008, IEEE Transactions on Parallel and Distributed Systems.

[28]  Christina Delimitrou,et al.  QoS-Aware scheduling in heterogeneous datacenters with paragon , 2013, TOCS.

[29]  Ladislau Bölöni,et al.  Robust scheduling of metaprograms , 2002 .

[30]  Kenli Li,et al.  Energy-Efficient Stochastic Task Scheduling on Heterogeneous Computing Systems , 2014, IEEE Transactions on Parallel and Distributed Systems.

[31]  Bora Uçar,et al.  Heuristics for scheduling file-sharing tasks on heterogeneous systems with distributed repositories , 2007, J. Parallel Distributed Comput..

[32]  Anurag Kumar,et al.  Performance Analysis and Scheduling of Stochastic Fork-Join Jobs in a Multicomputer System , 1993, IEEE Trans. Parallel Distributed Syst..

[33]  Anthony A. Maciejewski,et al.  Deadline and energy constrained dynamic resource allocation in a heterogeneous computing environment , 2011, 2011 40th International Conference on Parallel Processing Workshops.

[34]  Yan Alexander Li,et al.  Determining the Execution Time Distribution for a Data Parallel Program in a Heterogeneous Computing Environment , 1997, J. Parallel Distributed Comput..

[35]  Albert Y. Zomaya,et al.  Rescheduling for reliable job completion with the support of clouds , 2010, Future Gener. Comput. Syst..

[36]  G C Sharp,et al.  GPU-based streaming architectures for fast cone-beam CT image reconstruction and demons deformable registration , 2007, Physics in medicine and biology.

[37]  Albert Y. Zomaya,et al.  A Novel State Transition Method for Metaheuristic-Based Scheduling in Heterogeneous Computing Systems , 2008, IEEE Transactions on Parallel and Distributed Systems.

[38]  A. Leon-Garcia,et al.  Probability, statistics, and random processes for electrical engineering , 2008 .

[39]  Howard Jay Siegel,et al.  Robust resource allocation in a cluster based imaging system , 2009, Parallel Comput..

[40]  Wang Yi,et al.  Schedulability analysis for non-preemptive fixed-priority multiprocessor scheduling , 2011, J. Syst. Archit..

[41]  Dhabaleswar K. Panda,et al.  Characterization and enhancement of dynamic mapping heuristics for heterogeneous systems , 2000, Proceedings 2000. International Workshop on Parallel Processing.