Task scheduling strategy based on data replication in scientific Cloud workflows

The invention of Cloud computing as a new model of service provisioning in distributed systems encourages re- searchers to investigate its benefits and drawbacks in executing applications. In recent years, Cloud computing is fast evolving as the target platform for such applications among researchers. Furthermore, new pricing models have been pioneered by Cloud providers that allow users to provision resources and to use them in an efficient manner with significant cost reductions. Ap- proaches for scheduling and data placement is often highly correlated, which take into account a few factors at the same time, and what are the most often adapted to applications data medium and therefore doesn't go to scale. In this work, we propose an optimization approach that takes into account an effective data placement and scheduling of tasks grouped based on data replication in scientific Cloud environments. This proposed approach improve data placement and minimize response time due to scheduling tasks to data centers that contain the majority of the required data.

[1]  Fang Dong,et al.  BAR: An Efficient Data Locality Driven Task Scheduling Algorithm for Cloud Computing , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[2]  Baomin Xu,et al.  Job scheduling algorithm based on Berger model in cloud environment , 2011, Adv. Eng. Softw..

[3]  Jan Broeckhove,et al.  Online cost-efficient scheduling of deadline-constrained workloads on hybrid clouds , 2013, Future Gener. Comput. Syst..

[4]  Helen D. Karatza,et al.  Performance and cost evaluation of Gang Scheduling in a Cloud Computing system with job migrations and starvation handling , 2011, 2011 IEEE Symposium on Computers and Communications (ISCC).

[5]  Sachin D. Babar,et al.  An Efficient Data Locality Driven Task Scheduling Algorithm for Cloud Computing , 2012 .

[6]  Li-zhen Cui,et al.  A Multiple QoS Constrained Scheduling Strategy of Multiple Workflows for Cloud Computing , 2009, 2009 IEEE International Symposium on Parallel and Distributed Processing with Applications.

[7]  Yi Peng,et al.  The analytic hierarchy process: task scheduling and resource allocation in cloud computing environment , 2011, The Journal of Supercomputing.

[8]  Jesús Vigo-Aguiar,et al.  Preface to high performance computing applied to computational problems in science and engineering , 2012, The Journal of Supercomputing.

[9]  Meikang Qiu,et al.  Online optimization for scheduling preemptable tasks on IaaS cloud systems , 2012, J. Parallel Distributed Comput..

[10]  Sanjay Chaudhary,et al.  Policy based resource allocation in IaaS cloud , 2012, Future Gener. Comput. Syst..

[11]  Xiao Liu,et al.  A data placement strategy in scientific cloud workflows , 2010, Future Gener. Comput. Syst..

[12]  Mohamed Othman,et al.  A priority based job scheduling algorithm in cloud computing , 2012 .

[13]  Marty Humphrey,et al.  Auto-scaling to minimize cost and meet application deadlines in cloud workflows , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[14]  Chao-Tung Yang,et al.  A Dynamic Resource Allocation Model for Virtual Machine Management on Cloud , 2011, FGIT-GDC.

[15]  Paul J. Schweitzer,et al.  Problem Decomposition and Data Reorganization by a Clustering Technique , 1972, Oper. Res..

[16]  Xiaorong Li,et al.  Hybrid Heuristic for Scheduling Data Analytics Workflow Applications in Hybrid Cloud Environment , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[17]  Ghalem Belalem,et al.  Optimization of Tasks Scheduling by an Efficacy Data Placement and Replication in Cloud Computing , 2013, ICA3PP.

[18]  Sateesh Kumar Peddoju,et al.  A Dynamic Optimization Algorithm for Task Scheduling in Cloud Environment , 2012 .

[19]  Shalini Ramanathan,et al.  Linear Scheduling Strategy for Resource Allocation in Cloud Environment , 2012, CloudCom 2012.

[20]  Rubén S. Montero,et al.  Scheduling strategies for optimal service deployment across multiple clouds , 2013, Future Gener. Comput. Syst..

[21]  Gábor Terstyánszky,et al.  Scientific Workflow Makespan Reduction through Cloud Augmented Desktop Grids , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.