Shared data-aware dynamic resource provisioning and task scheduling for data intensive applications on hybrid clouds using Aneka

Abstract In the recent years, data-intensive applications have been growing at an increasing rate and there is a critical need to solve the high-performance and scalability issues. Hybrid Cloud Computing paradigm provides a promising solution to harness local infrastructure and remote resources and provide high Quality of Service (QoS) for time sensitive and data-intensive applications. Generally, hybrid cloud deployments have a heterogeneous pool of resources and it becomes a challenging task to efficiently utilize resources to provide optimum results. In modern data hungry applications, it is crucial to optimize bandwidth consumption, latency and networking overheads. Moreover, most of them have large extent of file sharing capability. The existing algorithms do not explicitly consider file sharing scenarios that leads large data transmission times and has severe effects on latency. In this direction, this paper focuses on building upon existing dynamic resource provisioning and task scheduling algorithms to provide better QoS in hybrid cloud environments for data intensive applications in a shared file task environment. The efficiency of proposed algorithms is demonstrated by deploying them on Microsoft Azure using Aneka, a platform for developing scalable applications on the Cloud. Experiments using real-world applications and datasets show that proposed algorithms are able to allocate tasks and extend to public cloud resources more efficiently, reducing deadline violations and improving response times to give response time reduction of upto 40.12% for a sample local alignment search application on genome sequences.

[1]  Sarbjeet Singh,et al.  A review of metaheuristic scheduling techniques in cloud computing , 2015 .

[2]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[3]  Victor I. Chang,et al.  Online learning offloading framework for heterogeneous mobile edge computing system , 2019, J. Parallel Distributed Comput..

[4]  Li Yan,et al.  A Multi-Objective Hybrid Cloud Resource Scheduling Method Based on Deadline and Cost Constraints , 2017, IEEE Access.

[5]  Rajkumar Buyya,et al.  The Aneka platform and QoS-driven resource provisioning for elastic applications on hybrid Clouds , 2012, Future Gener. Comput. Syst..

[6]  Rajkumar Buyya,et al.  EdgeLens: Deep Learning based Object Detection in Integrated IoT, Fog and Cloud Computing Environments , 2019, 2019 4th International Conference on Information Systems and Computer Networks (ISCON).

[7]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[8]  David Abramson,et al.  The Virtual Laboratory: a toolset to enable distributed molecular modelling for drug design on the World‐Wide Grid , 2003, Concurr. Comput. Pract. Exp..

[9]  C. R. Tripathy,et al.  Deadline based task scheduling using multi-criteria decision-making in cloud environment , 2018, Ain Shams Engineering Journal.

[10]  Prasant Kumar Pattnaik,et al.  An enhanced deadline constraint based task scheduling mechanism for cloud environment , 2018, J. King Saud Univ. Comput. Inf. Sci..

[11]  Rajkumar Buyya,et al.  Future Generation Computer Systems Deadline-driven Provisioning of Resources for Scientific Applications in Hybrid Clouds with Aneka , 2022 .

[12]  D. A. Bromley,et al.  An advanced computer-based nuclear physics data acquisition system , 1967 .

[13]  Rajkumar Buyya,et al.  FogBus: A Blockchain-based Lightweight Framework for Edge and Fog Computing , 2018, J. Syst. Softw..

[14]  GaniAbdullah,et al.  The rise of "big data" on cloud computing , 2015 .

[16]  Rajkumar Buyya,et al.  Multi-cloud resource provisioning with Aneka: A unified and integrated utilisation of microsoft azure and amazon EC2 instances , 2015, 2015 International Conference on Computing and Network Communications (CoCoNet).

[17]  Mahmood Ahmadi,et al.  Cost minimization for deadline-constrained bag-of-tasks applications in federated hybrid clouds , 2017, Future Gener. Comput. Syst..

[18]  DeelmanEwa,et al.  Algorithms for cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds , 2015 .

[19]  Wang Dong,et al.  Deadline based scheduling for data-intensive applications in clouds , 2016 .

[20]  Rajkumar Buyya,et al.  HealthFog: An Ensemble Deep Learning based Smart Healthcare System for Automatic Diagnosis of Heart Diseases in Integrated IoT and Fog Computing Environments , 2019, Future Gener. Comput. Syst..

[21]  Xinghui Zhao,et al.  A Framework for Privacy-Aware Computing on Hybrid Clouds with Mixed-Sensitivity Data , 2015, 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems.

[22]  Jarek Nabrzyski,et al.  Cost minimization for computational applications on hybrid cloud infrastructures , 2013, Future Gener. Comput. Syst..

[23]  Yue-Shan Chang,et al.  VM instance selection for deadline constraint job on agent-based interconnected cloud , 2018, Future Gener. Comput. Syst..

[24]  Richard O. Sinnott,et al.  Resource provisioning for data-intensive applications with deadline constraints on hybrid clouds using Aneka , 2018, Future Gener. Comput. Syst..

[25]  Marimuthu Palaniswami,et al.  Internet of Things (IoT): A vision, architectural elements, and future directions , 2012, Future Gener. Comput. Syst..

[26]  C. R. Tripathy,et al.  Deadline sensitive lease scheduling in cloud computing environment using AHP , 2016, J. King Saud Univ. Comput. Inf. Sci..

[27]  Jan Broeckhove,et al.  Online cost-efficient scheduling of deadline-constrained workloads on hybrid clouds , 2013, Future Gener. Comput. Syst..

[28]  What Are All Those Funny Symbols in a Blast Printout? Blast = Basic Local Alignment Search Tool , 2022 .

[29]  Rajkumar Buyya,et al.  An API for Development of User Defined Scheduling Algorithms in Aneka PaaS Cloud Software , 2018, Handbook of Research on Cloud Computing and Big Data Applications in IoT.

[30]  Rajkumar Buyya,et al.  Aneka: a Software Platform for .NET based Cloud Computing , 2009, High Performance Computing Workshop.

[31]  VanmechelenKurt,et al.  Online cost-efficient scheduling of deadline-constrained workloads on hybrid clouds , 2013 .

[32]  Helen D. Karatza,et al.  Multi-criteria scheduling of Bag-of-Tasks applications on heterogeneous interlinked clouds with simulated annealing , 2015, J. Syst. Softw..