Executing Bag of Distributed Tasks on the Cloud: Investigating the Trade-Offs between Performance and Cost

Bag of Distributed Tasks (BoDT) can benefit from decentralised execution on the Cloud. However, there is a trade-off between the performance that can be achieved by employing a large number of Cloud VMs for the tasks and the monetary constraints that are often placed by a user. The research reported in this paper is motivated towards investigating this trade-off so that an optimal plan for deploying BoDT applications on the cloud can be generated. A heuristic algorithm, which considers the user's preference of performance and cost is proposed and implemented. The feasibility of the algorithm is demonstrated by generating execution plans for a sample application. The key result is that the algorithm generates optimal execution plans for the application over 91% of the time.

[1]  Abhishek Chandra,et al.  Nebula: Distributed Edge Cloud for Data Intensive Computing , 2014, 2014 IEEE International Conference on Cloud Engineering.

[2]  Gueyoung Jung,et al.  Optimal Time-Cost Tradeoff of Parallel Service Workflow in Federated Heterogeneous Clouds , 2013, 2013 IEEE 20th International Conference on Web Services.

[3]  Johan Tordsson,et al.  Cloud brokering mechanisms for optimized placement of virtual machines across multiple providers , 2012, Future Gener. Comput. Syst..

[4]  Luiz Fernando Bittencourt,et al.  On the Performance-Cost Tradeoff for Workflow Scheduling in Hybrid Clouds , 2013, 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing.

[5]  Zibin Zheng,et al.  Scaling Service-Oriented Applications into Geo-distributed Clouds , 2013, 2013 IEEE Seventh International Symposium on Service-Oriented System Engineering.

[6]  Qi Zhang,et al.  Dynamic Service Placement in Geographically Distributed Clouds , 2013, IEEE J. Sel. Areas Commun..

[7]  Thilo Kielmann,et al.  Bag-of-Tasks Scheduling under Budget Constraints , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[8]  Quanyan Zhu,et al.  Dynamic Service Placement in Geographically Distributed Clouds , 2012, IEEE Journal on Selected Areas in Communications.

[9]  Zibin Zheng,et al.  A Latency-Aware Co-deployment Mechanism for Cloud-Based Services , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[10]  Rubén S. Montero,et al.  Scheduling strategies for optimal service deployment across multiple clouds , 2013, Future Gener. Comput. Syst..

[11]  Hangwei Qian,et al.  Towards Proximity-aware Application Deployment in Geo-distributed Clouds , 2013, CSA 2013.

[12]  Gabriel Antoniu,et al.  SAGE: Geo-Distributed Streaming Data Analysis in Clouds , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[13]  David E. Culler,et al.  PlanetLab: an overlay testbed for broad-coverage services , 2003, CCRV.

[14]  Adam Barker,et al.  Location, Location, Location: Data-Intensive Distributed Computing in the Cloud , 2013, 2013 IEEE 5th International Conference on Cloud Computing Technology and Science.

[15]  Scott Shenker,et al.  Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling , 2010, EuroSys '10.

[16]  Kavitha Ranganathan,et al.  Decoupling computation and data scheduling in distributed data-intensive applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.