Scheduling Workloads of Workflows in Clusters and Clouds

This dissertation addresses three key challenges that are characteristic to the online scheduling of workloads of workflows in modern distributed computing systems. The first challenge is the realistic estimation of the resource demand of a workflow, as it is important for making good task placement and resource allocation decisions. Usually, workflows consist of segments with different parallelism and different interconnection types between tasks which affect the order how the tasks become eligible. Moreover, realistic task runtime estimates are not always available. The second challenge is the efficient placement of workflow tasks on computing resources for minimizing average workflow slowdown while achieving fairness. A wrongly chosen task placement policy can easily degrade the performance and negatively affect the fair access of workflows to computing resources. The third challenge is the automatic allocation (autoscaling) of computing resources for workflows while meeting deadline and budget constraints. Computing clouds make it possible to easily lease and release resources. Such decisions should be made wisely to minimize slowdowns and deadline violations, and to efficiently use the leased resources to reduce incurred costs. To address these challenges, this dissertation proposes novel scheduling policies for workloads of workflows and investigates the applicability of relevant state-of-the-art policies to the online scenario. For new policies, implementation effort and suitability for production systems are kept in mind. The considered workflow scheduling policies are experimentally evaluated by conducting a wide set of simulation-based and real-world experiments on a private multicluster computer. Additionally, a Mixed Integer Programming (MIP) approach is used to validate the obtained real-world experimental results versus the optimal solution from a MIP solver.

[1]  et al,et al.  Search for gravitational waves from binary inspirals in S3 and S4 LIGO data , 2007, 0704.3368.

[2]  Thilo Kielmann,et al.  Autoscaling Web Applications in Heterogeneous Cloud Infrastructures , 2014, 2014 IEEE International Conference on Cloud Engineering.

[3]  David A. Lifka,et al.  The ANL/IBM SP Scheduling System , 1995, JSSPP.

[4]  Bingsheng He,et al.  Monetary Cost Optimizations for Hosting Workflow-as-a-Service in IaaS Clouds , 2013, IEEE Transactions on Cloud Computing.

[5]  Marty Humphrey,et al.  Scaling and Scheduling to Maximize Application Performance within Budget Constraints in Cloud Workflows , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[6]  Ravi Kumar,et al.  Pig latin: a not-so-foreign language for data processing , 2008, SIGMOD Conference.

[7]  Rajkumar Buyya,et al.  A Dynamic Critical Path Algorithm for Scheduling Scientific Workflow Applications on Global Grids , 2007, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007).

[8]  Minhaj Ahmad Khan,et al.  Scheduling for heterogeneous Systems using constrained critical paths , 2012, Parallel Comput..

[9]  Alexandru Iosup,et al.  Ananke: A Q-Learning-Based Portfolio Scheduler for Complex Industrial Workflows , 2017, 2017 IEEE International Conference on Autonomic Computing (ICAC).

[10]  Johan Tordsson,et al.  PEAS , 2016, ACM Trans. Model. Perform. Evaluation Comput. Syst..

[11]  Dick H. J. Epema,et al.  The Impact of Task Runtime Estimate Accuracy on Scheduling Workloads of Workflows , 2018, 2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[12]  Cees T. A. M. de Laat,et al.  A Medium-Scale Distributed System for Computer Science Research: Infrastructure for the Long Term , 2016, Computer.

[13]  Alexandru Iosup,et al.  Serverless is More: From PaaS to Present Cloud Computing , 2018, IEEE Internet Computing.

[14]  Rajiv Ranjan,et al.  Osmotic Flow: Osmotic Computing + IoT Workflow , 2017, IEEE Cloud Computing.

[15]  Alexandru Iosup,et al.  Elasticity in Graph Analytics? A Benchmarking Framework for Elastic Graph Processing , 2018, 2018 IEEE International Conference on Cluster Computing (CLUSTER).

[16]  Eric Cano,et al.  IOP : An efficient, modular and simple tape archiving solution for LHC Run-3 , 2017 .

[17]  Rizos Sakellariou,et al.  A Performance Model to Estimate Execution Time of Scientific Workflows on the Cloud , 2014, 2014 9th Workshop on Workflows in Support of Large-Scale Science.

[18]  Hamid Arabnejad,et al.  Multi-workflow QoS-Constrained Scheduling for Utility Computing , 2015, 2015 IEEE 18th International Conference on Computational Science and Engineering.

[19]  Alexandru Iosup,et al.  A Trace-Based Performance Study of Autoscaling Workloads of Workflows in Datacenters , 2018, 2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[20]  Thomas Heinis,et al.  Design and Evaluation of an Autonomic Workflow Engine , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[21]  L. S. Baumann,et al.  Automated workflow control: a key to office productivity , 1980, AFIPS '80.

[22]  Dror G. Feitelson,et al.  The workload on parallel supercomputers: modeling the characteristics of rigid jobs , 2003, J. Parallel Distributed Comput..

[23]  Philipp Leitner,et al.  Patterns in the Chaos—A Study of Performance Variation and Predictability in Public IaaS Clouds , 2014, ACM Trans. Internet Techn..

[24]  Shigang Chen,et al.  Using Integer Programming for Workflow Scheduling in the Cloud , 2017, 2017 IEEE 10th International Conference on Cloud Computing (CLOUD).

[25]  Vijay K. Garg,et al.  Algorithms for Dilworth ’ s Chain Partition , 2004 .

[26]  I. Bird Computing for the Large Hadron Collider , 2011 .

[27]  Amit P. Sheth,et al.  An overview of workflow management: From process modeling to workflow automation infrastructure , 1995, Distributed and Parallel Databases.

[28]  Rizos Sakellariou,et al.  A hybrid heuristic for DAG scheduling on heterogeneous systems , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[29]  Robert Ricci,et al.  Taming Performance Variability , 2018, OSDI.

[30]  Jonathan Livny,et al.  Bioinformatic discovery of bacterial regulatory RNAs using SIPHT. , 2012, Methods in molecular biology.

[31]  Alexandru Iosup,et al.  Which Cloud Auto-Scaler Should I Use for my Application?: Benchmarking Auto-Scaling Algorithms , 2016, ICPE.

[32]  Cees T. A. M. de Laat,et al.  CYCLONE: The Multi-cloud Middleware Stack for Application Deployment and Management , 2017, 2017 IEEE International Conference on Cloud Computing Technology and Science (CloudCom).

[33]  Christina Delimitrou,et al.  Quasar: resource-efficient and QoS-aware cluster management , 2014, ASPLOS.

[34]  Luke M. Leslie,et al.  Supporting On-demand Elasticity in Distributed Graph Processing , 2016, 2016 IEEE International Conference on Cloud Engineering (IC2E).

[35]  Andrei Tchernykh,et al.  Multiple Workflow Scheduling Strategies with User Run Time Estimates on a Grid , 2012, Journal of Grid Computing.

[36]  DeelmanEwa,et al.  Algorithms for cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds , 2015 .

[37]  Dick H. J. Epema,et al.  Scheduling Workloads of Workflows with Unknown Task Runtimes , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[38]  Johan Tordsson,et al.  Workload Classification for Efficient Auto-Scaling of Cloud Resources , 2013 .

[39]  Alexandru Iosup,et al.  DGSim: Comparing Grid Resource Management Architectures through Trace-Based Simulation , 2008, Euro-Par.

[40]  Prashant J. Shenoy,et al.  Agile dynamic provisioning of multi-tier Internet applications , 2008, TAAS.

[41]  Philip J. Fleming,et al.  How not to lie with statistics: the correct way to summarize benchmark results , 1986, CACM.

[42]  Marian Bubak,et al.  Scheduling Multilevel Deadline-Constrained Scientific Workflows on Clouds Based on Cost Optimization , 2015, Sci. Program..

[43]  Joel H. Saltz,et al.  A Duplication Based Algorithm for Optimizing Latency Under Throughput Constraints for Streaming Workflows , 2008, 2008 37th International Conference on Parallel Processing.

[44]  Mark J. Clement,et al.  Core Algorithms of the Maui Scheduler , 2001, JSSPP.

[45]  Y.-K. Kwok,et al.  Static scheduling algorithms for allocating directed task graphs to multiprocessors , 1999, CSUR.

[46]  Alexandru Iosup,et al.  On the Performance Variability of Production Cloud Services , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[47]  Daniel S. Katz,et al.  Montage: a grid portal and software toolkit for science-grade astronomical image mosaicking , 2009, Int. J. Comput. Sci. Eng..

[48]  Edward R. Marsh The Harmonogram of Karol Adamiecki. , 1974 .

[49]  Dennis Gannon,et al.  Workflows for e-Science, Scientific Workflows for Grids , 2014 .

[50]  Cees T. A. M. de Laat,et al.  CYCLONE: A Platform for Data Intensive Scientific Applications in Heterogeneous Multi-cloud/Multi-provider Environment , 2016, 2016 IEEE International Conference on Cloud Engineering Workshop (IC2EW).

[51]  P. Sadayappan,et al.  Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement , 2003, JSSPP.

[52]  James E. Kelley,et al.  Critical-path planning and scheduling , 1899, IRE-AIEE-ACM '59 (Eastern).

[53]  Bernd Freisleben,et al.  On-Demand Resource Provisioning for BPEL Workflows Using Amazon's Elastic Compute Cloud , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[54]  J. R. Wieland,et al.  Queueing-network stability: simulation-based checking , 2003, Proceedings of the 2003 Winter Simulation Conference, 2003..

[55]  Jin-Soo Kim,et al.  Cost optimized provisioning of elastic resources for application workflows , 2011, Future Gener. Comput. Syst..

[56]  Andreas Neumann,et al.  Oozie: towards a scalable workflow management system for Hadoop , 2012, SWEET '12.

[57]  Rizos Sakellariou,et al.  Scheduling multiple DAGs onto heterogeneous systems , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[58]  Hamid Arabnejad,et al.  Multi-QoS constrained and Profit-aware scheduling approach for concurrent workflows on heterogeneous systems , 2017, Future Gener. Comput. Syst..

[59]  Dror G. Feitelson,et al.  Workload Modeling for Computer Systems Performance Evaluation , 2015 .

[60]  Adam Belloum,et al.  Execution Time Estimation for Workflow Scheduling , 2014, 2014 9th Workshop on Workflows in Support of Large-Scale Science.

[61]  Samuel Kounev,et al.  Elasticity in Cloud Computing: What It Is, and What It Is Not , 2013, ICAC.

[62]  Rajkumar Buyya,et al.  Budget-Driven Scheduling of Scientific Workflows in IaaS Clouds with Fine-Grained Billing Periods , 2017, ACM Trans. Auton. Adapt. Syst..

[63]  Alexandru Iosup,et al.  An Experimental Performance Evaluation of Autoscaling Policies for Complex Workflows , 2017, ICPE.

[64]  Jacques van Helden,et al.  Regulatory Sequence Analysis Tools , 2003, Nucleic Acids Res..

[65]  Ajay Mohindra,et al.  Dynamic Scaling of Web Applications in a Virtualized Cloud Computing Environment , 2009, 2009 IEEE International Conference on e-Business Engineering.

[66]  Wallace Clark,et al.  The Gantt chart : a working tool of management , 2022 .

[67]  Liu Jin,et al.  A Dynamic Scheduling Method of Earth-Observing Satellites by Employing Rolling Horizon Strategy , 2013, TheScientificWorldJournal.

[68]  Radu Prodan,et al.  ON THE CHARACTERISTICS OF GRID WORKFLOWS , 2008 .

[69]  George B. Dantzig,et al.  Linear Programming 1: Introduction , 1997 .

[70]  Zhenyu Wen,et al.  Fog Orchestration for Internet of Things Services , 2017, IEEE Internet Computing.

[71]  Dick H. J. Epema,et al.  Deadline-constrained workflow scheduling algorithms for Infrastructure as a Service Clouds , 2013, Future Gener. Comput. Syst..

[72]  Paul Watson,et al.  e-Science Central: Cloud-based e-Science and its application to chemical property modelling , 2010 .

[73]  Samuel Kounev,et al.  BUNGEE: An Elasticity Benchmark for Self-Adaptive IaaS Cloud Environments , 2015, 2015 IEEE/ACM 10th International Symposium on Software Engineering for Adaptive and Self-Managing Systems.

[74]  Tristan Glatard,et al.  Controlling fairness and task granularity in distributed, online, non‐clairvoyant workflow executions , 2014, Concurr. Comput. Pract. Exp..

[75]  Ishfaq Ahmad,et al.  Benchmarking and Comparison of the Task Graph Scheduling Algorithms , 1999, J. Parallel Distributed Comput..

[76]  Zhen Feng,et al.  Fairness scheduling with dynamic priority for multi workflow on heterogeneous systems , 2017, 2017 IEEE 2nd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA).

[77]  Kuo-Chan Huang,et al.  Online scheduling of workflow applications in grid environments , 2011, Future Gener. Comput. Syst..

[78]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[79]  Alexandru Iosup,et al.  Performance-Feedback Autoscaling with Budget Constraints for Cloud-based Workloads of Workflows , 2019, ArXiv.

[80]  David Mazières,et al.  EyeQ: Practical Network Performance Isolation for the Multi-tenant Cloud , 2012, HotCloud.

[81]  Jarek Nabrzyski,et al.  Cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[82]  Samuel Kounev,et al.  Chamulteon: Coordinated Auto-Scaling of Micro-Services , 2019, 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS).

[83]  Ishfaq Ahmad,et al.  Dynamic Critical-Path Scheduling: An Effective Technique for Allocating Task Graphs to Multiprocessors , 1996, IEEE Trans. Parallel Distributed Syst..

[84]  R. P. Dilworth,et al.  A DECOMPOSITION THEOREM FOR PARTIALLY ORDERED SETS , 1950 .

[85]  Hamid Arabnejad,et al.  FAIR RESOURCE SHARING FOR DYNAMIC SCHEDULING OF WORKFLOWS ON HETEROGENEOUS SYSTEMS , 2014, HiPC 2014.

[86]  Marian Bubak,et al.  Prediction-based auto-scaling of scientific workflows , 2011, MGC '11.

[87]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[88]  Scientific Management , 2008, Nature.

[89]  Waheed Iqbal,et al.  Adaptive resource provisioning for read intensive multi-tier applications in the cloud , 2011, Future Gener. Comput. Syst..

[90]  Lavanya Ramakrishnan,et al.  The future of scientific workflows , 2018, Int. J. High Perform. Comput. Appl..

[91]  Weisong Shi,et al.  A Planner-Guided Scheduling Strategy for Multiple Workflow Applications , 2008, 2008 International Conference on Parallel Processing - Workshops.

[92]  Marty Humphrey,et al.  Auto-scaling to minimize cost and meet application deadlines in cloud workflows , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[93]  Dick H. J. Epema,et al.  Cost-driven scheduling of grid workflows using Partial Critical Paths , 2010, 2010 11th IEEE/ACM International Conference on Grid Computing.

[94]  V PapadopoulosAlessandro,et al.  An Experimental Performance Evaluation of Autoscalers for Complex Workflows , 2018 .

[95]  Dragos Manolescu,et al.  Production workflow: concepts and techniques , 2001, SOEN.

[96]  Hamid Arabnejad,et al.  Fairness Resource Sharing for Dynamic Workflow Scheduling on Heterogeneous Systems , 2012, 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications.

[97]  Jun Qin,et al.  ASKALON: A Development and Grid Computing Environment for Scientific Workflows , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[98]  Alexandru Iosup,et al.  Inter-operating grids through delegated matchmaking , 2007, Proceedings of the 2007 ACM/IEEE Conference on Supercomputing (SC '07).

[99]  Bryan Ng,et al.  Scheduling deadline constrained scientific workflows on dynamically provisioned cloud resources , 2017, Future Gener. Comput. Syst..

[100]  Dennis Gannon,et al.  Scientific versus Business Workflows , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[101]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[102]  Douglas Thain,et al.  Distributed computing in practice: the Condor experience , 2005, Concurr. Pract. Exp..

[103]  Rouven Krebs,et al.  Ready for Rain? A View from SPEC Research on the Future of Cloud Metrics , 2016, ArXiv.

[104]  Ewa Deelman,et al.  The interplay of resource provisioning and workflow optimization in scientific applications , 2011, Concurr. Comput. Pract. Exp..

[105]  Ann L. Chervenak,et al.  Characterizing and profiling scientific workflows , 2013, Future Gener. Comput. Syst..

[106]  Steven Bohez,et al.  Dynamic auto-scaling and scheduling of deadline constrained service workloads on IaaS clouds , 2016, J. Syst. Softw..

[107]  Jin Sun,et al.  A Review of Cost and Makespan-Aware Workflow Scheduling in Clouds , 2019, J. Circuits Syst. Comput..

[108]  Arif Merchant,et al.  Minerva: An automated resource provisioning tool for large-scale storage systems , 2001, TOCS.

[109]  Domenico Talia,et al.  Clouds for Scalable Big Data Analytics , 2013, Computer.

[110]  Miron Livny,et al.  Online Task Resource Consumption Prediction for Scientific Workflows , 2015, Parallel Process. Lett..

[111]  Ashley Shade,et al.  Computing Workflows for Biologists: A Roadmap , 2015, PLoS biology.

[112]  Xiaorong Li,et al.  Hybrid Heuristic for Scheduling Data Analytics Workflow Applications in Hybrid Cloud Environment , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[113]  Jan Mendling,et al.  Process Simulation for Machine Reservation in Cloud Manufacturing , 2018, 2018 IEEE 16th International Conference on Industrial Informatics (INDIN).

[114]  Ramesh K. Sitaraman,et al.  Optimizing the video transcoding workflow in content delivery networks , 2015, MMSys.

[115]  Cynthia Bailey Lee,et al.  Are User Runtime Estimates Inherently Inaccurate? , 2004, JSSPP.

[116]  Ioannis Konstantinou,et al.  Dependable Horizontal Scaling Based on Probabilistic Model Checking , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[117]  Prashant Pandey,et al.  Cloud computing , 2010, ICWET.

[118]  Dick H. J. Epema,et al.  Towards a Realistic Scheduler for Mixed Workloads with Workflows , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[119]  Rizos Sakellariou,et al.  Scheduling Data-IntensiveWorkflows onto Storage-Constrained Distributed Resources , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[120]  Ajit Kembhavi Big Data in Astronomy and Beyond , 2018 .

[121]  Dror G. Feitelson,et al.  Supporting priorities and improving utilization of the IBM SP scheduler using slack-based backfilling , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[122]  J. T. Childers,et al.  Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC , 2012 .

[123]  Thomas Rauber,et al.  A source code analyzer for performance prediction , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[124]  Dror G. Feitelson,et al.  Backfilling with lookahead to optimize the packing of parallel jobs , 2005, J. Parallel Distributed Comput..

[125]  Dror G. Feitelson,et al.  Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling , 2001, IEEE Trans. Parallel Distributed Syst..

[126]  Miron Livny,et al.  Pegasus, a workflow management system for science automation , 2015, Future Gener. Comput. Syst..

[127]  Miron Livny,et al.  The cost of doing science on the cloud: The Montage example , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[128]  B. Cohen,et al.  Incentives Build Robustness in Bit-Torrent , 2003 .

[129]  Norman W. Paton,et al.  Adaptive Workflow Processing and Execution in Pegasus , 2008, 2008 The 3rd International Conference on Grid and Pervasive Computing - Workshops.

[130]  Andrea C. Arpaci-Dusseau,et al.  Towards transparent cpu scheduling , 2011 .

[131]  Prasanta K. Jana,et al.  A novel cost-efficient approach for deadline-constrained workflow scheduling by dynamic provisioning of resources , 2018, Future Gener. Comput. Syst..

[132]  Samuel Kounev,et al.  Evaluating approaches to resource demand estimation , 2015, Perform. Evaluation.

[133]  Noel M. O'Boyle,et al.  cclib: A library for package‐independent computational chemistry algorithms , 2008, J. Comput. Chem..

[134]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[135]  Dejan S. Milojicic,et al.  OpenNebula: A Cloud Management Tool , 2011, IEEE Internet Computing.

[136]  H. A. David Ranking from unbalanced paired-comparison data , 1987 .

[137]  Qingbo Wu,et al.  Workflow scheduling in cloud: a survey , 2015, The Journal of Supercomputing.

[138]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[139]  Дотдаева Ольга Николаевна Принципы организации и деятельности органов государственной власти Ставропольского края , 2012 .

[140]  Mei-Hui Su,et al.  Characterization of scientific workflows , 2008, 2008 Third Workshop on Workflows in Support of Large-Scale Science.

[141]  Eytan Modiano,et al.  Fairness and Optimal Stochastic Control for Heterogeneous Networks , 2005, IEEE/ACM Transactions on Networking.

[142]  The Ligo Scientific Collaboration,et al.  GW170817: Observation of Gravitational Waves from a Binary Neutron Star Inspiral , 2017, 1710.05832.

[143]  Christopher Olston,et al.  Stateful bulk processing for incremental analytics , 2010, SoCC '10.

[144]  Johan Tordsson,et al.  An adaptive hybrid elasticity controller for cloud infrastructures , 2012, 2012 IEEE Network Operations and Management Symposium.

[145]  Mor Harchol-Balter,et al.  AutoScale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers , 2012, TOCS.

[146]  Johan Tordsson,et al.  Efficient provisioning of bursty scientific workloads on the cloud using adaptive elasticity control , 2012, ScienceCloud '12.

[147]  Salim Hariri,et al.  Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing , 2002, IEEE Trans. Parallel Distributed Syst..

[148]  Alexandru Iosup,et al.  KOALA-C: A task allocator for integrated multicluster and multicloud environments , 2014, 2014 IEEE International Conference on Cluster Computing (CLUSTER).

[149]  Alexandru Iosup,et al.  The BTWorld use case for big data analytics: Description, MapReduce logical workflow, and empirical evaluation , 2013, 2013 IEEE International Conference on Big Data.

[150]  José Antonio Lozano,et al.  A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments , 2014, Journal of Grid Computing.