Optimised auto-scaling for cloud-based web service

Elasticity and cost-effectiveness are two key features for ensuring that cloud-based web services appeal to more businesses. However, true elasticity and cost-effectiveness in the pay-per-use cloud business model has not yet been fully achieved. The explosion of cloud-based web services brings new challenges to enable the automatic scaling up and down of service provision when the workload is time-varying. This research studies the problems associated with these challenges. It proposes a novel scheme to achieve optimised auto-scaling for cloudbased web services from three levels of cloud structure: Software as a Service (SaaS), Platform as a Service (PaaS), and Infrastructure as a Service (IaaS). At the various levels, auto-scaling for cloud-based web services has different problems and requires different solutions. At the SaaS level, this study investigates how to design and develop scalable web services, especially for time-consuming applications. To achieve the greatest efficiency, the optimisation of service provision problem is studied by providing the minimum functionality and fastest scalability performance concerning the speed-up curve and QoS (Quality of Service) of the SLA (Service-Level Agreement). At the PaaS level, this work studies how to support dynamic re-configuration when workloads change and the effective deployment of various kinds of web services to the cloud. To achieve optimised auto-scaling of this deployment, a platform is designed to deploy all web services automatically with the minimal number of cloud resources by satisfying the QoS of SLAs. At the IaaS level for two infrastructure resources of virtual machine (VM) and virtual network (VN), this research focuses on studying two types of cloud-based web service: computation-intensive and bandwidth-intensive. To address the optimised auto-scaling problem for computation-intensive cloud-based web service, data-driven VM auto-scaling approaches are proposed to handle the workload in both stable and dynamic environments. To address the optimised autoscaling problem for bandwidth-intensive cloud-based web service, this study proposes a novel approach to predict the volume of requests and dynamically adjust the software defined network (SDN)-based network configuration in the cloud to auto-scale the service with minimal cost. This research proposes comprehensive and profound perspectives to solve the auto-scaling optimisation problems for cloud-based web services. The proposed approaches not only enable cloud-based web services to minimise resource consumption while auto-scaling service provision to achieve satisfying performance, but also save energy consumption for the global realisation of green computing. The performance of the proposed approaches has been evaluated on a public platform (e.g. Amazon EC2) with the real dataset workload of web services. The experiment results demonstrate that the proposed approaches are practicable and achieve superior performance to other benchmark methods.

[1]  E. Madeira,et al.  Prediction-based Auto-scaling of Scientific Workflows , 2011 .

[2]  Autoflex: Service Agnostic Auto-scaling Framework for IaaS Deployment Models , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[3]  Steven Hand,et al.  Self-adaptive and self-configured CPU resource provisioning for virtualized servers using Kalman filters , 2009, ICAC '09.

[4]  Ying Zhang,et al.  Integrating Resource Consumption and Allocation for Infrastructure Resources on-Demand , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[5]  Waheed Iqbal,et al.  Adaptive resource provisioning for read intensive multi-tier applications in the cloud , 2011, Future Gener. Comput. Syst..

[6]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[7]  Rajkumar Buyya,et al.  Energy Efficient Allocation of Virtual Machines in Cloud Data Centers , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[8]  Jie Lu,et al.  Optimal Cloud Resource Auto-Scaling for Web Applications , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[9]  Mor Harchol-Balter,et al.  Distributed, Robust Auto-Scaling Policies for Power Management in Compute Intensive Server Farms , 2011, 2011 Sixth Open Cirrus Summit.

[10]  Sunilkumar S. Manvi,et al.  Resource management for Infrastructure as a Service (IaaS) in cloud computing: A survey , 2014, J. Netw. Comput. Appl..

[11]  Mor Harchol-Balter,et al.  AutoScale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers , 2012, TOCS.

[12]  Kenli Li,et al.  Customer-Satisfaction-Aware Optimal Multiserver Configuration for Profit Maximization in Cloud Computing , 2017, IEEE Transactions on Sustainable Computing.

[13]  Symeon Papavassiliou,et al.  Energy Aware Networked Cloud Mapping , 2013, 2013 IEEE 12th International Symposium on Network Computing and Applications.

[14]  Rajkumar Buyya,et al.  SLA-Based Resource Allocation for Software as a Service Provider (SaaS) in Cloud Computing Environments , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[15]  Ioannis Konstantinou,et al.  Automated, Elastic Resource Provisioning for NoSQL Clusters Using TIRAMOLA , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[16]  Massoud Pedram,et al.  SLA-based Optimization of Power and Migration Cost in Cloud Computing , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[17]  Cheng-Zhong Xu,et al.  URL: A unified reinforcement learning approach for autonomic cloud management , 2012, J. Parallel Distributed Comput..

[18]  Marin Litoiu,et al.  Exploring Alternative Approaches to Implement an Elasticity Policy , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[19]  Marin Litoiu,et al.  Feedback-based optimization of a private cloud , 2012, Future Gener. Comput. Syst..

[20]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[21]  Vipin Kumar,et al.  Scalability of Parallel Algorithms for the All-Pairs Shortest-Path Problem , 1991, J. Parallel Distributed Comput..

[22]  Gang Wang,et al.  Appliance-Based Autonomic Provisioning Framework for Virtualized Outsourcing Data Center , 2007, Fourth International Conference on Autonomic Computing (ICAC'07).

[23]  Andrzej Kochut,et al.  Dynamic Placement of Virtual Machines for Managing SLA Violations , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[24]  Rajkumar Buyya,et al.  Cloudbus Toolkit for Market-Oriented Cloud Computing , 2009, CloudCom.

[25]  Don H. Johnson,et al.  Symmetrizing the Kullback-Leibler Distance , 2001 .

[26]  Ramin Yahyapour,et al.  SDN-based cloud computing networking , 2013, 2013 15th International Conference on Transparent Optical Networks (ICTON).

[27]  Mor Harchol-Balter,et al.  Optimality analysis of energy-performance trade-off for server farm management , 2010, Perform. Evaluation.

[28]  Jie Yang,et al.  A Profile-Based Approach to Just-in-Time Scalability for Cloud Applications , 2009, 2009 IEEE International Conference on Cloud Computing.

[29]  Erik Elmroth,et al.  A virtual machine re-packing approach to the horizontal vs. vertical elasticity trade-off for cloud autoscaling , 2013, CAC.

[30]  Eyal de Lara,et al.  SnowFlock: rapid virtual machine cloning for cloud computing , 2009, EuroSys '09.

[31]  Claus Pahl,et al.  Autonomic resource provisioning for cloud-based software , 2014, SEAMS 2014.

[32]  Anand Sivasubramaniam,et al.  Issues in Understanding the Scalability of Parallel Systems , 1994 .

[33]  Xue Liu,et al.  Optimal multivariate control for differentiated services on a shared hosting platform , 2007, 2007 46th IEEE Conference on Decision and Control.

[34]  Soonwook Hwang,et al.  An allocation and provisioning model of science cloud for high throughput computing applications , 2013, CAC.

[35]  Gerald Tesauro,et al.  Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies , 2007, IEEE Internet Computing.

[36]  Alexandru Iosup,et al.  Performance Analysis of Cloud Computing Services for Many-Tasks Scientific Computing , 2011, IEEE Transactions on Parallel and Distributed Systems.

[37]  Aniruddha S. Gokhale,et al.  Efficient Autoscaling in the Cloud Using Predictive Models for Workload Forecasting , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[38]  Richard E. Brown,et al.  Report to Congress on Server and Data Center Energy Efficiency: Public Law 109-431 , 2008 .

[39]  Rajkumar Buyya,et al.  Dynamically scaling applications in the cloud , 2011, CCRV.

[40]  Kevin Lee,et al.  Empirical prediction models for adaptive resource provisioning in the cloud , 2012, Future Gener. Comput. Syst..

[41]  H. L. Gray,et al.  Applied time series analysis , 2011 .

[42]  Geoffrey C. Fox,et al.  MapReduce for Data Intensive Scientific Analyses , 2008, 2008 IEEE Fourth International Conference on eScience.

[43]  Dmytro Dyachuk,et al.  Maximizing Cloud Providers' Revenues via Energy Aware Allocation Policies , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[44]  Henning Schulzrinne,et al.  Predicting the Upper Bound of Web Traffic Volume Using a Multiple Time Scale Approach , 2003, WWW.

[45]  Kang G. Shin,et al.  Automated control of multiple virtualized resources , 2009, EuroSys '09.

[46]  Alexander L. Stolyar,et al.  Online algorithms for joint application-VM-physical-machine auto-scaling in a cloud , 2014, SIGMETRICS '14.

[47]  Zhenhuan Gong,et al.  PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[48]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[49]  T. V. Lakshman,et al.  Network aware resource allocation in distributed clouds , 2012, 2012 Proceedings IEEE INFOCOM.

[50]  Sam Malek,et al.  A framework for utility-based service oriented design in SASSY , 2010, WOSP/SIPEW '10.

[51]  Marco Mellia,et al.  Dissecting Video Server Selection Strategies in the YouTube CDN , 2011, 2011 31st International Conference on Distributed Computing Systems.

[52]  Fang Hao,et al.  Unreeling netflix: Understanding and improving multi-CDN movie delivery , 2012, 2012 Proceedings IEEE INFOCOM.

[53]  Adnan Ashraf,et al.  Cost-Efficient Virtual Machine Provisioning for Multi-tier Web Applications and Video Transcoding , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[54]  Christopher Stewart,et al.  Exploiting nonstationarity for performance prediction , 2007, EuroSys '07.

[55]  Thomas Hofmann,et al.  Map-Reduce for Machine Learning on Multicore , 2007 .

[56]  Meng Wang,et al.  Consolidating virtual machines with dynamic bandwidth demand in data centers , 2011, 2011 Proceedings IEEE INFOCOM.

[57]  Baochun Li,et al.  Quality-assured cloud bandwidth auto-scaling for video-on-demand applications , 2012, 2012 Proceedings IEEE INFOCOM.

[58]  Marin Litoiu,et al.  Performance model driven QoS guarantees and optimization in clouds , 2009, 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing.

[59]  Zhuzhong Qian,et al.  A game theoretical method for auto-scaling of multi-tiers web applications in cloud , 2012, Internetware.

[60]  Jie Li,et al.  Cloud auto-scaling with deadline and budget constraints , 2010, 2010 11th IEEE/ACM International Conference on Grid Computing.

[61]  Rakesh Bobba,et al.  Towards SDN enabled network control delegation in clouds , 2013, 2013 43rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN).

[62]  Artur Andrzejak,et al.  Decision Model for Cloud Computing under SLA Constraints , 2010, 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[63]  Jeffrey S. Chase,et al.  Automated control for elastic storage , 2010, ICAC '10.

[64]  Donald F. Specht,et al.  A general regression neural network , 1991, IEEE Trans. Neural Networks.

[65]  Yong Yan,et al.  Measuring and Analyzing Parallel Computing Scalability , 1994, ICPP.

[66]  Rajkumar Buyya,et al.  Energy-Efficient Management of Data Center Resources for Cloud Computing: A Vision, Architectural Elements, and Open Challenges , 2010, PDPTA.

[67]  Vijay K. Gurbani,et al.  Network-aware service placement in a distributed cloud environment , 2012, SIGCOMM '12.

[68]  Milind A. Bhandarkar,et al.  MapReduce programming with apache Hadoop , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[69]  Marin Litoiu,et al.  Optimal autoscaling in a IaaS cloud , 2012, ICAC '12.

[70]  Helen J. Wang,et al.  SecondNet: a data center network virtualization architecture with bandwidth guarantees , 2010, CoNEXT.

[71]  Nagarajan Kandasamy,et al.  Online control for self-management in computing systems , 2004, Proceedings. RTAS 2004. 10th IEEE Real-Time and Embedded Technology and Applications Symposium, 2004..

[72]  Bradley N. Miller,et al.  MovieLens unplugged: experiences with an occasionally connected recommender system , 2003, IUI '03.

[73]  Fang Shu-fen,et al.  Stochastic Linear Optimization for Modeling Uncertainty in Aggregate Production Planning , 2006, International Conference on Autonomic and Autonomous Systems (ICAS'06).

[74]  Hyong S. Kim,et al.  How to tame your VMs: an automated control system for virtualized services , 2010 .

[75]  Fermín Galán Márquez,et al.  From infrastructure delivery to service management in clouds , 2010, Future Gener. Comput. Syst..

[76]  Zhi-Li Zhang,et al.  Vivisecting YouTube: An active measurement study , 2012, 2012 Proceedings IEEE INFOCOM.

[77]  Carl M. Harris,et al.  Fundamentals of queueing theory , 1975 .

[78]  Ajay Mohindra,et al.  Dynamic Scaling of Web Applications in a Virtualized Cloud Computing Environment , 2009, 2009 IEEE International Conference on e-Business Engineering.

[79]  Hui Li,et al.  SLA-driven planning and optimization of enterprise applications , 2010, WOSP/SIPEW '10.

[80]  Alexander L. Stolyar,et al.  An Infinite Server System with General Packing Constraints , 2012, Oper. Res..

[81]  Chu-Fen Li,et al.  Cloud Computing System Management Under Flat Rate Pricing , 2011, Journal of Network and Systems Management.

[82]  Nagarajan Kandasamy,et al.  Power and performance management of virtualized computing environments via lookahead control , 2008, 2008 International Conference on Autonomic Computing.

[83]  ZhiHui Lv,et al.  CloudStreamMedia: A Cloud Assistant Global Video on Demand Leasing Scheme , 2013, 2013 IEEE International Conference on Services Computing.

[84]  Ashraf Aboulnaga,et al.  Automatic virtual machine configuration for database workloads , 2008, SIGMOD Conference.

[85]  Asser N. Tantawi,et al.  Dynamic placement for clustered web applications , 2006, WWW '06.

[86]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[87]  Qing He,et al.  Parallel K-Means Clustering Based on MapReduce , 2009, CloudCom.

[88]  Fang Hao,et al.  A tale of three CDNs: An active measurement study of Hulu and its CDNs , 2012, 2012 Proceedings IEEE INFOCOM Workshops.

[89]  Dmytro Dyachuk,et al.  Optimizing Cloud providers revenues via energy efficient server allocation , 2012, Sustain. Comput. Informatics Syst..

[90]  Marty Humphrey,et al.  Auto-scaling to minimize cost and meet application deadlines in cloud workflows , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[91]  Marin Litoiu,et al.  Fast scalable optimization to configure service systems having cost and quality of service constraints , 2009, ICAC '09.

[92]  Rajkumar Buyya,et al.  Modeling and simulation of scalable Cloud computing environments and the CloudSim toolkit: Challenges and opportunities , 2009, 2009 International Conference on High Performance Computing & Simulation.

[93]  Douglas C. Schmidt,et al.  Model-driven auto-scaling of green cloud computing infrastructure , 2012, Future Gener. Comput. Syst..

[94]  Archana Ganapathi,et al.  Statistics-driven workload modeling for the Cloud , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[95]  Richard A. Davis,et al.  Time Series: Theory and Methods , 2013 .

[96]  Tao Li,et al.  Self-Adaptive Cloud Capacity Planning , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[97]  Themistoklis Charalambous,et al.  Applying Kalman Filters to Dynamic Resource Provisioning of Virtualized Server Applications , 2008 .

[98]  Yanghee Choi,et al.  How to realize CDN interconnection (CDNI) over OpenFlow? , 2012, CFI.

[99]  David Chiu,et al.  Reconciling Cost and Performance Objectives for Elastic Web Caches , 2012, 2012 International Conference on Cloud and Service Computing.

[100]  Christoforos E. Kozyrakis,et al.  On the energy (in)efficiency of Hadoop clusters , 2010, OPSR.

[101]  Johnny W. Wong,et al.  Autonomic Resource Management for a Cluster that Executes Batch Jobs , 2006, 2006 IEEE International Conference on Cluster Computing.

[102]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[103]  Moustafa Ghanem,et al.  Lightweight Resource Scaling for Cloud Applications , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[104]  Sanjay Chaudhary,et al.  Policy based resource allocation in IaaS cloud , 2012, Future Gener. Comput. Syst..

[105]  A. Federgruen,et al.  Approximations for the steady-state probabilities in the M/G/c queue , 1981, Advances in Applied Probability.