Cost-Efficient Virtual Machine Provisioning for Multi-tier Web Applications and Video Transcoding

Infrastructure as a Service (IaaS) clouds provide virtual machines (VMs) under the pay-per-use business model. The dynamic on-demand provisioning of VMs allows IaaS users to ensure scalability of their web applications and web-based services from really low to really high loads. However, VM provisioning must be done carefully because over-provisioning results in an increased operational cost, while under-provisioning leads to a sub par service. In this research work, our main focus is on cost-efficient VM provisioning for multi-tier web applications and video transcoding. Moreover, to prevent provisioned VMs from becoming overloaded, we augment VM provisioning with an admission control mechanism. Similarly, to ensure efficient use of provisioned VMs, under-utilized VMs are consolidated periodically. Since cost-efficient VM provisioning is an optimization problem, we apply metaheuristic approaches to find a near-optimal solution.

[1]  A. Robertsson,et al.  Admission control for Web server systems - design and experimental evaluation , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[2]  Chen-Hsiu Huang Video Transcoding Architectures and Techniques : An Overview , 2003 .

[3]  Jyh-Shing Roger Jang,et al.  Admission control schemes for proportional differentiated services enabled internet servers using machine learning techniques , 2006, Expert Syst. Appl..

[4]  Ítalo S. Cunha,et al.  Joint admission control and resource allocation in virtualized servers , 2010, J. Parallel Distributed Comput..

[5]  Chao Mei,et al.  CloudStream: Delivering high-quality streaming videos through a cloud-based SVC proxy , 2011, 2011 Proceedings IEEE INFOCOM.

[6]  Waheed Iqbal,et al.  Adaptive resource provisioning for read intensive multi-tier applications in the cloud , 2011, Future Gener. Comput. Syst..

[7]  Jun Han,et al.  A multi-model framework to implement self-managing control systems for QoS management , 2011, SEAMS '11.

[8]  Christian Blum,et al.  Hybrid metaheuristics in combinatorial optimization: A survey , 2011, Appl. Soft Comput..

[9]  Marin Litoiu,et al.  Resource provisioning for cloud computing , 2009, CASCON.

[10]  Gang Liu,et al.  Cloud transcoder: bridging the format and resolution gap between internet videos and mobile devices , 2012, NOSSDAV '12.

[11]  Jane Hillston,et al.  Cost-based admission control for Internet Commerce QoS enhancement , 2009, Electron. Commer. Res. Appl..

[12]  Werner Vogels,et al.  Beyond Server Consolidation , 2008, ACM Queue.

[13]  Pasi Tyrväinen,et al.  Hybrid Cloud Architecture for Short Message Services , 2012, CLOSER.

[14]  Sangyoon Oh,et al.  Sercon: Server Consolidation Algorithm using Live Migration of Virtual Machines for Green Computing , 2011 .

[15]  Dejun Mu,et al.  Feedback Control-Based QoS Guarantees in Web Application Servers , 2008, 2008 10th IEEE International Conference on High Performance Computing and Communications.

[16]  J. Lilius,et al.  Stream-Based Admission Control and Scheduling for Video Transcoding in Cloud Computing , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[17]  Ivan Porres,et al.  Feedback Control Algorithms to Deploy and Scale Multiple Web Applications per Virtual Machine , 2012, 2012 38th Euromicro Conference on Software Engineering and Advanced Applications.

[18]  Ivan Porres,et al.  A Session-Based Adaptive Admission Control Approach for Virtualized Application Servers , 2012, 2012 IEEE Fifth International Conference on Utility and Cloud Computing.

[19]  Henry H. Liu,et al.  Software Performance and Scalability - A Quantitative Approach , 2009, Wiley series on quantitative software engineering.

[20]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[21]  Timo Aho,et al.  Designing IDE as a Service , 2013 .

[22]  Moustafa Ghanem,et al.  Lightweight Resource Scaling for Cloud Applications , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[23]  Xiaobo Zhou,et al.  Coordinated session-based admission control with statistical learning for multi-tier internet applications , 2011, J. Netw. Comput. Appl..

[24]  Ivan Porres,et al.  Towards Automatic Performance and Scalability Testing of Rich Internet Applications in the Cloud , 2011, 2011 37th EUROMICRO Conference on Software Engineering and Advanced Applications.

[25]  Christine Morin,et al.  A case for fully decentralized dynamic VM consolidation in clouds , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[26]  Thiemo Voigt,et al.  Adaptive resource-based Web server admission control , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[27]  Prasant Mohapatra,et al.  ACES: An efficient admission control scheme for QoS-aware web servers , 2003, Comput. Commun..

[28]  Ajay Mohindra,et al.  Dynamic Scaling of Web Applications in a Virtualized Cloud Computing Environment , 2009, 2009 IEEE International Conference on e-Business Engineering.

[29]  Sébastien Lafond,et al.  Prediction-Based Dynamic Resource Allocation for Video Transcoding in Cloud Computing , 2013, 2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[30]  Isis Truck,et al.  From Data Center Resource Allocation to Control Theory and Back , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[31]  Gerhard Meixner,et al.  TwoSpot: A Cloud Platform for Scaling Out Web Applications Dynamically , 2010, ServiceWave.

[32]  Mark Harman,et al.  Cloud engineering is Search Based Software Engineering too , 2013, J. Syst. Softw..

[33]  Sara Casolari,et al.  Load prediction models in web-based systems , 2006, valuetools '06.

[34]  Ivan Porres,et al.  CRAMP: Cost-efficient Resource Allocation for Multiple web applications with Proactive scaling , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[35]  Carlo Ghezzi,et al.  Service Provisioning on the Cloud: Distributed Algorithms for Joint Capacity Allocation and Admission Control , 2010, ServiceWave.