Control-theoretical load-balancing for cloud applications with brownout

Cloud applications are often subject to unexpected events like flash crowds and hardware failures. Without a predictable behaviour, users may abandon an unresponsive application. This problem has been partially solved on two separate fronts: first, by adding a self-adaptive feature called brownout inside cloud applications to bound response times by modulating user experience, and, second, by introducing replicas - copies of the applications having the same function-alities - for redundancy and adding a load-balancer to direct incoming traffic.

[1]  Javier García,et al.  TPC-W E-Commerce Benchmark Evaluation , 2003, Computer.

[2]  Adam Wierman,et al.  Open Versus Closed: A Cautionary Tale , 2006, NSDI.

[3]  Barbara Panicucci,et al.  Multi-timescale Distributed Capacity Allocation and Load Redirect Algorithms for Cloud System , 2011 .

[4]  Michael I. Jordan,et al.  Characterizing, modeling, and generating workload spikes for stateful services , 2010, SoCC '10.

[5]  John A. Stankovic,et al.  An Application of Bayesian Decision Theory to Decentralized Control of Job Scheduling , 1985, IEEE Transactions on Computers.

[6]  Maria Kihl,et al.  Web server performance modeling using an M/G/1/K*PS queue , 2003, 10th International Conference on Telecommunications, 2003. ICT 2003..

[7]  Karl-Erik Årzén,et al.  Control strategies for predictable brownouts in cloud computing , 2014 .

[8]  Jacques M. Bahi,et al.  Dynamic load balancing and efficient load estimators for asynchronous iterative algorithms , 2005, IEEE Transactions on Parallel and Distributed Systems.

[9]  Karl-Erik Årzén,et al.  Brownout: building more robust cloud applications , 2014, ICSE.

[10]  Supranamaya Ranjan,et al.  Wide area redirection of dynamic content by Internet data centers , 2004, IEEE INFOCOM 2004.

[11]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[12]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[13]  James R. Larus,et al.  Join-Idle-Queue: A novel load balancing algorithm for dynamically scalable web services , 2011, Perform. Evaluation.

[14]  Daniel A. Menascé,et al.  Efficient Response Time Approximations for Multiclass Fork and Join Queues in Open and Closed Queuing Networks , 2014, IEEE Transactions on Parallel and Distributed Systems.

[15]  Kevin Skadron,et al.  Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations , 2011, 2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[16]  Ladan Tahvildari,et al.  Self-adaptive software: Landscape and research challenges , 2009, TAAS.

[17]  Yixin Diao,et al.  Incorporating cost of control into the design of a load balancing controller , 2004, Proceedings. RTAS 2004. 10th IEEE Real-Time and Embedded Technology and Applications Symposium, 2004..

[18]  Jim Zelenka,et al.  Informed prefetching and caching , 1995, SOSP.

[19]  James R. Hamilton,et al.  On Designing and Deploying Internet-Scale Services , 2007, LISA.

[20]  Robert Shorten,et al.  Stratus: Load Balancing the Cloud for Carbon Emissions Control , 2013, IEEE Transactions on Cloud Computing.

[21]  Lachlan L. H. Andrew,et al.  Online algorithms for geographical load balancing , 2012, 2012 International Green Computing Conference (IGCC).

[22]  Sabato Manfredi,et al.  A Distributed Control Law for Load Balancing in Content Delivery Networks , 2013, IEEE/ACM Transactions on Networking.

[23]  Oliver W. W. Yang,et al.  Load balancing of multipath source routing in ad hoc networks , 2002, 2002 IEEE International Conference on Communications. Conference Proceedings. ICC 2002 (Cat. No.02CH37333).

[24]  Craig A. Tovey,et al.  On Honey Bees and Dynamic Server Allocation in Internet Hosting Centers , 2004, Adapt. Behav..

[25]  Philip S. Yu,et al.  Request Redirection Algorithms for Distributed Web Systems , 2003, IEEE Trans. Parallel Distributed Syst..

[26]  Yixin Diao,et al.  Comparative studies of load balancing with control and optimization techniques , 2005, Proceedings of the 2005, American Control Conference, 2005..

[27]  Leonard Kleinrock,et al.  Time-shared Systems: a theoretical treatment , 1967, JACM.

[28]  Jie Li,et al.  A performance comparison of dynamic vs. static load balancing policies in a mainframe-personal computer network model , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[29]  Michael Mitzenmacher,et al.  The Power of Two Choices in Randomized Load Balancing , 2001, IEEE Trans. Parallel Distributed Syst..

[30]  Philip S. Yu,et al.  On balancing the load in a clustered web farm , 2001, TOIT.

[31]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[32]  Anne M. Holler,et al.  Cloud Scale Resource Management: Challenges and Techniques , 2011, HotCloud.

[33]  Tarek F. Abdelzaher,et al.  Bounded-latency content distribution feasibility and evaluation , 2005, IEEE Transactions on Computers.

[34]  Michele Colajanni,et al.  Self-Adaptive Techniques for the Load Trend Evaluation of Internal System Resources , 2009, 2009 Fifth International Conference on Autonomic and Autonomous Systems.

[35]  Larry L. Peterson,et al.  Proceedings of the 5th Symposium on Operating Systems Design and Implementation the Effectiveness of Request Redirection on Cdn Robustness , 2022 .

[36]  Michele Colajanni,et al.  Autonomic Request Management Algorithms for Geographically Distributed Internet-Based Systems , 2008, 2008 Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems.

[37]  Asser N. Tantawi,et al.  Optimal static load balancing in distributed computer systems , 1985, JACM.

[38]  Shoichi Noguchi,et al.  An Analysis of the M/G/1 Queue Under Round-Robin Scheduling , 1971, Oper. Res..

[39]  Johan Tordsson,et al.  Improving cloud infrastructure utilization through overbooking , 2013, CAC.

[40]  Tsang-Long Pao,et al.  The Scalability of Heterogeneous Dispatcher-Based Web Server Load Balancing Architecture , 2006, 2006 Seventh International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'06).