Budget-based control for interactive services with adaptive execution

We study the problem of managing a class of interactive services to meet a response time target while achieving high service quality. We focus here on interactive services that support adaptive execution, such as web search engines and finance servers. With adaptive execution, when a request receives more processing time, its result improves, posing new challenges and opportunities for resource management. We propose a new budget-based control model for interactive services with adaptive execution. The budget represents the amount of resources assigned to all pending requests. The budget-based control model consists of two components: (1) a hybrid control mechanism, which combines adaptive and integral controllers and controls the budget in order to meet the response time target with small steady-state error, fast settling time and little runtime overhead, and (2) an optimization procedure, which takes advantage of adaptive execution to maximize the total response quality of all pending requests under a given budget. We implement and evaluate the budget-based control model experimentally in Microsoft Bing, a commercial web search engine. The experimental results show that it achieves more accurate control of mean response time and higher response quality than traditional static and dynamic admission control techniques that control the queue length. We also apply the model to a finance server that estimates option prices, and conduct a simulation study. The simulation results show large benefits for budget-based control. For example, under the same response time and quality requirements, the budget-based model accommodates double the system throughput compared to a traditional queue-based control model.

[1]  Chenyang Lu,et al.  Introduction to Control Theory And Its Application to Computing Systems , 2008 .

[2]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[3]  J. Hiriart-Urruty,et al.  Convex analysis and minimization algorithms , 1993 .

[4]  Stephen Blyth,et al.  An Introduction to Quantitative Finance , 2013 .

[5]  Jeffrey S. Chase,et al.  Automated control for elastic storage , 2010, ICAC '10.

[6]  Lui Sha,et al.  Queueing model based network server performance control , 2002, 23rd IEEE Real-Time Systems Symposium, 2002. RTSS 2002..

[7]  Xue Liu,et al.  Online adaptive utilization control for real-time embedded multiprocessor systems , 2008, CODES+ISSS '08.

[8]  Robert R. Reitano Introduction to Quantitative Finance: A Math Tool Kit , 2010 .

[9]  Henry Hoffmann,et al.  Dynamic knobs for responsive power-aware computing , 2011, ASPLOS XVI.

[10]  I. Adan,et al.  QUEUEING THEORY , 1978 .

[11]  Wojciech Szpankowski Bounds for Queue Lengths in a Contention Packet Broadcast System , 1986, IEEE Trans. Commun..

[12]  Armando Fox,et al.  Adapting to network and client variation using infrastructural process proxies: lessons and perspectives , 1999 .

[13]  Sang Hyuk Son,et al.  A feedback control approach for guaranteeing relative delays in Web servers , 2001, Proceedings Seventh IEEE Real-Time Technology and Applications Symposium.

[14]  Woongki Baek,et al.  Green: a framework for supporting energy-conscious programming using controlled approximation , 2010, PLDI '10.

[15]  Chen Ding,et al.  Quantifying the cost of context switch , 2007, ExpCS '07.

[16]  Prasant Mohapatra,et al.  Session-based overload control in QoS-aware Web servers , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[17]  Ciro D'Apice,et al.  Queueing Theory , 2003, Operations Research.

[18]  Mor Harchol-Balter,et al.  Web servers under overload: How scheduling can help , 2006, TOIT.

[19]  Eric A. Brewer,et al.  Adapting to network and client variation using infrastructural proxies: lessons and perspectives , 1998, IEEE Wirel. Commun..

[20]  Daniel Mossé,et al.  Stochastic approximation control of power and tardiness in a three-tier web-hosting cluster , 2010, ICAC '10.

[21]  Rong Zheng,et al.  Timing Performance Control in Web Server Systems Utilizing Server Internal State Information , 2005, Joint International Conference on Autonomic and Autonomous Systems and International Conference on Networking and Services - (icas-isns'05).

[22]  Zhen Liu,et al.  Performance Modeling and Engineering , 2010 .

[23]  C. Muthusamy,et al.  Control Systems application in Java based Enterprise and Cloud Environments – A Survey , 2011 .

[24]  Tarek F. Abdelzaher,et al.  Web Content Adaptation to Improve Server Overload Behavior , 1999, Comput. Networks.

[25]  Yixin Diao,et al.  Feedback Control of Computing Systems , 2004 .

[26]  Lui Sha,et al.  Queueing-Model-Based Adaptive Control of Multi-Tiered Web Applications , 2008, IEEE Transactions on Network and Service Management.

[27]  Anastasios Gounaris,et al.  Honoring SLAs on cloud computing services: A control perspective , 2009, 2009 European Control Conference (ECC).

[28]  Wei-Ying Ma,et al.  Detecting web page structure for adaptive viewing on small form factor devices , 2003, WWW '03.