Risk-Aware Limited Lookahead Control for Dynamic Resource Provisioning in Enterprise Computing Systems

Utility or on-demand computing, a provisioning model where a service provider makes computing infrastructure available to customers as needed, is becoming increasingly common in enterprise computing systems. Realizing this model requires making dynamic and sometimes risky, resource provisioning and allocation decisions in an uncertain operating environment to maximize revenue while reducing operating cost. This paper develops an optimization framework wherein the resource provisioning problem is posed as one of sequential decision making under uncertainty and solved using a limited lookahead control scheme. The proposed approach accounts for the switching costs incurred during resource provisioning and explicitly encodes risk in the optimization problem. Simulations using workload traces from the Soccer World Cup 1998 web site show that a computing system managed by our controller generates up to 20% more revenue than a system without dynamic control while incurring low control overhead.

[1]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[2]  Akhil Sahai,et al.  Automated policy-based resource construction in utility computing environments , 2004, 2004 IEEE/IFIP Network Operations and Management Symposium (IEEE Cat. No.04CH37507).

[3]  Virgílio A. F. Almeida,et al.  Capacity Planning for Web Services: Metrics, Models, and Methods , 2001 .

[4]  John Judge,et al.  A model for the marginal distribution of aggregate per second HTTP request rate , 1999, 10th IEEE Workshop on Local and Metropolitan Area Networks. Selected Papers (IEEE Cat. No.99EX512).

[5]  E. N. Elnozahy,et al.  Energy-Efficient Server Clusters , 2002, PACS.

[6]  Yixin Diao,et al.  Feedback Control of Computing Systems , 2004 .

[7]  Nagarajan Kandasamy,et al.  Online control for self-management in computing systems , 2004, Proceedings. RTAS 2004. 10th IEEE Real-Time and Embedded Technology and Applications Symposium, 2004..

[8]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[9]  Claudio Scordino,et al.  Energy-Efficient Real-Time Heterogeneous Server Clusters , 2006, 12th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS'06).

[10]  Robert Tappan Morris,et al.  Variance of aggregated Web traffic , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[11]  Asser N. Tantawi,et al.  Performance management for cluster based Web services , 2003 .

[12]  Kevin Skadron,et al.  Power-aware QoS management in Web servers , 2003, RTSS 2003. 24th IEEE Real-Time Systems Symposium, 2003.

[13]  A. Nerode,et al.  Hybrid Control Systems: An Introductory Discussion to the Special Issue , 1998, IEEE Trans. Autom. Control..

[14]  Kang G. Shin,et al.  Real-time dynamic voltage scaling for low-power embedded operating systems , 2001, SOSP.

[15]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[16]  Daniel A. Menascé,et al.  Resource Allocation for Autonomic Data Centers using Analytic Performance Models , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[17]  Paul Barford,et al.  Generating representative Web workloads for network and server performance evaluation , 1998, SIGMETRICS '98/PERFORMANCE '98.

[18]  Jan M. Maciejowski,et al.  Predictive control : with constraints , 2002 .

[19]  Martin F. Arlitt,et al.  Web server workload characterization: the search for invariants , 1996, SIGMETRICS '96.

[20]  Asser N. Tantawi,et al.  Optimal static load balancing in distributed computer systems , 1985, JACM.

[21]  Asser N. Tantawi,et al.  Performance management for cluster-based web services , 2005, IEEE Journal on Selected Areas in Communications.

[22]  Karl Brammer,et al.  Kalman-Bucy filters , 1989 .

[23]  Jian Zhang,et al.  Optimal resource allocation scheme for maximizing revenue in the future IP networks , 2004, APCC/MDMC '04. The 2004 Joint Conference of the 10th Asia-Pacific Conference on Communications and the 5th International Symposium on Multi-Dimensional Mobile Communications Proceeding.

[24]  Trevor N. Mudge,et al.  Power: A First-Class Architectural Design Constraint , 2001, Computer.

[25]  Alan Scheller-Wolf,et al.  Analysis of cycle stealing with switching cost , 2003, SIGMETRICS '03.

[26]  Joseph L. Hellerstein,et al.  A framework for applying inventory control to capacity management for utility computing , 2005, 2005 9th IFIP/IEEE International Symposium on Integrated Network Management, 2005. IM 2005..

[27]  Rami G. Melhem,et al.  Energy aware scheduling for distributed real-time systems , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[28]  T. Copeland,et al.  Financial Theory and Corporate Policy. , 1980 .

[29]  George M. Siouris,et al.  Applied Optimal Control: Optimization, Estimation, and Control , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[30]  Nagarajan Kandasamy,et al.  Self-optimization in computer systems via on-line control: application to power management , 2004 .

[31]  Anand Sivasubramaniam,et al.  Managing server energy and operational costs in hosting centers , 2005, SIGMETRICS '05.

[32]  Y. Bar-Shalom Stochastic dynamic programming: Caution and probing , 1981 .

[33]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[34]  Edward D. Lazowska,et al.  Adaptive load sharing in homogeneous distributed systems , 1986, IEEE Transactions on Software Engineering.

[35]  Prashant J. Shenoy,et al.  Dynamic Provisioning of Multi-tier Internet Applications , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[36]  Richard Murch,et al.  Autonomic Computing , 2004 .