Dynamic Resource Allocation and Active Predictive Models for Enterprise Applications

This work is concerned with dynamic resource allocation for multi-tiered, cluster-based web hosting environ- ments. Dynamic resource allocation is reactive, that is, when overloading occurs in one resource pool, servers are moved from another (quieter) pool to meet this demand. Switching servers comes with some overhead, so it is important to weigh up the costs of the switch against possible system gains. In this paper we combine the reactive behaviour of two well known switching policies – the Proportional Switching Policy (PSP) and the Bottleneck Aware Switching Policy (BSP) – with the proactive properties of several workload forecasting models. Seven forecasting models are used, including Last Observation, Simple Algorithm, Sample Moving Average, Exponential Moving Algorithm, Low Pass Filter and Autoregressive Moving Average. As each of the forecasting schemes has its own bias, we also develop three meta-forecasting algorithms (the Active Window Model, the Voting Model and the Selective Model) to ensure consistent and improved results. We show that request servicing capability can be improved by as much as 40% when the right combination of dynamic server switching and workload forecasting are used. As important is that we can generate consistently improved results, even when we apply this scheme to real-world, highly-variable workload traces from several sources.

[1]  Mon-Yen Luo,et al.  Realizing Fault Resilience in Web-Server Cluster , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[2]  Daniel A. Menascé,et al.  Scaling for E-Business: Technologies, Models, Performance, and Capacity Planning , 2000 .

[3]  Stephen S. Lavenberg,et al.  Mean-Value Analysis of Closed Multichain Queuing Networks , 1980, JACM.

[4]  Stephen A. Jarvis,et al.  A System for Dynamic Server Allocation in Application Server Clusters , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing with Applications.

[5]  Andrzej Zalewski,et al.  Evaluation of Dependability of Multi-tier Internet Business Applications with Queueing Networks , 2006, 2006 International Conference on Dependability of Computer Systems.

[6]  J. Little A Proof for the Queuing Formula: L = λW , 1961 .

[7]  Faraz Ahmad,et al.  Joint optimization of idle and cooling power in data centers while maintaining response time , 2010, ASPLOS 2010.

[8]  Stephen A. Jarvis,et al.  Predicting the performance of globus monitoring and discovery service (MDS-2) queries , 2003, Proceedings. First Latin American Web Congress.

[9]  Carlos Juiz,et al.  Comparison of predictive techniques in cluster-based network servers with resource allocation , 2004, The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004. (MASCOTS 2004). Proceedings..

[10]  Simon S. Lam,et al.  PAM-a noniterative approximate solution method for closed multichain queueing networks , 1988, SIGMETRICS '88.

[11]  Xiaoyun Zhu,et al.  Statistical service assurances for applications in utility grid environments , 2004, Perform. Evaluation.

[12]  Prashant J. Shenoy,et al.  Dynamic Provisioning of Multi-tier Internet Applications , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[13]  Stephen A. Jarvis,et al.  Dynamic Resource Allocation in Enterprise Systems , 2008, 2008 14th IEEE International Conference on Parallel and Distributed Systems.

[14]  Virgílio A. F. Almeida,et al.  Capacity Planning for Web Services: Metrics, Models, and Methods , 2001 .

[15]  Mark S. Squillante,et al.  On maximizing service-level-agreement profits , 2001, EC.

[16]  Stephen A. Jarvis,et al.  Predictive and Dynamic Resource Allocation for Enterprise Applications , 2010, 2010 10th IEEE International Conference on Computer and Information Technology.

[17]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[18]  Marin Litoiu,et al.  A performance analysis method for autonomic computing systems , 2007, TAAS.

[19]  T. N. Vijaykumar,et al.  Joint optimization of idle and cooling power in data centers while maintaining response time , 2010, ASPLOS XV.

[20]  James C. French,et al.  Predicting Indexer Performance in a Distributed Digital Library , 1999, ECDL.

[21]  Mario Gerla,et al.  A Mean Value Analysis approach to transaction performance evaluation of multi-server systems , 2010 .

[22]  Awi Federgruen,et al.  The Greedy Procedure for Resource Allocation Problems: Necessary and Sufficient Conditions for Optimality , 1986, Oper. Res..

[23]  Asser N. Tantawi,et al.  Optimal allocation of multiple class resources in computer systems , 1988, SIGMETRICS 1988.

[24]  Daniel A. Menascé Workload Characterization , 2003, IEEE Internet Comput..

[25]  Martin F. Arlitt,et al.  Web server workload characterization: the search for invariants , 1996, SIGMETRICS '96.

[26]  Giuseppe Serazzi,et al.  Bottlenecks identification in multiclass queueing networks using convex polytopes , 2004, The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004. (MASCOTS 2004). Proceedings..

[27]  Carey L. Williamson,et al.  Workload Characterization of a Large Systems Conference Web Server , 2009, 2009 Seventh Annual Communication Networks and Services Research Conference.

[28]  Raffaela Mirandola,et al.  Performance Prediction of Web Service Workflows , 2007, QoSA.