Using automatically derived load thresholds to manage compute resources on-demand

Dynamic computing environments that support a changing application set and load are now a reality. Within these environments the decision to add or remove resources, must be made in a timely manner, while at the same time they must avoid excessive resource rebalancing. When the overhead incurred by the resource reallocation process is significant, we should be confident that additional resources are necessary before initiating an allocation process. This process is complicated by the instability of application content and frequent changes in average request processing time. Load and response time thresholds need to be dynamically and automatically adjusted if they are to remain effective. We investigate automated methods for selecting and setting thresholds, estimating load, and analyzing response time. Specifically, these methods estimate the load on a set of application servers, and set appropriate resource allocation thresholds based on the relationship between projected site response times and server load. We describe a solution to the problem of determining the maximum load over multiple servers under changing conditions, which include both changing traffic and application characteristics.

[1]  Louis P. Slothouber,et al.  A Model of Web Server Performance , 1996 .

[2]  Benny Rochwerger,et al.  Oceano-SLA based management of a computing utility , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[3]  Fan Zhang,et al.  Characterizing Normal Operation of a Web Server: Application to Workload Forecasting and Problem Determination , 1998, Int. CMG Conference.

[4]  Germán S. Goldszmidt,et al.  Scaling Internet services by dynamic allocation of connections , 1999, Integrated Network Management VI. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on Integrated Network Management. (Cat. No.99EX302).

[5]  James J. Filliben,et al.  NIST/SEMATECH e-Handbook of Statistical Methods; Chapter 1: Exploratory Data Analysis , 2003 .

[6]  Aparna Gupta,et al.  Pricing of risk for loss guaranteed intra-domain internet service contracts , 2006, Comput. Networks.

[7]  Robert Krauthgamer,et al.  Online server allocation in a server farm via benefit task systems , 2001, STOC '01.

[8]  Yixin Diao,et al.  Generic On-Line Discovery of Quantitative Models for Service Level Management , 2003, Integrated Network Management.

[9]  John C. S. Lui,et al.  Bounding of performance measures for a threshold-based queueing system with hysteresis , 1997, SIGMETRICS '97.

[10]  Malgorzata Steinder,et al.  Yemanja-a layered event correlation engine for multi-domain server farms , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[11]  Fabio Q. B. da Silva,et al.  SALMON - An Architecture to Define, Store, Monitoring and Billing ISLAs in a Server Farm , 2001, LANOMS.

[12]  Virgílio A. F. Almeida,et al.  Capacity Planning for Web Performance: Metrics, Models, and Methods , 1998 .