Connection and performance model driven optimization of pageview response time

Managing client perceived pageview response time for multiple classes of service is essential in today's highly competitive, e-commerce environment. We present Connection and Performance Model Driven Optimization (CP-MDO), a novel approach for providing optimal QoS as defined by a cost objective based on client perceived pageview response time and pageview drop rate. Our approach combines two vital models: 1) a latency model for connection establishment that captures the interactions between web browsers and web servers across network protocol layers and 2) a server performance model based on queueing theory that models performance across all tiers of a server complex. An algorithm capable of enforcing the optimal admission control based on the inter-arrival time between pageview admissions is given. Our approach has been implemented and evaluated in an experimental setting, demonstrating how CP-MDO achieves the minimal cost while providing minimal pageview response times under minimal drop rates across multiple classes of service.

[1]  K. Shin,et al.  Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[2]  Mark S. Squillante,et al.  Workload service requirements analysis: a queueing network optimization approach , 2002, Proceedings. 10th IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunications Systems.

[3]  Stephen S. Lavenberg,et al.  Mean-Value Analysis of Closed Multichain Queuing Networks , 1980, JACM.

[4]  Kang G. Shin,et al.  Resynchronization and controllability of bursty service requests , 2004, IEEE/ACM Transactions on Networking.

[5]  Richard Wolski,et al.  Quorum: flexible quality of service for internet services , 2005, NSDI.

[6]  Carlo Ghezzi,et al.  Model Driven QoS Analyses of Composed Web Services , 2008, ServiceWave.

[7]  Erich M. Nahum,et al.  Achieving Class-Based QoS for Transactional Workloads , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[8]  Carl M. Harris,et al.  Fundamentals of queueing theory , 1975 .

[9]  Tao Yang,et al.  Selective early request termination for busy internet services , 2006, WWW '06.

[10]  Donald F. Towsley,et al.  Modeling TCP throughput: a simple model and its empirical validation , 1998, SIGCOMM '98.

[11]  Aniruddha S. Gokhale,et al.  NetQoPE: A Model-Driven Network QoS Provisioning Engine for Distributed Real-time and Embedded Systems , 2008, 2008 IEEE Real-Time and Embedded Technology and Applications Symposium.

[12]  Jason Nieh,et al.  Understanding the management of client perceived response time , 2006, SIGMETRICS '06/Performance '06.

[13]  Timofei Popkov,et al.  Queueing model based Qos management prototype for e-commerce systems , 2000, CASCON.

[14]  Peter Druschel,et al.  TCP Implementation Enhancements for Improving Webserver Performance , 1999 .

[15]  Lui Sha,et al.  Online response time optimization of Apache web server , 2003, IWQoS'03.

[16]  Maria Kihl,et al.  Admission control schemes guaranteeing customer QOS in commercial web sites , 2002, Net-Con.

[17]  Biplab Sikdar,et al.  Analytic models and comparative study of the latency and steady-state throughput of TCP Tahoe, Reno and SACK , 2001, GLOBECOM'01. IEEE Global Telecommunications Conference (Cat. No.01CH37270).

[18]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[19]  L. Cherkasova,et al.  Session-based admission control: a mechanism for improving performance of commercial Web sites , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[20]  Joseph D. Touch,et al.  The TIME-WAIT state in TCP and its effect on busy servers , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[21]  Cheng-Zhong Xu,et al.  eQoS: Provisioning of Client-Perceived End-to-End QoS Guarantees in Web Servers , 2006, IEEE Transactions on Computers.

[22]  Yixin Diao,et al.  Optimizing Quality of Service Using Fuzzy Control , 2002, DSOM.

[23]  Erich M. Nahum,et al.  A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[24]  Carey L. Williamson,et al.  An analysis of TCP reset behaviour on the internet , 2005, CCRV.

[25]  Douglas M. Freimuth,et al.  Kernel Mechanisms for Service Differentiation in Overloaded Web Servers , 2001, USENIX Annual Technical Conference, General Track.

[26]  Lui Sha,et al.  Queueing model based network server performance control , 2002, 23rd IEEE Real-Time Systems Symposium, 2002. RTSS 2002..

[27]  Tim Brecht,et al.  accept()able Strategies for Improving Web Server Performance , 2004, USENIX Annual Technical Conference, General Track.

[28]  Stefan Savage,et al.  Modeling TCP latency , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[29]  Jeffrey S. Chase,et al.  Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control , 2004, OSDI.

[30]  Asser N. Tantawi,et al.  CPU demand for web serving: Measurement analysis and dynamic estimation , 2008, Perform. Evaluation.

[31]  Mor Harchol-Balter,et al.  Web servers under overload: How scheduling can help , 2006, TOIT.

[32]  Nina Bhatti,et al.  Web server support for tiered services , 1999, IEEE Netw..

[33]  Anand Sivasubramaniam,et al.  QDSL: a queuing model for systems with differential service levels , 2008, SIGMETRICS '08.

[34]  Allan Kuchinsky,et al.  Integrating user-perceived quality into Web server design , 2000, Comput. Networks.

[35]  Kang G. Shin,et al.  Persistent dropping: an efficient control of traffic aggregates , 2003, SIGCOMM '03.