论文信息 - Connection and performance model driven optimization of pageview response time

Connection and performance model driven optimization of pageview response time

Managing client perceived pageview response time for multiple classes of service is essential in today's highly competitive, e-commerce environment. We present Connection and Performance Model Driven Optimization (CP-MDO), a novel approach for providing optimal QoS as defined by a cost objective based on client perceived pageview response time and pageview drop rate. Our approach combines two vital models: 1) a latency model for connection establishment that captures the interactions between web browsers and web servers across network protocol layers and 2) a server performance model based on queueing theory that models performance across all tiers of a server complex. An algorithm capable of enforcing the optimal admission control based on the inter-arrival time between pageview admissions is given. Our approach has been implemented and evaluated in an experimental setting, demonstrating how CP-MDO achieves the minimal cost while providing minimal pageview response times under minimal drop rates across multiple classes of service.

Dinesh Kumar | Li Zhang | David P. Olshefski

[1] K. Shin,et al. Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[2] Mark S. Squillante,et al. Workload service requirements analysis: a queueing network optimization approach , 2002, Proceedings. 10th IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunications Systems.

[3] Stephen S. Lavenberg,et al. Mean-Value Analysis of Closed Multichain Queuing Networks , 1980, JACM.

[4] Kang G. Shin,et al. Resynchronization and controllability of bursty service requests , 2004, IEEE/ACM Transactions on Networking.

[5] Richard Wolski,et al. Quorum: flexible quality of service for internet services , 2005, NSDI.

[6] Carlo Ghezzi,et al. Model Driven QoS Analyses of Composed Web Services , 2008, ServiceWave.

[7] Erich M. Nahum,et al. Achieving Class-Based QoS for Transactional Workloads , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[8] Carl M. Harris,et al. Fundamentals of queueing theory , 1975 .

[9] Tao Yang,et al. Selective early request termination for busy internet services , 2006, WWW '06.

[10] Donald F. Towsley,et al. Modeling TCP throughput: a simple model and its empirical validation , 1998, SIGCOMM '98.

[11] Aniruddha S. Gokhale,et al. NetQoPE: A Model-Driven Network QoS Provisioning Engine for Distributed Real-time and Embedded Systems , 2008, 2008 IEEE Real-Time and Embedded Technology and Applications Symposium.

[12] Jason Nieh,et al. Understanding the management of client perceived response time , 2006, SIGMETRICS '06/Performance '06.

[13] Timofei Popkov,et al. Queueing model based Qos management prototype for e-commerce systems , 2000, CASCON.

[14] Peter Druschel,et al. TCP Implementation Enhancements for Improving Webserver Performance , 1999 .

[15] Lui Sha,et al. Online response time optimization of Apache web server , 2003, IWQoS'03.

[16] Maria Kihl,et al. Admission control schemes guaranteeing customer QOS in commercial web sites , 2002, Net-Con.

[17] Biplab Sikdar,et al. Analytic models and comparative study of the latency and steady-state throughput of TCP Tahoe, Reno and SACK , 2001, GLOBECOM'01. IEEE Global Telecommunications Conference (Cat. No.01CH37270).

[18] Asser N. Tantawi,et al. An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[19] L. Cherkasova,et al. Session-based admission control: a mechanism for improving performance of commercial Web sites , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[20] Joseph D. Touch,et al. The TIME-WAIT state in TCP and its effect on busy servers , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[21] Cheng-Zhong Xu,et al. eQoS: Provisioning of Client-Perceived End-to-End QoS Guarantees in Web Servers , 2006, IEEE Transactions on Computers.

[22] Yixin Diao,et al. Optimizing Quality of Service Using Fuzzy Control , 2002, DSOM.

[23] Erich M. Nahum,et al. A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[24] Carey L. Williamson,et al. An analysis of TCP reset behaviour on the internet , 2005, CCRV.

[25] Douglas M. Freimuth,et al. Kernel Mechanisms for Service Differentiation in Overloaded Web Servers , 2001, USENIX Annual Technical Conference, General Track.

[26] Lui Sha,et al. Queueing model based network server performance control , 2002, 23rd IEEE Real-Time Systems Symposium, 2002. RTSS 2002..

[27] Tim Brecht,et al. accept()able Strategies for Improving Web Server Performance , 2004, USENIX Annual Technical Conference, General Track.

[28] Stefan Savage,et al. Modeling TCP latency , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[29] Jeffrey S. Chase,et al. Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control , 2004, OSDI.

[30] Asser N. Tantawi,et al. CPU demand for web serving: Measurement analysis and dynamic estimation , 2008, Perform. Evaluation.

[31] Mor Harchol-Balter,et al. Web servers under overload: How scheduling can help , 2006, TOIT.

[32] Nina Bhatti,et al. Web server support for tiered services , 1999, IEEE Netw..

[33] Anand Sivasubramaniam,et al. QDSL: a queuing model for systems with differential service levels , 2008, SIGMETRICS '08.

[34] Allan Kuchinsky,et al. Integrating user-perceived quality into Web server design , 2000, Comput. Networks.

[35] Kang G. Shin,et al. Persistent dropping: an efficient control of traffic aggregates , 2003, SIGCOMM '03.