Processing Rate Allocation for Proportional Slowdown Differentiation on Internet Servers

Summary form only given. A proportional differentiation model states that quality of service of different classes of Internet traffic should be kept proportional to their prespecified differentiation parameters, independent of the class loads. The model has been applied in the proportional queueing delay differentiation (FDD) in both network core and network edges. However, in the server side, an important and interesting performance metric is slowdown, the ratio of a request's queueing delay to its service time. Slowdown is important because it is desirable that a request's delay be proportional to its processing requirement. We investigate the problem of processing rate allocation for proportional slowdown differentiation (PSD) on Internet servers. Existing algorithms for FDD provisioning in the network side are not applicable to PSD provisioning in the server side because slowdown is not only dependent on a job's queueing delay but also on its service time, which varies significantly depending on the requested services. We first derive a closed form expression of the expected slowdown in an M/Gp/1 FCFS queue, which is an M/G/l FCFS queue with a typical heavy-tailed service time distribution (bounded Pareto distribution). PSD provisioning is realized by deploying a task server for handling each request class in a FCFS way. We then develop a strategy of processing rate allocation for the task servers for PSD provisioning. Simulation results have showed that the proposed rate allocation strategy can provide predictable and controllable PSD services on the servers.

[1]  Mor Harchol-Balter,et al.  On Choosing a Task Assignment Policy for a Distributed Server System , 1998, J. Parallel Distributed Comput..

[2]  Erich M. Nahum,et al.  ksniffer: Determining the Remote Client Perceived Response Time from Live Packet Streams , 2004, OSDI.

[3]  Mor Harchol-Balter,et al.  Connection Scheduling in Web Servers , 1999, USENIX Symposium on Internet Technologies and Systems.

[4]  Gene F. Franklin,et al.  Feedback Control of Dynamic Systems , 1986 .

[5]  Sang Hyuk Son,et al.  A feedback control approach for guaranteeing relative delays in Web servers , 2001, Proceedings Seventh IEEE Real-Time Technology and Applications Symposium.

[6]  Parameswaran Ramanathan,et al.  Proportional differentiated services: delay differentiation and packet scheduling , 2002, TNET.

[7]  Abhay Parekh,et al.  A generalized processor sharing approach to flow control in integrated services networks: the single-node case , 1993, TNET.

[8]  Prasant Mohapatra,et al.  Performance Evaluation of Service Differentiating Internet Servers , 2002, IEEE Trans. Computers.

[9]  Parameswaran Ramanathan,et al.  A case for relative differentiated services and the proportional differentiation model , 1999, IEEE Netw..

[10]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[11]  Tarek F. Abdelzaher,et al.  Differentiated caching services; a control-theoretical approach , 2001, Proceedings 21st International Conference on Distributed Computing Systems.

[12]  Vaduvur Bharghavan,et al.  Delay differentiation and adaptation in core stateless networks , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[13]  Robert B. Cooper,et al.  Queueing systems, volume II: computer applications : By Leonard Kleinrock. Wiley-Interscience, New York, 1976, xx + 549 pp. , 1977 .

[14]  John S. Heidemann,et al.  Application‐level differentiated services for Web servers , 1999, World Wide Web.

[15]  William E. Weihl,et al.  Lottery scheduling: flexible proportional-share resource management , 1994, OSDI '94.

[16]  Joseph L. Hellerstein,et al.  Using Control Theory to Achieve Service Level Objectives In Performance Management , 2002, Real-Time Systems.

[17]  Wei Sun,et al.  ADAPTLOAD: effective balancing in clustered web servers under transient load conditions , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[18]  Mor Harchol-Balter Task assignment with unknown duration , 2002, JACM.

[19]  Cheng-Zhong Xu,et al.  VirtualLength: a new packet scheduling algorithm for proportional delay differentiation , 2003, Proceedings. 12th International Conference on Computer Communications and Networks (IEEE Cat. No.03EX712).

[20]  C. Dovrolis,et al.  Proportional differentiated services, part II: loss rate differentiation and packet dropping , 2000, 2000 Eighth International Workshop on Quality of Service. IWQoS 2000 (Cat. No.00EX400).

[21]  Cheng-Zhong Xu,et al.  A Self-tuning Fuzzy Control Approach for End-to-End QoS Guarantees in Web Servers , 2005, IWQoS.

[22]  Paul Barford,et al.  Generating representative Web workloads for network and server performance evaluation , 1998, SIGMETRICS '98/PERFORMANCE '98.

[23]  Allen B. Downey A parallel workload model and its implications for processor allocation , 2004, Cluster Computing.

[24]  Mor Harchol-Balter,et al.  Task assignment in a distributed system (extended abstract): improving performance by unbalancing load , 1997, SIGMETRICS '98/PERFORMANCE '98.

[25]  Walter Willinger,et al.  On the self-similar nature of Ethernet traffic , 1993, SIGCOMM '93.

[26]  David K. Y. Yau,et al.  Adaptive proportional delay differentiated services: characterization and performance evaluation , 2001, TNET.

[27]  Van Jacobson,et al.  A Two-bit Differentiated Services Architecture for the Internet , 1999, RFC.

[28]  V. Paxson,et al.  WHERE MATHEMATICS MEETS THE INTERNET , 1998 .

[29]  David L. Black,et al.  An Architecture for Differentiated Service , 1998 .

[30]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[31]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[32]  Nina Bhatti,et al.  Web server support for tiered services , 1999, IEEE Netw..

[33]  Kang-Won Lee,et al.  Scalable service differentiation in a shared storage cache , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[34]  Ashish Goel,et al.  Multi-processor scheduling to minimize flow time with ε resource augmentation , 2004, STOC '04.

[35]  Michael A. Bender,et al.  Flow and stretch metrics for scheduling continuous job streams , 1998, SODA '98.

[36]  Mor Harchol-Balter,et al.  Size-based scheduling to improve web performance , 2003, TOCS.

[37]  Lui Sha,et al.  Queueing model based network server performance control , 2002, 23rd IEEE Real-Time Systems Symposium, 2002. RTSS 2002..

[38]  Amin Vahdat,et al.  Application-level differentiated multimedia Web services using quality aware transcoding , 2000, IEEE Journal on Selected Areas in Communications.

[39]  Lui Sha,et al.  Feedback control with queueing-theoretic prediction for relative delay guarantees in web servers , 2003, The 9th IEEE Real-Time and Embedded Technology and Applications Symposium, 2003. Proceedings..

[40]  Roch Guérin,et al.  A simple FIFO-based scheme for differentiated loss guarantees , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[41]  Tao Yang,et al.  Demand-driven service differentiation in cluster-based network servers , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[42]  Nicolas Christin,et al.  JoBS: Joint Buffer Management and Scheduling for Differentiated Services , 2001, IWQoS.

[43]  Jerome A. Rolia,et al.  Characterizing the scalability of a large web-based shopping system , 2001, ACM Trans. Internet Techn..

[44]  Zhen Liu,et al.  Traffic model and performance evaluation of Web servers , 2001, Perform. Evaluation.

[45]  David K. Y. Yau,et al.  Admission control and dynamic adaptation for a proportional-delay diffserv-enabled web server , 2002, SIGMETRICS '02.

[46]  Ludmila Cherkasova Scheduling Strategy to improve Response Time for Web Applications , 1998, HPCN Europe.

[47]  Larry Rudolph,et al.  Metrics and Benchmarking for Parallel Job Scheduling , 1998, JSSPP.

[48]  Cheng-Zhong Xu,et al.  A robust packet scheduling algorithm for proportional delay differentiation services , 2004, IEEE Global Telecommunications Conference, 2004. GLOBECOM '04..

[49]  K. Shin,et al.  Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[50]  Sally Floyd,et al.  Wide area traffic: the failure of Poisson modeling , 1995, TNET.

[51]  Cheng-Zhong Xu,et al.  Modeling and analysis of 2D service differentiation on e-commerce servers , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..