Application‐level differentiated services for Web servers

The current World Wide Web service model treats all requests equivalently, both while being processed by servers and while being transmitted over the network. For some uses, such as Web prefetching or multiple priority schemes, different levels of service are desirable. This paper presents three simple, server‐side, application‐level mechanisms (limiting process pool size, lowering process priorities, limiting transmission rate) to provide two different levels of Web service (regular and low priority). We evaluated the performance of these mechanisms under combinations of two foreground workloads (light and heavy) and two levels of available network bandwidth (10 Mb/s and 100 Mb/s). Our experiments show that even with background traffic sufficient to saturate the network, foreground performance is reduced by at most 4–17%. Thus, our user‐level mechanisms can effectively provide different service classes even in the absence of operating system and network support.

[1]  John Wroclawski,et al.  The Use of RSVP with IETF Integrated Services , 1997, RFC.

[2]  Carl A. Waldspurger,et al.  Stride Scheduling: Deterministic Proportional- Share Resource Management , 1995 .

[3]  Vikram Visweswaraiah,et al.  Improving Restart of Idle TCP Connections , 1999 .

[4]  David L. Black,et al.  An Architecture for Differentiated Service , 1998 .

[5]  Carl A. Waldspurger,et al.  Lottery and stride scheduling: flexible proportional-share resource management , 1995 .

[6]  Bruce A. Mah,et al.  An empirical model of HTTP network traffic , 1997, Proceedings of INFOCOM '97.

[7]  Samuel J. Leffler,et al.  The design and implementation of the 4.3 BSD Unix operating system , 1991, Addison-Wesley series in computer science.

[8]  Jeffrey C. Mogul,et al.  Using predictive prefetching to improve World Wide Web latency , 1996, CCRV.

[9]  Klara Nahrstedt,et al.  Design, Implementation, and Experiences of the OMEGA End-Point Architecture , 1996, IEEE J. Sel. Areas Commun..

[10]  Zheng Wang,et al.  An Architecture for Differentiated Services , 1998, RFC.

[11]  Srinivasan Seshan,et al.  TCP behavior of a busy Internet server: analysis and improvements , 1997, Proceedings. IEEE INFOCOM '98, the Conference on Computer Communications. Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies. Gateway to the 21st Century (Cat. No.98.

[12]  Joseph D. Touch,et al.  LSAM Proxy Cache: A Multicast Distributed Virtual Cache , 1998, Comput. Networks.

[13]  Joseph D. Touch,et al.  Defining High-Speed Protocols: Five Challenges and an Example that Survives the Challenges , 1995, IEEE J. Sel. Areas Commun..

[14]  William E. Weihl,et al.  Lottery scheduling: flexible proportional-share resource management , 1994, OSDI '94.

[15]  Peter Druschel,et al.  Measuring the Capacity of a Web Server , 1997, USENIX Symposium on Internet Technologies and Systems.

[16]  Randy H. Katz,et al.  TCP Fast Start: A Technique For Speeding Up Web Transfers , 1998 .

[17]  Keith Bostic,et al.  The design and implementa-tion of the 4.4BSD operating system , 1996 .

[18]  Raj Yavatkar,et al.  Integrated CPU and network-I/O QoS management in an endsystem , 1998, Comput. Commun..

[19]  D. Estrin,et al.  RSVP: a new resource reservation protocol , 1993, IEEE Communications Magazine.

[20]  David D. Clark,et al.  The design philosophy of the DARPA internet protocols , 1988, SIGCOMM '88.