Multiple-Resource Request Scheduling for Differentiated QoS at Website Gateway

Differentiated quality of service is a way for a website operator to provide different service levels to its clients. Traditional HTTP request scheduling schemes can achieve this, but they schedule requests to manage only one server resource, such as CPU or disk I/O. Actually, processing a request on the server will consume multiple resources. This paper presents a multiple-resource request scheduling algorithm, called mQoS, for differentiating the utilization of the server resource. The mQoS scheduler consists of several sub-schedulers and a main scheduler. Each sub-scheduler manages a server resource to differentiate its utilization among the classes. The main scheduler checks the availability of every server resource and triggers an appropriate sub-scheduler to balance the utilization of server resources. The implementation of the mQoS gateway is based on Squid and Linux. The evaluation compares the mQoS scheduling with no scheduling (nQoS) and single-re source request scheduling (sQoS). The mQoS scheduling reveals the accurate differentiation on every server resource. In addition, the total server throughput in the mQoS scheduling is improved by 21%, compared with the sQoS scheduling. The average user-perceived latency of the mQoS scheduling is also shorter than other scheduling.

[1]  Edward W. Knightly,et al.  Ensuring Latency Targets in Multiclass Web Servers , 2003, IEEE Trans. Parallel Distributed Syst..

[2]  Raju Pandey,et al.  Supporting quality of service in HTTP servers , 1998, PODC '98.

[3]  Nina Bhatti,et al.  Web server support for tiered services , 1999, IEEE Netw..

[4]  R. Shreedhar,et al.  Efficient Fair Queuing Using Deficit Round - , 1997 .

[5]  Mor Harchol-Balter,et al.  Connection Scheduling in Web Servers , 1999, USENIX Symposium on Internet Technologies and Systems.

[6]  Michele Colajanni,et al.  Web switch support for differentiated services , 2001, PERV.

[7]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[8]  Vipin Kumar,et al.  Load balancing across near-homogeneous multi-resource servers , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[9]  Tao Yang,et al.  A Flexible QoS Framework for Cluster-based Network Services , 2007 .

[10]  Chenyang Lu,et al.  An adaptive control framework for QoS guarantees and its application to differentiated caching , 2002, IEEE 2002 Tenth IEEE International Workshop on Quality of Service (Cat. No.02EX564).

[11]  Michele Colajanni,et al.  A client-aware dispatching algorithm for web clusters providing multiple services , 2001, WWW '01.

[12]  Chi Chi Hung,et al.  Adaptive proxy-based content transformation framework for the World-Wide Web , 2000, Proceedings Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region.

[13]  Willy Zwaenepoel,et al.  Cluster reserves: a mechanism for resource management in cluster-based network servers , 2000, SIGMETRICS '00.

[14]  Gautam G. Rao,et al.  Application Level Differentiated Services for Web Servers , 2000 .

[15]  Daniel P. Siewiorek,et al.  A scalable solution to the multi-resource QoS problem , 1999, Proceedings 20th IEEE Real-Time Systems Symposium (Cat. No.99CB37054).

[16]  Erich M. Nahum,et al.  A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[17]  Amin Vahdat,et al.  Application-level differentiated multimedia Web services using quality aware transcoding , 2000, IEEE Journal on Selected Areas in Communications.

[18]  V. Kumar,et al.  Job Scheduling in the presence of Multiple Resource Requirements , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[19]  Tao Yang,et al.  Demand-driven service differentiation in cluster-based network servers , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[20]  Prasant Mohapatra,et al.  An Admission Control Scheme for Predictable Server Response Time for Web Accesses , 2001, WWW '01.