Resource Allocation for Session-Based Two-Dimensional Service Differentiation on e-Commerce Servers

A scalable e-commerce server should be able to provide different levels of quality of service (QoS) to different types of requests based on clients' navigation patterns and the server capacity. E-commerce workloads are composed of sessions. In this paper, we propose a session-based two-dimensional (2D) service differentiation model for online transactions: intersession and intrasession. The intersession model aims to provide different levels of QoS to sessions from different customer classes, and the intrasession model aims to provide different levels of QoS to requests in different states of a session. A primary performance metric of online transactions is slowdown. It measures the waiting time of a request relative to its service time. We present a processing rate allocation scheme for 2D proportional slowdown differentiation. We then introduce service slowdown as a systemwide QoS metric of an e-commerce server. It is defined as the weighted sum of request slowdown in different sessions and in different session states. We formulate the problem of 2D service differentiation as an optimization of processing rate allocation with the objective of minimizing the service slowdown of the server. We prove that the derived rate allocation scheme based on the optimization guarantees client requests' slowdown to be square-root proportional to their prespecified differentiation weights in both intersession and intrasession dimensions. We evaluate this square-root proportional rate allocation scheme and a proportional rate allocation scheme via extensive simulations. Results validate that both schemes can achieve predictable, controllable, and fair 2D service differentiation on e-commerce servers. The square-root proportional rate allocation scheme provides 2D service differentiation at a minimum cost of service slowdown

[1]  William E. Weihl,et al.  Lottery scheduling: flexible proportional-share resource management , 1994, OSDI '94.

[2]  David K. Y. Yau,et al.  A proportional-delay DiffServ-enabled Web server: admission control and dynamic adaptation , 2004, IEEE Transactions on Parallel and Distributed Systems.

[3]  Wei Sun,et al.  ADAPTLOAD: effective balancing in clustered web servers under transient load conditions , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[4]  Mor Harchol-Balter Task assignment with unknown duration , 2002, JACM.

[5]  Virgílio A. F. Almeida,et al.  A methodology for workload characterization of E-commerce sites , 1999, EC '99.

[6]  Prasant Mohapatra,et al.  Session-based overload control in QoS-aware Web servers , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[7]  Chenyang Lu,et al.  Feedback utilization control in distributed real-time systems with end-to-end tasks , 2005, IEEE Transactions on Parallel and Distributed Systems.

[8]  Vijay Karamcheti,et al.  Enforcing resource sharing agreements among distributed server clusters , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[9]  Virgílio A. F. Almeida,et al.  Resource management policies for e-commerce servers , 2000, PERV.

[10]  David K. Y. Yau,et al.  Adaptive proportional delay differentiated services: characterization and performance evaluation , 2001, TNET.

[11]  K. Shin,et al.  Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[12]  Prasant Mohapatra,et al.  Performance Evaluation of Service Differentiating Internet Servers , 2002, IEEE Trans. Computers.

[13]  Peter Druschel,et al.  Resource containers: a new facility for resource management in server systems , 1999, OSDI '99.

[14]  Wayne D. Smith,et al.  TPC-W: Benchmarking An Ecommerce Solution , 2001 .

[15]  Judy Kay,et al.  A fair share scheduler , 1988, CACM.

[16]  Calton Pu,et al.  A feedback-driven proportion allocator for real-rate scheduling , 1999, OSDI '99.

[17]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[18]  Jerome A. Rolia,et al.  Characterizing the scalability of a large web-based shopping system , 2001, ACM Trans. Internet Techn..

[19]  Vaduvur Bharghavan,et al.  Delay differentiation and adaptation in core stateless networks , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[20]  Cheng-Zhong Xu,et al.  A robust packet scheduling algorithm for proportional delay differentiation services , 2004, IEEE Global Telecommunications Conference, 2004. GLOBECOM '04..

[21]  Sang Hyuk Son,et al.  Differentiated Real-Time Data Services for E-Commerce Applications , 2003, Electron. Commer. Res..

[22]  Lui Sha,et al.  Feedback control with queueing-theoretic prediction for relative delay guarantees in web servers , 2003, The 9th IEEE Real-Time and Embedded Technology and Applications Symposium, 2003. Proceedings..

[23]  Sugih Jamin,et al.  A measurement-based admission-controlled Web server , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[24]  Daniel P. Siewiorek,et al.  A resource allocation model for QoS management , 1997, Proceedings Real-Time Systems Symposium.

[25]  Cheng-Zhong Xu,et al.  Modeling and analysis of 2D service differentiation on e-commerce servers , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[26]  Marcos José Santana,et al.  Using adaptive priority scheduling for service differentiation QoS-aware Web servers , 2004, IEEE International Conference on Performance, Computing, and Communications, 2004.

[27]  Nina Bhatti,et al.  Web server support for tiered services , 1999, IEEE Netw..

[28]  Gautam G. Rao,et al.  Application Level Differentiated Services for Web Servers , 2000 .

[29]  Riccardo Bettati,et al.  Differentiated services with statistical real-time guarantees in static-priority scheduling networks , 2001, Proceedings 22nd IEEE Real-Time Systems Symposium (RTSS 2001) (Cat. No.01PR1420).

[30]  Van Jacobson,et al.  A Two-bit Differentiated Services Architecture for the Internet , 1999, RFC.

[31]  Cheng-Zhong Xu,et al.  A Self-tuning Fuzzy Control Approach for End-to-End QoS Guarantees in Web Servers , 2005, IWQoS.

[32]  Mor Harchol-Balter,et al.  Size-based scheduling to improve web performance , 2003, TOCS.

[33]  Willy Zwaenepoel,et al.  Cluster reserves: a mechanism for resource management in cluster-based network servers , 2000, SIGMETRICS '00.

[34]  Michael A. Bender,et al.  Flow and stretch metrics for scheduling continuous job streams , 1998, SODA '98.

[35]  Erich M. Nahum,et al.  A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[36]  Amin Vahdat,et al.  Application-level differentiated multimedia Web services using quality aware transcoding , 2000, IEEE Journal on Selected Areas in Communications.

[37]  Tao Yang,et al.  Demand-driven service differentiation in cluster-based network servers , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[38]  Cheng-Zhong Xu,et al.  Harmonic proportional bandwidth allocation and scheduling for service differentiation on streaming servers , 2004, IEEE Transactions on Parallel and Distributed Systems.

[39]  Parameswaran Ramanathan,et al.  Proportional differentiated services: delay differentiation and packet scheduling , 1999, SIGCOMM '99.