Empirical study of session-based workload and reliability for Web servers

The growing availability of Internet access has led to significant increase in the use of World Wide Web. If we are to design dependable Web-based systems that deal effectively with the increasing number of clients and highly variable workload, it is important to be able to describe the Web workload and errors accurately. In this paper we focus on the detailed empirical analysis of the session-based workload and reliability based on the data extracted from actual Web logs often Web servers. First, we address the data collection process and describe the methods for extraction of workload and error data from Web log files. Then, we introduce and analyze several intra-session and inter-session metrics that collectively describe Web workload in terms of user sessions. Furthermore, we analyze Web error characteristics and estimate the request-based and session-based reliability of Web servers. Finally, we identify the invariants of the Web workload and reliability that apply through all data sets considered. The results presented in this paper show that session-based workload and reliability are better indicators of the users perception of the Web quality than the request-based metrics and provide more useful measures for tuning and maintaining of the Web servers.

[1]  Virgílio A. F. Almeida,et al.  Business-oriented resource management policies for e-commerce servers , 2000, Perform. Evaluation.

[2]  Azer Bestavros,et al.  Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.

[3]  Mark Rosenstein What is actually taking place on web sites: e-commerce lessons from web server logs , 2000, EC '00.

[4]  B. M. Hill,et al.  A Simple General Approach to Inference About the Tail of a Distribution , 1975 .

[5]  Jeff Tian,et al.  Measuring and Modeling Usage and Reliability for Statistical Web Testing , 2001, IEEE Trans. Software Eng..

[6]  Virgílio A. F. Almeida,et al.  In search of invariants for e-business workloads , 2000, EC '00.

[7]  Giuseppe Serazzi,et al.  End-to-End Performance of Web Services , 2002, Performance.

[8]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[9]  Eldred Nelson,et al.  Estimating software reliability from test data , 1978 .

[10]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[11]  Wen-Li Wang,et al.  User-oriented reliability modeling for a Web system , 2003, 14th International Symposium on Software Reliability Engineering, 2003. ISSRE 2003..

[12]  Virgílio A. F. Almeida,et al.  A methodology for workload characterization of E-commerce sites , 1999, EC '99.

[13]  Martin Arlitt,et al.  Workload Characterization of the 1998 World Cup Web Site , 1999 .

[14]  Ludmila Cherkasova,et al.  Session Based Admission Control: A Mechanism for Improving the Performance of an Overloaded Web Server , 1998 .

[15]  Vangalur S. Alagar,et al.  Reliability assessment of web applications , 2002, Proceedings 26th Annual International Computer Software and Applications.

[16]  David A. Patterson,et al.  Architecture and Dependability of Large-Scale Internet Services , 2002, IEEE Internet Comput..