Another viewpoint on “evaluating web software reliability based on workload and failure data extracted from server logs”

An approach of determining a website’s reliability is evaluated in this paper. This technique extracts workload measures and error codes from the server’s data logs. This information is then used to calculate the reliability for a particular website. This study follows on from a previous study, and hence, can be regarded as a “partial replication” (technically, as both studies are case studies not formal experiments, this description is inaccurate. Unfortunately, no corresponding definition exists for case studies, and hence the term is used to convey a general sense of purpose) of the original study. Although the method proposed by the original study is feasible, the effectiveness of just using a specific error type and a specific workload to estimate the reliability of websites is questionable. In this study, different error types and their usefulness for reliability analysis are examined and discussed. After a thorough investigation, we believe that reliability analysis for websites must be based on more specific error definitions as they can provide a superior reliability estimate for today’s highly dynamic websites.

[1]  James E. Pitkow,et al.  Characterizing Browsing Behaviors on the World-Wide Web , 1995 .

[2]  Katerina Goseva-Popstojanova,et al.  Empirical study of session-based workload and reliability for Web servers , 2004, 15th International Symposium on Software Reliability Engineering.

[3]  Mark Rosenstein What is actually taking place on web sites: e-commerce lessons from web server logs , 2000, EC '00.

[4]  Li Ma,et al.  Analyzing Errors and Referral Pairs to Characterize Common Problems and Improve Web Reliability , 2003, ICWE.

[5]  Virgílio A. F. Almeida,et al.  In search of invariants for e-business workloads , 2000, EC '00.

[6]  Myra Spiliopoulou,et al.  Web Usage Analysis and User Profiling: International WEBKDD'99 Workshop San Diego, CA, USA, August 15, 1999 Revised Papers , 2000 .

[7]  Angelos D. Keromytis,et al.  SQLrand: Preventing SQL Injection Attacks , 2004, ACNS.

[8]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[9]  M. HamidR.Jamali,et al.  Website usage metrics: A re-assessment of session data , 2008, Inf. Process. Manag..

[10]  Wen-Li Wang,et al.  User-oriented reliability modeling for a Web system , 2003, 14th International Symposium on Software Reliability Engineering, 2003. ISSRE 2003..

[11]  Crispan Cowan,et al.  StackGuard: Automatic Adaptive Detection and Prevention of Buffer-Overflow Attacks , 1998, USENIX Security Symposium.

[12]  Steven Cook A Web Developer's Guide to Cross-Site Scripting , 2003 .

[13]  James E. Pitkow,et al.  Characterizing Browsing Strategies in the World-Wide Web , 1995, Comput. Networks ISDN Syst..

[14]  Giuseppe Serazzi,et al.  End-to-End Performance of Web Services , 2002, Performance.

[15]  Zhao Li,et al.  Evaluating Web software reliability based on workload and failure data extracted from server logs , 2004, IEEE Transactions on Software Engineering.

[16]  Azer Bestavros,et al.  Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.

[17]  J. Williams Avoiding the CNN moment , 2001 .

[18]  Jeff Tian,et al.  Measuring and Modeling Usage and Reliability for Statistical Web Testing , 2001, IEEE Trans. Software Eng..

[19]  Fiona Fui-Hoon Nah A study of web users' waiting time , 2000, IRMA Conference.

[20]  Kishor S. Trivedi Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .

[21]  Gregory M. Rose,et al.  A refined view of download time impacts on e‐consumer attitudes and patronage intentions toward e‐retailers , 2001 .

[22]  David Evans,et al.  Improving Security Using Extensible Lightweight Static Analysis , 2002, IEEE Softw..

[23]  Shih-Kun Huang,et al.  Web application security assessment by fault injection and behavior monitoring , 2003, WWW '03.

[24]  Eldred Nelson,et al.  Estimating software reliability from test data , 1978 .

[25]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[26]  James Miller,et al.  Further investigations into evaluating Web site reliability , 2005, 2005 International Symposium on Empirical Software Engineering, 2005..

[27]  Dale Schuurmans,et al.  Dynamic Web log session identification with statistical language models , 2004, J. Assoc. Inf. Sci. Technol..

[28]  Daqing He,et al.  Detecting session boundaries from Web user logs , 2000 .

[29]  Dennis F. Galletta,et al.  Web Site Delays: How Tolerant are Users? , 2004, J. Assoc. Inf. Syst..

[30]  I. Jolliffe Principal Component Analysis , 2002 .

[31]  Katerina Goseva-Popstojanova,et al.  Empirical Characterization of Session–Based Workload and Reliability for Web Servers , 2006, Empirical Software Engineering.

[32]  David A. Wagner,et al.  A First Step Towards Automated Detection of Buffer Overrun Vulnerabilities , 2000, NDSS.

[33]  Ludmila Cherkasova,et al.  Session Based Admission Control: A Mechanism for Improving the Performance of an Overloaded Web Server , 1998 .

[34]  Vangalur S. Alagar,et al.  Reliability assessment of web applications , 2002, Proceedings 26th Annual International Computer Software and Applications.

[35]  A. Jefferson Offutt,et al.  Quality Attributes of Web Software Applications , 2002, IEEE Softw..

[36]  Martin Arlitt,et al.  Workload Characterization of the 1998 World Cup Web Site , 1999 .

[37]  James E. Pitkow Summary of WWW characterizations , 2004, World Wide Web.

[38]  Xuan Wang,et al.  A Contribution Towards Solving the Web Workload Puzzle , 2006, International Conference on Dependable Systems and Networks (DSN'06).

[39]  Virgílio A. F. Almeida,et al.  A methodology for workload characterization of E-commerce sites , 1999, EC '99.

[40]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[41]  PatternsYongjian,et al.  Clustering of Web Users Based on Access , 1999 .

[42]  John D. Musa,et al.  Software reliability - measurement, prediction, application , 1987, McGraw-Hill series in software engineering and technology.

[43]  Christos Faloutsos,et al.  Identifying Web Browsing Trends and Patterns , 2001, Computer.

[44]  Virgílio A. F. Almeida,et al.  Business-oriented resource management policies for e-commerce servers , 2000, Perform. Evaluation.