Non-intrusive Estimation of QoS Degradation Impact on E-Commerce User Satisfaction

With the massification of high speed Internet access, recent industry consumer reports show that Web site performance is increasingly becoming a key feature in determining user satisfaction, and finally, a decisive factor in whether a user will purchase on a Web site or even return to it. Traditional Web infrastructure capacity planning has focused on maintaining high throughput and availability on Web sites, optimizing the number of servers to serve peak hours to minimize costs. However, as we will show with our study, the conversion rate, the fraction of users that purchase on a site, is higher at peak hours, where systems are more exposed to suffer overload. In this article we propose a methodology to determine the thresholds of user satisfaction as the QoS delivered by an online business degrades, and to estimate its effects on actual sales. The novelty of the presented technique is that it does not involve any intrusive manipulation of production systems, but a learning process over historic sales data that is combined with system performance measurements. The methodology has been applied to Atrapalo.com, a top national Travel and Booking site. For our experiments, we were given access to a 3 year long sales history dataset, as well as actual HTTP and resource consumption logs for several weeks. Obtained results enable autonomic resource managers to set best performance goals and optimize the number of server according to the workload, without surpassing the thresholds of user satisfaction and maximizing revenue for the site.

[1]  Jordi Torres,et al.  Characterization of workload and resource consumption for an online travel and booking site , 2010, IEEE International Symposium on Workload Characterization (IISWC'10).

[2]  Jakob Nielsen,et al.  Usability engineering at a discount , 1989 .

[3]  Jordi Torres,et al.  Session-based adaptive overload control for secure dynamic Web applications , 2005, 2005 International Conference on Parallel Processing (ICPP'05).

[4]  Javier García,et al.  A QoS Control Mechanism to Provide Service Differentiation and Overload Protection to Internet Scalable Servers , 2009, IEEE Transactions on Services Computing.

[5]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[6]  Robert B. Miller,et al.  Response time in man-computer conversational transactions , 1899, AFIPS Fall Joint Computing Conference.

[7]  R. Gavaldà,et al.  Automatic Detection and Banning of Content Stealing Bots for E-commerce , 2007 .

[8]  Michele Mazzucco Towards Autonomic Service Provisioning Systems , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[9]  Stephen A. Jarvis,et al.  Predictive and Dynamic Resource Allocation for Enterprise Applications , 2010, 2010 10th IEEE International Conference on Computer and Information Technology.

[10]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[11]  Jordi Torres,et al.  Web Customer Modeling for Automated Session Prioritization on High Traffic Sites , 2007, User Modeling.

[12]  Dennis F. Galletta,et al.  Web Site Delays: How Tolerant are Users? , 2004, J. Assoc. Inf. Syst..

[13]  Daniel A. Menascé,et al.  Business-oriented autonomic load balancing for multitiered Web sites , 2009, 2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems.

[14]  Jordi Torres,et al.  Self-adaptive utility-based web session management , 2009, Comput. Networks.