Characterization of the E-commerce Storage Subsystem Workload

This paper characterizes the workload seen at the storage subsystem of an e-commerce system. Measurements are conducted on multi-tiered systems running three different benchmarks, i.e., TPC-W, TPC-C, and RUBiS. In this environment, TPC-W and RUBiS are used to represent web-based e-commerce applications (i.e., on-line shopping and auctioning). They generate mostly READ-dominated workloads. The TPC-C benchmark, although not directly an e-commerce benchmark, is used to represent an e-commerce system under heavy on-line transactions processing activity. Different from the TPC-W and RUBiS benchmarks, TPC-C generates WRITE-dominated workloads.For all three benchmarks, in addition to the system load, the workload mix causes the system resources such as memory to saturate,IO traffic to increase, and, consequently, overall system throughput to reduce.Generally, when a workload shifts from web-site browsing (i.e, reading) to transaction processing (i.e writing) the IO load reduces but the footprint of the IO working set increases, which slows down the IO subsystem.File system and device driver scheduling represent elements in the IO path that for a given set of system resources further improve user-level throughput. Their impact is visible for medium to high utilization and diminishes for light load or overload.

[1]  Mikko H. Lipasti,et al.  An architectural evaluation of Java TPC-W , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[2]  Yuanyuan Zhou,et al.  Eviction-based Cache Placement for Storage Caches , 2003, USENIX Annual Technical Conference, General Track.

[3]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[4]  C. Amza,et al.  Specification and implementation of dynamic Web site benchmarks , 2002, 2002 IEEE International Workshop on Workload Characterization.

[5]  Lili Qiu,et al.  The content and access dynamics of a busy Web site: findings and implications , 2000 .

[6]  Tzi-cker Chiueh,et al.  Charm: An I/O-Driven Execution Strategy for High-Performance Transaction Processing , 2001, USENIX Annual Technical Conference, General Track.

[7]  Erich M. Nahum,et al.  Controlling the performance of 3-tiered web sites: modeling, design and implementation , 2004, SIGMETRICS '04/Performance '04.

[8]  Qi Zhang,et al.  Bottlenecks and Their Performance Implications in E-commerce Systems , 2004, WCW.

[9]  Qi Zhang,et al.  Workload propagation - overload in bursty servers , 2005, Second International Conference on the Quantitative Evaluation of Systems (QEST'05).

[10]  Sandeep Pandey,et al.  Monitoring the dynamic web to respond to continuous queries , 2003, WWW '03.

[11]  Erich M. Nahum,et al.  A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[12]  Mahmut T. Kandemir,et al.  Interplay of energy and performance for disk arrays running transaction processing workloads , 2003, 2003 IEEE International Symposium on Performance Analysis of Systems and Software. ISPASS 2003..

[13]  Peter Druschel,et al.  Anticipatory scheduling: a disk scheduling framework to overcome deceptive idleness in synchronous I/O , 2001, SOSP.

[14]  Mor Harchol-Balter,et al.  Priority mechanisms for OLTP and transactional Web applications , 2004, Proceedings. 20th International Conference on Data Engineering.

[15]  Javier García,et al.  TPC-W E-Commerce Benchmark Evaluation , 2003, Computer.

[16]  Alan Jay Smith,et al.  I/O reference behavior of production database workloads and the TPC benchmarks—an analysis at the logical level , 1999, TODS.

[17]  Jerome A. Rolia,et al.  Characterizing the scalability of a large web-based shopping system , 2001, ACM Trans. Internet Techn..