Generating representative Web workloads for network and server performance evaluation
Abstract:One role for workload generation is as a means for understanding how servers and networks respond to variation in load. This enables management and capacity planning based on current and projected usage. This paper applies a number of observations of Web server usage to create a realistic Web workload generation tool which mimics a set of real users accessing a server. The tool, called Surge (Scalable URL Reference Generator) generates references matching empirical measurements of 1) server file size distribution; 2) request size distribution; 3) relative file popularity; 4) embedded file references; 5) temporal locality of reference; and 6) idle periods of individual users. This paper reviews the essential elements required in the generation of a representative Web workload. It also addresses the technical challenges to satisfying this large set of simultaneous constraints on the properties of the reference stream, the solutions we adopted, and their associated accuracy. Finally, we present evidence that Surge exercises servers in a manner significantly different from other Web server benchmarks.
暂无分享,去 创建一个
[1] George Kingsley Zipf,et al. Human behavior and the principle of least effort , 1949 .
[2] Irving L. Traiger,et al. Evaluation Techniques for Storage Hierarchies , 1970, IBM Syst. J..
[3] Henry Braun,et al. A simple metllod for testing goodness of fit in tile presence of nuisance parameters , 1980 .
[4] Editors , 1986, Brain Research Bulletin.
[5] L. Devroye. Discrete Univariate Distributions , 1986 .
[6] Ralph B. D'Agostino,et al. Goodness-of-Fit-Techniques , 2020 .
[7] M. E. Johnson,et al. Estimating model discrepancy , 1990 .
[8] Walter Willinger,et al. On the self-similar nature of Ethernet traffic , 1993, SIGCOMM '93.
[9] Vern Paxson,et al. Empirically derived analytic models of wide-area TCP connections , 1994, TNET.
[10] Walter Willinger,et al. Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level , 1997, TNET.
[11] Mark Crovella,et al. Characteristics of WWW Client-based Traces , 1995 .
[12] Virgílio A. F. Almeida,et al. Characterizing reference locality in the WWW , 1996, Fourth International Conference on Parallel and Distributed Information Systems.
[13] Kihong Park,et al. On the relationship between file sizes, transport protocols, and self-similar network traffic , 1996, Proceedings of 1996 International Conference on Network Protocols (ICNP-96).
[14] Kihong Park. On the relationship between le sizes, transport protocols, and self-similar network tra c , 1996 .
[15] Martin F. Arlitt,et al. Web server workload characterization: the search for invariants , 1996, SIGMETRICS '96.
[16] Tim Bray,et al. Measuring the Web , 1996, World Wide Web J..
[17] Shuang Deng,et al. Empirical model of WWW document arrivals at access link , 1996, Proceedings of ICC/SUPERCOMM '96 - International Conference on Communications.
[18] Azer Bestavros,et al. Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.
[19] Walter Willinger,et al. Experimental queueing analysis with long-range dependent packet traffic , 1996, TNET.
[20] Azer Bestavros,et al. Self-similarity in World Wide Web traffic: evidence and possible causes , 1997, TNET.
[21] Bruce A. Mah,et al. An empirical model of HTTP network traffic , 1997, Proceedings of INFOCOM '97.
[22] Walter Willinger,et al. Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level , 1997, TNET.
[23] Jeffrey C. Mogul,et al. Network Behavior of a Busy Web Server and its Clients , 1999 .