Performance-Guided Load (Un)balancing under Autocorrelated Flows

Size-based policies have been shown in the literature to effectively balance the load and improve performance in cluster environments. Size-based policies assign jobs to servers based on the job size and their performance improvements are an outcome of separating ";short"; from ";long"; jobs, by avoiding having short jobs waiting behind long jobs for service. In this paper, we present evidence that performance improvements due to this separation quickly vanish if the arrival process to the cluster is autocorrelated. Based on our observations, we devise a new size-based policy called D_EQAL that still strives to separate jobs to servers according to job size but this separation is now biased by an effort to reduce performance loss due to autocorrelation in the arrival flows to each server. As a result of this bias, all servers may not be equally utilized (i.e., the load in the system may be ";unbalanced";), but performance benefits become significant. D_EQAL can be used on-line as it does not assume any a priori knowledge of the incoming workload. Extensive simulations show the effectiveness of D_EQAL under autocorrelated and uncorrelated arrival streams and illustrate that the policy successfully self- adjusts the degree of load unbalancing based on monitored performance measures.

[1]  Wei Sun,et al.  Workload-aware load balancing for clustered Web servers , 2005, IEEE Transactions on Parallel and Distributed Systems.

[2]  S. Wittevrongel,et al.  Queueing systems , 2019, Autom..

[3]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[4]  Evgenia Smirni,et al.  Characterizing the BMAP/MAP/1 Departure Process via the ETAQA Truncation , 2005 .

[5]  Daniel P. Heyman,et al.  Modeling multiple IP traffic streams with rate limits , 2003, TNET.

[6]  Vishal Misra,et al.  Optimal state-free, size-aware dispatching for heterogeneous M/G/-type systems , 2005, Perform. Evaluation.

[7]  Walter Willinger,et al.  Experimental queueing analysis with long-range dependent packet traffic , 1996, TNET.

[8]  Prasant Mohapatra,et al.  Characterization of E-Commerce Traffic , 2003, Electron. Commer. Res..

[9]  Mor Harchol-Balter,et al.  On Choosing a Task Assignment Policy for a Distributed Server System , 1998, J. Parallel Distributed Comput..

[10]  Yong Meng Teo,et al.  Comparison of Load Balancing Strategies on Cluster-based Web Servers , 2001, Simul..

[11]  Alma Riska,et al.  Disk Drive Level Workload Characterization , 2006, USENIX Annual Technical Conference, General Track.

[12]  Dennis Bushmitch,et al.  Thinning, striping and shuffling: traffic shaping and transport techniques for variable bit rate video , 2002, Global Telecommunications Conference, 2002. GLOBECOM '02. IEEE.

[13]  Kenneth J. Christensen,et al.  Reduction of self-similarity by application-level traffic shaping , 1997, Proceedings of 22nd Annual Conference on Local Computer Networks.

[14]  Alma Riska,et al.  Long-Range Dependence at the Disk Drive Level , 2006, Third International Conference on the Quantitative Evaluation of Systems - (QEST'06).

[15]  Mor Harchol-Balter,et al.  Exploiting process lifetime distributions for dynamic load balancing , 1995, SIGMETRICS.

[16]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[17]  Evgenia Smirni,et al.  FlexSplit: a workload-aware, adaptive load balancing strategy for media clusters , 2006, Electronic Imaging.

[18]  Qi Zhang,et al.  Performance impacts of autocorrelated flows in multi-tiered systems , 2007, Perform. Evaluation.

[19]  Victoria Ungureanu,et al.  Class-Dependent Assignment in cluster-based servers , 2004, SAC '04.

[20]  Martin Arlitt,et al.  Workload Characterization of the 1998 World Cup Web Site , 1999 .

[21]  Michele Colajanni,et al.  Performance study of dispatching algorithms in multi-tier web architectures , 2002, PERV.

[22]  Qi Zhang,et al.  Load Unbalancing to Improve Performance under Autocorrelated Traffic , 2006, 26th IEEE International Conference on Distributed Computing Systems (ICDCS'06).

[23]  Dirk Abendroth,et al.  Intelligent shaping: well shaped throughout the entire network? , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[24]  Amarnath Mukherjee,et al.  On resource management and QoS guarantees for long range dependent traffic , 1995, Proceedings of INFOCOM'95.

[25]  Fei Xue,et al.  Self-similar traffic shaping at the edge router in optical packet-switched networks , 2002, 2002 IEEE International Conference on Communications. Conference Proceedings. ICC 2002 (Cat. No.02CH37333).

[26]  Erich M. Nahum,et al.  Locality-aware request distribution in cluster-based network servers , 1998, ASPLOS VIII.

[27]  Ludmila Cherkasova,et al.  An SLA-oriented capacity planning tool for streaming media services , 2004, International Conference on Dependable Systems and Networks, 2004.