Task assignment policies in distributed server systems: A survey

Data intensive computing, in the Web environment, motivates the distributed designs of Web server systems (Web clusters) because of their scalability and cost-effectiveness instead of one Web server with high performance. The task assignment policy, in such systems, focuses on the manner of assigning the tasks that reach these systems (e.g. the case of intensive requests that reach the distributed Web server systems) in order to minimize the response time and thus, improve the performance. These tasks, generally, follow the ''heavy-tailed'' distribution which has the property that there is a tiny fraction (about 3%) of large tasks that makes half (50%) of the total load. Several policies were proposed in the literature to deal with the nature of this Web traffic. This paper presents a state-of-art of the existing task assignment policies. We classify these policies in two classes: policies which assume that the task size is known a priori, and policies which assume that the task size is not known a priori (like TAGS, TAPTF and TAPTF-WC). The first class of policies regroups policies which consider no knowledge of load information at the servers when assigning the incoming tasks, known as static policies (like Random, Round Robin, etc.) and, policies, known as dynamic policies (like Central Queue Policy, Least Loaded First ''LLF'', etc.) which use some load information (e.g. the processing capacity, the queue load, etc.) to process.

[1]  S. A. Nozaki,et al.  Approximations in finite-capacity multi-server queues by Poisson arrivals , 1978, Journal of Applied Probability.

[2]  Zahir Tari,et al.  Task assignment strategy for overloaded systems , 2003, Proceedings of the Eighth IEEE Symposium on Computers and Communications. ISCC 2003.

[3]  Victoria Ungureanu,et al.  Deferred Assignment Scheduling in Cluster-Based Servers , 2006, Cluster Computing.

[4]  Mor Harchol-Balter,et al.  On Choosing a Task Assignment Policy for a Distributed Server System , 1998, J. Parallel Distributed Comput..

[5]  Carlos Juiz,et al.  Scalable QoS Content-Aware Load Balancing Algorithm for a Web Switch Based on Classical Policies , 2008, 22nd International Conference on Advanced Information Networking and Applications (aina 2008).

[6]  Tapani Lehtonen,et al.  On the optimality of the shortest line discipline , 1984 .

[7]  Zahir Tari,et al.  Task Assignment Based on Prioritising Traffic Flows , 2004, OPODIS.

[8]  Yossi Azar,et al.  Minimizing the flow time without migration , 1999, STOC '99.

[9]  Mark S. Squillante,et al.  Analysis of task assignment with cycle stealing under central queue , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[10]  Zahir Tari,et al.  Task assignment with work-conserving migration , 2006, Parallel Comput..

[11]  Mor Harchol-Balter,et al.  Evaluation of Task Assignment Policies for Supercomputing Servers: The Case for Load Unbalancing and Fairness , 2004, Cluster Computing.

[12]  Mor Harchol-Balter,et al.  Analysis of SRPT scheduling: investigating unfairness , 2001, SIGMETRICS '01.

[13]  Roy T. Fielding,et al.  Hypertext Transfer Protocol - HTTP/1.1 , 1997, RFC.

[14]  Cheng-Zhong Xu,et al.  Processing Rate Allocation for Proportional Slowdown Differentiation on Internet Servers , 2004, IPDPS.

[15]  Martin F. Arlitt,et al.  Web server workload characterization: the search for invariants , 1996, SIGMETRICS '96.

[16]  Zahir Tari,et al.  Dynamic task assignment in server farms: better performance by task grouping , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[17]  Mor Harchol-Balter,et al.  Size-based scheduling to improve web performance , 2003, TOCS.

[18]  Gianfranco Ciardo,et al.  EQUILOAD: a load balancing policy for clustered web servers , 2001, Perform. Evaluation.

[19]  Erich M. Nahum,et al.  Enabling Content-Based Load Distribution for Scalable Services , 2002 .

[20]  Cheng-Zhong Xu,et al.  Robust processing rate allocation for proportional slowdown differentiation on Internet servers , 2004, IEEE Transactions on Computers.

[21]  Zahir Tari,et al.  A scalable Multi-Tier Task Assignment Policy with Minimum Excess Load , 2010, The IEEE symposium on Computers and Communications.

[22]  Mor Harchol-Balter,et al.  Task assignment in a distributed system (extended abstract): improving performance by unbalancing load , 1997, SIGMETRICS '98/PERFORMANCE '98.

[23]  Margo I. Seltzer,et al.  Web Facts and Fantasy , 1997, USENIX Symposium on Internet Technologies and Systems.

[24]  William LeFebvre,et al.  CNN.com: Facing a World Crisis , 2001, LiSA.

[25]  Azer Bestavros,et al.  Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.

[26]  M. Crovella,et al.  Heavy-tailed probability distributions in the World Wide Web , 1998 .

[27]  Alan Scheller-Wolf,et al.  Surprising results on task assignment in server farms with high-variability workloads , 2009, SIGMETRICS '09.

[28]  Zahir Tari,et al.  A dynamic load distribution strategy for systems under high task variation and heavy traffic , 2003, SAC '03.

[29]  Wei Sun,et al.  ADAPTLOAD: effective balancing in clustered web servers under transient load conditions , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[30]  Mor Harchol-Balter Task assignment with unknown duration , 2002, JACM.

[31]  Alan Scheller-Wolf,et al.  Why segregating short jobs from long jobs under high variability is not always a win , 2009, 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[32]  Martin Arlitt,et al.  Web Workload Characterization: Ten Years Later , 2005 .

[33]  Balachander Krishnamurthy,et al.  Web Protocols and Practice - HTTP/1.1, Networking Protocols, Caching, and Traffic Measurement , 2001 .

[34]  Zahir Tari,et al.  A least flow-time first load sharing approach for distributed server farm , 2005, J. Parallel Distributed Comput..