论文信息 - On the efficiency of localized work stealing

On the efficiency of localized work stealing

Abstract This paper investigates a variant of the work-stealing algorithm that we call the localized work-stealing algorithm . The intuition behind this variant is that because of locality, processors can benefit from working on their own work. Consequently, when a processor is free, it makes a steal attempt to get back its own work. We call this type of steal a steal-back . We show that the expected running time of the algorithm is T 1 / P + O ( T ∞ P ) , and that under the “even distribution of free agents assumption”, the expected running time of the algorithm is T 1 / P + O ( T ∞ lg ⁡ P ) . In addition, we obtain another running-time bound based on ratios between the sizes of serial tasks in the computation. If M denotes the maximum ratio between the largest and the smallest serial tasks of a processor after removing a total of O ( P ) serial tasks across all processors from consideration, then the expected running time of the algorithm is T 1 / P + O ( T ∞ M ) .

Charles E. Leiserson | Warut Suksompong | Tao B. Schardl

[1] José Nelson Amaral,et al. On the Merits of Distributed Work-Stealing on Selective Locality-Aware Tasks , 2013, 2013 42nd International Conference on Parallel Processing.

[2] Yi Guo,et al. SLAW: A scalable locality-aware adaptive work-stealing scheduler , 2010, IPDPS.

[3] Katherine Yelick,et al. Hierarchical Work Stealing on Manycore Clusters , 2011 .

[4] Philip Wagala Gwanyama. The HM-GM-AM-QM Inequalities , 2004 .

[5] F. Warren Burton,et al. Executing functional programs on a virtual tree of processors , 1981, FPCA '81.

[6] C. Greg Plaxton,et al. Thread Scheduling for Multiprogrammed Multiprocessors , 1998, SPAA '98.

[7] Frédéric Wagner,et al. Hierarchical Work-Stealing , 2010, Euro-Par.

[8] Robert H. Halstead,et al. Implementation of multilisp: Lisp on a multiprocessor , 1984, LFP '84.

[9] Sriram Krishnamoorthy,et al. Scalable work stealing , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[10] Richard M. Karp,et al. Randomized parallel algorithms for backtrack search and branch-and-bound computation , 1993, JACM.

[11] Charles E. Leiserson,et al. Upper Bounds on Number of Steals in Rooted Trees , 2015, Theory of Computing Systems.

[12] Warut Suksompong,et al. Bounds on Multithreaded Computations by Work Stealing , 2014 .

[13] Guy E. Blelloch,et al. The Data Locality of Work Stealing , 2002, SPAA '00.