Locality and Loop Scheduling on NUMA Multiprocessors

An improtant issue in the parallel execution of loops is how to partition and schedule the loops onto the available processors. While most existing dynamic scheduling algorithms manage to load imbalance well, they fail to take locality into account and therefore perform poorly on parallel systems with non-uniform memory access times.

[1]  Evangelos P. Markatos,et al.  Using processor affinity in loop scheduling on shared-memory multiprocessors , 1992, Proceedings Supercomputing '92.

[2]  Anoop Gupta,et al.  The Stanford Dash multiprocessor , 1992, Computer.

[3]  Robert Bernecky,et al.  Book review: Past, Present, Parallel: A Survey of Available Parallel Computing Systems by Arthur Trew & Greg Wilson (Eds.), (Springer-Verlag 1991) , 1991, CARN.

[4]  Edith Schonberg,et al.  Factoring: a method for scheduling parallel loops , 1992 .

[5]  Multiprocessors Using Processor A � nity in Loop Scheduling on Shared Memory , 1994 .

[6]  Greg Wilson,et al.  "Past, Present, Parallel": A Survey Of Available Parallel Computer Systems , 1991 .

[7]  Gordon Bell,et al.  Ultracomputers: a teraflop before its time , 1992, CACM.

[8]  Robert J. Fowler,et al.  NUMA policies and their relation to memory architecture , 1991, ASPLOS IV.

[9]  Evangelos P. Markatos,et al.  Using Processor Affinity in Loop Scheduling , 1994 .

[10]  Michael Stumm,et al.  Hector-a hierarchically structured shared memory multiprocessor , 1991, Proceedings of the Twenty-Fourth Annual Hawaii International Conference on System Sciences.

[11]  R. Sarnath,et al.  Proceedings of the International Conference on Parallel Processing , 1992 .

[12]  CONSTANTINE D. POLYCHRONOPOULOS,et al.  Guided Self-Scheduling: A Practical Scheduling Scheme for Parallel Supercomputers , 1987, IEEE Transactions on Computers.

[13]  Michael Stumm,et al.  A Comparison of Basic CPU Scheduling Algorithms for Multiprocessor UNIX , 1990, Comput. Syst..

[14]  Greg Wilson,et al.  Past, Present, Parallel , 1991, Springer London.

[15]  Alan Weiss,et al.  Allocating Independent Subtasks on Parallel Processors , 1985, IEEE Transactions on Software Engineering.