Re-examining Scheduling and Communication in Parallel Programs

Modern MPPs and NOWs have evolved in ways that affect both the scheduling of parallel applications as well as the communication layer. The presence of a full operating systems upon each processor as well as the need to support interactive users substantially alter the traditional environment. Parallel applications may no longer be executing in the dedicated environment that fast communication layers, such as Active Messages, assume. In this paper we present a simulation-based study of the effects of a non-dedicated environment on parallel applications and investigate one method for reducing the resulting performance impact. Our results quantify the performance impact of the size of the flow-control window on parallel applications. We investigate increasing the size of this window to ameliorate the effect of various scheduling disturbances. Our results show that additional buffering in the communication layer significantly improves performance in the presence of large scheduling irregularities (e.g., those that occur when parallel applications are locally scheduled) but has a detrimental effect with smaller disturbances (e.g., quantum skew, daemon activity, and interactive users).

[1]  John Zahorjan,et al.  Zahorjan processor allocation policies for message-passing parallel computers , 1994, SIGMETRICS 1994.

[2]  Mary K. Vernon,et al.  Use of application characteristics and limited preemption for run-to-completion parallel processor scheduling policies , 1994, SIGMETRICS.

[3]  John K. Ousterhout,et al.  Scheduling Techniques for Concurrent Systems , 1982, ICDCS.

[4]  Anoop Gupta,et al.  The impact of operating system scheduling policies and synchronization methods of performance of parallel applications , 1991, SIGMETRICS '91.

[5]  S. T. Leutenegger,et al.  Distributed computing feasibility in a non-dedicated homogeneous distributed system , 1993, Supercomputing '93.

[6]  Leslie Lamport Concurrent reading and writing of clocks , 1990, TOCS.

[7]  Kenneth C. Sevcik Characterizations of parallelism in applications and their use in scheduling , 1989, SIGMETRICS '89.

[8]  David E. Culler,et al.  A case for NOW (networks of workstation) , 1995, PODC '95.

[9]  Lewis W. Tucker,et al.  CMMD: Active Messages on the CM-5 , 1994, Parallel Comput..

[10]  Mary K. Vernon,et al.  The performance of multiprogrammed multiprocessor scheduling algorithms , 1990, SIGMETRICS '90.

[11]  David E. Culler,et al.  Connected components on distributed memory machines , 1994, Parallel Algorithms.

[12]  Vaidy S. Sunderam,et al.  PVM: A Framework for Parallel Distributed Computing , 1990, Concurr. Pract. Exp..

[13]  Larry Rudolph,et al.  Gang Scheduling Performance Benefits for Fine-Grain Synchronization , 1992, J. Parallel Distributed Comput..

[14]  Frank Thomson Leighton,et al.  Tight Bounds on the Complexity of Parallel Sorting , 1984, IEEE Transactions on Computers.

[15]  James Cownie,et al.  Message Passing on the Meiko CS-2 , 1994, Parallel Comput..

[16]  Anoop Gupta,et al.  Process control and scheduling issues for multiprogrammed shared-memory multiprocessors , 1989, SOSP '89.

[17]  Richard P. Martin,et al.  Fast parallel sorting under logp: from theory to practice , 1993 .

[18]  T. von Eicken,et al.  Parallel programming in Split-C , 1993, Supercomputing '93.

[19]  Evangelos P. Markatos,et al.  Multiprogramming on multiprocessors , 1991, Proceedings of the Third IEEE Symposium on Parallel and Distributed Processing.

[20]  Guy E. Blelloch,et al.  A comparison of sorting algorithms for the connection machine CM-2 , 1991, SPAA '91.

[21]  Andrea C. Arpaci-Dusseau,et al.  The interaction of parallel and sequential workloads on a network of workstations , 1995, SIGMETRICS '95/PERFORMANCE '95.

[22]  W. Daniel Hillis,et al.  The Network Architecture of the Connection Machine CM-5 , 1996, J. Parallel Distributed Comput..

[23]  Richard P. Martin,et al.  HPAM: an active message layer for a network of hp workstations , 1994, Symposium Record Hot Interconnects II.

[24]  Seth Copen Goldstein,et al.  Active messages: a mechanism for integrating communication and computation , 1998, ISCA '98.

[25]  V. K. Naik,et al.  Performance analysis of job scheduling policies in parallel supercomputing environments , 1993, Supercomputing '93.