论文信息 - Implementing malleability on MPI jobs

Implementing malleability on MPI jobs

Parallel jobs are characterized for having processes that communicate and synchronize with each other frequently. A processor allocation strategy widely used in parallel supercomputers is space-sharing, that is assigning a processors partition to each job for its exclusive use. We present a global solution to offer virtual malleability on message-passing parallel jobs, by applying a processor allocation strategy, the Folding by JobType (FJT). This technique is based on folding and moldability concepts and tries to decide the optimal initial number of processes, when to fold jobs and the number of folding times by analyzing the current and past system information. At processor level, we apply co-scheduling. We implement and evaluate the FJT under several workloads with different job sizes, classes and machine utilization. Results show that the FJT adapts easily to load changes, and can obtain better performance than the rest evaluated, on workloads with high coefficient variation and especially with burst arrivals.

[1] Uwe Schwiegelshohn,et al. Theory and Practice in Parallel Job Scheduling , 1997, JSSPP.

[2] Dror G. Feitelson,et al. Backfilling with Lookahead to Optimize the Performance of Parallel Job Scheduling , 2003, JSSPP.

[3] John Zahorjan,et al. Zahorjan processor allocation policies for message-passing parallel computers , 1994, SIGMETRICS 1994.

[4] Message P Forum,et al. MPI: A Message-Passing Interface Standard , 1994 .

[5] Chita R. Das,et al. A closer look at coscheduling approaches for a network of workstations , 1999, SPAA '99.

[6] David H. Bailey,et al. The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..

[7] Dror G. Feitelson,et al. Job Characteristics of a Production Parallel Scientivic Workload on the NASA Ames iPSC/860 , 1995, JSSPP.

[8] Giuseppe Serazzi,et al. Analysis of Non-Work-Conserving Processor Partitioning Policies , 1995, JSSPP.

[9] Jitendra Padhye,et al. Dynamic versus Adaptive Processor Allocation Policies for Message Passing Parallel Computers: An Empirical Comparison , 1996, JSSPP.