On “Exploiting” Node-Heterogeneous Clusters Optimally

Abstract It is proved that “FIFO” worksharing protocols provide asymptotically optimal solutions to two problems related to sharing large collections of independent tasks in a heterogeneous network of workstations (HNOW) $\mathcal{N}$ . In the $\mathsf{HNOW-Exploitation Problem}$ , one seeks to accomplish as much work as possible on $\mathcal{N}$ during a prespecified fixed period of L time units. In the $\mathsf{HNOW-Rental Problem}$ , one seeks to complete W units of work by “renting” $\mathcal{N}$ for as short a time as necessary. The worksharing protocols we study are crafted within an architectural model that characterizes $\mathcal {N}$ via parameters that measure $\mathcal{N}$ ’s workstations’ computational and communicational powers. All valid protocols are self-scheduling, in the sense that they determine completely both an amount of work to allocate to each of $\mathcal{N}$ ’s workstations and a schedule for all related interworkstation communications. The schedules provide either a value for W given L, or a value for L given W, hence solve both of the motivating problems. A protocol observes a FIFO regimen if it has $\mathcal{N}$ ’s workstations finish their assigned work, and return their results, in the same order in which they are supplied with their workloads. The proven optimality of FIFO protocols resides in the fact that they accomplish at least as much work as any other protocol during all sufficiently long worksharing episodes, and that they complete sufficiently large given collections of tasks at least as fast as any other protocol. Simulation experiments illustrate that the superiority of FIFO protocols is often observed during worksharing episodes of only a few minutes’ duration.

[1]  Richard M. Karp,et al.  Optimal broadcast and summation in the LogP model , 1993, SPAA '93.

[2]  Henri Casanova,et al.  UMR: a multi-round algorithm for scheduling divisible workloads , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[3]  Gerassimos D. Barlas Collection-Aware Optimum Sequencing of Operations and Closed-Form Solutions for the Distribution of a Divisible Load on Arbitrary Processor Trees , 1998, IEEE Trans. Parallel Distributed Syst..

[4]  Dhabaleswar K. Panda,et al.  Efficient collective communication on heterogeneous networks of workstations , 1998, Proceedings. 1998 International Conference on Parallel Processing (Cat. No.98EX205).

[5]  John Galletly,et al.  In Search of Clusters:982Gregory F. Pfister. In Search of Clusters: The Coming Battle in Lowly Parallel Computing. Prentice‐Hall, 1995. xxiv + 415 pp, ISBN: 0‐13‐437625‐0: The Coming Battle in Lowly Parallel Computing , 1998 .

[6]  Ramesh Subramonian,et al.  LogP: a practical model of parallel computation , 1996, CACM.

[7]  Yves Robert,et al.  The Master-Slave Paradigm with Heterogeneous Processors , 2001, CLUSTER.

[8]  Ramesh Subramonian,et al.  LogP: towards a realistic model of parallel computation , 1993, PPOPP '93.

[9]  Arnold L. Rosenberg An algorithmic model for heterogeneous clusters: rationale and experience , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[10]  Yves Robert,et al.  Scheduling Strategies for Master-Slave Tasking on Heterogeneous Processor Grids , 2002, PARA.

[11]  Larry Carter,et al.  Scheduling strategies for master-slave tasking on heterogeneous processor platforms , 2004, IEEE Transactions on Parallel and Distributed Systems.

[12]  Viktor K. Prasanna,et al.  Adaptive Communication Algorithms for Distributed Heterogeneous Systems , 1999, J. Parallel Distributed Comput..

[13]  Thomas G. Robertazzi,et al.  Distributed computation for a tree network with communication delays , 1990 .

[14]  Tsan-sheng Hsu,et al.  Task Allocation on a Network of Processors , 2000, IEEE Trans. Computers.

[15]  Debasish Ghose,et al.  Optimal Sequencing and Arrangement in Distributed Single-Level Tree Networks with Communication Delays , 1994, IEEE Trans. Parallel Distributed Syst..

[16]  Ishwar K. Sethi,et al.  Modeling Cone-Beam Tomographic Reconstruction Using LogSMP: An Extenced LogP Model for Clusters of SMPs , 1999, HiPC.

[17]  Larry Carter,et al.  Bandwidth-centric allocation of independent tasks on heterogeneous platforms , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[18]  Pierre-François Dutot Master-slave tasking on heterogeneous processors , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[19]  Arnold L. Rosenberg,et al.  Efficient trigger-broadcasting in heterogeneous clusters , 2005, J. Parallel Distributed Comput..

[20]  Viktor K. Prasanna,et al.  Efficient collective communication in distributed heterogeneous systems , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[21]  Dhabaleswar K. Panda,et al.  Multicast on irregular switch-based networks with wormhole routing , 1997, Proceedings Third International Symposium on High-Performance Computer Architecture.

[22]  David E. Culler,et al.  A case for NOW (networks of workstation) , 1995, PODC '95.

[23]  Debasish Ghose,et al.  Scheduling Divisible Loads in Parallel and Distributed Systems , 1996 .

[24]  Yves Robert,et al.  Scheduling Divisible Loads with Return Messages on Heterogeneous Master-Worker Platforms , 2005, HiPC.

[25]  Paul D. Gader,et al.  Image algebra techniques for parallel image processing , 1987 .

[26]  Debasish Ghose,et al.  Multi-installment load distribution in tree networks with delays , 1995 .

[27]  Arnold L. Rosenberg,et al.  Asymptotically optimal worksharing in HNOWs: how long is "sufficiently long?" , 2003, 36th Annual Simulation Symposium, 2003..

[28]  Arnold L. Rosenberg,et al.  Sharing partitionable workloads in heterogeneous NOWs: greedier is not better , 2001, Proceedings 42nd IEEE Symposium on Foundations of Computer Science.

[29]  Chris J. Scheiman,et al.  LogGP: Incorporating Long Messages into the LogP Model for Parallel Computation , 1997, J. Parallel Distributed Comput..

[30]  Franck Cappello,et al.  An algorithmic model for heterogeneous hyper-clusters: rationale and experience , 2005, Int. J. Found. Comput. Sci..