Analyzing performance of sequencing mechanisms for simple layered task systems

Considers the problem of sequencing a set of parallel activities in the presence of nonzero overheads. Mechanisms for sequence control may range from explicit inter-task synchronization to more restrictive mechanisms such as blocking barriers. It is highly desirable to be able to base the choice of a sequence control mechanism for a parallel algorithm upon specific characteristics of the algorithm's structure and the underlying architecture. The paper presents approximate models for simple layered task systems that can predict overall performance and provide a useful understanding of key performance parameters and tradeoffs. The analytic results are compared with simulation to demonstrate their accuracy. Some simple applications of the model are presented that demonstrate non-intuitive behavior of layered graphs with inter-task versus barrier-based sequencing.<<ETX>>

[1]  Laxmi N. Bhuyan,et al.  Performance Analysis of Layered Task Graphs , 1991, ICPP.

[2]  Larry Rudolph,et al.  Efficient synchronization of multiprocessors with shared memory , 1988, TOPL.

[3]  Vivek Sarkar Synchronization using counting semaphores , 1988, ICS '88.

[4]  Timothy S. Axelrod,et al.  Effects of synchronization barriers on multiprocessor performance , 1986, Parallel Comput..

[5]  Harry F. Jordan,et al.  Comparing barrier algorithms , 1989, Parallel Comput..

[6]  Edward D. Lazowska,et al.  The Effect of Scheduling Discipline on Spin Overhead in Shared Memory Parallel Systems , 1991, IEEE Trans. Parallel Distributed Syst..

[7]  S. Leigh,et al.  Probability and Random Processes for Electrical Engineering , 1989 .

[8]  S. Madala,et al.  Performance of Synchronous Parallel Algorithms with Regular Structures , 1991, IEEE Trans. Parallel Distributed Syst..

[9]  J. T. Robinson,et al.  Some Analysis Techniques for Asynchronous Multiprocessor Algorithms , 1979, IEEE Transactions on Software Engineering.

[10]  Anne Greenbaum Synchronization costs on multiprocessors , 1989, Parallel Comput..

[11]  Stephen F. Lundstrom,et al.  Predicting Performance of Parallel Computations , 1990, IEEE Trans. Parallel Distributed Syst..

[12]  Constantine D. Polychronopoulos,et al.  The Effect of Barrier Synchronization and Scheduling Overhead on Parallel Loops , 1989, ICPP.

[13]  Larry Rudolph,et al.  Efficient synchronization of multiprocessors with shared memory , 1986, PODC '86.