Modeling Overlapped Operation between the Control Unit and Processing Elements in an SIMD Machine

Abstract A model for overlapped operation between the control unit (CU) and processing elements (PEs) in an SIMD machine is presented. The major requirements and structure of the CU for overlapped operation in SIMD mode are described and overlapped operation is formally defined. To use the computing power of both the CU and the PEs most effectively to execute a single program, a balanced work load between the CU and PEs is required. It is assumed that certain computations (e.g., the manipulation of loop index variables, PE-common array index calculations) can be migrated from the PEs to the CU and vice versa.This research demonstrates how to increase the effectiveness of an SIMD machine by allowing overlapped operation between the CU and PEs. The best overlapping can be achieved ideally by assigning an equal amount of work to be executed concurrently on the CU and PEs, resulting in a 2N speedup for an N-PE system. The goal of this research is to develop a model of overlapped operation in SIMD mode so that the actual maximum possible performance of the SIMD machine can be attained.

[1]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[2]  M. Auguin,et al.  Experience using a SIMD/SPMD multiprocessor architecture , 1987 .

[3]  Richard M. Brown,et al.  The ILLIAC IV Computer , 1968, IEEE Transactions on Computers.

[4]  Howard Jay Siegel,et al.  Limitations Imposed on Mixed-Mode Performance of Optimized Phases Due to Temporal Juxtaposition , 1991, J. Parallel Distributed Comput..

[5]  L. W. Tucker,et al.  Architecture and applications of the Connection Machine , 1988, Computer.

[6]  D. J. Hunt AMT DAP—a processor array in a workstation environment , 1989 .

[7]  W. Daniel Hillis,et al.  The connection machine , 1985 .

[8]  Kenneth E. Batcher,et al.  Bit-Serial Parallel Processing Systems , 1982, IEEE Transactions on Computers.

[9]  John L. Gustafson,et al.  Reevaluating Amdahl's law , 1988, CACM.

[10]  Gérard Giraudon,et al.  Image processing on a SIMD/SPMD architecture: OPSILA , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[11]  Howard Jay Siegel,et al.  Execution mode management and CU/PE overlap in an SIMD/SPMD parallel language/compiler , 1991, [1991] Proceedings The Fifteenth Annual International Computer Software & Applications Conference.

[12]  Kai Hwang,et al.  Computer architecture and parallel processing , 1984, McGraw-Hill Series in computer organization and architecture.

[13]  Thomas L. Casavant,et al.  Experimental Analysis of a Mixed-Mode Parallel Architecture Using Bitonic Sequence Sorting , 1991, J. Parallel Distributed Comput..

[14]  Tom Blank,et al.  The MasPar MP-1 architecture , 1990, Digest of Papers Compcon Spring '90. Thirty-Fifth IEEE Computer Society International Conference on Intellectual Leverage.

[15]  Michael J. Flynn,et al.  Very high-speed computing systems , 1966 .

[16]  Pearl Y. Wang,et al.  A formal model for SIMD computation , 1988, Proceedings., 2nd Symposium on the Frontiers of Massively Parallel Computation.

[17]  Frederica Darema,et al.  A single-program-multiple-data computational model for EPEX/FORTRAN , 1988, Parallel Comput..

[18]  Richard S. Piepho,et al.  A comparison of RISC architectures , 1989, IEEE Micro.

[19]  Howard Jay Siegel,et al.  Instruction execution trade-offs for SIMD vs. MIMD vs. mixed mode parallelism , 1991, [1991] Proceedings. The Fifth International Parallel Processing Symposium.

[20]  Kenneth E. Batcher STARAN parallel processor system hardware , 1974, AFIPS '74.

[21]  Peter Christy,et al.  Software to support massively parallel computing on the MasPar MP-1 , 1990, Digest of Papers Compcon Spring '90. Thirty-Fifth IEEE Computer Society International Conference on Intellectual Leverage.

[22]  Kenneth E. Batcher,et al.  Design of a Massively Parallel Processor , 1980, IEEE Transactions on Computers.

[23]  Howard Jay Siegel,et al.  PASM: A Partitionable SIMD/MIMD System for Image Processing and Pattern Recognition , 1981, IEEE Transactions on Computers.

[24]  Thomas L. Casavant,et al.  Experimental Application-Driven Architecture Analysis of an SIMD/MIMD Parallel Processing System , 1990, IEEE Trans. Parallel Distributed Syst..