论文信息 - Balanced Distributed Memory Parallel Computers

Balanced Distributed Memory Parallel Computers

Mismatches between on-chip high performance CPU and data access times is the basic reason for the increasing gap between peak and sustained performance in distributed memory parallel computers. We propose the concept of balanced architectures, based on a network with a dynamic topology and communication patterns determined at compile time. The corresponding processing element is a cacheless CPU, which can achieve a 1 FLOP/clock cycle rate. Network and PE features are presented. An example shows that balanced architectures keep efficiency when scaling.

[1] Franck Cappello,et al. Data layouts impacts on the compilation of the communications for a synchronous MSIMD machine , 1992, Microprocess. Microprogramming.

[2] Franck Cappello,et al. Static computation of standard linear algebra subroutines for PTAH , 1993, 1993 Euromicro Workshop on Parallel and Distributed Processing.

[3] Geoffrey C. Fox,et al. Scheduling regular and irregular communication patterns on the CM-5 , 1992, Proceedings Supercomputing '92.

[4] Alain Lichnewsky,et al. Introducing symbolic problem solving techniques in the dependence testing phases of a vectorizer , 1988, ICS '88.

[5] John B. Shoven,et al. I , Edinburgh Medical and Surgical Journal.

[6] Franck Cappello,et al. PTAH: Introduction to a New Parallel Architecture for Highly Numeric Processing , 1992, PARLE.

[7] V. Néri,et al. Hardware features of the static communication network of a parallel architecture , 1993, Microprocess. Microprogramming.

[8] Zhiyu Shen,et al. An Empirical Study on Array Subscripts and Data Dependencies , 1989, ICPP.

[9] R. Sarnath,et al. Proceedings of the International Conference on Parallel Processing , 1992 .