A methodology for designing efficient on-chip interconnects on well-behaved communication patterns

As the level of chip integration continues to advance at a fast pace, the desire for efficient interconnects - whether on-chip or off-chip - is rapidly increasing. Traditional interconnects like buses, point-to-point wires and regular topologies may suffer from poor resource sharing in the time and space domains, leading to high contention or low resource utilization. In this paper, we propose a design methodology for constructing networks for special-purpose computer systems with well-behaved (known) communication characteristics. A temporal and spatial model is proposed to define the sufficient condition for contention-free communication. Based upon this model, a design methodology using a recursive bisection technique is applied to systematically partition a parallel system such that the required number of links and switches is minimized while achieving low contention. Results show that the design methodology can generate more optimized on-chip networks with up to 60% fewer resources than meshes or tori while providing blocking performance closer to that of a fully connected crossbar.

[1]  Timothy Mark Pinkston,et al.  Design issues for core-based optoelectronic chips: a case study of the WARRP network router , 1999 .

[2]  Timothy Mark Pinkston,et al.  Characterization of Deadlocks in Irregular Networks , 2002, J. Parallel Distributed Comput..

[3]  Niraj K. Jha,et al.  MOGAC: a multiobjective genetic algorithm for hardware-software cosynthesis of distributed embedded systems , 1998, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[4]  Ruby B. Lee,et al.  Efficient permutation instructions for fast software cryptography , 2001 .

[5]  Scott Hauck,et al.  Reconfigurable computing: a survey of systems and software , 2002, CSUR.

[6]  Yuanyuan Yang,et al.  Nonblocking WDM switching networks with full and limited wavelength conversion , 2001, Proceedings Tenth International Conference on Computer Communications and Networks (Cat. No.01EX495).

[7]  Luciano Lavagno,et al.  Hardware-software codesign of embedded systems , 1994, IEEE Micro.

[8]  Jeffrey S. Vetter,et al.  Communication characteristics of large-scale scientific applications for contemporary cluster architectures , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[9]  Jason Miller,et al.  The Raw Processor: A Composeable 32-Bit Fabric for Embedded and General Purpose Computing , 2001 .

[10]  A. O. Fernandes,et al.  Hardware-software codesign of embedded systems , 1998, Proceedings. XI Brazilian Symposium on Integrated Circuit Design (Cat. No.98EX216).

[11]  W. Dally,et al.  Route packets, not wires: on-chip interconnection networks , 2001, Proceedings of the 38th Design Automation Conference (IEEE Cat. No.01CH37232).

[12]  Stephen D. Brown,et al.  Flexibility of interconnection structures for field-programmable gate arrays , 1991 .

[13]  Lionel M. Ni,et al.  The effects of network contention on processor allocation strategies , 1996, Proceedings of International Conference on Parallel Processing.

[14]  Cécile Germain,et al.  Static Communications in Parallel Scientific Propgrams , 1994, PARLE.

[15]  Shubhendu S. Mukherjee,et al.  The Alpha 21364 network architecture , 2001, HOT 9 Interconnects. Symposium on High Performance Interconnects.

[16]  William Gropp,et al.  Users guide for mpich, a portable implementation of MPI , 1996 .

[17]  David H. Bailey,et al.  The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..

[18]  Yuanyuan Yang,et al.  Routing permutations with link-disjoint and node-disjoint paths in a class of self-routable networks , 2002, Proceedings International Conference on Parallel Processing.

[19]  Ramesh Subramonian,et al.  LogP: a practical model of parallel computation , 1996, CACM.

[20]  Shietung Peng,et al.  Wavelengths requirement for permutation routing in all-optical multistage interconnection networks , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.

[21]  Wayne Wolf,et al.  Hardware-software co-design of embedded systems , 1994, Proc. IEEE.