Improving the Performance of Collective Communication for the On-Chip Network

Efficiently executing the massively parallel applications has become an important goal of developing a modern high-performance multicore computer. In these parallel programs, the collective communication among these cores consume a large portion of inter-core communication. In order to prevent the collective communication from the performance bottleneck of the on-chip network, this paper proposed a new on-chip network, call Hierarchy Self Similar Cubic (HSSC), to reduce the latency of the collective communication on the multicore system. The corresponding transmission mechanisms and packet scheduling mechanism are proposed to analyze and grouping the packets, and determine a suitable transmission mechanism for each packet group on-the-fly. The experiments compare the performance of several on-chip networks. The advantages of proposed transmission mechanisms and packet scheduling mechanism are also discussed.

[1]  Jack J. Dongarra,et al.  EZTrace: A Generic Framework for Performance Analysis , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[2]  Vaclav Dvorak,et al.  Optimizing Collective Communications on 2D-Mesh and Fat Tree NoC , 2010, 2010 Ninth International Conference on Networks.

[3]  David H. Bailey,et al.  The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..

[4]  Natalie D. Enright Jerger,et al.  Supporting efficient collective communication in NoCs , 2012, IEEE International Symposium on High-Performance Comp Architecture.

[5]  Jack Dongarra,et al.  MPI - The Complete Reference: Volume 1, The MPI Core , 1998 .

[6]  Slo-Li Chu,et al.  Self Similar Cubic: A Novel Interconnection Network for Many-Core Architectures , 2012, 2012 Fifth International Symposium on Parallel Architectures, Algorithms and Programming.

[7]  Kenichi Miura,et al.  Tofu Interconnect 2: System-on-Chip Integration of High-Performance Interconnect , 2014, ISC.

[8]  Jack Donovan,et al.  SystemC: From the Ground Up , 2004 .

[9]  Wim Vanderbauwhede,et al.  A Communication Model of Broadcast in Wormhole-Routed Networks on-Chip , 2009, 2009 International Conference on Advanced Information Networking and Applications.