Performance Evaluation of Modern Network-on-Chip Router Architectures

On chip interconnection networks simplify the challenges of integrating large number of processing elements. Routers are backbone of networks. Buffers and crossbar in router consumes significant area and power of network. They have huge impact on overall performance and cost of network. Dual Xbar router architecture combines buffered and bufferless feature to reduce buffer read/write energy with dual crossbars. While Switch folding technique introduced to reduce wire density and decrease muxes in crossbar by increasing resource utilization. In this paper, we propose Folded Dual Xbar architecture by combining the Dual Xbar and Folding technique in order to get advantages of both architectures. Performance of architectures is implemented and evaluated using OMNET++ platform by applying multiple traffic patterns under different load conditions. We further calculated buffered and bufferless events to estimate the reduction in buffer read/write energy. Simulation results shows that there is slight increase in throughput and reduction in buffer read/write energy by average 46% at high loads in proposed 2-Folded Dual Xbar as compared to conventional architecture. Proposed 3-Folded Dual Xbar results at least 16.6 % increase in throughput as compared to conventional architecture with 43-45% reduced buffer read/write energy but slight increase in crossbar. Throughput of 3-Folded Dual Xbar decreased only by 5-7% as compared to Dual Xbar with distributed wire density advantage..

[1]  Onur Mutlu,et al.  A case for bufferless routing in on-chip networks , 2009, ISCA '09.

[2]  Janet Roveda,et al.  Adaptive inter-router links for low-power, area-efficient and reliable Network-on-Chip (NoC) architectures , 2009, 2009 Asia and South Pacific Design Automation Conference.

[3]  Muhammad E. S. Elrabaa,et al.  Improved Modified Fat-Tree Topology Network-on-Chip , 2011, J. Circuits Syst. Comput..

[4]  Niraj K. Jha,et al.  GARNET: A detailed on-chip network model inside a full-system simulator , 2009, 2009 IEEE International Symposium on Performance Analysis of Systems and Software.

[5]  John Kim,et al.  Low-cost router microarchitecture for on-chip networks , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[6]  Bin Li,et al.  Configuring algorithm for reconfigurable Network-on-Chip architecture , 2012, 2012 2nd International Conference on Consumer Electronics, Communications and Networks (CECNet).

[7]  Chifeng Wang,et al.  Design and Evaluation of a High Throughput QoS-Aware and Congestion-Aware Router Architecture for Network-on-Chip , 2012, 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based Processing.

[8]  Bevan M. Baas,et al.  RoShaQ: High-performance on-chip router with shared queues , 2011, 2011 IEEE 29th International Conference on Computer Design (ICCD).

[9]  Atsushi Matsumoto,et al.  High-Throughput Compact Delay-Insensitive Asynchronous NoC Router , 2014, IEEE Transactions on Computers.

[10]  Israel Cidon,et al.  HNOCS: Modular open-source simulator for Heterogeneous NoCs , 2012, 2012 International Conference on Embedded Computer Systems (SAMOS).

[11]  Natalie D. Enright Jerger,et al.  SCARAB: A single cycle adaptive routing and bufferless network , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[12]  Kun-Lin Tsai,et al.  A priority based output arbiter for NoC router , 2011, 2011 IEEE International Symposium of Circuits and Systems (ISCAS).

[13]  Avinash Karanth Kodi,et al.  Design of a performance enhanced and power reduced dual-crossbar Network-on-Chip (NoC) architecture , 2011, Microprocess. Microsystems.

[14]  Siamak Mohammadi,et al.  Quota setting router architecture for quality of service in GALS NoC , 2013, 2013 International Symposium on Rapid System Prototyping (RSP).

[15]  Ayhan Demiriz,et al.  CPNoC: On Using Constraint Programming in Design of Network-on-Chip Architecture , 2013, 2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[16]  Yonggon Kim,et al.  Clumsy Flow Control for High-Throughput Bufferless On-Chip Networks , 2013, IEEE Computer Architecture Letters.

[17]  George Michelogiannakis,et al.  Router designs for elastic buffer on-chip networks , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[18]  Emmanouil Kalligeros,et al.  Switch folding: Network-on-Chip routers with time-multiplexed output ports , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[19]  Bill Lin,et al.  A High-Throughput Distributed Shared-Buffer NoC Router , 2009, IEEE Computer Architecture Letters.