A Low-Cost Test Solution for Reliable Communication in Networks-on-Chip

Networks-on-chip (NoC) provide the communication infrastructure for high-speed and large-scale computation that integrates several IP-cores on a single die. Faults on network channels severely degrade system performance and throughput. This paper presents a distributed and online mechanism for detecting and locating stuck-at faults (SAFs) in NoC channels. We also study the effects of such faults on various network performance metrics. The inherent parallelism present in the architecture is utilized to design a scheduling scheme that reduces the overall test time and overhead significantly. The proposed test solution scales well with network size, channel width, and network topology. Hardware synthesis based on FPGA shows that it needs small area overhead and low test time compared to prior approaches. Furthermore, it improves packet latency and reduces energy consumption.

[1]  Santosh Biswas,et al.  An odd-even model for diagnosis of shorts on NoC interconnects , 2015, 2015 Annual IEEE India Conference (INDICON).

[2]  Jianwen Zhu,et al.  Saturating the transceiver bandwidth: switch fabric design on FPGAs , 2012, FPGA '12.

[3]  Shekhar Y. Borkar,et al.  Microarchitecture and Design Challenges for Gigascale Integration , 2004, MICRO.

[4]  Salvatore Monteleone,et al.  Cycle-Accurate Network on Chip Simulation with Noxim , 2016, ACM Trans. Model. Comput. Simul..

[5]  James C. Hoe,et al.  CONNECT: re-examining conventional wisdom for designing nocs in the context of FPGAs , 2012, FPGA '12.

[6]  Reza Faghih Mirzaee,et al.  Non-preemptive offline multi-job mapping for a photonic network on a chip , 2017, Nano Commun. Networks.

[7]  S. Madhava Rao,et al.  In-Field Test for Permanent Faults in FIFO Buffers of NOC Routers , 2016 .

[8]  Alexandre M. Amory,et al.  A scalable test strategy for network-on-chip routers , 2005, IEEE International Conference on Test, 2005..

[9]  Partha Pratim Pande,et al.  Testing Network-on-Chip Communication Fabrics , 2007, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[10]  Dhiraj K. Pradhan,et al.  Thermal-aware testing of network-on-chip using multiple-frequency clocking , 2006, 24th IEEE VLSI Test Symposium.

[11]  Radu Marculescu,et al.  On-chip traffic modeling and synthesis for MPEG-2 video applications , 2004, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[12]  Alexandre M. Amory,et al.  A High-Fault-Coverage Approach for the Test of Data, Control and Handshake Interconnects in Mesh Networks-on-Chip , 2008, IEEE Transactions on Computers.

[13]  Nachiket Kapre,et al.  Hoplite: Building austere overlay NoCs for FPGAs , 2015, 2015 25th International Conference on Field Programmable Logic and Applications (FPL).

[14]  Dipak Ghosal,et al.  Improving network performance on multicore systems: Impact of core affinities on high throughput flows , 2016, Future Gener. Comput. Syst..

[15]  Santosh Biswas,et al.  A Time-Optimized Scheme Towards Analysis of Channel-Shorts in on-Chip Networks , 2017, J. Electron. Test..

[16]  Santosh Biswas,et al.  An odd-even scheme to prevent a packet from being corrupted and dropped in fault tolerant NoCs , 2016, 2016 IEEE 22nd International Symposium on On-Line Testing and Robust System Design (IOLTS).

[17]  Santanu Chattopadhyay,et al.  In-Field Test for Permanent Faults in FIFO Buffers of NoC Routers , 2016, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[18]  Luca P. Carloni,et al.  Networks-on-chip in emerging interconnect paradigms: Advantages and challenges , 2009, 2009 3rd ACM/IEEE International Symposium on Networks-on-Chip.

[19]  Marcelo Lubaszewski,et al.  Concurrent test of Network-on-Chip interconnects and routers , 2010, 2010 11th Latin American Test Workshop.

[20]  L. Benini,et al.  Xpipes: a network-on-chip architecture for gigascale systems-on-chip , 2004, IEEE Circuits and Systems Magazine.

[21]  André DeHon,et al.  FPGA optimized packet-switched NoC using split and merge primitives , 2012, 2012 International Conference on Field-Programmable Technology.

[22]  Altamiro Amadeu Susin,et al.  RASoC: a router soft-core for networks-on-chip , 2004, Proceedings Design, Automation and Test in Europe Conference and Exhibition.

[23]  Armin Runge Fault-tolerant Network-on-Chip based on Fault-aware Flits and Deflection Routing , 2015, NOCS.

[24]  Sorin Cotofana,et al.  A Novel Flit Serialization Strategy to Utilize Partially Faulty Links in Networks-on-Chip , 2012, 2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip.

[25]  Ning Wu,et al.  Novel NoC Mapping Scheme Optimized for Testing Time , 2014 .

[26]  Fabrice Monteiro,et al.  Smart Reliable Network-on-Chip , 2014, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[27]  Luca Benini,et al.  A distributed and topology-agnostic approach for on-line NoC testing , 2011, Proceedings of the Fifth ACM/IEEE International Symposium.

[28]  Luca Benini,et al.  At-Speed Distributed Functional Testing to Detect Logic and Delay Faults in NoCs , 2014, IEEE Transactions on Computers.

[29]  Wei Liu,et al.  Towards High-Speed Real-Time HTTP Traffic Analysis on the Tilera Many-Core Platform , 2013, 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing.

[30]  Sujit Dey,et al.  An Interconnect Architecture for Networking Systems on Chips , 2002, IEEE Micro.

[31]  Armin Runge FaFNoC: A Fault-tolerant and Bufferless Network-on-chip , 2015, FNC/MobiSPC.

[32]  Z. Navabi,et al.  An HDL-Based Platform for High Level NoC Switch Testing , 2007, 16th Asian Test Symposium (ATS 2007).

[33]  Dhiraj K. Pradhan,et al.  Introduction to Energy-Efficient Fault-Tolerant Systems , 2014 .

[34]  Armin Alaghi,et al.  Online NoC Switch Fault Detection and Diagnosis Using a High Level Fault Model , 2007, 22nd IEEE International Symposium on Defect and Fault-Tolerance in VLSI Systems (DFT 2007).

[35]  Daniele Ludovici,et al.  Cooperative Built-in Self-Testing and Self-Diagnosis of NoC Bisynchronous Channels , 2012, 2012 IEEE 6th International Symposium on Embedded Multicore SoCs.

[36]  Xiaoqing Wen,et al.  VLSI Test Principles and Architectures: Design for Testability (Systems on Silicon) , 2006 .

[37]  Ahmad Khademzadeh,et al.  Online-Structural Testing of Routers in Network on Chip , 2011 .

[38]  J. W. McPherson,et al.  Reliability challenges for 45nm and beyond , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[39]  Axel Jantsch,et al.  Design of Fault-Tolerant and Reliable Networks-on-Chip , 2015, 2015 IEEE Computer Society Annual Symposium on VLSI.

[40]  Sajjad Ahmad Madani,et al.  Congestion-aware core mapping for Network-on-Chip based systems using betweenness centrality , 2016, Future Gener. Comput. Syst..

[41]  Sanghamitra Roy,et al.  Dynamic Choke Sensing for Timing Error Resilience in NTC Systems , 2018, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[42]  Gang Luo,et al.  A test method of interconnection online detection of NoC based on 2D Torus topology , 2011, 2011 International Conference on Applied Superconductivity and Electromagnetic Devices.

[43]  Li-Shiuan Peh,et al.  ARIADNE: Agnostic Reconfiguration in a Disconnected Network Environment , 2011, 2011 International Conference on Parallel Architectures and Compilation Techniques.

[44]  Jorg Henkel,et al.  Agent-based distributed power management for kilo-core processors , 2013, ICCAD.

[45]  Kwang-Ting Cheng,et al.  Comprehensive online defect diagnosis in on-chip networks , 2012, 2012 IEEE 30th VLSI Test Symposium (VTS).

[46]  Dimitrios Soudris,et al.  Deploying and monitoring hadoop MapReduce analytics on single-chip cloud computer , 2016, PARMA-DITAM '16.

[47]  Nanning Zheng,et al.  Fault-tolerant routing for on-chip network without using virtual channels , 2014, 2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC).

[48]  Christof Teuscher,et al.  Design and analysis of heterogeneous nanoscale on-chip communication networks , 2013, Nano Commun. Networks.

[49]  Jeffrey D. Smith,et al.  Design and Analysis of Algorithms , 2009, Lecture Notes in Computer Science.

[50]  Santosh Biswas,et al.  A packet address driven test strategy for stuck-at faults in networks-on-chip interconnects , 2015, 2015 23rd Mediterranean Conference on Control and Automation (MED).

[51]  Marcelo Lubaszewski,et al.  Diagnosis of interconnect shorts in mesh NoCs , 2009, 2009 3rd ACM/IEEE International Symposium on Networks-on-Chip.

[52]  Santosh Biswas,et al.  On-line detection and diagnosis of stuck-at faults in channels of NoC-based systems , 2016, 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[53]  Kees G. W. Goossens,et al.  Bringing communication networks on a chip: test and verification implications , 2003, IEEE Commun. Mag..

[54]  Santosh Biswas,et al.  Reliability-Aware Test Methodology for Detecting Short-Channel Faults in On-Chip Networks , 2018, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[55]  Axel Jantsch,et al.  Addressing Transient and Permanent Faults in NoC With Efficient Fault-Tolerant Deflection Router , 2013, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[56]  Jim Harkin,et al.  Low cost fault-tolerant routing algorithm for Networks-on-Chip , 2015, Microprocess. Microsystems.

[57]  Marcelo Lubaszewski,et al.  Functional Test of Mesh-Based NoCs with Deterministic Routing: Integrating the Test of Interconnects and Routers , 2011, J. Electron. Test..

[58]  Erik Jan Marinissen,et al.  A structured and scalable mechanism for test access to embedded reusable cores , 1998, Proceedings International Test Conference 1998 (IEEE Cat. No.98CH36270).

[59]  Krishnendu Chakrabarty,et al.  Test-time optimization in NOC-based manycore SOCs using multicast routing , 2014, 2014 IEEE 32nd VLSI Test Symposium (VTS).

[60]  Paul Ampadu,et al.  Transient and Permanent Error Control for High-End Multiprocessor Systems-on-Chip , 2012, 2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip.

[61]  Sudeep Pasricha,et al.  SWIFTNoC: A Reconfigurable Silicon-Photonic Network with Multicast-Enabled Channel Sharing for Multicore Architectures , 2017, ACM J. Emerg. Technol. Comput. Syst..

[62]  Santanu Chattopadhyay,et al.  Fault-Tolerant Dynamic Task Mapping and Scheduling for Network-on-Chip-Based Multicore Platform , 2017, ACM Trans. Embed. Comput. Syst..

[63]  Giovanni De Micheli,et al.  Design, synthesis, and test of networks on chips , 2005, IEEE Design & Test of Computers.

[64]  Dimiter R. Avresky,et al.  Performance evaluation of the ServerNet(R) SAN under self-similar traffic , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[65]  Huaxi Gu,et al.  Square-Octagon interconnection architecture for Network-on-chips , 2014, 2014 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC).