Torus Topology based Fault-Tolerant Network-on-Chip Design with Flexible Spare Core Placement

Network-on-Chip (NoC) has been proposed as a promising solution to overcome the communication challenges of System-on-Chip (SoC) design in nanoscale technologies. With the advancement in the nanoscale technology, the integration density of Intellectual Property (IP) cores in a single chip have increased, leading to heat dissipation, which in turn makes the system unreliable. Therefore, efficient fault-tolerant methods are necessary at different levels to improve overall system performance and make the system to operate normally. This article presents a flexible spare core placement technique for mesh-based NoC by taking several benchmark applications into consideration. An Integer Linear Programming (ILP)-based solution has been proposed for the spare core placement problem. Also, Particle Swarm Optimisation (PSO)-based meta-heuristic has been proposed for the same. Experiments have been performed by taking several application benchmarks reported in the literature and the applications generated using the TGFF tool. Comparisons have been carried out using our approach and the approach followed in the literature (i) by varying the network size with fixed fault percentage in the network, and (ii) by fixing the network size while varying the percentage of faults in the network. We have also compared the overall communication cost and CPU runtime between ILP and PSO approaches. The results show significant reductions in the overall communication cost, average network latency, and network power consumption across all the cases using our approach over the approaches reported in the literature.

[1]  Hannu Tenhunen,et al.  A low-cost processing element recovery mechanism for fault tolerant Networks-on-Chip , 2011, 2011 NORCHIP.

[2]  Radu Marculescu,et al.  FARM: Fault-aware resource management in NoC-based multiprocessor platforms , 2011, 2011 Design, Automation & Test in Europe.

[3]  Santanu Chattopadhyay,et al.  Network-on-chip architecture design based on mesh-of-tree deterministic routing topology , 2008, Int. J. High Perform. Syst. Archit..

[4]  Andrew B. Kahng,et al.  ORION 2.0: A fast and accurate NoC power and area model for early-stage design space exploration , 2009, 2009 Design, Automation & Test in Europe Conference & Exhibition.

[5]  Chita R. Das,et al.  Exploring Fault-Tolerant Network-on-Chip Architectures , 2006, International Conference on Dependable Systems and Networks (DSN'06).

[6]  Axel Jantsch,et al.  Methods for fault tolerance in networks-on-chip , 2013, CSUR.

[7]  Jim Harkin,et al.  Fault-Tolerant Networks-on-Chip Routing With Coarse and Fine-Grained Look-Ahead , 2016, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[8]  Wayne H. Wolf,et al.  TGFF: task graphs for free , 1998, Proceedings of the Sixth International Workshop on Hardware/Software Codesign. (CODES/CASHE'98).

[9]  Zhao Hai,et al.  Convergence Analysis of a Dynamic Discrete PSO Algorithm , 2008, 2008 First International Conference on Intelligent Networks and Intelligent Systems.

[10]  Santanu Chattopadhyay,et al.  Thermal Uniformity-Aware Application Mapping for Network-on-Chip Design , 2014 .

[11]  Midia Reshadi,et al.  A fault tolerant approach for application-specific Network-on-Chip , 2013, 2013 NORCHIP.

[12]  Santanu Chattopadhyay,et al.  Design and evaluation of Mesh-of-Tree based Network-on-Chip using virtual channel router , 2012, Microprocess. Microsystems.

[13]  Masoud Daneshtalab,et al.  Fault-tolerant 3-D network-on-chip design using dynamic link sharing , 2016, 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[14]  Pasi Liljeberg,et al.  Multi network interface architectures for fault tolerant Network-on-Chip , 2009, 2009 International Symposium on Signals, Circuits and Systems.

[15]  Hamid R. Zarandi,et al.  A Fault-Tolerant Low-Energy Multi-Application Mapping onto NoC-based Multiprocessors , 2012, 2012 IEEE 15th International Conference on Computational Science and Engineering.

[16]  Ashish Sharma,et al.  Multi-Application Network-on-Chip Design using Global Mapping and Local Reconfiguration , 2014, TRETS.

[17]  Charles Addo-Quaye,et al.  Thermal-aware mapping and placement for 3-D NoC designs , 2005, Proceedings 2005 IEEE International SOC Conference.

[18]  Reiner Kolla,et al.  Using benes networks at fault-tolerant and deflection routing based network-on-chips , 2016, 2016 Tenth IEEE/ACM International Symposium on Networks-on-Chip (NOCS).

[19]  Santanu Chattopadhyay,et al.  Low Power Low Latency Floorplan‐aware Path Synthesis in Application-Specific Network-on-Chip Design , 2017, Integr..

[20]  Kumar Y. B. Nithin,et al.  A Gracefully Degrading and Energy-Efficient Fault Tolerant NoC Using Spare Core , 2016, 2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI).

[21]  Ruilian Xie,et al.  RFRA: Reconfigurable and Fault-tolerant Routing Algorithm without virtual channels for 2D network-on-chip , 2016, 2016 13th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT).

[22]  Gerhard Fettweis,et al.  Fault Tolerant Deadlock-Free Adaptive Routing Algorithms for Hexagonal Networks-on-Chip , 2016, 2016 Euromicro Conference on Digital System Design (DSD).

[23]  Ahmed Louri,et al.  QORE: A fault tolerant network-on-chip architecture with power-efficient quad-function channel (QFC) buffers , 2014, 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA).

[24]  Chi-Ying Tsui,et al.  A fault-tolerant network-on-chip design using dynamic reconfiguration of partial-faulty routing resources , 2011, 2011 IEEE/IFIP 19th International Conference on VLSI and System-on-Chip.

[25]  Pasi Liljeberg,et al.  Enhanced fault-tolerant Network-on-Chip architecture using hierarchical agents , 2013, 2013 IEEE 16th International Symposium on Design and Diagnostics of Electronic Circuits & Systems (DDECS).

[26]  Luca Benini,et al.  Networks on Chips : A New SoC Paradigm , 2022 .

[27]  En-Jui Chang,et al.  Path-Diversity-Aware Fault-Tolerant Routing Algorithm for Network-on-Chip Systems , 2017, IEEE Transactions on Parallel and Distributed Systems.

[28]  Wilm E. Donath Complexity Theory and Design Automation , 1980, 17th Design Automation Conference.

[29]  Sudeep Pasricha,et al.  CHARM: A checkpoint-based resource management framework for reliable multicore computing in the dark silicon era , 2016, 2016 IEEE 34th International Conference on Computer Design (ICCD).

[30]  Onur Derin,et al.  Online task remapping strategies for fault-tolerant Network-on-Chip multiprocessors , 2011, Proceedings of the Fifth ACM/IEEE International Symposium.

[31]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[32]  W. Dally,et al.  Route packets, not wires: on-chip interconnection networks , 2001, Proceedings of the 38th Design Automation Conference (IEEE Cat. No.01CH37232).

[33]  Jaan Raik,et al.  Logic-based implementation of fault-tolerant routing in 3D network-on-chips , 2016, 2016 Tenth IEEE/ACM International Symposium on Networks-on-Chip (NOCS).

[34]  Jinfu Xu,et al.  An unified online fault-tolerant mechanism for FIFO faults in network-on-chip router , 2016, 2016 13th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT).

[35]  Hideo Fujiwara,et al.  A unified test and fault-tolerant multicast solution for network-on-chip designs , 2016, 2016 IEEE International Test Conference (ITC).

[36]  Nacer-Eddine Zergainoh,et al.  Addressing transient routing errors in fault-tolerant Networks-on-Chips , 2016, 2016 21th IEEE European Test Symposium (ETS).

[37]  Sudeep Pasricha,et al.  ARTEMIS: An Aging-Aware Runtime Application Mapping Framework for 3D NoC-Based Chip Multiprocessors , 2017, IEEE Transactions on Multi-Scale Computing Systems.

[38]  Masaru Fukushi,et al.  A fault-tolerant routing method for 2D-mesh Network-on-Chips based on components of a router , 2016, 2016 IEEE 5th Global Conference on Consumer Electronics.

[39]  Jari Nurmi,et al.  Fault-tolerant 2-D Mesh Network-On-Chip for MultiProcessor Systems-on-Chip , 2006, 2006 IEEE Design and Diagnostics of Electronic Circuits and systems.

[40]  Martin Radetzki,et al.  Fault Tolerant Network on Chip Switching With Graceful Performance Degradation , 2010, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[41]  Terrence Mak,et al.  A Survey of Emerging Interconnects for On-Chip Efficient Multicast and Broadcast in Many-Cores , 2016, IEEE Circuits and Systems Magazine.

[42]  Jeffrey T. Draper,et al.  Performance Evaluation of Probe-Send Fault-tolerant Network-on-chip Router , 2007, 2007 IEEE International Conf. on Application-specific Systems, Architectures and Processors (ASAP).

[43]  Hamid R. Zarandi,et al.  A fault-tolerant core mapping technique in networks-on-chip , 2013, IET Comput. Digit. Tech..

[44]  Santanu Chattopadhyay,et al.  A survey on application mapping strategies for Network-on-Chip design , 2013, J. Syst. Archit..