Invited paper: System-wide fault management based on IEEE P1687 IJTAG

Fault tolerance and fault management mechanisms are necessary means to reduce the impact of soft errors and wear out in electronic devices. The semiconductor products manufactured with latest and emerging processes are increasingly affected by these effects. The paper describes a new general scalable fault management architecture based on the latest upcoming DFT standard IEEE P1687 IJTAG. The standard allows to create an efficient and regular network for handling fault detection information as well as to manage test and system resources as a system-wide background process during the system operation.

[1]  Janusz Sosnowski,et al.  Transient fault tolerance in digital systems , 1994, IEEE Micro.

[2]  H. Kopetz,et al.  The Evolution of Fault-Tolerant Computing , 1987, Dependable Computing and Fault-Tolerant Systems.

[3]  G. Poncelin,et al.  Development of a design-for-reliability method for complex systems , 2008, 2008 Annual Reliability and Maintainability Symposium.

[4]  Erik G. Larsson,et al.  Optimizing Fault Tolerance for Multi-Processor System-on-Chip , 2010 .

[5]  Jeff Rearick,et al.  IEEE P1687: Toward Standardized Access of Embedded Instrumentation , 2006, 2006 IEEE International Test Conference.

[6]  Ming Zhang,et al.  Design for Resilience to Soft Errors and Variations , 2007, 13th IEEE International On-Line Testing Symposium (IOLTS 2007).

[7]  斉藤 万裕 15.2.Design for Manufacturing(15.設計工学・システム)( 機械工学年鑑) , 2003 .

[8]  Andrew B. Kahng Design for manufacturability: Then and now , 2011, IEEE Design & Test of Computers.

[9]  Andrea Bondavalli,et al.  Threshold-Based Mechanisms to Discriminate Transient from Intermittent Faults , 2000, IEEE Trans. Computers.