On the feasibility of handling manufacturing faults in embedded memories by software means

This paper presents a software-based technique for handling permanent faults in low-cost memories for embedded systems. Basically the technique is based on an adaptation of the program code that avoids the usage of faulty memory words. With this technique it is possible to handle permanent manufacturing faults in simple and cheap memories without any kind of hardware-based fault management in the memory. This is of particular interest for long living low-cost embedded systems like sensor nodes that make use of such memories. Different methods for handling memory faults in data and program memory are discussed. The methods for program memories are evaluated with flash memory devices. Based on real test data for the flash memories it is shown that with the software-based technique a significant amount of hardware-overhead in the memory can be avoided without losing the capabilities of handling memory faults.

[1]  Mario Schölzel,et al.  Fine-Grained Software-Based Self-Repair of VLIW Processors , 2011, 2011 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems.

[2]  Yu Cao,et al.  Guest Editors' Introduction: Reliability Challenges in Nano-CMOS Design , 2009, IEEE Des. Test Comput..

[3]  Kiamal Z. Pekmestzi,et al.  A BISR Architecture for Embedded Memories , 2008, 2008 14th IEEE International On-Line Testing Symposium.

[4]  Sandeep K. Gupta,et al.  A novel software-based defect-tolerance approach for application-specific embedded systems , 2011, 2011 IEEE 29th International Conference on Computer Design (ICCD).

[5]  Cheng-Wen Wu,et al.  Economic Aspects of Memory Built-in Self-Repair , 2007, IEEE Design & Test of Computers.

[6]  Cheng-Wen Wu,et al.  A built-in self-repair scheme for NOR-type flash memory , 2006, 24th IEEE VLSI Test Symposium.

[7]  Patryk Skoncej WBR - word- and block-level hard error repair for memories , 2013, 2012 12th Annual Non-Volatile Memory Technology Symposium Proceedings.

[8]  Marco Torchiano,et al.  Soft-error detection through software fault-tolerance techniques , 1999, Proceedings 1999 IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems (EFT'99).

[9]  Jin-Fu Li,et al.  A built-in self-repair scheme for semiconductor memories with 2-d redundancy , 2003, International Test Conference, 2003. Proceedings. ITC 2003..

[10]  Hsie-Chia Chang,et al.  A fully parallel BCH codec with double error correcting capability for NOR flash applications , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Sebastian Müller,et al.  Towards a Graceful Degradable Multicore-System by Hierarchical Handling of Hard Errors , 2013, 2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[12]  Cheng-Wen Wu,et al.  Built-In Self-Repair Schemes for Flash Memories , 2010, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[13]  Edward J. McCluskey,et al.  Control-flow checking by software signatures , 2002, IEEE Trans. Reliab..

[14]  P.P. Ankolekar,et al.  Multibit Error-Correction Methods for Latency-Constrained Flash Memory Systems , 2008, IEEE Transactions on Device and Materials Reliability.

[15]  Mario Scholzel Software-based self-repair of statically scheduled superscalar data paths , 2010 .

[16]  Massimo Violante,et al.  Soft-error detection using control flow assertions , 2003, Proceedings 18th IEEE Symposium on Defect and Fault Tolerance in VLSI Systems.