Single Event Upset: An Embedded Tutorial

With the continuous downscaling of CMOS technologies, the reliability has become a major bottleneck in the evolution of the next generation systems. Technology trends such as transistor down-sizing, use of new materials, and system on chip architectures continue to increase the sensitivity of systems to soft errors. These errors are random and not related to permanent hardware faults. Their causes may be internal (e.g., interconnect coupling) or external (e.g., cosmic radiation). To meet the system reliability requirements it is necessary for both the circuit designers and test engineers to get the basic knowledge of the soft errors. We present a tutorial study of the radiation-induced single event upset phenomenon caused by external radiation, which is a major source of soft errors. We summarize basic radiation mechanisms and the resulting soft errors in silicon. Soft error mitigation techniques with time and space redundancy are illustrated. An industrial design example, the IBM z990 system, shows how the industry is dealing with soft errors these days.

[1]  James L. Walsh,et al.  IBM experiments in soft fails in computer electronics (1978-1994) , 1996, IBM J. Res. Dev..

[2]  Ravishankar K. Iyer,et al.  Analog-digital simulation of transient-induced logic errors and upset susceptibility of an advanced control system , 1990 .

[3]  G. C. Messenger,et al.  Collection of Charge on Junction Nodes from Ion Tracks , 1982, IEEE Transactions on Nuclear Science.

[4]  P.N. Sanda,et al.  IBM z990 soft error detection and recovery , 2005, IEEE Transactions on Device and Materials Reliability.

[5]  Rajesh Raina Is the concern for soft-error overblown? , 2005, IEEE International Conference on Test, 2005..

[6]  J. Neumann Probabilistic Logic and the Synthesis of Reliable Organisms from Unreliable Components , 1956 .

[7]  Naresh R. Shanbhag,et al.  Sequential Element Design With Built-In Soft Error Resilience , 2006, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[8]  Sandip Kundu Is the concern for soft-error overblown? , 2005 .

[9]  G. C. Messenger,et al.  Single Event Phenomena , 1997 .

[10]  Ming Zhang,et al.  Combinational Logic Soft Error Correction , 2006, 2006 IEEE International Test Conference.

[11]  T. May,et al.  A New Physical Mechanism for Soft Errors in Dynamic Memories , 1978, 16th International Reliability Physics Symposium.

[12]  R. Baumann Soft errors in advanced semiconductor devices-part I: the three radiation sources , 2001 .

[13]  J. Ziegler,et al.  Effect of Cosmic Rays on Computer Memories , 1979, Science.

[14]  J. von Neumann,et al.  Probabilistic Logic and the Synthesis of Reliable Organisms from Unreliable Components , 1956 .

[15]  K.A. LaBel,et al.  Commercial microelectronics technologies for applications in the satellite radiation environment , 1996, 1996 IEEE Aerospace Applications Conference. Proceedings.

[16]  R. C. Baumann,et al.  Soft errors in commercial integrated circuits , 2004 .

[17]  C. Metra,et al.  A model for transient fault propagation in combinatorial logic , 2003, 9th IEEE On-Line Testing Symposium, 2003. IOLTS 2003..

[18]  O. Musseau Single-event effects in SOI technologies and devices , 1996 .

[19]  E. Simoen,et al.  Radiation Effects in Advanced Semiconductor Materials and Devices , 2002 .

[20]  Algirdas Avizienis Faulty-Tolerant Computing: An Overview , 1971, Computer.

[21]  E. A. Wolicki,et al.  Single Event Upset of Dynamic Rams by Neutrons and Protons , 1979, IEEE Transactions on Nuclear Science.

[22]  Kartik Mohanram,et al.  Gate sizing to radiation harden combinational logic , 2006, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[23]  M. Nicolaidis,et al.  Design for soft error mitigation , 2005, IEEE Transactions on Device and Materials Reliability.

[24]  David Burnett,et al.  Soft-error-rate improvement in advanced BiCMOS SRAMs , 1993, 31st Annual Proceedings Reliability Physics 1993.

[25]  A. J. van de Goor,et al.  Testing Semiconductor Memories: Theory and Practice , 1998 .

[26]  N. Seifert,et al.  Introduction to the Special Issue on Soft Errors and Data Integrity in Terrestrial Computer Systems , 2005 .

[27]  S Kundu,et al.  Is the concern for soft errors overblown , 2006 .

[28]  C. Detcheverry,et al.  SEU critical charge and sensitive area in a submicron CMOS technology , 1997 .

[29]  Effects of Neutrons on Programmable Logic , 2002 .

[30]  S. M. Marcus,et al.  Minimum Size and Maximum Packing Density of Nonredundant Semiconductor Devices , 1962, Proceedings of the IRE.

[31]  Robert Baumann,et al.  Soft errors in advanced computer systems , 2005, IEEE Design & Test of Computers.