Fault-Tolerance with Graceful Degradation in Quality: A Design Methodology and its Application to Digital Signal Processing Systems

The tremendous increase in device density of present day designs is accompanied by a corresponding increase in transistor failures (hard faults), posing a major challenge to current fault tolerant techniques and tools. We propose a novel "design-time" fault-tolerance methodology at architecture/circuit levels to improve the reliability of applications, where it is possible to classify computations into two categories- (i) those which contribute to quality degradation and, (ii) those which result in total system failure. The proposed scheme enhances system reliability by making appropriate trade-offs between area, output quality (signal to noise ratio or mean square error), and fault tolerance. This low-overhead generic methodology is suitable not only for scaled CMOS technologies, but is also applicable to future nanotechnologies (carbon nanotubes etc.) as well, where such defects are expected to be prevalent. We evaluated this technique on a widely used DSP system - Finite Impulse Response (FIR) filters (where minor degradation in quality can be tolerated). Results show that our technique achieves an improvement between 73.4%-450% (in terms of total system failure probability under iso-redundancy) compared to conventional fault tolerance techniques.

[1]  Chiang-Ju Chien,et al.  A novel common-subexpression-elimination method for synthesizing fixed-point FIR filters , 2004, IEEE Trans. Circuits Syst. I Regul. Pap..

[2]  S. Goldstein,et al.  Scalable Defect Tolerance for Molecular Electronics , 2002 .

[3]  D. Bhaduri,et al.  Using quantum model of computation for reliability evaluation of defect tolerant nano-architectures , 2004, 4th IEEE Conference on Nanotechnology, 2004..

[4]  Y. Lim,et al.  Discrete coefficient FIR digital filter design based upon an LMS criteria , 1983 .

[5]  Kaushik Roy,et al.  Enhancing yield at the end of the technology roadmap , 2004, IEEE Design & Test of Computers.

[6]  Mehdi Baradaran Tahoori,et al.  On the defect tolerance of nano-scale two-dimensional crossbars , 2004, 19th IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems, 2004. DFT 2004. Proceedings..

[7]  R. Ramaswami,et al.  Book Review: Design and Analysis of Fault-Tolerant Digital Systems , 1990 .

[8]  Seth Copen Goldstein,et al.  NanoFabrics: spatial computing using molecular electronics , 2001, Proceedings 28th Annual International Symposium on Computer Architecture.

[9]  Seth Copen Goldstein,et al.  Molecular electronics: from devices and interconnect to circuits and architecture , 2003, Proc. IEEE.

[10]  John G. Proakis,et al.  Digital Signal Processing: Principles, Algorithms, and Applications , 1992 .

[11]  Mehdi Baradaran Tahoori,et al.  Defects and faults in quantum cellular automata at nano scale , 2004, 22nd IEEE VLSI Test Symposium, 2004. Proceedings..

[12]  Gustavo de Veciana,et al.  Defect tolerant probabilistic design paradigm for nanotechnologies , 2004, Proceedings. 41st Design Automation Conference, 2004..

[13]  Barry W. Johnson Design & analysis of fault tolerant digital systems , 1988 .

[14]  John G. Proakis,et al.  Digital signal processing (3rd ed.): principles, algorithms, and applications , 1996 .