Machine Learning-Based Approach for Hardware Faults Prediction

Hardware failures are undesired but a common problem in circuits. Such failures are inherently due to the aging of circuitry or variation in circumstances. In critical systems, customers demand the system never to fail. Several self-healing and fault tolerance techniques have been proposed in the literature for recovering a circuitry from a fault. Such techniques come to the rescue when a fault has already occurred but they are typically uninformed about the possibility of an impending failure (i.e., fault prediction), which can be used as a pre-stage to fault tolerance and self-healing. This paper presents an approach to early fault prediction of circuits. The proposed method uses Fast Fourier Transform (FFT) to get the fault frequency signature, Principal Component Analysis (PCA) to get the most important data with reduced dimension, and Convolutional Neural Network (CNN) to learn and classify the fault. The proposed method is validated for working in different circuits by testing it using two circuits: comparator and amplifier. The comparator and amplifier are implemented using 45 nm technology on HSPICE to extract the failures dataset in terms of voltage, current, temperature, noise, and delay. This extracted data is used for training the proposed approach using Tensorflow. To the best of our knowledge, this is the first work of fault prediction at the transistor level for hardware system. The proposed approach considers aging, short-circuit, and open-circuit faults, and it provides a fault prediction accuracy of 98.93% and 98.91% for comparator and amplifier circuits, respectively. The proposed method is tested for two different circuits for its validation, and it consumes 1.08 W for Altera Arria 10 GX FPGA device.

[1]  Magdy A. Bayoumi,et al.  Flexible Self-Healing Router for Reliable and High-Performance Network-an-Chips Architecture , 2018, 2018 31st IEEE International System-on-Chip Conference (SOCC).

[2]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[3]  Rodrigo Possamai Bastos,et al.  Effects of Transient Faults in Integrated Circuits , 2020 .

[4]  Tao Wang,et al.  End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[5]  Klaas Bult,et al.  A 13-mW 64-dB SNDR 280-MS/s Pipelined ADC Using Linearized Integrating Amplifiers , 2018, IEEE Journal of Solid-State Circuits.

[6]  Georges G. E. Gielen,et al.  Transistor aging-induced degradation of analog circuits: Impact analysis and design guidelines , 2011, 2011 Proceedings of the ESSCIRC (ESSCIRC).

[7]  Xingwen Li,et al.  Series Arc Fault Identification for Photovoltaic System Based on Time-Domain and Time-Frequency-Domain Analysis , 2017, IEEE Journal of Photovoltaics.

[8]  Ashwani Kumar,et al.  Transistor level fault diagnosis in digital circuits using artificial neural network , 2016 .

[9]  Sarita V. Adve,et al.  The impact of technology scaling on lifetime reliability , 2004, International Conference on Dependable Systems and Networks, 2004.

[10]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[11]  T. Nigam,et al.  Accurate model for time-dependent dielectric breakdown of high-k metal gate stacks , 2009, 2009 IEEE International Reliability Physics Symposium.

[12]  H. P. Lee,et al.  PROPER ORTHOGONAL DECOMPOSITION AND ITS APPLICATIONS—PART I: THEORY , 2002 .

[13]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[14]  Huaguo Liang,et al.  Co-mitigating circuit PBTI and HCI aging considering NMOS transistor stacking effect , 2016, 2016 International Symposium on Integrated Circuits (ISIC).

[15]  Yide Wang,et al.  Fault diagnosis method based on FFT-RPCA-SVM for Cascaded-Multilevel Inverter. , 2016, ISA transactions.

[16]  Kasem Khalil,et al.  Intelligent Fault-Prediction Assisted Self-Healing for Embryonic Hardware , 2020, IEEE Transactions on Biomedical Circuits and Systems.

[17]  Kasem Khalil,et al.  Self-healing hardware systems: A review , 2019, Microelectron. J..

[18]  Magdy A. Bayoumi,et al.  An Efficient Approach for Neural Network Architecture , 2018, 2018 25th IEEE International Conference on Electronics, Circuits and Systems (ICECS).

[19]  J. Goette,et al.  Pseudo Asynchronous Level Crossing adc for ecg Signal Acquisition , 2017, IEEE Transactions on Biomedical Circuits and Systems.

[20]  G Jagadanand,et al.  Open-transistor Fault Detection and Diagnosis Based on Current Trajectory in a Two-level Voltage Source Inverter☆ , 2016 .

[21]  Georges G. E. Gielen,et al.  An analytical model for hot carrier degradation in nanoscale CMOS suitable for the simulation of degradation in analog IC applications , 2008, Microelectron. Reliab..

[22]  Magdy Bayoumi,et al.  A novel approach towards less area overhead self-healing hardware systems , 2017, 2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS).

[23]  Takayasu Sakurai,et al.  An 80 mV Startup Dual-Mode Boost Converter by Charge-Pumped Pulse Generator and Threshold Voltage Tuned Oscillator With Hot Carrier Injection , 2012, IEEE Journal of Solid-State Circuits.

[24]  Kasem Khalil,et al.  A low propagation delay dispersion comparator for low cost level-crossing ADCs , 2013, 2013 8th IEEE Design and Test Symposium.

[25]  Xiaoqing Wen,et al.  On estimation of NBTI-Induced delay degradation , 2010, 2010 15th IEEE European Test Symposium.

[26]  Jens Lienig,et al.  introduction to electromigration-aware physical design , 2006, ISPD '06.

[27]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[28]  Georges G. E. Gielen,et al.  Emerging Yield and Reliability Challenges in Nanometer CMOS Technologies , 2008, 2008 Design, Automation and Test in Europe.

[29]  Prabhat,et al.  Artificial Neural Network , 2018, Encyclopedia of GIS.

[30]  Mamun Bin Ibne Reaz,et al.  CMOS Instrumentation Amplifier with Offset Cancellation Circuitry for Biomedical Application , 2006 .

[31]  C. Saritha,et al.  ECG Signal Analysis Using Wavelet Transforms , 2008 .

[32]  Martin Vetterli,et al.  Fast Fourier transforms: a tutorial review and a state of the art , 1990 .

[33]  Janusz Nieznanski,et al.  Open-Transistor Fault Diagnostics in Voltage-Source Inverters by Analyzing the Load Currents , 2007, IEEE Transactions on Industrial Electronics.

[34]  Magdy A. Bayoumi,et al.  A Cost-Effective Self-Healing Approach for Reliable Hardware Systems , 2018, 2018 IEEE International Symposium on Circuits and Systems (ISCAS).

[35]  Magdy Bayoumi,et al.  Economic LSTM Approach for Recurrent Neural Networks , 2019, IEEE Transactions on Circuits and Systems II: Express Briefs.

[36]  Juha Kostamovaara,et al.  A low propagation delay dispersion comparator for a level-crossing AD converter , 2009 .