论文信息 - A Reliability Study on CNNs for Critical Embedded Systems

A Reliability Study on CNNs for Critical Embedded Systems

Deep learning systems such as Convolutional Neural Networks (CNNs) have shown remarkable efficiency in dealing with a variety of complex real life problems. To accelerate the execution of these heavy algorithms, a plethora of software implementations and hardware accelerators have been proposed. In a context of shrinking devices dimensions, reliability issues of CNN-hosting systems are under-explored. In this paper, we experimentally evaluate the inherent fault tolerance of CNNs by injecting errors within network modules, namely processing elements and memories. Our experiments demonstrate a non uniform sensitivity between different parts of the system. While CNNs are relatively resilient to errors occurring in processing elements, transient faults hitting memories lead to catastrophic degradation of accuracy.

[1] Gerd Ascheid,et al. Accurate neuron resilience prediction for a flexible reliability management in neural network accelerators , 2018, 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[2] James L. Walsh,et al. IBM experiments in soft fails in computer electronics (1978-1994) , 1996, IBM J. Res. Dev..

[3] Fei Qiao,et al. Evaluating Data Resilience in CNNs from an Approximate Memory Perspective , 2017, ACM Great Lakes Symposium on VLSI.

[4] Road vehicles — Functional safety — Part 10 : Guideline , 2009 .

[5] Sachin S. Talathi,et al. Fixed Point Quantization of Deep Convolutional Networks , 2015, ICML.

[6] B. L. Bhuva,et al. Impact of Technology Scaling on SRAM Soft Error Rates , 2014, IEEE Transactions on Nuclear Science.

[7] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.