Performance and Accuracy in Soft-Error Resilience Evaluation using the Multi-Level Processor Simulator ETISS-ML

Soft errors are a major safety concern in many devices, e.g., in automotive, industrial, control or medical applications. Ideally, safety-critical systems should be resilient against the impact of soft errors, but at a low cost. This requires to evaluate the soft error resilience, which is typically done by extensive fault injection. In this paper, we present ETISS-ML, a multi-level processor simulator, which manages to achieve both accuracy and performance for fault simulation by intelligently switching the level of abstraction between an Instruction Set Simulator (ISS) and an RTL simulator. For a given software testcase and fault scenario, the software is first executed in ISS-mode until shortly before the fault injection. Then ETISS-ML switches to RTL-mode for accurate fault simulation. Whenever the impact of the fault is propagated completely out of the processor's micro-architecture, the simulation can switch back to ISS-mode. This paper describes the methods needed to preserve accuracy during both of these switches. Experimental results show that ETISS-ML obtains near to ISS performance with RTL accuracy. It is also shown that ETISS-ML can be used as the processor model in SystemC / TLM virtual prototypes (VPs) and, hence, allows to investigate the impact of soft errors at system level.

[1]  Sheldon X.-D. Tan,et al.  Multi-Physics-based FEM Analysis for Post-voiding Analysis of Electromigration Failure Effects , 2018, 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[2]  Surya P. N. Singh,et al.  V-REP: A versatile and scalable robot simulation framework , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Sanjay J. Patel,et al.  Examining ACE analysis reliability estimates using fault-injection , 2007, ISCA '07.

[4]  Ulf Schlichtmann,et al.  ETISS-ML: A multi-level instruction set simulator with RTL-level fault injection support for the evaluation of cross-layer resiliency techniques , 2018, 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[5]  Norbert Wehn,et al.  A Cross-Layer Technology-Based Study of How Memory Errors Impact System Resilience , 2013, IEEE Micro.

[6]  Andrea Höller,et al.  A Virtual Fault Injection Framework for Reliability-Aware Software Development , 2015, 2015 IEEE International Conference on Dependable Systems and Networks Workshops.

[7]  Sarita V. Adve,et al.  Accurate microarchitecture-level fault modeling for studying hardware faults , 2009, 2009 IEEE 15th International Symposium on High Performance Computer Architecture.

[8]  Eric Cheng,et al.  CLEAR: Cross-layer exploration for architecting resilience: Combining hardware and software techniques to tolerate soft errors in processor cores , 2016, 2016 53nd ACM/EDAC/IEEE Design Automation Conference (DAC).

[9]  Mehdi Baradaran Tahoori,et al.  Fault injection acceleration by architectural importance sampling , 2015, 2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS).

[10]  Lixin Zhang,et al.  Mambo: a full system simulator for the PowerPC architecture , 2004, PERV.

[11]  Jörg Henkel,et al.  Estimating and Optimizing BTl Aging Effects: From Physics to CAD , 2018, 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[12]  Ulf Schlichtmann,et al.  The Extendable Translating Instruction Set Simulator (ETISS) Interlinked with an MDA Framework for Fast RISC Prototyping , 2017, 2017 International Symposium on Rapid System Prototyping (RSP).

[13]  Hidetoshi Onodera,et al.  PVT2: Process, Voltage, Temperature and Time-dependent Variability in Scaled CMOS Process , 2018, 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[14]  Jacob A. Abraham,et al.  Quantitative evaluation of soft error injection techniques for robust system design , 2013, 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC).

[15]  Jaume Abella,et al.  Analysis and RTL correlation of instruction set simulators for automotive microcontroller robustness verification , 2015, 2015 52nd ACM/EDAC/IEEE Design Automation Conference (DAC).