In the space radiation environment, a large number of cosmic rays often lead to transient faults on the on-board computer. These transient faults result in data flow errors or control flow errors during program running. The present software implemented hardware fault tolerant technology mainly uses the signature analysis method to realize the control flow checking, namely, through assigning signature for each basic block and inserting some instructions into every basic block to realize the control flow checking. Because the size of different basic blocks in one program usually exist obvious difference, applying unified checking method for these basic blocks will reduce the protection efficiency. To solve this problem, this paper has proposed a control flow checking optimization method named RCFG by reconstructing control flow graph. RCFG firstly merges basic blocks into larger logic blocks, then cuts the logic blocks into basic logic blocks with similar size. At last, control flow detection algorithm can be applied based on the control flow graph composed with the basic logic blocks. RCFG can effectively improve the protection efficiency of algorithm, and user can regulate the balance between performance and reliability by configuring the size of basic logic block. This paper has finished the fault injection experiment for a typical signature analysis algorithm named CFCSS. According to the experiment result, compared with the original CFCSS algorithm, the average performance expense of the CFCSS algorithm implemented based on RCFG increased by 16.6%, and the average memory expense increased by 13.5%, but the number of the faults resulting in the program outputting wrong result reduced by 47.67% equally.
[1]
Todd M. Austin,et al.
The SimpleScalar tool set, version 2.0
,
1997,
CARN.
[2]
Wang Ke-hui.
Partition method of control flow checking-based low-powered basic block
,
2007
.
[3]
Mahdi Fazeli,et al.
Transient Error Detection in Embedded Systems Using Reconfigurable Components
,
2006,
2006 International Symposium on Industrial Embedded Systems.
[4]
Massimo Violante,et al.
Software-Implemented Hardware Fault Tolerance
,
2010
.
[5]
Edward J. McCluskey,et al.
Concurrent Error Detection Using Watchdog Processors - A Survey
,
1988,
IEEE Trans. Computers.
[6]
David J. Lu.
Watchdog Processors and Structural Integrity Checking
,
1982,
IEEE Transactions on Computers.
[7]
Lorenzo Alvisi,et al.
Modeling the effect of technology trends on the soft error rate of combinational logic
,
2002,
Proceedings International Conference on Dependable Systems and Networks.
[8]
Edward J. McCluskey,et al.
Control-flow checking by software signatures
,
2002,
IEEE Trans. Reliab..
[9]
Robert E. Lyons,et al.
The Use of Triple-Modular Redundancy to Improve Computer Reliability
,
1962,
IBM J. Res. Dev..