Implementation of middleware fault tolerance support for real-time embedded applications

Critical real-time embedded systems need to apply fault tolerance strategies to deal with operation time errors, either in hardware or software. In this paper we present the ongoing work to provide application fault tolerance by means of implementing middleware transparent support over the BOSS embedded operating system. The middleware uses a publishersubscriber protocol and enables the execution of several fault tolerance strategies with minimum burden to the application level software.

[1]  Liming Chen,et al.  N-VERSION PROGRAMMINC: A FAULT-TOLERANCE APPROACH TO RELlABlLlTY OF SOFTWARE OPERATlON , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing, 1995, ' Highlights from Twenty-Five Years'..

[2]  Dhiraj K. Pradhan,et al.  Fault-tolerant computer system design , 1996 .

[3]  Brian Randell,et al.  System structure for software fault tolerance , 1975, IEEE Transactions on Software Engineering.

[4]  Sergio Montenegro,et al.  BOSS/EVERCONTROL OS / Middleware Target Ultra High Dependability , 2005 .

[5]  K. H. Kim,et al.  An efficient middleware architecture supporting time-triggered message-triggered objects and an NT-based implementation , 1999, Proceedings 2nd IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC'99) (Cat. No.99-61702).

[6]  K. H. Kim,et al.  Distributed Execution of Recovery Blocks: An Approach for Uniform Treatment of Hardware and Software Faults in Real-Time Applications , 1989, IEEE Trans. Computers.

[7]  Rami G. Melhem,et al.  Implementation of a transient-fault-tolerance scheme on DEOS-a technology transfer from an academic system to an industrial system , 1999, Proceedings of the Fifth IEEE Real-Time Technology and Applications Symposium.

[8]  Brian Randell,et al.  Fundamental Concepts of Dependability , 2000 .

[9]  K. H. Kim ROAFTS: a middleware architecture for real-time object-oriented adaptive fault tolerance support , 1998, Proceedings Third IEEE International High-Assurance Systems Engineering Symposium (Cat. No.98EX231).