A fault-tolerant scheduler for distributed real-time systems

A distributed algorithm for providing fault-tolerant optimal scheduling in a simply periodic distributed real-time system based on the deadline mechanism is proposed. Each job is serviced by a primary algorithm or by an alternative algorithm. The primary algorithm provides a desirable service that may not satisfy the timing constraints, whereas an alternate algorithm provides acceptable service, while always satisfying the timing constraints. After using the deadline mechanism for scheduling the jobs on individual nodes, the proposed algorithm is invoked to schedule additional primaries on the other nodes. Since primaries provide accurate results, the distributed scheduling algorithm provides better results without disturbing the timing accuracy. The proposed distributed scheduling algorithm was tested for various randomly generated data.<<ETX>>

[1]  R. H. Campbell,et al.  A fault-tolerant scheduling problem , 1989, IEEE Transactions on Software Engineering.

[2]  P. M. Melliar-Smith,et al.  A program structure for error detection and recovery , 1974, Symposium on Operating Systems.

[3]  S. K. Shrivastava,et al.  Sequential pascal with recovery blocks , 1978, Softw. Pract. Exp..

[4]  K. H. Kim,et al.  An Approach to Experimental Evaluation of Real-Time Fault-Tolerant Distributed Computing Schemes , 1989, IEEE Trans. Software Eng..

[5]  Hermann Kopetz,et al.  Fault tolerance, principles and practice , 1990 .

[6]  Aviziens Fault-Tolerant Systems , 1976, IEEE Transactions on Computers.

[7]  K. H. Kim,et al.  Distributed Execution of Recovery Blocks: An Approach for Uniform Treatment of Hardware and Software Faults in Real-Time Applications , 1989, IEEE Trans. Computers.

[8]  K. H. Kim,et al.  Designing fault tolerance capabilities into real-time distributed computer systems , 1988, [1988] Proceedings. Workshop on the Future Trends of Distributed Computing Systems in the 1990s.

[9]  Roy H. Campbell,et al.  APPLICATION OF THE FAULT-TOLERANT DEADLINE MECHANISM TO A SATELLITE ON-BOARD COMPUTER SYSTEM. , 1980 .

[10]  H. Hecht,et al.  Fault-Tolerant Software for Real-Time Applications , 1976, CSUR.

[11]  Geneva G. Belford,et al.  SIMULATIONS OF A FAULT-TOLERANT DEADLINE MECHANISM. , 1979 .

[12]  John C. Knight,et al.  A Framework for Software Fault Tolerance in Real-Time Systems , 1983, IEEE Transactions on Software Engineering.