Distributed real-time task monitoring in the safety-critical system Melody

Safety-critical systems typically operate in unpredictable environments. Requirements for safety and reliability are in conflict with those for real-time responsiveness. Due to unpredictable environmental needs there is no static trade-off between measures to accommodate the conflicting objectives. Instead every feature or operating system service has to be adaptive. The distributed operating system MELODY was developed for supporting safety-critical applications under such constraints. In earlier phases real-time monitoring features like file monitoring and integration control had been implemented. This paper focuses on distributed task monitoring. In order to as early as possible abort task instances which cannot meet their deadline sub-deadlines for their location, allocation, locking phases are derived. These are used for creating three different monitoring models in which task instances would be aborted whenever they are about to miss the sub-deadline corresponding to the model. Extensive distributed experiments are reported in which a variety of refined models are compared to a Base model. Despite the considerable overhead for task monitoring the results of employing the distributed task monitors are very favorable, with respect to minimizing the total number of deadline failures as well as regarding survivability.

[1]  Horst F. Wedde,et al.  Scheduling critical and sensitiue tasks with remote requests in mission-critical systems , 1992, Fourth Euromicro workshop on Real-Time Systems.

[2]  Kang G. Shin,et al.  Application of real-time monitoring to scheduling tasks with random execution times , 1989, [1989] Proceedings. Real-Time Systems Symposium.

[3]  Horst F. Wedde,et al.  Efficient Distributed Resource Scheduling for Adaptive Real-Time Operation Support , 1991, ICCI.

[4]  Horst F. Wedde,et al.  A Completely Integrated Approach to Developing, Implementing, Evaluating Distributed Active Database Management and its OS Support , 1999 .

[5]  Horst F. Wedde,et al.  Achieving dependability in mission-critical operating systems through adaptability and large-scale functional integration , 1994, Proceedings of 1994 International Conference on Parallel and Distributed Systems.

[6]  Douglas Carl Daniels The design and analysis of protocols for distributed resource scheduling under real-time constraints , 1992 .

[7]  D. Parnas,et al.  On satisfying timing constraints in hard-real-time systems , 1991, SIGSOFT '91.

[8]  Tei-Wei Kuo,et al.  Similarity-based load adjustment for real-time data-intensive applications , 1997, Proceedings Real-Time Systems Symposium.

[9]  Horst F. Wedde,et al.  Integration of Adaptive File Assignment into Distributed Safety-Critical Systems , 1996 .

[10]  David Lorge Parnas,et al.  On Satisfying Timing Constraints in Hard-Real-Time Systems , 1993, IEEE Trans. Software Eng..

[11]  Giuseppe Lipari,et al.  Schedulability analysis of periodic and aperiodic tasks with resource constraints , 2000, J. Syst. Archit..