Bounding the effectiveness of temporal redundancy in fault-tolerant real-time scheduling under error bursts

Reliability is a key requirement in many distributed real-time systems deployed in safety and mission critical applications, and temporal redundancy is a widely employed strategy towards guaranteeing it. The temporal redundancy approach is typically based on task re-executions in form of entire tasks, task alternates or, check-pointing blocks, and each of the re-execution strategies have different impacts on the Fault Tolerance feasibility (FT-feasibility) of the system, which is traditionally defined as the existence of a schedule that guarantees timeliness of all tasks under a specified fault hypothesis. In this paper, we propose the use of resource augmentation to quantify the FT-feasibility of real-time task sets and use it to derive limits on the effectiveness of temporal redundancy in fault-tolerant real-time scheduling under error bursts of bounded lengths. We derive the limits for the general case, and then show that for the specific case when the error burst length is no longer than half the shortest deadline, the lower limit on the effectiveness of temporal redundancy is given by the resource augmentation bound 2, while, the corresponding upper-limit is 6. Our proposed approach empowers a system designer to quantify the effectiveness of a particular implementation of temporal redundancy.

[1]  Hakan Aydin,et al.  Exact Fault-Sensitive Feasibility Analysis of Real-Time Tasks , 2007, IEEE Transactions on Computers.

[2]  Sasikumar Punnekkat,et al.  Fault Tolerant Scheduling on Controller Area Network (CAN) , 2010, 2010 13th IEEE International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing Workshops.

[3]  Rolf Johansson,et al.  Probabilistic Schedulability Guarantees for Dependable Real-Time Systems under Error Bursts , 2011, 2011IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications.

[4]  Sasikumar Punnekkat,et al.  Maximizing the Fault Tolerance Capability of Fixed Priority Schedules , 2008, 2008 14th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications.

[5]  Sasikumar Punnekkat,et al.  Quantifying the Sub-optimality of Non-preemptive Real-Time Scheduling , 2013, 2013 25th Euromicro Conference on Real-Time Systems.

[6]  Hans A. Hansson,et al.  A simulation based approach for estimating the reliability of distributed real-time systems , 2001, ETFA 2001. 8th International Conference on Emerging Technologies and Factory Automation. Proceedings (Cat. No.01TH8597).

[7]  Kang G. Shin,et al.  Optimal Checkpointing of Real-Time Tasks , 1987, IEEE Transactions on Computers.

[8]  Bala Kalyanasundaram,et al.  Speed is as powerful as clairvoyance , 2000, JACM.

[9]  Alan Burns,et al.  Feasibility analysis of fault-tolerant real-time task sets , 1996, Proceedings of the Eighth Euromicro Workshop on Real-Time Systems.

[10]  Jan Jonsson,et al.  Exact Fault-Tolerant Feasibility Analysis of Fixed-Priority Real-Time Tasks , 2010, 2010 IEEE 16th International Conference on Embedded and Real-Time Computing Systems and Applications.

[11]  Alan Burns,et al.  Quantifying the sub-optimality of uniprocessor fixed-priority scheduling , 2008 .

[12]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[13]  Petru Eles,et al.  Synthesis of fault-tolerant embedded systems with checkpointing and replication , 2006, Third IEEE International Workshop on Electronic Design, Test and Applications (DELTA'06).

[14]  Sanjoy K. Baruah,et al.  Algorithms and complexity concerning the preemptive scheduling of periodic, real-time tasks on one processor , 1990, Real-Time Systems.

[15]  Alan Burns,et al.  Analysis of Checkpointing for Real-Time Systems , 2004, Real-Time Systems.

[16]  Sasikumar Punnekkat,et al.  Resource augmentation for fault-tolerance feasibility of real-time tasks under error bursts , 2012, RTNS '12.

[17]  David Doose,et al.  Scheduling Analysis under Fault Bursts , 2011, 2011 17th IEEE Real-Time and Embedded Technology and Applications Symposium.

[18]  Alan Burns,et al.  Exact quantification of the sub-optimality of uniprocessor fixed priority pre-emptive scheduling , 2009, Real-Time Systems.