Towards Efficient Probabilistic Scheduling Guarantees for Real-Time Systems Subject to Random Errors and Random Bursts of Errors

Real-time computing and communication systems are often required to operate with prespecified levels of reliability in harsh environments, which may lead to the exposure of the system to random errors and random bursts of errors. The classical fault-tolerant schedulability analysis in such cases assumes a pseudo-periodic arrival of errors, and does not effectively capture any underlying randomness or burst characteristics. More modern approaches employ much richer stochastic error models to capture these behaviors, but this is at the expense of greatly increased complexity. In this paper, we develop a quantile-based approach to probabilistic schedulability analysis in a bid to improve efficiency whilst still retaining a rich stochastic error model capturing random errors and random bursts of errors. Our principal contribution is the derivation of a simple closed-form expression that tightly bounds the number of errors that a system must be able to tolerate at any time subsequent to its critical instant in order to achieve a specified level of reliability. We apply this technique to develop an efficient 'one-shot' schedulability analysis for a simple fault-tolerant EDF scheduler. The paper concludes that the proposed method is capable of giving efficient probabilistic scheduling guarantees, and may easily be coupled with more representative higher-level job failure models, giving rise to efficient analysis procedures for safety-critical fault-tolerant real-time systems.

[1]  David Doose,et al.  Scheduling Analysis under Fault Bursts , 2011, 2011 17th IEEE Real-Time and Embedded Technology and Applications Symposium.

[2]  Yili Hong,et al.  On computing the distribution function for the Poisson binomial distribution , 2013, Comput. Stat. Data Anal..

[3]  Julian Proenza,et al.  Probabilistic scheduling guarantees in distributed real-time systems under error bursts , 2012, Proceedings of 2012 IEEE 17th International Conference on Emerging Technologies & Factory Automation (ETFA 2012).

[4]  J. Davis Univariate Discrete Distributions , 2006 .

[5]  A. W. Kemp,et al.  Univariate Discrete Distributions , 1993 .

[6]  Thomas Nolte,et al.  Integrating reliability and timing analysis of CAN-based systems , 2002, IEEE Trans. Ind. Electron..

[7]  A. W. Kemp,et al.  Univariate Discrete Distributions: Johnson/Univariate Discrete Distributions , 2005 .

[8]  E. Gilbert Capacity of a burst-noise channel , 1960 .

[9]  Yeqiong Song,et al.  Worst-case deadline failure probability in real-time applications distributed over controller area network , 2000, J. Syst. Archit..

[10]  Hakan Aydin,et al.  Exact Fault-Sensitive Feasibility Analysis of Real-Time Tasks , 2007, IEEE Transactions on Computers.

[11]  Alan Burns,et al.  Comparing real-time communication under electromagnetic interference , 2004, Proceedings. 16th Euromicro Conference on Real-Time Systems, 2004. ECRTS 2004..

[12]  Friedrich Eisenbrand,et al.  EDF-schedulability of synchronous periodic task systems is coNP-hard , 2010, SODA '10.

[13]  Marco Spuri,et al.  Deadline Scheduling for Real-Time Systems: Edf and Related Algorithms , 2013 .

[14]  R. H. Campbell,et al.  A fault-tolerant scheduling problem , 1989, IEEE Transactions on Software Engineering.

[15]  H. Kopetz,et al.  Dependability: Basic Concepts and Terminology , 1992, Dependable Computing and Fault-Tolerant Systems.

[16]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[17]  G. Bennett Probability Inequalities for the Sum of Independent Random Variables , 1962 .

[18]  R. E. Barlow,et al.  Computing k-out-of-n System Reliability , 1984, IEEE Transactions on Reliability.

[19]  Alan Burns,et al.  An Optimal Fixed-Priority Assignment Algorithm for Supporting Fault-Tolerant Hard Real-Time Systems , 2003, IEEE Trans. Computers.

[20]  Friedrich Eisenbrand,et al.  Static-Priority Real-Time Scheduling: Response Time Computation Is NP-Hard , 2008, 2008 Real-Time Systems Symposium.

[21]  Jan Jonsson,et al.  Exact Fault-Tolerant Feasibility Analysis of Fixed-Priority Real-Time Tasks , 2010, 2010 IEEE 16th International Conference on Embedded and Real-Time Computing Systems and Applications.

[22]  M. Fernandez,et al.  Closed-Form Expression for the Poisson-Binomial Probability Density Function , 2010, IEEE Transactions on Aerospace and Electronic Systems.

[23]  David Wright,et al.  Probabilistic scheduling guarantees for fault-tolerant real-time systems , 1999, Dependable Computing for Critical Applications 7.

[24]  Hermann Kopetz,et al.  Real-time systems , 2018, CSC '73.

[25]  Torben Hagerup,et al.  A Guided Tour of Chernoff Bounds , 1990, Inf. Process. Lett..

[26]  UmaMaheswari Devi,et al.  An improved schedulability test for uniprocessor periodic task systems , 2003, 15th Euromicro Conference on Real-Time Systems, 2003. Proceedings..

[27]  Giorgio Buttazzo,et al.  Hard Real-Time Computing Systems: Predictable Scheduling Algorithms and Applications , 1997 .