Generation of fault-tolerant state-based communication schedules for real-time systems

State-based schedules use a time division multiple access (TDMA) mechanism that supports executing conditional semantics and making on-the-fly decisions at runtime in each communication cycle. Until now, state-based schedules are unable to tolerate transient faults due to the assumption that stations make the on-the-fly decision on which message to execute next. Stations may make a faulty decision at run time in an unreliable communication environment such as wireless medium due to the presence of transient faults. This faulty decision causes state inconsistency among the stations in the system.In this work, we extend state-based schedules to tolerate faulty decisions in environments where transient faults can occur at the communication layer. Our proposed approach generates fault-tolerant state-based schedules using an integer linear programming optimization model after reducing the possibility of state inconsistency through using a clock and a sampling rate synchronization mechanism. The optimization model maximizes the use of time slots to place checkpoints for fault tolerance and resolving state inconsistency.

[1]  Insup Lee,et al.  A Verifiable Language for Programming Real-Time Communication Schedules , 2007, IEEE Transactions on Computers.

[2]  Lihua Xie,et al.  Distributed Consensus With Limited Communication Data Rate , 2011, IEEE Transactions on Automatic Control.

[3]  Gonzalo Carvajal,et al.  Generation of communication schedules for multi-mode distributed real-time applications , 2014, 2014 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[4]  Insup Lee,et al.  Compositional Analysis Framework Using EDP Resource Models , 2007, 28th IEEE International Real-Time Systems Symposium (RTSS 2007).

[5]  Xi Chen,et al.  DTS: Dynamic TDMA scheduling for Networked Control Systems , 2014, J. Syst. Archit..

[6]  Rajeev Alur,et al.  Specification and Analysis of Network Resource Requirements of Control Systems , 2009, HSCC.

[7]  Insup Lee,et al.  Plug-and-play for medical devices: experiences from a case study. , 2009, Biomedical instrumentation & technology.

[8]  Donal Heffernan,et al.  Interactive Consistency on a Time-Triggered Real-Time Control Network , 2006, IEEE Transactions on Industrial Informatics.

[9]  Philip Koopman,et al.  Cyclic redundancy code (CRC) polynomial selection for embedded networks , 2004, International Conference on Dependable Systems and Networks, 2004.

[10]  Andrew S. Tanenbaum,et al.  Distributed systems: Principles and Paradigms , 2001 .

[11]  Sebastian Fischmeister,et al.  Resolving state inconsistency in distributed fault-tolerant real-time dynamic TDMA architectures , 2011, ETFA2011.

[12]  Tei-Wei Kuo,et al.  RTSS 2007: Preface , 2007, RTSS 2007.

[13]  Petru Eles,et al.  Design optimization of time- and cost-constrained fault-tolerant distributed embedded systems , 2005, Design, Automation and Test in Europe.

[14]  Insup Lee,et al.  Compositional Analysis Framework Using EDP Resource Models , 2007, RTSS 2007.

[15]  Samarjit Chakraborty,et al.  A Multi-mode Real-Time Calculus , 2008, 2008 Real-Time Systems Symposium.

[16]  Raoul Velazco,et al.  Injecting bit flip faults by means of a purely software approach: a case studied , 2002, 17th IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems, 2002. DFT 2002. Proceedings..

[17]  Yves Sorel,et al.  Fault-tolerant static scheduling for real-time distributed embedded systems , 2001, Proceedings 21st International Conference on Distributed Computing Systems.

[18]  Hermann Kopetz,et al.  Tolerating Arbitrary Node Failures in the Time-Triggered Architecture , 2001 .

[19]  RICHARD KOO,et al.  Checkpointing and Rollback-Recovery for Distributed Systems , 1986, IEEE Transactions on Software Engineering.

[20]  James H. Anderson,et al.  Supporting soft real-time parallel applications on multiprocessors , 2014, J. Syst. Archit..

[21]  Robert E. Lyons,et al.  The Use of Triple-Modular Redundancy to Improve Computer Reliability , 1962, IBM J. Res. Dev..

[22]  Akramul Azim Analyzing consensus in multi-mode real-time communication using history information , 2016, 2016 IEEE 21st International Conference on Emerging Technologies and Factory Automation (ETFA).

[23]  Insup Lee,et al.  Hardware Acceleration for Conditional State-Based Communication Scheduling on Real-Time Ethernet , 2009, IEEE Transactions on Industrial Informatics.

[24]  Andreas Steininger,et al.  Safely Stimulating the Clock Synchronization Algorithm in Time-Triggered Systems–A Combined Formal and Experimental Approach , 2009, IEEE Transactions on Industrial Informatics.

[25]  Samarjit Chakraborty,et al.  Event count automata: a state-based model for stream processing systems , 2005, 26th IEEE International Real-Time Systems Symposium (RTSS'05).

[26]  Hervé Lacheray,et al.  QBOT: An educational mobile robot controlled in MATLAB Simulink environment , 2009, 2009 Canadian Conference on Electrical and Computer Engineering.

[27]  Insup Lee,et al.  Generating Reliable Code from Hybrid-Systems Models , 2010, IEEE Transactions on Computers.

[28]  Gene H. Golub,et al.  Floating Point Fault Tolerance with Backward Error Assertions , 1995, IEEE Trans. Computers.

[29]  Donal Heffernan,et al.  TTCAN: a new time-triggered controller area network , 2002, Microprocess. Microsystems.