An Energy-Efficient Fault-Tolerant Scheduling Algorithm Based on Variable Data Fragmentation

In this article, we propose an approach to build fault-tolerant distributed real-time embedded systems. From a given system description and a given fault hypothesis, we generate automatically a fault tolerant distributed schedule that achieves low energy consumption and high reliability efficiency. Our scheduling algorithm is dedicated to multi-bus heterogeneous architectures with multiple processors linked by several shared buses, which take as input a given system description and a given fault hypothesis. It is based on active redundancy to mask a fixed number L of processor failures supported in the system, and passive redundancy based on variable data fragmentation to tolerate N buses failures. In order to maximize the systems reliability, the replicas of each operation are scheduled on different reliable processors and the size of each fragmented data depends on GSFR and the bus failure rates. Finally, we show with an example that our approach can maximize reliability and reduce energy consumption when using active redundancy.

[1]  P. Agrawal,et al.  Energy-aware scheduling of distributed systems using cellular automata , 2012, 2012 IEEE International Systems Conference SysCon 2012.

[2]  Alois Knoll,et al.  Energy-Aware Task Allocation for Network-on-Chip Based Heterogeneous Multiprocessor Systems , 2011, 2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing.

[3]  Rami G. Melhem,et al.  Analysis of an energy efficient optimistic TMR scheme , 2004, Proceedings. Tenth International Conference on Parallel and Distributed Systems, 2004. ICPADS 2004..

[4]  Hong Song,et al.  The applied research of support vector machine in bus fault identification , 2010, 2010 Sixth International Conference on Natural Computation.

[5]  Alberto L. Sangiovanni-Vincentelli,et al.  Fault-tolerant deployment of embedded software for cost-sensitive real-time feedback-control applications , 2004, Proceedings Design, Automation and Test in Europe Conference and Exhibition.

[6]  Edwin Hsing-Mean Sha,et al.  Efficient fault-tolerant scheduling on multiprocessor systems via replication and deallocation , 2014, Int. J. Embed. Syst..

[7]  Alain Girault,et al.  A Novel Bicriteria Scheduling Heuristics Providing a Guaranteed Global System Failure Rate , 2009, IEEE Transactions on Dependable and Secure Computing.

[8]  Pankaj Jalote,et al.  Fault tolerance in distributed systems , 1994 .

[9]  Petru Eles,et al.  Scheduling and Optimization of Fault-Tolerant Embedded Systems with Transparency/Performance Trade-Offs , 2012, TECS.

[10]  Hermann Kopetz,et al.  COMMUNICATION RELIABILITY IN DISTRIBUTED REAL-TIME SYSTEMS* , 1992 .

[11]  Paul J. M. Havinga,et al.  Trade-off between traffic overhead and reliability in multipath routing for wireless sensor networks , 2003, 2003 IEEE Wireless Communications and Networking, 2003. WCNC 2003..

[12]  Shrisha Rao,et al.  Energy-Aware Scheduling of Distributed Systems , 2014, IEEE Transactions on Automation Science and Engineering.

[13]  Yves Sorel,et al.  Off-line real-time fault-tolerant scheduling , 2001, Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing.

[14]  C. Mani Krishna,et al.  Fault-tolerant scheduling in homogeneous real-time systems , 2014, ACM Comput. Surv..

[15]  Hermann Kopetz,et al.  Real-time systems , 2018, CSC '73.

[16]  Nagarajan Kandasamy,et al.  Dependable communication synthesis for distributed embedded systems , 2003, Reliab. Eng. Syst. Saf..