FEST: Fault-Tolerant Energy-Aware Scheduling on Two-Core Heterogeneous Platform

Energy awareness and fault-tolerance have emerged as important design constraints in the development of modern safety-critical real-time systems. Devising scheduling strategies for such systems implemented on heterogeneous platforms is a challenging as well as a computationally demanding problem. As a consequence, today we face a scarcity of low-overhead scheduling techniques which are applicable to heterogeneous platforms. This work attempts to develop an efficient energy-aware heuristic scheme called, FEST, for the fault-tolerant scheduling of real-time tasks on a two-core heterogeneous platform, where one core is high-performance and the other core is power-efficient. In order to provide fault-tolerance against transient processor faults, we consider a standby-sparing approach where the power-efficient core is used to execute primary task versions while the high-performance core is operated as a spare to re-execute fault affected tasks (i.e., backups). Since, the execution of backups scheduled on the spare core will be cancelled subsequent to the fault-free execution of their primaries, we employ the Dynamic Power Management (DPM) technique on both cores to minimize energy consumption. Further, FEST utilizes backup-backup overloading to minimize energy consumption while guaranteeing tolerance against a given number of transient processor faults. Experimental results reveal that our scheme achieves good energy savings as compared to the state-of-the-art.

[1]  Petru Eles,et al.  A standby-sparing technique with low energy-overhead for fault-tolerant hard real-time systems , 2009, CODES+ISSS '09.

[2]  Rami G. Melhem,et al.  Fault-Tolerance Through Scheduling of Aperiodic Tasks in Hard Real-Time Multiprocessor Systems , 1997, IEEE Trans. Parallel Distributed Syst..

[3]  Rami G. Melhem,et al.  Fault-tolerant scheduling on a hard real-time multiprocessor system , 1994, Proceedings of 8th International Parallel Processing Symposium.

[4]  Dakai Zhu,et al.  Energy-aware Standby-Sparing Technique for periodic real-time applications , 2011, 2011 IEEE 29th International Conference on Computer Design (ICCD).

[5]  Rajesh Devaraj,et al.  Fault-Tolerant Scheduling of Non-preemptive Periodic Tasks using SCT of Timed DES on Uniprocessor Systems , 2017 .

[6]  Arnab Sarkar,et al.  A priori overload handling in ERfair scheduled embedded systems: Hybrid automata approach , 2014, 2014 Annual IEEE India Conference (INDICON).

[7]  Arnab Sarkar,et al.  ERfair Scheduler with Processor Suspension for Real-Time Multiprocessor Embedded Systems , 2016, TODE.

[8]  Alan Burns,et al.  A survey of hard real-time scheduling for multiprocessor systems , 2011, CSUR.

[9]  Rajesh Devaraj,et al.  Supervisory Control Approach and its Symbolic Computation for Power-Aware RT Scheduling , 2019, IEEE Transactions on Industrial Informatics.

[10]  Rajesh Devaraj,et al.  DES based Modeling and Fault Diagnosis in Safety-critical Semi-Partitioned Real-time Systems , 2017 .

[11]  Patrick Meumeu Yomsi,et al.  Energy-aware task mapping onto heterogeneous platforms using DVFS and sleep states , 2015, Real-Time Systems.

[12]  Yifeng Guo,et al.  Generalized Standby-Sparing techniques for energy-efficient fault tolerance in multiprocessor real-time systems , 2013, 2013 IEEE 19th International Conference on Embedded and Real-Time Computing Systems and Applications.

[13]  Abhishek Roy,et al.  Energy-efficient primary/backup scheduling techniques for heterogeneous multicore systems , 2017, 2017 Eighth International Green and Sustainable Computing Conference (IGSC).

[14]  Rajesh Devaraj,et al.  HETERO-SCHED: A Low-Overhead Heterogeneous Multi-core Scheduler for Real-Time Periodic Tasks , 2018, 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[15]  Dakai Zhu,et al.  Shared recovery for energy efficiency and reliability enhancements in real-time applications with precedence constraints , 2013, TODE.

[16]  Rajesh Devaraj,et al.  Real-time scheduling of non-preemptive sporadic tasks on uniprocessor systems using Supervisory Control of timed DES , 2017, 2017 American Control Conference (ACC).

[17]  Hemangee K. Kapoor,et al.  Energy aware frame based fair scheduling , 2018, Sustain. Comput. Informatics Syst..

[18]  Arnab Sarkar,et al.  ERfair Scheduler with Processor Shutdown , 2009, 2009 International Conference on High Performance Computing (HiPC).

[19]  C. Mani Krishna,et al.  Fault-tolerant scheduling in homogeneous real-time systems , 2014, ACM Comput. Surv..

[20]  Rajesh Devaraj,et al.  Fault-Tolerant Preemptive Aperiodic RT Scheduling by Supervisory Control of TDES on Multiprocessors , 2017, ACM Trans. Embed. Comput. Syst..

[21]  Rajesh Devaraj,et al.  A Deadline-Partition Oriented Heterogeneous Multi-Core Scheduler for Periodic Tasks , 2017, 2017 18th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT).

[22]  Arnab Sarkar,et al.  Design of light weight exact discrete event system diagnosers using measurement limitation: case study of electronic fuel injection system , 2018, Int. J. Syst. Sci..

[23]  Sanjoy K. Baruah,et al.  Multiprocessor Scheduling for Real-Time Systems , 2015, Embedded Systems.

[24]  Abhishek Roy,et al.  Energy-aware standby-sparing on heterogeneous multicore systems , 2017, 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC).

[25]  Vanchinathan Venkataramani,et al.  Power-performance modeling on asymmetric multi-cores , 2013, 2013 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES).

[26]  Rajesh Devaraj,et al.  A design fix to supervisory control for fault-tolerant scheduling of real-time multiprocessor systems with aperiodic tasks , 2015, Int. J. Control.