Energy Optimization and Fault Tolerance to Embedded System Based on Adaptive Heterogeneous Multi-Core Hardware Architecture

Energy optimization and fault recovery are significant to ensure the embedded devices to work persistently and reliably. To achieve the energy optimization, a reconfigurable heterogeneous multi-core hardware platform is designed. Based on this hardware platform, a multi-core energy-efficient scheduling mechanism using reinforcement learning algorithm to search for the optimal scheduling solution is proposed. By the above hardware and software collaborative optimization mechanism, the energy cost of embedded system can be decreased significantly. To perform the fault recovery, a multi-core control-flow fault recovery mechanism is researched. This mechanism uses the Petri net model to detect the multi-core control-flow faults, and then recover these faults by a hardware-based quick recovery technique. The experimental results showed the energy cost of EMWSN could be optimized by more than 20% comparing to the traditional multi-core system. In addition, about 90% multi-core control-flow faults could be recovered, and the recovery time was nearly 40% less than the software-based recovery technique CFCSS.

[1]  Da He,et al.  A Heuristic Energy-Aware Approach for Hard Real-Time Systems on Multi-core Platforms , 2012, DSD.

[2]  Ioannis Dimitriou,et al.  A retrial queue for modeling fault-tolerant systems with checkpointing and rollback recovery , 2015, Comput. Ind. Eng..

[3]  Vinay Devadas,et al.  Coordinated power management of periodic real-time tasks on chip multiprocessors , 2010, International Conference on Green Computing.

[4]  Bishop Brock,et al.  Dynamic Power Management for Embedded Systems , 2003 .

[5]  Jean-Pierre Chanet,et al.  LiveNode: LIMOS versatile embedded wireless sensor node , 2007 .

[6]  Anuradha Pughat,et al.  A review on stochastic approach for dynamic power management in wireless sensor networks , 2015, Human-centric Computing and Information Sciences.

[7]  Qing Wang,et al.  Energy and Delay Optimization of Heterogeneous Multicore Wireless Multimedia Sensor Nodes by Adaptive Genetic-Simulated Annealing Algorithm , 2018, Wirel. Commun. Mob. Comput..

[8]  Jian-Jia Chen,et al.  Reliability Optimization on Multi-Core Systems with Multi-Tasking and Redundant Multi-Threading , 2018, IEEE Transactions on Computers.

[9]  Jacob A. Abraham,et al.  ACCE: Automatic correction of control-flow errors , 2007, 2007 IEEE International Test Conference.

[10]  Raffaela Mirandola,et al.  Dynamic power management for QoS-aware applications , 2013, Sustain. Comput. Informatics Syst..

[11]  Oliver Diessel,et al.  Fine-grained module-based error recovery in FPGA-based TMR systems , 2016, FPT.

[12]  Haiying Zhou,et al.  Memory and Energy Optimization Strategies for Multithreaded Operating System on the Resource-Constrained Wireless Sensor Node , 2014, Sensors.

[13]  Ali N. Akansu,et al.  FPGA, GPU, and CPU implementations of Jacobi algorithm for eigenanalysis , 2016, J. Parallel Distributed Comput..

[14]  Lu Wang,et al.  Deadlock Property Analysis of Concurrent Programs Based on Petri Net Structure , 2017, International Journal of Parallel Programming.

[15]  Gang Chen,et al.  Abstract: Energy optimization for real-time multiprocessor system-on-chip with optimal DVFS and DPM combination , 2013, ESTImedia.

[16]  Changjun Jiang,et al.  Energy Efficiency Aware Task Assignment with DVFS in Heterogeneous Hadoop Clusters , 2018, IEEE Transactions on Parallel and Distributed Systems.

[17]  Neil Weste,et al.  Principles of CMOS VLSI Design , 1985 .

[18]  D. T. Lee,et al.  Online Dynamic Power Management with Hard Real-Time Guarantees , 2014, STACS.

[19]  Seehwan Yoo An empirical validation of power-performance scaling: DVFS vs. multi-core scaling in big.LITTLE processor , 2015, IEICE Electron. Express.

[20]  Alireza Ejlali,et al.  Dataset supporting the article entitled "Two-Phase Low-Energy N-Modular Redundancy for Hard Real-Time Multi-Core Systems" , 2016 .

[21]  Chao Wang,et al.  WCET-Aware Partial Control-Flow Checking for Resource-Constrained Real-Time Embedded Systems , 2014, IEEE Transactions on Industrial Electronics.

[22]  Hans Hallez,et al.  Random Additive Signature Monitoring for Control Flow Error Detection , 2017, IEEE Transactions on Reliability.

[23]  Chia-Ming Wu,et al.  A green energy-efficient scheduling algorithm using the DVFS technique for cloud datacenters , 2014, Future Gener. Comput. Syst..

[24]  Shengwu Xiong,et al.  LiveWSN: A memory-efficient, energy-efficient, reprogrammable, and fault-tolerant platform for wireless sensor network , 2016, Int. J. Distributed Sens. Networks.

[25]  Wolfgang Schröder-Preikschat,et al.  Demystifying Soft-Error Mitigation by Control-Flow Checking -- A New Perspective on its Effectiveness , 2017, ACM Trans. Embed. Comput. Syst..

[26]  Naresh R. Shanbhag,et al.  Soft N-Modular Redundancy , 2012, IEEE Transactions on Computers.