A Fault Tolerant Mechanism for Partitioning and Offloading Framework in Pervasive Environments

Application partitioning and code offloading are being researched extensively during the past few years. Several frameworks for code offloading have been proposed. However, fewer works attempted to address issues occurred with its implementation in pervasive environments such as frequent network disconnection due to high mobility of users. Thus, in this paper, we proposed a fault tolerant algorithm that helps in consolidating the efficiency and robustness of application partitioning and offloading frameworks. To permit the usage of different fault tolerant policies such as replication and checkpointing, the devices are grouped into high and low reliability clusters. Experimental results shown that the fault tolerant algorithm can easily adapt to different execution conditions while incurring minimum overhead.

[1]  James E. Kelley,et al.  Critical-Path Planning and Scheduling: Mathematical Basis , 1961 .

[2]  John W. Young,et al.  A first order approximation to the optimum checkpoint interval , 1974, CACM.

[3]  Chong-Sun Hwang,et al.  Adaptive group scheduling mechanism using mobile agents in peer-to-peer grid computing environment , 2006, Applied Intelligence.

[4]  Naixue Xiong,et al.  A Survey on Fault-Tolerance in Distributed Network Systems , 2009, 2009 International Conference on Computational Science and Engineering.

[5]  Robert D. van der Mei,et al.  Dynamic Load Balancing and Job Replication in a Global-Scale Grid Environment: A Comparison , 2009, IEEE Transactions on Parallel and Distributed Systems.

[6]  Tristan Henderson,et al.  CRAWDAD dataset dartmouth/campus (v.2007-02-08) , 2007 .

[7]  Nawaz Mohamudally,et al.  A Dynamic Load Balancing Algorithm for Distributing Mobile Codes in Multi-Applications and Multi-Hosts Environment , 2020, ArXiv.

[8]  Gil Neiger,et al.  Automatically increasing the fault-tolerance of distributed systems , 1988, PODC '88.

[9]  Nawaz Mohamudally,et al.  Partitioning Application Using Graph Theory for Mobile Devices in Pervasive Computing Environments , 2016, FNC/MobiSPC.

[10]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[11]  Stephen L. Scott,et al.  Reliability-Aware Approach: An Incremental Checkpoint/Restart Model in HPC Environments , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).

[12]  A. Varga,et al.  INET Framework , 2019, Recent Advances in Network Simulation.

[13]  Dimitrios Skoutas,et al.  Efficient task replication and management for adaptive fault tolerance in Mobile Grid environments , 2007, Future Gener. Comput. Syst..

[14]  Xiaomin Zhu,et al.  AGILE: A terminal energy efficient scheduling method in mobile cloud computing , 2015, Trans. Emerg. Telecommun. Technol..

[15]  Nawaz Mohamudally,et al.  Device Selection Decision Making using Multi-Criteria for Offloading Application Mobile Codes , 2020, 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS).

[16]  Nawaz Mohamudally,et al.  Application Partitioning for Offloading in Mobile Pervasive Environments , 2019, EUSPN/ICTH.

[17]  Hyongsoon Kim,et al.  Dynamic group‐based fault tolerance technique for reliable resource management in mobile cloud computing , 2016, Concurr. Comput. Pract. Exp..