Review of State Based Approach Recovery Schemes in Mobile Distributed Environments

Fault tolerance techniques enable a system to perform tasks in the existence of faults. In a distributed system, hardware and software components are located at network computers and communication and coordination of their action is done by only passing messages. Mobile computing is progression of wireless network technology and portable information appliances such as laptops, handheld devices, PDAs etc. In the state based approach for recovery, known as snapshot, the entire state of a process is saved. When a recovery point is established, recovering a process involves reinstating its saved state and resuming the execution of the process from the state. Exhaustive research work has been carried out on designing efficient state based schemes for fault tolerance. In mobile distributed computing, due to mobility of MHs and limitations of wireless networks, there are new issues like mobility, catastrophic failure, limited battery life, low bandwidth, disconnections etc. That complicate the design of the snapshot algorithms. Recently, more attention has been given to providing state based approach of recovery for mobile systems. This paper surveys the algorithms reported in literature for introducing fault tolerance in mobile distributed systems and extension of it.

[1]  Nuno Neves,et al.  Adaptive recovery for mobile environments , 1997, CACM.

[2]  Mukesh Singhal,et al.  Mutable Checkpoints: A New Checkpointing Approach for Mobile Computing Systems , 2001, IEEE Trans. Parallel Distributed Syst..

[3]  Vijay K. Garg,et al.  Optimistic recovery in multi-threaded distributed systems , 1999, Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems.

[4]  Prashant Kumar,et al.  A synchronous checkpointing protocol for mobile distributed systems: probabilistic approach , 2007, Int. J. Inf. Comput. Secur..

[5]  Praveen Kumar,et al.  A Review of Fault Tolerant Checkpointing Protocols for Mobile Computing Systems , 2010 .

[6]  Lalit Kumar,et al.  Checkpointing in Distributed Computing Systems , 2002 .

[7]  Shahram Rahimi,et al.  A New High Performance Checkpointing Approach for Mobile Computing Systems , 2006 .

[8]  Parveen Kumar A low-cost hybrid coordinated checkpointing protocol for mobile distributed systems , 2008, Mob. Inf. Syst..

[9]  V. Rajaraman,et al.  A survey of checkpointing algorithms for parallel and distributed computers , 2000 .

[10]  Mukesh Singhal,et al.  Low-Cost Checkpointing and Failure Recovery in Mobile Computing Systems , 1996, IEEE Trans. Parallel Distributed Syst..

[11]  Tomasz Imielinski,et al.  Structuring distributed algorithms for mobile hosts , 1994, 14th International Conference on Distributed Computing Systems.

[12]  Leslie Lamport,et al.  Distributed snapshots: determining global states of distributed systems , 1985, TOCS.

[13]  Mukesh Singhal,et al.  Advanced Concepts In Operating Systems , 1994 .

[14]  Nuno Neves,et al.  Adaptive recovery for mobile environments , 1996, Proceedings. IEEE High-Assurance Systems Engineering Workshop (Cat. No.96TB100076).

[15]  Pushpendra Singh,et al.  A Checkpointing Algorithm for Mobile Computing Environment , 2003, PWC.

[16]  Parveen Kumar,et al.  Soft-Checkpointing Based Hybrid Synchronous Checkpointing Protocol for Mobile Distributed Systems , 2011, Int. J. Distributed Syst. Technol..

[17]  Parveen Kumar,et al.  Analysis of Recent Checkpointing Techniques for Mobile Computing Systems , 2011, ArXiv.

[18]  Mukesh Singhal,et al.  On the impossibility of min-process non-blocking checkpointing and an efficient checkpointing algorithm for mobile computing systems , 1998, Proceedings. 1998 International Conference on Parallel Processing (Cat. No.98EX205).

[19]  L. Alvisi,et al.  A Survey of Rollback-Recovery Protocols , 2002 .

[20]  Junguk L. Kim,et al.  An Efficient Protocol for Checkpointing Recovery in Distributed Systems , 1993, IEEE Trans. Parallel Distributed Syst..

[21]  RICHARD KOO,et al.  Checkpointing and Rollback-Recovery for Distributed Systems , 1986, IEEE Transactions on Software Engineering.

[22]  P Kanmani Fault tolerance using token ring checkpointing in distributed mobile computing systems , 2013 .

[23]  Parveen Kumar,et al.  The Design and Performance of a Checkpointing Scheme for Mobile Ad Hoc Networks , 2011 .

[24]  Rucha Ravindra Galgali,et al.  Predicting Resource Allocation in Distributed Environment by Using Online Predictive Approach: a Review , 2013 .

[25]  Mukesh Singhal,et al.  On Coordinated Checkpointing in Distributed Systems , 1998, IEEE Trans. Parallel Distributed Syst..

[26]  Bidyut Gupta,et al.  AN EFFICIENT CHECKPOINTING SCHEME FOR MOBILE COMPUTING SYSTEMS , 2001 .

[27]  Rachit Garg,et al.  A Review of Checkpointing Fault Tolerance Techniques in Distributed Mobile Systems , 2010 .

[28]  Parveen Kumar,et al.  A Comparative Study on Snapshot Protocols for Mobile Distributed Systems , 2014 .

[29]  Dhiraj K. Pradhan,et al.  Recoverable mobile environment: design and trade-off analysis , 1996, Proceedings of Annual Symposium on Fault Tolerant Computing.