A Comparative Study on Snapshot Protocols for Mobile Distributed Systems

Mobile Distributed Systems) , we come across some issues like: low bandwidth of wireless channels , mobility, and lack of stable storage on mobile nodes, limited battery power ,disconnections and high failure rate of mobile nodes. Fault Tolerance Techniques enable systems to perform tasks in the presence of faults. In case of a fault in DS, snapshot enables the execution of a program to be resumed from a previous consistent Global State rather than resuming the execution from the beginning. Thus, a lot of useful processing amount is lost because of the fault is significantly condensed. Coordinated global Snapshot is an effective FTT (Fault Tolerant Technique) in DS(Distributed Systems), as it avoids the domino effect and require minimum storage requirement. In this paper, we will study the accessible snapshot compilation schemes for DS & MDS. Then, a comparative analysis of the different schemes will be performed.

[1]  Elisa Bertino,et al.  PARALLEL AND DISTRIBUTED SYSTEMS , 2010 .

[2]  Makoto Takizawa,et al.  Checkpoint-recovery protocol for reliable mobile systems , 1998, Proceedings Seventeenth IEEE Symposium on Reliable Distributed Systems (Cat. No.98CB36281).

[3]  Prashant Kumar,et al.  A synchronous checkpointing protocol for mobile distributed systems: probabilistic approach , 2007, Int. J. Inf. Comput. Secur..

[4]  RICHARD KOO,et al.  Checkpointing and Rollback-Recovery for Distributed Systems , 1986, IEEE Transactions on Software Engineering.

[5]  Nuno Neves,et al.  Adaptive checkpointinng with storage management for mobile environments , 1999 .

[6]  C. Siva Ram Murthy,et al.  Ad Hoc Wireless Networks: Architectures and Protocols , 2004 .

[7]  Luís Moura Silva,et al.  Global checkpointing for distributed programs , 1992, [1992] Proceedings 11th Symposium on Reliable Distributed Systems.

[8]  Mukesh Singhal,et al.  On the impossibility of min-process non-blocking checkpointing and an efficient checkpointing algorithm for mobile computing systems , 1998, Proceedings. 1998 International Conference on Parallel Processing (Cat. No.98EX205).

[9]  Parveen Kumar,et al.  Analysis of Recent Checkpointing Techniques for Mobile Computing Systems , 2011, ArXiv.

[10]  Parveen Kumar A low-cost hybrid coordinated checkpointing protocol for mobile distributed systems , 2008, Mob. Inf. Syst..

[11]  B. R. Badrinath,et al.  Checkpointing distributed applications on mobile computers , 1994, Proceedings of 3rd International Conference on Parallel and Distributed Information Systems.

[12]  Lalit Kumar,et al.  Low overhead optimal checkpointing for mobile distributed systems , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[13]  Mukesh Singhal,et al.  On Coordinated Checkpointing in Distributed Systems , 1998, IEEE Trans. Parallel Distributed Syst..

[14]  L. Alvisi,et al.  A Survey of Rollback-Recovery Protocols , 2002 .

[15]  Junguk L. Kim,et al.  An Efficient Protocol for Checkpointing Recovery in Distributed Systems , 1993, IEEE Trans. Parallel Distributed Syst..

[16]  Mukesh Singhal,et al.  Mutable checkpoints: a new checkpointing approach for mobile computing systems , 1999, PODC '99.

[17]  Mukesh Singhal,et al.  Low-Cost Checkpointing and Failure Recovery in Mobile Computing Systems , 1996, IEEE Trans. Parallel Distributed Syst..

[18]  Leslie Lamport,et al.  Distributed snapshots: determining global states of distributed systems , 1985, TOCS.

[19]  Parveen Kumar,et al.  Minimum Process Coordinated Checkpointing Scheme for Ad Hoc Networks , 2011, ArXiv.

[20]  Rachit Garg,et al.  A Nonblocking Coordinated Checkpointing Algorithm for Mobile Computing Systems , 2010 .

[21]  Nuno Neves,et al.  Adaptive recovery for mobile environments , 1997, CACM.

[22]  R. K. Chauhan,et al.  BACKWARD ERROR RECOVERY PROTOCOLS IN DISTRIBUTED MOBILE SYSTEMS: A SURVEY , 2008 .

[23]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[24]  Willy Zwaenepoel,et al.  The performance of consistent checkpointing , 1992, [1992] Proceedings 11th Symposium on Reliable Distributed Systems.