A snapshot security protocol for radar network protection

Snapshot based rollback recovery is one of the widely used techniques for applications such as scientific computing, database, telecommunication and critical mission applications. The goal of this paper is to design and implement a snapshot security protocol, which calculates consistent global snapshot for distributed applications running on radar networks in order to add more reliability and high availability to these systems. Our protocol is non-blocking in nature and avoids unnecessary computation. We use the dependability concept to minimize the amount of information (that passes through the network) for asking only minimum number of processes to take snapshot. Moreover, the number of control messages is independent of the increase in the number of processes in the system. The size of control messages is also small.

[1]  Krishnendu Mukhopadhyaya,et al.  Concurrent checkpoint initiation and recovery algorithms on asynchronous ring network , 2004, J. Parallel Distributed Comput..

[2]  Jian Xu,et al.  Necessary and Sufficient Conditions for Consistent Global Snapshots , 1995, IEEE Trans. Parallel Distributed Syst..

[3]  D. Manivannan,et al.  An enhanced model-based checkpointing protocol for preventing useless checkpoints , 2009, Int. J. Parallel Emergent Distributed Syst..

[4]  Michel Raynal,et al.  Asynchronous Distributed Checkpointing , 2013 .

[5]  Makhlouf Aliouat,et al.  Adaptive Fault Tolerant Checkpointing Algorithm for Cluster Based Mobile Ad Hoc Networks , 2015 .

[6]  Bran Selic,et al.  A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems , 2013, The Journal of Supercomputing.

[7]  Praveen Kumar,et al.  Anti-message Logging based Check pointing Algorithm for Mobile Distributed Systems , 2013 .

[8]  D. Manivannan,et al.  HOPE: A Hybrid Optimistic checkpointing and selective Pessimistic mEssage logging protocol for large scale distributed systems , 2012, Future Gener. Comput. Syst..

[9]  V. Rajaraman,et al.  A survey of checkpointing algorithms for parallel and distributed computers , 2000 .

[10]  D. Manivannan,et al.  A fully informed model-based checkpointing protocol for preventing useless checkpoints , 2013, Int. J. Parallel Emergent Distributed Syst..

[11]  Leslie Lamport,et al.  Distributed snapshots: determining global states of distributed systems , 1985, TOCS.

[12]  Satoshi Matsuoka,et al.  Design and modeling of a non-blocking checkpointing system , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[13]  Makhlouf Aliouat,et al.  Checkpointing distributed application running on mobile ad hoc networks , 2018, Int. J. High Perform. Comput. Netw..

[14]  Zhibo Wu,et al.  A multi-cycle checkpointing protocol that ensures strict 1-rollback , 2012, Inf. Process. Lett..

[15]  S. Siva Sathya,et al.  Survey of fault tolerant techniques for grid , 2010, Comput. Sci. Rev..

[16]  Willy Zwaenepoel,et al.  The performance of consistent checkpointing , 1992, [1992] Proceedings 11th Symposium on Reliable Distributed Systems.

[17]  Shahram Rahimi,et al.  Design of High Performance Distributed Snapshot/Recovery Algorithms for Ring Networks , 2008, J. Comput. Inf. Technol..

[18]  Makhlouf Aliouat,et al.  A New Efficient Checkpointing Algorithm for Distributed Mobile Computing , 2015 .

[19]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[20]  D. Manivannan,et al.  An enhanced model-based checkpointing protocol , 2007, Parallel and Distributed Computing and Networks.

[21]  L. Alvisi,et al.  A Survey of Rollback-Recovery Protocols , 2002 .