Recoverable mobile environment: design and trade-off analysis

The mobile wireless environment poses challenging problems in designing fault-tolerant systems because of the dynamics of mobility, and limited bandwidth available on wireless links. Traditional fault-tolerance schemes, therefore, cannot be directly applied to these systems. Mobile systems are often subject to environmental conditions which can cause loss of communications or data. Because of the consumer orientation of most mobile systems, run-time faults must be corrected with minimal (if any) intervention from the user. The fault-tolerance capability must, therefore, be transparent to the user. The paper presents recovery schemes for the failure of a mobile host. It portrays the limitations of the mobile wireless environment, and their impact on recovery protocols. The adaptation of well-known recovery schemes are presented which suit the mobile environment. The performance of these schemes has been analyzed to determine those environments where a particular recovery scheme is best suited. The performance of the recovery schemes primarily depends on: the wireless bandwidth; the communication-mobility ratio of the user; and the failure rate of the mobile host.

[1]  Nitin H. Vaidya,et al.  Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing Scheme , 1997, IEEE Trans. Computers.

[2]  Rafael Alonso,et al.  Database system issues in nomadic computing , 1993, SIGMOD Conference.

[3]  Dhiraj K. Pradhan,et al.  Fault-tolerant computer system design , 1996 .

[4]  Dhiraj K. Pradhan,et al.  Roll-Forward Checkpointing Scheme: A Novel Fault-Tolerant Architecture , 1994, IEEE Trans. Computers.

[5]  Kishor S. Trivedi Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .

[6]  B. R. Badrinath,et al.  Checkpointing distributed applications on mobile computers , 1994, Proceedings of 3rd International Conference on Parallel and Distributed Information Systems.

[7]  Terry Williams,et al.  Probability and Statistics with Reliability, Queueing and Computer Science Applications , 1983 .

[8]  Sampath Rangarajan,et al.  A fault-tolerant protocol for location directory maintenance in mobile networks , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[9]  John Zahorjan,et al.  The challenges of mobile computing , 1994, Computer.

[10]  Pankaj Jalote,et al.  Fault tolerance in distributed systems , 1994 .

[11]  Anita Borg,et al.  A message system supporting fault tolerance , 1983, SOSP '83.

[12]  Nitin H. Vaidya,et al.  On Checkpoint Latency , 1995 .

[13]  Dhiraj K. Pradhan,et al.  Performance issues in mobile wireless networks , 1996 .