A Dynamic Fault-Tolerant Model for Open Distributed Computing

Open distributed computer systems are some of the most successful structures ever designed for the computer community together with their undisputed benefits for users. However, their complexity has also introduced a few side-effects, most notably the unpredictable nature of the underlying environments and reconfiguration burdens imposed by environmental changes. Thus, to gain high level of system performance, a required level of reliability has to be maintained. In this paper, we propose a mechanism to analyze the underlying environmental faults and failures. This model provides an adaptable fault-tolerant approach in order to address unanticipated events and unpredictable hazards in distributed systems. Therefore, this model maintains the required reliability by analyzing the environment and selects the optimal replication strategy for existing conditions. This pragmatic and theoretically appealing approach is a part of the Juice system which supports adaptation properties for open distributed environments

[1]  Gustavo Alonso,et al.  Understanding replication in databases and distributed systems , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[2]  Kentaro Oda,et al.  The flying object for an open distributed environment , 2001, Proceedings 15th International Conference on Information Networking.

[3]  Gustavo Alonso,et al.  How to select a replication protocol according to scalability, availability and communication overhead , 2001, Proceedings 20th IEEE Symposium on Reliable Distributed Systems.

[4]  Matti A. Hiltunen,et al.  Coyote: a system for constructing fine-grain configurable communication services , 1998, TOCS.

[5]  Kentaro Oda,et al.  An adaptable replication scheme for reliable distributed object-oriented computing , 2003, 17th International Conference on Advanced Information Networking and Applications, 2003. AINA 2003..

[6]  Roy Friedman,et al.  A framework for protocol composition in Horus , 1995, PODC '95.

[7]  Randy H. Katz,et al.  Dynamic Replica Placement for Scalable Content Delivery , 2002, IPTPS.

[8]  Yoshihiro Yasutake,et al.  Clear separation and combination of synchronization constraint for concurrent object oriented programming , 2003, 17th International Conference on Advanced Information Networking and Applications, 2003. AINA 2003..

[9]  Michal Szymaniak,et al.  Latency-Driven Replica Placement , 2006 .

[10]  Rachid Guerraoui,et al.  Flexible protocol composition in BAST , 1998, Proceedings. 18th International Conference on Distributed Computing Systems (Cat. No.98CB36183).