Active Fault-Tolerant System for Open Distributed Computing

Computer systems are growing in complexity and sophistication as open distributed systems and new technologies are used to achieve higher reliability and performance. Open distributed systems are some of the most successful structures ever designed for the computer community together with their undisputed benefits for users. However, this structure has also introduced a few side-effects, most notably the unanticipated runtime events and reconfiguration burdens imposed by the environmental changes. In this paper, we design a model that exploits the knowledge of pre-fault behavior to predict the suspected environmental faults and failures. Further, it can analyse the current underlying environmental behavior, in terms of current faults and failures. Therefore, this model mainly provides proactive as well as real-time fault-tolerant approaches in order to address unanticipated events and unpredictable hazards in distributed systems. Therefore, providing active fault tolerance could have a major impact with the growing requirements to support autonomic computing to overcome their rapidly growing complexity and to enable their further growth.

[1]  Kentaro Oda,et al.  An adaptable replication scheme for reliable distributed object-oriented computing , 2003, 17th International Conference on Advanced Information Networking and Applications, 2003. AINA 2003..

[2]  Kentaro Oda,et al.  The flying object for an open distributed environment , 2001, Proceedings 15th International Conference on Information Networking.

[3]  Michal Szymaniak,et al.  Latency-driven replica placement , 2005, The 2005 Symposium on Applications and the Internet.

[4]  Yoshihiro Yasutake,et al.  Clear separation and combination of synchronization constraint for concurrent object oriented programming , 2003, 17th International Conference on Advanced Information Networking and Applications, 2003. AINA 2003..

[5]  Gustavo Alonso,et al.  Understanding replication in databases and distributed systems , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[6]  Randy H. Katz,et al.  Dynamic Replica Placement for Scalable Content Delivery , 2002, IPTPS.