A Failure Detection Model Based on Message Delay Prediction

Failure detection is a key technology to implement a high reliable system. It is usually based on overtime mechanism to determine whether a process is failure or not. With the development of network, old failure detectors without adaptive mechanism can not meet the requirements of QoS of application all the time. Adaptive failure detection requires that the failure detectors can dynamically adjust the detecting quality according to the requirements of applications and the variations of network. A new failure detection model based on the predicted message delay is proposed in this paper. An adaptive failure detection algorithm is discussed and realized, which is based on the prediction from historical messages delay time. Experimental results show that the algorithm can satisfy the user’s demand of QoS on the failure detector to some extent.

[1]  Pierre Sens,et al.  Implementation and performance evaluation of an adaptable failure detector , 2002, Proceedings International Conference on Dependable Systems and Networks.

[2]  Naohiro Hayashibara,et al.  Failure detectors for large-scale distributed systems , 2002, 21st IEEE Symposium on Reliable Distributed Systems, 2002. Proceedings..

[3]  Andrea Bondavalli,et al.  Experimental evaluation of the QoS of failure detectors on wide area network , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[4]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[5]  Naixue Xiong,et al.  Design and analysis of quality of service on distributed fault-tolerant communication networks , 2008 .

[6]  Jean-Chrysotome Bolot End-to-end packet delay and loss behavior in the internet , 1993, SIGCOMM 1993.

[7]  Naohiro Hayashibara,et al.  Implementation and Performance Analysis of the φ-Failure Detector , 2003 .

[8]  Hong Jiang,et al.  Distributed systems of simple interacting agents , 2007 .

[9]  Marcos K. Aguilera,et al.  On the quality of service of failure detectors , 2000, Proceeding International Conference on Dependable Systems and Networks. DSN 2000.

[10]  Yun Xiao-chun Performance analysis and research of evaluation of failure detection , 2007 .

[11]  Michel Raynal,et al.  An adaptive failure detection protocol , 2001, Proceedings 2001 Pacific Rim International Symposium on Dependable Computing.

[12]  Piotr Zielinski Automatic Classification of Eventual Failure Detectors , 2007, DISC.