A Fine-Grained Parameter Configuration Model for Failure Detection in Overlay Network Systems

Failure detection is a significant challenge in most overlay network systems. And the choice of parameter configuration for a given failure detection scheme has a significant impact on the performance of the scheme. In the traditional coarse-grained parameter configuration (CGPC) model, the parameter configuration for each failure detecting process is uniform. In this paper, we analyze the disadvantages of CGPC model and then propose an alternate model, i.e. the fine-grained parameter configuration (FGPC) model, in which each detecting relationship is allocated with one independently configurable detecting process and the parameter configuration for each failure detecting process can be different. To make a tradeoff between detection time and probability of false positive for parameter configuration policies, we propose a new evaluation criterion, i.e. detection loss. Based on FGPC model, we discuss the two approaches, i.e. common approach and heuristic approach, to choose an optimal parameter configuration policy for failure detection scheme. Finally we show how to apply the FGPC model to a probe-based failure detection scheme in application-layer multicast systems as an example.

[1]  Indranil Gupta,et al.  On scalable and efficient distributed failure detectors , 2001, PODC '01.

[2]  David R. Oran,et al.  OSI IS-IS Intra-domain Routing Protocol , 1990, RFC.

[3]  Krishna P. Gummadi,et al.  The impact of DHT routing geometry on resilience and proximity , 2003, SIGCOMM '03.

[4]  Chen-Nee Chuah,et al.  Analysis of link failures in an IP backbone , 2002, IMW '02.

[5]  Zongming Fei,et al.  Cooperative Failure Detection in Overlay Multicast , 2005, NETWORKING.

[6]  Yakov Rekhter,et al.  A Border Gateway Protocol 4 (BGP-4) , 1994, RFC.

[7]  Hari Balakrishnan,et al.  Resilient overlay networks , 2001, SOSP.

[8]  Robbert van Renesse,et al.  A Gossip-Style Failure Detection Service , 2009 .

[9]  Robert Tappan Morris,et al.  Resilient overlay networks , 2001, SOSP.

[10]  Srinivasan Seshan,et al.  A case for end system multicast , 2002, IEEE J. Sel. Areas Commun..

[11]  Hui Zhang,et al.  A case for end system multicast (keynote address) , 2000, SIGMETRICS '00.

[12]  Steven McCanne,et al.  RMX: reliable multicast for heterogeneous networks , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[13]  Lixia Zhang,et al.  Host multicast: a framework for delivering multicast to end users , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[14]  Kirk L. Johnson,et al.  Overcast: reliable multicasting with on overlay network , 2000, OSDI.

[15]  Randy H. Katz,et al.  On failure detection algorithms in overlay networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..