Scalable stability detection using logical hypercube

This paper proposes to use a logical hypercube structure for detecting message stability in distributed systems. In particular, a stability detection protocol that uses such a superimposed logical structure is presented, and its scalability is compared with other known stability detection protocols. The main benefits of the logical hypercube approach are scalability, fault-tolerance, and refraining from overloading a single node or link in the system. These benefits become evident both by an analytical comparison and by simulations. Another important feature of the logical hypercube approach is that the performance of the protocol is in general not sensitive to the topology of the underlying physical network.

[1]  Robbert van Renesse Masking the Overhead of Protocol Layering , 1996, SIGCOMM.

[2]  S. Sitharama Iyengar,et al.  Introduction to parallel algorithms , 1998, Wiley series on parallel and distributed computing.

[3]  Srinivasan Keshav,et al.  Design, implementation, and performance measurement of a native-mode ATM transport layer (extended version) , 1996, TNET.

[4]  Liuba Shrira,et al.  Providing high availability using lazy replication , 1992, TOCS.

[5]  F. Leighton,et al.  Introduction to Parallel Algorithms and Architectures: Arrays, Trees, Hypercubes , 1991 .

[6]  Howard P. Katseff,et al.  Incomplete Hypercubes , 1988, IEEE Trans. Computers.

[7]  Kenneth P. Birman,et al.  Exploiting virtual synchrony in distributed systems , 1987, SOSP '87.

[8]  Danny Dolev,et al.  The Transis approach to high availability cluster communication , 1996, CACM.

[9]  E T. Leighton,et al.  Introduction to parallel algorithms and architectures , 1991 .

[10]  Flaviu Cristian,et al.  The pinwheel asynchronous atomic broadcast protocols , 1995, Proceedings ISADS 95. Second International Symposium on Autonomous Decentralized Systems.

[11]  Louise E. Moser,et al.  Totem: a fault-tolerant multicast group communication system , 1996, CACM.

[12]  ZHANGLi-xia,et al.  A reliable multicast framework for light-weight sessions and application level framing , 1995 .

[13]  Özalp Babaoglu,et al.  RELACS: A communications infrastructure for constructing reliable applications in large-scale distributed systems , 1995, Proceedings of the Twenty-Eighth Annual Hawaii International Conference on System Sciences.

[14]  Robbert van Renesse,et al.  Hierarchical Message Stability Tracking Protocols , 1997 .

[15]  Mark Garland Hayden,et al.  The Ensemble System , 1998 .

[16]  M. H. Schultz,et al.  Topological properties of hypercubes , 1988, IEEE Trans. Computers.

[17]  Srinivasan Keshav,et al.  Design, implementation, and performance of a native mode ATM transport layer , 1996, Proceedings of IEEE INFOCOM '96. Conference on Computer Communications.

[18]  André Schiper,et al.  Phoenix: A Toolkit for Building Fault-Tolerant Distributed Applications in Large Scale , 1995 .

[19]  Joseph JáJá,et al.  An Introduction to Parallel Algorithms , 1992 .

[20]  Kenneth P. Birman,et al.  Scalable message stability detection protocols , 1998 .

[21]  Robbert van Renesse,et al.  Horus: a flexible group communication system , 1996, CACM.