Spinglass: secure and scalable communication tools for mission-critical computing

Most existing communications technologies are either not scalable at all or scale only under carefully controlled conditions. This threatens an emerging generation of mission-critical but very large computing systems, which need communication support for such purposes as system management and control, policy administration, data dissemination, and to initiate adaptation in demanding environments. Cornell University's Spinglass project has discovered that "gossip-based" protocols can overcome scalability problems, offering security and reliability even in the most demanding settings. Gossip protocols emulate the spread of an infection in a crowded population and are both reliable and stable under forms of stress that can disable more traditional protocols. Our effort is developing a new generation of gossip-based technology for secure, reliable large-scale collaboration and soft real-time communications - even over global networks.

[1]  Ranveer Chandra,et al.  Anonymous Gossip: improving multicast reliability in mobile ad-hoc networks , 2001, Proceedings 21st International Conference on Distributed Computing Systems.

[2]  Michael H. Kalantar,et al.  Causally ordered multicast: the conservative approach , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[3]  Kenneth P. Birman,et al.  Bimodal multicast , 1999, TOCS.

[4]  Kenneth P. Birman,et al.  Scalable message stability detection protocols , 1998 .

[5]  David R. Cheriton,et al.  Understanding the limitations of causally and totally ordered communication , 1994, SOSP '93.

[6]  Dennis Shasha,et al.  The dangers of replication and a solution , 1996, SIGMOD '96.

[7]  Kenneth P. Birman,et al.  A gossip protocol for subgroup multicast , 2001, Proceedings 21st International Conference on Distributed Computing Systems Workshops.

[8]  Kenneth P. Birman,et al.  A randomized error recovery algorithm for reliable multicast , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[9]  Richard A. Golding,et al.  GROUP MEMBERSHIP IN THE EPIDEMIC STYLE , 1992 .

[10]  Indranil Gupta,et al.  Fighting fire with fire: using randomized gossip to combat stochastic scalability limits , 2002 .

[11]  Kenneth P. Birman,et al.  Tools for distributed application management , 1991, Computer.

[12]  Rico Piantoni,et al.  Implementing the Swiss Exchange trading system , 1997, Proceedings of IEEE 27th International Symposium on Fault Tolerant Computing.

[13]  Deborah Estrin,et al.  Error recovery in scalable reliable multicast , 1997 .

[14]  Matthew Thomas Lucas,et al.  Efficient data distribution in large-scale multicast networks , 1998 .

[15]  Dan Dumitriu,et al.  An overview of the Galaxy management framework for scalable enterprise cluster computing , 2000, Proceedings IEEE International Conference on Cluster Computing. CLUSTER 2000.

[16]  Sanjoy Paul,et al.  Reliable Multicast Transport Protocol (RMTP) , 1997, IEEE J. Sel. Areas Commun..

[17]  M. Gerla,et al.  Multicasting protocols for high-speed, wormhole-routing local area networks , 1996, SIGCOMM '96.