Evaluating Throughput Stability of Protocols for Distributed Middleware

Communication of large data volumes is a core functionality of distributed systems middleware, namely, for interconnecting components, for distributed computation and for fault tolerance. This common functionality is however achieved in different middleware platforms with various combinations of operating system and application level protocols, both standardized and ad hoc, and including implementations on managed runtime environments such as Java. In this paper, in contrast with most previous work that focus on performance, we point out that architectural and implementation decisions have an impact in throughput stability when the system is heavily loaded, precisely when such stability is most important. In detail, we present an experimental evaluation of several communication protocol components under stress conditions and conclude on the relative merits of several architectural options.

[1]  Kenneth P. Briman A review of experiences with reliable multicast , 1999 .

[2]  Jon Postel,et al.  User Datagram Protocol , 1980, RFC.

[3]  Emmanuel Cecchet,et al.  Evaluation of a Group Communication Middleware for Clustered J2EE Application Servers , 2004, CoopIS/DOA/ODBASE.

[4]  Nicolas Guelfi,et al.  Designing fault-tolerant mobile systems , 2003 .

[5]  David Detlefs,et al.  Garbage-first garbage collection , 2004, ISMM '04.

[6]  José Pereira,et al.  AKARA: A Flexible Clustering Protocol for Demanding Transactional Workloads , 2008, OTM Conferences.

[7]  Roberto Baldoni,et al.  Performance Analysis of Java Group Toolkits: A Case Study , 2002, FIDJI.

[8]  Luís E. T. Rodrigues,et al.  Appia, a flexible protocol kernel supporting multiple coordinated channels , 2001, Proceedings 21st International Conference on Distributed Computing Systems.

[9]  Rico Piantoni,et al.  Implementing the Swiss Exchange trading system , 1997, Proceedings of IEEE 27th International Symposium on Fault Tolerant Computing.

[10]  Nuno Carvalho,et al.  Versioned transactional shared memory for the FénixEDU web application , 2008, WDDDM '08.

[11]  Bela Ban Design and Implementation of a Reliable Group Communication Toolkit for Java , 1998 .

[12]  Lyndon Ong,et al.  An Introduction to the Stream Control Transmission Protocol (SCTP) , 2002, RFC.

[13]  Group Communication : Helping or Obscuring Failure Diagnosis ? , 2006 .

[14]  Kenneth P. Birman,et al.  A review of experiences with reliable multicast , 1999, Softw. Pract. Exp..

[15]  P BrimanKenneth A review of experiences with reliable multicast , 1999 .

[16]  Vinton G. Cerf,et al.  Specification of Internet Transmission Control Program , 1974, RFC.