An adaptive real-time group communication protocol

In this paper, we address the problem of having real-time group communication in a system. That is not completely synchronous (quasi-synchronous). This situation may happen when system load is not completely controlled and so, overload scenarios are not ruled out. In such scenarios, and for most of the networks used in this type of environment, temporal guarantees can only be made for the highest priority messages. The paper describes a group communication protocol that provides a total temporal order in such an environment. It adapts to the uncertainty of the environment by means of a failure detection service, in order to be able to achieve its safety properties in a timely fashion. The protocol also tries to provide early-delivery while preserving its ordering property.

[1]  Flaviu Cristian,et al.  Agreeing on who is present and who is absent in a synchronous distributed system , 1988, [1988] The Eighteenth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[2]  Sam Toueg,et al.  Inconsistency and contamination (preliminary version) , 1991, PODC '91.

[3]  David Powell,et al.  Failure mode assumptions and assumption coverage , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[4]  Flaviu Cristian,et al.  Atomic Broadcast: From Simple Message Diffusion to Byzantine Agreement , 1995, Inf. Comput..

[5]  Flaviu Cristian,et al.  Early-delivery atomic broadcast , 1990, PODC '90.

[6]  José Rufino,et al.  ENFORCING REAL-TIME BEHAVIOUR ON LAN-BASED PROTOCOLS , 1992 .

[7]  Nancy A. Lynch,et al.  Consensus in the presence of partial synchrony , 1988, JACM.

[8]  Sam Toueg,et al.  Unreliable failure detectors for asynchronous systems (preliminary version) , 1991, PODC '91.

[9]  Kenneth P. Birman,et al.  Using process groups to implement failure detection in asynchronous environments , 1991, PODC '91.

[10]  Danny Dolev,et al.  On the minimal synchronism needed for distributed consensus , 1983, 24th Annual Symposium on Foundations of Computer Science (sfcs 1983).

[11]  Hermann Kopetz,et al.  TTP - A time-triggered protocol for fault-tolerant real-time systems , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[12]  Paulo Veríssimo,et al.  xAMp: a multi-primitive group communications service , 1992, [1992] Proceedings 11th Symposium on Reliable Distributed Systems.

[13]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1983, PODS '83.

[14]  José Rufino,et al.  A study on the inaccessibility characteristics of ISO 8802/4 token-bus LANs , 1992, [Proceedings] IEEE INFOCOM '92: The Conference on Computer Communications.

[15]  David Powell Failure mode assumptions and assumption coverage , 1992 .