Quasi-Synchronism: a step away from the traditional fault-tolerant real-time system models

Distributed fault-tolerant real-time system models have exhibited a trend to polarize themselves in extreme positions. In this paper, we assess the tness of current models to represent the attributes underlying the distributed fault-tolerance of real-time systems. Namely, we are concerned with the correctness issues arising from the temporal properties of interprocess communication: reliable and ordered group communication, replication management protocols, time services, etc. We are particularly concerned with best-eeort or mission-critical systems, where despite the hard need to fulll timing guarantees, this cannot be ensured at all times in a given operational envelope, mandating that the system be highly dynamic and adaptive. In the paper, we suggest quasi-synchronism as a framework to address this kind of systems. We nalize by pointing out some contributions to materialize the model.

[1]  Sam Toueg,et al.  Unreliable failure detectors for asynchronous systems (preliminary version) , 1991, PODC '91.

[2]  David Powell Extra Performance Architecture (XPA) , 1991 .

[3]  André Schiper,et al.  Lightweight causal and atomic group multicast , 1991, TOCS.

[4]  André Schiper,et al.  Uniform actions in asynchronous distributed systems , 1994, PODC '94.

[5]  Hermann Kopetz,et al.  Distributed fault-tolerant real-time systems: the Mars approach , 1989, IEEE Micro.

[6]  Fred B. Schneider,et al.  Understanding Protocols for Byzantine Clock Synchronization , 1987 .

[7]  Alan Burns,et al.  The drtee architecture for distributed hard real-time systems , 1992 .

[8]  Paulo Veríssimo,et al.  Real time and dependability concepts , 1993 .

[9]  Gérard Le Lann On Real-Time Distributed Computing , 1983, IFIP Congress.

[10]  David Powell Failure mode assumptions and assumption coverage , 1992 .

[11]  Parameswaran Ramanathan,et al.  Fault-tolerant clock synchronization in distributed systems , 1990, Computer.

[12]  Paulo Veríssimo,et al.  Timing failure detection and real-time group communication in quasi-synchronous systems , 1996, Proceedings of the Eighth Euromicro Workshop on Real-Time Systems.

[13]  Nancy A. Lynch,et al.  Consensus in the presence of partial synchrony , 1988, JACM.

[14]  P. Verissimo,et al.  An adaptive real-time group communication protocol , 1995, Proceedings 1995 IEEE International Workshop on Factory Communication Systems. WFCS'95.

[15]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1983, PODS '83.