Fail-aware datagram service

In timed asynchronous distributed systems, it is often useful for a process p to know that another process q will not use a certain piece of information p has sent to q beyond a certain deadline. Since p learns about the occurrence of the deadline by simply measuring the passage of time on its own local clock, we call this kind of interprocess communication “communication by time”. Knowledge of computed upper bounds on one-way message transmission delays is a necessary prerequisite for this kind of communication.

[1]  F. Cristian,et al.  A fail-aware membership service , 1997, Proceedings of SRDS'97: 16th IEEE Symposium on Reliable Distributed Systems.

[2]  Flaviu Cristian,et al.  FORTRESS: A System to Support Fail-Aware Real-Time Applications , 1997 .

[3]  Flaviu Cristian,et al.  A Highly Available Local Leader Election Service , 1999, IEEE Trans. Software Eng..

[4]  J. Arlat,et al.  PADRE: a Protocol for Asymmetric Duplex REdundancy , 1999, Dependable Computing for Critical Applications 7.

[5]  Marko Schuba,et al.  Performance Investigations of the IP Multicast Architecture , 1996, Comput. Networks ISDN Syst..

[6]  Flaviu Cristian,et al.  Fail-Aware Clock Synchronization , 1996 .

[7]  Flaviu Cristian,et al.  Fail-awareness: an approach to construct fail-safe applications , 1997, Proceedings of IEEE 27th International Symposium on Fault Tolerant Computing.

[8]  Flaviu Cristian,et al.  The Timed Asynchronous Distributed System Model , 1998, IEEE Trans. Parallel Distributed Syst..

[9]  Flaviu Cristian,et al.  The Timed Asynchronous Distributed System Model , 1999, IEEE Trans. Parallel Distributed Syst..

[10]  Flaviu Cristian,et al.  Synchronous and Asynchronous Group Communication. , 1996 .

[11]  Flaviu Cristian,et al.  Understanding fault-tolerant distributed systems , 1991, CACM.

[12]  Jon Postel,et al.  User Datagram Protocol , 1980, RFC.

[13]  Flaviu Cristian,et al.  Fail-awareness in timed asynchronous systems , 1996, PODC '96.

[14]  Christof Fetzer,et al.  The message classification model , 1998, PODC '98.

[15]  Frank B. Schmuck,et al.  Agreeing on Processor Group Membership in Timed Asynchronous Distributed Systems , 1995 .

[16]  Flaviu Cristian,et al.  Synchronous and asynchronous , 1996, CACM.

[17]  David Powell Failure mode assumptions and assumption coverage , 1992 .

[18]  Flaviu Cristian,et al.  Building fault-tolerant hardware clocks from COTS components , 1999, Dependable Computing for Critical Applications 7.