The Timely Computing Base

Real-time behavior is materialized by timeliness specifications, which in essence call for synchronous system models. However, systems many often rely on largescale, unpredictable and unreliable infrastructures, that suggest the use of asynchronous models. Several models in between have addressed these antagonistic aims, each in its own way. We propose an architectural construct that addresses the problem in a generic way. We assume the existence of a component that is capable of executing timely functions, however asynchronous the rest of the system may be. This component can be used by other components to execute timely services. There is a certain analogy to the trusted computing base principle used in security. We call it the Timely Computing Base, TCB. In this paper, we show that a TCB can be used to build dependable and timely applications exhibiting varying degrees of timing fault tolerance, under several synchrony models.

[1]  Paulo Veríssimo,et al.  Timing failure detection and real-time group communication in quasi-synchronous systems , 1996, Proceedings of the Eighth Euromicro Workshop on Real-Time Systems.

[2]  Farnam Jahanian,et al.  Real-time primary-backup (RTPB) replication with temporal consistency guarantees , 1998, Proceedings. 18th International Conference on Distributed Computing Systems (Cat. No.98CB36183).

[3]  Flaviu Cristian,et al.  The Timed Asynchronous Distributed System Model , 1998, IEEE Trans. Parallel Distributed Syst..

[4]  David Powell,et al.  Failure mode assumptions and assumption coverage , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[5]  Zohar Manna,et al.  The Temporal Logic of Reactive and Concurrent Systems , 1991, Springer New York.

[6]  Flaviu Cristian,et al.  Fail-awareness: an approach to construct fail-safe applications , 1997, Proceedings of IEEE 27th International Symposium on Fault Tolerant Computing.

[7]  Private Communications , 2001 .

[8]  Paulo Veríssimo,et al.  Using light-weight groups to handle timing failures in quasi-synchronous systems , 1998, Proceedings 19th IEEE Real-Time Systems Symposium (Cat. No.98CB36279).

[9]  Lixia Zhang,et al.  Resource ReSerVation Protocol (RSVP) - Version 1 Functional Specification , 1997, RFC.

[10]  Bernadette Charron-Bost,et al.  On the impossibility of group membership , 1996, PODC '96.

[11]  Feller William,et al.  An Introduction To Probability Theory And Its Applications , 1950 .

[12]  Marcelo Lubaszewski,et al.  A Reliable Fail-Safe System , 1998, IEEE Trans. Computers.

[13]  Farnam Jahanian Fault Tolerance in Embedded Real-Time Systems , 1993, Hardware and Software Architectures for Fault Tolerance.

[14]  Martin de Prycker,et al.  Asynchronous Transfer Mode, Solution for Broadband Isdn , 1991 .

[15]  Gerhard Fohler,et al.  An engineering approach towards hard real-time system design , 1991 .

[16]  Matti A. Hiltunen,et al.  Real-Time Dependable Channels: Customizing QoS Attributes for Distributed Systems , 1999, IEEE Trans. Parallel Distributed Syst..

[17]  E. D. Jensen,et al.  Alpha: a nonproprietary OS for large, complex, distributed real-time systems , 1990, IEEE Workshop on Experimental Distributed Systems.

[18]  Alan Burns,et al.  Real-time distributed computing , 1995, Proceedings of the Fifth IEEE Computer Society Workshop on Future Trends of Distributed Computing Systems.

[19]  Krithi Ramamritham,et al.  Distributed Scheduling of Tasks with Deadlines and Resource Requirements , 1989, IEEE Trans. Computers.

[20]  Sam Toueg,et al.  Inconsistency and contamination (preliminary version) , 1991, PODC '91.

[21]  Paulo Veríssimo,et al.  Using Tailored Failure Suspectors to Support Distributed Cooperative Applications , 1995, Parallel and Distributed Computing and Systems.

[22]  Kang G. Shin,et al.  End-host architecture for QoS-adaptive communication , 1998, Proceedings. Fourth IEEE Real-Time Technology and Applications Symposium (Cat. No.98TB100245).

[23]  William H. Sanders,et al.  AQuA: an adaptive architecture that provides dependable distributed objects , 1998, Proceedings Seventeenth IEEE Symposium on Reliable Distributed Systems (Cat. No.98CB36281).

[24]  Paulo Veríssimo,et al.  Quasi-Synchronism: a step away from the traditional fault-tolerant real-time system models , 1995 .

[25]  Bernard Warner Bibliography on Time Series and Stochastic Processes , 1966 .

[26]  Danny Dolev,et al.  On the minimal synchronism needed for distributed consensus , 1983, 24th Annual Symposium on Foundations of Computer Science (sfcs 1983).

[27]  Henning Schulzrinne,et al.  RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.

[28]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1967 .

[29]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[30]  Flaviu Cristian,et al.  Fail-awareness in timed asynchronous systems , 1996, PODC '96.

[31]  Ravishankar K. Iyer,et al.  Chameleon: A Software Infrastructure for Adaptive Fault Tolerance , 1999, IEEE Trans. Parallel Distributed Syst..

[32]  Herman O. A. Wold,et al.  Bibliography on Time Series and Stochastic Processes. , 1966 .

[33]  David Powell Extra Performance Architecture (XPA) , 1991 .

[34]  Emmanuelle Anceaume,et al.  On the Formal Specification of Group Membership Services , 1994 .

[35]  Nancy A. Lynch,et al.  Consensus in the presence of partial synchrony , 1988, JACM.

[36]  J. Davenport Editor , 1960 .