MEAD: support for Real‐Time Fault‐Tolerant CORBA

The OMG's Real‐Time CORBA (RT‐CORBA) and Fault‐Tolerant CORBA (FT‐CORBA) specifications make it possible for today's CORBA implementations to exhibit either real‐time or fault tolerance in isolation. While real‐time requires a priori knowledge of the system's temporal operation, fault tolerance necessarily deals with faults that occur unexpectedly, and with possibly unpredictable fault recovery times. The MEAD (Middleware for Embedded Adaptive Dependability) system attempts to identify and to reconcile the conflicts between real‐time and fault tolerance, in a resource‐aware manner, for distributed CORBA applications. MEAD supports transparent yet tunable fault tolerance in real‐time, proactive dependability, resource‐aware system adaptation to crash, communication and timing faults with bounded fault detection and fault recovery. Copyright © 2005 John Wiley & Sons, Ltd.

[1]  Daniel P. Siewiorek,et al.  Error log analysis: statistical modeling and heuristic trend analysis , 1990 .

[2]  Ashok K. Agrawala,et al.  Design and implementation of Maruti-II , 1993 .

[3]  Daniel Pierre Bovet,et al.  Understanding the Linux Kernel , 2000 .

[4]  Roger Faulkner,et al.  The Process File System and Process Model in UNIX System V , 1991, USENIX Winter.

[5]  Paulo Veríssimo,et al.  The Delta-4 extra performance architecture (XPA) , 1990, [1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium.

[6]  K. H. Kim,et al.  Architecture of ROAFTS/Solaris: a Solaris-based middleware for real-time object-oriented adaptive fault tolerance support , 1998, Proceedings. The Twenty-Second Annual International Computer Software and Applications Conference (Compsac '98) (Cat. No.98CB 36241).

[7]  John R. Levine Linkers and Loaders , 1999 .

[8]  Priya Narasimhan,et al.  Experiences, strategies, and challenges in building fault-tolerant CORBA systems , 2004, IEEE Transactions on Computers.

[9]  Lisa Cingiser DiPippo,et al.  Real-time CORBA , 1997, Proceedings Third IEEE Real-Time Technology and Applications Symposium.

[10]  Priya Narasimhan,et al.  An Architecture for Versatile Dependability , 2004 .

[11]  Anees Shaikh,et al.  ARMADA Middleware and Communication Services , 1999, Real-Time Systems.

[12]  John A. Stankovic The Integration of Scheduling and Fault Tolerance in Real-Time Systems , 1992 .

[13]  P. Narasimhan Trade-Offs Between Real-Time and Fault Tolerance for Middleware Applications , 2002 .

[14]  Jürgen Abel,et al.  Universal text preprocessing for data compression , 2005, IEEE Transactions on Computers.

[15]  I. Bey,et al.  Delta-4: A Generic Architecture for Dependable Distributed Computing , 1991, Research Reports ESPRIT.

[16]  Chung Laung Liu,et al.  Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment , 1989, JACM.

[17]  Yennun Huang,et al.  Software rejuvenation: analysis, module and applications , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[18]  Hermann Kopetz,et al.  Distributed fault-tolerant real-time systems: the Mars approach , 1989, IEEE Micro.

[19]  Yair Amir,et al.  A low latency, loss tolerant architecture and protocol for wide area group communication , 2000, Proceeding International Conference on Dependable Systems and Networks. DSN 2000.

[20]  Ricardo Vilalta,et al.  Predicting rare events in temporal domains , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[21]  Daniel Mossé,et al.  A responsiveness approach for scheduling fault recovery in real-time systems , 1999, Proceedings of the Fifth IEEE Real-Time Technology and Applications Symposium.

[22]  Paulo Veríssimo,et al.  Real-time and dependability comparison of Delta-4/XPA and MARS systems , 1992 .

[23]  Priya Narasimhan,et al.  Using program analysis to identify and compensate for nondeterminism in fault-tolerant, replicated systems , 2004, Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems, 2004..