Building and Evaluating a Fault-Tolerant CORBA Infrastructure

In this paper * we explore the trade-offs involved in making one major middleware fault-tolerant. Our work builds on the FT-CORBA specification (April 2000), which is not in widespread use due to lack of quantified knowledge about design rules and trade-offs. Recent results show preliminary overhead and trade-off analysis for a not entirely FT-CORBA-compliant system for embedded applications (Eternal). In distinction from Eternal, we have extended an existing open source ORB and combine it with a collection of service objects and portable request interceptors following the FT-CORBA standard. The paper reports on extensive studies relating the timing aspects to different parameters such as the replication style, the number of replicas, and the checkpointing frequency. The experiments were conducted using a realistic telecom application.

[1]  Priya Narasimhan,et al.  Using Interceptors to Enhance CORBA , 1999, Computer.

[2]  Santosh K. Shrivastava,et al.  Integrating Group Communication with Transactions for Implementing Persistent Replicated Objects , 1999, Advances in Distributed Systems.

[3]  Priya Narasimhan,et al.  State synchronization and recovery for strongly consistent replicated CORBA objects , 2001, 2001 International Conference on Dependable Systems and Networks.

[4]  Miroslaw Malek,et al.  Automatic generation of fault-tolerant CORBA-services , 2000, Proceedings. 34th International Conference on Technology of Object-Oriented Languages and Systems - TOOLS 34.

[5]  A. Montresor Jgroup Tutorial and Programmer''s Manual , 2000 .

[6]  Rachid Guerraoui,et al.  Replication of CORBA Objects , 1999, Advances in Distributed Systems.

[7]  Robbert van Renesse,et al.  Horus: a flexible group communication system , 1996, CACM.