FAST TRANSPARENT FAILOVER FOR RELIABLE WEB SERVICE

Fault tolerance schemes can be used to increase the availability and reliability of network services. One aspect of such schemes is service failover — the reconfiguration of available resources and restoration of state required to continue providing the service despite the loss of some of the resources and corruption of parts of the state. We have previously presented CoRAL, a fault tolerance scheme for Web service based on a redundant standby backup server and logging. The focus of this paper is the implementation and evaluation of client-transparent failover for this scheme. In the event of a primary server failure, active client connections failover to a spare where their processing continues seamlessly. If extra server resources are available, a new server can be reintegrated into the system to reestablish fault-tolerant operation. Our performance results indicate short failover times and low overhead during fault-free operation.

[1]  Yuval Tamir,et al.  Implementation and evaluation of transparent fault-tolerant Web service with kernel-level support , 2002, Proceedings. Eleventh International Conference on Computer Communications and Networks.

[2]  Mon-Yen Luo,et al.  Constructing zero-loss Web services , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[3]  Fred B. Schneider,et al.  Byzantine generals in action: implementing fail-stop processors , 1984, TOCS.

[4]  Azer Bestavros,et al.  Load balancing a cluster of web servers: using distributed packet rewriting , 2000, Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No.00CH37086).

[5]  Rachid Guerraoui,et al.  Implementing e-Transactions with asynchronous replication , 2000, Proceeding International Conference on Dependable Systems and Networks. DSN 2000.

[6]  Yuval Tamir,et al.  Client-transparent fault-tolerant Web service , 2001, Conference Proceedings of the 2001 IEEE International Performance, Computing, and Communications Conference (Cat. No.01CH37210).

[7]  Hari Balakrishnan,et al.  Fine-Grained Failover Using Connection Migration , 2001, USITS.

[8]  Jacques Durand,et al.  Web Services Reliable Messaging TC WS-Reliability 1.1 , 2004 .

[9]  Riccardo Bettati,et al.  HydraNet-FT: network support for dependable services , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[10]  Thomas P. Brisco DNS Support for Load Balancing , 1995, RFC.

[11]  Lorenzo Alvisi,et al.  Engineering fault-tolerant TCP/IP servers using FT-TCP , 2003, 2003 International Conference on Dependable Systems and Networks, 2003. Proceedings..

[12]  Roy T. Fielding,et al.  Hypertext Transfer Protocol - HTTP/1.0 , 1996, RFC.

[13]  정희영,et al.  IETF에서의 빠른 핸드오프 기술 표준화 동향 , 2002 .

[14]  Noel Burton-Krahn,et al.  HotSwap-Transparent Server Failover for Linux , 2002, LISA.

[15]  Lorenzo Alvisi,et al.  Wrapping server-side TCP to mask connection failures , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[16]  Acee Lindem,et al.  Virtual Router Redundancy Protocol , 1998, RFC.

[17]  N. Aghdaie,et al.  Performance optimization for transparent fault-tolerant Web service , 2003, 2003 IEEE Pacific Rim Conference on Communications Computers and Signal Processing (PACRIM 2003) (Cat. No.03CH37490).

[18]  Tony Li,et al.  Cisco Hot Standby Router Protocol (HSRP) , 1998, RFC.