Layer 4 Fault Tolerance: Reliability Techniques for Cluster System in Internet Services

Cluster systems have been showing their great advantages in today's Internet services for their high scalability and high availability. Linux Virtual Server (LVS), one of the popular technologies in building Internet services based on cluster, with its layer 4 scheduling strategy, is gaining increasing interests by ISPs as well as researchers. This paper proposes three new methods to achieve a connection-level (layer 4) fault tolerance for Load Balancer, one of which is called incremental backup and the other two are state mirroring. With these methods, the connections maintained in the primary machine can be safely restored in the backup machine in case of the primary's failure without any loss on unfinished work, thus ensuring high availability for the Load Balancer. Compared with previous work, our schemes are more efficient and flexible. Although they are based on LVS, these methods can also be applied to other architectures with layer 4 switching.

[1]  Quanyuan Wu,et al.  Scaling Internet services by LinuxDirector , 2000, Proceedings Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region.

[2]  Daniel M. Dias,et al.  A scalable and highly available web server , 1996, COMPCON '96. Technologies for the Information Superhighway Digest of Papers.

[3]  Oscar H. Ibarra,et al.  Toward a Scalable Distributed {WWW} Server on Workstation Clusters , 1997, J. Parallel Distributed Comput..

[4]  Yi-Min Wang,et al.  ONE-IP: Techniques for Hosting a Service on a Cluster of Machines , 1997, Comput. Networks.

[5]  W. Richard Stevens TCP/IP Illustrated, Volume 3: TCP for Transactions, HTTP, NNTP, and the UNIX Domain Protocolls , 1996 .

[6]  Linda Mui,et al.  Web Performance Tuning , 1998 .