Transparent Reliable Multicast for Ethernet-Based Storage Area Networks

As disk storage density increases and data availability requirements become ever more demanding, data replication is increasingly an indispensable feature of enterprise-class storage systems. For highly available storage systems, every disk block is typically replicated on a local mirror server and a remote mirror server in addition to being stored on the main storage server. In a network storage environment, this N-way data replication increases the write traffic load on the storage client's network interface by N times. Multicast is a natural solution to this problem. However, existing storage area network technologies such as fibre channel and Ethernet do not provide adequate support for linklayer multicast. This paper describes a novel reliable multicast mechanism for Ethernet-based storage area networks that effectively exploits the virtual LAN technology and is able to keep the traffic load of N-way replication roughly at the same level as the no-replication case regardless of the value of N. This technology greatly increases the appeal of using Ethernet as the physical-layer technology for storage area networking, as fibre channel networks currently do not support such in-network replication. Performance measurements on an iSCSI-based network storage system demonstrate that the proposed reliable multicast mechanism is able to reduce the end-to-end data transport time by a factor of 2.6 for three-way replication when the disk write size is more than 4 Mbytes.

[1]  Julie Ward,et al.  Appia: Automatic Storage Area Network Fabric Design , 2002, FAST.

[2]  Julian Satran,et al.  Internet Small Computer Systems Interface (iSCSI) , 2004, RFC.

[3]  Liviu Iftode,et al.  Sixth IEEE International Symposium on Network Computing and Applications (NCA 2007), 12 - 14 July 2007, Cambridge, MA, USA , 2007, IEEE International Symposium on Network Computing and Applications.

[4]  Tzi-cker Chiueh,et al.  Track-based disk logging , 2002, Proceedings International Conference on Dependable Systems and Networks.

[5]  Robert D. Russell,et al.  Evaluating the Effect of iSCSI Protocol Parameters on Performance , 2005, Parallel and Distributed Computing and Networks.

[6]  San-Yih Hwang,et al.  Data Replication in a Distributed System: A Performance Study , 1996, DEXA.

[7]  Lionel M. Ni,et al.  Network planning and tuning in switch-based LANs , 1998, Proceedings. 1998 International Conference on Parallel Processing (Cat. No.98EX205).

[8]  Tzi-cker Chiueh,et al.  Network-Wide Load Balancing Routing With Performance Guarantees , 2006, 2006 IEEE International Conference on Communications.

[9]  Garth A. Gibson,et al.  RAID: high-performance, reliable secondary storage , 1994, CSUR.

[10]  Andrea C. Arpaci-Dusseau,et al.  Association Proceedings of the Third USENIX Conference on File and Storage Technologies San Francisco , CA , USA March 31 – April 2 , 2004 , 2004 .

[11]  William C. Fenner Internet Group Management Protocol, Version 2 , 1997, RFC.

[12]  Albert G. Greenberg,et al.  A flexible model for resource management in virtual private networks , 1999, SIGCOMM '99.

[13]  M. W. Sachs,et al.  Fibre Channel and related standards , 1996, IEEE Commun. Mag..