DS-MOSD: A Dynamic Selectable Master OSD Model for Large-Scale Object-Based Storage System

I/O performance for very large-scale storage systems has become more and more important as the need for storage has grown dramatically. Replication is one of the most important approaches to improve the I/O performance of massive distributed storage system. The storage node to which the Read/Write request is directed is the critical issue when multiple replicas exist in some different storage nodes simultaneously. This paper proposes and evaluates a novel master OSD model, called Dynamic Selectable Master OSD (DS- MOSD), which allows client to direct the Read/Write request to the optimized OSD. This approach has the salient advantage of simultaneously improving I/O performance and balancing the loads of all the OSDs. This paper first describes the architecture of Master OSD and characterizes the advantages of this model contrasting to the pseudo random hash function. Then an improved model, which is considered as DS-MOSD, is presented and evaluated. The experimental results show that the load unbalance degree and network overheads of DS-MOSD model are smaller than that of Fixed MOSD model, while the success probability of Write/Update transactions of DS-MOSD model is larger than that of Fixed MOSD model.

[1]  Jianxi Chen,et al.  VISA: a virtual interface storage architecture for improved network performance , 2005, Second International Conference on Embedded Software and Systems (ICESS'05).

[2]  S. Weil Leveraging Intra-object Locality with EBOFS , 2022 .

[3]  GhemawatSanjay,et al.  The Google file system , 2003 .

[4]  Scott A. Brandt,et al.  OBFS: A File System for Object-Based Storage Devices , 2004, MSST.

[5]  Yong Khai Leong,et al.  Adaptive Replica Management for Large-scale Object-based Storage Devices , 2006 .

[6]  Drew Roselli,et al.  Characteristics of File System Workloads , 1998 .

[7]  Jim Zelenka,et al.  File server scaling with network-attached secure disks , 1997, SIGMETRICS '97.

[8]  Jim Zelenka,et al.  A cost-effective, high-bandwidth storage architecture , 1998, ASPLOS VIII.

[9]  Qun Liu,et al.  HUSt: a heterogeneous unified storage system for GIS grid , 2006, SC.

[10]  Thomas E. Anderson,et al.  A Comparison of File System Workloads , 2000, USENIX Annual Technical Conference, General Track.

[11]  S.A. Brandt,et al.  CRUSH: Controlled, Scalable, Decentralized Placement of Replicated Data , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[12]  Tao Yang,et al.  The Panasas ActiveScale Storage Cluster - Delivering Scalable High Bandwidth Storage , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[13]  Amin Vahdat,et al.  Interposed request routing for scalable network storage , 2000, TOCS.

[14]  Carlos Maltzahn,et al.  Ceph: a scalable, high-performance distributed file system , 2006, OSDI '06.