HySF: A Striped File Assignment Strategy for Parallel File System with Hybrid Storage

Many algorithms have been proposed to statically assign files in parallel I/O systems. Most of them were built on the basis that the underlying storage devices are homogeneous, while the wide use of SSDs makes hybrid storage architecture composed of SSDs and HDDs to be adopted in many parallel I/O systems. In this way, the previous static assignment algorithms could not fulfill the need of this new storage architecture any more. In this paper, we first develop an extensive hybrid parallel I/O system, in which the SSDs and HDDs are laid in the same system level. Then we build the performance model for both SSDs and HDDs in order to find the appropriate roles for them in the parallel I/O system. Based on the performance model, we propose a novel static file assignment algorithm named HySF for the hybrid parallel I/O system targeted to acquire low response time and load balancing. We reach the low response time goal by assigning files to the appropriate respective storage medium and load balancing goal by sorting the files by the access rate. The experiment results show that the HySF algorithm achieves much better performance on mean response time than the other algorithms and presents great flexibility.

[1]  Sang-Won Lee,et al.  A survey of Flash Translation Layer , 2009, J. Syst. Archit..

[2]  H. Howie Huang,et al.  Black-Box Performance Modeling for Solid-State Drives , 2010, 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[3]  Lawrence W. Dowdy,et al.  Comparative Models of the File Assignment Problem , 1982, CSUR.

[4]  Limin Xiao,et al.  A dynamic and adaptive load balancing strategy for parallel file system with large-scale I/O servers , 2012, J. Parallel Distributed Comput..

[5]  Rakesh Agrawal,et al.  Storage and Querying of E-Commerce Data , 2001, VLDB.

[6]  Tao Xie,et al.  A static data placement strategy towards perfect load-balancing for distributed storage clusters , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[7]  Mahesh Balakrishnan,et al.  Extending SSD Lifetimes with Disk-Based Write Caches , 2010, FAST.

[8]  Limin Xiao,et al.  A File Assignment Strategy for Parallel I/O System with Minimum I/O Contention Probability , 2011, FGIT-GDC.

[9]  Li,et al.  Content Addressable Storage Optimization for Desktop Virtualization Based Disaster Backup Storage System , 2012 .

[10]  William H. Dutton,et al.  Clouds, big data, and smart assets: Ten tech-enabled business trends to watch , 2010 .

[11]  Carla Schlatter Ellis,et al.  File-Access Characteristics of Parallel Scientific Workloads , 1996, IEEE Trans. Parallel Distributed Syst..

[12]  Anand Sivasubramaniam,et al.  HybridStore: A Cost-Efficient, High-Performance Storage System Combining SSDs and HDDs , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.

[13]  Ronald L. Graham,et al.  Bounds on Multiprocessing Timing Anomalies , 1969, SIAM Journal of Applied Mathematics.

[14]  N. Kokash An introduction to heuristic algorithms , 2005 .

[15]  Feng Chen,et al.  Hystor: making the best use of solid state drives in high performance storage systems , 2011, ICS '11.

[16]  Geoffrey H. Kuenning,et al.  Conquest: Better Performance Through a Disk/Persistent-RAM Hybrid File System , 2002, USENIX Annual Technical Conference, General Track.

[17]  Tao Xie,et al.  SOR: A Static File Assignment Strategy Immune to Workload Characteristic Assumptions in Parallel I/O Systems , 2007, 2007 International Conference on Parallel Processing (ICPP 2007).

[18]  Garth A. Gibson,et al.  RAID: high-performance, reliable secondary storage , 1994, CSUR.

[19]  Peter Scheuermann,et al.  File Assignment in Parallel I/O Systems with Minimal Variance of Service Time , 2000, IEEE Trans. Computers.

[20]  Thorsten Strufe,et al.  Decentralized Social Networking Services , 2011 .

[21]  Yang Yu,et al.  A Balanced Allocation Strategy for File Assignment in Parallel I/O Systems , 2010, 2010 IEEE Fifth International Conference on Networking, Architecture, and Storage.

[22]  Tao Xie,et al.  SEA: A Striping-Based Energy-Aware Strategy for Data Placement in RAID-Structured Storage Systems , 2008, IEEE Transactions on Computers.

[23]  Yao Sun,et al.  A file assignment strategy independent of workload characteristic assumptions , 2009, TOS.