Efficient distributed algorithms for parallel I/O scheduling

In distributed systems, the lack of global information about data transfer between clients and servers makes implementation of parallel I/O a challenging task. In this paper, we propose two distributed algorithms for scheduling data transfer in parallel I/O with non-uniform data sizes, the maximum-size/maximum-load (MS/ML) algorithm and the minimum-size/earliest-completion-first (MS/ECF) algorithm. Experimental results indicate that both algorithms achieve good performance, compared with the results achieved by their centralized counterparts. Both algorithms yielded parallel performances within 6% of the centralized solutions. We also compare the performance of our algorithms with a distributed highest degree first (HDF) method, which handles non-uniform data transfers by dividing them into units of fixed-sized blocks which are then scheduled and transferred one at a time. Experimental results show that our algorithms require less scheduling and data transfer time, resulting in better overall parallel I/O performance. Our simulations also show that MS/ML is more suitable for parallel I/O with lighter data transfer traffic, while MS/ECF is more suitable for parallel I/O with heavy data transfer traffic.

[1]  Alessandro Panconesi,et al.  Nearly optimal distributed edge coloring in O (log log n ) rounds , 1997 .

[2]  Alessandro Panconesi,et al.  Near-Optimal, Distributed Edge Colouring via the Nibble Method , 1996, Theor. Comput. Sci..

[3]  Andrew A. Chien,et al.  PPFS: a high performance portable parallel file system , 1995, ICS '95.

[4]  Ravi Jain,et al.  Improving the Performance of Parallel I/O Using Distributed Scheduling Algorithms , 1996, Input/Output in Parallel and Distributed Computer Systems.

[5]  Shikharesh Majumdar,et al.  Performance of parallel I/O scheduling strategies on a network of workstations , 2001, Proceedings. Eighth International Conference on Parallel and Distributed Systems. ICPADS 2001.

[6]  Ravi Jain,et al.  Scheduling Parallel I/O Operations in Multiple Bus Systems , 1992, J. Parallel Distributed Comput..

[7]  Ravi Jain,et al.  Distributed scheduling algorithms to improve the performance of parallel data transfers , 1994, CARN.

[8]  Rajeev Thakur,et al.  Passion: Optimized I/O for Parallel Applications , 1996, Computer.

[9]  Alessandro Panconesi,et al.  Nearly optimal distributed edge colouring in O(log log n) rounds , 1997, SODA '97.

[10]  Ravi Jain,et al.  Parallel I/O scheduling using randomized, distributed edge coloring algorithms , 2003, J. Parallel Distributed Comput..

[11]  Ravi Jain,et al.  Heuristics for Scheduling I/O Operations , 1997, IEEE Trans. Parallel Distributed Syst..

[12]  Aravind Srinivasan,et al.  Randomized Distributed Edge Coloring via an Extension of the Chernoff-Hoeffding Bounds , 1997, SIAM J. Comput..

[13]  Marianne Winslett,et al.  Exploiting local data in parallel array I/O on a practical network of workstations , 1997, IOPADS '97.

[14]  Robert B. Ross,et al.  PVFS: A Parallel File System for Linux Clusters , 2000, Annual Linux Showcase & Conference.

[15]  Ravi Jain,et al.  Applying randomized edge coloring algorithms to distributed communication: an experimental study , 1995, SPAA '95.

[16]  Aravind Srinivasan,et al.  Fast randomized algorithms for distributed edge coloring , 1992, PODC '92.

[17]  Steven A. Moyer,et al.  PIOUS: a scalable parallel I/O system for distributed computing environments , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[18]  Peter Brezany,et al.  A Software Architecture for Massively Parallel Input-Output , 1996, PARA.

[19]  Teofilo F. Gonzalez,et al.  Open Shop Scheduling to Minimize Finish Time , 1976, JACM.

[20]  S. R. Subramanya,et al.  Routing and Scheduling I/O Transfers on Wormhole-Routed Mesh Networks , 1999, J. Parallel Distributed Comput..

[21]  Marianne Winslett,et al.  Server-Directed Collective I/O in Panda , 1995, Proceedings of the IEEE/ACM SC95 Conference.